Search | arXiv e-print repository

Origin of the Anisotropic Beer-Lambert Law from Dichroism and Birefringence in $β$-Ga$_2$O$_3$

Authors: Md Mohsinur Rahman Adnan, Mathias Schubert, Roberto C. Myers

Abstract: The anisotropic optical absorption edge of $β$-Ga$_2$O$_3$ follows a modified Beer-Lambert law having two effective absorption coefficients. The absorption coefficient of linearly polarized light reduces to the least absorbing direction beyond a critical penetration depth, which itself depends on polarization and wavelength. To understand this behavior, a Stokes vector analysis is performed to tra… ▽ More The anisotropic optical absorption edge of $β$-Ga$_2$O$_3$ follows a modified Beer-Lambert law having two effective absorption coefficients. The absorption coefficient of linearly polarized light reduces to the least absorbing direction beyond a critical penetration depth, which itself depends on polarization and wavelength. To understand this behavior, a Stokes vector analysis is performed to track the polarization state as a function of depth. The weakening of the absorption coefficient is associated with a gradual shift of linear polarization to the least absorbing crystallographic direction in the plane, which is along the a-exciton within the (010) plane or along the b-exciton in the (001) plane. We show that strong linear dichroism near the optical absorption edge causes this shift in $β$-Ga$_2$O$_3$, which arises from the anisotropy and spectral splitting of the physical absorbers i.e., excitons. The linear polarization shift is accompanied by a variation in the ellipticity due to the birefringence of $β$-Ga$_2$O$_3$. Analysis of the phase relationship between the incoming electric field to that at a certain depth reveals the phase speed as an effective refractive index, which varies along different crystallographic directions. The critical penetration depth is shown to be correlated with the depth at which the ellipticity is maximal. Thus, the anisotropic Beer-Lambert law arises from the interplay of both the dichroic and birefringent properties of $β$-Ga$_2$O$_3$. △ Less

Submitted 28 June, 2024; originally announced July 2024.

Comments: 11 pages, 5 figures

arXiv:2405.16307 [pdf, other]

The strain-stress relationships for coherent in-plane strain in heterostructures with monoclinic crystal systems: $β$-(Al$_x$Ga$_{1-x}$)$_2$O$_3$ on $(h0l)$ $β$-Ga$_2$O$_3$ as example

Authors: Mathias Schubert, Rafal Korlacki, Vanya Darakchieva

Abstract: In this work we derive the state of strain or stress under symmetry conserving conditions in pseudomorphic lattices with monoclinic symmetry. We compare surface vectors across the template epitaxial layer interface and impose conditions of a stress free epitaxial layer. As a result, we demonstrate the existence, in theory, of exactly three possible unit cells which can establish onto a given templ… ▽ More In this work we derive the state of strain or stress under symmetry conserving conditions in pseudomorphic lattices with monoclinic symmetry. We compare surface vectors across the template epitaxial layer interface and impose conditions of a stress free epitaxial layer. As a result, we demonstrate the existence, in theory, of exactly three possible unit cells which can establish onto a given template. We demonstrate this approach for a class of templates with $(h0l)$ planes and $β$-(Al$_x$Ga$_{1-x}$)$_2$O$_3$ on $(h0l)$ $β$-Ga$_2$O$_3$. We discuss the effects of composition $x$ and surface orientation onto the formation of three elastically stable unit cells, their strain and stress tensors, unit cell axes, unit cell volumes, lattice spacing, elastic potential energies, and stress free directions. The previous paradigm for epitaxial layer growth where the stress free direction is always perpendicular to the growing surface is not generally valid for low symmetry materials. In the example here, we find two possible competing domains with stress free direction oblique to the surface of the template for almost all planes $(h0l)$. We calculate the band-to-band transitions for $β$-(Al$_{0.1}$Ga$_{0.9}$)$_2$O$_3$ on $(h0l)$ $β$-Ga$_2$O$_3$ using the composition dependent deformation parameters and elastic coefficients reported prevoiously [Korlacki~\textit{et al.} Phys. Rev. Appl.~\textbf{18}, 064019 (2022)]. △ Less

Submitted 25 May, 2024; originally announced May 2024.

Comments: 10 pages, 8 figures

arXiv:2405.15382 [pdf, other]

The paramagnetic Lyddane-Sachs-Teller relation

Authors: Viktor Rindert, Vanya Darakchieva, Tapati Sarkar, Mathias Schubert

Abstract: In this letter, we derive an expression for magnetic dipole transitions that is analogous to the Lyddane-Sachs-Teller relation for dielectric polar lattice vibrations. We thereby define transverse and longitudinal optical frequencies at which paramagnetic resonance and antiresonance occurs, respectively. The relation found here thus permits non-invasive optical analysis of static magnetization pro… ▽ More In this letter, we derive an expression for magnetic dipole transitions that is analogous to the Lyddane-Sachs-Teller relation for dielectric polar lattice vibrations. We thereby define transverse and longitudinal optical frequencies at which paramagnetic resonance and antiresonance occurs, respectively. The relation found here thus permits non-invasive optical analysis of static magnetization properties in paramagnetic materials. Validated through terahertz electron paramagnetic resonance ellipsometry and superconducting quantum interference device measurements on Iron-doped Gallium Nitride, our findings show very good agreement between theory and experiment. We term the excitations associated with the paramagnetic transitions as paramagnetic polaritons which may find use in future photonic applications △ Less

Submitted 24 May, 2024; originally announced May 2024.

arXiv:2405.09683 [pdf]

Plasmonic Nanocavity to Boost Single Photon Emission from Defects in Thin Hexagonal Boron Nitride

Authors: Mohammadjavad Dowran, Ufuk Kilic, Suvechhya Lamichhane, Adam Erickson, Joshua Barker, Mathias Schubert, Sy-Hwang Liou, Christos Argyropoulos, Abdelghani Laraoui

Abstract: Efficient and compact single photon emission platforms operating at room temperature with ultrafast speed and high brightness will be fundamental components of the emerging quantum communications and computing fields. However, so far, it has been very challenging to design practical deterministic single photon emitters based on nanoscale solid state materials that meet the fast emission rate and s… ▽ More Efficient and compact single photon emission platforms operating at room temperature with ultrafast speed and high brightness will be fundamental components of the emerging quantum communications and computing fields. However, so far, it has been very challenging to design practical deterministic single photon emitters based on nanoscale solid state materials that meet the fast emission rate and strong brightness demands. Here we provide a solution to this longstanding problem by using metallic nanocavities integrated with hexagonal boron nitride (hBN) flakes with defects acting as nanoscale single photon emitters (SPEs) at room temperature. The presented hybrid nanophotonic structure creates a rapid speedup and large enhancement in single photon emission at room temperature. Hence, the nonclassical light emission performance is substantially improved compared to plain hBN flakes and hBN on gold layered structures without nanocavity. Extensive theoretical calculations are also performed to accurately model the new hybrid nanophotonic system and prove that the incorporation of plasmonic nanocavity is key to the efficient SPE performance. The proposed quantum nanocavity single photon source is expected to be an element of paramount importance to the envisioned room temperature integrated quantum photonic networks. △ Less

Submitted 15 May, 2024; originally announced May 2024.

arXiv:2404.18583 [pdf, other]

Context Matters: Leveraging Spatiotemporal Metadata for Semi-Supervised Learning on Remote Sensing Images

Authors: Maximilian Bernhard, Tanveer Hannan, Niklas Strauß, Matthias Schubert

Abstract: Remote sensing projects typically generate large amounts of imagery that can be used to train powerful deep neural networks. However, the amount of labeled images is often small, as remote sensing applications generally require expert labelers. Thus, semi-supervised learning (SSL), i.e., learning with a small pool of labeled and a larger pool of unlabeled data, is particularly useful in this domai… ▽ More Remote sensing projects typically generate large amounts of imagery that can be used to train powerful deep neural networks. However, the amount of labeled images is often small, as remote sensing applications generally require expert labelers. Thus, semi-supervised learning (SSL), i.e., learning with a small pool of labeled and a larger pool of unlabeled data, is particularly useful in this domain. Current SSL approaches generate pseudo-labels from model predictions for unlabeled samples. As the quality of these pseudo-labels is crucial for performance, utilizing additional information to improve pseudo-label quality yields a promising direction. For remote sensing images, geolocation and recording time are generally available and provide a valuable source of information as semantic concepts, such as land cover, are highly dependent on spatiotemporal context, e.g., due to seasonal effects and vegetation zones. In this paper, we propose to exploit spatiotemporal metainformation in SSL to improve the quality of pseudo-labels and, therefore, the final model performance. We show that directly adding the available metadata to the input of the predictor at test time degenerates the prediction quality for metadata outside the spatiotemporal distribution of the training set. Thus, we propose a teacher-student SSL framework where only the teacher network uses metainformation to improve the quality of pseudo-labels on the training set. Correspondingly, our student network benefits from the improved pseudo-labels but does not receive metadata as input, making it invariant to spatiotemporal shifts at test time. Furthermore, we propose methods for encoding and injecting spatiotemporal information into the model and introduce a novel distillation mechanism to enhance the knowledge transfer between teacher and student. Our framework dubbed Spatiotemporal SSL can be easily combined with several stat... △ Less

Submitted 29 April, 2024; originally announced April 2024.

arXiv:2404.12805 [pdf, other]

Bloch equations in Terahertz magnetic-resonance ellipsometry

Authors: Viktor Rindert, Steffen Richter, Philipp Kühne, Alexander Ruder, Vanya Darakchieva, Mathias Schubert

Abstract: A generalized approach derived from Blochs equation of motion of nuclear magnetic moments is presented to model the frequency, magnetic field, spin density, and temperature dependencies in the electromagnetic permeability tensor for materials with magnetic resonances. The resulting tensor model predicts characteristic polarization signatures which can be observed, for example, in fully polarizatio… ▽ More A generalized approach derived from Blochs equation of motion of nuclear magnetic moments is presented to model the frequency, magnetic field, spin density, and temperature dependencies in the electromagnetic permeability tensor for materials with magnetic resonances. The resulting tensor model predicts characteristic polarization signatures which can be observed, for example, in fully polarization-resolved Mueller matrix element spectra measured across magnetic resonances as a function of frequency, magnetic field, magnetic moment density, and temperature. When augmented with thermodynamic considerations and suitable Hamiltonian description of the magnetic eigenvalue spectrum, important parameters such as zero-frequency magnetization, spectral amplitude distribution, relaxation time constants, and geometrical orientation parameters of the magnetic moment density can be obtained from comparing the generalized model approach to experimental data. We demonstrate our approach by comparing model calculations with full Mueller matrix element spectra measured at oblique angle of incidence in the terahertz spectral range, across electron spin resonance quintuplet transitions observed in wurtzite-structure GaN doped with iron. Measurements were performed by ellipsometry, using a superconducting cryostat magnet at magnetic fields of 7.23 T and at temperatures of 20 K and 30 K. We detail the occurrence of linear and circular birefringence and dichroism associated with each of the zero-field split spin transitions in the S = 5/2 defect system. We derive the spectral dependence of the magnetic susceptibility function and obtain the temperature and magnetic field dependence of the spin Hamiltonian. Our model correctly predicts the complexity of the polarization signatures observed in the 15 independent elements of the normalized Mueller matrix for both positive and negative magnetic fields. △ Less

Submitted 19 April, 2024; originally announced April 2024.

arXiv:2404.12240 [pdf, other]

doi 10.1145/3274895.3274945

A Time-Inhomogeneous Markov Model for Resource Availability under Sparse Observations

Authors: Lukas Rottkamp, Matthias Schubert

Abstract: Accurate spatio-temporal information about the current situation is crucial for smart city applications such as modern routing algorithms. Often, this information describes the state of stationary resources, e.g. the availability of parking bays, charging stations or the amount of people waiting for a vehicle to pick them up near a given location. To exploit this kind of information, predicting fu… ▽ More Accurate spatio-temporal information about the current situation is crucial for smart city applications such as modern routing algorithms. Often, this information describes the state of stationary resources, e.g. the availability of parking bays, charging stations or the amount of people waiting for a vehicle to pick them up near a given location. To exploit this kind of information, predicting future states of the monitored resources is often mandatory because a resource might change its state within the time until it is needed. To train an accurate predictive model, it is often not possible to obtain a continuous time series on the state of the resource. For example, the information might be collected from traveling agents visiting the resource with an irregular frequency. Thus, it is necessary to develop methods which work on sparse observations for training and prediction. In this paper, we propose time-inhomogeneous discrete Markov models to allow accurate prediction even when the frequency of observation is very rare. Our new model is able to blend recent observations with historic data and also provide useful probabilistic estimates for future states. Since resources availability in a city is typically time-dependent, our Markov model is time-inhomogeneous and cyclic within a predefined time interval. To train our model, we propose a modified Baum-Welch algorithm. Evaluations on real-world datasets of parking bay availability show that our new method indeed yields good results compared to methods being trained on complete data and non-cyclic variants. △ Less

Submitted 18 April, 2024; originally announced April 2024.

Comments: 11 pages, long version of a paper published at 26th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems (SIGSPATIAL 2018)

Journal ref: Proceedings of the 26th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems (pp. 460-463) 2018

arXiv:2404.10683 [pdf, other]

doi 10.3233/FAIA230573

Simplex Decomposition for Portfolio Allocation Constraints in Reinforcement Learning

Authors: David Winkel, Niklas Strauß, Matthias Schubert, Thomas Seidl

Abstract: Portfolio optimization tasks describe sequential decision problems in which the investor's wealth is distributed across a set of assets. Allocation constraints are used to enforce minimal or maximal investments into particular subsets of assets to control for objectives such as limiting the portfolio's exposure to a certain sector due to environmental concerns. Although methods for constrained Rei… ▽ More Portfolio optimization tasks describe sequential decision problems in which the investor's wealth is distributed across a set of assets. Allocation constraints are used to enforce minimal or maximal investments into particular subsets of assets to control for objectives such as limiting the portfolio's exposure to a certain sector due to environmental concerns. Although methods for constrained Reinforcement Learning (CRL) can optimize policies while considering allocation constraints, it can be observed that these general methods yield suboptimal results. In this paper, we propose a novel approach to handle allocation constraints based on a decomposition of the constraint action space into a set of unconstrained allocation problems. In particular, we examine this approach for the case of two constraints. For example, an investor may wish to invest at least a certain percentage of the portfolio into green technologies while limiting the investment in the fossil energy sector. We show that the action space of the task is equivalent to the decomposed action space, and introduce a new reinforcement learning (RL) approach CAOSD, which is built on top of the decomposition. The experimental evaluation on real-world Nasdaq-100 data demonstrates that our approach consistently outperforms state-of-the-art CRL benchmarks for portfolio optimization. △ Less

Submitted 16 April, 2024; originally announced April 2024.

Journal ref: ECAI 2023 - 26th European Conference on Artificial Intelligence, September 30 - October 4, 2023, Krakow, Poland

arXiv:2404.10646 [pdf, other]

doi 10.1109/MDM52706.2021.00026

Efficient Parking Search using Shared Fleet Data

Authors: Niklas Strauß, Lukas Rottkamp, Sebatian Schmoll, Matthias Schubert

Abstract: Finding an available on-street parking spot is a relevant problem of day-to-day life. In recent years, cities such as Melbourne and San Francisco deployed sensors that provide real-time information about the occupation of parking spots. Finding a free parking spot in such a smart environment can be modeled and solved as a Markov decision process (MDP). The problem has to consider uncertainty as av… ▽ More Finding an available on-street parking spot is a relevant problem of day-to-day life. In recent years, cities such as Melbourne and San Francisco deployed sensors that provide real-time information about the occupation of parking spots. Finding a free parking spot in such a smart environment can be modeled and solved as a Markov decision process (MDP). The problem has to consider uncertainty as available parking spots might not remain available until arrival due to other vehicles also claiming spots in the meantime. Knowing the parking intention of every vehicle in the environment would eliminate this uncertainty. Unfortunately, it does currently not seem realistic to have such data from all vehicles. In contrast, acquiring data from a subset of vehicles or a vehicle fleet appears feasible and has the potential to reduce uncertainty. In this paper, we examine the question of how useful sharing data within a vehicle fleet might be for the search times of particular drivers. We use fleet data to better estimate the availability of parking spots at arrival. Since optimal solutions for large scenarios are infeasible, we base our method on approximate solutions, which have been shown to perform well in single-agent settings. Our experiments are conducted on a simulation using real-world and synthetic data from the city of Melbourne. The results indicate that fleet data can significantly reduce search times for an available parking spot. △ Less

Submitted 16 April, 2024; originally announced April 2024.

Comments: Long Version; published at 2021 22nd IEEE International Conference on Mobile Data Management (MDM)

Journal ref: 2021 22nd IEEE International Conference on Mobile Data Management (MDM)

arXiv:2404.07300 [pdf, other]

Effective uniaxial dielectric function tensor and optical phonons in ($\bar{2}01$)-plane oriented $β$-Ga$_2$O$_3$ films with equally-distributed six-fold rotation domains

Authors: Alyssa Mock, Steffen Richter, Alexis Papamichail, Vallery Stanishev, Misagh Ghezellou, Jawad Ul-Hassan, Andreas Popp, Saud Bin Anooz, Daniella Gogova, Praneeth Ranga, Sriram Krishnamoorthy, Rafal Korlacki, Mathias Schubert, Vanya Darakchieva

Abstract: Monoclinic $β$-Ga$_2$O$_3$ films grown on $c$-plane sapphire have been shown to exhibit six $(\bar{2}01)$-plane oriented domains, which are equally-spaced-by-rotation around the surface normal and equally-sized-by-volume that render the film optical response effectively uniaxial. We derive and discuss an optical model suitable for ellipsometry data analysis of such films. We model mid- and far-inf… ▽ More Monoclinic $β$-Ga$_2$O$_3$ films grown on $c$-plane sapphire have been shown to exhibit six $(\bar{2}01)$-plane oriented domains, which are equally-spaced-by-rotation around the surface normal and equally-sized-by-volume that render the film optical response effectively uniaxial. We derive and discuss an optical model suitable for ellipsometry data analysis of such films. We model mid- and far-infrared ellipsometry data from undoped and electrically insulating films with an effective uniaxial dielectric tensor based on projections of all phonon modes within the rotation domains parallel and perpendicular to the sample normal, i.e., to the reciprocal lattice vector $\mathbf{g}_{\bar{2}01}$. Two effective response functions are described by model, and found sufficient to calculate ellipsometry data that best-match measured ellipsometry data from a representative film. We propose to render either effective dielectric functions, or inverse effective dielectric functions, each separately for electric field directions parallel and perpendicular to $\mathbf{g}_{\bar{2}01}$, by sums of Lorentz oscillators, which permit to determine either sets of transverse optical phonon mode parameters, or sets of longitudinal optical phonon mode parameters, respectively. Transverse optical modes common to both dielectric functions can be traced back to single crystal modes with $B_{\mathrm{u}}$ character, while modes with $A_{\mathrm{u}}$ character only appear within the dielectric function for polarization perpendicular to the sample surface. The thereby obtained parameter sets reveal all phonon modes anticipated from averaging over the six-fold rotation domains of single crystal $β$-Ga$_2$O$_3$, but with slightly shifted transverse optical, and completely different longitudinal optical phonon modes. △ Less

Submitted 10 April, 2024; originally announced April 2024.

Comments: 14 pgaes, 8 figures

arXiv:2403.18950 [pdf]

doi 10.1038/s41467-024-48051-4

Controlling the broadband enhanced light chirality with L-shaped dielectric metamaterials

Authors: Ufuk Kilic, Matthew Hilfiker, Shawn Wimer, Alexander Ruder, Eva Schubert, Mathias Schubert, Christos Argyropoulos

Abstract: The inherently weak chiroptical responses of natural materials limit their usage for controlling and enhancing chiral light-matter interactions. Recently, several nanostructures with subwavelength scale dimensions were demonstrated, mainly due to the advent of nanofabrication technologies, as a potential alternative to efficiently enhance chirality. However, the intrinsic lossy nature of metals an… ▽ More The inherently weak chiroptical responses of natural materials limit their usage for controlling and enhancing chiral light-matter interactions. Recently, several nanostructures with subwavelength scale dimensions were demonstrated, mainly due to the advent of nanofabrication technologies, as a potential alternative to efficiently enhance chirality. However, the intrinsic lossy nature of metals and inherent narrowband response of dielectric planar thin films or metasurface structures pose severe limitations toward the practical realization of broadband and tailorable chiral systems. Here, we tackle these problems by designing all-dielectric silicon-based L-shaped optical metamaterials based on tilted nanopillars that exhibit broadband and enhanced chiroptical response in transmission operation. We use an emerging bottom-up fabrication approach, named glancing angle deposition, to assemble these dielectric metamaterials on a wafer scale. The reported strong chirality and optical anisotropic properties are controllable in terms of both amplitude and operating frequency by simply varying the shape and dimensions of the nanopillars. The presented nanostructures can be used in a plethora of emerging nanophotonic applications, such as chiral sensors, polarization filters, and spin-locked nanowaveguides. △ Less

Submitted 27 March, 2024; originally announced March 2024.

Journal ref: U. Kilic, M. Hilfiker, S. Wimer, A. Ruder, E. Schubert, M. Schubert, and C. Argyropoulos, Controlling the broadband enhanced light chirality with L-shaped dielectric metamaterials, Nature Communications, vol. 15, p. 3757, 2024

arXiv:2402.19227 [pdf]

All epitaxial self-assembly of vertically-confined silicon color centers using ultra-low temperature epitaxy

Authors: Johannes Aberl, Enrique Prado Navarrete, Merve Karaman, Diego Haya Enriquez, Christoph Wilflingseder, Andreas Salomon, Daniel Primetzhofer, Markus Andreas Schubert, Giovanni Capellini, Thomas Fromherz, Peter Deák, Péter Udvarhelyi, Li Song, Ádám Gali, Moritz Brehm

Abstract: Silicon-based color-centers (SiCCs) have recently emerged as quantum-light sources that can be combined with telecom-range Si Photonics platforms. Unfortunately, using current SiCC fabrication, deterministic control over the vertical emitter position is impossible due to ion-implantation's stochastic nature. To overcome this bottleneck towards high-yield integration, we demonstrate a radically inn… ▽ More Silicon-based color-centers (SiCCs) have recently emerged as quantum-light sources that can be combined with telecom-range Si Photonics platforms. Unfortunately, using current SiCC fabrication, deterministic control over the vertical emitter position is impossible due to ion-implantation's stochastic nature. To overcome this bottleneck towards high-yield integration, we demonstrate a radically innovative creation method for various SiCCs, solely relying on epitaxial growth of Si and C-doped Si at atypically-low temperatures in a ultra-clean growth environment. These telecom emitters can be confined within sub-1nm thick layers embedded at arbitrary vertical positions within a highly crystalline Si matrix. Tuning growth conditions and do**, different SiCC types, e.g., W-centers, T-centers, G-centers, or derivatives like G'-centers can be created, which are particularly promising as Si-based single-photon sources and spin-photon interfaces. The zero-phonon emission from G'-centers can be conveniently tuned by the C-concentration, leading to a systematic wavelength shift and linewidth narrowing towards low emitter densities. △ Less

Submitted 21 May, 2024; v1 submitted 29 February, 2024; originally announced February 2024.

arXiv:2402.06520 [pdf, other]

Accelerating Innovation in 6G Research: Real-Time Capable SDR System Architecture for Rapid Prototy**

Authors: Maximilian Engelhardt, Sebastian Giehl, Michael Schubert, Alexander Ihlow, Christian Schneider, Alexander Ebert, Markus Landmann, Giovanni Del Galdo, Carsten Andrich

Abstract: The upcoming 3GPP global mobile communication standard 6G strives to push the technological limits of radio frequency (RF) communication even further than its predecessors: Sum data rates beyond 100 Gbit/s, RF bandwidths above 1 GHz per link, and sub-millisecond latency necessitate very high performance development tools. We propose a new SDR firmware and software architecture designed explicitly… ▽ More The upcoming 3GPP global mobile communication standard 6G strives to push the technological limits of radio frequency (RF) communication even further than its predecessors: Sum data rates beyond 100 Gbit/s, RF bandwidths above 1 GHz per link, and sub-millisecond latency necessitate very high performance development tools. We propose a new SDR firmware and software architecture designed explicitly to meet these challenging requirements. It relies on Ethernet and commercial off-the-shelf network and server components to maximize flexibility and to reduce costs. We analyze state-of-the-art solutions (USRP X440 and other RFSoC-based systems), derive architectural design goals, explain resulting design decision in detail, and exemplify our architecture's implementation on the XCZU48DR RFSoC. Finally, we validate its performance via measurements and outline how the architecture surpasses the state-of-the-art with respect to sustained RF recording, while maintaining high Ethernet bandwidth efficiency. Building a micro-Doppler radar example, we demonstrate its real-time and rapid application development capabilities. △ Less

Submitted 23 May, 2024; v1 submitted 9 February, 2024; originally announced February 2024.

arXiv:2401.06669 [pdf, other]

User-Centric Cell-Free Wireless Networks for 6G: Communication Theoretic Models and Research Challenges

Authors: Fabian Göttsch, Giuseppe Caire, Wen Xu, Martin Schubert

Abstract: This paper presents a comprehensive communication theoretic model for the physical layer of a cell-free user-centric network, formed by user equipments (UEs), radio units (RUs), and decentralized units (DUs), uniformly spatially distributed over a given coverage area. We consider RUs equipped with multiple antennas, and focus on the regime where the UE, RU, and DU densities are constant and theref… ▽ More This paper presents a comprehensive communication theoretic model for the physical layer of a cell-free user-centric network, formed by user equipments (UEs), radio units (RUs), and decentralized units (DUs), uniformly spatially distributed over a given coverage area. We consider RUs equipped with multiple antennas, and focus on the regime where the UE, RU, and DU densities are constant and therefore the number of such nodes grows with the coverage area. A system is said scalable if the computing load and information rate at any node in the network converges to a constant as the network size (coverage area) grows to infinity. This imposes that each UE must be processed by a (user-centric) finite-size cluster of RUs, and that such cluster processors are dynamically allocated to the DUs (e.g., as software defined virtual network functions) in order to achieve a balanced computation load. We also assume that the RUs are connected to the DUs through a packet switching network, in order to achieve adaptive routing and load balance. For this model, we define in details the dynamic cluster formation and uplink pilot allocation. As a consequence of the pilot allocation and the scalability constraint, each cluster processor has a partial view of the network channel state information. We define the condition of ``ideal partial CSI'' when the channel vectors that can be estimated are perfectly known (while the ones that cannot be estimated are not know at all). We develop two attractive cluster-based linear receiver schemes for the uplink, and an uplink-downlink duality that allows to reuse such vectors as precoders for the downlink. △ Less

Submitted 12 January, 2024; originally announced January 2024.

arXiv:2401.05969 [pdf, other]

Spatial-Aware Deep Reinforcement Learning for the Traveling Officer Problem

Authors: Niklas Strauß, Matthias Schubert

Abstract: The traveling officer problem (TOP) is a challenging stochastic optimization task. In this problem, a parking officer is guided through a city equipped with parking sensors to fine as many parking offenders as possible. A major challenge in TOP is the dynamic nature of parking offenses, which randomly appear and disappear after some time, regardless of whether they have been fined. Thus, solutions… ▽ More The traveling officer problem (TOP) is a challenging stochastic optimization task. In this problem, a parking officer is guided through a city equipped with parking sensors to fine as many parking offenders as possible. A major challenge in TOP is the dynamic nature of parking offenses, which randomly appear and disappear after some time, regardless of whether they have been fined. Thus, solutions need to dynamically adjust to currently fineable parking offenses while also planning ahead to increase the likelihood that the officer arrives during the offense taking place. Though various solutions exist, these methods often struggle to take the implications of actions on the ability to fine future parking violations into account. This paper proposes SATOP, a novel spatial-aware deep reinforcement learning approach for TOP. Our novel state encoder creates a representation of each action, leveraging the spatial relationships between parking spots, the agent, and the action. Furthermore, we propose a novel message-passing module for learning future inter-action correlations in the given environment. Thus, the agent can estimate the potential to fine further parking violations after executing an action. We evaluate our method using an environment based on real-world data from Melbourne. Our results show that SATOP consistently outperforms state-of-the-art TOP agents and is able to fine up to 22% more parking offenses. △ Less

Submitted 11 January, 2024; originally announced January 2024.

Comments: SIAM SDM 2024

arXiv:2312.00779 [pdf, other]

Nanocolumnar Material Platforms:Universal structural parameters revealed from optical anisotropy

Authors: Ufuk Kilic, Yousra Traouli, Matthew Hilfiker, Khalil Bryant, Stefan Schoeche, Rene Feder, Christos Argyropoulos, Eva Schubert, Mathias Schubert

Abstract: Nanostructures represent a frontier where meticulous attention to the control and assessment of structural dimensions becomes a linchpin for their seamless integration into diverse technological applications. By using integrative and comprehensive methodical series of studies, we investigate the evolution of the depolarization factors in the anisotropic Bruggeman effective medium approximation, th… ▽ More Nanostructures represent a frontier where meticulous attention to the control and assessment of structural dimensions becomes a linchpin for their seamless integration into diverse technological applications. By using integrative and comprehensive methodical series of studies, we investigate the evolution of the depolarization factors in the anisotropic Bruggeman effective medium approximation, that are extremely sensitive to the changes in critical dimensions of the nanostructure platforms. To this end, we fabricate spatially coherent highly-ordered slanted nanocolumns from zirconia, silicon, titanium, and permalloy on silicon substrates with varying column lengths using glancing angle deposition. In tandem, broad-spectral range Mueller matrix spectroscopic ellipsometry data, spanning from the near-infrared to the vacuum ultraviolet (0.72 eV to 6.5 eV), is analyzed with a best-match model approach based on the anisotropic Bruggeman effective medium theory. We thereby extracted the anisotropic optical properties including complex dielectric function, birefringence, and dichroism. Most notably, our research unveils a universal, material-independent inverse relationship between depolarization factors and column length. We envision that the presented universal relationship will permit accurate prediction of optical properties of nanocolumnar thin films improving their integration and optimization for optoelectronic and photonic device applications. △ Less

Submitted 1 December, 2023; originally announced December 2023.

Comments: 20 pages, 10 figures

arXiv:2311.15913 [pdf, other]

doi 10.1007/s11044-023-09934-4

Discrete Adjoint Method for Variational Integration of Constrained ODEs and its application to Optimal Control of Geometrically Exact Beam Dynamics

Authors: Matthias Schubert, Rodrigo T. Sato Martín de Almagro, Karin Nachbagauer, Sina Ober-Blöbaum, Sigrid Leyendecker

Abstract: Direct methods for the simulation of optimal control problems apply a specific discretization to the dynamics of the problem, and the discrete adjoint method is suitable to calculate corresponding conditions to approximate an optimal solution. While the benefits of structure preserving or geometric methods have been known for decades, their exploration in the context of optimal control problems is… ▽ More Direct methods for the simulation of optimal control problems apply a specific discretization to the dynamics of the problem, and the discrete adjoint method is suitable to calculate corresponding conditions to approximate an optimal solution. While the benefits of structure preserving or geometric methods have been known for decades, their exploration in the context of optimal control problems is a relatively recent field of research. In this work, the discrete adjoint method is derived for variational integrators yielding structure preserving approximations of the dynamics firstly in the ODE case and secondly for the case in which the dynamics is subject to holonomic constraints. The convergence rates are illustrated by numerical examples. Thirdly, the discrete adjoint method is applied to geometrically exact beam dynamics, represented by a holonomically constrained PDE. △ Less

Submitted 27 November, 2023; originally announced November 2023.

Comments: Funding: H2020 Marie-Skłodowska-Curie 860124

MSC Class: 34; 35; 49; 70; 74

Journal ref: Multibody System Dynamics, 5 September 2023

arXiv:2311.13324 [pdf, other]

Optically manipulated micromirrors for precise excitation of WGM microlasers

Authors: Tomasz Plaskocinski, Libin Yan, Marcel Schubert, Malte C. Gather, Andrea Di Falco

Abstract: Whispering gallery mode microlasers are highly sensitive refractive index sensors widely explored for biophotonic and biomedical applications. Microlaser excitation and collection of the emitted light typically utilize microscope objectives at normal incidence, limiting the choice of the oscillation plane of the modes. Here, we present a platform that enables the excitation of microlasers from var… ▽ More Whispering gallery mode microlasers are highly sensitive refractive index sensors widely explored for biophotonic and biomedical applications. Microlaser excitation and collection of the emitted light typically utilize microscope objectives at normal incidence, limiting the choice of the oscillation plane of the modes. Here, we present a platform that enables the excitation of microlasers from various directions using an optically manipulated micromirror. The scheme enables precise sensing of the environment surrounding the microlasers along different well-controlled planes. We further demonstrate the capability of the platform to perform a time-resolved experiment of dynamic sensing using a polystyrene probe bead orbiting the microlaser. △ Less

Submitted 22 November, 2023; originally announced November 2023.

arXiv:2310.00372 [pdf, other]

Deep Active Learning with Noisy Oracle in Object Detection

Authors: Marius Schubert, Tobias Riedlinger, Karsten Kahl, Matthias Rottmann

Abstract: Obtaining annotations for complex computer vision tasks such as object detection is an expensive and time-intense endeavor involving a large number of human workers or expert opinions. Reducing the amount of annotations required while maintaining algorithm performance is, therefore, desirable for machine learning practitioners and has been successfully achieved by active learning algorithms. Howev… ▽ More Obtaining annotations for complex computer vision tasks such as object detection is an expensive and time-intense endeavor involving a large number of human workers or expert opinions. Reducing the amount of annotations required while maintaining algorithm performance is, therefore, desirable for machine learning practitioners and has been successfully achieved by active learning algorithms. However, it is not merely the amount of annotations which influences model performance but also the annotation quality. In practice, the oracles that are queried for new annotations frequently contain significant amounts of noise. Therefore, cleansing procedures are oftentimes necessary to review and correct given labels. This process is subject to the same budget as the initial annotation itself since it requires human workers or even domain experts. Here, we propose a composite active learning framework including a label review module for deep object detection. We show that utilizing part of the annotation budget to correct the noisy annotations partially in the active dataset leads to early improvements in model performance, especially when coupled with uncertainty-based query strategies. The precision of the label error proposals has a significant influence on the measured effect of the label review. In our experiments we achieve improvements of up to 4.5 mAP points of object detection performance by incorporating label reviews at equal annotation budget. △ Less

Submitted 30 September, 2023; originally announced October 2023.

arXiv:2308.08573 [pdf, other]

Fourier modal method for inverse design of metasurface-enhanced micro-LEDs

Authors: Martin F. Schubert, Alec M. Hammond

Abstract: We present a simulation capability for micro-scale light-emitting diodes (uLEDs) that achieves comparable accuracy to CPU-based finite-difference time-domain simulation but is more than 10^7 times faster. Our approach is based on the Fourier modal method (FMM) -- which, as we demonstrate, is well suited to modeling thousands of incoherent sources -- with extensions that allow rapid convergence for… ▽ More We present a simulation capability for micro-scale light-emitting diodes (uLEDs) that achieves comparable accuracy to CPU-based finite-difference time-domain simulation but is more than 10^7 times faster. Our approach is based on the Fourier modal method (FMM) -- which, as we demonstrate, is well suited to modeling thousands of incoherent sources -- with extensions that allow rapid convergence for uLED structures that are challenging to model with standard approaches. The speed of our method makes the inverse design of uLEDs tractable, which we demonstrate by designing a metasurface-enhanced uLED that doubles the light extraction efficiency of an unoptimized device. △ Less

Submitted 15 August, 2023; originally announced August 2023.

Comments: 15 pages, 10 figures

arXiv:2306.07835 [pdf, other]

LMD: Light-weight Prediction Quality Estimation for Object Detection in Lidar Point Clouds

Authors: Tobias Riedlinger, Marius Schubert, Sarina Penquitt, Jan-Marcel Kezmann, Pascal Colling, Karsten Kahl, Lutz Roese-Koerner, Michael Arnold, Urs Zimmermann, Matthias Rottmann

Abstract: Object detection on Lidar point cloud data is a promising technology for autonomous driving and robotics which has seen a significant rise in performance and accuracy during recent years. Particularly uncertainty estimation is a crucial component for down-stream tasks and deep neural networks remain error-prone even for predictions with high confidence. Previously proposed methods for quantifying… ▽ More Object detection on Lidar point cloud data is a promising technology for autonomous driving and robotics which has seen a significant rise in performance and accuracy during recent years. Particularly uncertainty estimation is a crucial component for down-stream tasks and deep neural networks remain error-prone even for predictions with high confidence. Previously proposed methods for quantifying prediction uncertainty tend to alter the training scheme of the detector or rely on prediction sampling which results in vastly increased inference time. In order to address these two issues, we propose LidarMetaDetect (LMD), a light-weight post-processing scheme for prediction quality estimation. Our method can easily be added to any pre-trained Lidar object detector without altering anything about the base model and is purely based on post-processing, therefore, only leading to a negligible computational overhead. Our experiments show a significant increase of statistical reliability in separating true from false predictions. We propose and evaluate an additional application of our method leading to the detection of annotation errors. Explicit samples and a conservative count of annotation error proposals indicates the viability of our method for large-scale datasets like KITTI and nuScenes. On the widely-used nuScenes test dataset, 43 out of the top 100 proposals of our method indicate, in fact, erroneous annotations. △ Less

Submitted 15 June, 2023; v1 submitted 13 June, 2023; originally announced June 2023.

Comments: 19 pages, 11 figures, 11 tables

arXiv:2305.19431 [pdf]

doi 10.1103/PhysRevApplied.21.054059

The anisotropic Beer-Lambert law in $β$-Ga$_{2}$O$_{3}$: Spectral and polarization dependent absorption and photoresponsivity

Authors: Md Mohsinur Rahman Adnan, Darpan Verma, Chris Sturm, Matthias Schubert, Roberto C. Myers

Abstract: Due to its low symmetry, $β$-Ga$_{2}$O$_{3}$ exhibits a strongly anisotropic optical response. As a result, the absorption spectra change with the polarization state of the incoming photons. To understand this phenomenon, here we calculate the complete electromagnetic wave equation solutions as a function of linear polarization angle and photon energy for $β$-Ga$_{2}$O$_{3}$ using its previously m… ▽ More Due to its low symmetry, $β$-Ga$_{2}$O$_{3}$ exhibits a strongly anisotropic optical response. As a result, the absorption spectra change with the polarization state of the incoming photons. To understand this phenomenon, here we calculate the complete electromagnetic wave equation solutions as a function of linear polarization angle and photon energy for $β$-Ga$_{2}$O$_{3}$ using its previously measured complex dielectric function tensor. The significant off-diagonal terms in this tensor can result in a non-exponential decay in the photon flux, indicating that the Beer-Lambert law is not generally valid in this anisotropic material. However, for above-band-gap spectral regions which depend on crystallographic orientations (> 5.8 eV (001 plane),>5.2 eV (010 plane)) an effective absorption coefficient well approximates the photon flux decay with depth. On the other hand, near the optical absorption edge (4.9 - 5.8 eV (001 plane),4.65 - 5.2 eV (010 plane)) the photon flux decay exhibits a sum of two exponential decays, such that two effective absorption coefficients are necessary to model the loss behavior versus the absorption depth. This behavior manifests from the presence of dichroism in $β$-Ga$_{2}$O$_{3}$. A single effective absorption coefficient can only be recovered for this energy range by augmenting the isotropic Beer-Lambert law with a critical penetration depth and polarization dependence. Using these results, we calculate the polarization-dependent photoresponsivity spectra for light polarized along different crystallographic directions. △ Less

Submitted 30 January, 2024; v1 submitted 30 May, 2023; originally announced May 2023.

Comments: 28 pages, 10 figures

Journal ref: Phys. Rev. Applied 21, 054059 (2024)

arXiv:2305.17096 [pdf, other]

GRAtt-VIS: Gated Residual Attention for Auto Rectifying Video Instance Segmentation

Authors: Tanveer Hannan, Rajat Koner, Maximilian Bernhard, Suprosanna Shit, Bjoern Menze, Volker Tresp, Matthias Schubert, Thomas Seidl

Abstract: Recent trends in Video Instance Segmentation (VIS) have seen a growing reliance on online methods to model complex and lengthy video sequences. However, the degradation of representation and noise accumulation of the online methods, especially during occlusion and abrupt changes, pose substantial challenges. Transformer-based query propagation provides promising directions at the cost of quadratic… ▽ More Recent trends in Video Instance Segmentation (VIS) have seen a growing reliance on online methods to model complex and lengthy video sequences. However, the degradation of representation and noise accumulation of the online methods, especially during occlusion and abrupt changes, pose substantial challenges. Transformer-based query propagation provides promising directions at the cost of quadratic memory attention. However, they are susceptible to the degradation of instance features due to the above-mentioned challenges and suffer from cascading effects. The detection and rectification of such errors remain largely underexplored. To this end, we introduce \textbf{GRAtt-VIS}, \textbf{G}ated \textbf{R}esidual \textbf{Att}ention for \textbf{V}ideo \textbf{I}nstance \textbf{S}egmentation. Firstly, we leverage a Gumbel-Softmax-based gate to detect possible errors in the current frame. Next, based on the gate activation, we rectify degraded features from its past representation. Such a residual configuration alleviates the need for dedicated memory and provides a continuous stream of relevant instance features. Secondly, we propose a novel inter-instance interaction using gate activation as a mask for self-attention. This masking strategy dynamically restricts the unrepresentative instance queries in the self-attention and preserves vital information for long-term tracking. We refer to this novel combination of Gated Residual Connection and Masked Self-Attention as \textbf{GRAtt} block, which can easily be integrated into the existing propagation-based framework. Further, GRAtt blocks significantly reduce the attention overhead and simplify dynamic temporal modeling. GRAtt-VIS achieves state-of-the-art performance on YouTube-VIS and the highly challenging OVIS dataset, significantly improving over previous methods. Code is available at \url{https://github.com/Tanveer81/GRAttVIS}. △ Less

Submitted 26 May, 2023; originally announced May 2023.

Comments: 14 pages, 5 tables, 9 figures

arXiv:2303.17859 [pdf, other]

MapFormer: Boosting Change Detection by Using Pre-change Information

Authors: Maximilian Bernhard, Niklas Strauß, Matthias Schubert

Abstract: Change detection in remote sensing imagery is essential for a variety of applications such as urban planning, disaster management, and climate research. However, existing methods for identifying semantically changed areas overlook the availability of semantic information in the form of existing maps describing features of the earth's surface. In this paper, we leverage this information for change… ▽ More Change detection in remote sensing imagery is essential for a variety of applications such as urban planning, disaster management, and climate research. However, existing methods for identifying semantically changed areas overlook the availability of semantic information in the form of existing maps describing features of the earth's surface. In this paper, we leverage this information for change detection in bi-temporal images. We show that the simple integration of the additional information via concatenation of latent representations suffices to significantly outperform state-of-the-art change detection methods. Motivated by this observation, we propose the new task of *Conditional Change Detection*, where pre-change semantic information is used as input next to bi-temporal images. To fully exploit the extra information, we propose *MapFormer*, a novel architecture based on a multi-modal feature fusion module that allows for feature processing conditioned on the available semantic information. We further employ a supervised, cross-modal contrastive loss to guide the learning of visual representations. Our approach outperforms existing change detection methods by an absolute 11.7\% and 18.4\% in terms of binary change IoU on DynamicEarthNet and HRSCD, respectively. Furthermore, we demonstrate the robustness of our approach to the quality of the pre-change semantic information and the absence pre-change imagery. The code is available at https://github.com/mxbh/mapformer. △ Less

Submitted 7 December, 2023; v1 submitted 31 March, 2023; originally announced March 2023.

Comments: accepted at ICCV 2023

arXiv:2303.06999 [pdf, other]

Identifying Label Errors in Object Detection Datasets by Loss Inspection

Authors: Marius Schubert, Tobias Riedlinger, Karsten Kahl, Daniel Kröll, Sebastian Schoenen, Siniša Šegvić, Matthias Rottmann

Abstract: Labeling datasets for supervised object detection is a dull and time-consuming task. Errors can be easily introduced during annotation and overlooked during review, yielding inaccurate benchmarks and performance degradation of deep neural networks trained on noisy labels. In this work, we for the first time introduce a benchmark for label error detection methods on object detection datasets as wel… ▽ More Labeling datasets for supervised object detection is a dull and time-consuming task. Errors can be easily introduced during annotation and overlooked during review, yielding inaccurate benchmarks and performance degradation of deep neural networks trained on noisy labels. In this work, we for the first time introduce a benchmark for label error detection methods on object detection datasets as well as a label error detection method and a number of baselines. We simulate four different types of randomly introduced label errors on train and test sets of well-labeled object detection datasets. For our label error detection method we assume a two-stage object detector to be given and consider the sum of both stages' classification and regression losses. The losses are computed with respect to the predictions and the noisy labels including simulated label errors, aiming at detecting the latter. We compare our method to three baselines: a naive one without deep learning, the object detector's score and the entropy of the classification softmax distribution. We outperform all baselines and demonstrate that among the considered methods, ours is the only one that detects label errors of all four types efficiently. Furthermore, we detect real label errors a) on commonly used test datasets in object detection and b) on a proprietary dataset. In both cases we achieve low false positives rates, i.e., we detect label errors with a precision for a) of up to 71.5% and for b) with 97%. △ Less

Submitted 19 December, 2023; v1 submitted 13 March, 2023; originally announced March 2023.

arXiv:2302.11069 [pdf]

Quantum Composites with the Functionality Defined by the Charge-Density-Wave Phase Transitions

Authors: Zahra Barani, Tekwam Geremew, Megan Stokey, Nicholas Sesing, Maedeh Taheri, Matthew J. Hilfiker, Fariborz Kargar, Mathias Schubert, Tina T. Salguero, Alexander A. Balandin

Abstract: We demonstrate a unique class of advanced materials - quantum composites based on polymers with fillers comprised of a van der Waals quantum material that reveals multiple charge-density-wave quantum condensate phases. Materials that exhibit quantum phenomena are typically crystalline, pure, and have few defects because disorder destroys the coherence of the electrons and phonons, leading to colla… ▽ More We demonstrate a unique class of advanced materials - quantum composites based on polymers with fillers comprised of a van der Waals quantum material that reveals multiple charge-density-wave quantum condensate phases. Materials that exhibit quantum phenomena are typically crystalline, pure, and have few defects because disorder destroys the coherence of the electrons and phonons, leading to collapses of the quantum states. We succeeded in preserving the macroscopic charge-density-wave phases of filler particles after multiple composite processing steps. The prepared composites manifest strong charge-density-wave phenomena even above room temperature. The dielectric constant experiences more than two orders of magnitude enhancement while the material maintains its electrically insulating properties, opening a venue for advanced applications in energy storage and electronics. The results present a conceptually different approach for engineering the properties of materials, extending the application domain for van der Waals materials. △ Less

Submitted 21 February, 2023; originally announced February 2023.

Comments: 23 pages, 5 figures

arXiv:2212.11636

Towards Causal Credit Assignment

Authors: Mátyás Schubert

Abstract: Adequately assigning credit to actions for future outcomes based on their contributions is a long-standing open challenge in Reinforcement Learning. The assumptions of the most commonly used credit assignment method are disadvantageous in tasks where the effects of decisions are not immediately evident. Furthermore, this method can only evaluate actions that have been selected by the agent, making… ▽ More Adequately assigning credit to actions for future outcomes based on their contributions is a long-standing open challenge in Reinforcement Learning. The assumptions of the most commonly used credit assignment method are disadvantageous in tasks where the effects of decisions are not immediately evident. Furthermore, this method can only evaluate actions that have been selected by the agent, making it highly inefficient. Still, no alternative methods have been widely adopted in the field. Hindsight Credit Assignment is a promising, but still unexplored candidate, which aims to solve the problems of both long-term and counterfactual credit assignment. In this thesis, we empirically investigate Hindsight Credit Assignment to identify its main benefits, and key points to improve. Then, we apply it to factored state representations, and in particular to state representations based on the causal structure of the environment. In this setting, we propose a variant of Hindsight Credit Assignment that effectively exploits a given causal structure. We show that our modification greatly decreases the workload of Hindsight Credit Assignment, making it more efficient and enabling it to outperform the baseline credit assignment method on various tasks. This opens the way to other methods based on given or learned causal structures. △ Less

Submitted 17 May, 2023; v1 submitted 22 December, 2022; originally announced December 2022.

Comments: In case I write a paper about this thesis, I do not want them to be duplicates of each other

arXiv:2212.10836 [pdf, other]

Towards Rapid Prototy** and Comparability in Active Learning for Deep Object Detection

Authors: Tobias Riedlinger, Marius Schubert, Karsten Kahl, Hanno Gottschalk, Matthias Rottmann

Abstract: Active learning as a paradigm in deep learning is especially important in applications involving intricate perception tasks such as object detection where labels are difficult and expensive to acquire. Development of active learning methods in such fields is highly computationally expensive and time consuming which obstructs the progression of research and leads to a lack of comparability between… ▽ More Active learning as a paradigm in deep learning is especially important in applications involving intricate perception tasks such as object detection where labels are difficult and expensive to acquire. Development of active learning methods in such fields is highly computationally expensive and time consuming which obstructs the progression of research and leads to a lack of comparability between methods. In this work, we propose and investigate a sandbox setup for rapid development and transparent evaluation of active learning in deep object detection. Our experiments with commonly used configurations of datasets and detection architectures found in the literature show that results obtained in our sandbox environment are representative of results on standard configurations. The total compute time to obtain results and assess the learning behavior can thereby be reduced by factors of up to 14 when comparing with Pascal VOC and up to 32 when comparing with BDD100k. This allows for testing and evaluating data acquisition and labeling strategies in under half a day and contributes to the transparency and development speed in the field of active learning for object detection. △ Less

Submitted 21 December, 2022; originally announced December 2022.

Comments: 17 pages, 12 figures, 9 tables

arXiv:2210.15319 [pdf, other]

doi 10.1051/0004-6361/202244640

Effects of solar evolution on finite acquisition time of Fabry-Perot-Interferometers in high resolution solar physics

Authors: Rolf Schlichenmaier, Daniel Pitters, Juan Manuel Borrero, Matthias Schubert

Abstract: The imaging spectro-polarimeter VTF (Visible Tunable Filter) will be operated at the Daniel K. Inouye Solar Telescope (DKIST). Due to its capability of resolving dynamic fine structure of smaller than 0.05'', the finite acquisition time of typically 11 s affects the measurement process and potentially causes errors in deduced physical parameters. We estimate those errors and investigate ways of mi… ▽ More The imaging spectro-polarimeter VTF (Visible Tunable Filter) will be operated at the Daniel K. Inouye Solar Telescope (DKIST). Due to its capability of resolving dynamic fine structure of smaller than 0.05'', the finite acquisition time of typically 11 s affects the measurement process and potentially causes errors in deduced physical parameters. We estimate those errors and investigate ways of minimising them. We mimic the solar surface using a magneto-hydrodynamic simulation with a spatially averaged vertical field strength of 200 G. We simulate the measurement process scanning through successive wavelength points with a temporal cadence of 1 s. We synthesise FeI 617.3 nm. Besides the classical composition of the line profile, we introduce a novel method in which the intensity in each wavelength point is normalised using the simultaneous continuum intensity. Milne-Eddington inversions are used to infer the line-of-sight velocity, v(los), and the vertical (longitudinal) component of the magnetic field, B(los). We quantify systematic errors, defining the temporal average of the simulation during the measurement as the truth. We find that with the classical composition of the line profiles, errors exceed the sensitivity for v(los) and in filigree regions also for B(los). The novel method that includes normalisation reduces the measurement errors in all cases. Spatial binning without reducing the acquisition time decreases the measurement error slightly. The evolutionary time-scale in inter-granular lanes, in particular in areas with magnetic features (filigree), is shorter than the time-scale within granules. Hence less accumulations could be used for strong magnetic field in inter-granular lanes and more accumulations could be used for the weak granular magnetic fields. As a key result, we suggest to include the novel method of normalisation in corresponding data pipelines. △ Less

Submitted 27 October, 2022; originally announced October 2022.

Comments: 11 pages, 11 figures, accepted for publication in A&A

Journal ref: A&A 669, A78 (2023)

arXiv:2210.12989 [pdf, other]

Robust Object Detection in Remote Sensing Imagery with Noisy and Sparse Geo-Annotations (Full Version)

Authors: Maximilian Bernhard, Matthias Schubert

Abstract: Recently, the availability of remote sensing imagery from aerial vehicles and satellites constantly improved. For an automated interpretation of such data, deep-learning-based object detectors achieve state-of-the-art performance. However, established object detectors require complete, precise, and correct bounding box annotations for training. In order to create the necessary training annotations… ▽ More Recently, the availability of remote sensing imagery from aerial vehicles and satellites constantly improved. For an automated interpretation of such data, deep-learning-based object detectors achieve state-of-the-art performance. However, established object detectors require complete, precise, and correct bounding box annotations for training. In order to create the necessary training annotations for object detectors, imagery can be georeferenced and combined with data from other sources, such as points of interest localized by GPS sensors. Unfortunately, this combination often leads to poor object localization and missing annotations. Therefore, training object detectors with such data often results in insufficient detection performance. In this paper, we present a novel approach for training object detectors with extremely noisy and incomplete annotations. Our method is based on a teacher-student learning framework and a correction module accounting for imprecise and missing annotations. Thus, our method is easy to use and can be combined with arbitrary object detectors. We demonstrate that our approach improves standard detectors by 37.1\% $AP_{50}$ on a noisy real-world remote-sensing dataset. Furthermore, our method achieves great performance gains on two datasets with synthetic noise. Code is available at \url{https://github.com/mxbh/robust_object_detection}. △ Less

Submitted 24 October, 2022; originally announced October 2022.

arXiv:2210.12547 [pdf, other]

SurCo: Learning Linear Surrogates For Combinatorial Nonlinear Optimization Problems

Authors: Aaron Ferber, Taoan Huang, Daochen Zha, Martin Schubert, Benoit Steiner, Bistra Dilkina, Yuandong Tian

Abstract: Optimization problems with nonlinear cost functions and combinatorial constraints appear in many real-world applications but remain challenging to solve efficiently compared to their linear counterparts. To bridge this gap, we propose $\textbf{SurCo}$ that learns linear $\underline{\text{Sur}}$rogate costs which can be used in existing $\underline{\text{Co}}$mbinatorial solvers to output good solu… ▽ More Optimization problems with nonlinear cost functions and combinatorial constraints appear in many real-world applications but remain challenging to solve efficiently compared to their linear counterparts. To bridge this gap, we propose $\textbf{SurCo}$ that learns linear $\underline{\text{Sur}}$rogate costs which can be used in existing $\underline{\text{Co}}$mbinatorial solvers to output good solutions to the original nonlinear combinatorial optimization problem. The surrogate costs are learned end-to-end with nonlinear loss by differentiating through the linear surrogate solver, combining the flexibility of gradient-based methods with the structure of linear combinatorial optimization. We propose three $\texttt{SurCo}$ variants: $\texttt{SurCo}-\texttt{zero}$ for individual nonlinear problems, $\texttt{SurCo}-\texttt{prior}$ for problem distributions, and $\texttt{SurCo}-\texttt{hybrid}$ to combine both distribution and problem-specific information. We give theoretical intuition motivating $\texttt{SurCo}$, and evaluate it empirically. Experiments show that $\texttt{SurCo}$ finds better solutions faster than state-of-the-art and domain expert approaches in real-world optimization problems such as embedding table sharding, inverse photonic design, and nonlinear route planning. △ Less

Submitted 19 July, 2023; v1 submitted 22 October, 2022; originally announced October 2022.

arXiv:2210.10680 [pdf]

Local sensing of absolute refractive index during protein-binding using microlasers with spectral encoding

Authors: Soraya Caixeiro, Casper Kunstmann-Olsen, Marcel Schubert, Joseph Hill, Isla R. M. Barnard, Matthew D. Simmons, Steven Johnson, Malte C. Gather

Abstract: Multiplexed, specific and sensitive detection of antigens is critical for the rapid and accurate diagnosis of disease and the informed development of personalized treatment plans. Here, we show that polymer microsphere lasers can be used as photonic sensors to monitor and quantify direct surface binding of biomolecules via changes in the refractive index. The unique spectral signature of each indi… ▽ More Multiplexed, specific and sensitive detection of antigens is critical for the rapid and accurate diagnosis of disease and the informed development of personalized treatment plans. Here, we show that polymer microsphere lasers can be used as photonic sensors to monitor and quantify direct surface binding of biomolecules via changes in the refractive index. The unique spectral signature of each individual laser can be used to find their size and effective refractive index which adds a new encoding dimension when compared to conventional fluorescent beads. We utilize antibody-functionalized microlasers to selectively detect protein binding. Different stages of the multilayer surface modification can be resolved, and protein binding is demonstrated for two different proteins, IgG and CRP. Moreover, by continuously monitoring single lasers, we demonstrate the possibility of real-time monitoring of binding dynamics between antigens in solution phase and the immobilized antibodies. For multiplexed detection, the microlasers are employed in a flow cytometer configuration, with fast spectral detection and identification of microlasers with and without antigen binding. We envision that by combining microlasers with well-established surface modification chemistries and flow geometries, the multiplexing ability of microbead immunoassays can be strongly increased while also opening avenues for single cell profiling within heterogenous cell populations. △ Less

Submitted 19 October, 2022; originally announced October 2022.

Comments: 18 pages 5 figures and SI

arXiv:2210.07821 [pdf, ps, other]

doi 10.23919/EuCAP57121.2023.10133262

Receiver Bandwidth Extension Beyond Nyquist Using Channel Bonding

Authors: Sebastian Giehl, Carsten Andrich, Michael Schubert, Maximilian Engelhardt, Alexander Ihlow

Abstract: Current and upcoming communication and sensing technologies require ever larger bandwidths. Channel bonding can be utilized to extend a receiver's instantaneous bandwidth beyond a single converter's Nyquist limit. Two potential joint front-end and converter design approaches are theoretically introduced, realized and evaluated in this paper. The Xilinx RFSoC platform with its 5 GSa/s analog to dig… ▽ More Current and upcoming communication and sensing technologies require ever larger bandwidths. Channel bonding can be utilized to extend a receiver's instantaneous bandwidth beyond a single converter's Nyquist limit. Two potential joint front-end and converter design approaches are theoretically introduced, realized and evaluated in this paper. The Xilinx RFSoC platform with its 5 GSa/s analog to digital converters (ADCs) is used to implement both a hybrid coupler based in-phase/quadrature (I/Q) sampling and a time-interleaved sampling approach along with channel bonding. Both realizations are demonstrated to be able to reconstruct instantaneous bandwidths of 5 GHz with up to 49 dB image rejection ratio (IRR) typically within 4 to 8 dB the front-ends' theoretical limits. △ Less

Submitted 12 February, 2024; v1 submitted 14 October, 2022; originally announced October 2022.

Comments: 5 pages, 10 figures, 2 tables, conference paper

MSC Class: 94A20 (Primary); 94A12 (Secondary)

Journal ref: 2023 17th European Conference on Antennas and Propagation (EuCAP)

arXiv:2210.06101 [pdf, other]

Federated Continual Learning for Text Classification via Selective Inter-client Transfer

Authors: Yatin Chaudhary, Pranav Rai, Matthias Schubert, Hinrich Schütze, Pankaj Gupta

Abstract: In this work, we combine the two paradigms: Federated Learning (FL) and Continual Learning (CL) for text classification task in cloud-edge continuum. The objective of Federated Continual Learning (FCL) is to improve deep learning models over life time at each client by (relevant and efficient) knowledge transfer without sharing data. Here, we address challenges in minimizing inter-client interfere… ▽ More In this work, we combine the two paradigms: Federated Learning (FL) and Continual Learning (CL) for text classification task in cloud-edge continuum. The objective of Federated Continual Learning (FCL) is to improve deep learning models over life time at each client by (relevant and efficient) knowledge transfer without sharing data. Here, we address challenges in minimizing inter-client interference while knowledge sharing due to heterogeneous tasks across clients in FCL setup. In doing so, we propose a novel framework, Federated Selective Inter-client Transfer (FedSeIT) which selectively combines model parameters of foreign clients. To further maximize knowledge transfer, we assess domain overlap and select informative tasks from the sequence of historical tasks at each foreign client while preserving privacy. Evaluating against the baselines, we show improved performance, a gain of (average) 12.4\% in text classification over a sequence of tasks using five datasets from diverse domains. To the best of our knowledge, this is the first work that applies FCL to NLP. △ Less

Submitted 12 February, 2023; v1 submitted 12 October, 2022; originally announced October 2022.

Comments: EMNLP2022 (Findings): 11 pages, 5 figures, 4 tables

arXiv:2209.15527 [pdf]

doi 10.1063/5.0106939

Remote Surface Optical Phonon Scattering in Ferroelectric Ba$_{0.6}$Sr$_{0.4}$TiO$_{3}$ Gated Graphene

Authors: Hanying Chen, Tianlin Li, Yifei Hao, Anil Rajapitamahuni, Zhiyong Xiao, Stefan Schoeche, Mathias Schubert, Xia Hong

Abstract: We report the effect of remote surface optical (RSO) phonon scattering on carrier mobility in monolayer graphene gated by ferroelectric oxide. We fabricate monolayer graphene transistors back-gated by epitaxial (001) Ba$_{0.6}$Sr$_{0.4}$TiO$_{3}$ films, with field effect mobility up to 23,000 cm$^{2}$V$^{-1}$s$^{-1}$ achieved. Switching the ferroelectric polarization induces nonvolatile modulation… ▽ More We report the effect of remote surface optical (RSO) phonon scattering on carrier mobility in monolayer graphene gated by ferroelectric oxide. We fabricate monolayer graphene transistors back-gated by epitaxial (001) Ba$_{0.6}$Sr$_{0.4}$TiO$_{3}$ films, with field effect mobility up to 23,000 cm$^{2}$V$^{-1}$s$^{-1}$ achieved. Switching the ferroelectric polarization induces nonvolatile modulation of resistance and quantum Hall effect in graphene at low temperatures. Ellipsometry spectroscopy studies reveal four pairs of optical phonon modes in Ba$_{0.6}$Sr$_{0.4}$TiO$_{3}$, from which we extract the RSO phonon frequencies. The temperature dependence of resistivity in graphene can be well accounted for by considering the scattering from the intrinsic longitudinal acoustic phonon and the RSO phonon, with the latter dominated by the mode at 35.8 meV. Our study reveals the room temperature mobility limit of ferroelectric-gated graphene transistors imposed by RSO phonon scattering. △ Less

Submitted 30 September, 2022; originally announced September 2022.

Comments: 15 pages, 6 figures

arXiv:2209.06025 [pdf, other]

Strain and composition dependencies of the near bandgap optical transitions in monoclinic (Al$_x$Ga$_{1-x}$)$_2$O$_3$ alloys with coherent biaxial in-plane strain on (010) Ga$_2$O$_3$

Authors: Rafał Korlacki, Matthew Hilfiker, Jenna Knudtson, Megan Stokey, Ufuk Kilic, Akhil Mauze, Yuewei Zhang, James Speck, Vanya Darakchieva, Mathias Schubert

Abstract: The bowing of the energy of the three lowest band-to-band transitions in $β$-(Al$_{x}$Ga$_{1-x}$)$_2$O$_3$ alloys was resolved using a combined density functional theory (DFT) and generalized spectroscopic ellipsometry (GSE) approach. The DFT calculations of the electronic band structure of both, $β$-Ga$_2$O$_3$ and $θ$-Al$_2$O$_3$, allow extracting of the linear portion of the energy shift in the… ▽ More The bowing of the energy of the three lowest band-to-band transitions in $β$-(Al$_{x}$Ga$_{1-x}$)$_2$O$_3$ alloys was resolved using a combined density functional theory (DFT) and generalized spectroscopic ellipsometry (GSE) approach. The DFT calculations of the electronic band structure of both, $β$-Ga$_2$O$_3$ and $θ$-Al$_2$O$_3$, allow extracting of the linear portion of the energy shift in the alloys, and provide a method for quantifying the role of coherent strain present in the $β$-(Al$_{x}$Ga$_{1-x}$)$_2$O$_3$ thin films on (010) $β$-Ga$_2$O$_3$ substrates. The energies of band-to-band transitions were obtained using the spectroscopic ellipsometry eigenpolarization model approach [A. Mock et al., Phys. Rev. B 95, 165202 (2017)]. After subtracting the effects of strain which also induces additional bowing and after subtraction of the linear portion of the energy shift due to alloying, the bowing parameters associated with the three lowest band-to-band transitions in monoclinic $β$-(Al$_{x}$Ga$_{1-x}$)$_2$O$_3$ are found. △ Less

Submitted 13 September, 2022; originally announced September 2022.

Comments: 12 pages, 5 figures

arXiv:2208.10547 [pdf, other]

InstanceFormer: An Online Video Instance Segmentation Framework

Authors: Rajat Koner, Tanveer Hannan, Suprosanna Shit, Sahand Sharifzadeh, Matthias Schubert, Thomas Seidl, Volker Tresp

Abstract: Recent transformer-based offline video instance segmentation (VIS) approaches achieve encouraging results and significantly outperform online approaches. However, their reliance on the whole video and the immense computational complexity caused by full Spatio-temporal attention limit them in real-life applications such as processing lengthy videos. In this paper, we propose a single-stage transfor… ▽ More Recent transformer-based offline video instance segmentation (VIS) approaches achieve encouraging results and significantly outperform online approaches. However, their reliance on the whole video and the immense computational complexity caused by full Spatio-temporal attention limit them in real-life applications such as processing lengthy videos. In this paper, we propose a single-stage transformer-based efficient online VIS framework named InstanceFormer, which is especially suitable for long and challenging videos. We propose three novel components to model short-term and long-term dependency and temporal coherence. First, we propagate the representation, location, and semantic information of prior instances to model short-term changes. Second, we propose a novel memory cross-attention in the decoder, which allows the network to look into earlier instances within a certain temporal window. Finally, we employ a temporal contrastive loss to impose coherence in the representation of an instance across all frames. Memory attention and temporal coherence are particularly beneficial to long-range dependency modeling, including challenging scenarios like occlusion. The proposed InstanceFormer outperforms previous online benchmark methods by a large margin across multiple datasets. Most importantly, InstanceFormer surpasses offline approaches for challenging and long datasets such as YouTube-VIS-2021 and OVIS. Code is available at https://github.com/rajatkoner08/InstanceFormer. △ Less

Submitted 22 August, 2022; originally announced August 2022.

Report number: InstanceFormer:08-22

Journal ref: Thirty-Seventh AAAI Conference on Artificial Intelligence (AAAI-2023)

arXiv:2208.01735 [pdf, other]

V-Coder: Adaptive AutoEncoder for Semantic Disclosure in Knowledge Graphs

Authors: Christian M. M. Frey, Matthias Schubert

Abstract: Semantic Web or Knowledge Graphs (KG) emerged to one of the most important information source for intelligent systems requiring access to structured knowledge. One of the major challenges is the extraction and processing of unambiguous information from textual data. Following the human perception, overlap** semantic linkages between two named entities become clear due to our common-sense about t… ▽ More Semantic Web or Knowledge Graphs (KG) emerged to one of the most important information source for intelligent systems requiring access to structured knowledge. One of the major challenges is the extraction and processing of unambiguous information from textual data. Following the human perception, overlap** semantic linkages between two named entities become clear due to our common-sense about the context a relationship lives in which is not the case when we look at it from an automatically driven process of a machine. In this work, we are interested in the problem of Relational Resolution within the scope of KGs, i.e, we are investigating the inherent semantic of relationships between entities within a network. We propose a new adaptive AutoEncoder, called V-Coder, to identify relations inherently connecting entities from different domains. Those relations can be considered as being ambiguous and are candidates for disentanglement. Likewise to the Adaptive Learning Theory (ART), our model learns new patterns from the KG by increasing units in a competitive layer without discarding the previous observed patterns whilst learning the quality of each relation separately. The evaluation on real-world datasets of Freebase, Yago and NELL shows that the V-Coder is not only able to recover links from corrupted input data, but also shows that the semantic disclosure of relations in a KG show the tendency to improve link prediction. A semantic evaluation wraps the evaluation up. △ Less

Submitted 22 July, 2022; originally announced August 2022.

arXiv:2206.04615 [pdf, other]

Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

Authors: Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza , et al. (426 additional authors not shown)

Abstract: Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet poorly characterized. In order to inform future research, prepare for disruptive new model capabilities, and ameliorate socially harmful effects, it is vital that we understand the present and near-futur… ▽ More Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet poorly characterized. In order to inform future research, prepare for disruptive new model capabilities, and ameliorate socially harmful effects, it is vital that we understand the present and near-future capabilities and limitations of language models. To address this challenge, we introduce the Beyond the Imitation Game benchmark (BIG-bench). BIG-bench currently consists of 204 tasks, contributed by 450 authors across 132 institutions. Task topics are diverse, drawing problems from linguistics, childhood development, math, common-sense reasoning, biology, physics, social bias, software development, and beyond. BIG-bench focuses on tasks that are believed to be beyond the capabilities of current language models. We evaluate the behavior of OpenAI's GPT models, Google-internal dense transformer architectures, and Switch-style sparse transformers on BIG-bench, across model sizes spanning millions to hundreds of billions of parameters. In addition, a team of human expert raters performed all tasks in order to provide a strong baseline. Findings include: model performance and calibration both improve with scale, but are poor in absolute terms (and when compared with rater performance); performance is remarkably similar across model classes, though with benefits from sparsity; tasks that improve gradually and predictably commonly involve a large knowledge or memorization component, whereas tasks that exhibit "breakthrough" behavior at a critical scale often involve multiple steps or components, or brittle metrics; social bias typically increases with scale in settings with ambiguous context, but this can be improved with prompting. △ Less

Submitted 12 June, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

Comments: 27 pages, 17 figures + references and appendices, repo: https://github.com/google/BIG-bench

Journal ref: Transactions on Machine Learning Research, May/2022, https://openreview.net/forum?id=uyTL5Bvosj

arXiv:2206.00766 [pdf, other]

Simulating the 1976 Teton Dam Failure using Geoclaw and HEC-RAS and comparing with Historical Observations

Authors: Hannah Spero, Donna Calhoun, Michael Schubert

Abstract: Dam failures occur worldwide, often from factors including aging structures, extreme hydrologic loading, and design oversights related to the changing climate. Understanding and mitigating risk to downstream inhabited areas require develo** and improving low-cost high-fidelity tools, such as numerical models, which allow emergency managers to predict the consequences of dam failures better. Two-… ▽ More Dam failures occur worldwide, often from factors including aging structures, extreme hydrologic loading, and design oversights related to the changing climate. Understanding and mitigating risk to downstream inhabited areas require develo** and improving low-cost high-fidelity tools, such as numerical models, which allow emergency managers to predict the consequences of dam failures better. Two-dimensional (2D) depth-averaged hydraulic models can provide valuable insights into the importance of breach parameters or downstream flow characteristics, but historical studies considering historic failures using real topographies are less common in literature. This study compares Geoclaw, a 2D hydraulic model with adaptive mesh refinement capabilities, to an industry-standard software HEC-RAS (Hydrologic Engineering Center - River Analysis System) using the 1976 Teton Dam failure as a case study. The suitability of Geoclaw for dam failure modeling is determined based on its capability to resolve inundation extent and flood wave arrival times. This study performs sensitivity analyses of the HEC-RAS model to compare an instantaneous dam breach assumption with a time-dependent breach formation for quantifying the model uncertainty. We find the 2D Geoclaw dam-break model results compare reasonably with historical gauge and field observational data and HEC-RAS results. The model demonstrates stability and relatively low computational costs. Our findings highlight opportunities for future work, with the Geoclaw software performance supporting continued studies to evaluate performance. Outcomes of this study will assist dam owners, floodplain managers, and emergency managers by providing an additional tool for estimating the impacts of dam failures to protect lives and infrastructure downstream. △ Less

Submitted 17 July, 2022; v1 submitted 15 May, 2022; originally announced June 2022.

arXiv:2205.14917 [pdf, other]

Uncertainty Quantification and Resource-Demanding Computer Vision Applications of Deep Learning

Authors: Julian Burghoff, Robin Chan, Hanno Gottschalk, Annika Muetze, Tobias Riedlinger, Matthias Rottmann, Marius Schubert

Abstract: Bringing deep neural networks (DNNs) into safety critical applications such as automated driving, medical imaging and finance, requires a thorough treatment of the model's uncertainties. Training deep neural networks is already resource demanding and so is also their uncertainty quantification. In this overview article, we survey methods that we developed to teach DNNs to be uncertain when they en… ▽ More Bringing deep neural networks (DNNs) into safety critical applications such as automated driving, medical imaging and finance, requires a thorough treatment of the model's uncertainties. Training deep neural networks is already resource demanding and so is also their uncertainty quantification. In this overview article, we survey methods that we developed to teach DNNs to be uncertain when they encounter new object classes. Additionally, we present training methods to learn from only a few labels with help of uncertainty quantification. Note that this is typically paid with a massive overhead in computation of an order of magnitude and more compared to ordinary network training. Finally, we survey our work on neural architecture search which is also an order of magnitude more resource demanding then ordinary network training. △ Less

Submitted 30 May, 2022; originally announced May 2022.

MSC Class: 68T45; 62-07

arXiv:2205.00429 [pdf, ps, other]

Closed-form max-min power control for some cellular and cell-free massive MIMO networks

Authors: Lorenzo Miretti, Renato L. G. Cavalcante, Slawomir Stanczak, Martin Schubert, Ronald Boehnke, Wen Xu

Abstract: Many common instances of power control problems for cellular and cell-free massive MIMO networks can be interpreted as max-min utility optimization problems involving affine interference map**s and polyhedral constraints. We show that these problems admit a closed-form solution which depends on the spectral radius of known matrices. In contrast, previous solutions in the literature have been ind… ▽ More Many common instances of power control problems for cellular and cell-free massive MIMO networks can be interpreted as max-min utility optimization problems involving affine interference map**s and polyhedral constraints. We show that these problems admit a closed-form solution which depends on the spectral radius of known matrices. In contrast, previous solutions in the literature have been indirectly obtained using iterative algorithms based on the bisection method, or on fixed-point iterations. Furthermore, we also show an asymptotically tight bound for the optimal utility, which in turn provides a simple rule of thumb for evaluating whether the network is operating in the noise or interference limited regime. We finally illustrate our results by focusing on classical max-min fair power control for cell-free massive MIMO networks. △ Less

Submitted 3 May, 2022; v1 submitted 1 May, 2022; originally announced May 2022.

arXiv:2202.07025 [pdf, other]

Box Supervised Video Segmentation Proposal Network

Authors: Tanveer Hannan, Rajat Koner, Jonathan Kobold, Matthias Schubert

Abstract: Video Object Segmentation (VOS) has been targeted by various fully-supervised and self-supervised approaches. While fully-supervised methods demonstrate excellent results, self-supervised ones, which do not use pixel-level ground truth, attract much attention. However, self-supervised approaches pose a significant performance gap. Box-level annotations provide a balanced compromise between labelin… ▽ More Video Object Segmentation (VOS) has been targeted by various fully-supervised and self-supervised approaches. While fully-supervised methods demonstrate excellent results, self-supervised ones, which do not use pixel-level ground truth, attract much attention. However, self-supervised approaches pose a significant performance gap. Box-level annotations provide a balanced compromise between labeling effort and result quality for image segmentation but have not been exploited for the video domain. In this work, we propose a box-supervised video object segmentation proposal network, which takes advantage of intrinsic video properties. Our method incorporates object motion in the following way: first, motion is computed using a bidirectional temporal difference and a novel bounding box-guided motion compensation. Second, we introduce a novel motion-aware affinity loss that encourages the network to predict positive pixel pairs if they share similar motion and color. The proposed method outperforms the state-of-the-art self-supervised benchmark by 16.4% and 6.9% $\mathcal{J}$ &$\mathcal{F}$ score and the majority of fully supervised methods on the DAVIS and Youtube-VOS dataset without imposing network architectural specifications. We provide extensive tests and ablations on the datasets, demonstrating the robustness of our method. △ Less

Submitted 16 February, 2022; v1 submitted 14 February, 2022; originally announced February 2022.

arXiv:2202.05388 [pdf, other]

Massively parallel pixel-by-pixel nanophotonic optimization using a Green's function formalism

Authors: Jiahui Wang, Alfred K. C. Cheung, Aleksandra Spyra, Ian A. D. Williamson, Jian Guan, Martin F. Schubert

Abstract: We introduce an efficient parallelization scheme to implement pixel-by-pixel nanophotonic optimization using a Green's function based formalism. The crucial insight in our proposal is the reframing of the optimization algorithm as a large-scale data processing pipeline, which allows for the efficient distribution of computational tasks across thousands of workers. We demonstrate the utility of our… ▽ More We introduce an efficient parallelization scheme to implement pixel-by-pixel nanophotonic optimization using a Green's function based formalism. The crucial insight in our proposal is the reframing of the optimization algorithm as a large-scale data processing pipeline, which allows for the efficient distribution of computational tasks across thousands of workers. We demonstrate the utility of our implementation by exercising it to optimize a high numerical aperture focusing metalens at problem sizes that would otherwise be far out of reach for the Green's function based method. Finally, we highlight the connection to powerful ideas from reinforcement learning as a natural corollary of reinterpreting the nanophotonic inverse design problem as a graph traversal enabled by the pixel-by-pixel optimization paradigm. △ Less

Submitted 10 February, 2022; originally announced February 2022.

Comments: 10 pages, 7 figures

arXiv:2201.12965 [pdf, other]

doi 10.1021/acsphotonics.2c00313

Inverse design of photonic devices with strict foundry fabrication constraints

Authors: Martin F. Schubert, Alfred K. C. Cheung, Ian A. D. Williamson, Aleksandra Spyra, David H. Alexander

Abstract: We introduce a new method for inverse design of nanophotonic devices which guarantees that resulting designs satisfy strict length scale constraints - including minimum width and spacing constraints required by commercial semiconductor foundries. The method adopts several concepts from machine learning to transform the problem of topology optimization with strict length scale constraints to an unc… ▽ More We introduce a new method for inverse design of nanophotonic devices which guarantees that resulting designs satisfy strict length scale constraints - including minimum width and spacing constraints required by commercial semiconductor foundries. The method adopts several concepts from machine learning to transform the problem of topology optimization with strict length scale constraints to an unconstrained stochastic gradient optimization problem. Specifically, we introduce a conditional generator for feasible designs and adopt a straight-through estimator for backpropagation of gradients to a latent design. We demonstrate the performance and reliability of our method by designing several common integrated photonic components. △ Less

Submitted 13 June, 2022; v1 submitted 30 January, 2022; originally announced January 2022.

Comments: 16 pages, 17 figures

Journal ref: ACS Photonics, vol. 9, no. 7, pp. 2327-2336, Jun. 2022

arXiv:2201.06695 [pdf, other]

doi 10.1063/5.0082353

Terahertz electron paramagnetic resonance generalized spectroscopic ellipsometry: The magnetic response of the nitrogen defect in 4H-SiC

Authors: Mathias Schubert, Sean Knight, Steffen Richter, Philipp Kühne, Vallery Stanishev, Alexander Ruder, Megan Stokey, Rafal Korlacki, Klaus Irmscher, Petr Neugebauer, Vanya Darakchieva

Abstract: We report on terahertz (THz) electron paramagnetic resonance generalized spectroscopic ellipsometry (THz-EPR-GSE). Measurements of the field and frequency dependencies of the magnetic response due to the spin transitions associated with the nitrogen defect in 4H-SiC are shown as an example. THz-EPR-GSE dispenses with the need of a cavity, permits independently scanning field and frequency paramete… ▽ More We report on terahertz (THz) electron paramagnetic resonance generalized spectroscopic ellipsometry (THz-EPR-GSE). Measurements of the field and frequency dependencies of the magnetic response due to the spin transitions associated with the nitrogen defect in 4H-SiC are shown as an example. THz-EPR-GSE dispenses with the need of a cavity, permits independently scanning field and frequency parameters, and does not require field or frequency modulation. We investigate spin transitions of hexagonal ($h$) and cubic ($k$) coordinated nitrogen including coupling with its nuclear spin (I=1), and we propose a model approach for the magnetic susceptibility to account for the spin transitions. From the THz-EPR-GSE measurements we can fully determine the polarization properties of the spin transitions and we obtain $g$ and hyperfine splitting parameters using magnetic field and frequency dependent Lorentzian oscillator lineshape functions. We propose frequency-scanning THz-EPR-GSE as a new and versatile method to study properties of spins in solid state materials. △ Less

Submitted 17 January, 2022; originally announced January 2022.

Comments: 6 pages, 5 figures

arXiv:2111.11388 [pdf, other]

Conifer Seedling Detection in UAV-Imagery with RGB-Depth Information

Authors: Jason Jooste, Michael Fromm, Matthias Schubert

Abstract: Monitoring of reforestation is currently being considerably streamlined through the use of drones and image recognition algorithms, which have already proven to be effective on colour imagery. In addition to colour imagery, elevation data is often also available. The primary aim of this work was to improve the performance of the faster-RCNN object detection algorithm by integrating this height inf… ▽ More Monitoring of reforestation is currently being considerably streamlined through the use of drones and image recognition algorithms, which have already proven to be effective on colour imagery. In addition to colour imagery, elevation data is often also available. The primary aim of this work was to improve the performance of the faster-RCNN object detection algorithm by integrating this height information, which showed itself to notably improve performance. Interestingly, the structure of the network played a key role, with direct addition of the height information as a fourth image channel showing no improvement, while integration after the backbone network and before the region proposal network led to marked improvements. This effect persisted with very long training regimes. Increasing the resolution of this height information also showed little effect. △ Less

Submitted 22 November, 2021; originally announced November 2021.

arXiv:2110.14284 [pdf, other]

APPTeK: Agent-Based Predicate Prediction in Temporal Knowledge Graphs

Authors: Christian M. M. Frey, Yunpu Ma, Matthias Schubert

Abstract: In temporal Knowledge Graphs (tKGs), the temporal dimension is attached to facts in a knowledge base resulting in quadruples between entities such as (Nintendo, released, Super Mario, Sep-13-1985), where the predicate holds within a time interval or at a timestamp. We propose a reinforcement learning agent gathering temporal relevant information about the query entities' neighborhoods, simultaneou… ▽ More In temporal Knowledge Graphs (tKGs), the temporal dimension is attached to facts in a knowledge base resulting in quadruples between entities such as (Nintendo, released, Super Mario, Sep-13-1985), where the predicate holds within a time interval or at a timestamp. We propose a reinforcement learning agent gathering temporal relevant information about the query entities' neighborhoods, simultaneously. We refer to the encodings of the explored graph structures as fingerprints which are used as input to a Q-network. Our agent decides sequentially which relation type needs to be explored next to expand the local subgraphs of the query entities. Our evaluation shows that the proposed method yields competitive results compared to state-of-the-art embedding algorithms for tKGs, and we additionally gain information about the relevant structures between subjects and objects. △ Less

Submitted 21 July, 2022; v1 submitted 27 October, 2021; originally announced October 2021.

arXiv:2110.10674 [pdf, other]

SEA: Graph Shell Attention in Graph Neural Networks

Authors: Christian M. M. Frey, Yunpu Ma, Matthias Schubert

Abstract: A common issue in Graph Neural Networks (GNNs) is known as over-smoothing. By increasing the number of iterations within the message-passing of GNNs, the nodes' representations of the input graph align with each other and become indiscernible. Recently, it has been shown that increasing a model's complexity by integrating an attention mechanism yields more expressive architectures. This is majorly… ▽ More A common issue in Graph Neural Networks (GNNs) is known as over-smoothing. By increasing the number of iterations within the message-passing of GNNs, the nodes' representations of the input graph align with each other and become indiscernible. Recently, it has been shown that increasing a model's complexity by integrating an attention mechanism yields more expressive architectures. This is majorly contributed to steering the nodes' representations only towards nodes that are more informative than others. Transformer models in combination with GNNs result in architectures including Graph Transformer Layers (GTL), where layers are entirely based on the attention operation. However, the calculation of a node's representation is still restricted to the computational working flow of a GNN. In our work, we relax the GNN architecture by means of implementing a routing heuristic. Specifically, the nodes' representations are routed to dedicated experts. Each expert calculates the representations according to their respective GNN workflow. The definitions of distinguishable GNNs result from k-localized views starting from the central node. We call this procedure Graph Shell Attention (SEA), where experts process different subgraphs in a transformer-motivated fashion. Intuitively, by increasing the number of experts, the models gain in expressiveness such that a node's representation is solely based on nodes that are located within the receptive field of an expert. We evaluate our architecture on various benchmark datasets showing competitive results compared to state-of-the-art models. △ Less

Submitted 20 October, 2021; originally announced October 2021.

arXiv:2108.12058 [pdf, ps, other]

Infrared dielectric functions and Brillouin zone center phonons of $α$-Ga$_2$O$_3$ compared to $α$-Al$_2$O$_3$

Authors: Megan Stokey, Rafal Korlacki, Matthew Hilfiker, Sean Knight, Steffen Richter, Vanya Darakchieva, Riena **no, Yong** Cho, Huili Grace Xing, Debdeep Jena, Yuichi Oshima, Kamruzzaman Khan, Elaheh Ahmadi, Mathias Schubert

Abstract: We determine the anisotropic dielectric functions of rhombohedral $α$-Ga$_2$O$_3$ by far-infrared and infrared generalized spectroscopic ellipsometry and derive all transverse optical and longitudinal optical phonon mode frequencies and broadening parameters. We also determine the high frequency and static dielectric constants. We perform density functional theory computations and determine the ph… ▽ More We determine the anisotropic dielectric functions of rhombohedral $α$-Ga$_2$O$_3$ by far-infrared and infrared generalized spectroscopic ellipsometry and derive all transverse optical and longitudinal optical phonon mode frequencies and broadening parameters. We also determine the high frequency and static dielectric constants. We perform density functional theory computations and determine the phonon dispersion for all branches in the Brillouin zone, and we derive all phonon mode parameters at the Brillouin zone center including Raman-active, infrared-active, and silent modes. Excellent agreement is obtained between our experimental and computation results as well as among all previously reported partial information from experiment and theory. We also compute the same information for $α$-Al$_2$O$_3$, the binary parent compound for the emerging alloy of $α$-(Al$_{x}$Ga$_{1-x}$)$_2$O$_3$, and use results from previous investigations [Schubert, Tiwald, and Herzinger, Phys. Rev. B 61, 8187 (2000)] to compare all properties among the two isostructural compounds. From both experimental and theoretical investigations we compute the frequency shifts of all modes between the two compounds. Additionally, we calculate overlap parameters between phonon mode eigenvectors and discuss the possible evolution of all phonon modes into the ternary alloy system and whether modes may form single mode or more complex mode behaviors. △ Less

Submitted 26 August, 2021; originally announced August 2021.

Showing 1–50 of 126 results for author: Schubert, M