-
Origin of the Anisotropic Beer-Lambert Law from Dichroism and Birefringence in $β$-Ga$_2$O$_3$
Authors:
Md Mohsinur Rahman Adnan,
Mathias Schubert,
Roberto C. Myers
Abstract:
The anisotropic optical absorption edge of $β$-Ga$_2$O$_3$ follows a modified Beer-Lambert law having two effective absorption coefficients. The absorption coefficient of linearly polarized light reduces to the least absorbing direction beyond a critical penetration depth, which itself depends on polarization and wavelength. To understand this behavior, a Stokes vector analysis is performed to tra…
▽ More
The anisotropic optical absorption edge of $β$-Ga$_2$O$_3$ follows a modified Beer-Lambert law having two effective absorption coefficients. The absorption coefficient of linearly polarized light reduces to the least absorbing direction beyond a critical penetration depth, which itself depends on polarization and wavelength. To understand this behavior, a Stokes vector analysis is performed to track the polarization state as a function of depth. The weakening of the absorption coefficient is associated with a gradual shift of linear polarization to the least absorbing crystallographic direction in the plane, which is along the a-exciton within the (010) plane or along the b-exciton in the (001) plane. We show that strong linear dichroism near the optical absorption edge causes this shift in $β$-Ga$_2$O$_3$, which arises from the anisotropy and spectral splitting of the physical absorbers i.e., excitons. The linear polarization shift is accompanied by a variation in the ellipticity due to the birefringence of $β$-Ga$_2$O$_3$. Analysis of the phase relationship between the incoming electric field to that at a certain depth reveals the phase speed as an effective refractive index, which varies along different crystallographic directions. The critical penetration depth is shown to be correlated with the depth at which the ellipticity is maximal. Thus, the anisotropic Beer-Lambert law arises from the interplay of both the dichroic and birefringent properties of $β$-Ga$_2$O$_3$.
△ Less
Submitted 28 June, 2024;
originally announced July 2024.
-
The strain-stress relationships for coherent in-plane strain in heterostructures with monoclinic crystal systems: $β$-(Al$_x$Ga$_{1-x}$)$_2$O$_3$ on $(h0l)$ $β$-Ga$_2$O$_3$ as example
Authors:
Mathias Schubert,
Rafal Korlacki,
Vanya Darakchieva
Abstract:
In this work we derive the state of strain or stress under symmetry conserving conditions in pseudomorphic lattices with monoclinic symmetry. We compare surface vectors across the template epitaxial layer interface and impose conditions of a stress free epitaxial layer. As a result, we demonstrate the existence, in theory, of exactly three possible unit cells which can establish onto a given templ…
▽ More
In this work we derive the state of strain or stress under symmetry conserving conditions in pseudomorphic lattices with monoclinic symmetry. We compare surface vectors across the template epitaxial layer interface and impose conditions of a stress free epitaxial layer. As a result, we demonstrate the existence, in theory, of exactly three possible unit cells which can establish onto a given template. We demonstrate this approach for a class of templates with $(h0l)$ planes and $β$-(Al$_x$Ga$_{1-x}$)$_2$O$_3$ on $(h0l)$ $β$-Ga$_2$O$_3$. We discuss the effects of composition $x$ and surface orientation onto the formation of three elastically stable unit cells, their strain and stress tensors, unit cell axes, unit cell volumes, lattice spacing, elastic potential energies, and stress free directions. The previous paradigm for epitaxial layer growth where the stress free direction is always perpendicular to the growing surface is not generally valid for low symmetry materials. In the example here, we find two possible competing domains with stress free direction oblique to the surface of the template for almost all planes $(h0l)$. We calculate the band-to-band transitions for $β$-(Al$_{0.1}$Ga$_{0.9}$)$_2$O$_3$ on $(h0l)$ $β$-Ga$_2$O$_3$ using the composition dependent deformation parameters and elastic coefficients reported prevoiously [Korlacki~\textit{et al.} Phys. Rev. Appl.~\textbf{18}, 064019 (2022)].
△ Less
Submitted 25 May, 2024;
originally announced May 2024.
-
The paramagnetic Lyddane-Sachs-Teller relation
Authors:
Viktor Rindert,
Vanya Darakchieva,
Tapati Sarkar,
Mathias Schubert
Abstract:
In this letter, we derive an expression for magnetic dipole transitions that is analogous to the Lyddane-Sachs-Teller relation for dielectric polar lattice vibrations. We thereby define transverse and longitudinal optical frequencies at which paramagnetic resonance and antiresonance occurs, respectively. The relation found here thus permits non-invasive optical analysis of static magnetization pro…
▽ More
In this letter, we derive an expression for magnetic dipole transitions that is analogous to the Lyddane-Sachs-Teller relation for dielectric polar lattice vibrations. We thereby define transverse and longitudinal optical frequencies at which paramagnetic resonance and antiresonance occurs, respectively. The relation found here thus permits non-invasive optical analysis of static magnetization properties in paramagnetic materials. Validated through terahertz electron paramagnetic resonance ellipsometry and superconducting quantum interference device measurements on Iron-doped Gallium Nitride, our findings show very good agreement between theory and experiment. We term the excitations associated with the paramagnetic transitions as paramagnetic polaritons which may find use in future photonic applications
△ Less
Submitted 24 May, 2024;
originally announced May 2024.
-
Plasmonic Nanocavity to Boost Single Photon Emission from Defects in Thin Hexagonal Boron Nitride
Authors:
Mohammadjavad Dowran,
Ufuk Kilic,
Suvechhya Lamichhane,
Adam Erickson,
Joshua Barker,
Mathias Schubert,
Sy-Hwang Liou,
Christos Argyropoulos,
Abdelghani Laraoui
Abstract:
Efficient and compact single photon emission platforms operating at room temperature with ultrafast speed and high brightness will be fundamental components of the emerging quantum communications and computing fields. However, so far, it has been very challenging to design practical deterministic single photon emitters based on nanoscale solid state materials that meet the fast emission rate and s…
▽ More
Efficient and compact single photon emission platforms operating at room temperature with ultrafast speed and high brightness will be fundamental components of the emerging quantum communications and computing fields. However, so far, it has been very challenging to design practical deterministic single photon emitters based on nanoscale solid state materials that meet the fast emission rate and strong brightness demands. Here we provide a solution to this longstanding problem by using metallic nanocavities integrated with hexagonal boron nitride (hBN) flakes with defects acting as nanoscale single photon emitters (SPEs) at room temperature. The presented hybrid nanophotonic structure creates a rapid speedup and large enhancement in single photon emission at room temperature. Hence, the nonclassical light emission performance is substantially improved compared to plain hBN flakes and hBN on gold layered structures without nanocavity. Extensive theoretical calculations are also performed to accurately model the new hybrid nanophotonic system and prove that the incorporation of plasmonic nanocavity is key to the efficient SPE performance. The proposed quantum nanocavity single photon source is expected to be an element of paramount importance to the envisioned room temperature integrated quantum photonic networks.
△ Less
Submitted 15 May, 2024;
originally announced May 2024.
-
Context Matters: Leveraging Spatiotemporal Metadata for Semi-Supervised Learning on Remote Sensing Images
Authors:
Maximilian Bernhard,
Tanveer Hannan,
Niklas Strauß,
Matthias Schubert
Abstract:
Remote sensing projects typically generate large amounts of imagery that can be used to train powerful deep neural networks. However, the amount of labeled images is often small, as remote sensing applications generally require expert labelers. Thus, semi-supervised learning (SSL), i.e., learning with a small pool of labeled and a larger pool of unlabeled data, is particularly useful in this domai…
▽ More
Remote sensing projects typically generate large amounts of imagery that can be used to train powerful deep neural networks. However, the amount of labeled images is often small, as remote sensing applications generally require expert labelers. Thus, semi-supervised learning (SSL), i.e., learning with a small pool of labeled and a larger pool of unlabeled data, is particularly useful in this domain. Current SSL approaches generate pseudo-labels from model predictions for unlabeled samples. As the quality of these pseudo-labels is crucial for performance, utilizing additional information to improve pseudo-label quality yields a promising direction. For remote sensing images, geolocation and recording time are generally available and provide a valuable source of information as semantic concepts, such as land cover, are highly dependent on spatiotemporal context, e.g., due to seasonal effects and vegetation zones. In this paper, we propose to exploit spatiotemporal metainformation in SSL to improve the quality of pseudo-labels and, therefore, the final model performance. We show that directly adding the available metadata to the input of the predictor at test time degenerates the prediction quality for metadata outside the spatiotemporal distribution of the training set. Thus, we propose a teacher-student SSL framework where only the teacher network uses metainformation to improve the quality of pseudo-labels on the training set. Correspondingly, our student network benefits from the improved pseudo-labels but does not receive metadata as input, making it invariant to spatiotemporal shifts at test time. Furthermore, we propose methods for encoding and injecting spatiotemporal information into the model and introduce a novel distillation mechanism to enhance the knowledge transfer between teacher and student. Our framework dubbed Spatiotemporal SSL can be easily combined with several stat...
△ Less
Submitted 29 April, 2024;
originally announced April 2024.
-
Bloch equations in Terahertz magnetic-resonance ellipsometry
Authors:
Viktor Rindert,
Steffen Richter,
Philipp Kühne,
Alexander Ruder,
Vanya Darakchieva,
Mathias Schubert
Abstract:
A generalized approach derived from Blochs equation of motion of nuclear magnetic moments is presented to model the frequency, magnetic field, spin density, and temperature dependencies in the electromagnetic permeability tensor for materials with magnetic resonances. The resulting tensor model predicts characteristic polarization signatures which can be observed, for example, in fully polarizatio…
▽ More
A generalized approach derived from Blochs equation of motion of nuclear magnetic moments is presented to model the frequency, magnetic field, spin density, and temperature dependencies in the electromagnetic permeability tensor for materials with magnetic resonances. The resulting tensor model predicts characteristic polarization signatures which can be observed, for example, in fully polarization-resolved Mueller matrix element spectra measured across magnetic resonances as a function of frequency, magnetic field, magnetic moment density, and temperature. When augmented with thermodynamic considerations and suitable Hamiltonian description of the magnetic eigenvalue spectrum, important parameters such as zero-frequency magnetization, spectral amplitude distribution, relaxation time constants, and geometrical orientation parameters of the magnetic moment density can be obtained from comparing the generalized model approach to experimental data. We demonstrate our approach by comparing model calculations with full Mueller matrix element spectra measured at oblique angle of incidence in the terahertz spectral range, across electron spin resonance quintuplet transitions observed in wurtzite-structure GaN doped with iron. Measurements were performed by ellipsometry, using a superconducting cryostat magnet at magnetic fields of 7.23 T and at temperatures of 20 K and 30 K. We detail the occurrence of linear and circular birefringence and dichroism associated with each of the zero-field split spin transitions in the S = 5/2 defect system. We derive the spectral dependence of the magnetic susceptibility function and obtain the temperature and magnetic field dependence of the spin Hamiltonian. Our model correctly predicts the complexity of the polarization signatures observed in the 15 independent elements of the normalized Mueller matrix for both positive and negative magnetic fields.
△ Less
Submitted 19 April, 2024;
originally announced April 2024.
-
A Time-Inhomogeneous Markov Model for Resource Availability under Sparse Observations
Authors:
Lukas Rottkamp,
Matthias Schubert
Abstract:
Accurate spatio-temporal information about the current situation is crucial for smart city applications such as modern routing algorithms. Often, this information describes the state of stationary resources, e.g. the availability of parking bays, charging stations or the amount of people waiting for a vehicle to pick them up near a given location. To exploit this kind of information, predicting fu…
▽ More
Accurate spatio-temporal information about the current situation is crucial for smart city applications such as modern routing algorithms. Often, this information describes the state of stationary resources, e.g. the availability of parking bays, charging stations or the amount of people waiting for a vehicle to pick them up near a given location. To exploit this kind of information, predicting future states of the monitored resources is often mandatory because a resource might change its state within the time until it is needed. To train an accurate predictive model, it is often not possible to obtain a continuous time series on the state of the resource. For example, the information might be collected from traveling agents visiting the resource with an irregular frequency. Thus, it is necessary to develop methods which work on sparse observations for training and prediction. In this paper, we propose time-inhomogeneous discrete Markov models to allow accurate prediction even when the frequency of observation is very rare. Our new model is able to blend recent observations with historic data and also provide useful probabilistic estimates for future states. Since resources availability in a city is typically time-dependent, our Markov model is time-inhomogeneous and cyclic within a predefined time interval. To train our model, we propose a modified Baum-Welch algorithm. Evaluations on real-world datasets of parking bay availability show that our new method indeed yields good results compared to methods being trained on complete data and non-cyclic variants.
△ Less
Submitted 18 April, 2024;
originally announced April 2024.
-
Simplex Decomposition for Portfolio Allocation Constraints in Reinforcement Learning
Authors:
David Winkel,
Niklas Strauß,
Matthias Schubert,
Thomas Seidl
Abstract:
Portfolio optimization tasks describe sequential decision problems in which the investor's wealth is distributed across a set of assets. Allocation constraints are used to enforce minimal or maximal investments into particular subsets of assets to control for objectives such as limiting the portfolio's exposure to a certain sector due to environmental concerns. Although methods for constrained Rei…
▽ More
Portfolio optimization tasks describe sequential decision problems in which the investor's wealth is distributed across a set of assets. Allocation constraints are used to enforce minimal or maximal investments into particular subsets of assets to control for objectives such as limiting the portfolio's exposure to a certain sector due to environmental concerns. Although methods for constrained Reinforcement Learning (CRL) can optimize policies while considering allocation constraints, it can be observed that these general methods yield suboptimal results. In this paper, we propose a novel approach to handle allocation constraints based on a decomposition of the constraint action space into a set of unconstrained allocation problems. In particular, we examine this approach for the case of two constraints. For example, an investor may wish to invest at least a certain percentage of the portfolio into green technologies while limiting the investment in the fossil energy sector. We show that the action space of the task is equivalent to the decomposed action space, and introduce a new reinforcement learning (RL) approach CAOSD, which is built on top of the decomposition. The experimental evaluation on real-world Nasdaq-100 data demonstrates that our approach consistently outperforms state-of-the-art CRL benchmarks for portfolio optimization.
△ Less
Submitted 16 April, 2024;
originally announced April 2024.
-
Efficient Parking Search using Shared Fleet Data
Authors:
Niklas Strauß,
Lukas Rottkamp,
Sebatian Schmoll,
Matthias Schubert
Abstract:
Finding an available on-street parking spot is a relevant problem of day-to-day life. In recent years, cities such as Melbourne and San Francisco deployed sensors that provide real-time information about the occupation of parking spots. Finding a free parking spot in such a smart environment can be modeled and solved as a Markov decision process (MDP). The problem has to consider uncertainty as av…
▽ More
Finding an available on-street parking spot is a relevant problem of day-to-day life. In recent years, cities such as Melbourne and San Francisco deployed sensors that provide real-time information about the occupation of parking spots. Finding a free parking spot in such a smart environment can be modeled and solved as a Markov decision process (MDP). The problem has to consider uncertainty as available parking spots might not remain available until arrival due to other vehicles also claiming spots in the meantime. Knowing the parking intention of every vehicle in the environment would eliminate this uncertainty. Unfortunately, it does currently not seem realistic to have such data from all vehicles. In contrast, acquiring data from a subset of vehicles or a vehicle fleet appears feasible and has the potential to reduce uncertainty.
In this paper, we examine the question of how useful sharing data within a vehicle fleet might be for the search times of particular drivers. We use fleet data to better estimate the availability of parking spots at arrival. Since optimal solutions for large scenarios are infeasible, we base our method on approximate solutions, which have been shown to perform well in single-agent settings. Our experiments are conducted on a simulation using real-world and synthetic data from the city of Melbourne. The results indicate that fleet data can significantly reduce search times for an available parking spot.
△ Less
Submitted 16 April, 2024;
originally announced April 2024.
-
Effective uniaxial dielectric function tensor and optical phonons in ($\bar{2}01$)-plane oriented $β$-Ga$_2$O$_3$ films with equally-distributed six-fold rotation domains
Authors:
Alyssa Mock,
Steffen Richter,
Alexis Papamichail,
Vallery Stanishev,
Misagh Ghezellou,
Jawad Ul-Hassan,
Andreas Popp,
Saud Bin Anooz,
Daniella Gogova,
Praneeth Ranga,
Sriram Krishnamoorthy,
Rafal Korlacki,
Mathias Schubert,
Vanya Darakchieva
Abstract:
Monoclinic $β$-Ga$_2$O$_3$ films grown on $c$-plane sapphire have been shown to exhibit six $(\bar{2}01)$-plane oriented domains, which are equally-spaced-by-rotation around the surface normal and equally-sized-by-volume that render the film optical response effectively uniaxial. We derive and discuss an optical model suitable for ellipsometry data analysis of such films. We model mid- and far-inf…
▽ More
Monoclinic $β$-Ga$_2$O$_3$ films grown on $c$-plane sapphire have been shown to exhibit six $(\bar{2}01)$-plane oriented domains, which are equally-spaced-by-rotation around the surface normal and equally-sized-by-volume that render the film optical response effectively uniaxial. We derive and discuss an optical model suitable for ellipsometry data analysis of such films. We model mid- and far-infrared ellipsometry data from undoped and electrically insulating films with an effective uniaxial dielectric tensor based on projections of all phonon modes within the rotation domains parallel and perpendicular to the sample normal, i.e., to the reciprocal lattice vector $\mathbf{g}_{\bar{2}01}$. Two effective response functions are described by model, and found sufficient to calculate ellipsometry data that best-match measured ellipsometry data from a representative film. We propose to render either effective dielectric functions, or inverse effective dielectric functions, each separately for electric field directions parallel and perpendicular to $\mathbf{g}_{\bar{2}01}$, by sums of Lorentz oscillators, which permit to determine either sets of transverse optical phonon mode parameters, or sets of longitudinal optical phonon mode parameters, respectively. Transverse optical modes common to both dielectric functions can be traced back to single crystal modes with $B_{\mathrm{u}}$ character, while modes with $A_{\mathrm{u}}$ character only appear within the dielectric function for polarization perpendicular to the sample surface. The thereby obtained parameter sets reveal all phonon modes anticipated from averaging over the six-fold rotation domains of single crystal $β$-Ga$_2$O$_3$, but with slightly shifted transverse optical, and completely different longitudinal optical phonon modes.
△ Less
Submitted 10 April, 2024;
originally announced April 2024.
-
Controlling the broadband enhanced light chirality with L-shaped dielectric metamaterials
Authors:
Ufuk Kilic,
Matthew Hilfiker,
Shawn Wimer,
Alexander Ruder,
Eva Schubert,
Mathias Schubert,
Christos Argyropoulos
Abstract:
The inherently weak chiroptical responses of natural materials limit their usage for controlling and enhancing chiral light-matter interactions. Recently, several nanostructures with subwavelength scale dimensions were demonstrated, mainly due to the advent of nanofabrication technologies, as a potential alternative to efficiently enhance chirality. However, the intrinsic lossy nature of metals an…
▽ More
The inherently weak chiroptical responses of natural materials limit their usage for controlling and enhancing chiral light-matter interactions. Recently, several nanostructures with subwavelength scale dimensions were demonstrated, mainly due to the advent of nanofabrication technologies, as a potential alternative to efficiently enhance chirality. However, the intrinsic lossy nature of metals and inherent narrowband response of dielectric planar thin films or metasurface structures pose severe limitations toward the practical realization of broadband and tailorable chiral systems. Here, we tackle these problems by designing all-dielectric silicon-based L-shaped optical metamaterials based on tilted nanopillars that exhibit broadband and enhanced chiroptical response in transmission operation. We use an emerging bottom-up fabrication approach, named glancing angle deposition, to assemble these dielectric metamaterials on a wafer scale. The reported strong chirality and optical anisotropic properties are controllable in terms of both amplitude and operating frequency by simply varying the shape and dimensions of the nanopillars. The presented nanostructures can be used in a plethora of emerging nanophotonic applications, such as chiral sensors, polarization filters, and spin-locked nanowaveguides.
△ Less
Submitted 27 March, 2024;
originally announced March 2024.
-
All epitaxial self-assembly of vertically-confined silicon color centers using ultra-low temperature epitaxy
Authors:
Johannes Aberl,
Enrique Prado Navarrete,
Merve Karaman,
Diego Haya Enriquez,
Christoph Wilflingseder,
Andreas Salomon,
Daniel Primetzhofer,
Markus Andreas Schubert,
Giovanni Capellini,
Thomas Fromherz,
Peter Deák,
Péter Udvarhelyi,
Li Song,
Ádám Gali,
Moritz Brehm
Abstract:
Silicon-based color-centers (SiCCs) have recently emerged as quantum-light sources that can be combined with telecom-range Si Photonics platforms. Unfortunately, using current SiCC fabrication, deterministic control over the vertical emitter position is impossible due to ion-implantation's stochastic nature. To overcome this bottleneck towards high-yield integration, we demonstrate a radically inn…
▽ More
Silicon-based color-centers (SiCCs) have recently emerged as quantum-light sources that can be combined with telecom-range Si Photonics platforms. Unfortunately, using current SiCC fabrication, deterministic control over the vertical emitter position is impossible due to ion-implantation's stochastic nature. To overcome this bottleneck towards high-yield integration, we demonstrate a radically innovative creation method for various SiCCs, solely relying on epitaxial growth of Si and C-doped Si at atypically-low temperatures in a ultra-clean growth environment. These telecom emitters can be confined within sub-1nm thick layers embedded at arbitrary vertical positions within a highly crystalline Si matrix. Tuning growth conditions and do**, different SiCC types, e.g., W-centers, T-centers, G-centers, or derivatives like G'-centers can be created, which are particularly promising as Si-based single-photon sources and spin-photon interfaces. The zero-phonon emission from G'-centers can be conveniently tuned by the C-concentration, leading to a systematic wavelength shift and linewidth narrowing towards low emitter densities.
△ Less
Submitted 21 May, 2024; v1 submitted 29 February, 2024;
originally announced February 2024.
-
Accelerating Innovation in 6G Research: Real-Time Capable SDR System Architecture for Rapid Prototy**
Authors:
Maximilian Engelhardt,
Sebastian Giehl,
Michael Schubert,
Alexander Ihlow,
Christian Schneider,
Alexander Ebert,
Markus Landmann,
Giovanni Del Galdo,
Carsten Andrich
Abstract:
The upcoming 3GPP global mobile communication standard 6G strives to push the technological limits of radio frequency (RF) communication even further than its predecessors: Sum data rates beyond 100 Gbit/s, RF bandwidths above 1 GHz per link, and sub-millisecond latency necessitate very high performance development tools. We propose a new SDR firmware and software architecture designed explicitly…
▽ More
The upcoming 3GPP global mobile communication standard 6G strives to push the technological limits of radio frequency (RF) communication even further than its predecessors: Sum data rates beyond 100 Gbit/s, RF bandwidths above 1 GHz per link, and sub-millisecond latency necessitate very high performance development tools. We propose a new SDR firmware and software architecture designed explicitly to meet these challenging requirements. It relies on Ethernet and commercial off-the-shelf network and server components to maximize flexibility and to reduce costs. We analyze state-of-the-art solutions (USRP X440 and other RFSoC-based systems), derive architectural design goals, explain resulting design decision in detail, and exemplify our architecture's implementation on the XCZU48DR RFSoC. Finally, we validate its performance via measurements and outline how the architecture surpasses the state-of-the-art with respect to sustained RF recording, while maintaining high Ethernet bandwidth efficiency. Building a micro-Doppler radar example, we demonstrate its real-time and rapid application development capabilities.
△ Less
Submitted 23 May, 2024; v1 submitted 9 February, 2024;
originally announced February 2024.
-
User-Centric Cell-Free Wireless Networks for 6G: Communication Theoretic Models and Research Challenges
Authors:
Fabian Göttsch,
Giuseppe Caire,
Wen Xu,
Martin Schubert
Abstract:
This paper presents a comprehensive communication theoretic model for the physical layer of a cell-free user-centric network, formed by user equipments (UEs), radio units (RUs), and decentralized units (DUs), uniformly spatially distributed over a given coverage area. We consider RUs equipped with multiple antennas, and focus on the regime where the UE, RU, and DU densities are constant and theref…
▽ More
This paper presents a comprehensive communication theoretic model for the physical layer of a cell-free user-centric network, formed by user equipments (UEs), radio units (RUs), and decentralized units (DUs), uniformly spatially distributed over a given coverage area. We consider RUs equipped with multiple antennas, and focus on the regime where the UE, RU, and DU densities are constant and therefore the number of such nodes grows with the coverage area. A system is said scalable if the computing load and information rate at any node in the network converges to a constant as the network size (coverage area) grows to infinity. This imposes that each UE must be processed by a (user-centric) finite-size cluster of RUs, and that such cluster processors are dynamically allocated to the DUs (e.g., as software defined virtual network functions) in order to achieve a balanced computation load. We also assume that the RUs are connected to the DUs through a packet switching network, in order to achieve adaptive routing and load balance. For this model, we define in details the dynamic cluster formation and uplink pilot allocation. As a consequence of the pilot allocation and the scalability constraint, each cluster processor has a partial view of the network channel state information. We define the condition of ``ideal partial CSI'' when the channel vectors that can be estimated are perfectly known (while the ones that cannot be estimated are not know at all). We develop two attractive cluster-based linear receiver schemes for the uplink, and an uplink-downlink duality that allows to reuse such vectors as precoders for the downlink.
△ Less
Submitted 12 January, 2024;
originally announced January 2024.
-
Spatial-Aware Deep Reinforcement Learning for the Traveling Officer Problem
Authors:
Niklas Strauß,
Matthias Schubert
Abstract:
The traveling officer problem (TOP) is a challenging stochastic optimization task. In this problem, a parking officer is guided through a city equipped with parking sensors to fine as many parking offenders as possible. A major challenge in TOP is the dynamic nature of parking offenses, which randomly appear and disappear after some time, regardless of whether they have been fined. Thus, solutions…
▽ More
The traveling officer problem (TOP) is a challenging stochastic optimization task. In this problem, a parking officer is guided through a city equipped with parking sensors to fine as many parking offenders as possible. A major challenge in TOP is the dynamic nature of parking offenses, which randomly appear and disappear after some time, regardless of whether they have been fined. Thus, solutions need to dynamically adjust to currently fineable parking offenses while also planning ahead to increase the likelihood that the officer arrives during the offense taking place. Though various solutions exist, these methods often struggle to take the implications of actions on the ability to fine future parking violations into account. This paper proposes SATOP, a novel spatial-aware deep reinforcement learning approach for TOP. Our novel state encoder creates a representation of each action, leveraging the spatial relationships between parking spots, the agent, and the action. Furthermore, we propose a novel message-passing module for learning future inter-action correlations in the given environment. Thus, the agent can estimate the potential to fine further parking violations after executing an action. We evaluate our method using an environment based on real-world data from Melbourne. Our results show that SATOP consistently outperforms state-of-the-art TOP agents and is able to fine up to 22% more parking offenses.
△ Less
Submitted 11 January, 2024;
originally announced January 2024.
-
Nanocolumnar Material Platforms:Universal structural parameters revealed from optical anisotropy
Authors:
Ufuk Kilic,
Yousra Traouli,
Matthew Hilfiker,
Khalil Bryant,
Stefan Schoeche,
Rene Feder,
Christos Argyropoulos,
Eva Schubert,
Mathias Schubert
Abstract:
Nanostructures represent a frontier where meticulous attention to the control and assessment of structural dimensions becomes a linchpin for their seamless integration into diverse technological applications. By using integrative and comprehensive methodical series of studies, we investigate the evolution of the depolarization factors in the anisotropic Bruggeman effective medium approximation, th…
▽ More
Nanostructures represent a frontier where meticulous attention to the control and assessment of structural dimensions becomes a linchpin for their seamless integration into diverse technological applications. By using integrative and comprehensive methodical series of studies, we investigate the evolution of the depolarization factors in the anisotropic Bruggeman effective medium approximation, that are extremely sensitive to the changes in critical dimensions of the nanostructure platforms. To this end, we fabricate spatially coherent highly-ordered slanted nanocolumns from zirconia, silicon, titanium, and permalloy on silicon substrates with varying column lengths using glancing angle deposition. In tandem, broad-spectral range Mueller matrix spectroscopic ellipsometry data, spanning from the near-infrared to the vacuum ultraviolet (0.72 eV to 6.5 eV), is analyzed with a best-match model approach based on the anisotropic Bruggeman effective medium theory. We thereby extracted the anisotropic optical properties including complex dielectric function, birefringence, and dichroism. Most notably, our research unveils a universal, material-independent inverse relationship between depolarization factors and column length. We envision that the presented universal relationship will permit accurate prediction of optical properties of nanocolumnar thin films improving their integration and optimization for optoelectronic and photonic device applications.
△ Less
Submitted 1 December, 2023;
originally announced December 2023.
-
Discrete Adjoint Method for Variational Integration of Constrained ODEs and its application to Optimal Control of Geometrically Exact Beam Dynamics
Authors:
Matthias Schubert,
Rodrigo T. Sato Martín de Almagro,
Karin Nachbagauer,
Sina Ober-Blöbaum,
Sigrid Leyendecker
Abstract:
Direct methods for the simulation of optimal control problems apply a specific discretization to the dynamics of the problem, and the discrete adjoint method is suitable to calculate corresponding conditions to approximate an optimal solution. While the benefits of structure preserving or geometric methods have been known for decades, their exploration in the context of optimal control problems is…
▽ More
Direct methods for the simulation of optimal control problems apply a specific discretization to the dynamics of the problem, and the discrete adjoint method is suitable to calculate corresponding conditions to approximate an optimal solution. While the benefits of structure preserving or geometric methods have been known for decades, their exploration in the context of optimal control problems is a relatively recent field of research. In this work, the discrete adjoint method is derived for variational integrators yielding structure preserving approximations of the dynamics firstly in the ODE case and secondly for the case in which the dynamics is subject to holonomic constraints. The convergence rates are illustrated by numerical examples. Thirdly, the discrete adjoint method is applied to geometrically exact beam dynamics, represented by a holonomically constrained PDE.
△ Less
Submitted 27 November, 2023;
originally announced November 2023.
-
Optically manipulated micromirrors for precise excitation of WGM microlasers
Authors:
Tomasz Plaskocinski,
Libin Yan,
Marcel Schubert,
Malte C. Gather,
Andrea Di Falco
Abstract:
Whispering gallery mode microlasers are highly sensitive refractive index sensors widely explored for biophotonic and biomedical applications. Microlaser excitation and collection of the emitted light typically utilize microscope objectives at normal incidence, limiting the choice of the oscillation plane of the modes. Here, we present a platform that enables the excitation of microlasers from var…
▽ More
Whispering gallery mode microlasers are highly sensitive refractive index sensors widely explored for biophotonic and biomedical applications. Microlaser excitation and collection of the emitted light typically utilize microscope objectives at normal incidence, limiting the choice of the oscillation plane of the modes. Here, we present a platform that enables the excitation of microlasers from various directions using an optically manipulated micromirror. The scheme enables precise sensing of the environment surrounding the microlasers along different well-controlled planes. We further demonstrate the capability of the platform to perform a time-resolved experiment of dynamic sensing using a polystyrene probe bead orbiting the microlaser.
△ Less
Submitted 22 November, 2023;
originally announced November 2023.
-
Deep Active Learning with Noisy Oracle in Object Detection
Authors:
Marius Schubert,
Tobias Riedlinger,
Karsten Kahl,
Matthias Rottmann
Abstract:
Obtaining annotations for complex computer vision tasks such as object detection is an expensive and time-intense endeavor involving a large number of human workers or expert opinions. Reducing the amount of annotations required while maintaining algorithm performance is, therefore, desirable for machine learning practitioners and has been successfully achieved by active learning algorithms. Howev…
▽ More
Obtaining annotations for complex computer vision tasks such as object detection is an expensive and time-intense endeavor involving a large number of human workers or expert opinions. Reducing the amount of annotations required while maintaining algorithm performance is, therefore, desirable for machine learning practitioners and has been successfully achieved by active learning algorithms. However, it is not merely the amount of annotations which influences model performance but also the annotation quality. In practice, the oracles that are queried for new annotations frequently contain significant amounts of noise. Therefore, cleansing procedures are oftentimes necessary to review and correct given labels. This process is subject to the same budget as the initial annotation itself since it requires human workers or even domain experts. Here, we propose a composite active learning framework including a label review module for deep object detection. We show that utilizing part of the annotation budget to correct the noisy annotations partially in the active dataset leads to early improvements in model performance, especially when coupled with uncertainty-based query strategies. The precision of the label error proposals has a significant influence on the measured effect of the label review. In our experiments we achieve improvements of up to 4.5 mAP points of object detection performance by incorporating label reviews at equal annotation budget.
△ Less
Submitted 30 September, 2023;
originally announced October 2023.
-
Fourier modal method for inverse design of metasurface-enhanced micro-LEDs
Authors:
Martin F. Schubert,
Alec M. Hammond
Abstract:
We present a simulation capability for micro-scale light-emitting diodes (uLEDs) that achieves comparable accuracy to CPU-based finite-difference time-domain simulation but is more than 10^7 times faster. Our approach is based on the Fourier modal method (FMM) -- which, as we demonstrate, is well suited to modeling thousands of incoherent sources -- with extensions that allow rapid convergence for…
▽ More
We present a simulation capability for micro-scale light-emitting diodes (uLEDs) that achieves comparable accuracy to CPU-based finite-difference time-domain simulation but is more than 10^7 times faster. Our approach is based on the Fourier modal method (FMM) -- which, as we demonstrate, is well suited to modeling thousands of incoherent sources -- with extensions that allow rapid convergence for uLED structures that are challenging to model with standard approaches. The speed of our method makes the inverse design of uLEDs tractable, which we demonstrate by designing a metasurface-enhanced uLED that doubles the light extraction efficiency of an unoptimized device.
△ Less
Submitted 15 August, 2023;
originally announced August 2023.
-
LMD: Light-weight Prediction Quality Estimation for Object Detection in Lidar Point Clouds
Authors:
Tobias Riedlinger,
Marius Schubert,
Sarina Penquitt,
Jan-Marcel Kezmann,
Pascal Colling,
Karsten Kahl,
Lutz Roese-Koerner,
Michael Arnold,
Urs Zimmermann,
Matthias Rottmann
Abstract:
Object detection on Lidar point cloud data is a promising technology for autonomous driving and robotics which has seen a significant rise in performance and accuracy during recent years. Particularly uncertainty estimation is a crucial component for down-stream tasks and deep neural networks remain error-prone even for predictions with high confidence. Previously proposed methods for quantifying…
▽ More
Object detection on Lidar point cloud data is a promising technology for autonomous driving and robotics which has seen a significant rise in performance and accuracy during recent years. Particularly uncertainty estimation is a crucial component for down-stream tasks and deep neural networks remain error-prone even for predictions with high confidence. Previously proposed methods for quantifying prediction uncertainty tend to alter the training scheme of the detector or rely on prediction sampling which results in vastly increased inference time. In order to address these two issues, we propose LidarMetaDetect (LMD), a light-weight post-processing scheme for prediction quality estimation. Our method can easily be added to any pre-trained Lidar object detector without altering anything about the base model and is purely based on post-processing, therefore, only leading to a negligible computational overhead. Our experiments show a significant increase of statistical reliability in separating true from false predictions. We propose and evaluate an additional application of our method leading to the detection of annotation errors. Explicit samples and a conservative count of annotation error proposals indicates the viability of our method for large-scale datasets like KITTI and nuScenes. On the widely-used nuScenes test dataset, 43 out of the top 100 proposals of our method indicate, in fact, erroneous annotations.
△ Less
Submitted 15 June, 2023; v1 submitted 13 June, 2023;
originally announced June 2023.
-
The anisotropic Beer-Lambert law in $β$-Ga$_{2}$O$_{3}$: Spectral and polarization dependent absorption and photoresponsivity
Authors:
Md Mohsinur Rahman Adnan,
Darpan Verma,
Chris Sturm,
Matthias Schubert,
Roberto C. Myers
Abstract:
Due to its low symmetry, $β$-Ga$_{2}$O$_{3}$ exhibits a strongly anisotropic optical response. As a result, the absorption spectra change with the polarization state of the incoming photons. To understand this phenomenon, here we calculate the complete electromagnetic wave equation solutions as a function of linear polarization angle and photon energy for $β$-Ga$_{2}$O$_{3}$ using its previously m…
▽ More
Due to its low symmetry, $β$-Ga$_{2}$O$_{3}$ exhibits a strongly anisotropic optical response. As a result, the absorption spectra change with the polarization state of the incoming photons. To understand this phenomenon, here we calculate the complete electromagnetic wave equation solutions as a function of linear polarization angle and photon energy for $β$-Ga$_{2}$O$_{3}$ using its previously measured complex dielectric function tensor. The significant off-diagonal terms in this tensor can result in a non-exponential decay in the photon flux, indicating that the Beer-Lambert law is not generally valid in this anisotropic material. However, for above-band-gap spectral regions which depend on crystallographic orientations (> 5.8 eV (001 plane),>5.2 eV (010 plane)) an effective absorption coefficient well approximates the photon flux decay with depth. On the other hand, near the optical absorption edge (4.9 - 5.8 eV (001 plane),4.65 - 5.2 eV (010 plane)) the photon flux decay exhibits a sum of two exponential decays, such that two effective absorption coefficients are necessary to model the loss behavior versus the absorption depth. This behavior manifests from the presence of dichroism in $β$-Ga$_{2}$O$_{3}$. A single effective absorption coefficient can only be recovered for this energy range by augmenting the isotropic Beer-Lambert law with a critical penetration depth and polarization dependence. Using these results, we calculate the polarization-dependent photoresponsivity spectra for light polarized along different crystallographic directions.
△ Less
Submitted 30 January, 2024; v1 submitted 30 May, 2023;
originally announced May 2023.
-
GRAtt-VIS: Gated Residual Attention for Auto Rectifying Video Instance Segmentation
Authors:
Tanveer Hannan,
Rajat Koner,
Maximilian Bernhard,
Suprosanna Shit,
Bjoern Menze,
Volker Tresp,
Matthias Schubert,
Thomas Seidl
Abstract:
Recent trends in Video Instance Segmentation (VIS) have seen a growing reliance on online methods to model complex and lengthy video sequences. However, the degradation of representation and noise accumulation of the online methods, especially during occlusion and abrupt changes, pose substantial challenges. Transformer-based query propagation provides promising directions at the cost of quadratic…
▽ More
Recent trends in Video Instance Segmentation (VIS) have seen a growing reliance on online methods to model complex and lengthy video sequences. However, the degradation of representation and noise accumulation of the online methods, especially during occlusion and abrupt changes, pose substantial challenges. Transformer-based query propagation provides promising directions at the cost of quadratic memory attention. However, they are susceptible to the degradation of instance features due to the above-mentioned challenges and suffer from cascading effects. The detection and rectification of such errors remain largely underexplored. To this end, we introduce \textbf{GRAtt-VIS}, \textbf{G}ated \textbf{R}esidual \textbf{Att}ention for \textbf{V}ideo \textbf{I}nstance \textbf{S}egmentation. Firstly, we leverage a Gumbel-Softmax-based gate to detect possible errors in the current frame. Next, based on the gate activation, we rectify degraded features from its past representation. Such a residual configuration alleviates the need for dedicated memory and provides a continuous stream of relevant instance features. Secondly, we propose a novel inter-instance interaction using gate activation as a mask for self-attention. This masking strategy dynamically restricts the unrepresentative instance queries in the self-attention and preserves vital information for long-term tracking. We refer to this novel combination of Gated Residual Connection and Masked Self-Attention as \textbf{GRAtt} block, which can easily be integrated into the existing propagation-based framework. Further, GRAtt blocks significantly reduce the attention overhead and simplify dynamic temporal modeling. GRAtt-VIS achieves state-of-the-art performance on YouTube-VIS and the highly challenging OVIS dataset, significantly improving over previous methods. Code is available at \url{https://github.com/Tanveer81/GRAttVIS}.
△ Less
Submitted 26 May, 2023;
originally announced May 2023.
-
MapFormer: Boosting Change Detection by Using Pre-change Information
Authors:
Maximilian Bernhard,
Niklas Strauß,
Matthias Schubert
Abstract:
Change detection in remote sensing imagery is essential for a variety of applications such as urban planning, disaster management, and climate research. However, existing methods for identifying semantically changed areas overlook the availability of semantic information in the form of existing maps describing features of the earth's surface. In this paper, we leverage this information for change…
▽ More
Change detection in remote sensing imagery is essential for a variety of applications such as urban planning, disaster management, and climate research. However, existing methods for identifying semantically changed areas overlook the availability of semantic information in the form of existing maps describing features of the earth's surface. In this paper, we leverage this information for change detection in bi-temporal images. We show that the simple integration of the additional information via concatenation of latent representations suffices to significantly outperform state-of-the-art change detection methods. Motivated by this observation, we propose the new task of *Conditional Change Detection*, where pre-change semantic information is used as input next to bi-temporal images. To fully exploit the extra information, we propose *MapFormer*, a novel architecture based on a multi-modal feature fusion module that allows for feature processing conditioned on the available semantic information. We further employ a supervised, cross-modal contrastive loss to guide the learning of visual representations. Our approach outperforms existing change detection methods by an absolute 11.7\% and 18.4\% in terms of binary change IoU on DynamicEarthNet and HRSCD, respectively. Furthermore, we demonstrate the robustness of our approach to the quality of the pre-change semantic information and the absence pre-change imagery. The code is available at https://github.com/mxbh/mapformer.
△ Less
Submitted 7 December, 2023; v1 submitted 31 March, 2023;
originally announced March 2023.
-
Identifying Label Errors in Object Detection Datasets by Loss Inspection
Authors:
Marius Schubert,
Tobias Riedlinger,
Karsten Kahl,
Daniel Kröll,
Sebastian Schoenen,
Siniša Šegvić,
Matthias Rottmann
Abstract:
Labeling datasets for supervised object detection is a dull and time-consuming task. Errors can be easily introduced during annotation and overlooked during review, yielding inaccurate benchmarks and performance degradation of deep neural networks trained on noisy labels. In this work, we for the first time introduce a benchmark for label error detection methods on object detection datasets as wel…
▽ More
Labeling datasets for supervised object detection is a dull and time-consuming task. Errors can be easily introduced during annotation and overlooked during review, yielding inaccurate benchmarks and performance degradation of deep neural networks trained on noisy labels. In this work, we for the first time introduce a benchmark for label error detection methods on object detection datasets as well as a label error detection method and a number of baselines. We simulate four different types of randomly introduced label errors on train and test sets of well-labeled object detection datasets. For our label error detection method we assume a two-stage object detector to be given and consider the sum of both stages' classification and regression losses. The losses are computed with respect to the predictions and the noisy labels including simulated label errors, aiming at detecting the latter. We compare our method to three baselines: a naive one without deep learning, the object detector's score and the entropy of the classification softmax distribution. We outperform all baselines and demonstrate that among the considered methods, ours is the only one that detects label errors of all four types efficiently. Furthermore, we detect real label errors a) on commonly used test datasets in object detection and b) on a proprietary dataset. In both cases we achieve low false positives rates, i.e., we detect label errors with a precision for a) of up to 71.5% and for b) with 97%.
△ Less
Submitted 19 December, 2023; v1 submitted 13 March, 2023;
originally announced March 2023.
-
Quantum Composites with the Functionality Defined by the Charge-Density-Wave Phase Transitions
Authors:
Zahra Barani,
Tekwam Geremew,
Megan Stokey,
Nicholas Sesing,
Maedeh Taheri,
Matthew J. Hilfiker,
Fariborz Kargar,
Mathias Schubert,
Tina T. Salguero,
Alexander A. Balandin
Abstract:
We demonstrate a unique class of advanced materials - quantum composites based on polymers with fillers comprised of a van der Waals quantum material that reveals multiple charge-density-wave quantum condensate phases. Materials that exhibit quantum phenomena are typically crystalline, pure, and have few defects because disorder destroys the coherence of the electrons and phonons, leading to colla…
▽ More
We demonstrate a unique class of advanced materials - quantum composites based on polymers with fillers comprised of a van der Waals quantum material that reveals multiple charge-density-wave quantum condensate phases. Materials that exhibit quantum phenomena are typically crystalline, pure, and have few defects because disorder destroys the coherence of the electrons and phonons, leading to collapses of the quantum states. We succeeded in preserving the macroscopic charge-density-wave phases of filler particles after multiple composite processing steps. The prepared composites manifest strong charge-density-wave phenomena even above room temperature. The dielectric constant experiences more than two orders of magnitude enhancement while the material maintains its electrically insulating properties, opening a venue for advanced applications in energy storage and electronics. The results present a conceptually different approach for engineering the properties of materials, extending the application domain for van der Waals materials.
△ Less
Submitted 21 February, 2023;
originally announced February 2023.
-
Towards Causal Credit Assignment
Authors:
Mátyás Schubert
Abstract:
Adequately assigning credit to actions for future outcomes based on their contributions is a long-standing open challenge in Reinforcement Learning. The assumptions of the most commonly used credit assignment method are disadvantageous in tasks where the effects of decisions are not immediately evident. Furthermore, this method can only evaluate actions that have been selected by the agent, making…
▽ More
Adequately assigning credit to actions for future outcomes based on their contributions is a long-standing open challenge in Reinforcement Learning. The assumptions of the most commonly used credit assignment method are disadvantageous in tasks where the effects of decisions are not immediately evident. Furthermore, this method can only evaluate actions that have been selected by the agent, making it highly inefficient. Still, no alternative methods have been widely adopted in the field. Hindsight Credit Assignment is a promising, but still unexplored candidate, which aims to solve the problems of both long-term and counterfactual credit assignment. In this thesis, we empirically investigate Hindsight Credit Assignment to identify its main benefits, and key points to improve. Then, we apply it to factored state representations, and in particular to state representations based on the causal structure of the environment. In this setting, we propose a variant of Hindsight Credit Assignment that effectively exploits a given causal structure. We show that our modification greatly decreases the workload of Hindsight Credit Assignment, making it more efficient and enabling it to outperform the baseline credit assignment method on various tasks. This opens the way to other methods based on given or learned causal structures.
△ Less
Submitted 17 May, 2023; v1 submitted 22 December, 2022;
originally announced December 2022.
-
Towards Rapid Prototy** and Comparability in Active Learning for Deep Object Detection
Authors:
Tobias Riedlinger,
Marius Schubert,
Karsten Kahl,
Hanno Gottschalk,
Matthias Rottmann
Abstract:
Active learning as a paradigm in deep learning is especially important in applications involving intricate perception tasks such as object detection where labels are difficult and expensive to acquire. Development of active learning methods in such fields is highly computationally expensive and time consuming which obstructs the progression of research and leads to a lack of comparability between…
▽ More
Active learning as a paradigm in deep learning is especially important in applications involving intricate perception tasks such as object detection where labels are difficult and expensive to acquire. Development of active learning methods in such fields is highly computationally expensive and time consuming which obstructs the progression of research and leads to a lack of comparability between methods. In this work, we propose and investigate a sandbox setup for rapid development and transparent evaluation of active learning in deep object detection. Our experiments with commonly used configurations of datasets and detection architectures found in the literature show that results obtained in our sandbox environment are representative of results on standard configurations. The total compute time to obtain results and assess the learning behavior can thereby be reduced by factors of up to 14 when comparing with Pascal VOC and up to 32 when comparing with BDD100k. This allows for testing and evaluating data acquisition and labeling strategies in under half a day and contributes to the transparency and development speed in the field of active learning for object detection.
△ Less
Submitted 21 December, 2022;
originally announced December 2022.
-
Effects of solar evolution on finite acquisition time of Fabry-Perot-Interferometers in high resolution solar physics
Authors:
Rolf Schlichenmaier,
Daniel Pitters,
Juan Manuel Borrero,
Matthias Schubert
Abstract:
The imaging spectro-polarimeter VTF (Visible Tunable Filter) will be operated at the Daniel K. Inouye Solar Telescope (DKIST). Due to its capability of resolving dynamic fine structure of smaller than 0.05'', the finite acquisition time of typically 11 s affects the measurement process and potentially causes errors in deduced physical parameters. We estimate those errors and investigate ways of mi…
▽ More
The imaging spectro-polarimeter VTF (Visible Tunable Filter) will be operated at the Daniel K. Inouye Solar Telescope (DKIST). Due to its capability of resolving dynamic fine structure of smaller than 0.05'', the finite acquisition time of typically 11 s affects the measurement process and potentially causes errors in deduced physical parameters. We estimate those errors and investigate ways of minimising them.
We mimic the solar surface using a magneto-hydrodynamic simulation with a spatially averaged vertical field strength of 200 G. We simulate the measurement process scanning through successive wavelength points with a temporal cadence of 1 s. We synthesise FeI 617.3 nm. Besides the classical composition of the line profile, we introduce a novel method in which the intensity in each wavelength point is normalised using the simultaneous continuum intensity. Milne-Eddington inversions are used to infer the line-of-sight velocity, v(los), and the vertical (longitudinal) component of the magnetic field, B(los).
We quantify systematic errors, defining the temporal average of the simulation during the measurement as the truth. We find that with the classical composition of the line profiles, errors exceed the sensitivity for v(los) and in filigree regions also for B(los). The novel method that includes normalisation reduces the measurement errors in all cases. Spatial binning without reducing the acquisition time decreases the measurement error slightly.
The evolutionary time-scale in inter-granular lanes, in particular in areas with magnetic features (filigree), is shorter than the time-scale within granules. Hence less accumulations could be used for strong magnetic field in inter-granular lanes and more accumulations could be used for the weak granular magnetic fields. As a key result, we suggest to include the novel method of normalisation in corresponding data pipelines.
△ Less
Submitted 27 October, 2022;
originally announced October 2022.
-
Robust Object Detection in Remote Sensing Imagery with Noisy and Sparse Geo-Annotations (Full Version)
Authors:
Maximilian Bernhard,
Matthias Schubert
Abstract:
Recently, the availability of remote sensing imagery from aerial vehicles and satellites constantly improved. For an automated interpretation of such data, deep-learning-based object detectors achieve state-of-the-art performance. However, established object detectors require complete, precise, and correct bounding box annotations for training. In order to create the necessary training annotations…
▽ More
Recently, the availability of remote sensing imagery from aerial vehicles and satellites constantly improved. For an automated interpretation of such data, deep-learning-based object detectors achieve state-of-the-art performance. However, established object detectors require complete, precise, and correct bounding box annotations for training. In order to create the necessary training annotations for object detectors, imagery can be georeferenced and combined with data from other sources, such as points of interest localized by GPS sensors. Unfortunately, this combination often leads to poor object localization and missing annotations. Therefore, training object detectors with such data often results in insufficient detection performance. In this paper, we present a novel approach for training object detectors with extremely noisy and incomplete annotations. Our method is based on a teacher-student learning framework and a correction module accounting for imprecise and missing annotations. Thus, our method is easy to use and can be combined with arbitrary object detectors. We demonstrate that our approach improves standard detectors by 37.1\% $AP_{50}$ on a noisy real-world remote-sensing dataset. Furthermore, our method achieves great performance gains on two datasets with synthetic noise. Code is available at \url{https://github.com/mxbh/robust_object_detection}.
△ Less
Submitted 24 October, 2022;
originally announced October 2022.
-
SurCo: Learning Linear Surrogates For Combinatorial Nonlinear Optimization Problems
Authors:
Aaron Ferber,
Taoan Huang,
Daochen Zha,
Martin Schubert,
Benoit Steiner,
Bistra Dilkina,
Yuandong Tian
Abstract:
Optimization problems with nonlinear cost functions and combinatorial constraints appear in many real-world applications but remain challenging to solve efficiently compared to their linear counterparts. To bridge this gap, we propose $\textbf{SurCo}$ that learns linear $\underline{\text{Sur}}$rogate costs which can be used in existing $\underline{\text{Co}}$mbinatorial solvers to output good solu…
▽ More
Optimization problems with nonlinear cost functions and combinatorial constraints appear in many real-world applications but remain challenging to solve efficiently compared to their linear counterparts. To bridge this gap, we propose $\textbf{SurCo}$ that learns linear $\underline{\text{Sur}}$rogate costs which can be used in existing $\underline{\text{Co}}$mbinatorial solvers to output good solutions to the original nonlinear combinatorial optimization problem. The surrogate costs are learned end-to-end with nonlinear loss by differentiating through the linear surrogate solver, combining the flexibility of gradient-based methods with the structure of linear combinatorial optimization. We propose three $\texttt{SurCo}$ variants: $\texttt{SurCo}-\texttt{zero}$ for individual nonlinear problems, $\texttt{SurCo}-\texttt{prior}$ for problem distributions, and $\texttt{SurCo}-\texttt{hybrid}$ to combine both distribution and problem-specific information. We give theoretical intuition motivating $\texttt{SurCo}$, and evaluate it empirically. Experiments show that $\texttt{SurCo}$ finds better solutions faster than state-of-the-art and domain expert approaches in real-world optimization problems such as embedding table sharding, inverse photonic design, and nonlinear route planning.
△ Less
Submitted 19 July, 2023; v1 submitted 22 October, 2022;
originally announced October 2022.
-
Local sensing of absolute refractive index during protein-binding using microlasers with spectral encoding
Authors:
Soraya Caixeiro,
Casper Kunstmann-Olsen,
Marcel Schubert,
Joseph Hill,
Isla R. M. Barnard,
Matthew D. Simmons,
Steven Johnson,
Malte C. Gather
Abstract:
Multiplexed, specific and sensitive detection of antigens is critical for the rapid and accurate diagnosis of disease and the informed development of personalized treatment plans. Here, we show that polymer microsphere lasers can be used as photonic sensors to monitor and quantify direct surface binding of biomolecules via changes in the refractive index. The unique spectral signature of each indi…
▽ More
Multiplexed, specific and sensitive detection of antigens is critical for the rapid and accurate diagnosis of disease and the informed development of personalized treatment plans. Here, we show that polymer microsphere lasers can be used as photonic sensors to monitor and quantify direct surface binding of biomolecules via changes in the refractive index. The unique spectral signature of each individual laser can be used to find their size and effective refractive index which adds a new encoding dimension when compared to conventional fluorescent beads. We utilize antibody-functionalized microlasers to selectively detect protein binding. Different stages of the multilayer surface modification can be resolved, and protein binding is demonstrated for two different proteins, IgG and CRP. Moreover, by continuously monitoring single lasers, we demonstrate the possibility of real-time monitoring of binding dynamics between antigens in solution phase and the immobilized antibodies. For multiplexed detection, the microlasers are employed in a flow cytometer configuration, with fast spectral detection and identification of microlasers with and without antigen binding. We envision that by combining microlasers with well-established surface modification chemistries and flow geometries, the multiplexing ability of microbead immunoassays can be strongly increased while also opening avenues for single cell profiling within heterogenous cell populations.
△ Less
Submitted 19 October, 2022;
originally announced October 2022.
-
Receiver Bandwidth Extension Beyond Nyquist Using Channel Bonding
Authors:
Sebastian Giehl,
Carsten Andrich,
Michael Schubert,
Maximilian Engelhardt,
Alexander Ihlow
Abstract:
Current and upcoming communication and sensing technologies require ever larger bandwidths. Channel bonding can be utilized to extend a receiver's instantaneous bandwidth beyond a single converter's Nyquist limit. Two potential joint front-end and converter design approaches are theoretically introduced, realized and evaluated in this paper. The Xilinx RFSoC platform with its 5 GSa/s analog to dig…
▽ More
Current and upcoming communication and sensing technologies require ever larger bandwidths. Channel bonding can be utilized to extend a receiver's instantaneous bandwidth beyond a single converter's Nyquist limit. Two potential joint front-end and converter design approaches are theoretically introduced, realized and evaluated in this paper. The Xilinx RFSoC platform with its 5 GSa/s analog to digital converters (ADCs) is used to implement both a hybrid coupler based in-phase/quadrature (I/Q) sampling and a time-interleaved sampling approach along with channel bonding. Both realizations are demonstrated to be able to reconstruct instantaneous bandwidths of 5 GHz with up to 49 dB image rejection ratio (IRR) typically within 4 to 8 dB the front-ends' theoretical limits.
△ Less
Submitted 12 February, 2024; v1 submitted 14 October, 2022;
originally announced October 2022.
-
Federated Continual Learning for Text Classification via Selective Inter-client Transfer
Authors:
Yatin Chaudhary,
Pranav Rai,
Matthias Schubert,
Hinrich Schütze,
Pankaj Gupta
Abstract:
In this work, we combine the two paradigms: Federated Learning (FL) and Continual Learning (CL) for text classification task in cloud-edge continuum. The objective of Federated Continual Learning (FCL) is to improve deep learning models over life time at each client by (relevant and efficient) knowledge transfer without sharing data. Here, we address challenges in minimizing inter-client interfere…
▽ More
In this work, we combine the two paradigms: Federated Learning (FL) and Continual Learning (CL) for text classification task in cloud-edge continuum. The objective of Federated Continual Learning (FCL) is to improve deep learning models over life time at each client by (relevant and efficient) knowledge transfer without sharing data. Here, we address challenges in minimizing inter-client interference while knowledge sharing due to heterogeneous tasks across clients in FCL setup. In doing so, we propose a novel framework, Federated Selective Inter-client Transfer (FedSeIT) which selectively combines model parameters of foreign clients. To further maximize knowledge transfer, we assess domain overlap and select informative tasks from the sequence of historical tasks at each foreign client while preserving privacy. Evaluating against the baselines, we show improved performance, a gain of (average) 12.4\% in text classification over a sequence of tasks using five datasets from diverse domains. To the best of our knowledge, this is the first work that applies FCL to NLP.
△ Less
Submitted 12 February, 2023; v1 submitted 12 October, 2022;
originally announced October 2022.
-
Remote Surface Optical Phonon Scattering in Ferroelectric Ba$_{0.6}$Sr$_{0.4}$TiO$_{3}$ Gated Graphene
Authors:
Hanying Chen,
Tianlin Li,
Yifei Hao,
Anil Rajapitamahuni,
Zhiyong Xiao,
Stefan Schoeche,
Mathias Schubert,
Xia Hong
Abstract:
We report the effect of remote surface optical (RSO) phonon scattering on carrier mobility in monolayer graphene gated by ferroelectric oxide. We fabricate monolayer graphene transistors back-gated by epitaxial (001) Ba$_{0.6}$Sr$_{0.4}$TiO$_{3}$ films, with field effect mobility up to 23,000 cm$^{2}$V$^{-1}$s$^{-1}$ achieved. Switching the ferroelectric polarization induces nonvolatile modulation…
▽ More
We report the effect of remote surface optical (RSO) phonon scattering on carrier mobility in monolayer graphene gated by ferroelectric oxide. We fabricate monolayer graphene transistors back-gated by epitaxial (001) Ba$_{0.6}$Sr$_{0.4}$TiO$_{3}$ films, with field effect mobility up to 23,000 cm$^{2}$V$^{-1}$s$^{-1}$ achieved. Switching the ferroelectric polarization induces nonvolatile modulation of resistance and quantum Hall effect in graphene at low temperatures. Ellipsometry spectroscopy studies reveal four pairs of optical phonon modes in Ba$_{0.6}$Sr$_{0.4}$TiO$_{3}$, from which we extract the RSO phonon frequencies. The temperature dependence of resistivity in graphene can be well accounted for by considering the scattering from the intrinsic longitudinal acoustic phonon and the RSO phonon, with the latter dominated by the mode at 35.8 meV. Our study reveals the room temperature mobility limit of ferroelectric-gated graphene transistors imposed by RSO phonon scattering.
△ Less
Submitted 30 September, 2022;
originally announced September 2022.
-
Strain and composition dependencies of the near bandgap optical transitions in monoclinic (Al$_x$Ga$_{1-x}$)$_2$O$_3$ alloys with coherent biaxial in-plane strain on (010) Ga$_2$O$_3$
Authors:
Rafał Korlacki,
Matthew Hilfiker,
Jenna Knudtson,
Megan Stokey,
Ufuk Kilic,
Akhil Mauze,
Yuewei Zhang,
James Speck,
Vanya Darakchieva,
Mathias Schubert
Abstract:
The bowing of the energy of the three lowest band-to-band transitions in $β$-(Al$_{x}$Ga$_{1-x}$)$_2$O$_3$ alloys was resolved using a combined density functional theory (DFT) and generalized spectroscopic ellipsometry (GSE) approach. The DFT calculations of the electronic band structure of both, $β$-Ga$_2$O$_3$ and $θ$-Al$_2$O$_3$, allow extracting of the linear portion of the energy shift in the…
▽ More
The bowing of the energy of the three lowest band-to-band transitions in $β$-(Al$_{x}$Ga$_{1-x}$)$_2$O$_3$ alloys was resolved using a combined density functional theory (DFT) and generalized spectroscopic ellipsometry (GSE) approach. The DFT calculations of the electronic band structure of both, $β$-Ga$_2$O$_3$ and $θ$-Al$_2$O$_3$, allow extracting of the linear portion of the energy shift in the alloys, and provide a method for quantifying the role of coherent strain present in the $β$-(Al$_{x}$Ga$_{1-x}$)$_2$O$_3$ thin films on (010) $β$-Ga$_2$O$_3$ substrates. The energies of band-to-band transitions were obtained using the spectroscopic ellipsometry eigenpolarization model approach [A. Mock et al., Phys. Rev. B 95, 165202 (2017)]. After subtracting the effects of strain which also induces additional bowing and after subtraction of the linear portion of the energy shift due to alloying, the bowing parameters associated with the three lowest band-to-band transitions in monoclinic $β$-(Al$_{x}$Ga$_{1-x}$)$_2$O$_3$ are found.
△ Less
Submitted 13 September, 2022;
originally announced September 2022.
-
InstanceFormer: An Online Video Instance Segmentation Framework
Authors:
Rajat Koner,
Tanveer Hannan,
Suprosanna Shit,
Sahand Sharifzadeh,
Matthias Schubert,
Thomas Seidl,
Volker Tresp
Abstract:
Recent transformer-based offline video instance segmentation (VIS) approaches achieve encouraging results and significantly outperform online approaches. However, their reliance on the whole video and the immense computational complexity caused by full Spatio-temporal attention limit them in real-life applications such as processing lengthy videos. In this paper, we propose a single-stage transfor…
▽ More
Recent transformer-based offline video instance segmentation (VIS) approaches achieve encouraging results and significantly outperform online approaches. However, their reliance on the whole video and the immense computational complexity caused by full Spatio-temporal attention limit them in real-life applications such as processing lengthy videos. In this paper, we propose a single-stage transformer-based efficient online VIS framework named InstanceFormer, which is especially suitable for long and challenging videos. We propose three novel components to model short-term and long-term dependency and temporal coherence. First, we propagate the representation, location, and semantic information of prior instances to model short-term changes. Second, we propose a novel memory cross-attention in the decoder, which allows the network to look into earlier instances within a certain temporal window. Finally, we employ a temporal contrastive loss to impose coherence in the representation of an instance across all frames. Memory attention and temporal coherence are particularly beneficial to long-range dependency modeling, including challenging scenarios like occlusion. The proposed InstanceFormer outperforms previous online benchmark methods by a large margin across multiple datasets. Most importantly, InstanceFormer surpasses offline approaches for challenging and long datasets such as YouTube-VIS-2021 and OVIS. Code is available at https://github.com/rajatkoner08/InstanceFormer.
△ Less
Submitted 22 August, 2022;
originally announced August 2022.
-
V-Coder: Adaptive AutoEncoder for Semantic Disclosure in Knowledge Graphs
Authors:
Christian M. M. Frey,
Matthias Schubert
Abstract:
Semantic Web or Knowledge Graphs (KG) emerged to one of the most important information source for intelligent systems requiring access to structured knowledge. One of the major challenges is the extraction and processing of unambiguous information from textual data. Following the human perception, overlap** semantic linkages between two named entities become clear due to our common-sense about t…
▽ More
Semantic Web or Knowledge Graphs (KG) emerged to one of the most important information source for intelligent systems requiring access to structured knowledge. One of the major challenges is the extraction and processing of unambiguous information from textual data. Following the human perception, overlap** semantic linkages between two named entities become clear due to our common-sense about the context a relationship lives in which is not the case when we look at it from an automatically driven process of a machine. In this work, we are interested in the problem of Relational Resolution within the scope of KGs, i.e, we are investigating the inherent semantic of relationships between entities within a network. We propose a new adaptive AutoEncoder, called V-Coder, to identify relations inherently connecting entities from different domains. Those relations can be considered as being ambiguous and are candidates for disentanglement. Likewise to the Adaptive Learning Theory (ART), our model learns new patterns from the KG by increasing units in a competitive layer without discarding the previous observed patterns whilst learning the quality of each relation separately. The evaluation on real-world datasets of Freebase, Yago and NELL shows that the V-Coder is not only able to recover links from corrupted input data, but also shows that the semantic disclosure of relations in a KG show the tendency to improve link prediction. A semantic evaluation wraps the evaluation up.
△ Less
Submitted 22 July, 2022;
originally announced August 2022.
-
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models
Authors:
Aarohi Srivastava,
Abhinav Rastogi,
Abhishek Rao,
Abu Awal Md Shoeb,
Abubakar Abid,
Adam Fisch,
Adam R. Brown,
Adam Santoro,
Aditya Gupta,
Adrià Garriga-Alonso,
Agnieszka Kluska,
Aitor Lewkowycz,
Akshat Agarwal,
Alethea Power,
Alex Ray,
Alex Warstadt,
Alexander W. Kocurek,
Ali Safaya,
Ali Tazarv,
Alice Xiang,
Alicia Parrish,
Allen Nie,
Aman Hussain,
Amanda Askell,
Amanda Dsouza
, et al. (426 additional authors not shown)
Abstract:
Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet poorly characterized. In order to inform future research, prepare for disruptive new model capabilities, and ameliorate socially harmful effects, it is vital that we understand the present and near-futur…
▽ More
Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet poorly characterized. In order to inform future research, prepare for disruptive new model capabilities, and ameliorate socially harmful effects, it is vital that we understand the present and near-future capabilities and limitations of language models. To address this challenge, we introduce the Beyond the Imitation Game benchmark (BIG-bench). BIG-bench currently consists of 204 tasks, contributed by 450 authors across 132 institutions. Task topics are diverse, drawing problems from linguistics, childhood development, math, common-sense reasoning, biology, physics, social bias, software development, and beyond. BIG-bench focuses on tasks that are believed to be beyond the capabilities of current language models. We evaluate the behavior of OpenAI's GPT models, Google-internal dense transformer architectures, and Switch-style sparse transformers on BIG-bench, across model sizes spanning millions to hundreds of billions of parameters. In addition, a team of human expert raters performed all tasks in order to provide a strong baseline. Findings include: model performance and calibration both improve with scale, but are poor in absolute terms (and when compared with rater performance); performance is remarkably similar across model classes, though with benefits from sparsity; tasks that improve gradually and predictably commonly involve a large knowledge or memorization component, whereas tasks that exhibit "breakthrough" behavior at a critical scale often involve multiple steps or components, or brittle metrics; social bias typically increases with scale in settings with ambiguous context, but this can be improved with prompting.
△ Less
Submitted 12 June, 2023; v1 submitted 9 June, 2022;
originally announced June 2022.
-
Simulating the 1976 Teton Dam Failure using Geoclaw and HEC-RAS and comparing with Historical Observations
Authors:
Hannah Spero,
Donna Calhoun,
Michael Schubert
Abstract:
Dam failures occur worldwide, often from factors including aging structures, extreme hydrologic loading, and design oversights related to the changing climate. Understanding and mitigating risk to downstream inhabited areas require develo** and improving low-cost high-fidelity tools, such as numerical models, which allow emergency managers to predict the consequences of dam failures better. Two-…
▽ More
Dam failures occur worldwide, often from factors including aging structures, extreme hydrologic loading, and design oversights related to the changing climate. Understanding and mitigating risk to downstream inhabited areas require develo** and improving low-cost high-fidelity tools, such as numerical models, which allow emergency managers to predict the consequences of dam failures better. Two-dimensional (2D) depth-averaged hydraulic models can provide valuable insights into the importance of breach parameters or downstream flow characteristics, but historical studies considering historic failures using real topographies are less common in literature. This study compares Geoclaw, a 2D hydraulic model with adaptive mesh refinement capabilities, to an industry-standard software HEC-RAS (Hydrologic Engineering Center - River Analysis System) using the 1976 Teton Dam failure as a case study. The suitability of Geoclaw for dam failure modeling is determined based on its capability to resolve inundation extent and flood wave arrival times. This study performs sensitivity analyses of the HEC-RAS model to compare an instantaneous dam breach assumption with a time-dependent breach formation for quantifying the model uncertainty. We find the 2D Geoclaw dam-break model results compare reasonably with historical gauge and field observational data and HEC-RAS results. The model demonstrates stability and relatively low computational costs. Our findings highlight opportunities for future work, with the Geoclaw software performance supporting continued studies to evaluate performance. Outcomes of this study will assist dam owners, floodplain managers, and emergency managers by providing an additional tool for estimating the impacts of dam failures to protect lives and infrastructure downstream.
△ Less
Submitted 17 July, 2022; v1 submitted 15 May, 2022;
originally announced June 2022.
-
Uncertainty Quantification and Resource-Demanding Computer Vision Applications of Deep Learning
Authors:
Julian Burghoff,
Robin Chan,
Hanno Gottschalk,
Annika Muetze,
Tobias Riedlinger,
Matthias Rottmann,
Marius Schubert
Abstract:
Bringing deep neural networks (DNNs) into safety critical applications such as automated driving, medical imaging and finance, requires a thorough treatment of the model's uncertainties. Training deep neural networks is already resource demanding and so is also their uncertainty quantification. In this overview article, we survey methods that we developed to teach DNNs to be uncertain when they en…
▽ More
Bringing deep neural networks (DNNs) into safety critical applications such as automated driving, medical imaging and finance, requires a thorough treatment of the model's uncertainties. Training deep neural networks is already resource demanding and so is also their uncertainty quantification. In this overview article, we survey methods that we developed to teach DNNs to be uncertain when they encounter new object classes. Additionally, we present training methods to learn from only a few labels with help of uncertainty quantification. Note that this is typically paid with a massive overhead in computation of an order of magnitude and more compared to ordinary network training. Finally, we survey our work on neural architecture search which is also an order of magnitude more resource demanding then ordinary network training.
△ Less
Submitted 30 May, 2022;
originally announced May 2022.
-
Closed-form max-min power control for some cellular and cell-free massive MIMO networks
Authors:
Lorenzo Miretti,
Renato L. G. Cavalcante,
Slawomir Stanczak,
Martin Schubert,
Ronald Boehnke,
Wen Xu
Abstract:
Many common instances of power control problems for cellular and cell-free massive MIMO networks can be interpreted as max-min utility optimization problems involving affine interference map**s and polyhedral constraints. We show that these problems admit a closed-form solution which depends on the spectral radius of known matrices. In contrast, previous solutions in the literature have been ind…
▽ More
Many common instances of power control problems for cellular and cell-free massive MIMO networks can be interpreted as max-min utility optimization problems involving affine interference map**s and polyhedral constraints. We show that these problems admit a closed-form solution which depends on the spectral radius of known matrices. In contrast, previous solutions in the literature have been indirectly obtained using iterative algorithms based on the bisection method, or on fixed-point iterations. Furthermore, we also show an asymptotically tight bound for the optimal utility, which in turn provides a simple rule of thumb for evaluating whether the network is operating in the noise or interference limited regime. We finally illustrate our results by focusing on classical max-min fair power control for cell-free massive MIMO networks.
△ Less
Submitted 3 May, 2022; v1 submitted 1 May, 2022;
originally announced May 2022.
-
Box Supervised Video Segmentation Proposal Network
Authors:
Tanveer Hannan,
Rajat Koner,
Jonathan Kobold,
Matthias Schubert
Abstract:
Video Object Segmentation (VOS) has been targeted by various fully-supervised and self-supervised approaches. While fully-supervised methods demonstrate excellent results, self-supervised ones, which do not use pixel-level ground truth, attract much attention. However, self-supervised approaches pose a significant performance gap. Box-level annotations provide a balanced compromise between labelin…
▽ More
Video Object Segmentation (VOS) has been targeted by various fully-supervised and self-supervised approaches. While fully-supervised methods demonstrate excellent results, self-supervised ones, which do not use pixel-level ground truth, attract much attention. However, self-supervised approaches pose a significant performance gap. Box-level annotations provide a balanced compromise between labeling effort and result quality for image segmentation but have not been exploited for the video domain. In this work, we propose a box-supervised video object segmentation proposal network, which takes advantage of intrinsic video properties. Our method incorporates object motion in the following way: first, motion is computed using a bidirectional temporal difference and a novel bounding box-guided motion compensation. Second, we introduce a novel motion-aware affinity loss that encourages the network to predict positive pixel pairs if they share similar motion and color. The proposed method outperforms the state-of-the-art self-supervised benchmark by 16.4% and 6.9% $\mathcal{J}$ &$\mathcal{F}$ score and the majority of fully supervised methods on the DAVIS and Youtube-VOS dataset without imposing network architectural specifications. We provide extensive tests and ablations on the datasets, demonstrating the robustness of our method.
△ Less
Submitted 16 February, 2022; v1 submitted 14 February, 2022;
originally announced February 2022.
-
Massively parallel pixel-by-pixel nanophotonic optimization using a Green's function formalism
Authors:
Jiahui Wang,
Alfred K. C. Cheung,
Aleksandra Spyra,
Ian A. D. Williamson,
Jian Guan,
Martin F. Schubert
Abstract:
We introduce an efficient parallelization scheme to implement pixel-by-pixel nanophotonic optimization using a Green's function based formalism. The crucial insight in our proposal is the reframing of the optimization algorithm as a large-scale data processing pipeline, which allows for the efficient distribution of computational tasks across thousands of workers. We demonstrate the utility of our…
▽ More
We introduce an efficient parallelization scheme to implement pixel-by-pixel nanophotonic optimization using a Green's function based formalism. The crucial insight in our proposal is the reframing of the optimization algorithm as a large-scale data processing pipeline, which allows for the efficient distribution of computational tasks across thousands of workers. We demonstrate the utility of our implementation by exercising it to optimize a high numerical aperture focusing metalens at problem sizes that would otherwise be far out of reach for the Green's function based method. Finally, we highlight the connection to powerful ideas from reinforcement learning as a natural corollary of reinterpreting the nanophotonic inverse design problem as a graph traversal enabled by the pixel-by-pixel optimization paradigm.
△ Less
Submitted 10 February, 2022;
originally announced February 2022.
-
Inverse design of photonic devices with strict foundry fabrication constraints
Authors:
Martin F. Schubert,
Alfred K. C. Cheung,
Ian A. D. Williamson,
Aleksandra Spyra,
David H. Alexander
Abstract:
We introduce a new method for inverse design of nanophotonic devices which guarantees that resulting designs satisfy strict length scale constraints - including minimum width and spacing constraints required by commercial semiconductor foundries. The method adopts several concepts from machine learning to transform the problem of topology optimization with strict length scale constraints to an unc…
▽ More
We introduce a new method for inverse design of nanophotonic devices which guarantees that resulting designs satisfy strict length scale constraints - including minimum width and spacing constraints required by commercial semiconductor foundries. The method adopts several concepts from machine learning to transform the problem of topology optimization with strict length scale constraints to an unconstrained stochastic gradient optimization problem. Specifically, we introduce a conditional generator for feasible designs and adopt a straight-through estimator for backpropagation of gradients to a latent design. We demonstrate the performance and reliability of our method by designing several common integrated photonic components.
△ Less
Submitted 13 June, 2022; v1 submitted 30 January, 2022;
originally announced January 2022.
-
Terahertz electron paramagnetic resonance generalized spectroscopic ellipsometry: The magnetic response of the nitrogen defect in 4H-SiC
Authors:
Mathias Schubert,
Sean Knight,
Steffen Richter,
Philipp Kühne,
Vallery Stanishev,
Alexander Ruder,
Megan Stokey,
Rafal Korlacki,
Klaus Irmscher,
Petr Neugebauer,
Vanya Darakchieva
Abstract:
We report on terahertz (THz) electron paramagnetic resonance generalized spectroscopic ellipsometry (THz-EPR-GSE). Measurements of the field and frequency dependencies of the magnetic response due to the spin transitions associated with the nitrogen defect in 4H-SiC are shown as an example. THz-EPR-GSE dispenses with the need of a cavity, permits independently scanning field and frequency paramete…
▽ More
We report on terahertz (THz) electron paramagnetic resonance generalized spectroscopic ellipsometry (THz-EPR-GSE). Measurements of the field and frequency dependencies of the magnetic response due to the spin transitions associated with the nitrogen defect in 4H-SiC are shown as an example. THz-EPR-GSE dispenses with the need of a cavity, permits independently scanning field and frequency parameters, and does not require field or frequency modulation. We investigate spin transitions of hexagonal ($h$) and cubic ($k$) coordinated nitrogen including coupling with its nuclear spin (I=1), and we propose a model approach for the magnetic susceptibility to account for the spin transitions. From the THz-EPR-GSE measurements we can fully determine the polarization properties of the spin transitions and we obtain $g$ and hyperfine splitting parameters using magnetic field and frequency dependent Lorentzian oscillator lineshape functions. We propose frequency-scanning THz-EPR-GSE as a new and versatile method to study properties of spins in solid state materials.
△ Less
Submitted 17 January, 2022;
originally announced January 2022.
-
Conifer Seedling Detection in UAV-Imagery with RGB-Depth Information
Authors:
Jason Jooste,
Michael Fromm,
Matthias Schubert
Abstract:
Monitoring of reforestation is currently being considerably streamlined through the use of drones and image recognition algorithms, which have already proven to be effective on colour imagery. In addition to colour imagery, elevation data is often also available. The primary aim of this work was to improve the performance of the faster-RCNN object detection algorithm by integrating this height inf…
▽ More
Monitoring of reforestation is currently being considerably streamlined through the use of drones and image recognition algorithms, which have already proven to be effective on colour imagery. In addition to colour imagery, elevation data is often also available. The primary aim of this work was to improve the performance of the faster-RCNN object detection algorithm by integrating this height information, which showed itself to notably improve performance. Interestingly, the structure of the network played a key role, with direct addition of the height information as a fourth image channel showing no improvement, while integration after the backbone network and before the region proposal network led to marked improvements. This effect persisted with very long training regimes. Increasing the resolution of this height information also showed little effect.
△ Less
Submitted 22 November, 2021;
originally announced November 2021.
-
APPTeK: Agent-Based Predicate Prediction in Temporal Knowledge Graphs
Authors:
Christian M. M. Frey,
Yunpu Ma,
Matthias Schubert
Abstract:
In temporal Knowledge Graphs (tKGs), the temporal dimension is attached to facts in a knowledge base resulting in quadruples between entities such as (Nintendo, released, Super Mario, Sep-13-1985), where the predicate holds within a time interval or at a timestamp. We propose a reinforcement learning agent gathering temporal relevant information about the query entities' neighborhoods, simultaneou…
▽ More
In temporal Knowledge Graphs (tKGs), the temporal dimension is attached to facts in a knowledge base resulting in quadruples between entities such as (Nintendo, released, Super Mario, Sep-13-1985), where the predicate holds within a time interval or at a timestamp. We propose a reinforcement learning agent gathering temporal relevant information about the query entities' neighborhoods, simultaneously. We refer to the encodings of the explored graph structures as fingerprints which are used as input to a Q-network. Our agent decides sequentially which relation type needs to be explored next to expand the local subgraphs of the query entities. Our evaluation shows that the proposed method yields competitive results compared to state-of-the-art embedding algorithms for tKGs, and we additionally gain information about the relevant structures between subjects and objects.
△ Less
Submitted 21 July, 2022; v1 submitted 27 October, 2021;
originally announced October 2021.
-
SEA: Graph Shell Attention in Graph Neural Networks
Authors:
Christian M. M. Frey,
Yunpu Ma,
Matthias Schubert
Abstract:
A common issue in Graph Neural Networks (GNNs) is known as over-smoothing. By increasing the number of iterations within the message-passing of GNNs, the nodes' representations of the input graph align with each other and become indiscernible. Recently, it has been shown that increasing a model's complexity by integrating an attention mechanism yields more expressive architectures. This is majorly…
▽ More
A common issue in Graph Neural Networks (GNNs) is known as over-smoothing. By increasing the number of iterations within the message-passing of GNNs, the nodes' representations of the input graph align with each other and become indiscernible. Recently, it has been shown that increasing a model's complexity by integrating an attention mechanism yields more expressive architectures. This is majorly contributed to steering the nodes' representations only towards nodes that are more informative than others. Transformer models in combination with GNNs result in architectures including Graph Transformer Layers (GTL), where layers are entirely based on the attention operation. However, the calculation of a node's representation is still restricted to the computational working flow of a GNN. In our work, we relax the GNN architecture by means of implementing a routing heuristic. Specifically, the nodes' representations are routed to dedicated experts. Each expert calculates the representations according to their respective GNN workflow. The definitions of distinguishable GNNs result from k-localized views starting from the central node. We call this procedure Graph Shell Attention (SEA), where experts process different subgraphs in a transformer-motivated fashion. Intuitively, by increasing the number of experts, the models gain in expressiveness such that a node's representation is solely based on nodes that are located within the receptive field of an expert. We evaluate our architecture on various benchmark datasets showing competitive results compared to state-of-the-art models.
△ Less
Submitted 20 October, 2021;
originally announced October 2021.
-
Infrared dielectric functions and Brillouin zone center phonons of $α$-Ga$_2$O$_3$ compared to $α$-Al$_2$O$_3$
Authors:
Megan Stokey,
Rafal Korlacki,
Matthew Hilfiker,
Sean Knight,
Steffen Richter,
Vanya Darakchieva,
Riena **no,
Yong** Cho,
Huili Grace Xing,
Debdeep Jena,
Yuichi Oshima,
Kamruzzaman Khan,
Elaheh Ahmadi,
Mathias Schubert
Abstract:
We determine the anisotropic dielectric functions of rhombohedral $α$-Ga$_2$O$_3$ by far-infrared and infrared generalized spectroscopic ellipsometry and derive all transverse optical and longitudinal optical phonon mode frequencies and broadening parameters. We also determine the high frequency and static dielectric constants. We perform density functional theory computations and determine the ph…
▽ More
We determine the anisotropic dielectric functions of rhombohedral $α$-Ga$_2$O$_3$ by far-infrared and infrared generalized spectroscopic ellipsometry and derive all transverse optical and longitudinal optical phonon mode frequencies and broadening parameters. We also determine the high frequency and static dielectric constants. We perform density functional theory computations and determine the phonon dispersion for all branches in the Brillouin zone, and we derive all phonon mode parameters at the Brillouin zone center including Raman-active, infrared-active, and silent modes. Excellent agreement is obtained between our experimental and computation results as well as among all previously reported partial information from experiment and theory. We also compute the same information for $α$-Al$_2$O$_3$, the binary parent compound for the emerging alloy of $α$-(Al$_{x}$Ga$_{1-x}$)$_2$O$_3$, and use results from previous investigations [Schubert, Tiwald, and Herzinger, Phys. Rev. B 61, 8187 (2000)] to compare all properties among the two isostructural compounds. From both experimental and theoretical investigations we compute the frequency shifts of all modes between the two compounds. Additionally, we calculate overlap parameters between phonon mode eigenvectors and discuss the possible evolution of all phonon modes into the ternary alloy system and whether modes may form single mode or more complex mode behaviors.
△ Less
Submitted 26 August, 2021;
originally announced August 2021.