-
Decoherence of dielectric particles by thermal emission
Authors:
Jonas Schäfer,
Benjamin A. Stickler,
Klaus Hornberger
Abstract:
Levitated nanoparticles are a promising platform for sensing applications and for macroscopic quantum experiments. While the nanoparticles' motional temperatures can be reduced to near absolute zero, their uncontrolled internal degrees of freedom remain much hotter, inevitably leading to the emission of heat radiation. The decoherence and motional heating caused by this thermal emission process is…
▽ More
Levitated nanoparticles are a promising platform for sensing applications and for macroscopic quantum experiments. While the nanoparticles' motional temperatures can be reduced to near absolute zero, their uncontrolled internal degrees of freedom remain much hotter, inevitably leading to the emission of heat radiation. The decoherence and motional heating caused by this thermal emission process is still poorly understood beyond the case of the center-of-mass motion of point particles. Here, we present the master equation describing the impact of heat radiation on the motional quantum state of arbitrarily sized and shaped dielectric rigid rotors. It predicts the localization of spatio-orientational superpositions only based on the bulk material properties and the particle geometry. A counter-intuitive and experimentally relevant implication of the presented theory is that orientational superpositions of optically isotropic bodies are not protected by their symmetry, even in the small-particle limit.
△ Less
Submitted 1 July, 2024;
originally announced July 2024.
-
H.E.S.S. observations of the 2021 periastron passage of PSR B1259-63/LS 2883
Authors:
H. E. S. S. Collaboration,
F. Aharonian,
F. Ait Benkhali,
J. Aschersleben,
H. Ashkar,
M. Backes,
V. Barbosa Martins,
R. Batzofin,
Y. Becherini,
D. Berge,
K. Bernlöhr,
M. Böttcher,
C. Boisson,
J. Bolmont,
M. de Bony de Lavergne,
J. Borowska,
M. Bouyahiaoui,
R. Brose,
A. Brown,
F. Brun,
B. Bruno,
T. Bulik,
C. Burger-Scheidlin,
S. Caroff,
S. Casanova
, et al. (119 additional authors not shown)
Abstract:
PSR B1259-63 is a gamma-ray binary system that hosts a pulsar in an eccentric orbit, with a 3.4 year period, around an O9.5Ve star. At orbital phases close to periastron passages, the system radiates bright and variable non-thermal emission. We report on an extensive VHE observation campaign conducted with the High Energy Stereoscopic System, comprised of ~100 hours of data taken from $t_p-24$ day…
▽ More
PSR B1259-63 is a gamma-ray binary system that hosts a pulsar in an eccentric orbit, with a 3.4 year period, around an O9.5Ve star. At orbital phases close to periastron passages, the system radiates bright and variable non-thermal emission. We report on an extensive VHE observation campaign conducted with the High Energy Stereoscopic System, comprised of ~100 hours of data taken from $t_p-24$ days to $t_p+127$ days around the system's 2021 periastron passage. We also present the timing and spectral analyses of the source. The VHE light curve in 2021 is consistent with the stacked light curve of all previous observations. Within the light curve, we report a VHE maximum at times coincident with the third X-ray peak first detected in the 2021 X-ray light curve. In the light curve -- although sparsely sampled in this time period -- we see no VHE enhancement during the second disc crossing. In addition, we see no correspondence to the 2021 GeV flare in the VHE light curve. The VHE spectrum obtained from the analysis of the 2021 dataset is best described by a power law of spectral index $Γ= 2.65 \pm 0.04_{\text{stat}}$ $\pm 0.04_{\text{sys}}$, a value consistent with the previous H.E.S.S. observations of the source. We report spectral variability with a difference of $ΔΓ= 0.56 ~\pm~ 0.18_{\text{stat}}$ $~\pm~0.10_{\text{sys}}$ at 95% c.l., between sub-periods of the 2021 dataset. We also find a linear correlation between contemporaneous flux values of X-ray and TeV datasets, detected mainly after $t_p+25$ days, suggesting a change in the available energy for non-thermal radiation processes. We detect no significant correlation between GeV and TeV flux points, within the uncertainties of the measurements, from $\sim t_p-23$ days to $\sim t_p+126$ days. This suggests that the GeV and TeV emission originate from different electron populations.
△ Less
Submitted 26 June, 2024;
originally announced June 2024.
-
Triple products of eigenfunctions and spectral geometry
Authors:
Joe Schaefer
Abstract:
Using elementary techniques from Geometric Analysis, Partial Differential Equations, and Abelian $C^*$ Algebras, we uncover a novel, yet familiar, global geometric invariant -- namely the indexed set of integrals of triple products of eigenfunctions of the Laplace-Beltrami operator, to precisely characterize which isospectral closed Riemannian manifolds are isometric.
Using elementary techniques from Geometric Analysis, Partial Differential Equations, and Abelian $C^*$ Algebras, we uncover a novel, yet familiar, global geometric invariant -- namely the indexed set of integrals of triple products of eigenfunctions of the Laplace-Beltrami operator, to precisely characterize which isospectral closed Riemannian manifolds are isometric.
△ Less
Submitted 16 April, 2024;
originally announced June 2024.
-
Combinatorial Printing of Functionally Graded Solid-State Electrolyte for High-Voltage Lithium Metal Batteries
Authors:
Qiang Jiang,
Stephanie Atampugre,
Yipu Du,
Lingyu Yang,
Jennifer L. Schaefer,
Yanliang Zhang
Abstract:
Heterogeneous multilayered solid-state electrolyte (HMSSE) has been widely explored for their broadened working voltage range and compatibility with electrodes. However, due to the limitations of traditional manufacturing methods such as casting, the interface between electrolyte layers in HMSSE can decrease the ionic conductivity severely. Here, a novel combinatory aerosol jet printing (CAJP) is…
▽ More
Heterogeneous multilayered solid-state electrolyte (HMSSE) has been widely explored for their broadened working voltage range and compatibility with electrodes. However, due to the limitations of traditional manufacturing methods such as casting, the interface between electrolyte layers in HMSSE can decrease the ionic conductivity severely. Here, a novel combinatory aerosol jet printing (CAJP) is introduced to fabricate functionally graded solid-state electrolyte (FGSSE) without sharp interface. Owing to the unique ability of CAJP (in-situ mixing and instantaneous tuning of the mixing ratio), FGSSE with smooth microscale compositional gradation is achieved. Electrochemical tests show that FGSSE has excellent oxidative stability exceeding 5.5 V and improved conductivity (>7 times of an analogous HMSSE). By decoupling the total resistance, we show that the resistance from the electrolyte/electrolyte interface of HMSSE is 5.7 times of the total resistance of FGSSE. The Li/FGSSE/NCM622 cell can be stably run for more than 200 cycles along with improved rate performance.
△ Less
Submitted 13 April, 2024;
originally announced April 2024.
-
Echo Chambers in the Age of Algorithms: An Audit of Twitter's Friend Recommender System
Authors:
Kayla Duskin,
Joseph S. Schafer,
Jevin D. West,
Emma S. Spiro
Abstract:
The presence of political misinformation and ideological echo chambers on social media platforms is concerning given the important role that these sites play in the public's exposure to news and current events. Algorithmic systems employed on these platforms are presumed to play a role in these phenomena, but little is known about their mechanisms and effects. In this work, we conduct an algorithm…
▽ More
The presence of political misinformation and ideological echo chambers on social media platforms is concerning given the important role that these sites play in the public's exposure to news and current events. Algorithmic systems employed on these platforms are presumed to play a role in these phenomena, but little is known about their mechanisms and effects. In this work, we conduct an algorithmic audit of Twitter's Who-To-Follow friend recommendation system, the first empirical audit that investigates the impact of this algorithm in-situ. We create automated Twitter accounts that initially follow left and right affiliated U.S. politicians during the 2022 U.S. midterm elections and then grow their information networks using the platform's recommender system. We pair the experiment with an observational study of Twitter users who already follow the same politicians. Broadly, we find that while following the recommendation algorithm leads accounts into dense and reciprocal neighborhoods that structurally resemble echo chambers, the recommender also results in less political homogeneity of a user's network compared to accounts growing their networks through social endorsement. Furthermore, accounts that exclusively followed users recommended by the algorithm had fewer opportunities to encounter content centered on false or misleading election narratives compared to choosing friends based on social endorsement.
△ Less
Submitted 9 April, 2024;
originally announced April 2024.
-
An image-computable model of speeded decision-making
Authors:
Paul I. Jaffe,
Gustavo X. Santiago-Reyes,
Robert J. Schafer,
Patrick G. Bissett,
Russell A. Poldrack
Abstract:
Evidence accumulation models (EAMs) are the dominant framework for modeling response time (RT) data from speeded decision-making tasks. While providing a good quantitative description of RT data in terms of abstract perceptual representations, EAMs do not explain how the visual system extracts these representations in the first place. To address this limitation, we introduce the visual accumulator…
▽ More
Evidence accumulation models (EAMs) are the dominant framework for modeling response time (RT) data from speeded decision-making tasks. While providing a good quantitative description of RT data in terms of abstract perceptual representations, EAMs do not explain how the visual system extracts these representations in the first place. To address this limitation, we introduce the visual accumulator model (VAM), in which convolutional neural network models of visual processing and traditional EAMs are jointly fitted to trial-level RTs and raw (pixel-space) visual stimuli from individual subjects. Models fitted to large-scale cognitive training data from a stylized flanker task captured individual differences in congruency effects, RTs, and accuracy. We find evidence that the selection of task-relevant information occurs through the orthogonalization of relevant and irrelevant representations, demonstrating how our framework can be used to relate visual representations to behavioral outputs. Together, our work provides a probabilistic framework for both constraining neural network models of vision with behavioral data and studying how the visual system extracts representations that guide decisions.
△ Less
Submitted 24 March, 2024;
originally announced March 2024.
-
On Integer Programs with Irrational Data
Authors:
Seyedmohammadhossein Hosseinian,
Andrew J. Schaefer
Abstract:
An integer program (IP) with a finite number of feasible solutions may have an unbounded linear programming relaxation if it contains irrational parameters, due to implicit constraints enforced by the irrational numbers. We show that those constraints can be obtained if the irrational parameters are polynomials of roots of integers over the field of rational numbers, leading to an equivalent ratio…
▽ More
An integer program (IP) with a finite number of feasible solutions may have an unbounded linear programming relaxation if it contains irrational parameters, due to implicit constraints enforced by the irrational numbers. We show that those constraints can be obtained if the irrational parameters are polynomials of roots of integers over the field of rational numbers, leading to an equivalent rational formulation. We also establish a weaker result for IPs involving the general class of algebraic irrational parameters, which extends to IPs with a particular form of transcendental numbers.
△ Less
Submitted 11 February, 2024;
originally announced February 2024.
-
Acceleration and transport of relativistic electrons in the jets of the microquasar SS 433
Authors:
F. Aharonian,
F. Ait Benkhali,
J. Aschersleben,
H. Ashkar,
M. Backes,
V. Barbosa Martins,
R. Batzofin,
Y. Becherini,
D. Berge,
K. Bernlöhr,
B. Bi,
M. Böttcher,
C. Boisson,
J. Bolmont,
M. de Bony de Lavergne,
J. Borowska,
M. Bouyahiaou,
M. Breuhau,
R. Brose,
A. M. Brown,
F. Brun,
B. Bruno,
T. Bulik,
C. Burger-Scheidlin,
S. Caroff
, et al. (140 additional authors not shown)
Abstract:
SS 433 is a microquasar, a stellar binary system with collimated relativistic jets. We observed SS 433 in gamma rays using the High Energy Stereoscopic System (H.E.S.S.), finding an energy-dependent shift in the apparent position of the gamma-ray emission of the parsec-scale jets. These observations trace the energetic electron population and indicate the gamma rays are produced by inverse-Compton…
▽ More
SS 433 is a microquasar, a stellar binary system with collimated relativistic jets. We observed SS 433 in gamma rays using the High Energy Stereoscopic System (H.E.S.S.), finding an energy-dependent shift in the apparent position of the gamma-ray emission of the parsec-scale jets. These observations trace the energetic electron population and indicate the gamma rays are produced by inverse-Compton scattering. Modelling of the energy-dependent gamma-ray morphology constrains the location of particle acceleration and requires an abrupt deceleration of the jet flow. We infer the presence of shocks on either side of the binary system at distances of 25 to 30 parsecs and conclude that self-collimation of the precessing jets forms the shocks, which then efficiently accelerate electrons.
△ Less
Submitted 29 January, 2024;
originally announced January 2024.
-
Viral Privacy: Contextual Integrity as a Lens to Understand Content Creators' Privacy Perceptions and Needs After Sudden Attention
Authors:
Joseph S. Schafer,
Annie Denton,
Chloe Seelhoff,
Jordyn Vo,
Kate Starbird
Abstract:
When designing multi-stakeholder privacy systems, it is important to consider how different groups of social media users have different goals and requirements for privacy. Additionally, we must acknowledge that it is important to keep in mind that even a single creator's needs can change as their online visibility and presence shifts, and that robust multi-stakeholder privacy systems should accoun…
▽ More
When designing multi-stakeholder privacy systems, it is important to consider how different groups of social media users have different goals and requirements for privacy. Additionally, we must acknowledge that it is important to keep in mind that even a single creator's needs can change as their online visibility and presence shifts, and that robust multi-stakeholder privacy systems should account for these shifts. Using the framework of contextual integrity, we explain a theoretical basis for how to evaluate the potential changing privacy needs of users as their profiles undergo a sudden rise in online attention, and ongoing projects to understand these potential shifts in perspectives.
△ Less
Submitted 18 December, 2023;
originally announced December 2023.
-
Towards Incorporating Researcher Safety into Information Integrity Research Ethics
Authors:
Joseph S. Schafer,
Kate Starbird
Abstract:
Traditional research ethics has mainly and rightly been focused on making sure that participants are treated safely, justly, and ethically, to avoid the violation of their rights or putting participants in harm's way. Information integrity research within CSCW has also correspondingly mainly focused on these issues, and the focus of internet research ethics has primarily focused on increasing prot…
▽ More
Traditional research ethics has mainly and rightly been focused on making sure that participants are treated safely, justly, and ethically, to avoid the violation of their rights or putting participants in harm's way. Information integrity research within CSCW has also correspondingly mainly focused on these issues, and the focus of internet research ethics has primarily focused on increasing protections of participant data. However, as branches of internet research focus on more fraught contexts such as information integrity and problematic information, more explicit consideration of other ethical frames and subjects is warranted. In this workshop paper, we argue that researcher protections should be more explicitly considered and acknowledged in these studies, and should be considered alongside more standard ethical considerations for participants and for broader society.
△ Less
Submitted 28 December, 2023; v1 submitted 14 December, 2023;
originally announced December 2023.
-
Estimating Post-Synaptic Effects for Online Training of Feed-Forward SNNs
Authors:
Thomas Summe,
Clemens JS Schaefer,
Siddharth Joshi
Abstract:
Facilitating online learning in spiking neural networks (SNNs) is a key step in develo** event-based models that can adapt to changing environments and learn from continuous data streams in real-time. Although forward-mode differentiation enables online learning, its computational requirements restrict scalability. This is typically addressed through approximations that limit learning in deep mo…
▽ More
Facilitating online learning in spiking neural networks (SNNs) is a key step in develo** event-based models that can adapt to changing environments and learn from continuous data streams in real-time. Although forward-mode differentiation enables online learning, its computational requirements restrict scalability. This is typically addressed through approximations that limit learning in deep models. In this study, we propose Online Training with Postsynaptic Estimates (OTPE) for training feed-forward SNNs, which approximates Real-Time Recurrent Learning (RTRL) by incorporating temporal dynamics not captured by current approximations, such as Online Training Through Time (OTTT) and Online Spatio-Temporal Learning (OSTL). We show improved scaling for multi-layer networks using a novel approximation of temporal effects on the subsequent layer's activity. This approximation incurs minimal overhead in the time and space complexity compared to similar algorithms, and the calculation of temporal effects remains local to each layer. We characterize the learning performance of our proposed algorithms on multiple SNN model configurations for rate-based and time-based encoding. OTPE exhibits the highest directional alignment to exact gradients, calculated with backpropagation through time (BPTT), in deep networks and, on time-based encoding, outperforms other approximate methods. We also observe sizeable gains in average performance over similar algorithms in offline training of Spiking Heidelberg Digits with equivalent hyper-parameters (OTTT/OSTL - 70.5%; OTPE - 75.2%; BPTT - 78.1%).
△ Less
Submitted 7 November, 2023;
originally announced November 2023.
-
Hadamard Domain Training with Integers for Class Incremental Quantized Learning
Authors:
Martin Schiemer,
Clemens JS Schaefer,
Jayden Parker Vap,
Mark James Horeni,
Yu Emma Wang,
Juan Ye,
Siddharth Joshi
Abstract:
Continual learning is a desirable feature in many modern machine learning applications, which allows in-field adaptation and updating, ranging from accommodating distribution shift, to fine-tuning, and to learning new tasks. For applications with privacy and low latency requirements, the compute and memory demands imposed by continual learning can be cost-prohibitive for resource-constraint edge p…
▽ More
Continual learning is a desirable feature in many modern machine learning applications, which allows in-field adaptation and updating, ranging from accommodating distribution shift, to fine-tuning, and to learning new tasks. For applications with privacy and low latency requirements, the compute and memory demands imposed by continual learning can be cost-prohibitive for resource-constraint edge platforms. Reducing computational precision through fully quantized training (FQT) simultaneously reduces memory footprint and increases compute efficiency for both training and inference. However, aggressive quantization especially integer FQT typically degrades model accuracy to unacceptable levels. In this paper, we propose a technique that leverages inexpensive Hadamard transforms to enable low-precision training with only integer matrix multiplications. We further determine which tensors need stochastic rounding and propose tiled matrix multiplication to enable low-bit width accumulators. We demonstrate the effectiveness of our technique on several human activity recognition datasets and CIFAR100 in a class incremental learning setting. We achieve less than 0.5% and 3% accuracy degradation while we quantize all matrix multiplications inputs down to 4-bits with 8-bit accumulators.
△ Less
Submitted 5 October, 2023;
originally announced October 2023.
-
Emergent mechanisms for long timescales depend on training curriculum and affect performance in memory tasks
Authors:
Sina Khajehabdollahi,
Roxana Zeraati,
Emmanouil Giannakakis,
Tim Jakob Schäfer,
Georg Martius,
Anna Levina
Abstract:
Recurrent neural networks (RNNs) in the brain and in silico excel at solving tasks with intricate temporal dependencies. Long timescales required for solving such tasks can arise from properties of individual neurons (single-neuron timescale, $τ$, e.g., membrane time constant in biological neurons) or recurrent interactions among them (network-mediated timescale). However, the contribution of each…
▽ More
Recurrent neural networks (RNNs) in the brain and in silico excel at solving tasks with intricate temporal dependencies. Long timescales required for solving such tasks can arise from properties of individual neurons (single-neuron timescale, $τ$, e.g., membrane time constant in biological neurons) or recurrent interactions among them (network-mediated timescale). However, the contribution of each mechanism for optimally solving memory-dependent tasks remains poorly understood. Here, we train RNNs to solve $N$-parity and $N$-delayed match-to-sample tasks with increasing memory requirements controlled by $N$ by simultaneously optimizing recurrent weights and $τ$s. We find that for both tasks RNNs develop longer timescales with increasing $N$, but depending on the learning objective, they use different mechanisms. Two distinct curricula define learning objectives: sequential learning of a single-$N$ (single-head) or simultaneous learning of multiple $N$s (multi-head). Single-head networks increase their $τ$ with $N$ and are able to solve tasks for large $N$, but they suffer from catastrophic forgetting. However, multi-head networks, which are explicitly required to hold multiple concurrent memories, keep $τ$ constant and develop longer timescales through recurrent connectivity. Moreover, we show that the multi-head curriculum increases training speed and network stability to ablations and perturbations, and allows RNNs to generalize better to tasks beyond their training regime. This curriculum also significantly improves training GRUs and LSTMs for large-$N$ tasks. Our results suggest that adapting timescales to task requirements via recurrent interactions allows learning more complex objectives and improves the RNN's performance.
△ Less
Submitted 22 September, 2023;
originally announced September 2023.
-
Relaxations and Duality for Multiobjective Integer Programming
Authors:
Alex Dunbar,
Saumya Sinha,
Andrew J Schaefer
Abstract:
Multiobjective integer programs (MOIPs) simultaneously optimize multiple objective functions over a set of linear constraints and integer variables. In this paper, we present continuous, convex hull and Lagrangian relaxations for MOIPs and examine the relationship among them. The convex hull relaxation is tight at supported solutions, i.e., those that can be derived via a weighted-sum scalarizatio…
▽ More
Multiobjective integer programs (MOIPs) simultaneously optimize multiple objective functions over a set of linear constraints and integer variables. In this paper, we present continuous, convex hull and Lagrangian relaxations for MOIPs and examine the relationship among them. The convex hull relaxation is tight at supported solutions, i.e., those that can be derived via a weighted-sum scalarization of the MOIP. At unsupported solutions, the convex hull relaxation is not tight and a Lagrangian relaxation may provide a tighter bound. Using the Lagrangian relaxation, we define a Lagrangian dual of an MOIP that satisfies weak duality and is strong at supported solutions under certain conditions on the primal feasible region. We include a numerical experiment to illustrate that bound sets obtained via Lagrangian duality may yield tighter bounds than those from a convex hull relaxation. Subsequently, we generalize the integer programming value function to MOIPs and use its properties to motivate a set-valued superadditive dual that is strong at supported solutions. We also define a simpler vector-valued superadditive dual that exhibits weak duality but is strongly dual if and only if the primal has a unique nondominated point.
△ Less
Submitted 15 September, 2023;
originally announced September 2023.
-
Locally Adaptive Shrinkage Priors for Trends and Breaks in Count Time Series
Authors:
Toryn L. J. Schafer,
David S. Matteson
Abstract:
Non-stationary count time series characterized by features such as abrupt changes and fluctuations about the trend arise in many scientific domains including biophysics, ecology, energy, epidemiology, and social science domains. Current approaches for integer-valued time series lack the flexibility to capture local transient features while more flexible models for continuous data types are inadequ…
▽ More
Non-stationary count time series characterized by features such as abrupt changes and fluctuations about the trend arise in many scientific domains including biophysics, ecology, energy, epidemiology, and social science domains. Current approaches for integer-valued time series lack the flexibility to capture local transient features while more flexible models for continuous data types are inadequate for universal applications to integer-valued responses such as settings with small counts. We present a modeling framework, the negative binomial Bayesian trend filter (NB-BTF), that offers an adaptive model-based solution to capturing multiscale features with valid integer-valued inference for trend filtering. The framework is a hierarchical Bayesian model with a dynamic global-local shrinkage process. The flexibility of the global-local process allows for the necessary local regularization while the temporal dependence induces a locally smooth trend. In simulation, the NB-BTF outperforms a number of alternative trend filtering methods. Then, we demonstrate the method on weekly power outage frequency in Massachusetts townships. Power outage frequency is characterized by a nominal low level with occasional spikes. These illustrations show the estimation of a smooth, non-stationary trend with adequate uncertainty quantification.
△ Less
Submitted 31 August, 2023;
originally announced September 2023.
-
Programmable Quantum Processors based on Spin Qubits with Mechanically-Mediated Interactions and Transport
Authors:
F. Fung,
E. Rosenfeld,
J. D. Schaefer,
A. Kabcenell,
J. Gieseler,
T. X. Zhou,
T. Madhavan,
N. Aslam,
A. Yacoby,
M. D. Lukin
Abstract:
Solid state spin qubits are promising candidates for quantum information processing, but controlled interactions and entanglement in large, multi-qubit systems are currently difficult to achieve. We describe a method for programmable control of multi-qubit spin systems, in which individual nitrogen-vacancy (NV) centers in diamond nanopillars are coupled to magnetically functionalized silicon nitri…
▽ More
Solid state spin qubits are promising candidates for quantum information processing, but controlled interactions and entanglement in large, multi-qubit systems are currently difficult to achieve. We describe a method for programmable control of multi-qubit spin systems, in which individual nitrogen-vacancy (NV) centers in diamond nanopillars are coupled to magnetically functionalized silicon nitride mechanical resonators in a scanning probe configuration. Qubits can be entangled via interactions with nanomechanical resonators while programmable connectivity is realized via mechanical transport of qubits in nanopillars. To demonstrate the feasibility of this approach, we characterize both the mechanical properties and the magnetic field gradients around the micromagnet placed on the nanobeam resonator. Furthermore, we show coherent manipulation and mechanical transport of a proximal spin qubit by utilizing nuclear spin memory, and use the NV center to detect the time-varying magnetic field from the oscillating micromagnet, extracting a spin-mechanical coupling of 7.7(9) Hz. With realistic improvements the high-cooperativity regime can be reached, offering a new avenue towards scalable quantum information processing with spin qubits.
△ Less
Submitted 22 July, 2023;
originally announced July 2023.
-
Multiwavelength Observations of the Blazar PKS 0735+178 in Spatial and Temporal Coincidence with an Astrophysical Neutrino Candidate IceCube-211208A
Authors:
A. Acharyya,
C. B. Adams,
A. Archer,
P. Bangale,
J. T. Bartkoske,
P. Batista,
W. Benbow,
A. Brill,
J. H. Buckley,
J. L. Christiansen,
A. J. Chromey,
M. Errando,
A. Falcone,
Q. Feng,
G. M. Foote,
L. Fortson,
A. Furniss,
G. Gallagher,
W. Hanlon,
D. Hanna,
O. Hervet,
C. E. Hinrichs,
J. Hoang,
J. Holder,
T. B. Humensky
, et al. (185 additional authors not shown)
Abstract:
We report on multiwavelength target-of-opportunity observations of the blazar PKS 0735+178, located 2.2$^\circ$ away from the best-fit position of the IceCube neutrino event IceCube-211208A detected on December 8, 2021. The source was in a high-flux state in the optical, ultraviolet, X-ray, and GeV gamma-ray bands around the time of the neutrino event, exhibiting daily variability in the soft X-ra…
▽ More
We report on multiwavelength target-of-opportunity observations of the blazar PKS 0735+178, located 2.2$^\circ$ away from the best-fit position of the IceCube neutrino event IceCube-211208A detected on December 8, 2021. The source was in a high-flux state in the optical, ultraviolet, X-ray, and GeV gamma-ray bands around the time of the neutrino event, exhibiting daily variability in the soft X-ray flux. The X-ray data from Swift-XRT and NuSTAR characterize the transition between the low-energy and high-energy components of the broadband spectral energy distribution (SED), and the gamma-ray data from Fermi -LAT, VERITAS, and H.E.S.S. require a spectral cut-off near 100 GeV. Both X-ray and gamma-ray measurements provide strong constraints on the leptonic and hadronic models. We analytically explore a synchrotron self-Compton model, an external Compton model, and a lepto-hadronic model. Models that are entirely based on internal photon fields face serious difficulties in matching the observed SED. The existence of an external photon field in the source would instead explain the observed gamma-ray spectral cut-off in both leptonic and lepto-hadronic models and allow a proton jet power that marginally agrees with the Eddington limit in the lepto-hadronic model. We show a numerical lepto-hadronic model with external target photons that reproduces the observed SED and is reasonably consistent with the neutrino event despite requiring a high jet power.
△ Less
Submitted 30 June, 2023;
originally announced June 2023.
-
Tracking Charge Migration with Frequency-Matched Strobo-Spectroscopy
Authors:
Kyle A. Hamer,
Aderonke S. Folorunso,
Kenneth Lopata,
Kenneth J. Schafer,
Mette B. Gaarde,
Francois Mauger
Abstract:
We present frequency-matched strobo-spectroscopy (FMSS) of charge migration (CM) in bromobutadiyne, simulated with time-dependent density-functional theory. CM+FMSS is a pump-probe scheme that uses a frequency-matched HHG-driving laser as an independent probe step following the creation of a localized hole on the bromine atom that induces CM dynamics. We show that the delay-dependent harmonic yiel…
▽ More
We present frequency-matched strobo-spectroscopy (FMSS) of charge migration (CM) in bromobutadiyne, simulated with time-dependent density-functional theory. CM+FMSS is a pump-probe scheme that uses a frequency-matched HHG-driving laser as an independent probe step following the creation of a localized hole on the bromine atom that induces CM dynamics. We show that the delay-dependent harmonic yield tracks the phase of the CM dynamics through its sensitivity to the amount of electron density on the bromine end of the molecule. FMSS takes advantage of the intrinsic attosecond time resolution of the HHG process, in which different harmonics are emitted at different times and thus probe different locations of the electron hole. Finally, we show that the CM-induced modulation of the HHG signal is dominated by the recombination step of the HHG process, with negligible contribution from the ionization step.
△ Less
Submitted 10 October, 2023; v1 submitted 26 June, 2023;
originally announced June 2023.
-
Augmenting Hessians with Inter-Layer Dependencies for Mixed-Precision Post-Training Quantization
Authors:
Clemens JS Schaefer,
Navid Lambert-Shirzad,
Xiaofan Zhang,
Chiachen Chou,
Tom Jablin,
Jian Li,
Elfie Guo,
Caitlin Stanton,
Siddharth Joshi,
Yu Emma Wang
Abstract:
Efficiently serving neural network models with low latency is becoming more challenging due to increasing model complexity and parameter count. Model quantization offers a solution which simultaneously reduces memory footprint and compute requirements. However, aggressive quantization may lead to an unacceptable loss in model accuracy owing to differences in sensitivity to numerical imperfection a…
▽ More
Efficiently serving neural network models with low latency is becoming more challenging due to increasing model complexity and parameter count. Model quantization offers a solution which simultaneously reduces memory footprint and compute requirements. However, aggressive quantization may lead to an unacceptable loss in model accuracy owing to differences in sensitivity to numerical imperfection across different layers in the model. To address this challenge, we propose a mixed-precision post training quantization (PTQ) approach that assigns different numerical precisions to tensors in a network based on their specific needs, for a reduced memory footprint and improved latency while preserving model accuracy. Previous works rely on layer-wise Hessian information to determine numerical precision, but as we demonstrate, Hessian estimation is typically insufficient in determining an effective ordering of layer sensitivities. We address this by augmenting the estimated Hessian with additional information to capture inter-layer dependencies. We demonstrate that this consistently improves PTQ performance along the accuracy-latency Pareto frontier across multiple models. Our method combines second-order information and inter-layer dependencies to guide a bisection search, finding quantization configurations within a user-configurable model accuracy degradation range. We evaluate the effectiveness of our method on the ResNet50, MobileNetV2, and BERT models. Our experiments demonstrate latency reductions compared to a 16-bit baseline of $25.48\%$, $21.69\%$, and $33.28\%$ respectively, while maintaining model accuracy to within $99.99\%$ of the baseline model.
△ Less
Submitted 7 June, 2023;
originally announced June 2023.
-
Constraining the cosmic-ray pressure in the inner Virgo Cluster using H.E.S.S. observations of M 87
Authors:
H. E. S. S. Collaboration,
:,
F. Aharonian,
F. Ait Benkhali,
C. Arcaro,
J. Aschersleben,
M. Backes,
V. Barbosa Martins,
R. Batzofin,
Y. Becherini,
D. Berge,
K. Bernlöhr,
B. Bi,
M. Böttcher,
C. Boisson,
J. Bolmont,
J. Borowska,
F. Bradascio,
M. Breuhaus,
R. Brose,
F. Brun,
B. Bruno,
T. Bulik,
C. Burger-Scheidlin,
T. Bylund
, et al. (139 additional authors not shown)
Abstract:
The origin of the gamma-ray emission from M87 is currently a matter of debate. This work aims to localize the VHE (100 GeV-100 TeV) gamma-ray emission from M87 and probe a potential extended hadronic emission component in the inner Virgo Cluster. The search for a steady and extended gamma-ray signal around M87 can constrain the cosmic-ray energy density and the pressure exerted by the cosmic rays…
▽ More
The origin of the gamma-ray emission from M87 is currently a matter of debate. This work aims to localize the VHE (100 GeV-100 TeV) gamma-ray emission from M87 and probe a potential extended hadronic emission component in the inner Virgo Cluster. The search for a steady and extended gamma-ray signal around M87 can constrain the cosmic-ray energy density and the pressure exerted by the cosmic rays onto the intra-cluster medium, and allow us to investigate the role of the cosmic rays in the active galactic nucleus feedback as a heating mechanism in the Virgo Cluster. H.E.S.S. telescopes are sensitive to VHE gamma rays and have been utilized to observe M87 since 2004. We utilized a Bayesian block analysis to identify M87 emission states with H.E.S.S. observations from 2004 until 2021, dividing them into low, intermediate, and high states. Because of the causality argument, an extended ($\gtrsim$kpc) signal is allowed only in steady emission states. Hence, we fitted the morphology of the 120h low state data and found no significant gamma-ray extension. Therefore, we derived for the low state an upper limit of 58"(corresponding to $\approx$4.6kpc) in the extension of a single-component morphological model described by a rotationally symmetric 2D Gaussian model at 99.7% confidence level. Our results exclude the radio lobes ($\approx$30 kpc) as the principal component of the VHE gamma-ray emission from the low state of M87. The gamma-ray emission is compatible with a single emission region at the radio core of M87. These results, with the help of two multiple-component models, constrain the maximum cosmic-ray to thermal pressure ratio $X_{CR,max.}$$\lesssim$$0.32$ and the total energy in cosmic-ray protons (CRp) to $U_{CR}$$\lesssim$5$\times10^{58}$ erg in the inner 20kpc of the Virgo Cluster for an assumed CRp power-law distribution in momentum with spectral index $α_{p}$=2.1.
△ Less
Submitted 16 May, 2023;
originally announced May 2023.
-
Survey on Unsupervised Domain Adaptation for Semantic Segmentation for Visual Perception in Automated Driving
Authors:
Manuel Schwonberg,
Joshua Niemeijer,
Jan-Aike Termöhlen,
Jörg P. Schäfer,
Nico M. Schmidt,
Hanno Gottschalk,
Tim Fingscheidt
Abstract:
Deep neural networks (DNNs) have proven their capabilities in many areas in the past years, such as robotics, or automated driving, enabling technological breakthroughs. DNNs play a significant role in environment perception for the challenging application of automated driving and are employed for tasks such as detection, semantic segmentation, and sensor fusion. Despite this progress and tremendo…
▽ More
Deep neural networks (DNNs) have proven their capabilities in many areas in the past years, such as robotics, or automated driving, enabling technological breakthroughs. DNNs play a significant role in environment perception for the challenging application of automated driving and are employed for tasks such as detection, semantic segmentation, and sensor fusion. Despite this progress and tremendous research efforts, several issues still need to be addressed that limit the applicability of DNNs in automated driving. The bad generalization of DNNs to new, unseen domains is a major problem on the way to a safe, large-scale application, because manual annotation of new domains is costly, particularly for semantic segmentation. For this reason, methods are required to adapt DNNs to new domains without labeling effort. The task, which these methods aim to solve is termed unsupervised domain adaptation (UDA). While several different domain shifts can challenge DNNs, the shift between synthetic and real data is of particular importance for automated driving, as it allows the use of simulation environments for DNN training. In this work, we present an overview of the current state of the art in this field of research. We categorize and explain the different approaches for UDA. The number of considered publications is larger than any other survey on this topic. The scope of this survey goes far beyond the description of the UDA state-of-the-art. Based on our large data and knowledge base, we present a quantitative comparison of the approaches and use the observations to point out the latest trends in this field. In the following, we conduct a critical analysis of the state-of-the-art and highlight promising future research directions. With this survey, we aim to facilitate UDA research further and encourage scientists to exploit novel research directions to generalize DNNs better.
△ Less
Submitted 24 April, 2023;
originally announced April 2023.
-
NeuroBench: A Framework for Benchmarking Neuromorphic Computing Algorithms and Systems
Authors:
Jason Yik,
Korneel Van den Berghe,
Douwe den Blanken,
Younes Bouhadjar,
Maxime Fabre,
Paul Hueber,
Denis Kleyko,
Noah Pacik-Nelson,
Pao-Sheng Vincent Sun,
Guangzhi Tang,
Shenqi Wang,
Biyan Zhou,
Soikat Hasan Ahmed,
George Vathakkattil Joseph,
Benedetto Leto,
Aurora Micheli,
Anurag Kumar Mishra,
Gregor Lenz,
Tao Sun,
Zergham Ahmed,
Mahmoud Akl,
Brian Anderson,
Andreas G. Andreou,
Chiara Bartolozzi,
Arindam Basu
, et al. (73 additional authors not shown)
Abstract:
Neuromorphic computing shows promise for advancing computing efficiency and capabilities of AI applications using brain-inspired principles. However, the neuromorphic research field currently lacks standardized benchmarks, making it difficult to accurately measure technological advancements, compare performance with conventional methods, and identify promising future research directions. Prior neu…
▽ More
Neuromorphic computing shows promise for advancing computing efficiency and capabilities of AI applications using brain-inspired principles. However, the neuromorphic research field currently lacks standardized benchmarks, making it difficult to accurately measure technological advancements, compare performance with conventional methods, and identify promising future research directions. Prior neuromorphic computing benchmark efforts have not seen widespread adoption due to a lack of inclusive, actionable, and iterative benchmark design and guidelines. To address these shortcomings, we present NeuroBench: a benchmark framework for neuromorphic computing algorithms and systems. NeuroBench is a collaboratively-designed effort from an open community of nearly 100 co-authors across over 50 institutions in industry and academia, aiming to provide a representative structure for standardizing the evaluation of neuromorphic approaches. The NeuroBench framework introduces a common set of tools and systematic methodology for inclusive benchmark measurement, delivering an objective reference framework for quantifying neuromorphic approaches in both hardware-independent (algorithm track) and hardware-dependent (system track) settings. In this article, we present initial performance baselines across various model architectures on the algorithm track and outline the system track benchmark tasks and guidelines. NeuroBench is intended to continually expand its benchmarks and features to foster and track the progress made by the research community.
△ Less
Submitted 17 January, 2024; v1 submitted 10 April, 2023;
originally announced April 2023.
-
Markov Decision Process Design: A Framework for Integrating Strategic and Operational Decisions
Authors:
Seth Brown,
Saumya Sinha,
Andrew J Schaefer
Abstract:
We consider the problem of optimally designing a system for repeated use under uncertainty. We develop a modeling framework that integrates design and operational phases, which are represented by a mixed-integer program and discounted-cost infinite-horizon Markov decision processes, respectively. We seek to simultaneously minimize the design costs and the subsequent expected operational costs. Thi…
▽ More
We consider the problem of optimally designing a system for repeated use under uncertainty. We develop a modeling framework that integrates design and operational phases, which are represented by a mixed-integer program and discounted-cost infinite-horizon Markov decision processes, respectively. We seek to simultaneously minimize the design costs and the subsequent expected operational costs. This problem setting arises naturally in several application areas, as we illustrate through examples. We derive a bilevel mixed-integer linear programming formulation for the problem and perform a computational study to demonstrate that realistic instances can be solved numerically.
△ Less
Submitted 21 March, 2024; v1 submitted 7 April, 2023;
originally announced April 2023.
-
The Challenges of Studying Misinformation on Video-Sharing Platforms During Crises and Mass-Convergence Events
Authors:
Sukrit Venkatagiri,
Joseph S. Schafer,
Stephen Prochaska
Abstract:
Mis- and disinformation can spread rapidly on video-sharing platforms (VSPs). Despite the growing use of VSPs, there has not been a proportional increase in our ability to understand this medium and the messages conveyed through it. In this work, we draw on our prior experiences to outline three core challenges faced in studying VSPs in high-stakes and fast-paced settings: (1) navigating the uniqu…
▽ More
Mis- and disinformation can spread rapidly on video-sharing platforms (VSPs). Despite the growing use of VSPs, there has not been a proportional increase in our ability to understand this medium and the messages conveyed through it. In this work, we draw on our prior experiences to outline three core challenges faced in studying VSPs in high-stakes and fast-paced settings: (1) navigating the unique affordances of VSPs, (2) understanding VSP content and determining its authenticity, and (3) novel user behaviors on VSPs for spreading misinformation. By highlighting these challenges, we hope that researchers can reflect on how to adapt existing research methods and tools to these new contexts, or develop entirely new ones.
△ Less
Submitted 24 March, 2023;
originally announced March 2023.
-
H.E.S.S. follow-up observations of GRB221009A
Authors:
H. E. S. S. Collaboration,
:,
F. Aharonian,
F. Ait Benkhali,
J. Aschersleben,
H. Ashkar,
M. Backes,
A. Baktash,
V. Barbosa Martins,
R. Batzofin,
Y. Becherini,
D. Berge,
K. Bernlöhr,
B. Bi,
M. Böttcher,
C. Boisson,
J. Bolmont,
M. de Bony de Lavergne,
J. Borowska,
M. Bouyahiaoui,
F. Bradascio,
M. Breuhaus,
R. Brose,
F. Brun,
B. Bruno
, et al. (138 additional authors not shown)
Abstract:
GRB221009A is the brightest gamma-ray burst ever detected. To probe the very-high-energy (VHE, $>$\!100 GeV) emission, the High Energy Stereoscopic System (H.E.S.S.) began observations 53 hours after the triggering event, when the brightness of the moonlight no longer precluded observations. We derive differential and integral upper limits using H.E.S.S. data from the third, fourth, and ninth nigh…
▽ More
GRB221009A is the brightest gamma-ray burst ever detected. To probe the very-high-energy (VHE, $>$\!100 GeV) emission, the High Energy Stereoscopic System (H.E.S.S.) began observations 53 hours after the triggering event, when the brightness of the moonlight no longer precluded observations. We derive differential and integral upper limits using H.E.S.S. data from the third, fourth, and ninth nights after the initial GRB detection, after applying atmospheric corrections. The combined observations yield an integral energy flux upper limit of $Φ_\mathrm{UL}^{95\%} = 9.7 \times 10^{-12}~\mathrm{erg\,cm^{-2}\,s^{-1}}$ above $E_\mathrm{thr} = 650$ GeV. The constraints derived from the H.E.S.S. observations complement the available multiwavelength data. The radio to X-ray data are consistent with synchrotron emission from a single electron population, with the peak in the SED occurring above the X-ray band. Compared to the VHE-bright GRB190829A, the upper limits for GRB221009A imply a smaller gamma-ray to X-ray flux ratio in the afterglow. Even in the absence of a detection, the H.E.S.S. upper limits thus contribute to the multiwavelength picture of GRB221009A, effectively ruling out an IC dominated scenario.
△ Less
Submitted 18 March, 2023;
originally announced March 2023.
-
Followback Clusters, Satellite Audiences, and Bridge Nodes: Coengagement Networks for the 2020 US Election
Authors:
Andrew Beers,
Joseph S. Schafer,
Ian Kennedy,
Morgan Wack,
Emma S. Spiro,
Kate Starbird
Abstract:
The 2020 United States presidential election was, and has continued to be, the focus of pervasive and persistent mis- and disinformation spreading through our media ecosystems, including social media. This event has driven the collection and analysis of large, directed social network datasets, but such datasets can resist intuitive understanding. In such large datasets, the overwhelming number of…
▽ More
The 2020 United States presidential election was, and has continued to be, the focus of pervasive and persistent mis- and disinformation spreading through our media ecosystems, including social media. This event has driven the collection and analysis of large, directed social network datasets, but such datasets can resist intuitive understanding. In such large datasets, the overwhelming number of nodes and edges present in typical representations create visual artifacts, such as densely overlap** edges and tightly-packed formations of low-degree nodes, which obscure many features of more practical interest. We apply a method, coengagement transformations, to convert such networks of social data into tractable images. Intuitively, this approach allows for parameterized network visualizations that make shared audiences of engaged viewers salient to viewers. Using the interpretative capabilities of this method, we perform an extensive case study of the 2020 United States presidential election on Twitter, contributing an empirical analysis of coengagement. By creating and contrasting different networks at different parameter sets, we define and characterize several structures in this discourse network, including bridging accounts, satellite audiences, and followback communities. We discuss the importance and implications of these empirical network features in this context. In addition, we release open-source code for creating coengagement networks from Twitter and other structured interaction data.
△ Less
Submitted 30 May, 2023; v1 submitted 28 February, 2023;
originally announced March 2023.
-
Validating Monte Carlo simulations for an analysis chain in H.E.S.S
Authors:
Fabian Leuschner,
Johannes Schäfer,
Simon Steinmassl,
Tim Lukas Holch,
Konrad Bernlöhr,
Stefan Funk,
Jim Hinton,
Stefan Ohm,
Gerd Pühlhofer
Abstract:
Imaging Air Cherenkov Telescopes (IACTs) detect very high energetic (VHE) gamma rays. They observe the Cherenkov light emitted in electromagnetic shower cascades that gamma rays induce in the atmosphere. A precise reconstruction of the primary photon energy and the source flux depends heavily on accurate Monte Carlo (MC) simulations of the shower propagation and the detector response, and therefor…
▽ More
Imaging Air Cherenkov Telescopes (IACTs) detect very high energetic (VHE) gamma rays. They observe the Cherenkov light emitted in electromagnetic shower cascades that gamma rays induce in the atmosphere. A precise reconstruction of the primary photon energy and the source flux depends heavily on accurate Monte Carlo (MC) simulations of the shower propagation and the detector response, and therefore also on adequate assumptions about the atmosphere at the site and time of a measurement. Here, we present the results of an extensive validation of the MC simulations for an analysis chain of the H.E.S.S. experiment with special focus on the recently installed FlashCam camera on the large 28 m telescope. One goal of this work was to create a flexible and easy-to-use framework to facilitate the detailed validation of MC simulations also for past and future phases of the H.E.S.S. experiment. Guided by the underlying physics, the detector simulation and the atmospheric transmission profiles were gradually improved until low level parameters such as cosmic ray (CR) trigger rates matched within a few percent between simulations and observational data. This led to instrument response functions (IRFs) with which the analysis of current H.E.S.S. data can ultimately be carried out within percent accuracy, substantially improving earlier simulations.
△ Less
Submitted 1 March, 2023;
originally announced March 2023.
-
The Hardware Impact of Quantization and Pruning for Weights in Spiking Neural Networks
Authors:
Clemens JS Schaefer,
Pooria Taheri,
Mark Horeni,
Siddharth Joshi
Abstract:
Energy efficient implementations and deployments of Spiking neural networks (SNNs) have been of great interest due to the possibility of develo** artificial systems that can achieve the computational powers and energy efficiency of the biological brain. Efficient implementations of SNNs on modern digital hardware are also inspired by advances in machine learning and deep neural networks (DNNs).…
▽ More
Energy efficient implementations and deployments of Spiking neural networks (SNNs) have been of great interest due to the possibility of develo** artificial systems that can achieve the computational powers and energy efficiency of the biological brain. Efficient implementations of SNNs on modern digital hardware are also inspired by advances in machine learning and deep neural networks (DNNs). Two techniques widely employed in the efficient deployment of DNNs -- the quantization and pruning of parameters, can both compress the model size, reduce memory footprints, and facilitate low-latency execution. The interaction between quantization and pruning and how they might impact model performance on SNN accelerators is currently unknown. We study various combinations of pruning and quantization in isolation, cumulatively, and simultaneously (jointly) to a state-of-the-art SNN targeting gesture recognition for dynamic vision sensor cameras (DVS). We show that this state-of-the-art model is amenable to aggressive parameter quantization, not suffering from any loss in accuracy down to ternary weights. However, pruning only maintains iso-accuracy up to 80% sparsity, which results in 45% more energy than the best quantization on our architectural model. Applying both pruning and quantization can result in an accuracy loss to offer a favourable trade-off on the energy-accuracy Pareto-frontier for the given hardware configuration.
△ Less
Submitted 8 February, 2023;
originally announced February 2023.
-
Best Practices in Active Learning for Semantic Segmentation
Authors:
Sudhanshu Mittal,
Joshua Niemeijer,
Jörg P. Schäfer,
Thomas Brox
Abstract:
Active learning is particularly of interest for semantic segmentation, where annotations are costly. Previous academic studies focused on datasets that are already very diverse and where the model is trained in a supervised manner with a large annotation budget. In contrast, data collected in many driving scenarios is highly redundant, and most medical applications are subject to very constrained…
▽ More
Active learning is particularly of interest for semantic segmentation, where annotations are costly. Previous academic studies focused on datasets that are already very diverse and where the model is trained in a supervised manner with a large annotation budget. In contrast, data collected in many driving scenarios is highly redundant, and most medical applications are subject to very constrained annotation budgets. This work investigates the various types of existing active learning methods for semantic segmentation under diverse conditions across three dimensions - data distribution w.r.t. different redundancy levels, integration of semi-supervised learning, and different labeling budgets. We find that these three underlying factors are decisive for the selection of the best active learning approach. As an outcome of our study, we provide a comprehensive usage guide to obtain the best performance for each case. We also propose an exemplary evaluation task for driving scenarios, where data has high redundancy, to showcase the practical implications of our research findings.
△ Less
Submitted 15 March, 2023; v1 submitted 8 February, 2023;
originally announced February 2023.
-
Mixed Precision Post Training Quantization of Neural Networks with Sensitivity Guided Search
Authors:
Clemens JS Schaefer,
Elfie Guo,
Caitlin Stanton,
Xiaofan Zhang,
Tom Jablin,
Navid Lambert-Shirzad,
Jian Li,
Chiachen Chou,
Siddharth Joshi,
Yu Emma Wang
Abstract:
Serving large-scale machine learning (ML) models efficiently and with low latency has become challenging owing to increasing model size and complexity. Quantizing models can simultaneously reduce memory and compute requirements, facilitating their widespread access. However, for large models not all layers are equally amenable to the same numerical precision and aggressive quantization can lead to…
▽ More
Serving large-scale machine learning (ML) models efficiently and with low latency has become challenging owing to increasing model size and complexity. Quantizing models can simultaneously reduce memory and compute requirements, facilitating their widespread access. However, for large models not all layers are equally amenable to the same numerical precision and aggressive quantization can lead to unacceptable loss in model accuracy. One approach to prevent this accuracy degradation is mixed-precision quantization, which allows different tensors to be quantized to varying levels of numerical precision, leveraging the capabilities of modern hardware. Such mixed-precision quantiztaion can more effectively allocate numerical precision to different tensors `as needed' to preserve model accuracy while reducing footprint and compute latency. In this paper, we propose a method to efficiently determine quantization configurations of different tensors in ML models using post-training mixed precision quantization. We analyze three sensitivity metrics and evaluate them for guiding configuration search of two algorithms. We evaluate our method for computer vision and natural language processing and demonstrate latency reductions of up to 27.59% and 34.31% compared to the baseline 16-bit floating point model while guaranteeing no more than 1% accuracy degradation.
△ Less
Submitted 6 February, 2023; v1 submitted 2 February, 2023;
originally announced February 2023.
-
Hamiltonian formulation and symplectic split-operator schemes for time-dependent density-functional-theory equations of electron dynamics in molecules
Authors:
Francois Mauger,
Cristel Chandre,
Mette B. Gaarde,
Kenneth Lopata,
Kenneth J. Schafer
Abstract:
We revisit Kohn-Sham time-dependent density-functional theory (TDDFT) equations and show that they derive from a canonical Hamiltonian formalism. We use this geometric description of the TDDFT dynamics to define families of symplectic split-operator schemes that accurately and efficiently simulate the time propagation for certain classes of DFT functionals. We illustrate these with numerical simul…
▽ More
We revisit Kohn-Sham time-dependent density-functional theory (TDDFT) equations and show that they derive from a canonical Hamiltonian formalism. We use this geometric description of the TDDFT dynamics to define families of symplectic split-operator schemes that accurately and efficiently simulate the time propagation for certain classes of DFT functionals. We illustrate these with numerical simulations of the far-from-equilibrium electronic dynamics of a one-dimensional carbon chain. In these examples, we find that an optimized 4th order scheme provides a good compromise between the numerical complexity of each time step and the accuracy of the scheme. We also discuss how the Hamiltonian structure changes when using a basis set to discretize TDDFT and the challenges this raises for using symplectic split-operator propagation schemes.
△ Less
Submitted 7 September, 2023; v1 submitted 1 February, 2023;
originally announced February 2023.
-
Training one model to detect heart and lung sound events from single point auscultations
Authors:
Leander Melms,
Robert R. Ilesan,
Ulrich Köhler,
Olaf Hildebrandt,
Regina Conradt,
Jens Eckstein,
Cihan Atila,
Sami Matrood,
Bernhard Schieffer,
Jürgen R. Schaefer,
Tobias Müller,
Julius Obergassel,
Nadine Schlicker,
Martin C. Hirsch
Abstract:
Objective: This work proposes a semi-supervised training approach for detecting lung and heart sounds simultaneously with only one trained model and in invariance to the auscultation point. Methods: We use open-access data from the 2016 Physionet/CinC Challenge, the 2022 George Moody Challenge, and from the lung sound database HF_V1. We first train specialist single-task models using foreground gr…
▽ More
Objective: This work proposes a semi-supervised training approach for detecting lung and heart sounds simultaneously with only one trained model and in invariance to the auscultation point. Methods: We use open-access data from the 2016 Physionet/CinC Challenge, the 2022 George Moody Challenge, and from the lung sound database HF_V1. We first train specialist single-task models using foreground ground truth (GT) labels from different auscultation databases to identify background sound events in the respective lung and heart auscultation databases. The pseudo-labels generated in this way were combined with the ground truth labels in a new training iteration, such that a new model was subsequently trained to detect foreground and background signals. Benchmark tests ensured that the newly trained model could detect both, lung, and heart sound events in different auscultation sites without regressing on the original task. We also established hand-validated labels for the respective background signal in heart and lung sound auscultations to evaluate the models. Results: In this work, we report for the first time results for i) a multi-class prediction for lung sound events and ii) for simultaneous detection of heart and lung sound events and achieve competitive results using only one model. The combined multi-task model regressed slightly in heart sound detection and gained significantly in lung sound detection accuracy with an overall macro F1 score of 39.2% over six classes, representing a 6.7% improvement over the single-task baseline models. Conclusion/Significance: To the best of our knowledge, this is the first approach developed to date for measuring heart and lung sound events invariant to both, the auscultation site and capturing device. Hence, our model is capable of performing lung and heart sound detection from any auscultation location.
△ Less
Submitted 15 January, 2023;
originally announced January 2023.
-
Gamma-ray observations of MAXI J1820+070 during the 2018 outburst
Authors:
H. Abe,
S. Abe,
V. A. Acciari,
T. Aniello,
S. Ansoldi,
L. A. Antonelli,
A. Arbet Engels,
C. Arcaro,
M. Artero,
K. Asano,
D. Baack,
A. Babić,
A. Baquero,
U. Barres de Almeida,
J. A. Barrio,
I. Batković,
J. Baxter,
J. Becerra González,
W. Bednarek,
E. Bernardini,
M. Bernardos,
A. Berti,
J. Besenrieder,
W. Bhattacharyya,
C. Bigongiari
, et al. (418 additional authors not shown)
Abstract:
MAXI J1820+070 is a low-mass X-ray binary with a black hole as a compact object. This binary underwent an exceptionally bright X-ray outburst from March to October 2018, showing evidence of a non-thermal particle population through its radio emission during this whole period. The combined results of 59.5 hours of observations of the MAXI J1820+070 outburst with the H.E.S.S., MAGIC and VERITAS expe…
▽ More
MAXI J1820+070 is a low-mass X-ray binary with a black hole as a compact object. This binary underwent an exceptionally bright X-ray outburst from March to October 2018, showing evidence of a non-thermal particle population through its radio emission during this whole period. The combined results of 59.5 hours of observations of the MAXI J1820+070 outburst with the H.E.S.S., MAGIC and VERITAS experiments at energies above 200 GeV are presented, together with Fermi-LAT data between 0.1 and 500 GeV, and multiwavelength observations from radio to X-rays. Gamma-ray emission is not detected from MAXI J1820+070, but the obtained upper limits and the multiwavelength data allow us to put meaningful constraints on the source properties under reasonable assumptions regarding the non-thermal particle population and the jet synchrotron spectrum. In particular, it is possible to show that, if a high-energy gamma-ray emitting region is present during the hard state of the source, its predicted flux should be at most a factor of 20 below the obtained Fermi-LAT upper limits, and closer to them for magnetic fields significantly below equipartition. During the state transitions, under the plausible assumption that electrons are accelerated up to ~ 500 GeV, the multiwavelength data and the gamma-ray upper limits lead consistently to the conclusion that a potential high-energy and very-high-energy gamma-ray emitting region should be located at a distance from the black hole ranging between 10^11 and 10^13 cm. Similar outbursts from low-mass X-ray binaries might be detectable in the near future with upcoming instruments such as CTA.
△ Less
Submitted 6 October, 2022; v1 submitted 20 September, 2022;
originally announced September 2022.
-
All-Electron, Density Functional-Based Method for Angle-Resolved Tunneling Ionization in the Adiabatic Regime
Authors:
Imam S. Wahyutama,
Denawakage D. Jayasinghe,
François Mauger,
Kenneth Lopata,
Mette B. Gaarde,
Kenneth J. Schafer
Abstract:
We develop and test a method that integrates many-electron weak-field asymptotic theory (ME-WFAT) [Phys. Rev. A 89, 013421 (2014)] in the integral representation (IR) into the density functional theory (DFT) framework. In particular, we present modifications of the integral formula in the IR ME-WFAT to incorporate the potential terms unique to DFT. By solving an adiabatic rate equation for the ang…
▽ More
We develop and test a method that integrates many-electron weak-field asymptotic theory (ME-WFAT) [Phys. Rev. A 89, 013421 (2014)] in the integral representation (IR) into the density functional theory (DFT) framework. In particular, we present modifications of the integral formula in the IR ME-WFAT to incorporate the potential terms unique to DFT. By solving an adiabatic rate equation for the angle-resolved ionization yield in our DFT-based ME-WFAT method, we show that the results are in excellent agreement with those of real-time time-dependent density functional theory (RT-TDDFT) simulations for NO, OCS, CH$_3$Br, and CH$_3$Cl interacting with one- and two- color laser fields with a fundamental wavelength of $800$ nm. This agreement is significant because the WFAT calculations take only a small fraction of the time of full TDDFT calculations. These results suggest that in the wavelength region commonly used in strong-field experiments ($800$ nm and longer), our DFT-based WFAT treatment can be used to rapidly screen for the ionization properties of a large number of molecules as a function of alignment or orientation between the molecule and the strong field.
△ Less
Submitted 7 October, 2022; v1 submitted 13 August, 2022;
originally announced August 2022.
-
Attochemistry Regulation of Charge Migration
Authors:
Aderonke S. Folorunso,
François Mauger,
Kyle A. Hamer,
Denawakage D Jayasinghe,
Imam Wahyutama,
Justin R. Ragains,
Robert R. Jones,
Louis F. DiMauro,
Mette B. Gaarde,
Kenneth J. Schafer,
Kenneth Lopata
Abstract:
Charge migration (CM) is a coherent attosecond process that involves the movement of localized holes across a molecule. To determine the relationship between a molecule's structure and the CM dynamics it exhibits, we perform systematic studies of para-functionalized bromobenzene molecules (X-C$_6$H$_4$-R) using real-time time-dependent density functional theory. We initiate valence-electron dynami…
▽ More
Charge migration (CM) is a coherent attosecond process that involves the movement of localized holes across a molecule. To determine the relationship between a molecule's structure and the CM dynamics it exhibits, we perform systematic studies of para-functionalized bromobenzene molecules (X-C$_6$H$_4$-R) using real-time time-dependent density functional theory. We initiate valence-electron dynamics by emulating rapid strong-field ionization leading to a localized hole on the bromine atom. The resulting CM, which takes on the order of 1 fs, occurs via an X localized to C$_6$H$_4$ delocalized to R localized mechanism. Interestingly, the hole contrast on the acceptor functional group increases with increasing electron donating strength. This trend is well-described by the Hammett sigma value of the group, which is a commonly used metric for quantifying the effect of functionalization on the chemical reactivity of benzene derivatives. These results suggest that simple attochemistry principles and a density-based picture can be used to predict and understand CM.
△ Less
Submitted 12 July, 2022;
originally announced July 2022.
-
Edge Inference with Fully Differentiable Quantized Mixed Precision Neural Networks
Authors:
Clemens JS Schaefer,
Siddharth Joshi,
Shan Li,
Raul Blazquez
Abstract:
The large computing and memory cost of deep neural networks (DNNs) often precludes their use in resource-constrained devices. Quantizing the parameters and operations to lower bit-precision offers substantial memory and energy savings for neural network inference, facilitating the use of DNNs on edge computing platforms. Recent efforts at quantizing DNNs have employed a range of techniques encompa…
▽ More
The large computing and memory cost of deep neural networks (DNNs) often precludes their use in resource-constrained devices. Quantizing the parameters and operations to lower bit-precision offers substantial memory and energy savings for neural network inference, facilitating the use of DNNs on edge computing platforms. Recent efforts at quantizing DNNs have employed a range of techniques encompassing progressive quantization, step-size adaptation, and gradient scaling. This paper proposes a new quantization approach for mixed precision convolutional neural networks (CNNs) targeting edge-computing. Our method establishes a new pareto frontier in model accuracy and memory footprint demonstrating a range of quantized models, delivering best-in-class accuracy below 4.3 MB of weights (wgts.) and activations (acts.). Our main contributions are: (i) hardware-aware heterogeneous differentiable quantization with tensor-sliced learned precision, (ii) targeted gradient modification for wgts. and acts. to mitigate quantization errors, and (iii) a multi-phase learning schedule to address instability in learning arising from updates to the learned quantizer and model parameters. We demonstrate the effectiveness of our techniques on the ImageNet dataset across a range of models including EfficientNet-Lite0 (e.g., 4.14MB of wgts. and acts. at 67.66% accuracy) and MobileNetV2 (e.g., 3.51MB wgts. and acts. at 65.39% accuracy).
△ Less
Submitted 29 August, 2023; v1 submitted 15 June, 2022;
originally announced June 2022.
-
Temperature dependent ARPES of the metallic-like bands in Si(553)-Au
Authors:
Lenart Dudy,
Julian Aulbach,
Jörg Schäfer,
Ralph Claessen,
Victor Rogalev,
Piotr Chudzinski
Abstract:
We conducted a thorough investigation into the temperature dependence of the metallic-like bands of Si(553)-Au using angular-resolved photoemission spectroscopy (ARPES). Our study addresses the challenges posed by the short-term stability of the surface and photo-voltage effects, which we overcame to extract changes in the band-filling and Fermi-velocity. Our findings shed light on the low-tempera…
▽ More
We conducted a thorough investigation into the temperature dependence of the metallic-like bands of Si(553)-Au using angular-resolved photoemission spectroscopy (ARPES). Our study addresses the challenges posed by the short-term stability of the surface and photo-voltage effects, which we overcame to extract changes in the band-filling and Fermi-velocity. Our findings shed light on the low-temperature phase of the step edge in Si(553)-Au, which has been a topic of ongoing debate regarding its structural or electronic nature. Through comparison with theoretical predictions of a structural-related low-temperature to high-temperature phase transition, we discovered that the band-filling and Fermi-velocity do not change accordingly, thereby ruling out this scenario. Our study contributes to a better understanding of this material system and provides an important reference for future research.
△ Less
Submitted 14 June, 2023; v1 submitted 25 March, 2022;
originally announced March 2022.
-
Time-resolved hadronic particle acceleration in the recurrent Nova RS Ophiuchi
Authors:
H. E. S. S. Collaboration,
F. Aharonian,
F. Ait Benkhali,
E. O. Angüner,
H. Ashkar,
M. Backes,
V. Baghmanyan,
V. Barbosa Martins,
R. Batzofin,
Y. Becherini,
D. Berge,
K. Bernlöhr,
B. Bi,
M. Böttcher,
C. Boisson,
J. Bolmont,
M. de Bony de Lavergne,
M. Breuhaus,
R. Brose,
F. Brun,
S. Caroff,
S. Casanova,
M. Cerruti,
T. Chand,
A. Chen
, et al. (150 additional authors not shown)
Abstract:
Recurrent Novae are repeating thermonuclear explosions in the outer layers of white dwarfs, due to the accretion of fresh material from a binary companion. The shock generated by ejected material slamming into the companion star's wind, accelerates particles to very-high-energies. We report very-high-energy (VHE, $\gtrsim100$\,GeV) gamma rays from the recurrent nova RS\,Ophiuchi up to a month afte…
▽ More
Recurrent Novae are repeating thermonuclear explosions in the outer layers of white dwarfs, due to the accretion of fresh material from a binary companion. The shock generated by ejected material slamming into the companion star's wind, accelerates particles to very-high-energies. We report very-high-energy (VHE, $\gtrsim100$\,GeV) gamma rays from the recurrent nova RS\,Ophiuchi up to a month after its 2021 outburst, using the High Energy Stereoscopic System. The VHE emission has a similar temporal profile to lower-energy GeV emission, indicating a common origin, with a two-day delay in peak flux. These observations constrain models of time-dependent particle energization, favouring a hadronic emission scenario over the leptonic alternative. This confirms that shocks in dense winds provide favourable environments for efficient cosmic-ray acceleration to very-high-energies.
△ Less
Submitted 28 March, 2022; v1 submitted 16 February, 2022;
originally announced February 2022.
-
Characterizing Particle-Like Charge Migration Dynamics with High-Harmonic Sideband Spectroscopy
Authors:
Kyle A. Hamer,
Francois Mauger,
Aderonke S. Folorunso,
Kenneth Lopata,
Robert R. Jones,
Louis F. DiMauro,
Kenneth J. Schafer,
Mette B. Gaarde
Abstract:
We introduce high-harmonic sideband spectroscopy (HHSS) and show that it can be a robust probe of attosecond charge migration (CM) in a halogenated carbon-chain molecule. We simulate both the CM and harmonic-generation (HHG) dynamics using ab initio time-dependent density-functional theory. We find that CM dynamics initiated along the molecular backbone induces sidebands in the HHG spectrum driven…
▽ More
We introduce high-harmonic sideband spectroscopy (HHSS) and show that it can be a robust probe of attosecond charge migration (CM) in a halogenated carbon-chain molecule. We simulate both the CM and harmonic-generation (HHG) dynamics using ab initio time-dependent density-functional theory. We find that CM dynamics initiated along the molecular backbone induces sidebands in the HHG spectrum driven by a delayed laser pulse that is polarized perpendicular to the molecular axis. Monitoring the spectrum as either the HHG laser frequency or the relative delay is scanned allows for the extraction of detailed information about the time-domain characteristics of the CM process.
△ Less
Submitted 12 May, 2022; v1 submitted 1 February, 2022;
originally announced February 2022.
-
Drift vs Shift: Decoupling Trends and Changepoint Analysis
Authors:
Haoxuan Wu,
Toryn L. J. Schafer,
Sean Ryan,
David S. Matteson
Abstract:
We introduce a new approach for decoupling trends (drift) and changepoints (shifts) in time series. Our locally adaptive model-based approach for robustly decoupling combines Bayesian trend filtering and machine learning based regularization. An over-parameterized Bayesian dynamic linear model (DLM) is first applied to characterize drift. Then a weighted penalized likelihood estimator is paired wi…
▽ More
We introduce a new approach for decoupling trends (drift) and changepoints (shifts) in time series. Our locally adaptive model-based approach for robustly decoupling combines Bayesian trend filtering and machine learning based regularization. An over-parameterized Bayesian dynamic linear model (DLM) is first applied to characterize drift. Then a weighted penalized likelihood estimator is paired with the estimated DLM posterior distribution to identify shifts. We show how Bayesian DLMs specified with so-called shrinkage priors can provide smooth estimates of underlying trends in the presence of complex noise components. However, their inability to shrink exactly to zero inhibits direct changepoint detection. In contrast, penalized likelihood methods are highly effective in locating changepoints. However, they require data with simple patterns in both signal and noise. The proposed decoupling approach combines the strengths of both, i.e. the flexibility of Bayesian DLMs with the hard thresholding property of penalized likelihood estimators, to provide changepoint analysis in complex, modern settings. The proposed framework is outlier robust and can identify a variety of changes, including in mean and slope. It is also easily extended for analysis of parameter shifts in time-varying parameter models like dynamic regressions. We illustrate the flexibility and contrast the performance and robustness of our approach with several alternative methods across a wide range of simulations and application examples.
△ Less
Submitted 6 January, 2024; v1 submitted 17 January, 2022;
originally announced January 2022.
-
Evidence for gamma-ray emission from the remnant of Kepler's supernova based on deep H.E.S.S. observations
Authors:
H. E. S. S. Collaboration,
F. Aharonian,
F. Ait Benkhali,
E. O. Anguner,
H. Ashkar,
M. Backes,
V. Barbosa Martins,
R. Batzofin,
Y. Becherini,
D. Berge,
K. Bernloehr,
M. Boettcher,
C. Boisson,
J. Bolmont,
M. de Bony de Lavergne,
M. Breuhaus,
R. Brose,
F. Brun,
T. Bulik,
T. Bylund,
F. Cangemi,
S. Caroff,
S. Casanova,
M. Cerruti,
T. Chand
, et al. (136 additional authors not shown)
Abstract:
Observations with imaging atmospheric Cherenkov telescopes (IACTs) have enhanced our knowledge of nearby supernova (SN) remnants with ages younger than 500 years by establishing Cassiopeia A and the remnant of Tycho's SN as very-high-energy (VHE) gamma-ray sources. The remnant of Kepler's SN, which is the product of the most recent naked-eye supernova in our Galaxy, is comparable in age to the oth…
▽ More
Observations with imaging atmospheric Cherenkov telescopes (IACTs) have enhanced our knowledge of nearby supernova (SN) remnants with ages younger than 500 years by establishing Cassiopeia A and the remnant of Tycho's SN as very-high-energy (VHE) gamma-ray sources. The remnant of Kepler's SN, which is the product of the most recent naked-eye supernova in our Galaxy, is comparable in age to the other two, but is significantly more distant. If the gamma-ray luminosities of the remnants of Tycho's and Kepler's SNe are similar, then the latter is expected to be one of the faintest gamma-ray sources within reach of the current generation IACT arrays.
Here we report evidence at a statistical level of 4.6 sigma for a VHE signal from the remnant of Kepler's SN based on deep observations by the High Energy Stereoscopic System (H.E.S.S.) with an exposure of 152 hours. The measured integral flux above an energy of 226 GeV is ~0.3% of the flux of the Crab Nebula. The spectral energy distribution (SED) reveals a gamma-ray emitting component connecting the VHE emission observed with H.E.S.S. to the emission observed at GeV energies with Fermi-LAT. The overall SED is similar to that of the remnant of Tycho's SN, possibly indicating the same non-thermal emission processes acting in both these young remnants of thermonuclear SNe.
△ Less
Submitted 23 March, 2024; v1 submitted 15 January, 2022;
originally announced January 2022.
-
Analysis of animal-related electric outages using species distribution models and community science data
Authors:
Mei-Ling E. Feng,
Olukunle O. Owolabi,
Toryn L. J. Schafer,
Sanhita Sengupta,
Lan Wang,
David S. Matteson,
Judy P. Che-Castaldo,
Deborah A. Sunter
Abstract:
Animal-related outages (AROs) are a prevalent form of outages in electrical distribution systems. Animal-infrastructure interactions vary across focal species and regions, underlining the need to study the animal-outage relationship in more species and diverse systems. Animal activity has been used as an indicator of reliability in the electrical grid system and to describe temporal patterns in AR…
▽ More
Animal-related outages (AROs) are a prevalent form of outages in electrical distribution systems. Animal-infrastructure interactions vary across focal species and regions, underlining the need to study the animal-outage relationship in more species and diverse systems. Animal activity has been used as an indicator of reliability in the electrical grid system and to describe temporal patterns in AROs. However, these ARO models have been limited by a lack of available estimates of species activity, instead approximating activity based on seasonal and weather patterns in animal-related outage records and characteristics of broad taxonomic groups, e.g., squirrels. We highlight publicly available resources to fill the ecological data gap that is limiting joint analyses between ecology and energy sectors. Species distribution models (SDMs), a common technique to model the distribution of a species across geographic space and time, paired with data sourced from eBird, a community science database for bird observations, provided us with species-specific estimates of activity to model spatio-temporal patterns of AROs. These flexible, species-specific estimates can allow future animal-indicators of grid reliability to be investigated in more diverse regions and ecological communities, providing a better understanding of the variation that exists in animal-outage relationship. AROs were best modeled by accounting for multiple outage-prone species activity patterns and their unique relationships with seasonality and habitat availability. Different species were important for modeling outages in different landscapes and seasons depending on their distribution and migration behavior. We recommend that future models of AROs include species-specific activity data that account for the diverse spectrum of spatio-temporal activity patterns that outage-prone animals exhibit.
△ Less
Submitted 22 December, 2021;
originally announced December 2021.
-
Role of Variable Renewable Energy Penetration on Electricity Price and its Volatility Across Independent System Operators in the United States
Authors:
Olukunle O. Owolabi,
Toryn L. J. Schafer,
Georgia E. Smits,
Sanhita Sengupta,
Sean E. Ryan,
Lan Wang,
David S. Matteson,
Mila Getmansky Sherman,
Deborah A. Sunter
Abstract:
The U.S. electrical grid has undergone substantial transformation with increased penetration of wind and solar -- forms of variable renewable energy (VRE). Despite the benefits of VRE for decarbonization, it has garnered some controversy for inducing unwanted effects in regional electricity markets. In this study, the role of VRE penetration is examined on the system electricity price and price vo…
▽ More
The U.S. electrical grid has undergone substantial transformation with increased penetration of wind and solar -- forms of variable renewable energy (VRE). Despite the benefits of VRE for decarbonization, it has garnered some controversy for inducing unwanted effects in regional electricity markets. In this study, the role of VRE penetration is examined on the system electricity price and price volatility based on hourly, real-time, historical data from six Independent System Operators (ISOs) in the U.S. using quantile and skew t-distribution regressions. After correcting for temporal effects, we found an increase in VRE penetration is associated with decrease in system electricity price in all ISOs studied. The increase in VRE penetration is associated with decrease in temporal price volatility in five out of six ISOs studied. The relationships are non-linear. These results are consistent with the modern portfolio theory where diverse volatile assets may lead to more stable and less risky portfolios.
△ Less
Submitted 28 November, 2022; v1 submitted 10 November, 2021;
originally announced December 2021.
-
Overview of the HASOC Subtrack at FIRE 2021: Hate Speech and Offensive Content Identification in English and Indo-Aryan Languages
Authors:
Thomas Mandl,
Sandip Modha,
Gautam Kishore Shahi,
Hiren Madhu,
Shrey Satapara,
Prasenjit Majumder,
Johannes Schaefer,
Tharindu Ranasinghe,
Marcos Zampieri,
Durgesh Nandini,
Amit Kumar Jaiswal
Abstract:
The widespread of offensive content online such as hate speech poses a growing societal problem. AI tools are necessary for supporting the moderation process at online platforms. For the evaluation of these identification tools, continuous experimentation with data sets in different languages are necessary. The HASOC track (Hate Speech and Offensive Content Identification) is dedicated to develop…
▽ More
The widespread of offensive content online such as hate speech poses a growing societal problem. AI tools are necessary for supporting the moderation process at online platforms. For the evaluation of these identification tools, continuous experimentation with data sets in different languages are necessary. The HASOC track (Hate Speech and Offensive Content Identification) is dedicated to develop benchmark data for this purpose. This paper presents the HASOC subtrack for English, Hindi, and Marathi. The data set was assembled from Twitter. This subtrack has two sub-tasks. Task A is a binary classification problem (Hate and Not Offensive) offered for all three languages. Task B is a fine-grained classification problem for three classes (HATE) Hate speech, OFFENSIVE and PROFANITY offered for English and Hindi. Overall, 652 runs were submitted by 65 teams. The performance of the best classification algorithms for task A are F1 measures 0.91, 0.78 and 0.83 for Marathi, Hindi and English, respectively. This overview presents the tasks and the data development as well as the detailed results. The systems submitted to the competition applied a variety of technologies. The best performing algorithms were mainly variants of transformer architectures.
△ Less
Submitted 16 December, 2021;
originally announced December 2021.
-
Lifting topological protection in a quantum spin Hall insulator by edge coupling
Authors:
Raul Stühler,
André Kowalewski,
Felix Reis,
Dimitri Jungblut,
Fernando Dominguez,
Benedikt Scharf,
Gang Li,
Jörg Schäfer,
Ewelina M. Hankiewicz,
Ralph Claessen
Abstract:
The scientific interest in two-dimensional topological insulators (2D TIs) is currently shifting from a more fundamental perspective to the exploration and design of novel functionalities. Key concepts for the use of 2D TIs in spintronics are based on the topological protection and spin-momentum locking of their helical edge states. In this study we present experimental evidence that topological p…
▽ More
The scientific interest in two-dimensional topological insulators (2D TIs) is currently shifting from a more fundamental perspective to the exploration and design of novel functionalities. Key concepts for the use of 2D TIs in spintronics are based on the topological protection and spin-momentum locking of their helical edge states. In this study we present experimental evidence that topological protection can be (partially) lifted by pairwise coupling of 2D TI edges in close proximity. Using direct wave function map** via scanning tunneling microscopy/spectroscopy (STM/STS) we compare isolated and coupled topological edges in the 2D TI bismuthene. The latter situation is realized by natural lattice line defects and reveals distinct quasi-particle interference (QPI) patterns, identified as electronic Fabry-Pérot resonator modes. In contrast, free edges show no sign of any single-particle backscattering. These results pave the way for novel device concepts based on active control of topological protection through inter-edge hybridization for, e.g., electronic Fabry-Pérot interferometry.
△ Less
Submitted 8 November, 2021;
originally announced November 2021.
-
Combination Chemotherapy Optimization with Discrete Dosing
Authors:
Temitayo Ajayi,
Seyedmohammadhossein Hosseinian,
Andrew J. Schaefer,
Clifton D. Fuller
Abstract:
Chemotherapy is one of the primary modalities of cancer treatment. Chemotherapy drug administration is a complex problem that often requires expensive clinical trials to evaluate potential regimens. One way to alleviate this burden and better inform future trials is to build reliable models for drug administration. Previous chemotherapy optimization models have mainly relied on optimal control, wh…
▽ More
Chemotherapy is one of the primary modalities of cancer treatment. Chemotherapy drug administration is a complex problem that often requires expensive clinical trials to evaluate potential regimens. One way to alleviate this burden and better inform future trials is to build reliable models for drug administration. Previous chemotherapy optimization models have mainly relied on optimal control, which does not lend itself to capturing complex and vital operational constraints in chemotherapy planning involving discrete decisions, such as doses via pills and rest periods. In addition, most of the existing models for chemotherapy optimization lack an explicit toxicity measure and impose toxicity constraints primarily through (fixed) limits on drug concentration. The existing stochastic optimization models also focus on maximizing the probability of cure when tumor heterogeneity is uncertain. In this paper, we develop a mixed-integer program for combination chemotherapy (utilization of multiple drugs) optimization that incorporates various important operational constraints and, besides dose and concentration limits, controls treatment toxicity based on its effect on the count of white blood cells. To address the uncertainty of tumor heterogeneity, we propose chance constraints that guarantee reaching an operable tumor size with a high probability in a neoadjuvant setting. We present analytical results pertinent to the accuracy of the model in representing biological processes of chemotherapy and establish its merit for clinical applications through a numerical study of breast cancer.
△ Less
Submitted 2 November, 2021;
originally announced November 2021.
-
Searching for TeV gamma-ray emission from SGR\,1935+2154 during its 2020 X-ray and radio bursting phase
Authors:
H. E. S. S. Collaboration,
:,
H. Abdalla,
F. Aharonian,
F. Ait Benkhali,
E. O. Anguner,
C. Arcaro,
C. Armand,
T. Armstrong,
H. Ashkar,
M. Backes,
V. Baghmanyan,
V. Barbosa Martins,
A. Barnacka,
M. Barnard,
Y. Becherini,
D. Berge,
K. Bernlohr,
B. Bi,
M. Bottcher,
C. Boisson,
J. Bolmont,
M. de Bony de Lavergne,
M. Breuhaus,
R. Brose
, et al. (230 additional authors not shown)
Abstract:
Magnetar hyperflares are the most plausible explanation for fast radio bursts (FRB) -- enigmatic powerful radio pulses with durations of several milliseconds and high brightness temperatures. The first observational evidence for this scenario was obtained in 2020 April when a FRB was detected from the direction of the Galactic magnetar and soft gamma-ray repeater SGR\,1935+2154. The FRB was preced…
▽ More
Magnetar hyperflares are the most plausible explanation for fast radio bursts (FRB) -- enigmatic powerful radio pulses with durations of several milliseconds and high brightness temperatures. The first observational evidence for this scenario was obtained in 2020 April when a FRB was detected from the direction of the Galactic magnetar and soft gamma-ray repeater SGR\,1935+2154. The FRB was preceded by two gamma-ray outburst alerts by the BAT instrument aboard the Swift satellite, which triggered follow-up observations by the High Energy Stereoscopic System (H.E.S.S.). H.E.S.S. has observed SGR\,1935+2154 for 2 hr on 2020 April 28. The observations are coincident with X-ray bursts from the magnetar detected by INTEGRAL and Fermi-GBM, thus providing the first very high energy (VHE) gamma-ray observations of a magnetar in a flaring state. High-quality data acquired during these follow-up observations allow us to perform a search for short-time transients. No significant signal at energies $E>0.6$~TeV is found and upper limits on the persistent and transient emission are derived. We here present the analysis of these observations and discuss the obtained results and prospects of the H.E.S.S. follow-up program for soft gamma-ray repeaters.
△ Less
Submitted 1 October, 2021;
originally announced October 2021.
-
Overview of the HASOC track at FIRE 2020: Hate Speech and Offensive Content Identification in Indo-European Languages
Authors:
Thomas Mandla,
Sandip Modha,
Gautam Kishore Shahi,
Amit Kumar Jaiswal,
Durgesh Nandini,
Daksh Patel,
Prasenjit Majumder,
Johannes Schäfer
Abstract:
With the growth of social media, the spread of hate speech is also increasing rapidly. Social media are widely used in many countries. Also Hate Speech is spreading in these countries. This brings a need for multilingual Hate Speech detection algorithms. Much research in this area is dedicated to English at the moment. The HASOC track intends to provide a platform to develop and optimize Hate Spee…
▽ More
With the growth of social media, the spread of hate speech is also increasing rapidly. Social media are widely used in many countries. Also Hate Speech is spreading in these countries. This brings a need for multilingual Hate Speech detection algorithms. Much research in this area is dedicated to English at the moment. The HASOC track intends to provide a platform to develop and optimize Hate Speech detection algorithms for Hindi, German and English. The dataset is collected from a Twitter archive and pre-classified by a machine learning system. HASOC has two sub-task for all three languages: task A is a binary classification problem (Hate and Not Offensive) while task B is a fine-grained classification problem for three classes (HATE) Hate speech, OFFENSIVE and PROFANITY. Overall, 252 runs were submitted by 40 teams. The performance of the best classification algorithms for task A are F1 measures of 0.51, 0.53 and 0.52 for English, Hindi, and German, respectively. For task B, the best classification algorithms achieved F1 measures of 0.26, 0.33 and 0.29 for English, Hindi, and German, respectively. This article presents the tasks and the data development as well as the results. The best performing algorithms were mainly variants of the transformer architecture BERT. However, also other systems were applied with good success
△ Less
Submitted 12 August, 2021;
originally announced August 2021.
-
Hybrid cosmic ray measurements using the IceAct telescopes in coincidence with the IceCube and IceTop detectors
Authors:
Larissa Paul,
Matthias Plum,
Merlin Schaufel,
Thomas Bretz,
Giang Do,
John W. Hewitt,
Frank Maslowski,
Florian Rehbein,
Johannes Schäfer,
Adrian Zink
Abstract:
IceAct is a proposed surface array of compact (50 cm diameter) and cost-effective Imaging Air Cherenkov Telescopes installed at the site of the IceCube Neutrino Observatory at the geographic South Pole. Since January 2019, two IceAct telescope demonstrators, featuring 61 silicon pho- tomultiplier (SiPM) pixels have been taking data in the center of the IceTop surface array during the austral winte…
▽ More
IceAct is a proposed surface array of compact (50 cm diameter) and cost-effective Imaging Air Cherenkov Telescopes installed at the site of the IceCube Neutrino Observatory at the geographic South Pole. Since January 2019, two IceAct telescope demonstrators, featuring 61 silicon pho- tomultiplier (SiPM) pixels have been taking data in the center of the IceTop surface array during the austral winter. We present the first analysis of hybrid cosmic ray events detected by the IceAct imaging air-Cherenkov telescopes in coincidence with the IceCube Neutrino Observatory, includ- ing the IceTop surface array and the IceCube in-ice array. By featuring an energy threshold of about 10 TeV and a wide field-of-view, the IceAct telescopes show promising capabilities of im- proving current cosmic ray composition studies: measuring the Cherenkov light emissions in the atmosphere adds new information about the shower development not accessible with the current detectors, enabling significantly better primary particle type discrimination on a statistical basis. The hybrid measurement also allows for detailed feasibility studies of detector cross-calibration and of cosmic ray veto capabilities for neutrino analyses. We present the performance of the telescopes, the results from the analysis of two years of data, and an outlook of a hybrid simulation for a future telescope array.
△ Less
Submitted 12 August, 2021;
originally announced August 2021.
-
LMC N132D: A mature supernova remnant with a power-law gamma-ray spectrum extending beyond 8 TeV
Authors:
H. E. S. S. Collaboration,
:,
H. Abdalla,
F. Aharonian,
F. Ait Benkhali,
E. O. Angüner,
C. Arcaro,
C. Armand,
T. Armstrong,
H. Ashkar,
M. Backes,
V. Baghmanyan,
V. Barbosa Martins,
A. Barnacka,
M. Barnard,
Y. Becherini,
D. Berge,
K. Bernlöhr,
B. Bi,
M. Böttcher,
C. Boisson,
J. Bolmont,
M. de Bony de Lavergne,
M. Breuhaus,
F. Brun
, et al. (212 additional authors not shown)
Abstract:
We analyzed 252 hours of High Energy Stereoscopic System (H.E.S.S.) observations towards the supernova remnant (SNR) LMC N132D that were accumulated between December 2004 and March 2016 during a deep survey of the Large Magellanic Cloud, adding 104 hours of observations to the previously published data set to ensure a > 5 sigma detection. To broaden the gamma-ray spectral coverage required for mod…
▽ More
We analyzed 252 hours of High Energy Stereoscopic System (H.E.S.S.) observations towards the supernova remnant (SNR) LMC N132D that were accumulated between December 2004 and March 2016 during a deep survey of the Large Magellanic Cloud, adding 104 hours of observations to the previously published data set to ensure a > 5 sigma detection. To broaden the gamma-ray spectral coverage required for modeling the spectral energy distribution, an analysis of Fermi-LAT Pass 8 data was also included. We unambiguously detect N132D at very high energies (VHE) with a significance of 5.7 sigma. We report the results of a detailed analysis of its spectrum and localization based on the extended H.E.S.S. data set. The joint analysis of the extended H.E.S.S and Fermi-LAT data results in a spectral energy distribution in the energy range from 1.7 GeV to 14.8 TeV, which suggests a high luminosity of N132D at GeV and TeV energies. We set a lower limit on a gamma-ray cutoff energy of 8 TeV with a confidence level of 95%. The new gamma-ray spectrum as well as multiwavelength observations of N132D when compared to physical models suggests a hadronic origin of the VHE gamma-ray emission. SNR N132D is a VHE gamma-ray source that shows a spectrum extending to the VHE domain without a spectral cutoff at a few TeV, unlike the younger oxygen-rich SNR Cassiopeia A. The gamma-ray properties of N132D may be affected by an interaction with a nearby molecular cloud that partially lies inside the 95% confidence region of the source position. [Abridged]
△ Less
Submitted 4 August, 2021;
originally announced August 2021.