-
Electronic spin susceptibility in metallic strontium titanate
Authors:
A. Najev,
N. Somun,
M. Spaić,
I. Khayr,
M. Greven,
A. Klein,
M. N. Gastiasoro,
D. Pelc
Abstract:
Metallic strontium titanate (SrTiO$_3$) is known to have both normal-state and superconducting properties that vary strongly over a wide range of charge carrier densities. This indicates the importance of nonlinear dynamics, and has hindered the development of a clear qualitative description of the observed behaviour. A major challenge is to understand how the charge carriers themselves evolve wit…
▽ More
Metallic strontium titanate (SrTiO$_3$) is known to have both normal-state and superconducting properties that vary strongly over a wide range of charge carrier densities. This indicates the importance of nonlinear dynamics, and has hindered the development of a clear qualitative description of the observed behaviour. A major challenge is to understand how the charge carriers themselves evolve with do** and temperature, with possible polaronic effects and evidence of an effective mass that strongly increases with temperature. Here we use $^{47,49}$Ti nuclear magnetic resonance (NMR) to perform a comprehensive study of the electronic spin susceptibility in the dilute metallic state of strontium titanate across the do**-temperature phase diagram. We find a temperature-dependent Knight shift that can be quantitatively understood within a non-degenerate Fermi gas model that fully takes into account the complex band structure of SrTiO$_3$. Our data are consistent with a temperature-independent effective mass, and we show that the behavior of the spin susceptibility is universal in a wide range of temperatures and carrier concentrations. These results provide a microscopic foundation for the understanding of the properties of the unconventional low-density metallic state in strontium titanate and related materials.
△ Less
Submitted 25 June, 2024;
originally announced June 2024.
-
Measurement of $J/ψ$ and $ψ\left(2S\right)$ production in $p+p$ and $p+d$ interactions at 120 GeV
Authors:
C. H. Leung,
K. Nagai,
K. Nakano,
D. Nawarathne,
J. Dove,
S. Prasad,
N. Wuerfel,
C. A. Aidala,
J. Arrington,
C. Ayuso,
C. L. Barker,
C. N. Brown,
W. C. Chang,
A. Chen,
D. C. Christian,
B. P. Dannowitz,
M. Daugherity,
L. El Fassi,
D. F. Geesaman,
R. Gilman,
Y. Goto,
R. Guo,
T. J. Hague,
R. J. Holt,
M. F. Hossain
, et al. (36 additional authors not shown)
Abstract:
We report the $p+p$ and $p+d$ differential cross sections measured in the SeaQuest experiment for $J/ψ$ and $ψ\left(2S\right)$ production at 120 GeV beam energy covering the forward $x$-Feynman ($x_F$) range of $0.5 < x_F <0.9$. The measured cross sections are in good agreement with theoretical calculations based on the nonrelativistic QCD (NRQCD) using the long-distance matrix elements deduced fr…
▽ More
We report the $p+p$ and $p+d$ differential cross sections measured in the SeaQuest experiment for $J/ψ$ and $ψ\left(2S\right)$ production at 120 GeV beam energy covering the forward $x$-Feynman ($x_F$) range of $0.5 < x_F <0.9$. The measured cross sections are in good agreement with theoretical calculations based on the nonrelativistic QCD (NRQCD) using the long-distance matrix elements deduced from a recent global analysis of proton- and pion-induced charmonium production data. The $σ_{ψ\left(2S\right)} / σ_{J/ψ}$ cross section ratios are found to increase as $x_F$ increases, indicating that the $q \bar{q}$ annihilation process has larger contributions in the $ψ\left(2S\right)$ production than the $J/ψ$ production. The $σ_{pd}/2σ_{pp}$ cross section ratios are observed to be significantly different for the Drell-Yan process and $J/ψ$ production, reflecting their different production mechanisms. We find that the $σ_{pd}/2σ_{pp}$ ratios for $J/ψ$ production at the forward $x_F$ region are sensitive to the $\bar{d}/ \bar{u}$ flavor asymmetry of the proton sea, analogous to the Drell-Yan process. The transverse momentum ($p_T$) distributions for $J/ψ$ and $ψ\left(2S\right)$ production are also presented and compared with data collected at higher center-of-mass energies.
△ Less
Submitted 17 June, 2024;
originally announced June 2024.
-
Designing metasurface optical interfaces for solid-state qubits using many-body adjoint shape optimization
Authors:
Amelia R. Klein,
Nader Engheta,
Lee C. Bassett
Abstract:
We present a general strategy for the inverse design of metasurfaces composed of elementary shapes. We use it to design a structure that collects and collimates light from nitrogen-vacancy centers in diamond. Such metasurfaces constitute scalable optical interfaces for solid-state qubits, enabling efficient photon coupling into optical fibers and eliminating free-space collection optics. The many-…
▽ More
We present a general strategy for the inverse design of metasurfaces composed of elementary shapes. We use it to design a structure that collects and collimates light from nitrogen-vacancy centers in diamond. Such metasurfaces constitute scalable optical interfaces for solid-state qubits, enabling efficient photon coupling into optical fibers and eliminating free-space collection optics. The many-body shape optimization strategy is a practical alternative to topology optimization that explicitly enforces material and fabrication constraints throughout the optimization, while still achieving high performance. The metasurface is easily adaptable to other solid-state qubits, and the optimization method is broadly applicable to fabrication-constrained photonic design problems.
△ Less
Submitted 12 June, 2024;
originally announced June 2024.
-
HW-GPT-Bench: Hardware-Aware Architecture Benchmark for Language Models
Authors:
Rhea Sanjay Sukthanker,
Arber Zela,
Benedikt Staffler,
Aaron Klein,
Lennart Purucker,
Joerg K. H. Franke,
Frank Hutter
Abstract:
The increasing size of language models necessitates a thorough analysis across multiple dimensions to assess trade-offs among crucial hardware metrics such as latency, energy consumption, GPU memory usage, and performance. Identifying optimal model configurations under specific hardware constraints is becoming essential but remains challenging due to the computational load of exhaustive training a…
▽ More
The increasing size of language models necessitates a thorough analysis across multiple dimensions to assess trade-offs among crucial hardware metrics such as latency, energy consumption, GPU memory usage, and performance. Identifying optimal model configurations under specific hardware constraints is becoming essential but remains challenging due to the computational load of exhaustive training and evaluation on multiple devices. To address this, we introduce HW-GPT-Bench, a hardware-aware benchmark that utilizes surrogate predictions to approximate various hardware metrics across 13 devices of architectures in the GPT-2 family, with architectures containing up to 774M parameters. Our surrogates, via calibrated predictions and reliable uncertainty estimates, faithfully model the heteroscedastic noise inherent in the energy and latency measurements. To estimate perplexity, we employ weight-sharing techniques from Neural Architecture Search (NAS), inheriting pretrained weights from the largest GPT-2 model. Finally, we demonstrate the utility of HW-GPT-Bench by simulating optimization trajectories of various multi-objective optimization algorithms in just a few seconds.
△ Less
Submitted 21 June, 2024; v1 submitted 16 May, 2024;
originally announced May 2024.
-
Structural Pruning of Pre-trained Language Models via Neural Architecture Search
Authors:
Aaron Klein,
Jacek Golebiowski,
Xingchen Ma,
Valerio Perrone,
Cedric Archambeau
Abstract:
Pre-trained language models (PLM), for example BERT or RoBERTa, mark the state-of-the-art for natural language understanding task when fine-tuned on labeled data. However, their large size poses challenges in deploying them for inference in real-world applications, due to significant GPU memory requirements and high inference latency. This paper explores neural architecture search (NAS) for struct…
▽ More
Pre-trained language models (PLM), for example BERT or RoBERTa, mark the state-of-the-art for natural language understanding task when fine-tuned on labeled data. However, their large size poses challenges in deploying them for inference in real-world applications, due to significant GPU memory requirements and high inference latency. This paper explores neural architecture search (NAS) for structural pruning to find sub-parts of the fine-tuned network that optimally trade-off efficiency, for example in terms of model size or latency, and generalization performance. We also show how we can utilize more recently developed two-stage weight-sharing NAS approaches in this setting to accelerate the search process. Unlike traditional pruning methods with fixed thresholds, we propose to adopt a multi-objective approach that identifies the Pareto optimal set of sub-networks, allowing for a more flexible and automated compression process.
△ Less
Submitted 3 May, 2024;
originally announced May 2024.
-
Singlet, triplet, and mixed all-to-all pairing states emerging from incoherent fermions
Authors:
Jagannath Sutradhar,
Jonathan Ruhman,
Avraham Klein
Abstract:
The electron-electron and electron-phonon coupling in complex materials can be more complicated than simple density-density interactions, involving intertwined dynamics of spin, charge, and spatial symmetries. This motivates studying universal models with complex interactions, and studying whether in this case BCS-type singlet pairing is still the ``natural'' fate of the system. To this end, we co…
▽ More
The electron-electron and electron-phonon coupling in complex materials can be more complicated than simple density-density interactions, involving intertwined dynamics of spin, charge, and spatial symmetries. This motivates studying universal models with complex interactions, and studying whether in this case BCS-type singlet pairing is still the ``natural'' fate of the system. To this end, we construct a Yukawa-SYK model with nonlocal couplings in both spin and charge channels. Furthermore, we provide for time-reversal-symmetry breaking dynamics by averaging over the Gaussian Unitary ensemble rather than the Orthogonal ensemble. We find that the ground state of the system can be an orbitally nonlocal superconducting state arising from incoherent fermions with no BCS-like analog. The superconductivity has an equal tendency to triplet and singlet pairing states separated by a non-Fermi liquid phase. We further study the fate of the system within the superconducting phase and find that the expected ground state, away from the critical point, is a mixed singlet/triplet state. Finally, we find that while at $T_c$ the triplet and singlet transitions are dual to one another, below $T_c$ the duality is broken, with the triplet state more susceptible to orbital fluctuations just by virtue of its symmetry. Our results indicate that such fluctuation-induced mixed states may be an inherent feature of strongly correlated materials.
△ Less
Submitted 15 April, 2024; v1 submitted 4 April, 2024;
originally announced April 2024.
-
NL2KQL: From Natural Language to Kusto Query
Authors:
Amir H. Abdi,
Xinye Tang,
Jeremias Eichelbaum,
Mahan Das,
Alex Klein,
Nihal Irmak Pakis,
William Blum,
Daniel L Mace,
Tanvi Raja,
Namrata Padmanabhan,
Ye Xing
Abstract:
Data is growing rapidly in volume and complexity. Proficiency in database query languages is pivotal for crafting effective queries. As coding assistants become more prevalent, there is significant opportunity to enhance database query languages. The Kusto Query Language (KQL) is a widely used query language for large semi-structured data such as logs, telemetries, and time-series for big data ana…
▽ More
Data is growing rapidly in volume and complexity. Proficiency in database query languages is pivotal for crafting effective queries. As coding assistants become more prevalent, there is significant opportunity to enhance database query languages. The Kusto Query Language (KQL) is a widely used query language for large semi-structured data such as logs, telemetries, and time-series for big data analytics platforms. This paper introduces NL2KQL an innovative framework that uses large language models (LLMs) to convert natural language queries (NLQs) to KQL queries. The proposed NL2KQL framework includes several key components: Schema Refiner which narrows down the schema to its most pertinent elements; the Few-shot Selector which dynamically selects relevant examples from a few-shot dataset; and the Query Refiner which repairs syntactic and semantic errors in KQL queries. Additionally, this study outlines a method for generating large datasets of synthetic NLQ-KQL pairs which are valid within a specific database contexts. To validate NL2KQL's performance, we utilize an array of online (based on query execution) and offline (based on query parsing) metrics. Through ablation studies, the significance of each framework component is examined, and the datasets used for benchmarking are made publicly available. This work is the first of its kind and is compared with available baselines to demonstrate its effectiveness.
△ Less
Submitted 15 April, 2024; v1 submitted 2 April, 2024;
originally announced April 2024.
-
Nano/micro-plastics effects in agricultural landscapes: an overlooked threat to pollination, biological pest control, and food security
Authors:
Dong Sheng,
Siyuan **g,
Xueqing He,
Alexandra-Maria Klein,
Heinz-R. Köhler,
Thomas C. Wanger
Abstract:
Biodiversity-associated ecosystem services such as pollination and biocontrol may be severely affected by emerging nano/micro-plastics (NMP) pollution. We synthesized the little-explored effects of NMP on pollinators and biocontrol agents on the organismal, farm and landscape scale. For instance ingested NMP trigger organismal changes from gene expression, organ damage to behavior modifications. A…
▽ More
Biodiversity-associated ecosystem services such as pollination and biocontrol may be severely affected by emerging nano/micro-plastics (NMP) pollution. We synthesized the little-explored effects of NMP on pollinators and biocontrol agents on the organismal, farm and landscape scale. For instance ingested NMP trigger organismal changes from gene expression, organ damage to behavior modifications. At the farm and landscape level, NMP will likely amplify synergistic effects with other threats such as pathogens and antibiotics, and may alter landscape properties such as floral resource distributions in high NMP concentration areas, what we call NMP islands. It is essential to understand the functional exposure pathways of NMP on pollinators and biocontrol agents to comprehensively evaluate the risks for agricultural ecosystems and global food security.
△ Less
Submitted 7 March, 2024;
originally announced March 2024.
-
A novel methodological framework for the analysis of health trajectories and survival outcomes in heart failure patients
Authors:
Juliette Murris,
Tristan Amadei,
Tristan Kirscher,
Antoine Klein,
Anne-Isabelle Tropeano,
Sandrine Katsahian
Abstract:
Heart failure (HF) contributes to circa 200,000 annual hospitalizations in France. With the increasing age of HF patients, elucidating the specific causes of inpatient mortality became a public health problematic. We introduce a novel methodological framework designed to identify prevalent health trajectories and investigate their impact on death. The initial step involves applying sequential patt…
▽ More
Heart failure (HF) contributes to circa 200,000 annual hospitalizations in France. With the increasing age of HF patients, elucidating the specific causes of inpatient mortality became a public health problematic. We introduce a novel methodological framework designed to identify prevalent health trajectories and investigate their impact on death. The initial step involves applying sequential pattern mining to characterize patients' trajectories, followed by an unsupervised clustering algorithm based on a new metric for measuring the distance between hospitalization diagnoses. Finally, a survival analysis is conducted to assess survival outcomes. The application of this framework to HF patients from a representative sample of the French population demonstrates its methodological significance in enhancing the analysis of healthcare trajectories.
△ Less
Submitted 20 March, 2024; v1 submitted 5 March, 2024;
originally announced March 2024.
-
LISA Definition Study Report
Authors:
Monica Colpi,
Karsten Danzmann,
Martin Hewitson,
Kelly Holley-Bockelmann,
Philippe Jetzer,
Gijs Nelemans,
Antoine Petiteau,
David Shoemaker,
Carlos Sopuerta,
Robin Stebbins,
Nial Tanvir,
Henry Ward,
William Joseph Weber,
Ira Thorpe,
Anna Daurskikh,
Atul Deep,
Ignacio Fernández Núñez,
César García Marirrodriga,
Martin Gehler,
Jean-Philippe Halain,
Oliver Jennrich,
Uwe Lammers,
Jonan Larrañaga,
Maike Lieser,
Nora Lützgendorf
, et al. (86 additional authors not shown)
Abstract:
The Laser Interferometer Space Antenna (LISA) is the first scientific endeavour to detect and study gravitational waves from space. LISA will survey the sky for Gravitational Waves in the 0.1 mHz to 1 Hz frequency band which will enable the study of a vast number of objects ranging from Galactic binaries and stellar mass black holes in the Milky Way, to distant massive black-hole mergers and the e…
▽ More
The Laser Interferometer Space Antenna (LISA) is the first scientific endeavour to detect and study gravitational waves from space. LISA will survey the sky for Gravitational Waves in the 0.1 mHz to 1 Hz frequency band which will enable the study of a vast number of objects ranging from Galactic binaries and stellar mass black holes in the Milky Way, to distant massive black-hole mergers and the expansion of the Universe. This definition study report, or Red Book, presents a summary of the very large body of work that has been undertaken on the LISA mission over the LISA definition phase.
△ Less
Submitted 12 February, 2024;
originally announced February 2024.
-
Efficient Gravitational-Wave Model for Fully-Precessing and Moderately-Eccentric, Compact Binary Inspirals
Authors:
J. Nijaid Arredondo,
Antoine Klein,
Nicolás Yunes
Abstract:
Future gravitational-wave detectors, especially the Laser Interferometer Space Antenna (LISA), will be sensitive to black hole binaries formed in astrophysical environments that promote large eccentricities and spin precession. Gravitational-wave templates that include both effects have only recently begun to be developed. The Efficient Fully Precessing Eccentric (EFPE) family is one such model, c…
▽ More
Future gravitational-wave detectors, especially the Laser Interferometer Space Antenna (LISA), will be sensitive to black hole binaries formed in astrophysical environments that promote large eccentricities and spin precession. Gravitational-wave templates that include both effects have only recently begun to be developed. The Efficient Fully Precessing Eccentric (EFPE) family is one such model, covering the inspiral stage with small-eccentricity-expanded gravitational-wave amplitudes accurate for eccentricities $e < 0.3$. In this work, we extend this model to cover a larger range of eccentricities. The new EFPE_ME model is able to accurately represent the leading-order gravitational-wave amplitudes to $e \leq 0.8$. Comparing the EFPE and the EFPE_ME models in the LISA band, however, reveals that there is no significant difference when $e_0 \leq 0.5$ for binaries at 4 years before merger, as radiation reaction circularizes supermassive black hole binaries too quickly. This suggests that the EFPE model may have a larger regime of validity in eccentricity space than previously thought, making it suitable for some inspiral parameter estimation with LISA data. On the other hand, for systems with $e_0 > 0.5$, the deviations between the models are significant, particularly for binaries with total masses below $10^5\, \mathrm{M}_{\odot}$. This suggests that the EFPE_ME model will be crucial to avoid systematic bias in parameter estimation with LISA in the future, once this model has been hybridized to include the merger and ringdown.
△ Less
Submitted 9 February, 2024;
originally announced February 2024.
-
An ion trap design for a space-deployable strontium-ion optical clock
Authors:
Alessio Spampinato,
Jonathan Stacey,
Sean Mulholland,
Billy I. Robertson,
Hugh A. Klein,
Guilong Huang,
Geoffrey P. Barwood,
Patrick Gill
Abstract:
Optical atomic clocks demonstrate a better stability and lower systematic uncertainty than the highest performance microwave atomic clocks. However, the best performing optical clocks have a large footprint in a laboratory environment and require specialist skills to maintain continuous operation. Growing and evolving needs across several sectors are increasing the demand for compact robust and po…
▽ More
Optical atomic clocks demonstrate a better stability and lower systematic uncertainty than the highest performance microwave atomic clocks. However, the best performing optical clocks have a large footprint in a laboratory environment and require specialist skills to maintain continuous operation. Growing and evolving needs across several sectors are increasing the demand for compact robust and portable devices at this capability level. In this paper we discuss the design of a physics package for a compact laser-cooled 88Sr+ optical clock that would, with further development, be suitable for space deployment. We review the design parameters to target a relative frequency uncertainty at the low parts in 10^18 with this system. We then explain the results of finite element modelling to simulate the response of the ion trap and vacuum chamber to vibration, shock and thermal conditions expected during launch and space deployment. Additionally, an electrostatic model has been developed to investigate the relationship between the ion trap geometrical tolerances and the trap** efficiency. We present the results from these analyses that have led to the design of a more robust prototype ready for experimental testing.
△ Less
Submitted 23 January, 2024;
originally announced January 2024.
-
The Best Time for an Update: Risk-Sensitive Minimization of Age-Based Metrics
Authors:
Wanja de Sombre,
Andrea Ortiz,
Frank Aurzada,
Anja Klein
Abstract:
Popular methods to quantify transmitted data quality are the Age of Information (AoI), the Query Age of Information (QAoI), and the Age of Incorrect Information (AoII). We consider these metrics in a point-to-point wireless communication system, where the transmitter monitors a process and sends status updates to a receiver. The challenge is to decide on the best time for an update, balancing the…
▽ More
Popular methods to quantify transmitted data quality are the Age of Information (AoI), the Query Age of Information (QAoI), and the Age of Incorrect Information (AoII). We consider these metrics in a point-to-point wireless communication system, where the transmitter monitors a process and sends status updates to a receiver. The challenge is to decide on the best time for an update, balancing the transmission energy and the age-based metric at the receiver. Due to the inherent risk of high age-based metric values causing complications such as unstable system states, we introduce the new concept of risky states to denote states with high age-based metric. We use this new notion of risky states to quantify and minimize this risk of experiencing high age-based metrics by directly deriving the frequency of risky states as a novel risk-metric. Building on this foundation, we introduce two risk-sensitive strategies for AoI, QAoI and AoII. The first strategy uses system knowledge, i.e., channel quality and packet arrival probability, to find an optimal strategy that transmits when the age-based metric exceeds a tunable threshold. A lower threshold leads to higher risk-sensitivity. The second strategy uses an enhanced Q-learning approach and balances the age-based metric, the transmission energy and the frequency of risky states without requiring knowledge about the system. Numerical results affirm our risk-sensitive strategies' high effectiveness.
△ Less
Submitted 3 January, 2024;
originally announced January 2024.
-
Bulk synthesis of Zn$_3$WN$_4$ via solid-state metathesis
Authors:
Christopher L. Rom,
Shaun O'Donnell,
Kayla Huang,
Ryan A. Klein,
Morgan J. Kramer,
Rebecca W. Smaha,
Andriy Zakutayev
Abstract:
Ternary nitrides are of growing technological importance, with applications as semiconductors, catalysts, and magnetic materials; however, new synthetic tools are needed to advance materials discovery efforts. Here, we show that Zn$_3$WN$_4$ can be synthesized via metathesis reactions between Li$_6$WN$_4$ and Zn$X_2$ ($X$ = Br, Cl, F). In situ synchrotron powder X-ray diffraction and differential…
▽ More
Ternary nitrides are of growing technological importance, with applications as semiconductors, catalysts, and magnetic materials; however, new synthetic tools are needed to advance materials discovery efforts. Here, we show that Zn$_3$WN$_4$ can be synthesized via metathesis reactions between Li$_6$WN$_4$ and Zn$X_2$ ($X$ = Br, Cl, F). In situ synchrotron powder X-ray diffraction and differential scanning calorimetry show that the reaction onset is correlated with the Zn$X_2$ melting point and that product purity is inversely correlated with the reaction's exothermicity. High resolution synchrotron powder X-ray diffraction measurements show that this bulk synthesis produces a structure with substantial cation ordering, as opposed to the disordered structure initially discovered via thin film sputtering. Diffuse reflectance spectroscopy reveals that Zn$_3$WN$_4$ powders exhibit two optical absorption onsets at 2.5 eV and 4.0 eV, indicating wide-bandgap semiconducting behavior and suggesting a small amount of structural disorder. We hypothesize that this synthesis strategy is generalizable because many potential Li-$M$-N precursors (where $M$ is a metal) are available for synthesizing new ternary nitride materials. This work introduces a promising synthesis strategy that will accelerate the discovery of novel functional ternary nitrides and other currently inaccessible materials.
△ Less
Submitted 5 January, 2024; v1 submitted 3 January, 2024;
originally announced January 2024.
-
Tuning the spontaneous exchange bias effect in La1.5Sr0.5CoMnO6 with sintering temperature
Authors:
C. Macchiutti,
J. R. Jesus,
F. B. Carneiro,
L. Bufaical,
R. A. Klein,
Q. Zhang,
M. Kirkham,
C. M. Brown,
R. D. dos Reis,
G. Perez,
E. M. Bittar
Abstract:
Here, we present a study of the influence of microstructure on the magnetic properties of polycrystalline samples of the La1.5Sr0.5CoMnO6 double perovskite, with primary attention to the spontaneous exchange bias effect, a fascinating recently discovered phenomena for which some materials exhibit unidirectional magnetic anisotropy after being cooled in zero magnetic fields. By sintering La1.5Sr0.5…
▽ More
Here, we present a study of the influence of microstructure on the magnetic properties of polycrystalline samples of the La1.5Sr0.5CoMnO6 double perovskite, with primary attention to the spontaneous exchange bias effect, a fascinating recently discovered phenomena for which some materials exhibit unidirectional magnetic anisotropy after being cooled in zero magnetic fields. By sintering La1.5Sr0.5CoMnO6 at different temperatures, we obtained samples with distinct average grain sizes, ranging from 1.54 to 6.65 mu_m. A detailed investigation of the material's structural, morphologic, electronic, and magnetic properties using X-ray powder diffraction, powder neutron diffraction, X-ray absorption near edge structure spectroscopy, scanning electron microscopy, and AC and DC magnetometry has revealed a systematic enhancement of the exchange bias effect with increasing the average grain size. This evolution is discussed in terms of changes in the material's porosity and grain morphology and its influence on the exchange couplings at the magnetic interfaces.
△ Less
Submitted 29 April, 2024; v1 submitted 14 December, 2023;
originally announced December 2023.
-
The 2nd Workshop on Maritime Computer Vision (MaCVi) 2024
Authors:
Benjamin Kiefer,
Lojze Žust,
Matej Kristan,
Janez Perš,
Matija Teršek,
Arnold Wiliem,
Martin Messmer,
Cheng-Yen Yang,
Hsiang-Wei Huang,
Zhongyu Jiang,
Heng-Cheng Kuo,
Jie Mei,
Jenq-Neng Hwang,
Daniel Stadler,
Lars Sommer,
Kaer Huang,
Aiguo Zheng,
Weitu Chong,
Kanokphan Lertniphonphan,
Jun Xie,
Feng Chen,
Jian Li,
Zhepeng Wang,
Luca Zedda,
Andrea Loddo
, et al. (24 additional authors not shown)
Abstract:
The 2nd Workshop on Maritime Computer Vision (MaCVi) 2024 addresses maritime computer vision for Unmanned Aerial Vehicles (UAV) and Unmanned Surface Vehicles (USV). Three challenges categories are considered: (i) UAV-based Maritime Object Tracking with Re-identification, (ii) USV-based Maritime Obstacle Segmentation and Detection, (iii) USV-based Maritime Boat Tracking. The USV-based Maritime Obst…
▽ More
The 2nd Workshop on Maritime Computer Vision (MaCVi) 2024 addresses maritime computer vision for Unmanned Aerial Vehicles (UAV) and Unmanned Surface Vehicles (USV). Three challenges categories are considered: (i) UAV-based Maritime Object Tracking with Re-identification, (ii) USV-based Maritime Obstacle Segmentation and Detection, (iii) USV-based Maritime Boat Tracking. The USV-based Maritime Obstacle Segmentation and Detection features three sub-challenges, including a new embedded challenge addressing efficicent inference on real-world embedded devices. This report offers a comprehensive overview of the findings from the challenges. We provide both statistical and qualitative analyses, evaluating trends from over 195 submissions. All datasets, evaluation code, and the leaderboard are available to the public at https://macvi.org/workshop/macvi24.
△ Less
Submitted 23 November, 2023;
originally announced November 2023.
-
Slow propagation of information on the random XXZ quantum spin chain
Authors:
Alexander Elgart,
Abel Klein
Abstract:
The random XXZ quantum spin chain manifests localization (in the form of quasi-locality) in any fixed energy interval, as previously proved by the authors. In this article it is shown that this property implies slow propagation of information, one of the putative signatures of many-body localization, in the same energy interval.
The random XXZ quantum spin chain manifests localization (in the form of quasi-locality) in any fixed energy interval, as previously proved by the authors. In this article it is shown that this property implies slow propagation of information, one of the putative signatures of many-body localization, in the same energy interval.
△ Less
Submitted 25 June, 2024; v1 submitted 23 November, 2023;
originally announced November 2023.
-
Discovering neutron stars with LISA via measurements of orbital eccentricity in Galactic binaries
Authors:
Christopher J. Moore,
Eliot Finch,
Antoine Klein,
Valeriya Korol,
Nhat Pham,
Daniel Robins
Abstract:
LISA will detect $\sim \! 10^4$ Galactic binaries, the majority being double white dwarfs. However, approximately $\sim \! 1 \textrm{--} 5 \%$ of these systems will contain neutron stars which, if they can be correctly identified, will provide new opportunities for studying binary evolution pathways involving mass reversal and supernovae as well as being promising targets for multi-messenger obser…
▽ More
LISA will detect $\sim \! 10^4$ Galactic binaries, the majority being double white dwarfs. However, approximately $\sim \! 1 \textrm{--} 5 \%$ of these systems will contain neutron stars which, if they can be correctly identified, will provide new opportunities for studying binary evolution pathways involving mass reversal and supernovae as well as being promising targets for multi-messenger observations. Eccentricity, expected from neutron star natal kicks, will be a key identifying signature for binaries containing a neutron star. Eccentric binaries radiate at widely-spaced frequency harmonics that must first be identified as originating from a single source and then analysed coherently. A multi-harmonic heterodyning approach for this type of data analysis is used to perform Bayesian parameter estimation on a range of simulated eccentric LISA signals. This is used to: (i) investigate LISA's ability to measure orbital eccentricity and to quantify the minimum detectable eccentricity; (ii) demonstrate how eccentricity and periastron precession help to break the mass degeneracy allowing the individual component masses to be inferred, potentially confirming the presence of a neutron star; (iii) investigate the possibility of source misidentification when the individual harmonics of an eccentric binary masquerade as separate circular binaries; and (iv) investigate the possibility of source reclassification, where parameter estimation results of multiple circular analyses are combined in postprocessing to quickly infer the parameters of an eccentric source. The broader implications of this for the ongoing design of the LISA global fit are also discussed.
△ Less
Submitted 8 June, 2024; v1 submitted 10 October, 2023;
originally announced October 2023.
-
Neutron Star - White Dwarf Binaries: Probing Formation Pathways and Natal Kicks with LISA
Authors:
Valeriya Korol,
Andrei P. Igoshev,
Silvia Toonen,
Nikolaos Karnesis,
Christopher J. Moore,
Eliot Finch,
Antoine Klein
Abstract:
Neutron star-white dwarf (NS+WD) binaries offer a unique opportunity for studying NS-specific phenomena with gravitational waves. In this paper, we employ the binary population synthesis technique to study the Galactic population of NS+WDs with the future Laser Interferometer Space Antenna (LISA). We anticipate approximately $\mathcal{O}(10^2)$ detectable NS+WDs by LISA, encompassing both circular…
▽ More
Neutron star-white dwarf (NS+WD) binaries offer a unique opportunity for studying NS-specific phenomena with gravitational waves. In this paper, we employ the binary population synthesis technique to study the Galactic population of NS+WDs with the future Laser Interferometer Space Antenna (LISA). We anticipate approximately $\mathcal{O}(10^2)$ detectable NS+WDs by LISA, encompassing both circular and eccentric binaries formed via different pathways. Despite the challenge of distinguishing NS+WDs from more prevalent double white dwarfs in the LISA data (especially at frequencies below 2 mHz), we show that their eccentricity and chirp mass distributions may provide avenues to explore the NS natal kicks and common envelope evolution. Additionally, we investigate the spatial distribution of detectable NS+WDs relative to the Galactic plane and discuss prospects for identifying electromagnetic counterparts at radio wavelengths. Our results emphasise LISA's capability to detect and characterise NS+WDs and to offer insights into the properties of the underlying population. Our conclusions carry significant implications for sha** LISA data analysis strategies and future data interpretation.
△ Less
Submitted 9 April, 2024; v1 submitted 10 October, 2023;
originally announced October 2023.
-
Map** of Internet "Coastlines" via Large Scale Anonymized Network Source Correlations
Authors:
Hayden Jananthan,
Jeremy Kepner,
Michael Jones,
William Arcand,
David Bestor,
William Bergeron,
Chansup Byun,
Timothy Davis,
Vijay Gadepally,
Daniel Grant,
Michael Houle,
Matthew Hubbell,
Anna Klein,
Lauren Milechin,
Guillermo Morales,
Andrew Morris,
Julie Mullen,
Ritesh Patel,
Alex Pentland,
Sandeep Pisharody,
Andrew Prout,
Albert Reuther,
Antonio Rosa,
Siddharth Samsi,
Tyler Trigg
, et al. (3 additional authors not shown)
Abstract:
Expanding the scientific tools available to protect computer networks can be aided by a deeper understanding of the underlying statistical distributions of network traffic and their potential geometric interpretations. Analyses of large scale network observations provide a unique window into studying those underlying statistics. Newly developed GraphBLAS hypersparse matrices and D4M associative ar…
▽ More
Expanding the scientific tools available to protect computer networks can be aided by a deeper understanding of the underlying statistical distributions of network traffic and their potential geometric interpretations. Analyses of large scale network observations provide a unique window into studying those underlying statistics. Newly developed GraphBLAS hypersparse matrices and D4M associative array technologies enable the efficient anonymized analysis of network traffic on the scale of trillions of events. This work analyzes over 100,000,000,000 anonymized packets from the largest Internet telescope (CAIDA) and over 10,000,000 anonymized sources from the largest commercial honeyfarm (GreyNoise). Neither CAIDA nor GreyNoise actively emit Internet traffic and provide distinct observations of unsolicited Internet traffic (primarily botnets and scanners). Analysis of these observations confirms the previously observed Cauchy-like distributions describing temporal correlations between Internet sources. The Gull lighthouse problem is a well-known geometric characterization of the standard Cauchy distribution and motivates a potential geometric interpretation for Internet observations. This work generalizes the Gull lighthouse problem to accommodate larger classes of coastlines, deriving a closed-form solution for the resulting probability distributions, stating and examining the inverse problem of identifying an appropriate coastline given a continuous probability distribution, identifying a geometric heuristic for solving this problem computationally, and applying that heuristic to examine the temporal geometry of different subsets of network observations. Application of this method to the CAIDA and GreyNoise data reveals a several orders of magnitude difference between known benign and other traffic which can lead to potentially novel ways to protect networks.
△ Less
Submitted 30 September, 2023;
originally announced October 2023.
-
Decentralized Online Learning in Task Assignment Games for Mobile Crowdsensing
Authors:
Bernd Simon,
Andrea Ortiz,
Walid Saad,
Anja Klein
Abstract:
The problem of coordinated data collection is studied for a mobile crowdsensing (MCS) system. A mobile crowdsensing platform (MCSP) sequentially publishes sensing tasks to the available mobile units (MUs) that signal their willingness to participate in a task by sending sensing offers back to the MCSP. From the received offers, the MCSP decides the task assignment. A stable task assignment must ad…
▽ More
The problem of coordinated data collection is studied for a mobile crowdsensing (MCS) system. A mobile crowdsensing platform (MCSP) sequentially publishes sensing tasks to the available mobile units (MUs) that signal their willingness to participate in a task by sending sensing offers back to the MCSP. From the received offers, the MCSP decides the task assignment. A stable task assignment must address two challenges: the MCSP's and MUs' conflicting goals, and the uncertainty about the MUs' required efforts and preferences. To overcome these challenges a novel decentralized approach combining matching theory and online learning, called collision-avoidance multi-armed bandit with strategic free sensing (CA-MAB-SFS), is proposed. The task assignment problem is modeled as a matching game considering the MCSP's and MUs' individual goals while the MUs learn their efforts online. Our innovative "free-sensing" mechanism significantly improves the MU's learning process while reducing collisions during task allocation. The stable regret of CA-MAB-SFS, i.e., the loss of learning, is analytically shown to be bounded by a sublinear function, ensuring the convergence to a stable optimal solution. Simulation results show that CA-MAB-SFS increases the MUs' and the MCSP's satisfaction compared to state-of-the-art methods while reducing the average task completion time by at least 16%.
△ Less
Submitted 19 September, 2023;
originally announced September 2023.
-
pPython Performance Study
Authors:
Chansup Byun,
William Arcand,
David Bestor,
Bill Bergeron,
Vijay Gadepally,
Michael Houle,
Matthew Hubbell,
Hayden Jananthan,
Michael Jones,
Anna Klein,
Peter Michaleas,
Lauren Milechin,
Guillermo Morales,
Julie Mullen,
Andrew Prout,
Albert Reuther,
Antonio Rosa,
Siddharth Samsi,
Charles Yee,
Jeremy Kepner
Abstract:
pPython seeks to provide a parallel capability that provides good speed-up without sacrificing the ease of programming in Python by implementing partitioned global array semantics (PGAS) on top of a simple file-based messaging library (PythonMPI) in pure Python. pPython follows a SPMD (single program multiple data) model of computation. pPython runs on a single-node (e.g., a laptop) running Window…
▽ More
pPython seeks to provide a parallel capability that provides good speed-up without sacrificing the ease of programming in Python by implementing partitioned global array semantics (PGAS) on top of a simple file-based messaging library (PythonMPI) in pure Python. pPython follows a SPMD (single program multiple data) model of computation. pPython runs on a single-node (e.g., a laptop) running Windows, Linux, or MacOS operating systems or on any combination of heterogeneous systems that support Python, including on a cluster through a Slurm scheduler interface so that pPython can be executed in a massively parallel computing environment. It is interesting to see what performance pPython can achieve compared to the traditional socket-based MPI communication because of its unique file-based messaging implementation. In this paper, we present the point-to-point and collective communication performances of pPython and compare them with those obtained by using mpi4py with OpenMPI. For large messages, pPython demonstrates comparable performance as compared to mpi4py.
△ Less
Submitted 7 September, 2023;
originally announced September 2023.
-
Deployment of Real-Time Network Traffic Analysis using GraphBLAS Hypersparse Matrices and D4M Associative Arrays
Authors:
Michael Jones,
Jeremy Kepner,
Andrew Prout,
Timothy Davis,
William Arcand,
David Bestor,
William Bergeron,
Chansup Byun,
Vijay Gadepally,
Micheal Houle,
Matthew Hubbell,
Hayden Jananthan,
Anna Klein,
Lauren Milechin,
Guillermo Morales,
Julie Mullen,
Ritesh Patel,
Sandeep Pisharody,
Albert Reuther,
Antonio Rosa,
Siddharth Samsi,
Charles Yee,
Peter Michaleas
Abstract:
Matrix/array analysis of networks can provide significant insight into their behavior and aid in their operation and protection. Prior work has demonstrated the analytic, performance, and compression capabilities of GraphBLAS (graphblas.org) hypersparse matrices and D4M (d4m.mit.edu) associative arrays (a mathematical superset of matrices). Obtaining the benefits of these capabilities requires int…
▽ More
Matrix/array analysis of networks can provide significant insight into their behavior and aid in their operation and protection. Prior work has demonstrated the analytic, performance, and compression capabilities of GraphBLAS (graphblas.org) hypersparse matrices and D4M (d4m.mit.edu) associative arrays (a mathematical superset of matrices). Obtaining the benefits of these capabilities requires integrating them into operational systems, which comes with its own unique challenges. This paper describes two examples of real-time operational implementations. First, is an operational GraphBLAS implementation that constructs anonymized hypersparse matrices on a high-bandwidth network tap. Second, is an operational D4M implementation that analyzes daily cloud gateway logs. The architectures of these implementations are presented. Detailed measurements of the resources and the performance are collected and analyzed. The implementations are capable of meeting their operational requirements using modest computational resources (a couple of processing cores). GraphBLAS is well-suited for low-level analysis of high-bandwidth connections with relatively structured network data. D4M is well-suited for higher-level analysis of more unstructured data. This work demonstrates that these technologies can be implemented in operational settings.
△ Less
Submitted 8 December, 2023; v1 submitted 4 September, 2023;
originally announced September 2023.
-
Focusing and Calibration of Large Scale Network Sensors using GraphBLAS Anonymized Hypersparse Matrices
Authors:
Jeremy Kepner,
Michael Jones,
Phil Dykstra,
Chansup Byun,
Timothy Davis,
Hayden Jananthan,
William Arcand,
David Bestor,
William Bergeron,
Vijay Gadepally,
Micheal Houle,
Matthew Hubbell,
Anna Klein,
Lauren Milechin,
Guillermo Morales,
Julie Mullen,
Ritesh Patel,
Alex Pentland,
Sandeep Pisharody,
Andrew Prout,
Albert Reuther,
Antonio Rosa,
Siddharth Samsi,
Tyler Trigg,
Charles Yee
, et al. (1 additional authors not shown)
Abstract:
Defending community-owned cyber space requires community-based efforts. Large-scale network observations that uphold the highest regard for privacy are key to protecting our shared cyberspace. Deployment of the necessary network sensors requires careful sensor placement, focusing, and calibration with significant volumes of network observations. This paper demonstrates novel focusing and calibrati…
▽ More
Defending community-owned cyber space requires community-based efforts. Large-scale network observations that uphold the highest regard for privacy are key to protecting our shared cyberspace. Deployment of the necessary network sensors requires careful sensor placement, focusing, and calibration with significant volumes of network observations. This paper demonstrates novel focusing and calibration procedures on a multi-billion packet dataset using high-performance GraphBLAS anonymized hypersparse matrices. The run-time performance on a real-world data set confirms previously observed real-time processing rates for high-bandwidth links while achieving significant data compression. The output of the analysis demonstrates the effectiveness of these procedures at focusing the traffic matrix and revealing the underlying stable heavy-tail statistical distributions that are necessary for anomaly detection. A simple model of the corresponding probability of detection ($p_{\rm d}$) and probability of false alarm ($p_{\rm fa}$) for these distributions highlights the criticality of network sensor focusing and calibration. Once a sensor is properly focused and calibrated it is then in a position to carry out two of the central tenets of good cybersecurity: (1) continuous observation of the network and (2) minimizing unbrokered network connections.
△ Less
Submitted 4 September, 2023;
originally announced September 2023.
-
Multiferroicity in plastically deformed SrTiO$_3$
Authors:
Xi Wang,
Anirban Kundu,
Bochao Xu,
Sajna Hameed,
Ilya Sochnikov,
Damjan Pelc,
Martin Greven,
Avraham Klein,
Beena Kalisky
Abstract:
A major challenge in the development of quantum technologies is to induce additional types of ferroic orders into materials that exhibit other useful quantum properties. Various techniques have been applied to this end, such as elastically straining, do**, or interfacing a compound with other materials. Plastic deformation introduces permanent topological defects and large local strains into a m…
▽ More
A major challenge in the development of quantum technologies is to induce additional types of ferroic orders into materials that exhibit other useful quantum properties. Various techniques have been applied to this end, such as elastically straining, do**, or interfacing a compound with other materials. Plastic deformation introduces permanent topological defects and large local strains into a material, which can give rise to qualitatively new functionality. Here we show via local magnetic imaging that plastic deformation induces robust magnetism in the quantum paraelectric SrTiO3, in both conducting and insulating samples. Our analysis indicates that the magnetic order is localized along dislocation walls and coexists with polar order along the walls. The magnetic signals can be switched on and off in a controllable manner with external stress, which demonstrates that plastically deformed SrTiO3 is a quantum multiferroic. These results establish plastic deformation as a versatile platform for quantum materials engineering.
△ Less
Submitted 28 August, 2023;
originally announced August 2023.
-
Unleash the Power of Context: Enhancing Large-Scale Recommender Systems with Context-Based Prediction Models
Authors:
Jan Hartman,
Assaf Klein,
Davorin Kopič,
Natalia Silberstein
Abstract:
In this work, we introduce the notion of Context-Based Prediction Models. A Context-Based Prediction Model determines the probability of a user's action (such as a click or a conversion) solely by relying on user and contextual features, without considering any specific features of the item itself. We have identified numerous valuable applications for this modeling approach, including training an…
▽ More
In this work, we introduce the notion of Context-Based Prediction Models. A Context-Based Prediction Model determines the probability of a user's action (such as a click or a conversion) solely by relying on user and contextual features, without considering any specific features of the item itself. We have identified numerous valuable applications for this modeling approach, including training an auxiliary context-based model to estimate click probability and incorporating its prediction as a feature in CTR prediction models. Our experiments indicate that this enhancement brings significant improvements in offline and online business metrics while having minimal impact on the cost of serving. Overall, our work offers a simple and scalable, yet powerful approach for enhancing the performance of large-scale commercial recommender systems, with broad implications for the field of personalized recommendations.
△ Less
Submitted 25 July, 2023;
originally announced August 2023.
-
Obeying the Order: Introducing Ordered Transfer Hyperparameter Optimisation
Authors:
Sigrid Passano Hellan,
Huibin Shen,
François-Xavier Aubet,
David Salinas,
Aaron Klein
Abstract:
We introduce ordered transfer hyperparameter optimisation (OTHPO), a version of transfer learning for hyperparameter optimisation (HPO) where the tasks follow a sequential order. Unlike for state-of-the-art transfer HPO, the assumption is that each task is most correlated to those immediately before it. This matches many deployed settings, where hyperparameters are retuned as more data is collecte…
▽ More
We introduce ordered transfer hyperparameter optimisation (OTHPO), a version of transfer learning for hyperparameter optimisation (HPO) where the tasks follow a sequential order. Unlike for state-of-the-art transfer HPO, the assumption is that each task is most correlated to those immediately before it. This matches many deployed settings, where hyperparameters are retuned as more data is collected; for instance tuning a sequence of movie recommendation systems as more movies and ratings are added. We propose a formal definition, outline the differences to related problems and propose a basic OTHPO method that outperforms state-of-the-art transfer HPO. We empirically show the importance of taking order into account using ten benchmarks. The benchmarks are in the setting of gradually accumulating data, and span XGBoost, random forest, approximate k-nearest neighbor, elastic net, support vector machines and a separate real-world motivated optimisation problem. We open source the benchmarks to foster future research on ordered transfer HPO.
△ Less
Submitted 29 June, 2023;
originally announced June 2023.
-
Glitch systematics on the observation of massive black-hole binaries with LISA
Authors:
Alice Spadaro,
Riccardo Buscicchio,
Daniele Vetrugno,
Antoine Klein,
Davide Gerosa,
Stefano Vitale,
Rita Dolesi,
William Joseph Weber,
Monica Colpi
Abstract:
Detecting and coherently characterizing thousands of gravitational-wave signals is a core data-analysis challenge for the Laser Interferometer Space Antenna (LISA). Transient artifacts, or "glitches", with disparate morphologies are expected to be present in the data, potentially affecting the scientific return of the mission. We present the first joint reconstruction of short-lived astrophysical…
▽ More
Detecting and coherently characterizing thousands of gravitational-wave signals is a core data-analysis challenge for the Laser Interferometer Space Antenna (LISA). Transient artifacts, or "glitches", with disparate morphologies are expected to be present in the data, potentially affecting the scientific return of the mission. We present the first joint reconstruction of short-lived astrophysical signals and noise artifacts. Our analysis is inspired by glitches observed by the LISA Pathfinder mission, including both acceleration and fast displacement transients. We perform full Bayesian inference using LISA time-delay interferometric data and gravitational waveforms describing mergers of massive black holes. We focus on a representative binary with a detector-frame total mass of $6 \times 10^7 M_\odot$ at redshift $5$, yielding a signal lasting $\sim 30~\mathrm{h}$ in the LISA sensitivity band. We explore two glitch models of different flexibility, namely a fixed parametric family and a shapelet decomposition. In the most challenging scenario, we report a complete loss of the gravitational-wave signal if the glitch is ignored; more modest glitches induce biases on the black-hole parameters. On the other hand, a joint inference approach fully sanitizes the reconstruction of both the astrophysical and the glitch signal. We also inject a variety of glitch morphologies in isolation, without a superimposed gravitational signal, and show we can identify the correct transient model. Our analysis is an important step** stone toward a realistic treatment of LISA data in the context of the highly sought-after "global fit".
△ Less
Submitted 21 December, 2023; v1 submitted 6 June, 2023;
originally announced June 2023.
-
Matching Game for Optimized Association in Quantum Communication Networks
Authors:
Mahdi Chehimi,
Bernd Simon,
Walid Saad,
Anja Klein,
Don Towsley,
Mérouane Debbah
Abstract:
Enabling quantum switches (QSs) to serve requests submitted by quantum end nodes in quantum communication networks (QCNs) is a challenging problem due to the heterogeneous fidelity requirements of the submitted requests and the limited resources of the QCN. Effectively determining which requests are served by a given QS is fundamental to foster developments in practical QCN applications, like quan…
▽ More
Enabling quantum switches (QSs) to serve requests submitted by quantum end nodes in quantum communication networks (QCNs) is a challenging problem due to the heterogeneous fidelity requirements of the submitted requests and the limited resources of the QCN. Effectively determining which requests are served by a given QS is fundamental to foster developments in practical QCN applications, like quantum data centers. However, the state-of-the-art on QS operation has overlooked this association problem, and it mainly focused on QCNs with a single QS. In this paper, the request-QS association problem in QCNs is formulated as a matching game that captures the limited QCN resources, heterogeneous application-specific fidelity requirements, and scheduling of the different QS operations. To solve this game, a swap-stable request-QS association (RQSA) algorithm is proposed while considering partial QCN information availability. Extensive simulations are conducted to validate the effectiveness of the proposed RQSA algorithm. Simulation results show that the proposed RQSA algorithm achieves a near-optimal (within 5%) performance in terms of the percentage of served requests and overall achieved fidelity, while outperforming benchmark greedy solutions by over 13%. Moreover, the proposed RQSA algorithm is shown to be scalable and maintain its near-optimal performance even when the size of the QCN increases.
△ Less
Submitted 21 May, 2023;
originally announced May 2023.
-
Implications of pulsar timing array observations for LISA detections of massive black hole binaries
Authors:
Nathan Steinle,
Hannah Middleton,
Christopher J. Moore,
Siyuan Chen,
Antoine Klein,
Geraint Pratten,
Riccardo Buscicchio,
Eliot Finch,
Alberto Vecchio
Abstract:
Pulsar timing arrays (PTAs) and the Laser Interferometer Space Antenna (LISA) will open complementary observational windows on massive black-hole binaries (MBHBs), i.e., with masses in the range $\sim 10^6 - 10^{10}\,$ M$_{\odot}$. While PTAs may detect a stochastic gravitational-wave background from a population of MBHBs, during operation LISA will detect individual merging MBHBs. To demonstrate…
▽ More
Pulsar timing arrays (PTAs) and the Laser Interferometer Space Antenna (LISA) will open complementary observational windows on massive black-hole binaries (MBHBs), i.e., with masses in the range $\sim 10^6 - 10^{10}\,$ M$_{\odot}$. While PTAs may detect a stochastic gravitational-wave background from a population of MBHBs, during operation LISA will detect individual merging MBHBs. To demonstrate the profound interplay between LISA and PTAs, we estimate the number of MBHB mergers that one can expect to observe with LISA by extrapolating direct observational constraints on the MBHB merger rate inferred from PTA data. For this, we postulate that the common signal observed by PTAs (and consistent with the increased evidence recently reported) is an astrophysical background sourced by a single MBHB population. We then constrain the LISA detection rate, $\mathcal{R}$, in the mass-redshift space by combining our Bayesian-inferred merger rate with LISA's sensitivity to spin-aligned, inspiral-merger-ringdown waveforms. Using an astrophysically-informed formation model, we predict a 95$\%$ upper limit on the detection rate of $\mathcal{R} < 134\,{\rm yr}^{-1}$ for binaries with total masses in the range $10^7 - 10^8\,$ M$_{\odot}$. For higher masses, i.e., $>10^8\,$ M$_{\odot}$, we find $\mathcal{R} < 2\,(1)\,\mathrm{yr}^{-1}$ using an astrophysically-informed (agnostic) formation model, rising to $11\,(6)\,\mathrm{yr}^{-1}$ if the LISA sensitivity bandwidth extends down to $10^{-5}$ Hz. Forecasts of LISA science potential with PTA background measurements should improve as PTAs continue their search.
△ Less
Submitted 4 August, 2023; v1 submitted 10 May, 2023;
originally announced May 2023.
-
Optimizing Hyperparameters with Conformal Quantile Regression
Authors:
David Salinas,
Jacek Golebiowski,
Aaron Klein,
Matthias Seeger,
Cedric Archambeau
Abstract:
Many state-of-the-art hyperparameter optimization (HPO) algorithms rely on model-based optimizers that learn surrogate models of the target function to guide the search. Gaussian processes are the de facto surrogate model due to their ability to capture uncertainty but they make strong assumptions about the observation noise, which might not be warranted in practice. In this work, we propose to le…
▽ More
Many state-of-the-art hyperparameter optimization (HPO) algorithms rely on model-based optimizers that learn surrogate models of the target function to guide the search. Gaussian processes are the de facto surrogate model due to their ability to capture uncertainty but they make strong assumptions about the observation noise, which might not be warranted in practice. In this work, we propose to leverage conformalized quantile regression which makes minimal assumptions about the observation noise and, as a result, models the target function in a more realistic and robust fashion which translates to quicker HPO convergence on empirical benchmarks. To apply our method in a multi-fidelity setting, we propose a simple, yet effective, technique that aggregates observed results across different resource levels and outperforms conventional methods across many empirical tasks.
△ Less
Submitted 5 May, 2023;
originally announced May 2023.
-
ESID: Exploring the Design and Development of a Visual Analytics Tool for Epidemiological Emergencies
Authors:
Pawandeep Kaur Betz,
Julien Stoll,
Valerie Grappendorf,
Jonas Gilg,
Moritz Zeumer,
Margrit Klitz,
Luca Spataro,
Anna Klein,
Lena Rothenhäusler,
Hartmut Bohnacker,
Hans Krämer,
Michael Meyer-Hermann,
Sybille Somogyi,
Andreas Gerndt,
Martin J. Kühn
Abstract:
Visual analytics tools can help illustrate the spread of infectious diseases and enable informed decisions on epidemiological and public health issues. To create visualisation tools that are intuitive, easy to use, and effective in communicating information, continued research and development focusing on user-centric and methodological design models is extremely important. As a contribution to thi…
▽ More
Visual analytics tools can help illustrate the spread of infectious diseases and enable informed decisions on epidemiological and public health issues. To create visualisation tools that are intuitive, easy to use, and effective in communicating information, continued research and development focusing on user-centric and methodological design models is extremely important. As a contribution to this topic, this paper presents the design and development process of the visual analytics application ESID (Epidemiological Scenarios for Infectious Diseases). ESID is a visual analytics tool aimed at projecting the future developments of infectious disease spread using reported and simulated data based on sound mathematical-epidemiological models. The development process involved a collaborative and participatory design approach with project partners from diverse scientific fields. The findings from these studies, along with the guidelines derived from them, played a pivotal role in sha** the visualisation tool.
△ Less
Submitted 29 August, 2023; v1 submitted 10 April, 2023;
originally announced April 2023.
-
Calculation of Thermodynamic Equilibria with the Predictive Electrolyte Model COSMO-RS-ES: Improvements for Low Permittivity Systems
Authors:
Simon Müller,
Andrés González de Castilla,
Christoph Taeschler,
Andreas Klein,
Irina Smirnova
Abstract:
The predictive electrolyte model COSMO-RS-ES is refined to improve the description of systems at 25°C in which strong ion pairing is expected due to a low static permittivity of the liquid phase. Furthermore, the short-range ion energy interaction equations have been modified to better describe the misfit and energy interaction terms between ions and solvent molecules. In addition, the salt solubi…
▽ More
The predictive electrolyte model COSMO-RS-ES is refined to improve the description of systems at 25°C in which strong ion pairing is expected due to a low static permittivity of the liquid phase. Furthermore, the short-range ion energy interaction equations have been modified to better describe the misfit and energy interaction terms between ions and solvent molecules. In addition, the salt solubility database is extended with additional non-aqueous systems containing solvents that have a low (ε_s<15) dielectric constant and promote near to full ion association. Throughout this work it is demonstrated that liquid-liquid equilibrium calculations and solid-liquid equilibrium predictions for electrolyte systems can be markedly improved with the inclusion of Bjerrum treatment based phenomenological considerations while introducing only one general additional parameter. Our modified approach reinforces the capabilities of COSMO-RS ES as a powerful predictive tool for the calculation of phase equilibria in systems with scarce experimental data.
△ Less
Submitted 4 April, 2023; v1 submitted 29 March, 2023;
originally announced March 2023.
-
Evaluation and Refinement of the novel predictive electrolyte model COSMO-RS-ES based on solid-liquid equilibria of salts and Gibbs free Energies of Transfer of Ions
Authors:
Simon Müller,
Christoph Taeschler,
Andreas Klein,
Irina Smirnova
Abstract:
The new predictive electrolyte model COSMO-RS-ES is evaluated and refined for the calculation of solubilities of salts in mixed solvent systems. It is demonstrated that the model is capable of predicting solid-liquid equilibria at 25 °C for ammonium and alkali metal salts quite accurately in a wide variety of solvent mixtures. Furthermore, through the introduction of Gibbs free energies of transfe…
▽ More
The new predictive electrolyte model COSMO-RS-ES is evaluated and refined for the calculation of solubilities of salts in mixed solvent systems. It is demonstrated that the model is capable of predicting solid-liquid equilibria at 25 °C for ammonium and alkali metal salts quite accurately in a wide variety of solvent mixtures. Furthermore, through the introduction of Gibbs free energies of transfer of single ions it is shown that the model performance can be improved even further. This new data type also allows for an ion-specific way of evaluating the model for the first time. For some systems when calculating the solubility, larger deviations are observed, but for the vast majority of systems the model delivers good predictions. This shows that COSMO-RS-ES is a valuable tool for calculation of phase equilibria in electrolyte systems especially when the scarcity of data impede the application of models that require a higher number of parameters.
△ Less
Submitted 4 April, 2023; v1 submitted 29 March, 2023;
originally announced March 2023.
-
Safehaul: Risk-Averse Learning for Reliable mmWave Self-Backhauling in 6G Networks
Authors:
Amir Ashtari Gargari,
Andrea Ortiz,
Matteo Pagin,
Anja Klein,
Matthias Hollick,
Michele Zorzi,
Arash Asadi
Abstract:
Wireless backhauling at millimeter-wave frequencies (mmWave) in static scenarios is a well-established practice in cellular networks. However, highly directional and adaptive beamforming in today's mmWave systems have opened new possibilities for self-backhauling. Tap** into this potential, 3GPP has standardized Integrated Access and Backhaul (IAB) allowing the same base station serve both acces…
▽ More
Wireless backhauling at millimeter-wave frequencies (mmWave) in static scenarios is a well-established practice in cellular networks. However, highly directional and adaptive beamforming in today's mmWave systems have opened new possibilities for self-backhauling. Tap** into this potential, 3GPP has standardized Integrated Access and Backhaul (IAB) allowing the same base station serve both access and backhaul traffic. Although much more cost-effective and flexible, resource allocation and path selection in IAB mmWave networks is a formidable task. To date, prior works have addressed this challenge through a plethora of classic optimization and learning methods, generally optimizing a Key Performance Indicator (KPI) such as throughput, latency, and fairness, and little attention has been paid to the reliability of the KPI. We propose Safehaul, a risk-averse learning-based solution for IAB mmWave networks. In addition to optimizing average performance, Safehaul ensures reliability by minimizing the losses in the tail of the performance distribution. We develop a novel simulator and show via extensive simulations that Safehaul not only reduces the latency by up to 43.2% compared to the benchmarks but also exhibits significantly more reliable performance (e.g., 71.4% less variance in achieved latency).
△ Less
Submitted 12 January, 2023; v1 submitted 9 January, 2023;
originally announced January 2023.
-
Measurement of flavor asymmetry of light-quark sea in the proton with Drell-Yan dimuon production in $p+p$ and $p+d$ collisions at 120 GeV
Authors:
J. Dove,
B. Kerns,
C. Leung,
R. E. McClellan,
S. Miyasaka,
D. H. Morton,
K. Nagai,
S. Prasad,
F. Sanftl,
M. B. C. Scott,
A. S. Tadepalli,
C. A. Aidala,
J. Arrington,
C. Ayuso,
C. T. Barker,
C. N. Brown,
T. H. Chang,
W. C. Chang,
A. Chen,
D. C. Christian,
B. P. Dannowitz,
M. Daugherity,
M. Diefenthaler,
L. El Fassi,
D. F. Geesaman
, et al. (44 additional authors not shown)
Abstract:
Evidence for a flavor asymmetry between the $\bar u$ and $\bar d$ quark distributions in the proton has been found in deep-inelastic scattering and Drell-Yan experiments. The pronounced dependence of this flavor asymmetry on $x$ (fraction of nucleon momentum carried by partons) observed in the Fermilab E866 Drell-Yan experiment suggested a drop of the $\bar d\left(x\right) / \bar u\left(x\right)$…
▽ More
Evidence for a flavor asymmetry between the $\bar u$ and $\bar d$ quark distributions in the proton has been found in deep-inelastic scattering and Drell-Yan experiments. The pronounced dependence of this flavor asymmetry on $x$ (fraction of nucleon momentum carried by partons) observed in the Fermilab E866 Drell-Yan experiment suggested a drop of the $\bar d\left(x\right) / \bar u\left(x\right)$ ratio in the $x > 0.15$ region. We report results from the SeaQuest Fermilab E906 experiment with improved statistical precision for $\bar d\left(x\right) / \bar u\left(x\right)$ in the large $x$ region up to $x=0.45$ using the 120 GeV proton beam. Two different methods for extracting the Drell-Yan cross section ratios, $σ^{pd} /2 σ^{pp}$, from the SeaQuest data give consistent results. The $\bar{d}\left(x\right) / \bar{u}\left(x\right)$ ratios and the $\bar d\left(x\right) - \bar u\left(x\right)$ differences are deduced from these cross section ratios for $0.13 < x < 0.45$. The SeaQuest and E866/NuSea $\bar{d}\left(x\right) / \bar{u}\left(x\right)$ ratios are in good agreement for the $x\lesssim 0.25$ region. The new SeaQuest data, however, show that $\bar d\left(x\right)$ continues to be greater than $\bar u\left(x\right)$ up to the highest $x$ value ($x = 0.45$). The new results on $\bar{d}\left(x\right) / \bar{u}\left(x\right)$ and $\bar{d}\left(x\right) - \bar{u}\left(x\right)$ are compared with various parton distribution functions and theoretical calculations.
△ Less
Submitted 2 October, 2023; v1 submitted 23 December, 2022;
originally announced December 2022.
-
On the LISA science performance in observations of short-lived signals from massive black hole binary coalescences
Authors:
Geraint Pratten,
Antoine Klein,
Christopher J. Moore,
Hannah Middleton,
Nathan Steinle,
Patricia Schmidt,
Alberto Vecchio
Abstract:
The observation of massive black hole binary systems is one of the main science objectives of the Laser Interferometer Space Antenna (LISA). The instrument's design requirements have recently been revised: they set a requirement at $0.1\,\mathrm{mHz}$, with no additional explicit requirements at lower frequencies. This has implications for observations of the short-lived signals produced by the co…
▽ More
The observation of massive black hole binary systems is one of the main science objectives of the Laser Interferometer Space Antenna (LISA). The instrument's design requirements have recently been revised: they set a requirement at $0.1\,\mathrm{mHz}$, with no additional explicit requirements at lower frequencies. This has implications for observations of the short-lived signals produced by the coalescence of massive and high-redshift binaries. Here we consider the most pessimistic scenario: the (unlikely) case in which LISA has no sensitivity below $0.1\,\mathrm{mHz}$. We show that the presence of higher multipoles (beyond the dominant $\ell = |m| = 2$ mode) in the gravitational radiation from these systems, which will be detectable with a total signal-to-noise ratio $\sim 10^3$, allows LISA to retain the capability to accurately measure the physical parameters, the redshift, and to constrain the sky location. To illustrate this point, we consider a few select binaries in a total (redshifted) mass range of $4 \times10^6 - 4 \times 10^7\,M_\odot$ whose ($\ell = |m| = 2$) gravitational-wave signals last between $\approx 12$ hours and $\approx 20$ days in band. We model the emitted gravitational radiation using the highly accurate (spin-aligned) waveform approximant IMRPhenomXHM and carry out a fully coherent Bayesian analysis on the LISA noise-orthogonal time-delay-interferometry channels.
△ Less
Submitted 5 December, 2022;
originally announced December 2022.
-
Localization in the random XXZ quantum spin chain
Authors:
Alexander Elgart,
Abel Klein
Abstract:
We study the many-body localization (MBL) properties of the Heisenberg XXZ spin-$\frac12$ chain in a random magnetic field. We prove that the system exhibits localization in any given energy interval at the bottom of the spectrum in a nontrivial region of the parameter space. This region, which includes weak interaction and strong disorder regimes, is independent of the size of the system and depe…
▽ More
We study the many-body localization (MBL) properties of the Heisenberg XXZ spin-$\frac12$ chain in a random magnetic field. We prove that the system exhibits localization in any given energy interval at the bottom of the spectrum in a nontrivial region of the parameter space. This region, which includes weak interaction and strong disorder regimes, is independent of the size of the system and depends only on the energy interval. Our approach is based on the reformulation of the localization problem as an expression of quasi-locality for functions of the random many-body XXZ Hamiltonian. This allows us to extend the fractional moment method for proving localization, previously derived in a single-particle localization context, to the many-body setting.
△ Less
Submitted 25 January, 2024; v1 submitted 26 October, 2022;
originally announced October 2022.
-
Identifying LISA verification binaries among the Galactic population of double white dwarfs
Authors:
Eliot Finch,
Giorgia Bartolucci,
Daniel Chucherko,
Ben G. Patterson,
Valeriya Korol,
Antoine Klein,
Diganta Bandopadhyay,
Hannah Middleton,
Christopher J. Moore,
Alberto Vecchio
Abstract:
Double white dwarfs (DWDs) will be the most numerous gravitational-wave (GW) sources for the Laser Interferometer Space Antenna (LISA). Most of the Galactic DWDs will be unresolved and will superpose to form a confusion noise foreground, the dominant LISA noise source around $\sim 0.5\mathrm{-}3\,\mathrm{mHz}$. A small fraction of these sources will stand out from the background and be individuall…
▽ More
Double white dwarfs (DWDs) will be the most numerous gravitational-wave (GW) sources for the Laser Interferometer Space Antenna (LISA). Most of the Galactic DWDs will be unresolved and will superpose to form a confusion noise foreground, the dominant LISA noise source around $\sim 0.5\mathrm{-}3\,\mathrm{mHz}$. A small fraction of these sources will stand out from the background and be individually detectable. Uniquely among GW sources, a handful of these binaries will be known in advance from electromagnetic (EM) observations and will be guaranteed sources of detectable GWs in the LISA band; these are known as verification binaries (VBs). High-cadence photometric surveys are continuously discovering new VB systems, and their number will continue to grow ahead of the launch of LISA. We analyse, in a fully Bayesian framework, all the currently known VB candidates with the latest design requirements for the LISA mission and find that 25 of the considered sources can be detected within a $4\,\mathrm{yr}$ observation time. We explore what can be expected from GW observations, both alone and in combination with EM observations, and estimate the VB's time to detection in the early months of LISA operations. We also show how VBs can be analysed in the case where their GW signals compete with many other unknown binary signals (both resolved and unresolved) from a realistic Galactic population of DWDs.
△ Less
Submitted 17 May, 2023; v1 submitted 19 October, 2022;
originally announced October 2022.
-
Device Tracking via Linux's New TCP Source Port Selection Algorithm (Extended Version)
Authors:
Moshe Kol,
Amit Klein,
Yossi Gilad
Abstract:
We describe a tracking technique for Linux devices, exploiting a new TCP source port generation mechanism recently introduced to the Linux kernel. This mechanism is based on an algorithm, standardized in RFC 6056, for boosting security by better randomizing port selection. Our technique detects collisions in a hash function used in the said algorithm, based on sampling TCP source ports generated i…
▽ More
We describe a tracking technique for Linux devices, exploiting a new TCP source port generation mechanism recently introduced to the Linux kernel. This mechanism is based on an algorithm, standardized in RFC 6056, for boosting security by better randomizing port selection. Our technique detects collisions in a hash function used in the said algorithm, based on sampling TCP source ports generated in an attacker-prescribed manner. These hash collisions depend solely on a per-device key, and thus the set of collisions forms a device ID that allows tracking devices across browsers, browser privacy modes, containers, and IPv4/IPv6 networks (including some VPNs). It can distinguish among devices with identical hardware and software, and lasts until the device restarts.
We implemented this technique and then tested it using tracking servers in two different locations and with Linux devices on various networks. We also tested it on an Android device that we patched to introduce the new port selection algorithm. The tracking technique works in real-life conditions, and we report detailed findings about it, including its dwell time, scalability, and success rate in different network types. We worked with the Linux kernel team to mitigate the exploit, resulting in a security patch introduced in May 2022 to the Linux kernel, and we provide recommendations for better securing the port selection algorithm in the paper.
△ Less
Submitted 22 December, 2022; v1 submitted 26 September, 2022;
originally announced September 2022.
-
Hypersparse Network Flow Analysis of Packets with GraphBLAS
Authors:
Tyler Trigg,
Chad Meiners,
Sandeep Pisharody,
Hayden Jananthan,
Michael Jones,
Adam Michaleas,
Timothy Davis,
Erik Welch,
William Arcand,
David Bestor,
William Bergeron,
Chansup Byun,
Vijay Gadepally,
Micheal Houle,
Matthew Hubbell,
Anna Klein,
Peter Michaleas,
Lauren Milechin,
Julie Mullen,
Andrew Prout,
Albert Reuther,
Antonio Rosa,
Siddharth Samsi,
Doug Stetson,
Charles Yee
, et al. (1 additional authors not shown)
Abstract:
Internet analysis is a major challenge due to the volume and rate of network traffic. In lieu of analyzing traffic as raw packets, network analysts often rely on compressed network flows (netflows) that contain the start time, stop time, source, destination, and number of packets in each direction. However, many traffic analyses benefit from temporal aggregation of multiple simultaneous netflows,…
▽ More
Internet analysis is a major challenge due to the volume and rate of network traffic. In lieu of analyzing traffic as raw packets, network analysts often rely on compressed network flows (netflows) that contain the start time, stop time, source, destination, and number of packets in each direction. However, many traffic analyses benefit from temporal aggregation of multiple simultaneous netflows, which can be computationally challenging. To alleviate this concern, a novel netflow compression and resampling method has been developed leveraging GraphBLAS hyperspace traffic matrices that preserve anonymization while enabling subrange analysis. Standard multitemporal spatial analyses are then performed on each subrange to generate detailed statistical aggregates of the source packets, source fan-out, unique links, destination fan-in, and destination packets of each subrange which can then be used for background modeling and anomaly detection. A simple file format based on GraphBLAS sparse matrices is developed for storing these statistical aggregates. This method is scale tested on the MIT SuperCloud using a 50 trillion packet netflow corpus from several hundred sites collected over several months. The resulting compression achieved is significant (<0.1 bit per packet) enabling extremely large netflow analyses to be stored and transported. The single node parallel performance is analyzed in terms of both processors and threads showing that a single node can perform hundreds of simultaneous analyses at over a million packets/sec (roughly equivalent to a 10 Gigabit link).
△ Less
Submitted 13 September, 2022;
originally announced September 2022.
-
Optimal Offloading Strategies for Edge-Computing via Mean-Field Games and Control
Authors:
Kai Cui,
Mustafa Burak Yilmaz,
Anam Tahir,
Anja Klein,
Heinz Koeppl
Abstract:
The optimal offloading of tasks in heterogeneous edge-computing scenarios is of great practical interest, both in the selfish and fully cooperative setting. In practice, such systems are typically very large, rendering exact solutions in terms of cooperative optima or Nash equilibria intractable. For this purpose, we adopt a general mean-field formulation in order to solve the competitive and coop…
▽ More
The optimal offloading of tasks in heterogeneous edge-computing scenarios is of great practical interest, both in the selfish and fully cooperative setting. In practice, such systems are typically very large, rendering exact solutions in terms of cooperative optima or Nash equilibria intractable. For this purpose, we adopt a general mean-field formulation in order to solve the competitive and cooperative offloading problems in the limit of infinitely large systems. We give theoretical guarantees for the approximation properties of the limiting solution and solve the resulting mean-field problems numerically. Furthermore, we verify our solutions numerically and find that our approximations are accurate for systems with dozens of edge devices. As a result, we obtain a tractable approach to the design of offloading strategies in large edge-computing scenarios with many users.
△ Less
Submitted 8 September, 2022;
originally announced September 2022.
-
A theory of criticality for quantum ferroelectric metals
Authors:
Avraham Klein,
Vladyslav Kozii,
Jonathan Ruhman,
Rafael M. Fernandes
Abstract:
A variety of compounds, for example doped paraelectrics and polar metals, exhibit both ferroelectricity and correlated electronic phenomena such as low-density superconductivity and anomalous transport. Characterizing such properties is tied to understanding the quantum dynamics of inversion symmetry breaking in the presence of itinerant electrons. Here, we present a comprehensive analysis of the…
▽ More
A variety of compounds, for example doped paraelectrics and polar metals, exhibit both ferroelectricity and correlated electronic phenomena such as low-density superconductivity and anomalous transport. Characterizing such properties is tied to understanding the quantum dynamics of inversion symmetry breaking in the presence of itinerant electrons. Here, we present a comprehensive analysis of the normal state properties of a metal near a quantum critical transition to a ferroelectric state, in both two and three dimensions. Starting from a minimal model of electrons coupled to a \emph{transverse} polar phonon via a Rashba-type spin-orbit interaction, we compute the dynamical response of both electrons and phonons. We find that the system can evince both Fermi and non-Fermi liquid phases, as well as enhanced pairing in both singlet and triplet channels. Furthermore, we systematically compute corrections to one-loop theory and find a tendency to quantum order-by-disorder, leading to a phase diagram that can include second order, first order, and finite-momentum phase transitions. Finally, we show that the entire phase diagram can be controlled via application of external strain, either compressive or volume-preserving. Our results provide a map of the dynamical and thermodynamical phase space of quantum ferroelectic metals, which can serve in characterizing existing materials and in seeking applications for quantum technologies.
△ Less
Submitted 17 July, 2023; v1 submitted 6 September, 2022;
originally announced September 2022.
-
Python Implementation of the Dynamic Distributed Dimensional Data Model
Authors:
Hayden Jananthan,
Lauren Milechin,
Michael Jones,
William Arcand,
William Bergeron,
David Bestor,
Chansup Byun,
Michael Houle,
Matthew Hubbell,
Vijay Gadepally,
Anna Klein,
Peter Michaleas,
Guillermo Morales,
Julie Mullen,
Andrew Prout,
Albert Reuther,
Antonio Rosa,
Siddharth Samsi,
Charles Yee,
Jeremy Kepner
Abstract:
Python has become a standard scientific computing language with fast-growing support of machine learning and data analysis modules, as well as an increasing usage of big data. The Dynamic Distributed Dimensional Data Model (D4M) offers a highly composable, unified data model with strong performance built to handle big data fast and efficiently. In this work we present an implementation of D4M in P…
▽ More
Python has become a standard scientific computing language with fast-growing support of machine learning and data analysis modules, as well as an increasing usage of big data. The Dynamic Distributed Dimensional Data Model (D4M) offers a highly composable, unified data model with strong performance built to handle big data fast and efficiently. In this work we present an implementation of D4M in Python. $D4M.py$ implements all foundational functionality of D4M and includes Accumulo and SQL database support via Graphulo. We describe the mathematical background and motivation, an explanation of the approaches made for its fundamental functions and building blocks, and performance results which compare $D4M.py$'s performance to D4M-MATLAB and D4M.jl.
△ Less
Submitted 22 November, 2022; v1 submitted 1 September, 2022;
originally announced September 2022.
-
pPython for Parallel Python Programming
Authors:
Chansup Byun,
William Arcand,
David Bestor,
Bill Bergeron,
Vijay Gadepally,
Michael Houle,
Matthew Hubbell,
Hayden Jananthan,
Michael Jones,
Kurt Keville,
Anna Klein,
Peter Michaleas,
Lauren Milechin,
Guillermo Morales,
Julie Mullen,
Andrew Prout,
Albert Reuther,
Antonio Rosa,
Siddharth Samsi,
Charles Yee,
Jeremy Kepner
Abstract:
pPython seeks to provide a parallel capability that provides good speed-up without sacrificing the ease of programming in Python by implementing partitioned global array semantics (PGAS) on top of a simple file-based messaging library (PythonMPI) in pure Python. The core data structure in pPython is a distributed numerical array whose distribution onto multiple processors is specified with a map c…
▽ More
pPython seeks to provide a parallel capability that provides good speed-up without sacrificing the ease of programming in Python by implementing partitioned global array semantics (PGAS) on top of a simple file-based messaging library (PythonMPI) in pure Python. The core data structure in pPython is a distributed numerical array whose distribution onto multiple processors is specified with a map construct. Communication operations between distributed arrays are abstracted away from the user and pPython transparently supports redistribution between any block-cyclic-overlapped distributions in up to four dimensions. pPython follows a SPMD (single program multiple data) model of computation. pPython runs on any combination of heterogeneous systems that support Python, including Windows, Linux, and MacOS operating systems. In addition to running transparently on single-node (e.g., a laptop), pPython provides a scheduler interface, so that pPython can be executed in a massively parallel computing environment. The initial implementation uses the Slurm scheduler. Performance of pPython on the HPC Challenge benchmark suite demonstrates both ease of programming and scalability.
△ Less
Submitted 31 August, 2022;
originally announced August 2022.
-
Chiral to Nematic Crossover in the Superconducting State of 4Hb-TaS$_2$
Authors:
I. Silber,
S. Mathimalar,
I. Mangel,
O. Green,
N. Avraham,
H. Beidenkopf,
I. Feldman,
A. Kanigel,
A. Klein,
M. Goldstein,
A. Banerjee,
E. Sela,
Y. Dagan
Abstract:
Most superconductors have an isotropic, single component order parameter, and are well described by the BCS theory for superconductivity. Unconventional, multiple components superconductors are exceptionally rare and are much less understood. Here, we combine scanning tunneling microscopy and angle-resolved macroscopic transport to study the candidate chiral superconductor, 4Hb-TaS$_2$. We reveal…
▽ More
Most superconductors have an isotropic, single component order parameter, and are well described by the BCS theory for superconductivity. Unconventional, multiple components superconductors are exceptionally rare and are much less understood. Here, we combine scanning tunneling microscopy and angle-resolved macroscopic transport to study the candidate chiral superconductor, 4Hb-TaS$_2$. We reveal quasi-periodic one-dimensional modulations in the tunneling conductance accompanied by two-fold symmetric superconducting critical field. The strong modulation of the in-plane critical field, points to a nematic, unconventional order parameter. However, the imaged vortex core is nearly circular symmetric, suggesting an isotropic order parameter. We reconcile this apparent discrepancy by modeling a competition between a dominating chiral superconducting order parameter and a nematic one, the latter emerges close to the normal phase. Our results strongly support the existence of two-component superconductivity in 4Hb-TaS$_2$ and can provide useful insights to other systems with coexistent charge order and superconductivity.
△ Less
Submitted 30 August, 2022;
originally announced August 2022.
-
QASem Parsing: Text-to-text Modeling of QA-based Semantics
Authors:
Ayal Klein,
Eran Hirsch,
Ron Eliav,
Valentina Pyatkin,
Avi Caciularu,
Ido Dagan
Abstract:
Several recent works have suggested to represent semantic relations with questions and answers, decomposing textual information into separate interrogative natural language statements. In this paper, we consider three QA-based semantic tasks - namely, QA-SRL, QANom and QADiscourse, each targeting a certain type of predication - and propose to regard them as jointly providing a comprehensive repres…
▽ More
Several recent works have suggested to represent semantic relations with questions and answers, decomposing textual information into separate interrogative natural language statements. In this paper, we consider three QA-based semantic tasks - namely, QA-SRL, QANom and QADiscourse, each targeting a certain type of predication - and propose to regard them as jointly providing a comprehensive representation of textual information. To promote this goal, we investigate how to best utilize the power of sequence-to-sequence (seq2seq) pre-trained language models, within the unique setup of semi-structured outputs, consisting of an unordered set of question-answer pairs. We examine different input and output linearization strategies, and assess the effect of multitask learning and of simple data augmentation techniques in the setting of imbalanced training data. Consequently, we release the first unified QASem parsing tool, practical for downstream applications who can benefit from an explicit, QA-based account of information units in a text.
△ Less
Submitted 14 February, 2023; v1 submitted 23 May, 2022;
originally announced May 2022.
-
The MIT Supercloud Workload Classification Challenge
Authors:
Benny J. Tang,
Qiqi Chen,
Matthew L. Weiss,
Nathan Frey,
Joseph McDonald,
David Bestor,
Charles Yee,
William Arcand,
Chansup Byun,
Daniel Edelman,
Matthew Hubbell,
Michael Jones,
Jeremy Kepner,
Anna Klein,
Adam Michaleas,
Peter Michaleas,
Lauren Milechin,
Julia Mullen,
Andrew Prout,
Albert Reuther,
Antonio Rosa,
Andrew Bowne,
Lindsey McEvoy,
Baolin Li,
Devesh Tiwari
, et al. (2 additional authors not shown)
Abstract:
High-Performance Computing (HPC) centers and cloud providers support an increasingly diverse set of applications on heterogenous hardware. As Artificial Intelligence (AI) and Machine Learning (ML) workloads have become an increasingly larger share of the compute workloads, new approaches to optimized resource usage, allocation, and deployment of new AI frameworks are needed. By identifying compute…
▽ More
High-Performance Computing (HPC) centers and cloud providers support an increasingly diverse set of applications on heterogenous hardware. As Artificial Intelligence (AI) and Machine Learning (ML) workloads have become an increasingly larger share of the compute workloads, new approaches to optimized resource usage, allocation, and deployment of new AI frameworks are needed. By identifying compute workloads and their utilization characteristics, HPC systems may be able to better match available resources with the application demand. By leveraging datacenter instrumentation, it may be possible to develop AI-based approaches that can identify workloads and provide feedback to researchers and datacenter operators for improving operational efficiency. To enable this research, we released the MIT Supercloud Dataset, which provides detailed monitoring logs from the MIT Supercloud cluster. This dataset includes CPU and GPU usage by jobs, memory usage, and file system logs. In this paper, we present a workload classification challenge based on this dataset. We introduce a labelled dataset that can be used to develop new approaches to workload classification and present initial results based on existing approaches. The goal of this challenge is to foster algorithmic innovations in the analysis of compute workloads that can achieve higher accuracy than existing methods. Data and code will be made publicly available via the Datacenter Challenge website : https://dcc.mit.edu.
△ Less
Submitted 13 April, 2022; v1 submitted 12 April, 2022;
originally announced April 2022.
-
The last three years: multiband gravitational-wave observations of stellar-mass binary black holes
Authors:
Antoine Klein,
Geraint Pratten,
Riccardo Buscicchio,
Patricia Schmidt,
Christopher J. Moore,
Eliot Finch,
Alice Bonino,
Lucy M. Thomas,
Natalie Williams,
Davide Gerosa,
Sean McGee,
Matt Nicholl,
Alberto Vecchio
Abstract:
Understanding the formation and evolution of the stellar-mass binary black holes discovered by LIGO and Virgo is a challenge that spans many areas of astrophysics, from stellar evolution, dynamics and accretion disks, to possible exotic early universe processes. Over the final years of their lives, stellar-mass binaries radiate gravitational waves that are first observable by space-based detectors…
▽ More
Understanding the formation and evolution of the stellar-mass binary black holes discovered by LIGO and Virgo is a challenge that spans many areas of astrophysics, from stellar evolution, dynamics and accretion disks, to possible exotic early universe processes. Over the final years of their lives, stellar-mass binaries radiate gravitational waves that are first observable by space-based detectors (such as LISA) and then ground-based instruments (such as LIGO, Virgo and the next generation observatories Cosmic Explorer and the Einstein Telescope). Using state-of-the-art waveform models and parameter-estimation pipelines for both ground- and space-based observations, we show that (the expected handful of) these multiband observations will allow at least percent-level measurements of all 17 parameters that describe the binary, the possible identification of a likely host galaxy, and the forewarning of the merger days in advance allowing telescopes at multiple wavelengths to search for any electromagnetic signature associated to it. Multiband sources will therefore be a gold mine for astrophysics, but we also show that they could be less useful as laboratories for fundamental tests of general relativity than has been previously suggested.
△ Less
Submitted 14 April, 2022; v1 submitted 7 April, 2022;
originally announced April 2022.
-
GraphBLAS on the Edge: Anonymized High Performance Streaming of Network Traffic
Authors:
Michael Jones,
Jeremy Kepner,
Daniel Andersen,
Aydin Buluc,
Chansup Byun,
K Claffy,
Timothy Davis,
William Arcand,
Jonathan Bernays,
David Bestor,
William Bergeron,
Vijay Gadepally,
Micheal Houle,
Matthew Hubbell,
Hayden Jananthan,
Anna Klein,
Chad Meiners,
Lauren Milechin,
Julie Mullen,
Sandeep Pisharody,
Andrew Prout,
Albert Reuther,
Antonio Rosa,
Siddharth Samsi,
Jon Sreekanth
, et al. (3 additional authors not shown)
Abstract:
Long range detection is a cornerstone of defense in many operating domains (land, sea, undersea, air, space, ..,). In the cyber domain, long range detection requires the analysis of significant network traffic from a variety of observatories and outposts. Construction of anonymized hypersparse traffic matrices on edge network devices can be a key enabler by providing significant data compression i…
▽ More
Long range detection is a cornerstone of defense in many operating domains (land, sea, undersea, air, space, ..,). In the cyber domain, long range detection requires the analysis of significant network traffic from a variety of observatories and outposts. Construction of anonymized hypersparse traffic matrices on edge network devices can be a key enabler by providing significant data compression in a rapidly analyzable format that protects privacy. GraphBLAS is ideally suited for both constructing and analyzing anonymized hypersparse traffic matrices. The performance of GraphBLAS on an Accolade Technologies edge network device is demonstrated on a near worse case traffic scenario using a continuous stream of CAIDA Telescope darknet packets. The performance for varying numbers of traffic buffers, threads, and processor cores is explored. Anonymized hypersparse traffic matrices can be constructed at a rate of over 50,000,000 packets per second; exceeding a typical 400 Gigabit network link. This performance demonstrates that anonymized hypersparse traffic matrices are readily computable on edge network devices with minimal compute resources and can be a viable data product for such devices.
△ Less
Submitted 5 September, 2022; v1 submitted 25 March, 2022;
originally announced March 2022.