-
Imaging of single barium atoms in a second matrix site in solid xenon for barium tagging in a $^{136}$Xe double beta decay experiment
Authors:
M. Yvaine,
D. Fairbank,
J. Soderstrom,
C. Taylor,
J. Stanley,
T. Walton,
C. Chambers,
A. Iverson,
W. Fairbank,
S. Al Kharusi,
A. Amy,
E. Angelico,
A. Anker,
I. J. Arnquist,
A. Atencio,
J. Bane,
V. Belov,
E. P. Bernard,
T. Bhatta,
A. Bolotnikov,
J. Breslin,
P. A. Breur,
J. P. Brodsky,
E. Brown,
T. Brunner
, et al. (112 additional authors not shown)
Abstract:
Neutrinoless double beta decay is one of the most sensitive probes for new physics beyond the Standard Model of particle physics. One of the isotopes under investigation is $^{136}$Xe, which would double beta decay into $^{136}$Ba. Detecting the single $^{136}$Ba daughter provides a sort of ultimate tool in the discrimination against backgrounds. Previous work demonstrated the ability to perform s…
▽ More
Neutrinoless double beta decay is one of the most sensitive probes for new physics beyond the Standard Model of particle physics. One of the isotopes under investigation is $^{136}$Xe, which would double beta decay into $^{136}$Ba. Detecting the single $^{136}$Ba daughter provides a sort of ultimate tool in the discrimination against backgrounds. Previous work demonstrated the ability to perform single atom imaging of Ba atoms in a single-vacancy site of a solid xenon matrix. In this paper, the effort to identify signal from individual barium atoms is extended to Ba atoms in a hexa-vacancy site in the matrix and is achieved despite increased photobleaching in this site. Abrupt fluorescence turn-off of a single Ba atom is also observed. Significant recovery of fluorescence signal lost through photobleaching is demonstrated upon annealing of Ba deposits in the Xe ice. Following annealing, it is observed that Ba atoms in the hexa-vacancy site exhibit antibleaching while Ba atoms in the tetra-vacancy site exhibit bleaching. This may be evidence for a matrix site transfer upon laser excitation. Our findings offer a path of continued research toward tagging of Ba daughters in all significant sites in solid xenon.
△ Less
Submitted 28 June, 2024;
originally announced July 2024.
-
Supernova Electron-Neutrino Interactions with Xenon in the nEXO Detector
Authors:
nEXO Collaboration,
S. Hedges,
S. Al Kharusi,
E. Angelico,
J. P. Brodsky,
G. Richardson,
S. Wilde,
A. Amy,
A. Anker,
I. J. Arnquist,
P. Arsenault,
A. Atencio,
I. Badhrees,
J. Bane,
V. Belov,
E. P. Bernard,
T. Bhatta,
A. Bolotnikov,
J. Breslin,
P. A. Breur,
E. Brown,
T. Brunner,
E. Caden,
G. F. Cao,
L. Q. Cao
, et al. (121 additional authors not shown)
Abstract:
Electron-neutrino charged-current interactions with xenon nuclei were modeled in the nEXO neutrinoless double-beta decay detector (~5-tonne, 90% ${}^{136}$Xe, 10% ${}^{134}$Xe) to evaluate its sensitivity to supernova neutrinos. Predictions for event rates and detectable signatures were modeled using the MARLEY event generator. We find good agreement between MARLEY's predictions and existing theor…
▽ More
Electron-neutrino charged-current interactions with xenon nuclei were modeled in the nEXO neutrinoless double-beta decay detector (~5-tonne, 90% ${}^{136}$Xe, 10% ${}^{134}$Xe) to evaluate its sensitivity to supernova neutrinos. Predictions for event rates and detectable signatures were modeled using the MARLEY event generator. We find good agreement between MARLEY's predictions and existing theoretical calculations of the inclusive cross sections at supernova neutrino energies. The interactions modeled by MARLEY were simulated within the nEXO simulation framework and were run through an example reconstruction algorithm to determine the detector's efficiency for reconstructing these events. The simulated data, incorporating the detector response, were used to study the ability of nEXO to reconstruct the incident electron-neutrino spectrum and these results were extended to a larger xenon detector of the same isotope enrichment. We estimate that nEXO will be able to observe electron-neutrino interactions with xenon from supernovae as far as 5 to 8 kpc from earth, while the ability to reconstruct incident electron-neutrino spectrum parameters from observed interactions in nEXO is limited to closer supernovae.
△ Less
Submitted 29 May, 2024;
originally announced May 2024.
-
Double Robust Variance Estimation
Authors:
Bonnie E. Shook-Sa,
Paul N. Zivich,
Chanhwa Lee,
Keyi Xue,
Rachael K. Ross,
Jessie K. Edwards,
Jeffrey S. A. Stringer,
Stephen R. Cole
Abstract:
Doubly robust estimators have gained popularity in the field of causal inference due to their ability to provide consistent point estimates when either an outcome or exposure model is correctly specified. However, the influence function based variance estimator frequently used with doubly robust estimators is only consistent when both the outcome and exposure models are correctly specified. Here,…
▽ More
Doubly robust estimators have gained popularity in the field of causal inference due to their ability to provide consistent point estimates when either an outcome or exposure model is correctly specified. However, the influence function based variance estimator frequently used with doubly robust estimators is only consistent when both the outcome and exposure models are correctly specified. Here, use of M-estimation and the empirical sandwich variance estimator for doubly robust point and variance estimation is demonstrated. Simulation studies illustrate the properties of the influence function based and empirical sandwich variance estimators. Estimators are applied to data from the Improving Pregnancy Outcomes with Progesterone (IPOP) trial to estimate the effect of maternal anemia on birth weight among women with HIV. In the example, birth weights if all women had anemia were estimated to be lower than birth weights if no women had anemia, though estimates were imprecise. Variance estimates were more stable under varying model specifications for the empirical sandwich variance estimator than the influence function based variance estimator.
△ Less
Submitted 24 April, 2024;
originally announced April 2024.
-
Associations between pain-management treatments and opioid use disorder risk among Medicaid patients
Authors:
Kara E. Rudolph,
Nicholas T. Williams,
Ivan Diaz,
Sarah Forrest,
Katherine L. Hoffman,
Hillary Samples,
Mark Olfson,
Lisa Doan,
Magdalena Cerda,
Rachael Ross
Abstract:
Introduction: Chronic pain patients are at increased risk of opioid-misuse. Less is known about the unique risk conferred by each pain-management treatment, as treatments are typically implemented together, confounding their independent effects. We estimated the extent to which pain-management strategies were associated with risk of incident opioid use disorder (OUD) for those with chronic pain, c…
▽ More
Introduction: Chronic pain patients are at increased risk of opioid-misuse. Less is known about the unique risk conferred by each pain-management treatment, as treatments are typically implemented together, confounding their independent effects. We estimated the extent to which pain-management strategies were associated with risk of incident opioid use disorder (OUD) for those with chronic pain, controlling for baseline demographic and clinical confounding variables and holding other pain-management treatments at their observed levels.
Methods: We used data from two chronic pain subgroups within a cohort of non-pregnant Medicaid patients aged 35-64 years, 2016-2019, from 25 states: 1) those with a chronic pain condition co-morbid with physical disability (N=6,133) or 2) those with chronic pain without disability (N=67,438). We considered 9 pain-management treatments: prescription opioid i) dose and ii) duration; iii) number of opioid prescribers; opioid co-prescription with iv) benzodiazepines, v) muscle relaxants, and vi) gabapentinoids; vii) non-opioid pain prescription, viii) physical therapy, and ix) other pain treatment modality. Our outcome was incident OUD.
Results: Having an opioid and gabapentin co-prescription or an opioid and benzodiazepine co-prescription was statistically significantly associated with a 16-46% increased risk of OUD. Opioid dose and duration also were significantly associated with increased risk of OUD. Physical therapy was significantly associated with an 11% decreased risk of OUD in the subgroup with chronic pain but no disability.
Conclusions: Co-prescription of opioids with either gabapentin or benzodiazepines may substantially increase risk of OUD. More positively, physical therapy may be a relatively accessible and safe pain-management strategy.
△ Less
Submitted 17 April, 2024;
originally announced April 2024.
-
Union: An Automatic Workload Manager for Accelerating Network Simulation
Authors:
Xin Wang,
Misbah Mubarak,
Yao Kang,
Robert B. Ross,
Zhiling Lan
Abstract:
With the rapid growth of the machine learning applications, the workloads of future HPC systems are anticipated to be a mix of scientific simulation, big data analytics, and machine learning applications. Simulation is a great research vehicle to understand the performance implications of co-running scientific applications with big data and machine learning workloads on large-scale systems. In thi…
▽ More
With the rapid growth of the machine learning applications, the workloads of future HPC systems are anticipated to be a mix of scientific simulation, big data analytics, and machine learning applications. Simulation is a great research vehicle to understand the performance implications of co-running scientific applications with big data and machine learning workloads on large-scale systems. In this paper, we present Union, a workload manager that provides an automatic framework to facilitate hybrid workload simulation in CODES. Furthermore, we use Union, along with CODES, to investigate various hybrid workloads composed of traditional simulation applications and emerging learning applications on two dragonfly systems. The experiment results show that both message latency and communication time are important performance metrics to evaluate network interference. Network interference on HPC applications is more reflected by the message latency variation, whereas ML application performance depends more on the communication time.
△ Less
Submitted 3 April, 2024; v1 submitted 24 March, 2024;
originally announced March 2024.
-
WildfireGPT: Tailored Large Language Model for Wildfire Analysis
Authors:
Yangxinyu Xie,
Tanwi Mallick,
Joshua David Bergerson,
John K. Hutchison,
Duane R. Verner,
Jordan Branham,
M. Ross Alexander,
Robert B. Ross,
Yan Feng,
Leslie-Anne Levy,
Weijie Su
Abstract:
The recent advancement of large language models (LLMs) represents a transformational capability at the frontier of artificial intelligence (AI) and machine learning (ML). However, LLMs are generalized models, trained on extensive text corpus, and often struggle to provide context-specific information, particularly in areas requiring specialized knowledge such as wildfire details within the broader…
▽ More
The recent advancement of large language models (LLMs) represents a transformational capability at the frontier of artificial intelligence (AI) and machine learning (ML). However, LLMs are generalized models, trained on extensive text corpus, and often struggle to provide context-specific information, particularly in areas requiring specialized knowledge such as wildfire details within the broader context of climate change. For decision-makers and policymakers focused on wildfire resilience and adaptation, it is crucial to obtain responses that are not only precise but also domain-specific, rather than generic. To that end, we developed WildfireGPT, a prototype LLM agent designed to transform user queries into actionable insights on wildfire risks. We enrich WildfireGPT by providing additional context such as climate projections and scientific literature to ensure its information is current, relevant, and scientifically accurate. This enables WildfireGPT to be an effective tool for delivering detailed, user-specific insights on wildfire risks to support a diverse set of end users, including researchers, engineers, urban planners, emergency managers, and infrastructure operators.
△ Less
Submitted 12 February, 2024;
originally announced February 2024.
-
Validity of Complete Case Analysis Depends on the Target Population
Authors:
Michael Webster-Clark,
Rachael K Ross
Abstract:
Missing data is a pernicious problem in epidemiologic research. Research on the validity of complete case analysis for missing data has typically focused on estimating the average treatment effect (ATE) in the whole population. However, other target populations like the treated (ATT) or external targets can be of substantive interest. In such cases, whether missing covariate data occurs within or…
▽ More
Missing data is a pernicious problem in epidemiologic research. Research on the validity of complete case analysis for missing data has typically focused on estimating the average treatment effect (ATE) in the whole population. However, other target populations like the treated (ATT) or external targets can be of substantive interest. In such cases, whether missing covariate data occurs within or outside the target population may impact the validity of complete case analysis. We sought to assess bias in complete case analysis when covariate data is missing outside the target (e.g., missing covariate data among the untreated when estimating the ATT). We simulated a study of the effect of a binary treatment X on a binary outcome Y in the presence of 3 confounders C1-C3 that modified the risk difference (RD). We induced missingness in C1 only among the untreated under 4 scenarios: completely randomly (similar to MCAR); randomly based on C2 and C3 (similar to MAR); randomly based on C1 (similar to MNAR); or randomly based on Y (similar to MAR). We estimated the ATE and ATT using weighting and averaged results across the replicates. We conducted a parallel simulation transporting trial results to a target population in the presence of missing covariate data in the trial. In the complete case analysis, estimated ATE was unbiased only when C1 was MCAR among the untreated. The estimated ATT, on the other hand, was unbiased in all scenarios except when Y caused missingness. The parallel simulation of generalizing and transporting trial results saw similar bias patterns. If missing covariate data is only present outside the target population, complete case analysis is unbiased except when missingness is associated with the outcome.
△ Less
Submitted 27 January, 2024;
originally announced January 2024.
-
Who Are We Missing? A Principled Approach to Characterizing the Underrepresented Population
Authors:
Harsh Parikh,
Rachael Ross,
Elizabeth Stuart,
Kara Rudolph
Abstract:
Randomized controlled trials (RCTs) serve as the cornerstone for understanding causal effects, yet extending inferences to target populations presents challenges due to effect heterogeneity and underrepresentation. Our paper addresses the critical issue of identifying and characterizing underrepresented subgroups in RCTs, proposing a novel framework for refining target populations to improve gener…
▽ More
Randomized controlled trials (RCTs) serve as the cornerstone for understanding causal effects, yet extending inferences to target populations presents challenges due to effect heterogeneity and underrepresentation. Our paper addresses the critical issue of identifying and characterizing underrepresented subgroups in RCTs, proposing a novel framework for refining target populations to improve generalizability. We introduce an optimization-based approach, Rashomon Set of Optimal Trees (ROOT), to characterize underrepresented groups. ROOT optimizes the target subpopulation distribution by minimizing the variance of the target average treatment effect estimate, ensuring more precise treatment effect estimations. Notably, ROOT generates interpretable characteristics of the underrepresented population, aiding researchers in effective communication. Our approach demonstrates improved precision and interpretability compared to alternatives, as illustrated with synthetic data experiments. We apply our methodology to extend inferences from the Starting Treatment with Agonist Replacement Therapies (START) trial -- investigating the effectiveness of medication for opioid use disorder -- to the real-world population represented by the Treatment Episode Dataset: Admissions (TEDS-A). By refining target populations using ROOT, our framework offers a systematic approach to enhance decision-making accuracy and inform future trials in diverse populations.
△ Less
Submitted 10 March, 2024; v1 submitted 25 January, 2024;
originally announced January 2024.
-
Towards Continually Learning Application Performance Models
Authors:
Ray A. O. Sinurat,
Anurag Daram,
Haryadi S. Gunawi,
Robert B. Ross,
Sandeep Madireddy
Abstract:
Machine learning-based performance models are increasingly being used to build critical job scheduling and application optimization decisions. Traditionally, these models assume that data distribution does not change as more samples are collected over time. However, owing to the complexity and heterogeneity of production HPC systems, they are susceptible to hardware degradation, replacement, and/o…
▽ More
Machine learning-based performance models are increasingly being used to build critical job scheduling and application optimization decisions. Traditionally, these models assume that data distribution does not change as more samples are collected over time. However, owing to the complexity and heterogeneity of production HPC systems, they are susceptible to hardware degradation, replacement, and/or software patches, which can lead to drift in the data distribution that can adversely affect the performance models. To this end, we develop continually learning performance models that account for the distribution drift, alleviate catastrophic forgetting, and improve generalizability. Our best model was able to retain accuracy, regardless of having to learn the new distribution of data inflicted by system changes, while demonstrating a 2x improvement in the prediction accuracy of the whole data sequence in comparison to the naive approach.
△ Less
Submitted 25 October, 2023;
originally announced October 2023.
-
Empirical sandwich variance estimator for iterated conditional expectation g-computation
Authors:
Paul N Zivich,
Rachael K Ross,
Bonnie E Shook-Sa,
Stephen R Cole,
Jessie K Edwards
Abstract:
Iterated conditional expectation (ICE) g-computation is an estimation approach for addressing time-varying confounding for both longitudinal and time-to-event data. Unlike other g-computation implementations, ICE avoids the need to specify models for each time-varying covariate. For variance estimation, previous work has suggested the bootstrap. However, bootstrap** can be computationally intens…
▽ More
Iterated conditional expectation (ICE) g-computation is an estimation approach for addressing time-varying confounding for both longitudinal and time-to-event data. Unlike other g-computation implementations, ICE avoids the need to specify models for each time-varying covariate. For variance estimation, previous work has suggested the bootstrap. However, bootstrap** can be computationally intense and sensitive to the number of resamples used. Here, we present ICE g-computation as a set of stacked estimating equations. Therefore, the variance for the ICE g-computation estimator can be consistently estimated using the empirical sandwich variance estimator. Performance of the variance estimator was evaluated empirically with a simulation study. The proposed approach is also demonstrated with an illustrative example on the effect of cigarette smoking on the prevalence of hypertension. In the simulation study, the empirical sandwich variance estimator appropriately estimated the variance. When comparing runtimes between the sandwich variance estimator and the bootstrap for the applied example, the sandwich estimator was substantially faster, even when bootstraps were run in parallel. The empirical sandwich variance estimator is a viable option for variance estimation with ICE g-computation.
△ Less
Submitted 4 March, 2024; v1 submitted 19 June, 2023;
originally announced June 2023.
-
Inverse Design of Nanophotonic Devices using Dynamic Binarization
Authors:
Marco Butz,
Adrian S. Abazi,
Rene Ross,
Benjamin Risse,
Carsten Schuck
Abstract:
The complexity of applications addressed with photonic integrated circuits is steadily rising and poses increasingly challenging demands on individual component functionality, performance and footprint. Inverse design methods have recently shown great promise to address these demands using fully automated design procedures that enable access to non-intuitive device layouts beyond conventional nano…
▽ More
The complexity of applications addressed with photonic integrated circuits is steadily rising and poses increasingly challenging demands on individual component functionality, performance and footprint. Inverse design methods have recently shown great promise to address these demands using fully automated design procedures that enable access to non-intuitive device layouts beyond conventional nanophotonic design concepts. Here we present a dynamic binarization method for the objective-first algorithm that lies at the core of the currently most successful inverse design algorithms. Our results demonstrate significant performance advantages over previous implementations of objective first algorithms, which we show for a fundamental TE00 to TE20 waveguide mode converter both in simulation and in experiments with fabricated devices.
△ Less
Submitted 18 November, 2022;
originally announced November 2022.
-
HPC Storage Service Autotuning Using Variational-Autoencoder-Guided Asynchronous Bayesian Optimization
Authors:
Matthieu Dorier,
Romain Egele,
Prasanna Balaprakash,
Jaehoon Koo,
Sandeep Madireddy,
Srinivasan Ramesh,
Allen D. Malony,
Rob Ross
Abstract:
Distributed data storage services tailored to specific applications have grown popular in the high-performance computing (HPC) community as a way to address I/O and storage challenges. These services offer a variety of specific interfaces, semantics, and data representations. They also expose many tuning parameters, making it difficult for their users to find the best configuration for a given wor…
▽ More
Distributed data storage services tailored to specific applications have grown popular in the high-performance computing (HPC) community as a way to address I/O and storage challenges. These services offer a variety of specific interfaces, semantics, and data representations. They also expose many tuning parameters, making it difficult for their users to find the best configuration for a given workload and platform.
To address this issue, we develop a novel variational-autoencoder-guided asynchronous Bayesian optimization method to tune HPC storage service parameters. Our approach uses transfer learning to leverage prior tuning results and use a dynamically updated surrogate model to explore the large parameter search space in a systematic way.
We implement our approach within the DeepHyper open-source framework, and apply it to the autotuning of a high-energy physics workflow on Argonne's Theta supercomputer. We show that our transfer-learning approach enables a more than $40\times$ search speedup over random search, compared with a $2.5\times$ to $10\times$ speedup when not using transfer learning. Additionally, we show that our approach is on par with state-of-the-art autotuning frameworks in speed and outperforms them in resource utilization and parallelization capabilities.
△ Less
Submitted 3 October, 2022;
originally announced October 2022.
-
Dialogue Policies for Confusion Mitigation in Situated HRI
Authors:
Na Li,
Robert Ross
Abstract:
Confusion is a mental state triggered by cognitive disequilibrium that can occur in many types of task-oriented interaction, including Human-Robot Interaction (HRI). People may become confused while interacting with robots due to communicative or even task-centred challenges. To build a smooth and engaging HRI, it is insufficient for an agent to simply detect confusion; instead, the system should…
▽ More
Confusion is a mental state triggered by cognitive disequilibrium that can occur in many types of task-oriented interaction, including Human-Robot Interaction (HRI). People may become confused while interacting with robots due to communicative or even task-centred challenges. To build a smooth and engaging HRI, it is insufficient for an agent to simply detect confusion; instead, the system should aim to mitigate the situation. In light of this, in this paper, we present our approach to a linguistic design of dialogue policies to build a dialogue framework to alleviate interlocutor confusion. We also outline our sketch and discuss challenges with respect to its operationalisation.
△ Less
Submitted 19 August, 2022;
originally announced August 2022.
-
Detecting Interlocutor Confusion in Situated Human-Avatar Dialogue: A Pilot Study
Authors:
Na Li,
John D. Kelleher,
Robert Ross
Abstract:
In order to enhance levels of engagement with conversational systems, our long term research goal seeks to monitor the confusion state of a user and adapt dialogue policies in response to such user confusion states. To this end, in this paper, we present our initial research centred on a user-avatar dialogue scenario that we have developed to study the manifestation of confusion and in the long te…
▽ More
In order to enhance levels of engagement with conversational systems, our long term research goal seeks to monitor the confusion state of a user and adapt dialogue policies in response to such user confusion states. To this end, in this paper, we present our initial research centred on a user-avatar dialogue scenario that we have developed to study the manifestation of confusion and in the long term its mitigation. We present a new definition of confusion that is particularly tailored to the requirements of intelligent conversational system development for task-oriented dialogue. We also present the details of our Wizard-of-Oz based data collection scenario wherein users interacted with a conversational avatar and were presented with stimuli that were in some cases designed to invoke a confused state in the user. Post study analysis of this data is also presented. Here, three pre-trained deep learning models were deployed to estimate base emotion, head pose and eye gaze. Despite a small pilot study group, our analysis demonstrates a significant relationship between these indicators and confusion states. We understand this as a useful step forward in the automated analysis of the pragmatics of dialogue.
△ Less
Submitted 6 June, 2022;
originally announced June 2022.
-
Transferring Studies Across Embodiments: A Case Study in Confusion Detection
Authors:
Na Li,
Robert Ross
Abstract:
Human-robot studies are expensive to conduct and difficult to control, and as such researchers sometimes turn to human-avatar interaction in the hope of faster and cheaper data collection that can be transferred to the robot domain. In terms of our work, we are particularly interested in the challenge of detecting and modelling user confusion in interaction, and as part of this research programme,…
▽ More
Human-robot studies are expensive to conduct and difficult to control, and as such researchers sometimes turn to human-avatar interaction in the hope of faster and cheaper data collection that can be transferred to the robot domain. In terms of our work, we are particularly interested in the challenge of detecting and modelling user confusion in interaction, and as part of this research programme, we conducted situated dialogue studies to investigate users' reactions in confusing scenarios that we give in both physical and virtual environments. In this paper, we present a combined review of these studies and the results that we observed across these two embodiments. For the physical embodiment, we used a Pepper Robot, while for the virtual modality, we used a 3D avatar. Our study shows that despite attitudinal differences and technical control limitations, there were a number of similarities detected in user behaviour and self-reporting results across embodiment options. This work suggests that, while avatar interaction is no true substitute for robot interaction studies, sufficient care in study design may allow well executed human-avatar studies to supplement more challenging human-robot studies.
△ Less
Submitted 3 June, 2022;
originally announced June 2022.
-
Self-Supervised Learning for Invariant Representations from Multi-Spectral and SAR Images
Authors:
Pallavi Jain,
Bianca Schoen-Phelan,
Robert Ross
Abstract:
Self-Supervised learning (SSL) has become the new state-of-art in several domain classification and segmentation tasks. Of these, one popular category in SSL is distillation networks such as BYOL. This work proposes RSDnet, which applies the distillation network (BYOL) in the remote sensing (RS) domain where data is non-trivially different from natural RGB images. Since Multi-spectral (MS) and syn…
▽ More
Self-Supervised learning (SSL) has become the new state-of-art in several domain classification and segmentation tasks. Of these, one popular category in SSL is distillation networks such as BYOL. This work proposes RSDnet, which applies the distillation network (BYOL) in the remote sensing (RS) domain where data is non-trivially different from natural RGB images. Since Multi-spectral (MS) and synthetic aperture radar (SAR) sensors provide varied spectral and spatial resolution information, we utilised them as an implicit augmentation to learn invariant feature embeddings. In order to learn RS based invariant features with SSL, we trained RSDnet in two ways, i.e., single channel feature learning and three channel feature learning. This work explores the usefulness of single channel feature learning from random MS and SAR bands compared to the common notion of using three or more bands. In our linear evaluation, these single channel features reached a 0.92 F1 score on the EuroSAT classification task and 59.6 mIoU on the DFC segmentation task for certain single bands. We also compared our results with ImageNet weights and showed that the RS based SSL model outperforms the supervised ImageNet based model. We further explored the usefulness of multi-modal data compared to single modality data, and it is shown that utilising MS and SAR data learn better invariant representations than utilising only MS data.
△ Less
Submitted 5 September, 2022; v1 submitted 4 May, 2022;
originally announced May 2022.
-
A Case Study on Parallel HDF5 Dataset Concatenation for High Energy Physics Data Analysis
Authors:
Sunwoo Lee,
Kai-yuan Hou,
Kewei Wang,
Saba Sehrish,
Marc Paterno,
James Kowalkowski,
Quincey Koziol,
Robert Ross,
Ankit Agrawal,
Alok Choudhary,
Wei-keng Liao
Abstract:
In High Energy Physics (HEP), experimentalists generate large volumes of data that, when analyzed, helps us better understand the fundamental particles and their interactions. This data is often captured in many files of small size, creating a data management challenge for scientists. In order to better facilitate data management, transfer, and analysis on large scale platforms, it is advantageous…
▽ More
In High Energy Physics (HEP), experimentalists generate large volumes of data that, when analyzed, helps us better understand the fundamental particles and their interactions. This data is often captured in many files of small size, creating a data management challenge for scientists. In order to better facilitate data management, transfer, and analysis on large scale platforms, it is advantageous to aggregate data further into a smaller number of larger files. However, this translation process can consume significant time and resources, and if performed incorrectly the resulting aggregated files can be inefficient for highly parallel access during analysis on large scale platforms. In this paper, we present our case study on parallel I/O strategies and HDF5 features for reducing data aggregation time, making effective use of compression, and ensuring efficient access to the resulting data during analysis at scale. We focus on NOvA detector data in this case study, a large-scale HEP experiment generating many terabytes of data. The lessons learned from our case study inform the handling of similar datasets, thus expanding community knowledge related to this common data management task.
△ Less
Submitted 2 May, 2022;
originally announced May 2022.
-
A Taxonomy of Error Sources in HPC I/O Machine Learning Models
Authors:
Mihailo Isakov,
Mikaela Currier,
Eliakin del Rosario,
Sandeep Madireddy,
Prasanna Balaprakash,
Philip Carns,
Robert B. Ross,
Glenn K. Lockwood,
Michel A. Kinsy
Abstract:
I/O efficiency is crucial to productivity in scientific computing, but the increasing complexity of the system and the applications makes it difficult for practitioners to understand and optimize I/O behavior at scale. Data-driven machine learning-based I/O throughput models offer a solution: they can be used to identify bottlenecks, automate I/O tuning, or optimize job scheduling with minimal hum…
▽ More
I/O efficiency is crucial to productivity in scientific computing, but the increasing complexity of the system and the applications makes it difficult for practitioners to understand and optimize I/O behavior at scale. Data-driven machine learning-based I/O throughput models offer a solution: they can be used to identify bottlenecks, automate I/O tuning, or optimize job scheduling with minimal human intervention. Unfortunately, current state-of-the-art I/O models are not robust enough for production use and underperform after being deployed.
We analyze multiple years of application, scheduler, and storage system logs on two leadership-class HPC platforms to understand why I/O models underperform in practice. We propose a taxonomy consisting of five categories of I/O modeling errors: poor application and system modeling, inadequate dataset coverage, I/O contention, and I/O noise. We develop litmus tests to quantify each category, allowing researchers to narrow down failure modes, enhance I/O throughput models, and improve future generations of HPC logging and analysis tools.
△ Less
Submitted 18 April, 2022;
originally announced April 2022.
-
High-precision real-space simulation of electrostatically-confined few-electron states
Authors:
Christopher R. Anderson,
Mark F. Gyure,
Sam Quinn,
Andrew Pan,
Richard S. Ross,
Andrey A. Kiselev
Abstract:
In this paper we present a computational procedure that utilizes real-space grids to obtain high precision approximations of electrostatically confined few-electron states such as those that arise in gated semiconductor quantum dots. We use the Full Configuration Interaction (FCI) method with a continuously adapted orthonormal orbital basis to approximate the ground and excited states of such syst…
▽ More
In this paper we present a computational procedure that utilizes real-space grids to obtain high precision approximations of electrostatically confined few-electron states such as those that arise in gated semiconductor quantum dots. We use the Full Configuration Interaction (FCI) method with a continuously adapted orthonormal orbital basis to approximate the ground and excited states of such systems. We also introduce a benchmark problem based on a realistic analytical electrostatic potential for quantum dot devices. We show that our approach leads to highly precise computed energies and energy differences over a wide range of model parameters. The analytic definition of the benchmark allows for a collection of tests that are easily replicated, thus facilitating comparisons with other computational approaches.
△ Less
Submitted 28 February, 2022;
originally announced March 2022.
-
Nonlinear Anti-(Parity-Time) symmetric dimer
Authors:
A. S. Rodrigues,
R. M. Ross,
V. V. Konotop,
A. Saxena,
P. G. Kevrekidis
Abstract:
In the present work we propose a nonlinear anti-$\mathcal{PT}$-symmetric dimer, that at the linear level has been experimentally created in the realm of electric circuit resonators. We find four families of solutions, the so-called upper and lower branches, both in a symmetric and in an asymmetric (symmetry-broken) form. We unveil analytically and confirm numerically the critical thresholds for th…
▽ More
In the present work we propose a nonlinear anti-$\mathcal{PT}$-symmetric dimer, that at the linear level has been experimentally created in the realm of electric circuit resonators. We find four families of solutions, the so-called upper and lower branches, both in a symmetric and in an asymmetric (symmetry-broken) form. We unveil analytically and confirm numerically the critical thresholds for the existence of such branches and explore the bifurcations (such as saddle-node ones) that delimit their existence, as well as transcritical ones that lead to their potential exchange of stability. We find that out of the four relevant branches, only one, the upper symmetric branch, corresponds to a spectrally and dynamically robust solution. We subsequently leverage detailed direct numerical computations in order to explore the dynamics of the different states, corroborating our spectral analysis results.
△ Less
Submitted 25 January, 2022;
originally announced January 2022.
-
Ground States in Spatially Discrete Nonlinear Schrödinger Lattices
Authors:
Atanas G. Stefanov,
Ryan M. Ross,
Panayotis G. Kevrekidis
Abstract:
In his seminal work, Weinstein considered the question of the ground states for discrete Schrödinger equations with power law nonlinearities, posed on ${\mathbb Z}^d$. More specifically, he constructed the so-called normalized waves, by minimizing the Hamiltonian functional, for fixed power $P$ (i.e. $l^2$ mass). This type of variational method allows one to claim, in a straightforward manner, set…
▽ More
In his seminal work, Weinstein considered the question of the ground states for discrete Schrödinger equations with power law nonlinearities, posed on ${\mathbb Z}^d$. More specifically, he constructed the so-called normalized waves, by minimizing the Hamiltonian functional, for fixed power $P$ (i.e. $l^2$ mass). This type of variational method allows one to claim, in a straightforward manner, set stability for such waves.
In this work, we revisit and build upon Weinstein's work in several directions. First, for the normalized waves, we show that they are in fact spectrally stable as solutions of the corresponding discrete NLS evolution equation. Next, we construct the so-called homogeneous waves, by using a different constrained optimization problem. Importantly, this construction works for all values of the parameters, e.g. $l^2$ supercritical problems. We establish a rigorous criterion for stability, which decides the stability on the homogeneous waves, based on the classical Grillakis-Shatah-Strauss/Vakhitov-Kolokolov quantity $\partial_ω\|\varphi_ω\|_{l^2}^2$. In addition, we provide some symmetry results for the solitons. Finally, we complement our results with numerical computations, which showcase the full agreement between the conclusion from the GSS/VK criterion vis-á-vis with the linearized problem. In particular, one observes that it is possible for the stability of the wave to change as the spectral parameter $ω$ varies, in contrast with the corresponding continuous NLS model.
△ Less
Submitted 29 October, 2021;
originally announced November 2021.
-
Solitary waves with intensity-dependent dispersion: variational characterization
Authors:
D. E. Pelinovsky,
R. M. Ross,
P. G. Kevrekidis
Abstract:
A continuous family of singular solitary waves exists in a prototypical system with intensity-dependent dispersion. The family has a cusped soliton as the limiting lowest energy state and is formed by the solitary waves with bell-shaped heads of different lengths. We show that this family can be obtained variationally by minimization of mass at fixed energy and fixed length of the bell-shaped head…
▽ More
A continuous family of singular solitary waves exists in a prototypical system with intensity-dependent dispersion. The family has a cusped soliton as the limiting lowest energy state and is formed by the solitary waves with bell-shaped heads of different lengths. We show that this family can be obtained variationally by minimization of mass at fixed energy and fixed length of the bell-shaped head. We develop a weak formulation for the singular solitary waves and prove that they are stable under perturbations which do not change the length of the bell-shaped head. Numerical simulations confirm the stability of the singular solitary waves.
△ Less
Submitted 9 June, 2021;
originally announced June 2021.
-
Data-Driven Reinforcement Learning for Virtual Character Animation Control
Authors:
Vihanga Gamage,
Cathy Ennis,
Robert Ross
Abstract:
Virtual character animation control is a problem for which Reinforcement Learning (RL) is a viable approach. While current work have applied RL effectively to portray physics-based skills, social behaviours are challenging to design reward functions for, due to their lack of physical interaction with the world. On the other hand, data-driven implementations for these skills have been limited to su…
▽ More
Virtual character animation control is a problem for which Reinforcement Learning (RL) is a viable approach. While current work have applied RL effectively to portray physics-based skills, social behaviours are challenging to design reward functions for, due to their lack of physical interaction with the world. On the other hand, data-driven implementations for these skills have been limited to supervised learning methods which require extensive training data and carry constraints on generalisability. In this paper, we propose RLAnimate, a novel data-driven deep RL approach to address this challenge, where we combine the strengths of RL together with an ability to learn from a motion dataset when creating agents. We formalise a mathematical structure for training agents by refining the conceptual roles of elements such as agents, environments, states and actions, in a way that leverages attributes of the character animation domain and model-based RL. An agent trained using our approach learns versatile animation dynamics to portray multiple behaviours, using an iterative RL training process, which becomes aware of valid behaviours via representations learnt from motion capture clips. We demonstrate, by training agents that portray realistic pointing and waving behaviours, that our approach requires a significantly lower training time, and substantially fewer sample episodes to be generated during training relative to state-of-the-art physics-based RL methods. Also, compared to existing supervised learning-based animation agents, RLAnimate needs a limited dataset of motion clips to generate representations of valid behaviours during training.
△ Less
Submitted 13 April, 2021;
originally announced April 2021.
-
Localization in optical systems with an intensity-dependent dispersion
Authors:
R. M. Ross,
P. G. Kevrekidis,
D. E. Pelinovsky
Abstract:
We address the nonlinear Schrodinger equation with intensity-dependent dispersion which was recently proposed in the context of nonlinear optical systems. Contrary to the previous findings, we prove that no solitary wave solutions exist if the sign of the intensity-dependent dispersion coincides with the sign of the constant dispersion, whereas a continuous family of such solutions exists in the c…
▽ More
We address the nonlinear Schrodinger equation with intensity-dependent dispersion which was recently proposed in the context of nonlinear optical systems. Contrary to the previous findings, we prove that no solitary wave solutions exist if the sign of the intensity-dependent dispersion coincides with the sign of the constant dispersion, whereas a continuous family of such solutions exists in the case of the opposite signs. The family includes two particular solutions, namely cusped and bell-shaped solitons, where the former represents the lowest energy state in the family and the latter is a limit of solitary waves in a regularized system. We further analyze the delicate analytical properties of these solitary waves such as the asymptotic behavior near singularities, the spectral stability, and the convergence of the fixed-point iterations near such solutions. The analytical theory is corroborated by means of numerical approximations.
△ Less
Submitted 22 March, 2021;
originally announced March 2021.
-
Language-Driven Region Pointer Advancement for Controllable Image Captioning
Authors:
Annika Lindh,
Robert J. Ross,
John D. Kelleher
Abstract:
Controllable Image Captioning is a recent sub-field in the multi-modal task of Image Captioning wherein constraints are placed on which regions in an image should be described in the generated natural language caption. This puts a stronger focus on producing more detailed descriptions, and opens the door for more end-user control over results. A vital component of the Controllable Image Captioning…
▽ More
Controllable Image Captioning is a recent sub-field in the multi-modal task of Image Captioning wherein constraints are placed on which regions in an image should be described in the generated natural language caption. This puts a stronger focus on producing more detailed descriptions, and opens the door for more end-user control over results. A vital component of the Controllable Image Captioning architecture is the mechanism that decides the timing of attending to each region through the advancement of a region pointer. In this paper, we propose a novel method for predicting the timing of region pointer advancement by treating the advancement step as a natural part of the language structure via a NEXT-token, motivated by a strong correlation to the sentence structure in the training data. We find that our timing agrees with the ground-truth timing in the Flickr30k Entities test data with a precision of 86.55% and a recall of 97.92%. Our model implementing this technique improves the state-of-the-art on standard captioning metrics while additionally demonstrating a considerably larger effective vocabulary size.
△ Less
Submitted 30 November, 2020;
originally announced November 2020.
-
Balancing conservative and disruptive growth in the voter model
Authors:
Robert J. H. Ross,
Walter Fontana
Abstract:
We are concerned with how the implementation of growth determines the expected number of state-changes in a growing self-organizing process. With this problem in mind, we examine two versions of the voter model on a one-dimensional growing lattice. Our main result asserts that the expected number of state-changes before an absorbing state is found can be controlled by balancing the conservative an…
▽ More
We are concerned with how the implementation of growth determines the expected number of state-changes in a growing self-organizing process. With this problem in mind, we examine two versions of the voter model on a one-dimensional growing lattice. Our main result asserts that the expected number of state-changes before an absorbing state is found can be controlled by balancing the conservative and disruptive forces of growth. This is because conservative growth preserves the self-organization of the voter model as it searches for an absorbing state, whereas disruptive growth undermines this self-organization. In particular, we focus on controlling the expected number of state-changes as the rate of growth tends to zero or infinity in the limit. These results illustrate how growth can affect the costs of self-organization and so are pertinent to the physics of growing active matter.
△ Less
Submitted 20 March, 2021; v1 submitted 21 October, 2020;
originally announced November 2020.
-
Achieving a quantum smart workforce
Authors:
Clarice D. Aiello,
D. D. Awschalom,
Hannes Bernien,
Tina Brower-Thomas,
Kenneth R. Brown,
Todd A. Brun,
Justin R. Caram,
Eric Chitambar,
Rosa Di Felice,
Michael F. J. Fox,
Stephan Haas,
Alexander W. Holleitner,
Eric R. Hudson,
Jeffrey H. Hunt,
Robert Joynt,
Scott Koziol,
H. J. Lewandowski,
Douglas T. McClure,
Jens Palsberg,
Gina Passante,
Kristen L. Pudenz,
Christopher J. K. Richardson,
Jessica L. Rosenberg,
R. S. Ross,
Mark Saffman
, et al. (7 additional authors not shown)
Abstract:
Interest in building dedicated Quantum Information Science and Engineering (QISE) education programs has greatly expanded in recent years. These programs are inherently convergent, complex, often resource intensive and likely require collaboration with a broad variety of stakeholders. In order to address this combination of challenges, we have captured ideas from many members in the community. Thi…
▽ More
Interest in building dedicated Quantum Information Science and Engineering (QISE) education programs has greatly expanded in recent years. These programs are inherently convergent, complex, often resource intensive and likely require collaboration with a broad variety of stakeholders. In order to address this combination of challenges, we have captured ideas from many members in the community. This manuscript not only addresses policy makers and funding agencies (both public and private and from the regional to the international level) but also contains needs identified by industry leaders and discusses the difficulties inherent in creating an inclusive QISE curriculum. We report on the status of eighteen post-secondary education programs in QISE and provide guidance for building new programs. Lastly, we encourage the development of a comprehensive strategic plan for quantum education and workforce development as a means to make the most of the ongoing substantial investments being made in QISE.
△ Less
Submitted 23 October, 2020;
originally announced October 2020.
-
Detuning Axis Pulsed Spectroscopy of Valley-Orbital States in Si/SiGe Quantum Dots
Authors:
Edward H. Chen,
Kate Raach,
Andrew Pan,
Andrey A. Kiselev,
Edwin Acuna,
Jacob Z. Blumoff,
Teresa Brecht,
Maxwell Choi,
Wonill Ha,
Daniel Hulbert,
Michael P. Jura,
Tyler Keating,
Ramsey Noah,
Bo Sun,
Bryan J. Thomas,
Matthew Borselli,
C. A. C. Jackson,
Matthew T. Rakher,
Richard S. Ross
Abstract:
Silicon quantum dot qubits must contend with low-lying valley excited states which are sensitive functions of the quantum well heterostructure and disorder; quantifying and maximizing the energies of these states are critical to improving device performance. We describe a spectroscopic method for probing excited states in isolated Si/SiGe double quantum dots using standard baseband pulsing techniq…
▽ More
Silicon quantum dot qubits must contend with low-lying valley excited states which are sensitive functions of the quantum well heterostructure and disorder; quantifying and maximizing the energies of these states are critical to improving device performance. We describe a spectroscopic method for probing excited states in isolated Si/SiGe double quantum dots using standard baseband pulsing techniques, easing the extraction of energy spectra in multiple-dot devices. We use this method to measure dozens of valley excited state energies spanning multiple wafers, quantum dots, and orbital states, crucial for evaluating the dependence of valley splitting on quantum well width and other epitaxial conditions. Our results suggest that narrower wells can be beneficial for improving valley splittings, but this effect can be confounded by variations in growth and fabrication conditions. These results underscore the importance of valley splitting measurements for guiding the development of Si qubits.
△ Less
Submitted 26 February, 2021; v1 submitted 9 October, 2020;
originally announced October 2020.
-
Magnetic Gradient Fluctuations from Quadrupolar $^{73}$Ge in Si/SiGe Exchange-Only Qubits
Authors:
J. Kerckhoff,
B. Sun,
B. H. Fong,
C. Jones,
A. A. Kiselev,
D. W. Barnes,
R. S. Noah,
E. Acuna,
M. Akmal,
S. D. Ha,
J. A. Wright,
B. J. Thomas,
C. A. C. Jackson,
L. F. Edge,
K. Eng,
R. S. Ross,
T. D. Ladd
Abstract:
We study the time-fluctuating magnetic gradient noise mechanisms in pairs of Si/SiGe quantum dots using exchange echo noise spectroscopy. We find through a combination of spectral inversion and correspondence to theoretical modeling that quadrupolar precession of the $^{73}$Ge nuclei play a key role in the spin-echo decay time $T_2$, with a characteristic dependence on magnetic field and the width…
▽ More
We study the time-fluctuating magnetic gradient noise mechanisms in pairs of Si/SiGe quantum dots using exchange echo noise spectroscopy. We find through a combination of spectral inversion and correspondence to theoretical modeling that quadrupolar precession of the $^{73}$Ge nuclei play a key role in the spin-echo decay time $T_2$, with a characteristic dependence on magnetic field and the width of the Si quantum well. The $^{73}$Ge noise peaks appear at the fundamental and first harmonic of the $^{73}$Ge Larmor resonance, superimposed over $1/f$ noise due to $^{29}$Si dipole-dipole dynamics, and are dependent on material epitaxy and applied magnetic field. These results may inform the needs of dynamical decoupling when using Si/SiGe quantum dots as qubits in quantum information processing devices.
△ Less
Submitted 17 September, 2020;
originally announced September 2020.
-
GymFG: A Framework with a Gym Interface for FlightGear
Authors:
Andrew Wood,
Ali Sydney,
Peter Chin,
Bishal Thapa,
Ryan Ross
Abstract:
Over the past decades, progress in deployable autonomous flight systems has slowly stagnated. This is reflected in today's production air-crafts, where pilots only enable simple physics-based systems such as autopilot for takeoff, landing, navigation, and terrain/traffic avoidance. Evidently, autonomy has not gained the trust of the community where higher problem complexity and cognitive workload…
▽ More
Over the past decades, progress in deployable autonomous flight systems has slowly stagnated. This is reflected in today's production air-crafts, where pilots only enable simple physics-based systems such as autopilot for takeoff, landing, navigation, and terrain/traffic avoidance. Evidently, autonomy has not gained the trust of the community where higher problem complexity and cognitive workload are required. To address trust, we must revisit the process for develo** autonomous capabilities: modeling and simulation. Given the prohibitive costs for live tests, we need to prototype and evaluate autonomous aerial agents in a high fidelity flight simulator with autonomous learning capabilities applicable to flight systems: such a open-source development platform is not available. As a result, we have developed GymFG: GymFG couples and extends a high fidelity, open-source flight simulator and a robust agent learning framework to facilitate learning of more complex tasks. Furthermore, we have demonstrated the use of GymFG to train an autonomous aerial agent using Imitation Learning. With GymFG, we can now deploy innovative ideas to address complex problems and build the trust necessary to move prototypes to the real-world.
△ Less
Submitted 26 April, 2020;
originally announced April 2020.
-
Split-Gate Cavity Coupler for Silicon Circuit Quantum Electrodynamics
Authors:
F. Borjans,
X. Croot,
S. Putz,
X. Mi,
S. M. Quinn,
A. Pan,
J. Kerckhoff,
E. J. Pritchett,
C. A. Jackson,
L. F. Edge,
R. S. Ross,
T. D. Ladd,
M. G. Borselli,
M. F. Gyure,
J. R. Petta
Abstract:
Coherent charge-photon and spin-photon coupling has recently been achieved in silicon double quantum dots (DQD). Here we demonstrate a versatile split-gate cavity-coupler that allows more than one DQD to be coupled to the same microwave cavity. Measurements of the cavity transmission as a function of level detuning yield a charge cavity coupling rate $g_c/2π$ = 58 MHz, charge decoherence rate…
▽ More
Coherent charge-photon and spin-photon coupling has recently been achieved in silicon double quantum dots (DQD). Here we demonstrate a versatile split-gate cavity-coupler that allows more than one DQD to be coupled to the same microwave cavity. Measurements of the cavity transmission as a function of level detuning yield a charge cavity coupling rate $g_c/2π$ = 58 MHz, charge decoherence rate $γ_c/2π$ = 36 MHz, and cavity decay rate $κ/2π$ = 1.2 MHz. The charge cavity coupling rate is in good agreement with device simulations. Our coupling technique can be extended to enable simultaneous coupling of multiple DQDs to the same cavity mode, opening the door to long-range coupling of semiconductor qubits using microwave frequency photons.
△ Less
Submitted 2 March, 2020;
originally announced March 2020.
-
A Visual Analytics Framework for Reviewing Streaming Performance Data
Authors:
Suraj P. Kesavan,
Takanori Fujiwara,
Jian** Kelvin Li,
Caitlin Ross,
Misbah Mubarak,
Christopher D. Carothers,
Robert B. Ross,
Kwan-Liu Ma
Abstract:
Understanding and tuning the performance of extreme-scale parallel computing systems demands a streaming approach due to the computational cost of applying offline algorithms to vast amounts of performance log data. Analyzing large streaming data is challenging because the rate of receiving data and limited time to comprehend data make it difficult for the analysts to sufficiently examine the data…
▽ More
Understanding and tuning the performance of extreme-scale parallel computing systems demands a streaming approach due to the computational cost of applying offline algorithms to vast amounts of performance log data. Analyzing large streaming data is challenging because the rate of receiving data and limited time to comprehend data make it difficult for the analysts to sufficiently examine the data without missing important changes or patterns. To support streaming data analysis, we introduce a visual analytic framework comprising of three modules: data management, analysis, and interactive visualization. The data management module collects various computing and communication performance metrics from the monitored system using streaming data processing techniques and feeds the data to the other two modules. The analysis module automatically identifies important changes and patterns at the required latency. In particular, we introduce a set of online and progressive analysis methods for not only controlling the computational costs but also hel** analysts better follow the critical aspects of the analysis results. Finally, the interactive visualization module provides the analysts with a coherent view of the changes and patterns in the continuously captured performance data. Through a multi-faceted case study on performance analysis of parallel discrete-event simulation, we demonstrate the effectiveness of our framework for identifying bottlenecks and locating outliers.
△ Less
Submitted 25 January, 2020;
originally announced January 2020.
-
Resonant Exchange Operation in Triple-Quantum-Dot Qubits for Spin-Photon Transduction
Authors:
Andrew Pan,
Tyler E. Keating,
Mark F. Gyure,
Emily J. Pritchett,
Samuel Quinn,
Richard S. Ross,
Thaddeus D. Ladd,
Joseph Kerckhoff
Abstract:
Triple quantum dots (TQDs) are promising semiconductor spin qubits because of their all-electrical control via fast, tunable exchange interactions and immunity to global magnetic fluctuations. These qubits can experience strong transverse interaction with photons in the resonant exchange (RX) regime, when exchange is simultaneously active on both qubit axes. However, most theoretical work has been…
▽ More
Triple quantum dots (TQDs) are promising semiconductor spin qubits because of their all-electrical control via fast, tunable exchange interactions and immunity to global magnetic fluctuations. These qubits can experience strong transverse interaction with photons in the resonant exchange (RX) regime, when exchange is simultaneously active on both qubit axes. However, most theoretical work has been based on phenomenological Fermi-Hubbard models, which may not fully capture the complexity of the qubit spin-charge states in this regime. Here we investigate exchange in Si/SiGe and GaAs TQDs using full configuration interaction (FCI) calculations which better describe practical device operation. We show that high exchange operation in general, and the RX regime in particular, can differ significantly from simple models, presenting new challenges and opportunities for spin-photon coupling. We highlight the impact of device electrostatics and effective mass on exchange and identify a new operating point (XRX) where strong spin-photon coupling is most likely to occur in Si/SiGe TQDs. Based on our numerical results, we analyze the feasibility of a remote entanglement cavity iSWAP protocol and discuss design pathways for improving fidelity. Our analysis provides insight into the requirements for TQD spin-photon transduction and demonstrates more generally the necessity of accurate modeling of exchange in spin qubits.
△ Less
Submitted 8 May, 2020; v1 submitted 24 January, 2020;
originally announced January 2020.
-
Improving MPI Collective I/O Performance With Intra-node Request Aggregation
Authors:
Qiao Kang,
Sunwoo Lee,
Kai-yuan Hou,
Robert Ross,
Ankit Agrawal,
Alok Choudhary,
Wei-keng Liao
Abstract:
Two-phase I/O is a well-known strategy for implementing collective MPI-IO functions. It redistributes I/O requests among the calling processes into a form that minimizes the file access costs. As modern parallel computers continue to grow into the exascale era, the communication cost of such request redistribution can quickly overwhelm collective I/O performance. This effect has been observed from…
▽ More
Two-phase I/O is a well-known strategy for implementing collective MPI-IO functions. It redistributes I/O requests among the calling processes into a form that minimizes the file access costs. As modern parallel computers continue to grow into the exascale era, the communication cost of such request redistribution can quickly overwhelm collective I/O performance. This effect has been observed from parallel jobs that run on multiple compute nodes with a high count of MPI processes on each node. To reduce the communication cost, we present a new design for collective I/O by adding an extra communication layer that performs request aggregation among processes within the same compute nodes. This approach can significantly reduce inter-node communication congestion when redistributing the I/O requests. We evaluate the performance and compare with the original two-phase I/O on a Cray XC40 parallel computer with Intel KNL processors. Using I/O patterns from two large-scale production applications and an I/O benchmark, we show the performance improvement of up to 29 times when running 16384 MPI processes on 256 compute nodes.
△ Less
Submitted 29 July, 2019;
originally announced July 2019.
-
The Alt-Right and Global Information Warfare
Authors:
Emmi Bevensee,
Alexander Reid Ross
Abstract:
The Alt-Right is a neo-fascist white supremacist movement that is involved in violent extremism and shows signs of engagement in extensive disinformation campaigns. Using social media data mining, this study develops a deeper understanding of such targeted disinformation campaigns and the ways they spread. It also adds to the available literature on the endogenous and exogenous influences within t…
▽ More
The Alt-Right is a neo-fascist white supremacist movement that is involved in violent extremism and shows signs of engagement in extensive disinformation campaigns. Using social media data mining, this study develops a deeper understanding of such targeted disinformation campaigns and the ways they spread. It also adds to the available literature on the endogenous and exogenous influences within the US far right, as well as motivating factors that drive disinformation campaigns, such as geopolitical strategy. This study is to be taken as a preliminary analysis to indicate future methods and follow-on research that will help develop an integrated approach to understanding the strategies and associations of the modern fascist movement.
△ Less
Submitted 7 May, 2019;
originally announced May 2019.
-
A power series method for solving ordinary and partial differentials equations motivated by domain growth
Authors:
Robert Ross
Abstract:
In this work we present a power series method for solving ordinary and partial differential equations. To demonstrate our method we solve a system of ordinary differential equations describing the movement of a random walker on a one-dimensional lattice, two nonlinear ordinary differential equations, a wave and diffusion equation (linear partial differential equations), and a nonlinear partial dif…
▽ More
In this work we present a power series method for solving ordinary and partial differential equations. To demonstrate our method we solve a system of ordinary differential equations describing the movement of a random walker on a one-dimensional lattice, two nonlinear ordinary differential equations, a wave and diffusion equation (linear partial differential equations), and a nonlinear partial differential equation (quasilinear). The inclusion of boundary conditions and the general solutions to other equations of interest are included in the Supplementary material.
△ Less
Submitted 11 January, 2019;
originally announced January 2019.
-
Modeling random walkers on growing random networks
Authors:
Robert Ross,
Walter Fontana
Abstract:
We present continuum models that describe the evolution of the position of a random walker on a growing network using four different growth algorithms. Three of these involve a random element, including one in which the motility rate of the random walker controls the network topology. For motility rates in which the position of the walker can be treated as quasi-stationary, we present accurate app…
▽ More
We present continuum models that describe the evolution of the position of a random walker on a growing network using four different growth algorithms. Three of these involve a random element, including one in which the motility rate of the random walker controls the network topology. For motility rates in which the position of the walker can be treated as quasi-stationary, we present accurate approximations to replace pair probabilities that allow us to numerically solve an otherwise intractable system of equations.
△ Less
Submitted 19 April, 2019; v1 submitted 22 December, 2018;
originally announced December 2018.
-
Spatially Resolved Spectroscopic Study of nearby Seyfert Galaxies: Implications for a Population of "Missed" Seyferts at High-$\textit{z}$
Authors:
Junjie Xia,
Matthew A. Malkan,
Nathaniel R. Ross,
Agnes J. Ancheta
Abstract:
We present mosaicked long-slit spectral maps of 18 nearby Active Galactic Nuclei (AGNs), 2 LINERs, and 4 star-forming galaxies. With the resulting data cubes taken using the Kast dual spectrograph on the 3 m Shane telescope of the Lick Observatory, we measure the aperture effects on the spectroscopic classification of AGNs. With more starlight included in a larger aperture, the nuclear spectrum th…
▽ More
We present mosaicked long-slit spectral maps of 18 nearby Active Galactic Nuclei (AGNs), 2 LINERs, and 4 star-forming galaxies. With the resulting data cubes taken using the Kast dual spectrograph on the 3 m Shane telescope of the Lick Observatory, we measure the aperture effects on the spectroscopic classification of AGNs. With more starlight included in a larger aperture, the nuclear spectrum that is Seyfert-like may become contaminated. We generated standard spectroscopic classification diagrams in different observing apertures. These show quantitatively how the ensemble of Seyferts migrates toward the H $\scriptsize{\textrm{II}}$ region classification when being observed with increasing aperture sizes. But the effect ranges widely in individual active galaxies. Some of the less luminous Seyferts shfit by a large amount, while some other barely move or even shift in different directions. We find that those Seyfert galaxies with the fraction of nuclear H$α$ emission lower than 0.2 of the host galaxy, 2-10 keV hard X-ray luminosity lower than $10^{43}$ erg s$^{-1}$, and the observed nuclear [O $\scriptsize{\textrm{III}}$] luminosity lower than $10^{40.5}$ erg s$^{-1}$, are more likely to change activity classification type when the entire host galaxy is included. Overall, 4 of our 24 galaxies (18 Seyferts) change their spectral activity classification type when observed with a very large aperture.
△ Less
Submitted 19 December, 2018;
originally announced December 2018.
-
Generating Diverse and Meaningful Captions
Authors:
Annika Lindh,
Robert J. Ross,
Abhijit Mahalunkar,
Giancarlo Salton,
John D. Kelleher
Abstract:
Image Captioning is a task that requires models to acquire a multi-modal understanding of the world and to express this understanding in natural language text. While the state-of-the-art for this task has rapidly improved in terms of n-gram metrics, these models tend to output the same generic captions for similar images. In this work, we address this limitation and train a model that generates mo…
▽ More
Image Captioning is a task that requires models to acquire a multi-modal understanding of the world and to express this understanding in natural language text. While the state-of-the-art for this task has rapidly improved in terms of n-gram metrics, these models tend to output the same generic captions for similar images. In this work, we address this limitation and train a model that generates more diverse and specific captions through an unsupervised training approach that incorporates a learning signal from an Image Retrieval model. We summarize previous results and improve the state-of-the-art on caption diversity and novelty. We make our source code publicly available online.
△ Less
Submitted 19 December, 2018;
originally announced December 2018.
-
A random walker's view of networks whose growth it shapes
Authors:
Robert J. H. Ross,
Charlotte Strandkvist,
Walter Fontana
Abstract:
We study a simple model in which the growth of a network is determined by the location of one or more random walkers. Depending on walker speed, the model generates a spectrum of structures situated between well-known limiting cases. We demonstrate that the average degree observed by a walker is related to the global variance. Modulating the extent to which the location of node attachment is deter…
▽ More
We study a simple model in which the growth of a network is determined by the location of one or more random walkers. Depending on walker speed, the model generates a spectrum of structures situated between well-known limiting cases. We demonstrate that the average degree observed by a walker is related to the global variance. Modulating the extent to which the location of node attachment is determined by the walker as opposed to random selection is akin to scaling the speed of the walker and generates new limiting behavior. The model raises questions about energetic and computational resource requirements in a physical instantiation.
△ Less
Submitted 24 January, 2020; v1 submitted 21 November, 2018;
originally announced November 2018.
-
Compressibility of random walker trajectories on growing networks
Authors:
Robert J. H. Ross,
Charlotte Strandkvist,
Walter Fontana
Abstract:
We find that the simple coupling of network growth to the position of a random walker on the network generates a traveling wave in the probability distribution of nodes visited by the walker. We argue that the entropy of this probability distribution is bounded as the network size tends to infinity. This means that the growth of a space coupled to a random walker situated in it constrains its dyna…
▽ More
We find that the simple coupling of network growth to the position of a random walker on the network generates a traveling wave in the probability distribution of nodes visited by the walker. We argue that the entropy of this probability distribution is bounded as the network size tends to infinity. This means that the growth of a space coupled to a random walker situated in it constrains its dynamics to a set of typical random walker trajectories, and walker trajectories inside the growing space are compressible.
△ Less
Submitted 4 April, 2019; v1 submitted 21 November, 2018;
originally announced November 2018.
-
Exploring the Use of Attention within an Neural Machine Translation Decoder States to Translate Idioms
Authors:
Giancarlo D. Salton,
Robert J. Ross,
John D. Kelleher
Abstract:
Idioms pose problems to almost all Machine Translation systems. This type of language is very frequent in day-to-day language use and cannot be simply ignored. The recent interest in memory augmented models in the field of Language Modelling has aided the systems to achieve good results by bridging long-distance dependencies. In this paper we explore the use of such techniques into a Neural Machin…
▽ More
Idioms pose problems to almost all Machine Translation systems. This type of language is very frequent in day-to-day language use and cannot be simply ignored. The recent interest in memory augmented models in the field of Language Modelling has aided the systems to achieve good results by bridging long-distance dependencies. In this paper we explore the use of such techniques into a Neural Machine Translation system to help in translation of idiomatic language.
△ Less
Submitted 10 October, 2018;
originally announced October 2018.
-
Spin-Blockade Spectroscopy of Si/SiGe Quantum Dots
Authors:
A. M. Jones,
E. J. Pritchett,
E. H. Chen,
T. E. Keating,
R. W. Andrews,
J. Z. Blumoff,
L. A. De Lorenzo,
K. Eng,
S. D. Ha,
A. A. Kiselev,
S. M. Meenehan,
S. T. Merkel,
J. A. Wright,
L. F. Edge,
R. S. Ross,
M. T. Rakher,
M. G. Borselli,
A. Hunter
Abstract:
We implement a technique for measuring the singlet-triplet energy splitting responsible for spin-to-charge conversion in semiconductor quantum dots. This method, which requires fast, single-shot charge measurement, reliably extracts an energy in the limits of both large and small splittings. We perform this technique on an undoped, accumulation-mode Si/SiGe triple-quantum dot and find that the mea…
▽ More
We implement a technique for measuring the singlet-triplet energy splitting responsible for spin-to-charge conversion in semiconductor quantum dots. This method, which requires fast, single-shot charge measurement, reliably extracts an energy in the limits of both large and small splittings. We perform this technique on an undoped, accumulation-mode Si/SiGe triple-quantum dot and find that the measured splitting varies smoothly as a function of confinement gate biases. Not only does this demonstration prove the value of having an $in~situ$ excited-state measurement technique as part of a standard tune-up procedure, it also suggests that in typical Si/SiGe quantum dot devices, spin-blockade can be limited by lateral orbital excitation energy rather than valley splitting.
△ Less
Submitted 21 September, 2018;
originally announced September 2018.
-
Beef Cattle Instance Segmentation Using Fully Convolutional Neural Network
Authors:
Aram Ter-Sarkisov,
Robert Ross,
John Kelleher,
Bernadette Earley,
Michael Keane
Abstract:
We present an instance segmentation algorithm trained and applied to a CCTV recording of beef cattle during a winter finishing period. A fully convolutional network was transformed into an instance segmentation network that learns to label each instance of an animal separately. We introduce a conceptually simple framework that the network uses to output a single prediction for every animal. These…
▽ More
We present an instance segmentation algorithm trained and applied to a CCTV recording of beef cattle during a winter finishing period. A fully convolutional network was transformed into an instance segmentation network that learns to label each instance of an animal separately. We introduce a conceptually simple framework that the network uses to output a single prediction for every animal. These results are a contribution towards behaviour analysis in winter finishing beef cattle for early detection of animal welfare-related problems.
△ Less
Submitted 20 September, 2018; v1 submitted 5 July, 2018;
originally announced July 2018.
-
A Cross-Layer Solution in Scientific Workflow System for Tackling Data Movement Challenge
Authors:
Dong Dai,
Robert Ross,
Dounia Khaldi,
Yonghong Yan,
Matthieu Dorier,
Neda Tavakoli,
Yong Chen
Abstract:
Scientific applications in HPC environment are more com-plex and more data-intensive nowadays. Scientists usually rely on workflow system to manage the complexity: simply define multiple processing steps into a single script and let the work-flow systems compile it and schedule all tasks accordingly. Numerous workflow systems have been proposed and widely used, like Galaxy, Pegasus, Taverna, Keple…
▽ More
Scientific applications in HPC environment are more com-plex and more data-intensive nowadays. Scientists usually rely on workflow system to manage the complexity: simply define multiple processing steps into a single script and let the work-flow systems compile it and schedule all tasks accordingly. Numerous workflow systems have been proposed and widely used, like Galaxy, Pegasus, Taverna, Kepler, Swift, AWE, etc., to name a few examples.
Traditionally, scientific workflow systems work with parallel file systems, like Lustre, PVFS, Ceph, or other forms of remote shared storage systems. As such, the data (including the intermediate data generated during workflow execution) need to be transferred back and forth between compute nodes and storage systems, which introduces a significant performance bottleneck on I/O operations. Along with the enlarging perfor-mance gap between CPU and storage devices, this bottleneck is expected to be worse.
Recently, we have introduced a new concept of Compute-on-Data-Path to allow tasks and data binding to be more efficient to reduce the data movement cost. To workflow systems, the key is to exploit the data locality in HPC storage hierarchy: if the datasets are stored in compute nodes, near the workflow tasks, then the task can directly access them with better performance with less network usage. Several recent studies have been done regarding building such a shared storage system, utilizing compute node resources, to serve HPC workflows with locality, such as Hercules [1] and WOSS [2] etc. In this research, we further argue that providing a compute-node side storage system is not sufficient to fully exploit data locality. A cross-layer solution combining storage system, compiler, and runtime is necessary. We take Swift/T [3], a workflow system for data-intensive applications, as a prototype platform to demonstrate such a cross-layer solution
△ Less
Submitted 16 May, 2018;
originally announced May 2018.
-
A Software-Defined Approach for QoS Control in High-Performance Computing Storage Systems
Authors:
Neda Tavakoli,
Dong Dai,
John Jenkins,
Philip Carns,
Robert Ross,
Yong Chen
Abstract:
High-performance computing (HPC) storage systems become increasingly critical to scientific applications given the data-driven discovery paradigm shift. As a storage solution for large-scale HPC systems, dozens of applications share the same storage system, and will compete and can interfere with each other. Application interference can dramatically degrade the overall storage system performance.…
▽ More
High-performance computing (HPC) storage systems become increasingly critical to scientific applications given the data-driven discovery paradigm shift. As a storage solution for large-scale HPC systems, dozens of applications share the same storage system, and will compete and can interfere with each other. Application interference can dramatically degrade the overall storage system performance. Therefore, develo** a flexible and effective storage solution to assure a certain level of resources per application, i.e. the Quality-of-Service (QoS) support, is critical. One of the common solution to achieve QoS assurance for storage systems is using provisioning technique~\cite{3}. Provisioning refers to the ability of providing certain amount of resources for applications and expected workloads. However, provisioning has limitations such as requiring the detailed knowledge of the expected workloads. In addition, the storage workloads are transient hence expensive to be satisfied. Due to these limitations, providing QoS storage systems through provisioning is challenging.
In this research, a software-defined approach~\cite{0} is proposed as a flexible solution to achieve QoS guarantee for storage systems. The driving force of using a software-defined approach instead of the traditional approaches, is that it has the ability to enable a more flexible, scalable, and efficient platform. For example, if any changes occurred in the system, it does not necessarily need to re-configure thousands of devices; instead, with re-configuring a logically centralized component, other devices will be automatically notified.
△ Less
Submitted 16 May, 2018;
originally announced May 2018.
-
φ^4 Solitary Waves in a Parabolic Potential: Existence, Stability, and Collisional Dynamics
Authors:
R. M. Ross,
P. G. Kevrekidis,
D. K. Campbell,
R. Decker,
A. Demirkaya
Abstract:
We explore a φ^4 model with an added external parabolic potential term. This term dramatically alters the spectral properties of the system. We identify single and multiple kink solutions and examine their stability features; importantly, all of the stationary structures turn out to be unstable. We complement these with a dynamical study of the evolution of a single kink in the trap, as well as of…
▽ More
We explore a φ^4 model with an added external parabolic potential term. This term dramatically alters the spectral properties of the system. We identify single and multiple kink solutions and examine their stability features; importantly, all of the stationary structures turn out to be unstable. We complement these with a dynamical study of the evolution of a single kink in the trap, as well as of the scattering of kink and anti-kink solutions of the model. We see that some of the key characteristics of kink-antikink collisions, such as the critical velocity and the multi-bounce windows, are sensitively dependent on the trap strength parameter, as well as the initial displacement of the kink and antikink.
△ Less
Submitted 1 May, 2018;
originally announced May 2018.
-
NaSn2As2: An Exfoliatable Layered van der Waals Zintl Phase
Authors:
Maxx Q. Arguilla,
Jyoti Katoch,
Kevin Krymowski,
Nicholas D. Cultrara,
**song Xu,
Xiaoxiang Xi,
Amanda Hanks,
Shishi Jiang,
Richard D. Ross,
Roland J. Koch,
Søren Ulstrup,
Aaron Bostwick,
Chris Jozwiak,
Dave McComb,
Eli Rotenberg,
Jie Shan,
Wolfgang Windl,
Roland K. Kawakami,
Joshua E. Goldberger
Abstract:
The discovery of new families of exfoliatable 2D crystals that have diverse sets of electronic, optical, and spin-orbit coupling properties, enables the realization of unique physical phenomena in these few-atom thick building blocks and in proximity to other materials. Herein, using NaSn2As2 as a model system, we demonstrate that layered Zintl phases having the stoichiometry ATt2Pn2 (A = Group 1…
▽ More
The discovery of new families of exfoliatable 2D crystals that have diverse sets of electronic, optical, and spin-orbit coupling properties, enables the realization of unique physical phenomena in these few-atom thick building blocks and in proximity to other materials. Herein, using NaSn2As2 as a model system, we demonstrate that layered Zintl phases having the stoichiometry ATt2Pn2 (A = Group 1 or 2 element, Tt = Group 14 tetrel element and Pn = Group 15 pnictogen element) and feature networks separated by van der Waals gaps can be readily exfoliated with both mechanical and liquid-phase methods. We identified the symmetries of the Raman active modes of the bulk crystals via polarized Raman spectroscopy. The bulk and mechanically exfoliated NaSn2As2 samples are resistant towards oxidation, with only the top surface oxidizing in ambient conditions over a couple of days, while the liquid-exfoliated samples oxidize much more quickly in ambient conditions. Employing angle-resolved photoemission spectroscopy (ARPES), density functional theory (DFT), and transport on bulk and exfoliated samples, we show that NaSn2As2 is a highly conducting 2D semimetal, with resistivities on the order of 10-6 Ω m. Due to peculiarities in the band structure, the dominating p-type carriers at low temperature are nearly compensated by the opening of n-type conduction channels as temperature increases. This work further expands the family of exfoliatable 2D materials to layered van der Waals Zintl phases, opening up opportunities in electronics and spintronics.
△ Less
Submitted 24 October, 2017;
originally announced October 2017.
-
Bootstrap** Labelled Dataset Construction for Cow Tracking and Behavior Analysis
Authors:
Aram Ter-Sarkisov,
Robert Ross,
John Kelleher
Abstract:
This paper introduces a new approach to the long-term tracking of an object in a challenging environment. The object is a cow and the environment is an enclosure in a cowshed. Some of the key challenges in this domain are a cluttered background, low contrast and high similarity between moving objects which greatly reduces the efficiency of most existing approaches, including those based on backgro…
▽ More
This paper introduces a new approach to the long-term tracking of an object in a challenging environment. The object is a cow and the environment is an enclosure in a cowshed. Some of the key challenges in this domain are a cluttered background, low contrast and high similarity between moving objects which greatly reduces the efficiency of most existing approaches, including those based on background subtraction. Our approach is split into object localization, instance segmentation, learning and tracking stages. Our solution is compared to a range of semi-supervised object tracking algorithms and we show that the performance is strong and well suited to subsequent analysis. We present our solution as a first step towards broader tracking and behavior monitoring for cows in precision agriculture with the ultimate objective of early detection of lameness.
△ Less
Submitted 30 March, 2017;
originally announced March 2017.
-
Hα Imaging of Nearby Seyfert Host Galaxies
Authors:
R. L. Theios,
M. A. Malkan,
N. R. Ross
Abstract:
We used narrowband interference filters with the CCD imaging camera on the Nickel 1.0 meter telescope at Lick Observatory to observe 31 nearby (z < 0.03) Seyfert galaxies in the 12 μm Active Galaxy Sample. We obtained pure emission line images of each galaxy in order to separate Hα emission from the nucleus from that of the host galaxy. The extended Hα emission is expected to be powered by newly f…
▽ More
We used narrowband interference filters with the CCD imaging camera on the Nickel 1.0 meter telescope at Lick Observatory to observe 31 nearby (z < 0.03) Seyfert galaxies in the 12 μm Active Galaxy Sample. We obtained pure emission line images of each galaxy in order to separate Hα emission from the nucleus from that of the host galaxy. The extended Hα emission is expected to be powered by newly formed hot stars, and correlates well with other indicators of current star formation in these galaxies: 7.7 μm PAH, far-infrared, and radio luminosity. Relative to what would be expected from recent star formation, there is a 0.8 dex excess of radio emission in our Seyfert galaxies. The nuclear Hα luminosity is dominated by the AGN, and is correlated with the hard X-ray luminosity. There is an upward offset of 1 dex in this correlation for the Seyfert 1s due to a strong contribution from the Broad Line Region. We found a correlation between star formation rate and AGN luminosity. In spite of selection effects, we concluded that the absence of bright Seyfert nuclei in galaxies with low SFRs is real, albeit only weakly significant. We used our measured spatial distributions of Hα emission to determine what these Seyfert galaxies would look like when observed through fixed apertures at high redshifts. Although all would be detectable emission line galaxies at any redshift, most would appear dominated by HII region emission. Only the most luminous AGN would still be identified at z~0.3.
△ Less
Submitted 31 March, 2016;
originally announced April 2016.