-
WeatherQA: Can Multimodal Language Models Reason about Severe Weather?
Authors:
Chengqian Ma,
Zhanxiang Hua,
Alexandra Anderson-Frey,
Vikram Iyer,
Xin Liu,
Lianhui Qin
Abstract:
Severe convective weather events, such as hail, tornadoes, and thunderstorms, often occur quickly yet cause significant damage, costing billions of dollars every year. This highlights the importance of forecasting severe weather threats hours in advance to better prepare meteorologists and residents in at-risk areas. Can modern large foundation models perform such forecasting? Existing weather ben…
▽ More
Severe convective weather events, such as hail, tornadoes, and thunderstorms, often occur quickly yet cause significant damage, costing billions of dollars every year. This highlights the importance of forecasting severe weather threats hours in advance to better prepare meteorologists and residents in at-risk areas. Can modern large foundation models perform such forecasting? Existing weather benchmarks typically focus only on predicting time-series changes in certain weather parameters (e.g., temperature, moisture) with text-only features. In this work, we introduce WeatherQA, the first multimodal dataset designed for machines to reason about complex combinations of weather parameters (a.k.a., ingredients) and predict severe weather in real-world scenarios. The dataset includes over 8,000 (multi-images, text) pairs for diverse severe weather events. Each pair contains rich information crucial for forecasting -- the images describe the ingredients capturing environmental instability, surface observations, and radar reflectivity, and the text contains forecast analyses written by human experts. With WeatherQA, we evaluate state-of-the-art vision language models, including GPT4, Claude3.5, Gemini-1.5, and a fine-tuned Llama3-based VLM, by designing two challenging tasks: (1) multi-choice QA for predicting affected area and (2) classification of the development potential of severe convection. These tasks require deep understanding of domain knowledge (e.g., atmospheric dynamics) and complex reasoning over multimodal data (e.g., interactions between weather parameters). We show a substantial gap between the strongest VLM, GPT4o, and human reasoning. Our comprehensive case study with meteorologists further reveals the weaknesses of the models, suggesting that better training and data integration are necessary to bridge this gap. WeatherQA link: https://github.com/chengqianma/WeatherQA.
△ Less
Submitted 23 June, 2024; v1 submitted 17 June, 2024;
originally announced June 2024.
-
mHuBERT-147: A Compact Multilingual HuBERT Model
Authors:
Marcely Zanon Boito,
Vivek Iyer,
Nikolaos Lagos,
Laurent Besacier,
Ioan Calapodescu
Abstract:
We present mHuBERT-147, the first general-purpose massively multilingual HuBERT speech representation model trained on 90K hours of clean, open-license data. To scale up the multi-iteration HuBERT approach, we use faiss-based clustering, achieving 5.2x faster label assignment than the original method. We also apply a new multilingual batching up-sampling strategy, leveraging both language and data…
▽ More
We present mHuBERT-147, the first general-purpose massively multilingual HuBERT speech representation model trained on 90K hours of clean, open-license data. To scale up the multi-iteration HuBERT approach, we use faiss-based clustering, achieving 5.2x faster label assignment than the original method. We also apply a new multilingual batching up-sampling strategy, leveraging both language and dataset diversity. After 3 training iterations, our compact 95M parameter mHuBERT-147 outperforms larger models trained on substantially more data. We rank second and first on the ML-SUPERB 10min and 1h leaderboards, with SOTA scores for 3 tasks. Across ASR/LID tasks, our model consistently surpasses XLS-R (300M params; 436K hours) and demonstrates strong competitiveness against the much larger MMS (1B params; 491K hours). Our findings indicate that mHuBERT-147 is a promising model for multilingual speech tasks, offering an unprecedented balance between high performance and parameter efficiency.
△ Less
Submitted 27 June, 2024; v1 submitted 10 June, 2024;
originally announced June 2024.
-
Agnostic Tomography of Stabilizer Product States
Authors:
Sabee Grewal,
Vishnu Iyer,
William Kretschmer,
Daniel Liang
Abstract:
We define a quantum learning task called agnostic tomography, where given copies of an arbitrary state $ρ$ and a class of quantum states $\mathcal{C}$, the goal is to output a succinct description of a state that approximates $ρ$ at least as well as any state in $\mathcal{C}$ (up to some small error $\varepsilon$). This task generalizes ordinary quantum tomography of states in $\mathcal{C}$ and is…
▽ More
We define a quantum learning task called agnostic tomography, where given copies of an arbitrary state $ρ$ and a class of quantum states $\mathcal{C}$, the goal is to output a succinct description of a state that approximates $ρ$ at least as well as any state in $\mathcal{C}$ (up to some small error $\varepsilon$). This task generalizes ordinary quantum tomography of states in $\mathcal{C}$ and is more challenging because the learning algorithm must be robust to perturbations of $ρ$.
We give an efficient agnostic tomography algorithm for the class $\mathcal{C}$ of $n$-qubit stabilizer product states. Assuming $ρ$ has fidelity at least $τ$ with a stabilizer product state, the algorithm runs in time $n^{O(1 + \log(1/τ))} / \varepsilon^2$. This runtime is quasipolynomial in all parameters, and polynomial if $τ$ is a constant.
△ Less
Submitted 4 April, 2024;
originally announced April 2024.
-
Biodegradable Interactive Materials
Authors:
Zhihan Zhang,
Mallory Parker,
Kuotian Liao,
Jerry Cao,
Anandghan Waghmare,
Joseph Breda,
Chris Matsumura,
Serena Eley,
Eleftheria Roumeli,
Shwetak Patel,
Vikram Iyer
Abstract:
The sense of touch is fundamental to how we interact with the physical and digital world. Conventional interactive surfaces and tactile interfaces use electronic sensors embedded into objects, however this approach poses serious challenges both for environmental sustainability and a future of truly ubiquitous interaction systems where information is encoded into everyday objects. In this work, we…
▽ More
The sense of touch is fundamental to how we interact with the physical and digital world. Conventional interactive surfaces and tactile interfaces use electronic sensors embedded into objects, however this approach poses serious challenges both for environmental sustainability and a future of truly ubiquitous interaction systems where information is encoded into everyday objects. In this work, we present Biodegradable Interactive Materials: backyard-compostable interactive interfaces that leverage information encoded in material properties. Inspired by natural systems, we propose an architecture that programmatically encodes multidimensional information into materials themselves and combines them with wearable devices that extend human senses to perceive the embedded data. We combine unrefined biological matter from plants and algae like chlorella with natural minerals like graphite and magnetite to produce materials with varying electrical, magnetic, and surface properties. We perform in-depth analysis using physics models, computational simulations, and real-world experiments to characterize their information density and develop decoding methods. Our passive, chip-less materials can robustly encode 12 bits of information, equivalent to 4096 unique classes. We further develop wearable device prototypes that can decode this information during touch interactions using off-the-shelf sensors. We demonstrate sample applications such as customized buttons, tactile maps, and interactive surfaces. We further demonstrate the natural degradation of these interactive materials in degrade outdoors within 21 days and perform a comparative environmental analysis of the benefits of this approach.
△ Less
Submitted 3 April, 2024;
originally announced April 2024.
-
Pseudoentanglement Ain't Cheap
Authors:
Sabee Grewal,
Vishnu Iyer,
William Kretschmer,
Daniel Liang
Abstract:
We show that any pseudoentangled state ensemble with a gap of $t$ bits of entropy requires $Ω(t)$ non-Clifford gates to prepare. This bound is tight up to polylogarithmic factors if linear-time quantum-secure pseudorandom functions exist. Our result follows from a polynomial-time algorithm to estimate the entanglement entropy of a quantum state across any cut of qubits. When run on an $n$-qubit st…
▽ More
We show that any pseudoentangled state ensemble with a gap of $t$ bits of entropy requires $Ω(t)$ non-Clifford gates to prepare. This bound is tight up to polylogarithmic factors if linear-time quantum-secure pseudorandom functions exist. Our result follows from a polynomial-time algorithm to estimate the entanglement entropy of a quantum state across any cut of qubits. When run on an $n$-qubit state that is stabilized by at least $2^{n-t}$ Pauli operators, our algorithm produces an estimate that is within an additive factor of $\frac{t}{2}$ bits of the true entanglement entropy.
△ Less
Submitted 11 April, 2024; v1 submitted 29 March, 2024;
originally announced April 2024.
-
LabelAId: Just-in-time AI Interventions for Improving Human Labeling Quality and Domain Knowledge in Crowdsourcing Systems
Authors:
Chu Li,
Zhihan Zhang,
Michael Saugstad,
Esteban Safranchik,
Minchu Kulkarni,
Xiaoyu Huang,
Shwetak Patel,
Vikram Iyer,
Tim Althoff,
Jon E. Froehlich
Abstract:
Crowdsourcing platforms have transformed distributed problem-solving, yet quality control remains a persistent challenge. Traditional quality control measures, such as prescreening workers and refining instructions, often focus solely on optimizing economic output. This paper explores just-in-time AI interventions to enhance both labeling quality and domain-specific knowledge among crowdworkers. W…
▽ More
Crowdsourcing platforms have transformed distributed problem-solving, yet quality control remains a persistent challenge. Traditional quality control measures, such as prescreening workers and refining instructions, often focus solely on optimizing economic output. This paper explores just-in-time AI interventions to enhance both labeling quality and domain-specific knowledge among crowdworkers. We introduce LabelAId, an advanced inference model combining Programmatic Weak Supervision (PWS) with FT-Transformers to infer label correctness based on user behavior and domain knowledge. Our technical evaluation shows that our LabelAId pipeline consistently outperforms state-of-the-art ML baselines, improving mistake inference accuracy by 36.7% with 50 downstream samples. We then implemented LabelAId into Project Sidewalk, an open-source crowdsourcing platform for urban accessibility. A between-subjects study with 34 participants demonstrates that LabelAId significantly enhances label precision without compromising efficiency while also increasing labeler confidence. We discuss LabelAId's success factors, limitations, and its generalizability to other crowdsourced science domains.
△ Less
Submitted 14 March, 2024;
originally announced March 2024.
-
PDQMA = DQMA = NEXP: QMA With Hidden Variables and Non-collapsing Measurements
Authors:
Scott Aaronson,
Sabee Grewal,
Vishnu Iyer,
Simon C. Marshall,
Ronak Ramachandran
Abstract:
We define and study a variant of QMA (Quantum Merlin Arthur) in which Arthur can make multiple non-collapsing measurements to Merlin's witness state, in addition to ordinary collapsing measurements. By analogy to the class PDQP defined by Aaronson, Bouland, Fitzsimons, and Lee (2014), we call this class PDQMA. Our main result is that PDQMA = NEXP; this result builds on the MIP = NEXP Theorem and c…
▽ More
We define and study a variant of QMA (Quantum Merlin Arthur) in which Arthur can make multiple non-collapsing measurements to Merlin's witness state, in addition to ordinary collapsing measurements. By analogy to the class PDQP defined by Aaronson, Bouland, Fitzsimons, and Lee (2014), we call this class PDQMA. Our main result is that PDQMA = NEXP; this result builds on the MIP = NEXP Theorem and complements the result of Aaronson (2018) that PDQP/qpoly = ALL. While the result has little to do with quantum mechanics, we also show a more "quantum" result: namely, that QMA with the ability to inspect the entire history of a hidden variable is equal to NEXP, under mild assumptions on the hidden-variable theory. We also observe that a quantum computer, augmented with quantum advice and the ability to inspect the history of a hidden variable, can solve any decision problem in polynomial time.
△ Less
Submitted 4 March, 2024;
originally announced March 2024.
-
AVELA -- A Vision for Engineering Literacy & Access: Understanding Why Technology Alone Is Not Enough
Authors:
Kyle Johnson,
Vicente Arroyos,
Celeste Garcia,
Liban Hussein,
Aisha Cora,
Tsewone Melaku,
Jay L. Cunningham,
R. Benjamin Shapiro,
Vikram Iyer
Abstract:
Unequal technology access for Black and Latine communities has been a persistent economic, social justice, and human rights issue despite increased technology accessibility due to advancements in consumer electronics like phones, tablets, and computers. We contextualize socio-technical access inequalities for Black and Latine urban communities and find that many students are hesitant to engage wit…
▽ More
Unequal technology access for Black and Latine communities has been a persistent economic, social justice, and human rights issue despite increased technology accessibility due to advancements in consumer electronics like phones, tablets, and computers. We contextualize socio-technical access inequalities for Black and Latine urban communities and find that many students are hesitant to engage with available technologies due to a lack of engaging support systems. We present a holistic student-led STEM engagement model through AVELA - A Vision for Engineering Literacy and Access leveraging culturally responsive lessons, mentor embodied community representation, and service learning. To evaluate the model's impact after 4 years of mentoring 200+ university student instructors in teaching to 2,500+ secondary school students in 100+ classrooms, we conducted 24 semi-structured interviews with college AnonymizedOrganization members. We identify access barriers and provide principled recommendations for designing future STEM education programs.
△ Less
Submitted 29 January, 2024; v1 submitted 25 January, 2024;
originally announced January 2024.
-
A review on different techniques used to combat the non-IID and heterogeneous nature of data in FL
Authors:
Venkataraman Natarajan Iyer
Abstract:
Federated Learning (FL) is a machine-learning approach enabling collaborative model training across multiple decentralized edge devices that hold local data samples, all without exchanging these samples. This collaborative process occurs under the supervision of a central server orchestrating the training or via a peer-to-peer network. The significance of FL is particularly pronounced in industrie…
▽ More
Federated Learning (FL) is a machine-learning approach enabling collaborative model training across multiple decentralized edge devices that hold local data samples, all without exchanging these samples. This collaborative process occurs under the supervision of a central server orchestrating the training or via a peer-to-peer network. The significance of FL is particularly pronounced in industries such as healthcare and finance, where data privacy holds paramount importance. However, training a model under the Federated learning setting brings forth several challenges, with one of the most prominent being the heterogeneity of data distribution among the edge devices. The data is typically non-independently and non-identically distributed (non-IID), thereby presenting challenges to model convergence. This report delves into the issues arising from non-IID and heterogeneous data and explores current algorithms designed to address these challenges.
△ Less
Submitted 1 January, 2024;
originally announced January 2024.
-
From Classification to Clinical Insights: Towards Analyzing and Reasoning About Mobile and Behavioral Health Data With Large Language Models
Authors:
Zachary Englhardt,
Chengqian Ma,
Margaret E. Morris,
Xuhai "Orson" Xu,
Chun-Cheng Chang,
Lianhui Qin,
Daniel McDuff,
Xin Liu,
Shwetak Patel,
Vikram Iyer
Abstract:
Passively collected behavioral health data from ubiquitous sensors holds significant promise to provide mental health professionals insights from patient's daily lives; however, develo** analysis tools to use this data in clinical practice requires addressing challenges of generalization across devices and weak or ambiguous correlations between the measured signals and an individual's mental hea…
▽ More
Passively collected behavioral health data from ubiquitous sensors holds significant promise to provide mental health professionals insights from patient's daily lives; however, develo** analysis tools to use this data in clinical practice requires addressing challenges of generalization across devices and weak or ambiguous correlations between the measured signals and an individual's mental health. To address these challenges, we take a novel approach that leverages large language models (LLMs) to synthesize clinically useful insights from multi-sensor data. We develop chain of thought prompting methods that use LLMs to generate reasoning about how trends in data such as step count and sleep relate to conditions like depression and anxiety. We first demonstrate binary depression classification with LLMs achieving accuracies of 61.1% which exceed the state of the art. While it is not robust for clinical use, this leads us to our key finding: even more impactful and valued than classification is a new human-AI collaboration approach in which clinician experts interactively query these tools and combine their domain expertise and context about the patient with AI generated reasoning to support clinical decision-making. We find models like GPT-4 correctly reference numerical data 75% of the time, and clinician participants express strong interest in using this approach to interpret self-tracking data.
△ Less
Submitted 25 November, 2023; v1 submitted 21 November, 2023;
originally announced November 2023.
-
DeltaLCA: Comparative Life-Cycle Assessment for Electronics Design
Authors:
Zhihan Zhang,
Felix Hähnlein,
Yuxuan Mei,
Zachary Englhardt,
Shwetak Patel,
Adriana Schulz,
Vikram Iyer
Abstract:
Reducing the environmental footprint of electronics and computing devices requires new tools that empower designers to make informed decisions about sustainability during the design process itself. This is not possible with current tools for life cycle assessment (LCA) which require substantial domain expertise and time to evaluate the numerous chips and other components that make up a device. We…
▽ More
Reducing the environmental footprint of electronics and computing devices requires new tools that empower designers to make informed decisions about sustainability during the design process itself. This is not possible with current tools for life cycle assessment (LCA) which require substantial domain expertise and time to evaluate the numerous chips and other components that make up a device. We observe first that informed decision-making does not require absolute metrics and can instead be done by comparing designs. Second, we can use domain-specific heuristics to perform these comparisons. We combine these insights to develop DeltaLCA, an open-source interactive design tool that addresses the dual challenges of automating life cycle inventory generation and data availability by performing comparative analyses of electronics designs. Users can upload standard design files from Electronic Design Automation (EDA) software and the tool will guide them through determining which one has greater carbon footprint. DeltaLCA leverages electronics-specific LCA datasets and heuristics and tries to automatically rank the two designs, prompting users to provide additional information only when necessary. We show through case studies DeltaLCA achieves the same result as evaluating full LCAs, and that it accelerates LCA comparisons from eight expert-hours to a single click for devices with ~30 components, and 15 minutes for more complex devices with ~100 components.
△ Less
Submitted 16 November, 2023;
originally announced November 2023.
-
Code-Switching with Word Senses for Pretraining in Neural Machine Translation
Authors:
Vivek Iyer,
Edoardo Barba,
Alexandra Birch,
Jeff Z. Pan,
Roberto Navigli
Abstract:
Lexical ambiguity is a significant and pervasive challenge in Neural Machine Translation (NMT), with many state-of-the-art (SOTA) NMT systems struggling to handle polysemous words (Campolungo et al., 2022). The same holds for the NMT pretraining paradigm of denoising synthetic "code-switched" text (Pan et al., 2021; Iyer et al., 2023), where word senses are ignored in the noising stage -- leading…
▽ More
Lexical ambiguity is a significant and pervasive challenge in Neural Machine Translation (NMT), with many state-of-the-art (SOTA) NMT systems struggling to handle polysemous words (Campolungo et al., 2022). The same holds for the NMT pretraining paradigm of denoising synthetic "code-switched" text (Pan et al., 2021; Iyer et al., 2023), where word senses are ignored in the noising stage -- leading to harmful sense biases in the pretraining data that are subsequently inherited by the resulting models. In this work, we introduce Word Sense Pretraining for Neural Machine Translation (WSP-NMT) - an end-to-end approach for pretraining multilingual NMT models leveraging word sense-specific information from Knowledge Bases. Our experiments show significant improvements in overall translation quality. Then, we show the robustness of our approach to scale to various challenging data and resource-scarce scenarios and, finally, report fine-grained accuracy improvements on the DiBiMT disambiguation benchmark. Our studies yield interesting and novel insights into the merits and challenges of integrating word sense information and structured knowledge in multilingual pretraining for NMT.
△ Less
Submitted 21 October, 2023;
originally announced October 2023.
-
On the Rational Degree of Boolean Functions and Applications
Authors:
Vishnu Iyer,
Siddhartha Jain,
Matt Kovacs-Deak,
Vinayak M. Kumar,
Luke Schaeffer,
Daochen Wang,
Michael Whitmeyer
Abstract:
We study a natural complexity measure of Boolean functions known as the (exact) rational degree. For total functions $f$, it is conjectured that $\mathrm{rdeg}(f)$ is polynomially related to $\mathrm{deg}(f)$, where $\mathrm{deg}(f)$ is the Fourier degree. Towards this conjecture, we show that symmetric functions have rational degree at least $\mathrm{deg}(f)/2$ and monotone functions have rationa…
▽ More
We study a natural complexity measure of Boolean functions known as the (exact) rational degree. For total functions $f$, it is conjectured that $\mathrm{rdeg}(f)$ is polynomially related to $\mathrm{deg}(f)$, where $\mathrm{deg}(f)$ is the Fourier degree. Towards this conjecture, we show that symmetric functions have rational degree at least $\mathrm{deg}(f)/2$ and monotone functions have rational degree at least $\sqrt{\mathrm{deg}(f)}$. We observe that both of these lower bounds are tight. In addition, we show that all read-once depth-$d$ Boolean formulae have rational degree at least $Ω(\mathrm{deg}(f)^{1/d})$. Furthermore, we show that almost every Boolean function on $n$ variables has rational degree at least $n/2 - O(\sqrt{n})$.
In contrast to total functions, we exhibit partial functions that witness unbounded separations between rational and approximate degree, in both directions. As a consequence, we show that for quantum computers, post-selection and bounded-error are incomparable resources in the black-box model.
△ Less
Submitted 11 October, 2023;
originally announced October 2023.
-
SeMAnD: Self-Supervised Anomaly Detection in Multimodal Geospatial Datasets
Authors:
Daria Reshetova,
Swetava Ganguli,
C. V. Krishnakumar Iyer,
Vipul Pandey
Abstract:
We propose a Self-supervised Anomaly Detection technique, called SeMAnD, to detect geometric anomalies in Multimodal geospatial datasets. Geospatial data comprises of acquired and derived heterogeneous data modalities that we transform to semantically meaningful, image-like tensors to address the challenges of representation, alignment, and fusion of multimodal data. SeMAnD is comprised of (i) a s…
▽ More
We propose a Self-supervised Anomaly Detection technique, called SeMAnD, to detect geometric anomalies in Multimodal geospatial datasets. Geospatial data comprises of acquired and derived heterogeneous data modalities that we transform to semantically meaningful, image-like tensors to address the challenges of representation, alignment, and fusion of multimodal data. SeMAnD is comprised of (i) a simple data augmentation strategy, called RandPolyAugment, capable of generating diverse augmentations of vector geometries, and (ii) a self-supervised training objective with three components that incentivize learning representations of multimodal data that are discriminative to local changes in one modality which are not corroborated by the other modalities. Detecting local defects is crucial for geospatial anomaly detection where even small anomalies (e.g., shifted, incorrectly connected, malformed, or missing polygonal vector geometries like roads, buildings, landcover, etc.) are detrimental to the experience and safety of users of geospatial applications like map**, routing, search, and recommendation systems. Our empirical study on test sets of different types of real-world geometric geospatial anomalies across 3 diverse geographical regions demonstrates that SeMAnD is able to detect real-world defects and outperforms domain-agnostic anomaly detection strategies by 4.8-19.7% as measured using anomaly classification AUC. We also show that model performance increases (i) up to 20.4% as the number of input modalities increase and (ii) up to 22.9% as the diversity and strength of training data augmentations increase.
△ Less
Submitted 26 September, 2023;
originally announced September 2023.
-
Towards Effective Disambiguation for Machine Translation with Large Language Models
Authors:
Vivek Iyer,
Pinzhen Chen,
Alexandra Birch
Abstract:
Resolving semantic ambiguity has long been recognised as a central challenge in the field of Machine Translation. Recent work on benchmarking translation performance on ambiguous sentences has exposed the limitations of conventional Neural Machine Translation (NMT) systems, which fail to handle many such cases. Large language models (LLMs) have emerged as a promising alternative, demonstrating com…
▽ More
Resolving semantic ambiguity has long been recognised as a central challenge in the field of Machine Translation. Recent work on benchmarking translation performance on ambiguous sentences has exposed the limitations of conventional Neural Machine Translation (NMT) systems, which fail to handle many such cases. Large language models (LLMs) have emerged as a promising alternative, demonstrating comparable performance to traditional NMT models while introducing new paradigms for controlling the target outputs. In this paper, we study the capabilities of LLMs to translate "ambiguous sentences" - i.e. those containing highly polysemous words and/or rare word senses. We also propose two ways to improve their disambiguation capabilities, through a) in-context learning and b) fine-tuning on carefully curated ambiguous datasets. Experiments show that our methods can match or outperform state-of-the-art systems such as DeepL and NLLB in four out of five language directions. Our research provides valuable insights into effectively adapting LLMs to become better disambiguators during Machine Translation. We release our curated disambiguation corpora and resources at https://data.statmt.org/ambiguous-europarl.
△ Less
Submitted 21 October, 2023; v1 submitted 20 September, 2023;
originally announced September 2023.
-
Solar-powered shape-changing origami microfliers
Authors:
Kyle Johnson,
Vicente Arroyos,
Amélie Ferran,
Tilboon Elberier,
Raul Villanueva,
Dennis Yin,
Alberto Aliseda,
Sawyer Fuller,
Vikram Iyer,
Shyamnath Gollakota
Abstract:
Using wind to disperse microfliers that fall like seeds and leaves can help automate large-scale sensor deployments. Here, we present battery-free microfliers that can change shape in mid-air to vary their dispersal distance. We design origami microfliers using bi-stable leaf-out structures and uncover an important property: a simple change in the shape of these origami structures causes two drama…
▽ More
Using wind to disperse microfliers that fall like seeds and leaves can help automate large-scale sensor deployments. Here, we present battery-free microfliers that can change shape in mid-air to vary their dispersal distance. We design origami microfliers using bi-stable leaf-out structures and uncover an important property: a simple change in the shape of these origami structures causes two dramatically different falling behaviors. When unfolded and flat, the microfliers exhibit a tumbling behavior that increases lateral displacement in the wind. When folded inward, their orientation is stabilized, resulting in a downward descent that is less influenced by wind. To electronically transition between these two shapes, we designed a low-power electromagnetic actuator that produces peak forces of up to 200 millinewtons within 25 milliseconds while powered by solar cells. We fabricated a circuit directly on the folded origami structure that includes a programmable microcontroller, Bluetooth radio, solar power harvesting circuit, a pressure sensor to estimate altitude and a temperature sensor. Outdoor evaluations show that our 414 milligram origami microfliers are able to electronically change their shape mid-air, travel up to 98 meters in a light breeze, and wirelessly transmit data via Bluetooth up to 60 meters away, using only power collected from the sun.
△ Less
Submitted 13 September, 2023;
originally announced September 2023.
-
Recyclable vitrimer-based printed circuit board for circular electronics
Authors:
Zhihan Zhang,
Agni K. Biswal,
Ankush Nandi,
Kali Frost,
Jake A. Smith,
Bichlien H. Nguyen,
Shwetak Patel,
Aniruddh Vashisth,
Vikram Iyer
Abstract:
Electronics are integral to modern life; however, at their end-of-life these devices produce environmentally hazardous electronic waste (e-waste). Recycling the ubiquitous printed circuit boards (PCBs) that make up a substantial mass and volume fraction of e-waste is challenging due to their use of irreversibly cured thermoset epoxies. We present a PCB formulation using transesterification vitrime…
▽ More
Electronics are integral to modern life; however, at their end-of-life these devices produce environmentally hazardous electronic waste (e-waste). Recycling the ubiquitous printed circuit boards (PCBs) that make up a substantial mass and volume fraction of e-waste is challenging due to their use of irreversibly cured thermoset epoxies. We present a PCB formulation using transesterification vitrimers (vPCBs), and an end-to-end fabrication process compatible with standard manufacturing ecosystems. We create functional prototypes of IoT devices transmitting 2.4 GHz radio signals on vPCBs with electrical and mechanical properties meeting industry standards. Fractures and holes in vPCBs can be repaired while retaining comparable performance over more than four repair cycles. We further demonstrate non-destructive decomposition of transesterification vitrimer composites with solid inclusions and metal attachments by polymer swelling with small molecule solvents. We hypothesize that unlike traditional solvolysis recycling, swelling does not degrade the materials. Through dynamic mechanical analysis we find negligible catalyst loss, minimal changes in storage modulus, and equivalent polymer backbone composition across multiple recycling cycles. We achieve 98% polymer recovery, 100% fiber recovery, and 91% solvent recovery which we reuse to create new vPCBs without degraded performance. Our cradle-to-cradle life-cycle assessment shows substantial environmental impact reduction over conventional PCBs in 11 categories.
△ Less
Submitted 23 August, 2023;
originally announced August 2023.
-
LegalBench: A Collaboratively Built Benchmark for Measuring Legal Reasoning in Large Language Models
Authors:
Neel Guha,
Julian Nyarko,
Daniel E. Ho,
Christopher Ré,
Adam Chilton,
Aditya Narayana,
Alex Chohlas-Wood,
Austin Peters,
Brandon Waldon,
Daniel N. Rockmore,
Diego Zambrano,
Dmitry Talisman,
Enam Hoque,
Faiz Surani,
Frank Fagan,
Galit Sarfaty,
Gregory M. Dickinson,
Haggai Porat,
Jason Hegland,
Jessica Wu,
Joe Nudell,
Joel Niklaus,
John Nay,
Jonathan H. Choi,
Kevin Tobia
, et al. (15 additional authors not shown)
Abstract:
The advent of large language models (LLMs) and their adoption by the legal community has given rise to the question: what types of legal reasoning can LLMs perform? To enable greater study of this question, we present LegalBench: a collaboratively constructed legal reasoning benchmark consisting of 162 tasks covering six different types of legal reasoning. LegalBench was built through an interdisc…
▽ More
The advent of large language models (LLMs) and their adoption by the legal community has given rise to the question: what types of legal reasoning can LLMs perform? To enable greater study of this question, we present LegalBench: a collaboratively constructed legal reasoning benchmark consisting of 162 tasks covering six different types of legal reasoning. LegalBench was built through an interdisciplinary process, in which we collected tasks designed and hand-crafted by legal professionals. Because these subject matter experts took a leading role in construction, tasks either measure legal reasoning capabilities that are practically useful, or measure reasoning skills that lawyers find interesting. To enable cross-disciplinary conversations about LLMs in the law, we additionally show how popular legal frameworks for describing legal reasoning -- which distinguish between its many forms -- correspond to LegalBench tasks, thus giving lawyers and LLM developers a common vocabulary. This paper describes LegalBench, presents an empirical evaluation of 20 open-source and commercial LLMs, and illustrates the types of research explorations LegalBench enables.
△ Less
Submitted 20 August, 2023;
originally announced August 2023.
-
Efficient Learning of Quantum States Prepared With Few Non-Clifford Gates II: Single-Copy Measurements
Authors:
Sabee Grewal,
Vishnu Iyer,
William Kretschmer,
Daniel Liang
Abstract:
Recent work has shown that $n$-qubit quantum states output by circuits with at most $t$ single-qubit non-Clifford gates can be learned to trace distance $ε$ using $\mathsf{poly}(n,2^t,1/ε)$ time and samples. All prior algorithms achieving this runtime use entangled measurements across two copies of the input state. In this work, we give a similarly efficient algorithm that learns the same class of…
▽ More
Recent work has shown that $n$-qubit quantum states output by circuits with at most $t$ single-qubit non-Clifford gates can be learned to trace distance $ε$ using $\mathsf{poly}(n,2^t,1/ε)$ time and samples. All prior algorithms achieving this runtime use entangled measurements across two copies of the input state. In this work, we give a similarly efficient algorithm that learns the same class of states using only single-copy measurements.
△ Less
Submitted 4 April, 2024; v1 submitted 14 August, 2023;
originally announced August 2023.
-
Exploring and Characterizing Large Language Models For Embedded System Development and Debugging
Authors:
Zachary Englhardt,
Richard Li,
Dilini Nissanka,
Zhihan Zhang,
Girish Narayanswamy,
Joseph Breda,
Xin Liu,
Shwetak Patel,
Vikram Iyer
Abstract:
Large language models (LLMs) have shown remarkable abilities to generate code, however their ability to develop software for embedded systems, which requires cross-domain knowledge of hardware and software has not been studied. In this paper we develop an extensible, open source hardware-in-the-loop framework to systematically evaluate leading LLMs (GPT-3.5, GPT-4, PaLM 2) to assess their capabili…
▽ More
Large language models (LLMs) have shown remarkable abilities to generate code, however their ability to develop software for embedded systems, which requires cross-domain knowledge of hardware and software has not been studied. In this paper we develop an extensible, open source hardware-in-the-loop framework to systematically evaluate leading LLMs (GPT-3.5, GPT-4, PaLM 2) to assess their capabilities and limitations for embedded system development. We observe through our study that even when these tools fail to produce working code, they consistently generate helpful reasoning about embedded design tasks. We leverage this finding to study how human programmers interact with these tools, and develop an human-AI based software engineering workflow for building embedded systems.
Our evaluation platform for verifying LLM generated programs uses sensor actuator pairs for physical evaluation. We compare all three models with N=450 experiments and find surprisingly that GPT-4 especially shows an exceptional level of cross-domain understanding and reasoning, in some cases generating fully correct programs from a single prompt. In N=50 trials, GPT-4 produces functional I2C interfaces 66% of the time. GPT-4 also produces register-level drivers, code for LoRa communication, and context-specific power optimizations for an nRF52 program resulting in over 740x current reduction to 12.2uA. We also characterize the models' limitations to develop a generalizable human-AI workflow for using LLMs in embedded system development. We evaluate our workflow with 15 users including novice and expert programmers. We find that our workflow improves productivity for all users and increases the success rate for building a LoRa environmental sensor from 25% to 100%, including for users with zero hardware or C/C++ experience.
△ Less
Submitted 21 November, 2023; v1 submitted 7 July, 2023;
originally announced July 2023.
-
ParaAMR: A Large-Scale Syntactically Diverse Paraphrase Dataset by AMR Back-Translation
Authors:
Kuan-Hao Huang,
Varun Iyer,
I-Hung Hsu,
Anoop Kumar,
Kai-Wei Chang,
Aram Galstyan
Abstract:
Paraphrase generation is a long-standing task in natural language processing (NLP). Supervised paraphrase generation models, which rely on human-annotated paraphrase pairs, are cost-inefficient and hard to scale up. On the other hand, automatically annotated paraphrase pairs (e.g., by machine back-translation), usually suffer from the lack of syntactic diversity -- the generated paraphrase sentenc…
▽ More
Paraphrase generation is a long-standing task in natural language processing (NLP). Supervised paraphrase generation models, which rely on human-annotated paraphrase pairs, are cost-inefficient and hard to scale up. On the other hand, automatically annotated paraphrase pairs (e.g., by machine back-translation), usually suffer from the lack of syntactic diversity -- the generated paraphrase sentences are very similar to the source sentences in terms of syntax. In this work, we present ParaAMR, a large-scale syntactically diverse paraphrase dataset created by abstract meaning representation back-translation. Our quantitative analysis, qualitative examples, and human evaluation demonstrate that the paraphrases of ParaAMR are syntactically more diverse compared to existing large-scale paraphrase datasets while preserving good semantic similarity. In addition, we show that ParaAMR can be used to improve on three NLP tasks: learning sentence embeddings, syntactically controlled paraphrase generation, and data augmentation for few-shot learning. Our results thus showcase the potential of ParaAMR for improving various NLP applications.
△ Less
Submitted 25 May, 2023;
originally announced May 2023.
-
Efficient Learning of Quantum States Prepared With Few Non-Clifford Gates
Authors:
Sabee Grewal,
Vishnu Iyer,
William Kretschmer,
Daniel Liang
Abstract:
We give a pair of algorithms that efficiently learn a quantum state prepared by Clifford gates and $O(\log n)$ non-Clifford gates. Specifically, for an $n$-qubit state $|ψ\rangle$ prepared with at most $t$ non-Clifford gates, our algorithms use $\mathsf{poly}(n,2^t,1/\varepsilon)$ time and copies of $|ψ\rangle$ to learn $|ψ\rangle$ to trace distance at most $\varepsilon$.
The first algorithm for…
▽ More
We give a pair of algorithms that efficiently learn a quantum state prepared by Clifford gates and $O(\log n)$ non-Clifford gates. Specifically, for an $n$-qubit state $|ψ\rangle$ prepared with at most $t$ non-Clifford gates, our algorithms use $\mathsf{poly}(n,2^t,1/\varepsilon)$ time and copies of $|ψ\rangle$ to learn $|ψ\rangle$ to trace distance at most $\varepsilon$.
The first algorithm for this task is more efficient, but requires entangled measurements across two copies of $|ψ\rangle$. The second algorithm uses only single-copy measurements at the cost of polynomial factors in runtime and sample complexity. Our algorithms more generally learn any state with sufficiently large stabilizer dimension, where a quantum state has stabilizer dimension $k$ if it is stabilized by an abelian group of $2^k$ Pauli operators. We also develop an efficient property testing algorithm for stabilizer dimension, which may be of independent interest.
△ Less
Submitted 4 April, 2024; v1 submitted 22 May, 2023;
originally announced May 2023.
-
Improved Stabilizer Estimation via Bell Difference Sampling
Authors:
Sabee Grewal,
Vishnu Iyer,
William Kretschmer,
Daniel Liang
Abstract:
We study the complexity of learning quantum states in various models with respect to the stabilizer formalism and obtain the following results:
- We prove that $Ω(n)$ $T$-gates are necessary for any Clifford+$T$ circuit to prepare computationally pseudorandom quantum states, an exponential improvement over the previously known bound. This bound is asymptotically tight if linear-time quantum-secu…
▽ More
We study the complexity of learning quantum states in various models with respect to the stabilizer formalism and obtain the following results:
- We prove that $Ω(n)$ $T$-gates are necessary for any Clifford+$T$ circuit to prepare computationally pseudorandom quantum states, an exponential improvement over the previously known bound. This bound is asymptotically tight if linear-time quantum-secure pseudorandom functions exist.
- Given an $n$-qubit pure quantum state $|ψ\rangle$ that has fidelity at least $τ$ with some stabilizer state, we give an algorithm that outputs a succinct description of a stabilizer state that witnesses fidelity at least $τ- \varepsilon$. The algorithm uses $O(n/(\varepsilon^2τ^4))$ samples and $\exp\left(O(n/τ^4)\right) / \varepsilon^2$ time. In the regime of $τ$ constant, this algorithm estimates stabilizer fidelity substantially faster than the naïve $\exp(O(n^2))$-time brute-force algorithm over all stabilizer states.
- In the special case of $τ> \cos^2(π/8)$, we show that a modification of the above algorithm runs in polynomial time.
- We exhibit a tolerant property testing algorithm for stabilizer states.
The underlying algorithmic primitive in all of our results is Bell difference sampling. To prove our results, we establish and/or strengthen connections between Bell difference sampling, symplectic Fourier analysis, and graph theory.
△ Less
Submitted 29 March, 2024; v1 submitted 26 April, 2023;
originally announced April 2023.
-
First measurement of the nuclear-recoil ionization yield in silicon at 100 eV
Authors:
M. F. Albakry,
I. Alkhatib,
D. Alonso,
D. W. P. Amaral,
P. An,
T. Aralis,
T. Aramaki,
I. J. Arnquist,
I. Ataee Langroudy,
E. Azadbakht,
S. Banik,
P. S. Barbeau,
C. Bathurst,
R. Bhattacharyya,
P. L. Brink,
R. Bunker,
B. Cabrera,
R. Calkins,
R. A. Cameron,
C. Cartaro,
D. G. Cerdeño,
Y. -Y. Chang,
M. Chaudhuri,
R. Chen,
N. Chott
, et al. (115 additional authors not shown)
Abstract:
We measured the nuclear--recoil ionization yield in silicon with a cryogenic phonon-sensitive gram-scale detector. Neutrons from a mono-energetic beam scatter off of the silicon nuclei at angles corresponding to energy depositions from 4\,keV down to 100\,eV, the lowest energy probed so far. The results show no sign of an ionization production threshold above 100\,eV. These results call for furthe…
▽ More
We measured the nuclear--recoil ionization yield in silicon with a cryogenic phonon-sensitive gram-scale detector. Neutrons from a mono-energetic beam scatter off of the silicon nuclei at angles corresponding to energy depositions from 4\,keV down to 100\,eV, the lowest energy probed so far. The results show no sign of an ionization production threshold above 100\,eV. These results call for further investigation of the ionization yield theory and a comprehensive determination of the detector response function at energies below the keV scale.
△ Less
Submitted 3 March, 2023;
originally announced March 2023.
-
A Search for Low-mass Dark Matter via Bremsstrahlung Radiation and the Migdal Effect in SuperCDMS
Authors:
M. F. Albakry,
I. Alkhatib,
D. Alonso,
D. W. P. Amaral,
T. Aralis,
T. Aramaki,
I. J. Arnquist,
I. Ataee Langroudy,
E. Azadbakht,
S. Banik,
C. Bathurst,
R. Bhattacharyya,
P. L. Brink,
R. Bunker,
B. Cabrera,
R. Calkins,
R. A. Cameron,
C. Cartaro,
D. G. Cerdeño,
Y. -Y. Chang,
M. Chaudhuri,
R. Chen,
N. Chott,
J. Cooley,
H. Coombes
, et al. (108 additional authors not shown)
Abstract:
We present a new analysis of previously published of SuperCDMS data using a profile likelihood framework to search for sub-GeV dark matter (DM) particles through two inelastic scattering channels: bremsstrahlung radiation and the Migdal effect. By considering these possible inelastic scattering channels, experimental sensitivity can be extended to DM masses that are undetectable through the DM-nuc…
▽ More
We present a new analysis of previously published of SuperCDMS data using a profile likelihood framework to search for sub-GeV dark matter (DM) particles through two inelastic scattering channels: bremsstrahlung radiation and the Migdal effect. By considering these possible inelastic scattering channels, experimental sensitivity can be extended to DM masses that are undetectable through the DM-nucleon elastic scattering channel, given the energy threshold of current experiments. We exclude DM masses down to $220~\textrm{MeV}/c^2$ at $2.7 \times 10^{-30}~\textrm{cm}^2$ via the bremsstrahlung channel. The Migdal channel search provides overall considerably more stringent limits and excludes DM masses down to $30~\textrm{MeV}/c^2$ at $5.0 \times 10^{-30}~\textrm{cm}^2$.
△ Less
Submitted 17 February, 2023;
originally announced February 2023.
-
Photon bunching in cathodoluminescence induced by indirect electron excitation
Authors:
Vasudevan Iyer,
Kevin Roccapriore,
Jacob Ng,
Bernadeta Srijanto,
David Lingerfelt,
Benjamin Lawrie
Abstract:
The impulsive excitation of ensembles of excitons or color centers by a high-energy electron beam results in the observation of photon bunching in the second-order correlation function of the cathodoluminescence generated by those emitters. Photon bunching in cathodoluminescence microscopy can be used to resolve the excited-state dynamics and the excitation and emission efficiency of nanoscale mat…
▽ More
The impulsive excitation of ensembles of excitons or color centers by a high-energy electron beam results in the observation of photon bunching in the second-order correlation function of the cathodoluminescence generated by those emitters. Photon bunching in cathodoluminescence microscopy can be used to resolve the excited-state dynamics and the excitation and emission efficiency of nanoscale materials, and it can be used to probe interactions between emitters and nanophotonic cavities. Here, we report substantial changes in the measured bunching induced by indirect electron interactions (with indirect electron excitation inducing $g^{2}(0)$ values approaching $10^4$). This result is critical to the interpretation of $g^{2}(τ)$ in cathodoluminescence microscopies, and, more importantly, it provides a foundation for the nanoscale characterization of optical properties in beam-sensitive materials.
△ Less
Submitted 25 January, 2023;
originally announced January 2023.
-
Development of a large-mass, low-threshold detector system with simultaneous measurements of athermal phonons and scintillation light
Authors:
M. Chaudhuri,
G. Agnolet,
V. Iyer,
V. K. S. Kashyap,
M. Lee,
R. Mahapatra,
S. Maludze,
N. Mirabolfathi,
B. Mohanty,
M. Platt,
A. Upadhyay,
S. Sahoo,
S. Verma
Abstract:
We have combined two low-threshold detector technologies to develop a large-mass, low-threshold detector system that simultaneously measures the athermal phonons in a sapphire detector while an adjacent silicon high-voltage detector detects the scintillation light from the sapphire detector. This detector system could provide event-by-event discrimination between electron and nuclear events due to…
▽ More
We have combined two low-threshold detector technologies to develop a large-mass, low-threshold detector system that simultaneously measures the athermal phonons in a sapphire detector while an adjacent silicon high-voltage detector detects the scintillation light from the sapphire detector. This detector system could provide event-by-event discrimination between electron and nuclear events due to the difference in their scintillation light yield. While such systems with simultaneous phonon and light detection have been demonstrated earlier with smaller detectors, our system is designed to provide a large detector mass with high amplification for the limited scintillation light. Future work will focus on at least an order of magnitude improvement in the light collection efficiency by having a highly reflective detector housing and custom phonon mask design to maximize light collection by the silicon high-voltage detector.
△ Less
Submitted 8 December, 2022;
originally announced December 2022.
-
Unsupervised Syntactically Controlled Paraphrase Generation with Abstract Meaning Representations
Authors:
Kuan-Hao Huang,
Varun Iyer,
Anoop Kumar,
Sriram Venkatapathy,
Kai-Wei Chang,
Aram Galstyan
Abstract:
Syntactically controlled paraphrase generation has become an emerging research direction in recent years. Most existing approaches require annotated paraphrase pairs for training and are thus costly to extend to new domains. Unsupervised approaches, on the other hand, do not need paraphrase pairs but suffer from relatively poor performance in terms of syntactic control and quality of generated par…
▽ More
Syntactically controlled paraphrase generation has become an emerging research direction in recent years. Most existing approaches require annotated paraphrase pairs for training and are thus costly to extend to new domains. Unsupervised approaches, on the other hand, do not need paraphrase pairs but suffer from relatively poor performance in terms of syntactic control and quality of generated paraphrases. In this paper, we demonstrate that leveraging Abstract Meaning Representations (AMR) can greatly improve the performance of unsupervised syntactically controlled paraphrase generation. Our proposed model, AMR-enhanced Paraphrase Generator (AMRPG), separately encodes the AMR graph and the constituency parse of the input sentence into two disentangled semantic and syntactic embeddings. A decoder is then learned to reconstruct the input sentence from the semantic and syntactic embeddings. Our experiments show that AMRPG generates more accurate syntactically controlled paraphrases, both quantitatively and qualitatively, compared to the existing unsupervised approaches. We also demonstrate that the paraphrases generated by AMRPG can be used for data augmentation to improve the robustness of NLP models.
△ Less
Submitted 2 November, 2022;
originally announced November 2022.
-
The University of Edinburgh's Submission to the WMT22 Code-Mixing Shared Task (MixMT)
Authors:
Faheem Kirefu,
Vivek Iyer,
Pinzhen Chen,
Laurie Burchell
Abstract:
The University of Edinburgh participated in the WMT22 shared task on code-mixed translation. This consists of two subtasks: i) generating code-mixed Hindi/English (Hinglish) text generation from parallel Hindi and English sentences and ii) machine translation from Hinglish to English. As both subtasks are considered low-resource, we focused our efforts on careful data generation and curation, espe…
▽ More
The University of Edinburgh participated in the WMT22 shared task on code-mixed translation. This consists of two subtasks: i) generating code-mixed Hindi/English (Hinglish) text generation from parallel Hindi and English sentences and ii) machine translation from Hinglish to English. As both subtasks are considered low-resource, we focused our efforts on careful data generation and curation, especially the use of backtranslation from monolingual resources. For subtask 1 we explored the effects of constrained decoding on English and transliterated subwords in order to produce Hinglish. For subtask 2, we investigated different pretraining techniques, namely comparing simple initialisation from existing machine translation models and aligned augmentation. For both subtasks, we found that our baseline systems worked best. Our systems for both subtasks were one of the overall top-performing submissions.
△ Less
Submitted 20 October, 2022;
originally announced October 2022.
-
Map** the Pathways of Photo-induced Ion Migration in Organic-inorganic Hybrid Halide Perovskites
Authors:
Taeyong Kim,
Soyeon Park,
Vasudevan Iyer,
Qi Jiang,
Usama Choudhry,
Gage Eichman,
Ryan Gnabasik,
Benjamin Lawrie,
Kai Zhu,
Bolin Liao
Abstract:
Organic-inorganic hybrid perovskites (OIHPs) exhibiting exceptional photovoltaic and optoelectronic properties are of fundamental and practical interest, owing to their tunability and low manufacturing cost. For practical applications, however, challenges such as material instability and the photocurrent hysteresis occurring in perovskite solar cells under light exposure need to be understood and…
▽ More
Organic-inorganic hybrid perovskites (OIHPs) exhibiting exceptional photovoltaic and optoelectronic properties are of fundamental and practical interest, owing to their tunability and low manufacturing cost. For practical applications, however, challenges such as material instability and the photocurrent hysteresis occurring in perovskite solar cells under light exposure need to be understood and addressed. While extensive investigations have suggested that ion migration is a plausible origin of these detrimental effects, detailed understanding of the ion migration pathways remains elusive. Here, we report the characterization of photo-induced ion migration in OIHPs using \textit{in situ} laser illumination inside a scanning electron microscope, coupled with secondary electron imaging, energy-dispersive X-ray spectroscopy and cathodoluminescence with varying primary electron energies. Using methylammonium lead iodide (MAPbI$_3$), formamidinium lead iodide (FAPbI$_3$) and hybrid formamidinium-methylammonium lead iodide as model systems, we observed photo-induced long-range migration of halide ions over hundreds of micrometers and elucidated the transport pathways of various ions both near the surface and inside the bulk of the OIHPs, including a surprising finding of the vertical migration of lead ions. Our study provides insights into ion migration processes in OIHPs that can aid OIHP material design and processing in future applications.
△ Less
Submitted 10 October, 2022;
originally announced October 2022.
-
Scalable Self-Supervised Representation Learning from Spatiotemporal Motion Trajectories for Multimodal Computer Vision
Authors:
Swetava Ganguli,
C. V. Krishnakumar Iyer,
Vipul Pandey
Abstract:
Self-supervised representation learning techniques utilize large datasets without semantic annotations to learn meaningful, universal features that can be conveniently transferred to solve a wide variety of downstream supervised tasks. In this work, we propose a self-supervised method for learning representations of geographic locations from unlabeled GPS trajectories to solve downstream geospatia…
▽ More
Self-supervised representation learning techniques utilize large datasets without semantic annotations to learn meaningful, universal features that can be conveniently transferred to solve a wide variety of downstream supervised tasks. In this work, we propose a self-supervised method for learning representations of geographic locations from unlabeled GPS trajectories to solve downstream geospatial computer vision tasks. Tiles resulting from a raster representation of the earth's surface are modeled as nodes on a graph or pixels of an image. GPS trajectories are modeled as allowed Markovian paths on these nodes. A scalable and distributed algorithm is presented to compute image-like representations, called reachability summaries, of the spatial connectivity patterns between tiles and their neighbors implied by the observed Markovian paths. A convolutional, contractive autoencoder is trained to learn compressed representations, called reachability embeddings, of reachability summaries for every tile. Reachability embeddings serve as task-agnostic, feature representations of geographic locations. Using reachability embeddings as pixel representations for five different downstream geospatial tasks, cast as supervised semantic segmentation problems, we quantitatively demonstrate that reachability embeddings are semantically meaningful representations and result in 4-23% gain in performance, as measured using area under the precision-recall curve (AUPRC) metric, when compared to baseline models that use pixel representations that do not account for the spatial connectivity between tiles. Reachability embeddings transform sequential, spatiotemporal mobility data into semantically meaningful tensor representations that can be combined with other sources of imagery and are designed to facilitate multimodal learning in geospatial computer vision.
△ Less
Submitted 6 October, 2022;
originally announced October 2022.
-
DIAGNOSE: Avoiding Out-of-distribution Data using Submodular Information Measures
Authors:
Suraj Kothawade,
Akshit Srivastava,
Venkat Iyer,
Ganesh Ramakrishnan,
Rishabh Iyer
Abstract:
Avoiding out-of-distribution (OOD) data is critical for training supervised machine learning models in the medical imaging domain. Furthermore, obtaining labeled medical data is difficult and expensive since it requires expert annotators like doctors, radiologists, etc. Active learning (AL) is a well-known method to mitigate labeling costs by selecting the most diverse or uncertain samples. Howeve…
▽ More
Avoiding out-of-distribution (OOD) data is critical for training supervised machine learning models in the medical imaging domain. Furthermore, obtaining labeled medical data is difficult and expensive since it requires expert annotators like doctors, radiologists, etc. Active learning (AL) is a well-known method to mitigate labeling costs by selecting the most diverse or uncertain samples. However, current AL methods do not work well in the medical imaging domain with OOD data. We propose Diagnose (avoiDing out-of-dIstribution dAta usinG submodular iNfOrmation meaSurEs), a novel active learning framework that can jointly model similarity and dissimilarity, which is crucial in mining in-distribution data and avoiding OOD data at the same time. Particularly, we use a small number of data points as exemplars that represent a query set of in-distribution data points and a private set of OOD data points. We illustrate the generalizability of our framework by evaluating it on a wide variety of real-world OOD scenarios. Our experiments verify the superiority of Diagnose over the state-of-the-art AL methods across multiple domains of medical imaging.
△ Less
Submitted 4 October, 2022;
originally announced October 2022.
-
CLINICAL: Targeted Active Learning for Imbalanced Medical Image Classification
Authors:
Suraj Kothawade,
Atharv Savarkar,
Venkat Iyer,
Lakshman Tamil,
Ganesh Ramakrishnan,
Rishabh Iyer
Abstract:
Training deep learning models on medical datasets that perform well for all classes is a challenging task. It is often the case that a suboptimal performance is obtained on some classes due to the natural class imbalance issue that comes with medical data. An effective way to tackle this problem is by using targeted active learning, where we iteratively add data points to the training data that be…
▽ More
Training deep learning models on medical datasets that perform well for all classes is a challenging task. It is often the case that a suboptimal performance is obtained on some classes due to the natural class imbalance issue that comes with medical data. An effective way to tackle this problem is by using targeted active learning, where we iteratively add data points to the training data that belong to the rare classes. However, existing active learning methods are ineffective in targeting rare classes in medical datasets. In this work, we propose Clinical (targeted aCtive Learning for ImbalaNced medICal imAge cLassification) a framework that uses submodular mutual information functions as acquisition functions to mine critical data points from rare classes. We apply our framework to a wide-array of medical imaging datasets on a variety of real-world class imbalance scenarios - namely, binary imbalance and long-tail imbalance. We show that Clinical outperforms the state-of-the-art active learning methods by acquiring a diverse set of data points that belong to the rare classes.
△ Less
Submitted 4 October, 2022;
originally announced October 2022.
-
Low-Stabilizer-Complexity Quantum States Are Not Pseudorandom
Authors:
Sabee Grewal,
Vishnu Iyer,
William Kretschmer,
Daniel Liang
Abstract:
We show that quantum states with "low stabilizer complexity" can be efficiently distinguished from Haar-random. Specifically, given an $n$-qubit pure state $|ψ\rangle$, we give an efficient algorithm that distinguishes whether $|ψ\rangle$ is (i) Haar-random or (ii) a state with stabilizer fidelity at least $\frac{1}{k}$ (i.e., has fidelity at least $\frac{1}{k}$ with some stabilizer state), promis…
▽ More
We show that quantum states with "low stabilizer complexity" can be efficiently distinguished from Haar-random. Specifically, given an $n$-qubit pure state $|ψ\rangle$, we give an efficient algorithm that distinguishes whether $|ψ\rangle$ is (i) Haar-random or (ii) a state with stabilizer fidelity at least $\frac{1}{k}$ (i.e., has fidelity at least $\frac{1}{k}$ with some stabilizer state), promised that one of these is the case. With black-box access to $|ψ\rangle$, our algorithm uses $O\!\left( k^{12} \log(1/δ)\right)$ copies of $|ψ\rangle$ and $O\!\left(n k^{12} \log(1/δ)\right)$ time to succeed with probability at least $1-δ$, and, with access to a state preparation unitary for $|ψ\rangle$ (and its inverse), $O\!\left( k^{3} \log(1/δ)\right)$ queries and $O\!\left(n k^{3} \log(1/δ)\right)$ time suffice.
As a corollary, we prove that $ω(\log(n))$ $T$-gates are necessary for any Clifford+$T$ circuit to prepare computationally pseudorandom quantum states, a first-of-its-kind lower bound.
△ Less
Submitted 28 September, 2022;
originally announced September 2022.
-
Asymmetric Light Bending in the Equatorial Kerr Metric
Authors:
Arthur B. Congdon,
Savitri V. Iyer,
Charles R. Keeton
Abstract:
The observation of the bending of light by mass, now known as gravitational lensing, was key in establishing general relativity as one of the pillars of modern physics. In the past couple of decades, there has been increasing interest in using gravitational lensing to test general relativity beyond the weak deflection limit. Black holes and neutron stars produce the strong gravitational fields nee…
▽ More
The observation of the bending of light by mass, now known as gravitational lensing, was key in establishing general relativity as one of the pillars of modern physics. In the past couple of decades, there has been increasing interest in using gravitational lensing to test general relativity beyond the weak deflection limit. Black holes and neutron stars produce the strong gravitational fields needed for such tests. For a rotating compact object, the distinction between prograde and retrograde photon trajectories becomes important. In this paper, we explore subtleties that arise in interpreting the bending angle in this context and address the origin of seemingly contradictory results in the literature. We argue that analogies that cannot be precisely quantified present a source of confusion.
△ Less
Submitted 18 July, 2022;
originally announced July 2022.
-
A Relative Church-Turing-Deutsch Thesis from Special Relativity and Undecidability
Authors:
Blake Wilson,
Ethan Dickey,
Vaishnavi Iyer,
Sabre Kais
Abstract:
Beginning with Turing's seminal work in 1950, artificial intelligence proposes that consciousness can be simulated by a Turing machine. This implies a potential theory of everything where the universe is a simulation on a computer, which begs the question of whether we can prove we exist in a simulation. In this work, we construct a relative model of computation where a computable \textit{local} m…
▽ More
Beginning with Turing's seminal work in 1950, artificial intelligence proposes that consciousness can be simulated by a Turing machine. This implies a potential theory of everything where the universe is a simulation on a computer, which begs the question of whether we can prove we exist in a simulation. In this work, we construct a relative model of computation where a computable \textit{local} machine is simulated by a \textit{global}, classical Turing machine. We show that the problem of the local machine computing \textbf{simulation properties} of its global simulator is undecidable in the same sense as the Halting problem. Then, we show that computing the time, space, or error accumulated by the global simulator are simulation properties and therefore are undecidable. These simulation properties give rise to special relativistic effects in the relative model which we use to construct a relative Church-Turing-Deutsch thesis where a global, classical Turing machine computes quantum mechanics for a local machine with the same constant-time local computational complexity as experienced in our universe.
△ Less
Submitted 13 June, 2022;
originally announced June 2022.
-
Effective Field Theory Analysis of CDMSlite Run 2 Data
Authors:
SuperCDMS Collaboration,
M. F. Albakry,
I. Alkhatib,
D. W. P. Amaral,
T. Aralis,
T. Aramaki,
I. J. Arnquist,
I. Ataee Langroudy,
E. Azadbakht,
S. Banik,
C. Bathurst,
D. A. Bauer,
L. V. S. Bezerra,
R. Bhattacharyya,
P. L. Brink,
R. Bunker,
B. Cabrera,
R. Calkins,
R. A. Cameron,
C. Cartaro,
D. G. Cerdeño,
Y. -Y. Chang,
M. Chaudhuri,
R. Chen,
N. Chott
, et al. (105 additional authors not shown)
Abstract:
CDMSlite Run 2 was a search for weakly interacting massive particles (WIMPs) with a cryogenic 600 g Ge detector operated in a high-voltage mode to optimize sensitivity to WIMPs of relatively low mass from 2 - 20 GeV/$c^2$. In this article, we present an effective field theory (EFT) analysis of the CDMSlite Run 2 data using an extended energy range and a comprehensive treatment of the expected back…
▽ More
CDMSlite Run 2 was a search for weakly interacting massive particles (WIMPs) with a cryogenic 600 g Ge detector operated in a high-voltage mode to optimize sensitivity to WIMPs of relatively low mass from 2 - 20 GeV/$c^2$. In this article, we present an effective field theory (EFT) analysis of the CDMSlite Run 2 data using an extended energy range and a comprehensive treatment of the expected background. A binned likelihood Bayesian analysis was performed on the recoil energy data, taking into account the parameters of the EFT interactions and optimizing the data selection with respect to the dominant background components. Energy regions within 5$σ$ of known activation peaks were removed from the analysis. The Bayesian evidences resulting from the different operator hypotheses show that the CDMSlite Run 2 data are consistent with the background-only models and do not allow for a signal interpretation assuming any additional EFT interaction. Consequently, upper limits on the WIMP mass and coupling-coefficient amplitudes and phases are presented for each EFT operator. These limits improve previous CDMSlite Run 2 bounds for WIMP masses above 5 GeV/$c^2$.
△ Less
Submitted 23 May, 2022;
originally announced May 2022.
-
Investigating the sources of low-energy events in a SuperCDMS-HVeV detector
Authors:
SuperCDMS Collaboration,
M. F. Albakry,
I. Alkhatib,
D. W. P. Amaral,
T. Aralis,
T. Aramaki,
I. J. Arnquist,
I. Ataee Langroudy,
E. Azadbakht,
S. Banik,
C. Bathurst,
D. A. Bauer,
R. Bhattacharyya,
P. L. Brink,
R. Bunker,
B. Cabrera,
R. Calkins,
R. A. Cameron,
C. Cartaro,
D. G. Cerdeño,
Y. -Y. Chang,
M. Chaudhuri,
R. Chen,
N. Chott,
J. Cooley
, et al. (104 additional authors not shown)
Abstract:
Recent experiments searching for sub-GeV/$c^2$ dark matter have observed event excesses close to their respective energy thresholds. Although specific to the individual technologies, the measured excess event rates have been consistently reported at or below event energies of a few-hundred eV, or with charges of a few electron-hole pairs. In the present work, we operated a 1-gram silicon SuperCDMS…
▽ More
Recent experiments searching for sub-GeV/$c^2$ dark matter have observed event excesses close to their respective energy thresholds. Although specific to the individual technologies, the measured excess event rates have been consistently reported at or below event energies of a few-hundred eV, or with charges of a few electron-hole pairs. In the present work, we operated a 1-gram silicon SuperCDMS-HVeV detector at three voltages across the crystal (0 V, 60 V and 100 V). The 0 V data show an excess of events in the tens of eV region. Despite this event excess, we demonstrate the ability to set a competitive exclusion limit on the spin-independent dark matter--nucleon elastic scattering cross section for dark matter masses of $\mathcal{O}(100)$ MeV/$c^2$, enabled by operation of the detector at 0 V potential and achievement of a very low $\mathcal{O}(10)$ eV threshold for nuclear recoils. Comparing the data acquired at 0 V, 60 V and 100 V potentials across the crystal, we investigated possible sources of the unexpected events observed at low energy. The data indicate that the dominant contribution to the excess is consistent with a hypothesized luminescence from the printed circuit boards used in the detector holder.
△ Less
Submitted 11 October, 2022; v1 submitted 17 April, 2022;
originally announced April 2022.
-
Large-mass, low-threshold sapphire detector for rare event searches
Authors:
S. Verma,
S. Maludze,
M. Lee,
M. Chaudhuri,
V. Iyer,
V. K. S. Kashyap,
A. Kubik,
T. Lin,
R. Mahapatra,
N. Mirabolfathi,
N. Mishra,
B. Mohanty,
H. Neog,
A. Jastram,
M. Platt Platta
Abstract:
Low mass nuclear recoil dark matter and coherent-elastic-neutrino-nucleus-scattering (CENNS) searches confront similar challenges in choosing ultra-low threshold and large-mass detectors. We report experimental results from the first-of-its-kind 100 g single-crystal sapphire detector design with a diameter of 76 mm and thickness of 4 mm. The detector is designed to be sensitive for low-energy rare…
▽ More
Low mass nuclear recoil dark matter and coherent-elastic-neutrino-nucleus-scattering (CENNS) searches confront similar challenges in choosing ultra-low threshold and large-mass detectors. We report experimental results from the first-of-its-kind 100 g single-crystal sapphire detector design with a diameter of 76 mm and thickness of 4 mm. The detector is designed to be sensitive for low-energy rare interactions with an intention to investigate the low mass region of dark matter phase-space and search for CENNS at the reactor site. Sapphire is a crystal of aluminum oxide (Al2O3) and has been found to be a good candidate for light mass spin-dependent dark matter search experiments due to its lower atomic mass compared to other detector materials such as germanium and silicon. Using the data collected from the test facility at Texas A&M University, we were able to resolve low energy lines from calibration sources and estimated that our newly developed sapphire detector has a baseline recoil energy resolution of 18 eV. These detectors are operated at 0 V with the phonon-assisted detection providing a quenching-free low-threshold operation.
△ Less
Submitted 26 March, 2022;
originally announced March 2022.
-
Correlative nanoscale imaging of strained hBN spin defects
Authors:
David Curie,
Jaron T. Krogel,
Lukas Cavar,
Abhishek Solanki,
Pramey Upadhyaya,
Tongcang Li,
Yun-Yi Pai,
Michael Chilcote,
Vasudevan Iyer,
Alex Puretzky,
Ilia Ivanov,
Mao-Hua Du,
Fernando Reboredo,
Benjamin Lawrie
Abstract:
Spin defects like the negatively charged boron vacancy color center ($V_B^-$) in hexagonal boron nitride (hBN) may enable new forms of quantum sensing with near-surface defects in layered van der Waals heterostructures. Here, we reveal the effect of strain associated with creases in hBN flakes on $V_B^-$ and $V_B$ color centers in hBN with correlative cathodoluminescence and photoluminescence micr…
▽ More
Spin defects like the negatively charged boron vacancy color center ($V_B^-$) in hexagonal boron nitride (hBN) may enable new forms of quantum sensing with near-surface defects in layered van der Waals heterostructures. Here, we reveal the effect of strain associated with creases in hBN flakes on $V_B^-$ and $V_B$ color centers in hBN with correlative cathodoluminescence and photoluminescence microscopies. We observe strong localized enhancement and redshifting of the $V_B^-$ luminescence at creases, consistent with density functional theory calculations showing $V_B^-$ migration toward regions with moderate uniaxial compressive strain. The ability to manipulate these spin defects with highly localized strain offers intriguing possibilities for future 2D quantum sensors.
△ Less
Submitted 18 March, 2022;
originally announced March 2022.
-
A Strategy for Low-Mass Dark Matter Searches with Cryogenic Detectors in the SuperCDMS SNOLAB Facility
Authors:
SuperCDMS Collaboration,
M. F. Albakry,
I. Alkhatib,
D. W. P. Amaral,
T. Aralis,
T. Aramaki,
I. J. Arnquist,
I. Ataee Langroudy,
E. Azadbakht,
S. Banik,
C. Bathurst,
D. A. Bauer,
R. Bhattacharyya,
P. L. Brink,
R. Bunker,
B. Cabrera,
R. Calkins,
R. A. Cameron,
C. Cartaro,
D. G. Cerdeno,
Y. -Y. Chang,
M. Chaudhuri,
R. Chen,
N. Chott,
J. Cooley
, et al. (103 additional authors not shown)
Abstract:
The SuperCDMS Collaboration is currently building SuperCDMS SNOLAB, a dark matter search focused on nucleon-coupled dark matter in the 1-5 GeV/c$^2$ mass range. Looking to the future, the Collaboration has developed a set of experience-based upgrade scenarios, as well as novel directions, to extend the search for dark matter using the SuperCDMS technology in the SNOLAB facility. The experienced-ba…
▽ More
The SuperCDMS Collaboration is currently building SuperCDMS SNOLAB, a dark matter search focused on nucleon-coupled dark matter in the 1-5 GeV/c$^2$ mass range. Looking to the future, the Collaboration has developed a set of experience-based upgrade scenarios, as well as novel directions, to extend the search for dark matter using the SuperCDMS technology in the SNOLAB facility. The experienced-based scenarios are forecasted to probe many square decades of unexplored dark matter parameter space below 5 GeV/c$^2$, covering over 6 decades in mass: 1-100 eV/c$^2$ for dark photons and axion-like particles, 1-100 MeV/c$^2$ for dark-photon-coupled light dark matter, and 0.05-5 GeV/c$^2$ for nucleon-coupled dark matter. They will reach the neutrino fog in the 0.5-5 GeV/c$^2$ mass range and test a variety of benchmark models and sharp targets. The novel directions involve greater departures from current SuperCDMS technology but promise even greater reach in the long run, and their development must begin now for them to be available in a timely fashion.
The experienced-based upgrade scenarios rely mainly on dramatic improvements in detector performance based on demonstrated scaling laws and reasonable extrapolations of current performance. Importantly, these improvements in detector performance obviate significant reductions in background levels beyond current expectations for the SuperCDMS SNOLAB experiment. Given that the dominant limiting backgrounds for SuperCDMS SNOLAB are cosmogenically created radioisotopes in the detectors, likely amenable only to isotopic purification and an underground detector life-cycle from before crystal growth to detector testing, the potential cost and time savings are enormous and the necessary improvements much easier to prototype.
△ Less
Submitted 1 April, 2023; v1 submitted 16 March, 2022;
originally announced March 2022.
-
A Search for Low-mass Dark Matter via Bremsstrahlung Radiation and the Migdal Effect in SuperCDMS
Authors:
SuperCDMS Collaboration,
Musaab Al-Bakry,
Imran Alkhatib,
Dorian Praia do Amaral,
Taylor Aralis,
Tsuguo Aramaki,
Isaac Arnquist,
Iman Ataee Langroudy,
Elham Azadbakht,
Samir Banik,
Corey Bathurst,
Dan Bauer,
Lucas Bezerra,
Rik Bhattacharyya,
Paul Brink,
Ray Bunker,
Blas Cabrera,
Robert Calkins,
Robert Cameron,
Concetta Cartaro,
David Cerdeno,
Yen-Yung Chang,
Mouli Chaudhuri,
Ran Chen,
Nicholas Chott
, et al. (106 additional authors not shown)
Abstract:
In this paper, we present a re-analysis of SuperCDMS data using a profile likelihood approach to search for sub-GeV dark matter particles (DM) through two inelastic scattering channels: bremsstrahlung radiation and the Migdal effect. By considering possible inelastic scattering channels, experimental sensitivity can be extended to DM masses that would otherwise be undetectable through the DM-nucle…
▽ More
In this paper, we present a re-analysis of SuperCDMS data using a profile likelihood approach to search for sub-GeV dark matter particles (DM) through two inelastic scattering channels: bremsstrahlung radiation and the Migdal effect. By considering possible inelastic scattering channels, experimental sensitivity can be extended to DM masses that would otherwise be undetectable through the DM-nucleon elastic scattering channel, given the energy threshold of current experiments. We exclude DM masses down to $220~\textrm{MeV}/c^2$ at $2.7 \times 10^{-30}~\textrm{cm}^2$ via the bremsstrahlung channel. The Migdal channel search excludes DM masses down to $30~\textrm{MeV}/c^2$ at $5.0 \times 10^{-30}~\textrm{cm}^2$.
△ Less
Submitted 19 May, 2022; v1 submitted 4 March, 2022;
originally announced March 2022.
-
A novel active veto prototype detector with an inner target for improved rare event searches
Authors:
M. Chaudhuri,
A. Jastram,
G. Agnolet,
S. Banik,
H. Chen,
V. Iyer,
V. K. S. Kashyap,
A. Kubik,
M. Lee,
R. Mahapatra,
S. Maludze,
N. Mirabolfathi,
N. Mishra,
B. Mohanty,
H. Neog,
M. Platt
Abstract:
We report the fabrication and performance of an annular, cryogenic, phonon-mediated veto detector that can host an inner target detector, allowing substantial reduction in radiogenic backgrounds for rare event search experiments. A germanium veto detector of mass $\sim$500 g with an outer diameter of 76 mm and an inner diameter of 28 mm was produced. A 25 mm diameter germanium inner target detecto…
▽ More
We report the fabrication and performance of an annular, cryogenic, phonon-mediated veto detector that can host an inner target detector, allowing substantial reduction in radiogenic backgrounds for rare event search experiments. A germanium veto detector of mass $\sim$500 g with an outer diameter of 76 mm and an inner diameter of 28 mm was produced. A 25 mm diameter germanium inner target detector of mass $\sim$10 g was mounted inside the veto detector. The detector was designed using inputs from a GEANT4 based simulation, where it was modeled to be sandwiched between two germanium detectors. The simulation showed that the background rates (dominantly gamma interactions) could be reduced by $>$ 90$\%$, and that such an arrangement is sufficient for aggressive background reduction needed for neutrino and dark matter search experiments. During testing at the experimental site the veto detector prototype achieved a baseline resolution of 1.24 $\pm$ 0.02 keV while hosting a functional inner target detector. The baseline resolution of the inner target detector was 147 $\pm$ 2 eV. The detectors were operated at mK temperatures. The experimental results of an identical detector arrangement are in excellent agreement with the simulation.
△ Less
Submitted 22 February, 2022;
originally announced February 2022.
-
Ionization yield measurement in a germanium CDMSlite detector using photo-neutron sources
Authors:
SuperCDMS Collaboration,
M. F. Albakry,
I. Alkhatib,
D. W. P. Amaral,
T. Aralis,
T. Aramaki,
I. J. Arnquist,
I. Ataee Langroudy,
E. Azadbakht,
S. Banik,
C. Bathurst,
D. A. Bauer,
L. V. S. Bezerra,
R. Bhattacharyya,
M. A. Bowles,
P. L. Brink,
R. Bunker,
B. Cabrera,
R. Calkins,
R. A. Cameron,
C. Cartaro,
D. G. Cerdeño,
Y. -Y. Chang,
M. Chaudhuri,
R. Chen
, et al. (104 additional authors not shown)
Abstract:
Two photo-neutron sources, $^{88}$Y$^{9}$Be and $^{124}$Sb$^{9}$Be, have been used to investigate the ionization yield of nuclear recoils in the CDMSlite germanium detectors by the SuperCDMS collaboration. This work evaluates the yield for nuclear recoil energies between 1 keV and 7 keV at a temperature of $\sim$ 50 mK. We use a Geant4 simulation to model the neutron spectrum assuming a charge yie…
▽ More
Two photo-neutron sources, $^{88}$Y$^{9}$Be and $^{124}$Sb$^{9}$Be, have been used to investigate the ionization yield of nuclear recoils in the CDMSlite germanium detectors by the SuperCDMS collaboration. This work evaluates the yield for nuclear recoil energies between 1 keV and 7 keV at a temperature of $\sim$ 50 mK. We use a Geant4 simulation to model the neutron spectrum assuming a charge yield model that is a generalization of the standard Lindhard model and consists of two energy dependent parameters. We perform a likelihood analysis using the simulated neutron spectrum, modeled background, and experimental data to obtain the best fit values of the yield model. The ionization yield between recoil energies of 1 keV and 7 keV is shown to be significantly lower than predicted by the standard Lindhard model for germanium. There is a general lack of agreement among different experiments using a variety of techniques studying the low-energy range of the nuclear recoil yield, which is most critical for interpretation of direct dark matter searches. This suggests complexity in the physical process that many direct detection experiments use to model their primary signal detection mechanism and highlights the need for further studies to clarify underlying systematic effects that have not been well understood up to this point.
△ Less
Submitted 27 June, 2022; v1 submitted 14 February, 2022;
originally announced February 2022.
-
EXCESS workshop: Descriptions of rising low-energy spectra
Authors:
P. Adari,
A. Aguilar-Arevalo,
D. Amidei,
G. Angloher,
E. Armengaud,
C. Augier,
L. Balogh,
S. Banik,
D. Baxter,
C. Beaufort,
G. Beaulieu,
V. Belov,
Y. Ben Gal,
G. Benato,
A. Benoît,
A. Bento,
L. Bergé,
A. Bertolini,
R. Bhattacharyya,
J. Billard,
I. M. Bloch,
A. Botti,
R. Breier,
G. Bres,
J-. L. Bret
, et al. (281 additional authors not shown)
Abstract:
Many low-threshold experiments observe sharply rising event rates of yet unknown origins below a few hundred eV, and larger than expected from known backgrounds. Due to the significant impact of this excess on the dark matter or neutrino sensitivity of these experiments, a collective effort has been started to share the knowledge about the individual observations. For this, the EXCESS Workshop was…
▽ More
Many low-threshold experiments observe sharply rising event rates of yet unknown origins below a few hundred eV, and larger than expected from known backgrounds. Due to the significant impact of this excess on the dark matter or neutrino sensitivity of these experiments, a collective effort has been started to share the knowledge about the individual observations. For this, the EXCESS Workshop was initiated. In its first iteration in June 2021, ten rare event search collaborations contributed to this initiative via talks and discussions. The contributing collaborations were CONNIE, CRESST, DAMIC, EDELWEISS, MINER, NEWS-G, NUCLEUS, RICOCHET, SENSEI and SuperCDMS. They presented data about their observed energy spectra and known backgrounds together with details about the respective measurements. In this paper, we summarize the presented information and give a comprehensive overview of the similarities and differences between the distinct measurements. The provided data is furthermore publicly available on the workshop's data repository together with a plotting tool for visualization.
△ Less
Submitted 4 March, 2022; v1 submitted 10 February, 2022;
originally announced February 2022.
-
Hyperspectral nanoscale map** of hybrid perovskite photophysics at the single grain level
Authors:
Ethan J. Taylor,
Vasudevan Iyer,
Bibek S. Dhami,
Clay Klein,
Benjamin J. Lawrie,
Kannatassen Appavoo
Abstract:
Hybrid organic-inorganic perovskites have drawn significant interest for applications in optoelectronics over the last few years. Despite rapid progress in understanding the photophysics of perovskite, there remains a need for improved understanding of the effect of microstructure on perovskite photophysical processes. Here, we combine unsupervised machine learning and cathodoluminescence microsco…
▽ More
Hybrid organic-inorganic perovskites have drawn significant interest for applications in optoelectronics over the last few years. Despite rapid progress in understanding the photophysics of perovskite, there remains a need for improved understanding of the effect of microstructure on perovskite photophysical processes. Here, we combine unsupervised machine learning and cathodoluminescence microscopy of a prototypical hybrid perovskite film to decode photophysical processes that are otherwise lost with conventional Gaussian image processing. Hyperspectral maps are decoded with non-negative matrix factorization, revealing components relating to primary band-edge emission, photon recycling, and defect emission. A blind-spectral non-negative matrix factorization procedure provides additional understanding of changes in an intermediate perovskite phase under electron beam exposure and illustrates how traditional Gaussian techniques may hide relevant emission features that are critical to the development of environmentally robust perovskite devices
△ Less
Submitted 17 January, 2022;
originally announced January 2022.
-
Angle-Resolved Cathodoluminescence Polarimetry of Hybrid Perovskites
Authors:
Bibek S. Dhami,
Vasudevan Iyer,
Aniket Pant,
Ravi P. N. Tripathi,
Benjamin J. Lawrie,
Kannatassen Appavoo
Abstract:
Coupling between light and matter strongly depends on the polarization of the electromagnetic field and the nature of the excitations in the material. As hybrid perovskites emerge as a promising class of materials for light-based technologies like LEDs, lasers, and photodetectors, understanding the microscopic details of how photons couple to matter is critical. While most optical studies have foc…
▽ More
Coupling between light and matter strongly depends on the polarization of the electromagnetic field and the nature of the excitations in the material. As hybrid perovskites emerge as a promising class of materials for light-based technologies like LEDs, lasers, and photodetectors, understanding the microscopic details of how photons couple to matter is critical. While most optical studies have focused on the spectral content and quantum efficiency of emitted photons in various hybrid perovskite thin-film and nanoscale structures, few studies have explored other properties of the emitted photons such as polarization and emission angle. Here, we use angle-resolved cathodoluminescence microscopy to access the full polarization state of photons emitted from large-grain hybrid perovskite films with spatial resolution well below the optical diffraction limit. Map** the Stokes parameters as a function of the emission angle in a thin film, we reveal the effect of a grain boundary on the degree of polarization and angle at which the photons are emitted. This exploration of angle- and polarization-resolved emission near grain boundaries provides an improved understanding of the emission properties of hybrid perovskites in thin film geometries -- a necessary investigation for subsequent engineering of subwavelength nanophotonic structures using the hybrid perovskite class of materials.
△ Less
Submitted 17 January, 2022;
originally announced January 2022.
-
A Deep Learning Approach for Ontology Enrichment from Unstructured Text
Authors:
Lalit Mohan Sanagavarapu,
Vivek Iyer,
Raghu Reddy
Abstract:
Information Security in the cyber world is a major cause for concern, with a significant increase in the number of attack surfaces. Existing information on vulnerabilities, attacks, controls, and advisories available on the web provides an opportunity to represent knowledge and perform security analytics to mitigate some of the concerns. Representing security knowledge in the form of ontology faci…
▽ More
Information Security in the cyber world is a major cause for concern, with a significant increase in the number of attack surfaces. Existing information on vulnerabilities, attacks, controls, and advisories available on the web provides an opportunity to represent knowledge and perform security analytics to mitigate some of the concerns. Representing security knowledge in the form of ontology facilitates anomaly detection, threat intelligence, reasoning and relevance attribution of attacks, and many more. This necessitates dynamic and automated enrichment of information security ontologies. However, existing ontology enrichment algorithms based on natural language processing and ML models have issues with contextual extraction of concepts in words, phrases, and sentences. This motivates the need for sequential Deep Learning architectures that traverse through dependency paths in text and extract embedded vulnerabilities, threats, controls, products, and other security-related concepts and instances from learned path representations. In the proposed approach, Bidirectional LSTMs trained on a large DBpedia dataset and Wikipedia corpus of 2.8 GB along with Universal Sentence Encoder is deployed to enrich ISO 27001-based information security ontology. The model is trained and tested on a high-performance computing (HPC) environment to handle Wiki text dimensionality. The approach yielded a test accuracy of over 80% when tested with knocked-out concepts from ontology and web page instances to validate the robustness.
△ Less
Submitted 15 December, 2021;
originally announced December 2021.
-
A framework for syntactic and semantic quality evaluation of ontologies
Authors:
Vivek Iyer,
Lalit Mohan Sanagavarapu,
Raghu Reddy
Abstract:
The increasing focus on Web 3.0 is leading to automated creation and enrichment of ontologies and other linked datasets. Alongside automation, quality evaluation of enriched ontologies can impact software reliability and reuse. Current quality evaluation approaches oftentimes seek to evaluate ontologies in either syntactic (degree of following ontology development guidelines) or semantic (degree o…
▽ More
The increasing focus on Web 3.0 is leading to automated creation and enrichment of ontologies and other linked datasets. Alongside automation, quality evaluation of enriched ontologies can impact software reliability and reuse. Current quality evaluation approaches oftentimes seek to evaluate ontologies in either syntactic (degree of following ontology development guidelines) or semantic (degree of semantic validity of enriched concepts/relations) aspects. This paper proposes an ontology quality evaluation framework consisting of: (a) SynEvaluator and (b) SemValidator for evaluating syntactic and semantic aspects of ontologies respectively. SynEvaluator allows dynamic task-specific creation and updation of syntactic rules at run-time without any need for programming. SemValidator uses Twitter-based expertise of validators for semantic evaluation. The efficacy and validity of the framework is shown empirically on multiple ontologies.
△ Less
Submitted 15 December, 2021;
originally announced December 2021.
-
Reachability Embeddings: Scalable Self-Supervised Representation Learning from Mobility Trajectories for Multimodal Geospatial Computer Vision
Authors:
Swetava Ganguli,
C. V. Krishnakumar Iyer,
Vipul Pandey
Abstract:
Self-supervised representation learning techniques utilize large datasets without semantic annotations to learn meaningful, universal features that can be conveniently transferred to solve a wide variety of downstream supervised tasks. In this paper, we propose a self-supervised method for learning representations of geographic locations from unlabeled GPS trajectories to solve downstream geospati…
▽ More
Self-supervised representation learning techniques utilize large datasets without semantic annotations to learn meaningful, universal features that can be conveniently transferred to solve a wide variety of downstream supervised tasks. In this paper, we propose a self-supervised method for learning representations of geographic locations from unlabeled GPS trajectories to solve downstream geospatial computer vision tasks. Tiles resulting from a raster representation of the earth's surface are modeled as nodes on a graph or pixels of an image. GPS trajectories are modeled as allowed Markovian paths on these nodes. A scalable and distributed algorithm is presented to compute image-like tensors, called reachability summaries, of the spatial connectivity patterns between tiles and their neighbors implied by the observed Markovian paths. A convolutional, contractive autoencoder is trained to learn compressed representations, called reachability embeddings, of reachability summaries for every tile. Reachability embeddings serve as task-agnostic, feature representations of geographic locations. Using reachability embeddings as pixel representations for five different downstream geospatial tasks, cast as supervised semantic segmentation problems, we quantitatively demonstrate that reachability embeddings are semantically meaningful representations and result in 4-23% gain in performance, while using upto 67% less trajectory data, as measured using area under the precision-recall curve (AUPRC) metric, when compared to baseline models that use pixel representations that do not account for the spatial connectivity between tiles. Reachability embeddings transform sequential, spatiotemporal mobility data into semantically meaningful image-like tensor representations that can be combined with other sources of imagery and are designed to facilitate multimodal learning in geospatial computer vision.
△ Less
Submitted 15 July, 2022; v1 submitted 24 October, 2021;
originally announced October 2021.