-
Hybrid approach predicts a lower binding energy for benzene on water ice
Authors:
Victoria H. J. Clark,
David M. Benoit,
Marie Van de Sande,
Catherine Walsh
Abstract:
In this paper we provide a highly accurate value for the binding energy of benzene to proton-ordered crystalline water ice (XIh), as a model for interstellar ices. We compare our computed value to the latest experimental data available from temperature programmed desorption (TPD) experiments and find that our binding energy value agrees well with data obtained from binding to either crystalline or…
▽ More
In this paper we provide a highly accurate value for the binding energy of benzene to proton-ordered crystalline water ice (XIh), as a model for interstellar ices. We compare our computed value to the latest experimental data available from temperature programmed desorption (TPD) experiments and find that our binding energy value agrees well with data obtained from binding to either crystalline or amorphous ice. Importantly, our new value is lower than that used in most astrochemical networks by about nearly half its value. We explore the impact of this revised binding energy value for both an AGB outflow and a protoplanetary disk. We find that the lower value of the binding energy predicted here compared with values used in the literature (4050 K versus 7587 K) leads to less depletion of gas-phase benzene in an AGB outflow, and leads to a shift outwards in the benzene snowline in the midplane of a protoplanetary disk. Using this new value, the AGB model predicts lower abundances of benzene in the solid phase throughout the outflow. The disk model also predicts a larger reservoir of gas-phase benzene in the inner disk, which is consistent with the recent detections of benzene for the first time in protoplanetary disks with JWST.
△ Less
Submitted 27 June, 2024;
originally announced June 2024.
-
Curved detectors for future X-ray astrophysics missions
Authors:
Eric D. Miller,
James A. Gregory,
Marshall W. Bautz,
Harry R. Clark,
Michael Cooper,
Kevan Donlon,
Richard F. Foster,
Catherine E. Grant,
Mallory Jensen,
Beverly LaMarr,
Renee Lambert,
Christopher Leitz,
Andrew Malonis,
Mo Neak,
Gregory Prigozhin,
Kevin Ryu,
Benjamin Schneider,
Keith Warner,
Douglas J. Young,
William W. Zhang
Abstract:
Future X-ray astrophysics missions will survey large areas of the sky with unparalleled sensitivity, enabled by lightweight, high-resolution optics. These optics inherently produce curved focal surfaces with radii as small as 2 m, requiring a large area detector system that closely conforms to the curved focal surface. We have embarked on a project using a curved charge-coupled device (CCD) detect…
▽ More
Future X-ray astrophysics missions will survey large areas of the sky with unparalleled sensitivity, enabled by lightweight, high-resolution optics. These optics inherently produce curved focal surfaces with radii as small as 2 m, requiring a large area detector system that closely conforms to the curved focal surface. We have embarked on a project using a curved charge-coupled device (CCD) detector technology developed at MIT Lincoln Laboratory to provide large-format, curved detectors for such missions, improving performance and simplifying design. We present the current status of this work, which aims to curve back-illuminated, large-format (5 cm x 4 cm) CCDs to 2.5-m radius and confirm X-ray performance. We detail the design of fixtures and the curving process, and present intial results on curving bare silicon samples and monitor devices and characterizing the surface geometric accuracy. The tests meet our accuracy requirement of <5 $μ$m RMS surface non-conformance for samples of similar thickness to the functional detectors. We finally show X-ray performance measurements of planar CCDs that will serve as a baseline to evaluate the curved detectors. The detectors exhibit low noise, good charge-transfer efficiency, and excellent, uniform spectroscopic performance, including in the important soft X-ray band.
△ Less
Submitted 26 June, 2024;
originally announced June 2024.
-
Granular temperature controls local rheology of vibrated granular flows
Authors:
Mitchell G. Irmer,
Emily E. Brodsky,
Abram H. Clark
Abstract:
We use numerical simulations to demonstrate a local rheology for sheared, vibrated granular flows. We consider a granular assembly that is subjected to simple shear and harmonic vibration at the boundary. This configuration allows us to isolate the effects of vibration, as parameterized by granular temperature. We find that friction is reduced due to local velocity fluctuations of grains. All data…
▽ More
We use numerical simulations to demonstrate a local rheology for sheared, vibrated granular flows. We consider a granular assembly that is subjected to simple shear and harmonic vibration at the boundary. This configuration allows us to isolate the effects of vibration, as parameterized by granular temperature. We find that friction is reduced due to local velocity fluctuations of grains. All data obey a local rheology that relates the material friction coefficient, the granular temperature, and the dimensionless shear rate. We also observe that reduction in material friction due to granular temperature is associated with reduction in fabric anisotropy. We demonstrate that the temperature can be modeled by a heat equation with dissipation with appropriate boundary conditions, which provides complete closure of the system and allows a fully local continuum description of sheared, vibrated granular flows. This success suggests local rheology based on temperature, as suggested previously, combined with the new, empirical heat diffusion equation may provide a general strategy for dense granular flows.
△ Less
Submitted 21 May, 2024;
originally announced May 2024.
-
Elements of World Knowledge (EWOK): A cognition-inspired framework for evaluating basic world knowledge in language models
Authors:
Anna A. Ivanova,
Aalok Sathe,
Benjamin Lipkin,
Unnathi Kumar,
Setayesh Radkani,
Thomas H. Clark,
Carina Kauf,
Jennifer Hu,
R. T. Pramod,
Gabriel Grand,
Vivian Paulun,
Maria Ryskina,
Ekin Akyürek,
Ethan Wilcox,
Nafisa Rashid,
Leshem Choshen,
Roger Levy,
Evelina Fedorenko,
Joshua Tenenbaum,
Jacob Andreas
Abstract:
The ability to build and leverage world models is essential for a general-purpose AI agent. Testing such capabilities is hard, in part because the building blocks of world models are ill-defined. We present Elements of World Knowledge (EWOK), a framework for evaluating world modeling in language models by testing their ability to use knowledge of a concept to match a target text with a plausible/i…
▽ More
The ability to build and leverage world models is essential for a general-purpose AI agent. Testing such capabilities is hard, in part because the building blocks of world models are ill-defined. We present Elements of World Knowledge (EWOK), a framework for evaluating world modeling in language models by testing their ability to use knowledge of a concept to match a target text with a plausible/implausible context. EWOK targets specific concepts from multiple knowledge domains known to be vital for world modeling in humans. Domains range from social interactions (help/hinder) to spatial relations (left/right). Both, contexts and targets are minimal pairs. Objects, agents, and locations in the items can be flexibly filled in enabling easy generation of multiple controlled datasets. We then introduce EWOK-CORE-1.0, a dataset of 4,374 items covering 11 world knowledge domains. We evaluate 20 openweights large language models (1.3B--70B parameters) across a battery of evaluation paradigms along with a human norming study comprising 12,480 measurements. The overall performance of all tested models is worse than human performance, with results varying drastically across domains. These data highlight simple cases where even large models fail and present rich avenues for targeted research on LLM world modeling capabilities.
△ Less
Submitted 15 May, 2024;
originally announced May 2024.
-
An explicit granular-mechanics approach to marine sediment acoustics
Authors:
Abram H. Clark,
Derek R. Olson,
Andrew J. Swartz,
W. Mason Starnes
Abstract:
Here we theoretically and computationally study the frequency dependence of phase speed and attenuation for marine sediments from the perspective of granular mechanics. We leverage recent theoretical insights from the granular physics community as well as discrete-element method simulations, where the granular material is treated as a packing of discrete objects that interact via pairwise forces.…
▽ More
Here we theoretically and computationally study the frequency dependence of phase speed and attenuation for marine sediments from the perspective of granular mechanics. We leverage recent theoretical insights from the granular physics community as well as discrete-element method simulations, where the granular material is treated as a packing of discrete objects that interact via pairwise forces. These pairwise forces include both repulsive contact forces as well as dissipative terms which may include losses from the fluid as well as losses from inelasticity at grain-grain contacts. We show that the structure of disordered granular packings leads to anomalous scaling laws for frequency-dependent phase speed and attenuation that do not follow from a continuum treatment. Our results demonstrate that granular packing structure, which is not explicitly considered in existing models, may play a crucial role in a complete theory of sediment acoustics. While this simple approach does not explicitly treat sound propagation or inertial effects in the interstitial fluid, it provides a starting point for future models that include these and other more complex features.
△ Less
Submitted 10 May, 2024;
originally announced May 2024.
-
Automatic Modulation Classification using a Waveform Signature
Authors:
William H. Clark IV,
Joseph M. Ernst,
Robert W. McGwier
Abstract:
Cognitive Radios (CRs) build upon Software Defined Radios (SDRs) to allow for autonomous reconfiguration of communication architectures. In recent years, CRs have been identified as an enabler for Dynamic Spectrum Access (DSA) applications in which secondary users opportunistically share licensed spectrum. A major challenge for DSA is accurately characterizing the spectral environment, which requi…
▽ More
Cognitive Radios (CRs) build upon Software Defined Radios (SDRs) to allow for autonomous reconfiguration of communication architectures. In recent years, CRs have been identified as an enabler for Dynamic Spectrum Access (DSA) applications in which secondary users opportunistically share licensed spectrum. A major challenge for DSA is accurately characterizing the spectral environment, which requires blind signal classification. Existing work in this area has focused on simplistic channel models; however, more challenging fading channels (e.g., frequency selective fading channels) cause existing methods to be computationally complex or insufficient. This paper develops a novel blind modulation classification algorithm, which uses a set of higher order statistics to overcome these challenges. The set of statistics forms a signature, which can either be used directly for classification or can be processed using big data analytical techniques, such as principle component analysis (PCA), to learn the environment. The algorithm is tested in simulation on both flat fading and selective fading channel models. Results of this blind classification algorithm are shown to improve upon those which use single value higher order statistical methods.
△ Less
Submitted 1 April, 2024;
originally announced April 2024.
-
Multitask Multilingual Model Adaptation with Featurized Low-Rank Mixtures
Authors:
Chu-Cheng Lin,
Xinyi Wang,
Jonathan H. Clark,
Han Lu,
Yun Zhu,
Chenxi Whitehouse,
Hongkun Yu
Abstract:
Adapting pretrained large language models (LLMs) to various downstream tasks in tens or hundreds of human languages is computationally expensive. Parameter-efficient fine-tuning (PEFT) significantly reduces the adaptation cost, by tuning only a small amount of parameters. However, directly applying PEFT methods such as LoRA (Hu et al., 2022) on diverse dataset mixtures could lead to suboptimal per…
▽ More
Adapting pretrained large language models (LLMs) to various downstream tasks in tens or hundreds of human languages is computationally expensive. Parameter-efficient fine-tuning (PEFT) significantly reduces the adaptation cost, by tuning only a small amount of parameters. However, directly applying PEFT methods such as LoRA (Hu et al., 2022) on diverse dataset mixtures could lead to suboptimal performance due to limited parameter capacity and negative interference among different datasets. In this work, we propose Featurized Low-rank Mixtures (FLix), a novel PEFT method designed for effective multitask multilingual tuning. FLix associates each unique dataset feature, such as the dataset's language or task, with its own low-rank weight update parameters. By composing feature-specific parameters for each dataset, FLix can accommodate diverse dataset mixtures and generalize better to unseen datasets. Our experiments show that FLix leads to significant improvements over a variety of tasks for both supervised learning and zero-shot settings using different training data mixtures.
△ Less
Submitted 27 February, 2024;
originally announced February 2024.
-
Investigating heterogeneous PSMA ligand uptake inside parotid glands
Authors:
Caleb Sample,
Carlos Uribe,
Arman Rahmim,
François Bénard,
Jonn Wu,
Haley Clark
Abstract:
The purpose was to investigate the spatial heterogeneity of prostate-specific membrane antigen (PSMA) positron emission tomography (PET) uptake within parotid glands. We aim to quantify patterns in well-defined regions to facilitate further investigations. Furthermore, we investigate whether uptake is correlated with computed tomography (CT) texture features. Parotid glands from [18F]DCFPyL PSMA P…
▽ More
The purpose was to investigate the spatial heterogeneity of prostate-specific membrane antigen (PSMA) positron emission tomography (PET) uptake within parotid glands. We aim to quantify patterns in well-defined regions to facilitate further investigations. Furthermore, we investigate whether uptake is correlated with computed tomography (CT) texture features. Parotid glands from [18F]DCFPyL PSMA PET/CT images of 30 prostate cancer patients were analyzed. Thresholding was used to define high-uptake regions, and uptake statistics were computed within various divisions. Spearman's rank correlation coefficient was calculated between PSMA PET uptake and the Grey Level Run Length Matrix (GLRLM) using a long and short run length emphasis (GLRLML and GLRLMS) in subregions of parotid glands. PSMA PET uptake was significantly higher (p < 0.001) in lateral/posterior regions of the glands than anterior/medial regions. Maximum uptake was found in the lateral half of parotid glands in 50 out of 60 glands. The difference in SUV between parotid halves is greatest when parotids are divided by a plane separating the anterior/medial and posterior/lateral halves symmetrically. PSMA PET uptake was significantly correlated with CT GLRLML (p < 0.001), and anti-correlated with CT GLRLMS (p < 0.001). Uptake of PSMA PET is heterogeneous within parotid glands, with uptake biased towards lateral and posterior regions. Uptake patterns within parotid glands were found to be strongly correlated with CT texture features, suggesting the possible future use of CT texture features as a proxy for inferring PSMA PET uptake in salivary glands.
△ Less
Submitted 4 January, 2024;
originally announced January 2024.
-
Image denoising and model-independent parameterization for improving IVIM MRI
Authors:
Caleb Sample,
Jonn Wu,
Haley Clark
Abstract:
Variability of IVIM parameters throughout the literature is a long-standing issue, and perfusion-related parameters are difficult to interpret. We demonstrate for improving the analysis of intravoxel incoherent motion imaging (IVIM) magnetic resonance (MR) images, using image denoising and a quantitative approach that does not require imposing specific exponential models. IVIM images were acquired…
▽ More
Variability of IVIM parameters throughout the literature is a long-standing issue, and perfusion-related parameters are difficult to interpret. We demonstrate for improving the analysis of intravoxel incoherent motion imaging (IVIM) magnetic resonance (MR) images, using image denoising and a quantitative approach that does not require imposing specific exponential models. IVIM images were acquired for 13 head-and-neck patients prior to radiotherapy. Of these, 5 patients also had post-radiotherapy scans acquired. Image quality was improved prior to parameter fitting via denoising. For this, we employed neural blind deconvolution, a method of undertaking the ill-posed mathematical problem of blind deconvolution using neural networks. The signal decay curve was then quantified in terms of area under the curve ($AUC$) parameters. Denoised images were assessed in terms of blind image quality metrics, and correlations between their derived parameters in parotid glands with radiotherapy dose levels. We assessed the method's ability to recover artificial pseudokernels which had been applied to denoised images. $AUC$ parameters were compared with the apparent diffusion coefficient ($ADC$), biexponential, and triexponential model parameters, in terms of their correlations with dose, and their relative contributions to the total variance of the dataset, obtained through singular value decomposition. Image denoising resulted in improved blind image quality metrics, and higher correlations between IVIM parameters and dose. $AUC$ parameters were more correlated with dose than traditional IVIM parameters, and captured the highest proportion of the dataset's variance. V This method of describing the signal decay curve with model-independent parameters like the $AUC$, and preprocessing images with denoising techniques, shows potential for improving reproducibility and utility of IVIM imaging.
△ Less
Submitted 4 January, 2024;
originally announced January 2024.
-
PSMA PET/CT as a predictive tool for sub-regional importance estimates in the parotid gland
Authors:
Caleb Sample,
Arman Rahmim,
François Bénard,
Jonn Wu,
Haley Clark
Abstract:
Xerostomia and radiation-induced salivary gland dysfunction remain a common side effect for head-and-neck radiotherapy patients, and attempts have been made to quantify the heterogeneous dose response within parotid glands. Here several models of parotid gland subregional importance are compared with prostate specific membrane antigen (PSMA) positron emission tomography (PET) uptake. PSMA ligands…
▽ More
Xerostomia and radiation-induced salivary gland dysfunction remain a common side effect for head-and-neck radiotherapy patients, and attempts have been made to quantify the heterogeneous dose response within parotid glands. Here several models of parotid gland subregional importance are compared with prostate specific membrane antigen (PSMA) positron emission tomography (PET) uptake. PSMA ligands show high concentrations in salivary glands, whose uptake has been previously found to relate to gland functionality. We develop a predictive model for relative importance estimates using PSMA PET and CT radiomic features, and demonstrate a methodology for predicting patient-specific importance deviations from the population. Intra-parotid gland uptake was compared with four regional importance models using 30 [18F]DCFPyL PSMA PET images. A radiomics-based predictive model of population importance was developed using a double cross-validation methodology. Population importance estimates were supplemented using patient-specific radiomic features. Anticorrelative relationships were found to exist between PSMA PET uptake and four independent models of subregional parotid gland importance from the literature. Kernel Ridge Regression with principal component analysis feature selection performed best over test sets (MAE = 0.08), with GLCM features being particularly important. Deblurring PSMA PET images strengthened correlations and improved model performance. This study suggests that regions of relatively low PSMA PET concentration in parotid glands may exhibit relatively high dose-sensitivity. We've demonstrated the utility of PSMA PET radiomic features for predicting relative importance within the parotid glands. PSMA PET appears promising for analyzing salivary gland functionality.
△ Less
Submitted 4 January, 2024; v1 submitted 17 September, 2023;
originally announced September 2023.
-
FIAT: Fusing learning paradigms with Instruction-Accelerated Tuning
Authors:
Xinyi Wang,
John Wieting,
Jonathan H. Clark
Abstract:
Learning paradigms for large language models (LLMs) currently tend to fall within either in-context learning (ICL) or full fine-tuning. Each of these comes with their own trade-offs based on available data, model size, compute cost, ease-of-use, and final quality with neither solution performing well across-the-board. In this article, we first describe ICL and fine-tuning paradigms in a way that h…
▽ More
Learning paradigms for large language models (LLMs) currently tend to fall within either in-context learning (ICL) or full fine-tuning. Each of these comes with their own trade-offs based on available data, model size, compute cost, ease-of-use, and final quality with neither solution performing well across-the-board. In this article, we first describe ICL and fine-tuning paradigms in a way that highlights their natural connections. Based on these connections, we propose a new learning paradigm called FIAT that fuses the best of these paradigms together, enabling prompt-engineered instructions and chain-of-thought reasoning with the very largest models while also using similar methods to perform parameter updates on a modestly-sized LLM with parameter-efficient tuning. We evaluate FIAT's effectiveness on a variety of multilingual tasks and observe that FIAT performs better than both ICL and fine-tuning at scales ranging from 100-10,000 training examples. We hope that FIAT provides a practical way of harnessing the full potential of LLMs without needing to make a hard choice between learning paradigms.
△ Less
Submitted 12 September, 2023; v1 submitted 8 September, 2023;
originally announced September 2023.
-
Neural blind deconvolution for deblurring and supersampling PSMA PET
Authors:
Caleb Sample,
Arman Rahmim,
Carlos Uribe,
François Bénard,
Jonn Wu,
Roberto Fedrigo,
Haley Clark
Abstract:
Objective: To simultaneously deblur and supersample prostate specific membrane antigen (PSMA) positron emission tomography (PET) images using neural blind deconvolution. Approach: Blind deconvolution is a method of estimating the hypothetical "deblurred" image along with the blur kernel (related to the point spread function) simultaneously. Traditional \textit{maximum a posteriori} blind deconvolu…
▽ More
Objective: To simultaneously deblur and supersample prostate specific membrane antigen (PSMA) positron emission tomography (PET) images using neural blind deconvolution. Approach: Blind deconvolution is a method of estimating the hypothetical "deblurred" image along with the blur kernel (related to the point spread function) simultaneously. Traditional \textit{maximum a posteriori} blind deconvolution methods require stringent assumptions and suffer from convergence to a trivial solution. A method of modelling the deblurred image and kernel with independent neural networks, called "neural blind deconvolution" had demonstrated success for deblurring 2D natural images in 2020. In this work, we adapt neural blind deconvolution for PVE correction of PSMA PET images with simultaneous supersampling. We compare this methodology with several interpolation methods, using blind image quality metrics, and test the model's ability to predict kernels by re-running the model after applying artificial "pseudokernels" to deblurred images. The methodology was tested on a retrospective set of 30 prostate patients as well as phantom images containing spherical lesions of various volumes. Results: Neural blind deconvolution led to improvements in image quality over other interpolation methods in terms of blind image quality metrics, recovery coefficients, and visual assessment. Predicted kernels were similar between patients, and the model accurately predicted several artificially-applied pseudokernels. Localization of activity in phantom spheres was improved after deblurring, allowing small lesions to be more accurately defined. Significance: The intrinsically low spatial resolution of PSMA PET leads to PVEs which negatively impact uptake quantification in small regions. The proposed method can be used to mitigate this issue, and can be straightforwardly adapted for other imaging modalities.
△ Less
Submitted 2 March, 2024; v1 submitted 1 September, 2023;
originally announced September 2023.
-
The Devil is in the Errors: Leveraging Large Language Models for Fine-grained Machine Translation Evaluation
Authors:
Patrick Fernandes,
Daniel Deutsch,
Mara Finkelstein,
Parker Riley,
André F. T. Martins,
Graham Neubig,
Ankush Garg,
Jonathan H. Clark,
Markus Freitag,
Orhan Firat
Abstract:
Automatic evaluation of machine translation (MT) is a critical tool driving the rapid iterative development of MT systems. While considerable progress has been made on estimating a single scalar quality score, current metrics lack the informativeness of more detailed schemes that annotate individual errors, such as Multidimensional Quality Metrics (MQM). In this paper, we help fill this gap by pro…
▽ More
Automatic evaluation of machine translation (MT) is a critical tool driving the rapid iterative development of MT systems. While considerable progress has been made on estimating a single scalar quality score, current metrics lack the informativeness of more detailed schemes that annotate individual errors, such as Multidimensional Quality Metrics (MQM). In this paper, we help fill this gap by proposing AutoMQM, a prompting technique which leverages the reasoning and in-context learning capabilities of large language models (LLMs) and asks them to identify and categorize errors in translations. We start by evaluating recent LLMs, such as PaLM and PaLM-2, through simple score prediction prompting, and we study the impact of labeled data through in-context learning and finetuning. We then evaluate AutoMQM with PaLM-2 models, and we find that it improves performance compared to just prompting for scores (with particularly large gains for larger models) while providing interpretability through error spans that align with human annotations.
△ Less
Submitted 14 August, 2023;
originally announced August 2023.
-
A Cross-Linguistic Pressure for Uniform Information Density in Word Order
Authors:
Thomas Hikaru Clark,
Clara Meister,
Tiago Pimentel,
Michael Hahn,
Ryan Cotterell,
Richard Futrell,
Roger Levy
Abstract:
While natural languages differ widely in both canonical word order and word order flexibility, their word orders still follow shared cross-linguistic statistical patterns, often attributed to functional pressures. In the effort to identify these pressures, prior work has compared real and counterfactual word orders. Yet one functional pressure has been overlooked in such investigations: the unifor…
▽ More
While natural languages differ widely in both canonical word order and word order flexibility, their word orders still follow shared cross-linguistic statistical patterns, often attributed to functional pressures. In the effort to identify these pressures, prior work has compared real and counterfactual word orders. Yet one functional pressure has been overlooked in such investigations: the uniform information density (UID) hypothesis, which holds that information should be spread evenly throughout an utterance. Here, we ask whether a pressure for UID may have influenced word order patterns cross-linguistically. To this end, we use computational models to test whether real orders lead to greater information uniformity than counterfactual orders. In our empirical study of 10 typologically diverse languages, we find that: (i) among SVO languages, real word orders consistently have greater uniformity than reverse word orders, and (ii) only linguistically implausible counterfactual orders consistently exceed the uniformity of real orders. These findings are compatible with a pressure for information uniformity in the development and usage of natural languages.
△ Less
Submitted 9 July, 2023; v1 submitted 6 June, 2023;
originally announced June 2023.
-
Evaluating and Modeling Attribution for Cross-Lingual Question Answering
Authors:
Benjamin Muller,
John Wieting,
Jonathan H. Clark,
Tom Kwiatkowski,
Sebastian Ruder,
Livio Baldini Soares,
Roee Aharoni,
Jonathan Herzig,
Xinyi Wang
Abstract:
Trustworthy answer content is abundant in many high-resource languages and is instantly accessible through question answering systems, yet this content can be hard to access for those that do not speak these languages. The leap forward in cross-lingual modeling quality offered by generative language models offers much promise, yet their raw generations often fall short in factuality. To improve tr…
▽ More
Trustworthy answer content is abundant in many high-resource languages and is instantly accessible through question answering systems, yet this content can be hard to access for those that do not speak these languages. The leap forward in cross-lingual modeling quality offered by generative language models offers much promise, yet their raw generations often fall short in factuality. To improve trustworthiness in these systems, a promising direction is to attribute the answer to a retrieved source, possibly in a content-rich language different from the query. Our work is the first to study attribution for cross-lingual question answering. First, we collect data in 5 languages to assess the attribution level of a state-of-the-art cross-lingual QA system. To our surprise, we find that a substantial portion of the answers is not attributable to any retrieved passages (up to 50% of answers exactly matching a gold reference) despite the system being able to attend directly to the retrieved text. Second, to address this poor attribution level, we experiment with a wide range of attribution detection techniques. We find that Natural Language Inference models and PaLM 2 fine-tuned on a very small amount of attribution data can accurately detect attribution. Based on these models, we improve the attribution level of a cross-lingual question-answering system. Overall, we show that current academic generative cross-lingual QA systems have substantial shortcomings in attribution and we build tooling to mitigate these issues.
△ Less
Submitted 15 November, 2023; v1 submitted 23 May, 2023;
originally announced May 2023.
-
XTREME-UP: A User-Centric Scarce-Data Benchmark for Under-Represented Languages
Authors:
Sebastian Ruder,
Jonathan H. Clark,
Alexander Gutkin,
Mihir Kale,
Min Ma,
Massimo Nicosia,
Shruti Rijhwani,
Parker Riley,
Jean-Michel A. Sarr,
Xinyi Wang,
John Wieting,
Nitish Gupta,
Anna Katanova,
Christo Kirov,
Dana L. Dickinson,
Brian Roark,
Bidisha Samanta,
Connie Tao,
David I. Adelani,
Vera Axelrod,
Isaac Caswell,
Colin Cherry,
Dan Garrette,
Reeve Ingle,
Melvin Johnson
, et al. (2 additional authors not shown)
Abstract:
Data scarcity is a crucial issue for the development of highly multilingual NLP systems. Yet for many under-represented languages (ULs) -- languages for which NLP re-search is particularly far behind in meeting user needs -- it is feasible to annotate small amounts of data. Motivated by this, we propose XTREME-UP, a benchmark defined by: its focus on the scarce-data scenario rather than zero-shot;…
▽ More
Data scarcity is a crucial issue for the development of highly multilingual NLP systems. Yet for many under-represented languages (ULs) -- languages for which NLP re-search is particularly far behind in meeting user needs -- it is feasible to annotate small amounts of data. Motivated by this, we propose XTREME-UP, a benchmark defined by: its focus on the scarce-data scenario rather than zero-shot; its focus on user-centric tasks -- tasks with broad adoption by speakers of high-resource languages; and its focus on under-represented languages where this scarce-data scenario tends to be most realistic. XTREME-UP evaluates the capabilities of language models across 88 under-represented languages over 9 key user-centric technologies including ASR, OCR, MT, and information access tasks that are of general utility. We create new datasets for OCR, autocomplete, semantic parsing, and transliteration, and build on and refine existing datasets for other tasks. XTREME-UP provides methodology for evaluating many modeling scenarios including text-only, multi-modal (vision, audio, and text),supervised parameter tuning, and in-context learning. We evaluate commonly used models on the benchmark. We release all code and scripts to train and evaluate models
△ Less
Submitted 24 May, 2023; v1 submitted 19 May, 2023;
originally announced May 2023.
-
Improving the modeling of the Agility multi-leaf collimator
Authors:
Mohammad Hussein,
Agnes Angerud,
Jordi Saez,
Evelien Bogaert,
Matthieu Lemire,
Miriam Barry,
Ileana Silvestre Patallo,
David Shipley,
Catharine H Clark,
Victor Hernandez
Abstract:
Robust fine tuning of multi-leaf collimator (MLC) Treatment Planning System (TPS) modeling parameters is crucial for creating an optimal beam model, particularly with the ever-increasing accuracy required for advancing techniques. Challenges arise from balancing the trade-off between multiple parameters and therefore the quality of tuning depends on the experience of the physicist and the procedur…
▽ More
Robust fine tuning of multi-leaf collimator (MLC) Treatment Planning System (TPS) modeling parameters is crucial for creating an optimal beam model, particularly with the ever-increasing accuracy required for advancing techniques. Challenges arise from balancing the trade-off between multiple parameters and therefore the quality of tuning depends on the experience of the physicist and the procedures used. This is in part due to limitations of the MLC modeling within the TPS. As a result, the actual values used have been shown to vary widely between centers. We present and evaluate two different MLC transmission maps to improve the modeling of the Elekta Agility MLC in the RayStation TPS. The model prototypes were developed with discrete and continuous MLC transmission maps assigned to the tongue-and-groove and leaf tip regions. The prototypes aimed to replicate the average doses for synchronous and asynchronous swee** gap fields, measured using a Farmer chamber in a solid water phantom. This study investigated the impact of these improvements using test fields and a wide variety of measured clinical plans in three different centers. The models achieved good accuracy and the improvements facilitated the standardization of the configuration and commissioning processes and extended the range of validity of TPS dose calculations in wide variety of treatment plans and measuring systems. The simpler MLC prototype with discrete transmission maps performed similarly to the more sophisticated one both in tests and clinical plans and constitutes a good option for routine clinical practice. The need for trade-offs were reduced and the models were successfully configured using a common set of parameters across the three centers, which is useful for reducing the workload and the risks associated with the configuration process, thus improving the accuracy and safety of radiotherapy treatments.
△ Less
Submitted 19 May, 2023;
originally announced May 2023.
-
PaLM 2 Technical Report
Authors:
Rohan Anil,
Andrew M. Dai,
Orhan Firat,
Melvin Johnson,
Dmitry Lepikhin,
Alexandre Passos,
Siamak Shakeri,
Emanuel Taropa,
Paige Bailey,
Zhifeng Chen,
Eric Chu,
Jonathan H. Clark,
Laurent El Shafey,
Yan** Huang,
Kathy Meier-Hellstern,
Gaurav Mishra,
Erica Moreira,
Mark Omernick,
Kevin Robinson,
Sebastian Ruder,
Yi Tay,
Kefan Xiao,
Yuanzhong Xu,
Yu**g Zhang,
Gustavo Hernandez Abrego
, et al. (103 additional authors not shown)
Abstract:
We introduce PaLM 2, a new state-of-the-art language model that has better multilingual and reasoning capabilities and is more compute-efficient than its predecessor PaLM. PaLM 2 is a Transformer-based model trained using a mixture of objectives. Through extensive evaluations on English and multilingual language, and reasoning tasks, we demonstrate that PaLM 2 has significantly improved quality on…
▽ More
We introduce PaLM 2, a new state-of-the-art language model that has better multilingual and reasoning capabilities and is more compute-efficient than its predecessor PaLM. PaLM 2 is a Transformer-based model trained using a mixture of objectives. Through extensive evaluations on English and multilingual language, and reasoning tasks, we demonstrate that PaLM 2 has significantly improved quality on downstream tasks across different model sizes, while simultaneously exhibiting faster and more efficient inference compared to PaLM. This improved efficiency enables broader deployment while also allowing the model to respond faster, for a more natural pace of interaction. PaLM 2 demonstrates robust reasoning capabilities exemplified by large improvements over PaLM on BIG-Bench and other reasoning tasks. PaLM 2 exhibits stable performance on a suite of responsible AI evaluations, and enables inference-time control over toxicity without additional overhead or impact on other capabilities. Overall, PaLM 2 achieves state-of-the-art performance across a diverse set of tasks and capabilities.
When discussing the PaLM 2 family, it is important to distinguish between pre-trained models (of various sizes), fine-tuned variants of these models, and the user-facing products that use these models. In particular, user-facing products typically include additional pre- and post-processing steps. Additionally, the underlying models may evolve over time. Therefore, one should not expect the performance of user-facing products to exactly match the results reported in this report.
△ Less
Submitted 13 September, 2023; v1 submitted 17 May, 2023;
originally announced May 2023.
-
AfriQA: Cross-lingual Open-Retrieval Question Answering for African Languages
Authors:
Odunayo Ogundepo,
Tajuddeen R. Gwadabe,
Clara E. Rivera,
Jonathan H. Clark,
Sebastian Ruder,
David Ifeoluwa Adelani,
Bonaventure F. P. Dossou,
Abdou Aziz DIOP,
Claytone Sikasote,
Gilles Hacheme,
Happy Buzaaba,
Ignatius Ezeani,
Rooweither Mabuya,
Salomey Osei,
Chris Emezue,
Albert Njoroge Kahira,
Shamsuddeen H. Muhammad,
Akintunde Oladipo,
Abraham Toluwase Owodunni,
Atnafu Lambebo Tonja,
Iyanuoluwa Shode,
Akari Asai,
Tunde Oluwaseyi Ajayi,
Clemencia Siro,
Steven Arthur
, et al. (27 additional authors not shown)
Abstract:
African languages have far less in-language content available digitally, making it challenging for question answering systems to satisfy the information needs of users. Cross-lingual open-retrieval question answering (XOR QA) systems -- those that retrieve answer content from other languages while serving people in their native language -- offer a means of filling this gap. To this end, we create…
▽ More
African languages have far less in-language content available digitally, making it challenging for question answering systems to satisfy the information needs of users. Cross-lingual open-retrieval question answering (XOR QA) systems -- those that retrieve answer content from other languages while serving people in their native language -- offer a means of filling this gap. To this end, we create AfriQA, the first cross-lingual QA dataset with a focus on African languages. AfriQA includes 12,000+ XOR QA examples across 10 African languages. While previous datasets have focused primarily on languages where cross-lingual QA augments coverage from the target language, AfriQA focuses on languages where cross-lingual answer content is the only high-coverage source of answer content. Because of this, we argue that African languages are one of the most important and realistic use cases for XOR QA. Our experiments demonstrate the poor performance of automatic translation and multilingual retrieval methods. Overall, AfriQA proves challenging for state-of-the-art QA models. We hope that the dataset enables the development of more equitable QA technology.
△ Less
Submitted 11 May, 2023;
originally announced May 2023.
-
The Earliest Stage of Galactic Star Formation
Authors:
Charles L. Steinhardt,
Vadim Rusakov,
Thomas H. Clark,
Andrei Diaconu,
Conor McPartland,
John Forbes,
Albert Sneppen,
John Weaver
Abstract:
Using a recently-developed technique to estimate gas temperatures ($T_\textrm{SF}$) in star-forming regions from large photometric surveys, we propose a diagram, analogous to the Hertzsprung-Russell diagram for individual stars, to probe the evolution of individual galaxies. On this $T_\textrm{SF}$-sSFR (specific star formation rate) diagram, a small fraction of star-forming galaxies appear to be…
▽ More
Using a recently-developed technique to estimate gas temperatures ($T_\textrm{SF}$) in star-forming regions from large photometric surveys, we propose a diagram, analogous to the Hertzsprung-Russell diagram for individual stars, to probe the evolution of individual galaxies. On this $T_\textrm{SF}$-sSFR (specific star formation rate) diagram, a small fraction of star-forming galaxies appear to be dominated by different feedback mechanisms than typical star-forming galaxies. These galaxies generically have younger stellar populations, lower stellar masses and increase in relative abundance towards higher redshifts, so we argue that these objects are in an earlier stage of galactic star formation. Further, Hubble observations find that these "core-forming" galaxies also exhibit distinct morphology, and that tracks on the $T_\textrm{SF}$-sSFR diagram are also a morphological sequence. Thus, unlike starburst phases which can be triggered environmentally, these earliest, core-forming galaxies, appear to be a stage that typical galaxies go through early in their star formation history. We therefore argue that most galaxies first go through a core formation stage, then subsequently disk formation, and finally become quiescent.
△ Less
Submitted 22 June, 2023; v1 submitted 4 January, 2023;
originally announced January 2023.
-
Beyond Contrastive Learning: A Variational Generative Model for Multilingual Retrieval
Authors:
John Wieting,
Jonathan H. Clark,
William W. Cohen,
Graham Neubig,
Taylor Berg-Kirkpatrick
Abstract:
Contrastive learning has been successfully used for retrieval of semantically aligned sentences, but it often requires large batch sizes or careful engineering to work well. In this paper, we instead propose a generative model for learning multilingual text embeddings which can be used to retrieve or score sentence pairs. Our model operates on parallel data in $N$ languages and, through an approxi…
▽ More
Contrastive learning has been successfully used for retrieval of semantically aligned sentences, but it often requires large batch sizes or careful engineering to work well. In this paper, we instead propose a generative model for learning multilingual text embeddings which can be used to retrieve or score sentence pairs. Our model operates on parallel data in $N$ languages and, through an approximation we introduce, efficiently encourages source separation in this multilingual setting, separating semantic information that is shared between translations from stylistic or language-specific variation. We show careful large-scale comparisons between contrastive and generation-based approaches for learning multilingual text embeddings, a comparison that has not been done to the best of our knowledge despite the popularity of these approaches. We evaluate this method on a suite of tasks including semantic similarity, bitext mining, and cross-lingual question retrieval -- the last of which we introduce in this paper. Overall, our Variational Multilingual Source-Separation Transformer (VMSST) model outperforms both a strong contrastive and generative baseline on these tasks.
△ Less
Submitted 4 June, 2023; v1 submitted 20 December, 2022;
originally announced December 2022.
-
Rastreo muscular móvil usando magnetomicrometría -- traducción al español del articulo "Untethered Muscle Tracking Using Magnetomicrometry" por el autor Cameron R. Taylor
Authors:
Cameron R. Taylor,
Seong Ho Yeon,
William H. Clark,
Ellen G. Clarrissimeaux,
Mary Kate O'Donnell,
Thomas J. Roberts,
Hugh M. Herr
Abstract:
Muscle tissue drives nearly all movement in the animal kingdom, providing power, mobility, and dexterity. Technologies for measuring muscle tissue motion, such as sonomicrometry, fluoromicrometry, and ultrasound, have significantly advanced our understanding of biomechanics. Yet, the field lacks the ability to monitor muscle tissue motion for animal behavior outside the lab. Towards addressing thi…
▽ More
Muscle tissue drives nearly all movement in the animal kingdom, providing power, mobility, and dexterity. Technologies for measuring muscle tissue motion, such as sonomicrometry, fluoromicrometry, and ultrasound, have significantly advanced our understanding of biomechanics. Yet, the field lacks the ability to monitor muscle tissue motion for animal behavior outside the lab. Towards addressing this issue, we previously introduced magnetomicrometry, a method that uses magnetic beads to wirelessly monitor muscle tissue length changes, and we validated magnetomicrometry via tightly-controlled in situ testing. In this study we validate the accuracy of magnetomicrometry against fluoromicrometry during untethered running in an in vivo turkey model. We demonstrate real-time muscle tissue length tracking of the freely-moving turkeys executing various motor activities, including ramp ascent and descent, vertical ascent and descent, and free roaming movement. Given the demonstrated capacity of magnetomicrometry to track muscle movement in untethered animals, we feel that this technique will enable new scientific explorations and an improved understanding of muscle function. -- --
El tejido muscular es el motor de casi todos los movimientos del reino animal, ya que proporciona fuerza, movilidad y destreza. Las tecnologías para medir el movimiento del tejido muscular, como la sonomicrometría, la fluoromicrometría y el ultrasonido, han avanzado considerablemente la comprensión de la biomecánica. Sin embargo, este campo carece de la capacidad de rastrear el movimiento del tejido muscular en el comportamiento animal fuera del laboratorio. Para abordar este problema, presentamos previamente la magnetomicrometría, un método que utiliza pequeños imanes para rastrear de forma inalámbrica los cambios de longitud del tejido muscular, y validamos la magnetomicrometría mediante pruebas estrechamente controladas in situ. En este estudio validamos la precisión de la magnetomicrometría en comparación con la fluoromicrometría usando un modelo de pavo in vivo mientras corre libremente. Demostramos el rastreo en tiempo real de la longitud del tejido muscular de los pavos que se mueven libremente ejecutando varias actividades motoras, incluyendo el ascenso y el descenso en rampa, el ascenso y el descenso vertical, y el movimiento libre. Dada la capacidad demostrada de la magnetomicrometría para rastrear el movimiento muscular en animales en un contexto móvil, creemos que esta técnica permitirá nuevas exploraciones científicas y una mejor comprensión de la función muscular.
△ Less
Submitted 19 November, 2022;
originally announced November 2022.
-
Angular Diameters and Fundamental Parameters of Forty-Four Stars from the Navy Precision Optical Interferometer
Authors:
Ellyn K. Baines,
J. Thomas Armstrong,
James H. Clark III,
Jim Gorney,
Donald J. Hutter,
Anders M. Jorgensen,
Casey Kyte,
David Mozurkewich,
Ishara Nisley,
Jason Sanborn,
Henrique R. Schmitt,
Gerard T. van Belle
Abstract:
We measured the angular diameters of 44 stars with the Navy Precision Optical Interferometer, obtaining uncertainties on the limb darkened diameter of 2% or less for all but four stars. We then used our diameters with Gaia or Hipparcos parallaxes to calculate each star's physical radius. We gathered information from the literature to determine bolometric flux and luminosity, and combined that with…
▽ More
We measured the angular diameters of 44 stars with the Navy Precision Optical Interferometer, obtaining uncertainties on the limb darkened diameter of 2% or less for all but four stars. We then used our diameters with Gaia or Hipparcos parallaxes to calculate each star's physical radius. We gathered information from the literature to determine bolometric flux and luminosity, and combined that with our diameters to produce an effective temperature. Our sample consists of mostly giant stars, and spans a wide range of spectral classes from B to M.
△ Less
Submitted 16 November, 2022;
originally announced November 2022.
-
Detecting Topological phase transitions in a double kicked quantum rotor
Authors:
Nikolai Bolik,
Caspar Groiseau,
Jerry H. Clark,
Gil S. Summy,
Yingmei Liu,
Sandro Wimberger
Abstract:
We present a concrete theoretical proposal for detecting topological phase transitions in double kicked atom-optics kicked rotors with internal spin-1/2 degree of freedom. The implementation utilizes a kicked Bose-Einstein condensate evolving in one-dimensional momentum space. To reduce influence of atom loss and phase decoherence we aim to keep experimental durations short while maintaining a res…
▽ More
We present a concrete theoretical proposal for detecting topological phase transitions in double kicked atom-optics kicked rotors with internal spin-1/2 degree of freedom. The implementation utilizes a kicked Bose-Einstein condensate evolving in one-dimensional momentum space. To reduce influence of atom loss and phase decoherence we aim to keep experimental durations short while maintaining a resonant experimental protocol. Experimental limitations induced by phase noise, quasimomentum distributions, symmetries, and the AC-Stark shift are considered. Our results thus suggest a feasible and optimized procedure for observing topological phase transitions in quantum kicked rotors.
△ Less
Submitted 21 October, 2022;
originally announced October 2022.
-
Frictional weakening of vibrated granular flows
Authors:
Abram H. Clark,
H. John Nasrin,
Stephanie E. Taylor,
Emily E. Brodsky
Abstract:
We computationally study the frictional properties of sheared granular media subjected to harmonic vibration applied at the boundary. Such vibrations are thought to play an important role in weakening flows, yet the independent effects of amplitude, frequency, and pressure on the process have remained unclear. Based on a dimensional analysis and DEM simulations, we show that, in addition to a prev…
▽ More
We computationally study the frictional properties of sheared granular media subjected to harmonic vibration applied at the boundary. Such vibrations are thought to play an important role in weakening flows, yet the independent effects of amplitude, frequency, and pressure on the process have remained unclear. Based on a dimensional analysis and DEM simulations, we show that, in addition to a previously proposed criterion for peak acceleration that leads to breaking of contacts, weakening requires the absolute amplitude squared of the displacement is sufficiently large relative to the confining pressure. The analysis provides a basis for predicting flows subjected to arbitrary external vibration and demonstrates that a previously unrecognized second process that is dependent on dissipation contributes to shear weakening under vibrations.
△ Less
Submitted 1 March, 2023; v1 submitted 11 July, 2022;
originally announced July 2022.
-
MIA 2022 Shared Task: Evaluating Cross-lingual Open-Retrieval Question Answering for 16 Diverse Languages
Authors:
Akari Asai,
Shayne Longpre,
Jungo Kasai,
Chia-Hsuan Lee,
Rui Zhang,
Junjie Hu,
Ikuya Yamada,
Jonathan H. Clark,
Eunsol Choi
Abstract:
We present the results of the Workshop on Multilingual Information Access (MIA) 2022 Shared Task, evaluating cross-lingual open-retrieval question answering (QA) systems in 16 typologically diverse languages. In this task, we adapted two large-scale cross-lingual open-retrieval QA datasets in 14 typologically diverse languages, and newly annotated open-retrieval QA data in 2 underrepresented langu…
▽ More
We present the results of the Workshop on Multilingual Information Access (MIA) 2022 Shared Task, evaluating cross-lingual open-retrieval question answering (QA) systems in 16 typologically diverse languages. In this task, we adapted two large-scale cross-lingual open-retrieval QA datasets in 14 typologically diverse languages, and newly annotated open-retrieval QA data in 2 underrepresented languages: Tagalog and Tamil. Four teams submitted their systems. The best system leveraging iteratively mined diverse negative examples and larger pretrained models achieves 32.2 F1, outperforming our baseline by 4.5 points. The second best system uses entity-aware contextualized representations for document retrieval, and achieves significant improvements in Tamil (20.8 F1), whereas most of the other systems yield nearly zero scores.
△ Less
Submitted 2 July, 2022;
originally announced July 2022.
-
Implications of a Temperature Dependent IMF III: Mass Growth and Quiescence
Authors:
Charles L. Steinhardt,
Albert Sneppen,
Hagan Hensley,
Adam S. Jermyn,
Basel Mostafa,
John R. Weaver,
Gabriel Brammer,
Thomas H. Clark,
Iary Davidzon,
Andrei C. Diaconu,
Bahram Mobasher,
Vadim Rusakov,
Sune Toft
Abstract:
The stellar initial mass function (IMF) is predicted to depend upon the temperature of gas in star-forming molecular clouds. The introduction of an additional parameter, $T_{IMF}$ , into photometric template fitting, suggest most galaxies obey an IMF top-heavier than the Galactic IMF. The implications of these revised fits on mass functions, quiescence and turnoff are discussed. At all redshifts t…
▽ More
The stellar initial mass function (IMF) is predicted to depend upon the temperature of gas in star-forming molecular clouds. The introduction of an additional parameter, $T_{IMF}$ , into photometric template fitting, suggest most galaxies obey an IMF top-heavier than the Galactic IMF. The implications of these revised fits on mass functions, quiescence and turnoff are discussed. At all redshifts the highest mass galaxies become quiescent first with the turnoff mass decreasing towards the present. The synchronous turnoff mass across galaxies suggests quiescence is driven by universal mechanisms rather than by stochastic or environmental processes.
△ Less
Submitted 3 June, 2022;
originally announced June 2022.
-
Implications of a Temperature Dependent IMF II: An Updated View of the Star-Forming Main Sequence
Authors:
Charles L. Steinhardt,
Albert Sneppen,
Basel Mostafa,
Hagan Hensley,
Adam S. Jermyn,
Adrian Lopez,
John Weaver,
Gabriel Brammer,
Thomas H. Clark,
Iary Davidzon,
Andrei C. Diaconu,
Bahram Mobasher,
Vadim Rusakov,
Sune Toft
Abstract:
The stellar initial mass function (IMF) is predicted to depend upon the temperature of gas in star-forming molecular clouds. The introduction of an additional parameter, $T_{IMF}$ , into photometric template fitting, allows galaxies to be fit with a range of IMFs. Three surprising new features appear: (1) most star-forming galaxies are best fit with a bottom-lighter IMF than the Milky Way; (2) mos…
▽ More
The stellar initial mass function (IMF) is predicted to depend upon the temperature of gas in star-forming molecular clouds. The introduction of an additional parameter, $T_{IMF}$ , into photometric template fitting, allows galaxies to be fit with a range of IMFs. Three surprising new features appear: (1) most star-forming galaxies are best fit with a bottom-lighter IMF than the Milky Way; (2) most star-forming galaxies at fixed redshift are fit with a very similar IMF; and (3) the most massive star-forming galaxies at fixed redshift instead exhibit a less bottom-light IMF, similar to that measured in quiescent galaxies. Additionally, since stellar masses and star formation rates both depend on the IMF, these results slightly modify the resulting relationship, while yielding similar qualitative characteristics to previous studies.
△ Less
Submitted 27 May, 2022;
originally announced May 2022.
-
Light-shift induced behaviors observed in momentum-space quantum walks
Authors:
Nikolai Bolik,
Caspar Groiseau,
Jerry H. Clark,
Alexander Gresch,
Siamak Dadras,
Gil S. Summy,
Yingmei Liu,
Sandro Wimberger
Abstract:
Over the last decade there have been many advances in studies of quantum walks (QWs) including a momentum-space QW recently realized in our spinor Bose-Einstein condensate system. This QW possessed behaviors that generally agreed with theoretical predictions; however, it also showed momentum distributions that were not adequately explained by the theory. We present a theoretical model which proves…
▽ More
Over the last decade there have been many advances in studies of quantum walks (QWs) including a momentum-space QW recently realized in our spinor Bose-Einstein condensate system. This QW possessed behaviors that generally agreed with theoretical predictions; however, it also showed momentum distributions that were not adequately explained by the theory. We present a theoretical model which proves that the coherent dynamics of the spinor condensate is sufficient to explain the experimental data without invoking the presence of a thermal cloud of atoms as in the original theory. Our numerical findings are supported by an analytical prediction for the momentum distributions in the limit of zero-temperature condensates. This current model provides more complete explanations to the momentum-space QWs that can be applied to study quantum search algorithms and topological phases in Floquet-driven systems.
△ Less
Submitted 26 September, 2022; v1 submitted 16 May, 2022;
originally announced May 2022.
-
Training from Zero: Radio Frequency Machine Learning Data Quantity Forecasting
Authors:
William H. Clark IV,
Alan J. Michaels
Abstract:
The data used during training in any given application space is directly tied to the performance of the system once deployed. While there are many other factors that go into producing high performance models within machine learning, there is no doubt that the data used to train a system provides the foundation from which to build. One of the underlying rule of thumb heuristics used within the mach…
▽ More
The data used during training in any given application space is directly tied to the performance of the system once deployed. While there are many other factors that go into producing high performance models within machine learning, there is no doubt that the data used to train a system provides the foundation from which to build. One of the underlying rule of thumb heuristics used within the machine learning space is that more data leads to better models, but there is no easy answer for the question, "How much data is needed?" This work examines a modulation classification problem in the Radio Frequency domain space, attempting to answer the question of how much training data is required to achieve a desired level of performance, but the procedure readily applies to classification problems across modalities. The ultimate goal is determining an approach that requires the least amount of data collection to better inform a more thorough collection effort to achieve the desired performance metric. While this approach will require an initial dataset that is germane to the problem space to act as a \textit{target} dataset on which metrics are extracted, the goal is to allow for the initial data to be orders of magnitude smaller than what is required for delivering a system that achieves the desired performance. An additional benefit of the techniques presented here is that the quality of different datasets can be numerically evaluated and tied together with the quantity of data, and ultimately, the performance of the architecture in the problem domain.
△ Less
Submitted 14 June, 2024; v1 submitted 7 May, 2022;
originally announced May 2022.
-
Analyzing Wrap-Up Effects through an Information-Theoretic Lens
Authors:
Clara Meister,
Tiago Pimentel,
Thomas Hikaru Clark,
Ryan Cotterell,
Roger Levy
Abstract:
Numerous analyses of reading time (RT) data have been implemented -- all in an effort to better understand the cognitive processes driving reading comprehension. However, data measured on words at the end of a sentence -- or even at the end of a clause -- is often omitted due to the confounding factors introduced by so-called "wrap-up effects," which manifests as a skewed distribution of RTs for t…
▽ More
Numerous analyses of reading time (RT) data have been implemented -- all in an effort to better understand the cognitive processes driving reading comprehension. However, data measured on words at the end of a sentence -- or even at the end of a clause -- is often omitted due to the confounding factors introduced by so-called "wrap-up effects," which manifests as a skewed distribution of RTs for these words. Consequently, the understanding of the cognitive processes that might be involved in these wrap-up effects is limited. In this work, we attempt to learn more about these processes by examining the relationship between wrap-up effects and information-theoretic quantities, such as word and context surprisals. We find that the distribution of information in prior contexts is often predictive of sentence- and clause-final RTs (while not of sentence-medial RTs). This lends support to several prior hypotheses about the processes involved in wrap-up effects.
△ Less
Submitted 5 January, 2024; v1 submitted 31 March, 2022;
originally announced March 2022.
-
Scaling Up Models and Data with $\texttt{t5x}$ and $\texttt{seqio}$
Authors:
Adam Roberts,
Hyung Won Chung,
Anselm Levskaya,
Gaurav Mishra,
James Bradbury,
Daniel Andor,
Sharan Narang,
Brian Lester,
Colin Gaffney,
Afroz Mohiuddin,
Curtis Hawthorne,
Aitor Lewkowycz,
Alex Salcianu,
Marc van Zee,
Jacob Austin,
Sebastian Goodman,
Livio Baldini Soares,
Haitang Hu,
Sasha Tsvyashchenko,
Aakanksha Chowdhery,
Jasmijn Bastings,
Jannis Bulian,
Xavier Garcia,
Jianmo Ni,
Andrew Chen
, et al. (18 additional authors not shown)
Abstract:
Recent neural network-based language models have benefited greatly from scaling up the size of training datasets and the number of parameters in the models themselves. Scaling can be complicated due to various factors including the need to distribute computation on supercomputer clusters (e.g., TPUs), prevent bottlenecks when infeeding data, and ensure reproducible results. In this work, we presen…
▽ More
Recent neural network-based language models have benefited greatly from scaling up the size of training datasets and the number of parameters in the models themselves. Scaling can be complicated due to various factors including the need to distribute computation on supercomputer clusters (e.g., TPUs), prevent bottlenecks when infeeding data, and ensure reproducible results. In this work, we present two software libraries that ease these issues: $\texttt{t5x}$ simplifies the process of building and training large language models at scale while maintaining ease of use, and $\texttt{seqio}$ provides a task-based API for simple creation of fast and reproducible training data and evaluation pipelines. These open-source libraries have been used to train models with hundreds of billions of parameters on datasets with multiple terabytes of training data.
Along with the libraries, we release configurations and instructions for T5-like encoder-decoder models as well as GPT-like decoder-only architectures.
$\texttt{t5x}$ and $\texttt{seqio}$ are open source and available at https://github.com/google-research/t5x and https://github.com/google/seqio, respectively.
△ Less
Submitted 31 March, 2022;
originally announced March 2022.
-
Crystal-field states and defect levels in candidate quantum spin ice Ce$_{2}$Hf$_{2}$O$_{7}$
Authors:
Victor Porée,
Elsa Lhotel,
Sylvain Petit,
Aleksandra Krajewska,
Pascal Puphal,
Adam H. Clark,
Vladimir Pomjakushin,
Helen C. Walker,
Nicolas Gauthier,
Dariusz J. Gawryluk,
Romain Sibille
Abstract:
We report the synthesis of powder and single-crystal samples of the cerium pyrohafnate and their characterization using neutron diffraction, thermogravimetry and X-ray absorption spectroscopy. We evaluate the amount of non-magnetic Ce$^{4+}$ defects and use this result to interpret the spectrum of crystal-electric field transitions observed using inelastic neutron scattering. The analysis of these…
▽ More
We report the synthesis of powder and single-crystal samples of the cerium pyrohafnate and their characterization using neutron diffraction, thermogravimetry and X-ray absorption spectroscopy. We evaluate the amount of non-magnetic Ce$^{4+}$ defects and use this result to interpret the spectrum of crystal-electric field transitions observed using inelastic neutron scattering. The analysis of these single-ion transitions indicates the dipole-octupole nature of the ground state doublet and a significant degree of spin-lattice coupling. The single-ion properties calculated from the crystal-electric field parameters obtained spectroscopically are in good agreement with bulk magnetic susceptibility data down to about 1 K. Below this temperature, the behavior of the magnetic susceptibility indicates a correlated regime without showing any sign of magnetic long-range order or freezing down to 0.08 K. We conclude that Ce$_2$Hf$_2$O$_{7}$ is another candidate to investigate exotic correlated states of quantum matter such as the octupolar quantum spin ice recently argued to exist in the isostructural compounds Ce$_2$Sn$_2$O$_7$ and Ce$_2$Zr$_2$O$_7$.
△ Less
Submitted 30 March, 2022;
originally announced March 2022.
-
XTREME-S: Evaluating Cross-lingual Speech Representations
Authors:
Alexis Conneau,
Ankur Bapna,
Yu Zhang,
Min Ma,
Patrick von Platen,
Anton Lozhkov,
Colin Cherry,
Ye Jia,
Clara Rivera,
Mihir Kale,
Daan Van Esch,
Vera Axelrod,
Simran Khanuja,
Jonathan H. Clark,
Orhan Firat,
Michael Auli,
Sebastian Ruder,
Jason Riesa,
Melvin Johnson
Abstract:
We introduce XTREME-S, a new benchmark to evaluate universal cross-lingual speech representations in many languages. XTREME-S covers four task families: speech recognition, classification, speech-to-text translation and retrieval. Covering 102 languages from 10+ language families, 3 different domains and 4 task families, XTREME-S aims to simplify multilingual speech representation evaluation, as w…
▽ More
We introduce XTREME-S, a new benchmark to evaluate universal cross-lingual speech representations in many languages. XTREME-S covers four task families: speech recognition, classification, speech-to-text translation and retrieval. Covering 102 languages from 10+ language families, 3 different domains and 4 task families, XTREME-S aims to simplify multilingual speech representation evaluation, as well as catalyze research in "universal" speech representation learning. This paper describes the new benchmark and establishes the first speech-only and speech-text baselines using XLS-R and mSLAM on all downstream tasks. We motivate the design choices and detail how to use the benchmark. Datasets and fine-tuning scripts are made easily accessible at https://hf.co/datasets/google/xtreme_s.
△ Less
Submitted 13 April, 2022; v1 submitted 21 March, 2022;
originally announced March 2022.
-
Faint objects in motion: the new frontier of high precision astrometry
Authors:
Fabien Malbet,
Céline Boehm,
Alberto Krone-Martins,
Antonio Amorim,
Guillem Anglada-Escudé,
Alexis Brandeker,
Frédéric Courbin,
Torsten Enßlin,
Antonio Falcão,
Katherine Freese,
Berry Holl,
Lucas Labadie,
Alain Léger,
Gary Mamon,
Barbara Mcarthur,
Alcione Mora,
Mike Shao,
Alessandro Sozzetti,
Douglas Spolyar,
Eva Villaver,
Ummi Abbas,
Conrado Albertus,
João Alves,
Rory Barnes,
Aldo Stefano Bonomo
, et al. (61 additional authors not shown)
Abstract:
Sky survey telescopes and powerful targeted telescopes play complementary roles in astronomy. In order to investigate the nature and characteristics of the motions of very faint objects, a flexibly-pointed instrument capable of high astrometric accuracy is an ideal complement to current astrometric surveys and a unique tool for precision astrophysics. Such a space-based mission will push the front…
▽ More
Sky survey telescopes and powerful targeted telescopes play complementary roles in astronomy. In order to investigate the nature and characteristics of the motions of very faint objects, a flexibly-pointed instrument capable of high astrometric accuracy is an ideal complement to current astrometric surveys and a unique tool for precision astrophysics. Such a space-based mission will push the frontier of precision astrometry from evidence of Earth-mass habitable worlds around the nearest stars, to distant Milky Way objects, and out to the Local Group of galaxies. As we enter the era of the James Webb Space Telescope and the new ground-based, adaptive-optics-enabled giant telescopes, by obtaining these high precision measurements on key objects that Gaia could not reach, a mission that focuses on high precision astrometry science can consolidate our theoretical understanding of the local Universe, enable extrapolation of physical processes to remote redshifts, and derive a much more consistent picture of cosmological evolution and the likely fate of our cosmos. Already several missions have been proposed to address the science case of faint objects in motion using high precision astrometry missions: NEAT proposed for the ESA M3 opportunity, micro-NEAT for the S1 opportunity, and Theia for the M4 and M5 opportunities. Additional new mission configurations adapted with technological innovations could be envisioned to pursue accurate measurements of these extremely small motions. The goal of this White Paper is to address the fundamental science questions that are at stake when we focus on the motions of faint sky objects and to briefly review instrumentation and mission profiles.
△ Less
Submitted 16 November, 2021;
originally announced November 2021.
-
ExoMol line lists -- XLIV. IR and UV line list for silicon monoxide (SiO)
Authors:
Sergei N. Yurchenko,
Jonathan Tennyson,
Anna-Maree Syme,
Ahmad Y. Adam,
Victoria H. J. Clark,
Bridgette Cooper,
C. Pria Dobney,
Shaun T. E. Donnelly,
Maire N. Gorman,
Anthony E. Lynas-Gray,
Thomas Meltzer,
Alec Owens,
Qianwei Qu,
Mikhail Semenov,
Wilfrid Somogyi,
Apoorva Upadhyay,
Samuel Wright,
Juan C. Zapata Trujillo
Abstract:
A new silicon monoxide ($^{28}$Si$^{16}$O) line list covering infrared, visible and ultraviolet regions called SiOUVenIR is presented. This line list extends the infrared EBJT ExoMol line list by including vibronic transitions to the $A\,{}^{1}Π$ and $E\,{}^{1}Σ^{+}$ electronic states. Strong perturbations to the $A\,{}^{1}Π$ band system are accurately modelled through the treatment of 6 dark elec…
▽ More
A new silicon monoxide ($^{28}$Si$^{16}$O) line list covering infrared, visible and ultraviolet regions called SiOUVenIR is presented. This line list extends the infrared EBJT ExoMol line list by including vibronic transitions to the $A\,{}^{1}Π$ and $E\,{}^{1}Σ^{+}$ electronic states. Strong perturbations to the $A\,{}^{1}Π$ band system are accurately modelled through the treatment of 6 dark electronic states: $C\,{}^{1}Σ^{-}$, $D\,{}^{1}Δ$, $a\,{}^{3}Σ^{+}$, $b\,{}^{3}Π$, $e\,{}^{3}Σ^{-}$ and $d\,{}^{3}Δ$. Along with the $X\,{}^{1}Σ^{+}$ ground state, these 9 electronic states were used to build a comprehensive spectroscopic model of SiO using a combination of empirical and ab initio curves, including the potential energy (PE), spin-orbit (SO), electronic angular momentum (EAM) and (transition) dipole moment curves. The ab initio PE and coupling curves, computed at the multireference configuration interaction (MRCI) level of theory, were refined by fitting their analytical representations to 2617 experimentally derived SiO energy levels determined from 97 vibronic bands belonging to the $X$-$X$, $E$-$X$ and $A$-$X$ electronic systems through the MARVEL procedure. 112 observed forbidden transitions from the $C$-$X$, $D$-$X$, $e$-$X$, and $d$-$X$ bands were assigned using our predictions, and these could be fed back into the MARVEL procedure. The SiOUVenIR line list was computed using published ab initio transition dipole moments for the $E$-$X$ and $A$-$X$ bands; the line list is suitable for temperatures up to 10,000 K and for wavelengths longer than 140 nm. SiOUVenIR is available from www.exomol.com and the CDS database.
△ Less
Submitted 8 November, 2021;
originally announced November 2021.
-
Pediatric Otoscopy Video Screening with Shift Contrastive Anomaly Detection
Authors:
Weiyao Wang,
Aniruddha Tamhane,
Christine Santos,
John R. Rzasa,
James H. Clark,
Therese L. Canares,
Mathias Unberath
Abstract:
Ear related concerns and symptoms represents the leading indication for seeking pediatric healthcare attention. Despite the high incidence of such encounters, the diagnostic process of commonly encountered disease of the middle and external presents significant challenge. Much of this challenge stems from the lack of cost effective diagnostic testing, which necessitating the presence or absence of…
▽ More
Ear related concerns and symptoms represents the leading indication for seeking pediatric healthcare attention. Despite the high incidence of such encounters, the diagnostic process of commonly encountered disease of the middle and external presents significant challenge. Much of this challenge stems from the lack of cost effective diagnostic testing, which necessitating the presence or absence of ear pathology to be determined clinically. Research has however demonstrated considerable variation among clinicians in their ability to accurately diagnose and consequently manage ear pathology. With recent advances in computer vision and machine learning, there is an increasing interest in hel** clinicians to accurately diagnose middle and external ear pathology with computer-aided systems. It has been shown that AI has the capacity to analyse a single clinical image captured during examination of the ear canal and eardrum from which it can determine the likelihood of a pathognomonic pattern for a specific diagnosis being present. The capture of such an image can however be challenging especially to inexperienced clinicians. To help mitigate this technical challenge we have developed and tested a method using video sequences. We present a two stage method that first, identifies valid frames by detecting and extracting ear drum patches from the video sequence, and second, performs the proposed shift contrastive anomaly detection to flag the otoscopy video sequences as normal or abnormal. Our method achieves an AUROC of 88.0% on the patient-level and also outperforms the average of a group of 25 clinicians in a comparative study, which is the largest of such published to date. We conclude that the presented method achieves a promising first step towards automated analysis of otoscopy video.
△ Less
Submitted 25 October, 2021;
originally announced October 2021.
-
SLAM: A Unified Encoder for Speech and Language Modeling via Speech-Text Joint Pre-Training
Authors:
Ankur Bapna,
Yu-an Chung,
Nan Wu,
Anmol Gulati,
Ye Jia,
Jonathan H. Clark,
Melvin Johnson,
Jason Riesa,
Alexis Conneau,
Yu Zhang
Abstract:
Unsupervised pre-training is now the predominant approach for both text and speech understanding. Self-attention models pre-trained on large amounts of unannotated data have been hugely successful when fine-tuned on downstream tasks from a variety of domains and languages. This paper takes the universality of unsupervised language pre-training one step further, by unifying speech and text pre-trai…
▽ More
Unsupervised pre-training is now the predominant approach for both text and speech understanding. Self-attention models pre-trained on large amounts of unannotated data have been hugely successful when fine-tuned on downstream tasks from a variety of domains and languages. This paper takes the universality of unsupervised language pre-training one step further, by unifying speech and text pre-training within a single model. We build a single encoder with the BERT objective on unlabeled text together with the w2v-BERT objective on unlabeled speech. To further align our model representations across modalities, we leverage alignment losses, specifically Translation Language Modeling (TLM) and Speech Text Matching (STM) that make use of supervised speech-text recognition data. We demonstrate that incorporating both speech and text data during pre-training can significantly improve downstream quality on CoVoST~2 speech translation, by around 1 BLEU compared to single-modality pre-trained models, while retaining close to SotA performance on LibriSpeech and SpeechStew ASR tasks. On four GLUE tasks and text-normalization, we observe evidence of capacity limitations and interference between the two modalities, leading to degraded performance compared to an equivalent text-only model, while still being competitive with BERT. Through extensive empirical analysis we also demonstrate the importance of the choice of objective function for speech pre-training, and the beneficial effect of adding additional supervised signals on the quality of the learned representations.
△ Less
Submitted 19 October, 2021;
originally announced October 2021.
-
Mixture-of-Partitions: Infusing Large Biomedical Knowledge Graphs into BERT
Authors:
Zaiqiao Meng,
Fangyu Liu,
Thomas Hikaru Clark,
Ehsan Shareghi,
Nigel Collier
Abstract:
Infusing factual knowledge into pre-trained models is fundamental for many knowledge-intensive tasks. In this paper, we proposed Mixture-of-Partitions (MoP), an infusion approach that can handle a very large knowledge graph (KG) by partitioning it into smaller sub-graphs and infusing their specific knowledge into various BERT models using lightweight adapters. To leverage the overall factual knowl…
▽ More
Infusing factual knowledge into pre-trained models is fundamental for many knowledge-intensive tasks. In this paper, we proposed Mixture-of-Partitions (MoP), an infusion approach that can handle a very large knowledge graph (KG) by partitioning it into smaller sub-graphs and infusing their specific knowledge into various BERT models using lightweight adapters. To leverage the overall factual knowledge for a target task, these sub-graph adapters are further fine-tuned along with the underlying BERT through a mixture layer. We evaluate our MoP with three biomedical BERTs (SciBERT, BioBERT, PubmedBERT) on six downstream tasks (inc. NLI, QA, Classification), and the results show that our MoP consistently enhances the underlying BERTs in task performance, and achieves new SOTA performances on five evaluated datasets.
△ Less
Submitted 10 September, 2021;
originally announced September 2021.
-
Quantum to Classical Walk Transitions Tuned by Spontaneous Emissions
Authors:
J. H. Clark,
C. Groiseau,
Z. N. Shaw,
S. Dadras,
C. Binegar,
S. Wimberger,
G. S. Summy,
Y. Liu
Abstract:
We have realized a quantum walk in momentum space with a rubidium spinor Bose-Einstein condensate by applying a periodic kicking potential as a walk operator and a resonant microwave pulse as a coin toss operator. The generated quantum walks appear to be stable for up to ten steps and then quickly transit to classical walks due to spontaneous emissions induced by laser beams of the walk operator.…
▽ More
We have realized a quantum walk in momentum space with a rubidium spinor Bose-Einstein condensate by applying a periodic kicking potential as a walk operator and a resonant microwave pulse as a coin toss operator. The generated quantum walks appear to be stable for up to ten steps and then quickly transit to classical walks due to spontaneous emissions induced by laser beams of the walk operator. We investigate these quantum to classical walk transitions by introducing well controlled spontaneous emissions with an external light source during quantum walks. Our findings demonstrate a scheme to control the robustness of the quantum walks and can also be applied to other cold atom experiments involving spontaneous emissions.
△ Less
Submitted 20 August, 2021;
originally announced August 2021.
-
Defense Against Adversarial Swarms with Parameter Uncertainty
Authors:
Claire Walton,
Isaac Kaminer,
Qi Gong,
Abram. H. Clark,
Theodoros Tsatsanifos
Abstract:
This paper addresses the problem of optimal defense of a High Value Unit against a large-scale swarm attack. We show that the problem can be cast in the framework of uncertain parameter optimal control and derive a consistency result for the dual problem of this framework. We show that the dual can be computed numerically and apply these numerical results to derive optimal defender strategies agai…
▽ More
This paper addresses the problem of optimal defense of a High Value Unit against a large-scale swarm attack. We show that the problem can be cast in the framework of uncertain parameter optimal control and derive a consistency result for the dual problem of this framework. We show that the dual can be computed numerically and apply these numerical results to derive optimal defender strategies against a 100 agent swarm attack.
△ Less
Submitted 9 August, 2021;
originally announced August 2021.
-
Modeling and Control of Large-Scale Adversarial Swarm Engagements
Authors:
Theodoros Tsatsanifos,
Abram H. Clark,
Claire Walton,
Isaac Kaminer,
Qi Gong
Abstract:
We theoretically and numerically study the problem of optimal control of large-scale autonomous systems under explicitly adversarial conditions, including probabilistic destruction of agents during the simulation. Large-scale autonomous systems often include an adversarial component, where different agents or groups of agents explicitly compete with one another. An important component of these sys…
▽ More
We theoretically and numerically study the problem of optimal control of large-scale autonomous systems under explicitly adversarial conditions, including probabilistic destruction of agents during the simulation. Large-scale autonomous systems often include an adversarial component, where different agents or groups of agents explicitly compete with one another. An important component of these systems that is not included in current theory or modeling frameworks is random destruction of agents in time. In this case, the modeling and optimal control framework should consider the attrition of agents as well as their position. We propose and test three numerical modeling schemes, where survival probabilities of all agents are smoothly and continuously decreased in time, based on the relative positions of all agents during the simulation. In particular, we apply these schemes to the case of agents defending a high-value unit from an attacking swarm. We show that these models can be successfully used to model this situation, provided that attrition and spatial dynamics are coupled. Our results have relevance to an entire class of adversarial autonomy situations, where the positions of agents and their survival probabilities are both important.
△ Less
Submitted 4 August, 2021;
originally announced August 2021.
-
Darcy-Reynolds forces during intrusion into granular-fluid beds
Authors:
Joshua Strader,
Neil Causley,
Joshua A. Dijksman,
Abram H. Clark
Abstract:
We experimentally study intrusion into fluid-saturated granular beds by a free-falling sphere, varying particle size and fluid viscosity. We test our results against Darcy-Reynolds theory, where the deceleration of the sphere is controlled by Reynolds dilatancy and the Darcy flow resistance. We find the observed intruder dynamics are consistent with Darcy-Reynolds theory for varied particle size.…
▽ More
We experimentally study intrusion into fluid-saturated granular beds by a free-falling sphere, varying particle size and fluid viscosity. We test our results against Darcy-Reynolds theory, where the deceleration of the sphere is controlled by Reynolds dilatancy and the Darcy flow resistance. We find the observed intruder dynamics are consistent with Darcy-Reynolds theory for varied particle size. We also find that our experimental results for varied viscosity are consistent with Darcy-Reynolds theory, but only for a limited range of the viscosity. For large viscosities, observed forces begin to decrease with increasing viscosity, in contrast with the theoretical prediction.
△ Less
Submitted 27 July, 2021;
originally announced July 2021.
-
Theoretical rovibronic spectroscopy of the calcium monohydroxide radical (CaOH)
Authors:
Alec Owens,
Victoria H. J. Clark,
Alexander Mitrushchenkov,
Sergei N. Yurchenko,
Jonathan Tennyson
Abstract:
The rovibronic (rotation-vibration-electronic) spectrum of the calcium monohydroxide radical (CaOH) is of interest to studies of exoplanet atmospheres and ultracold molecules. Here, we theoretically investigate the $\tilde{A}\,^2Π$--$\tilde{X}\,^2Σ^+$ band system of CaOH using high-level \textit{ab initio} theory and variational nuclear motion calculations. New potential energy surfaces (PESs) are…
▽ More
The rovibronic (rotation-vibration-electronic) spectrum of the calcium monohydroxide radical (CaOH) is of interest to studies of exoplanet atmospheres and ultracold molecules. Here, we theoretically investigate the $\tilde{A}\,^2Π$--$\tilde{X}\,^2Σ^+$ band system of CaOH using high-level \textit{ab initio} theory and variational nuclear motion calculations. New potential energy surfaces (PESs) are constructed for the $\tilde{X}\,^2Σ^+$ and $\tilde{A}\,^2Π$ electronic states along with $\tilde{A}$--$\tilde{X}$ transition dipole moment surfaces (DMSs). For the ground $\tilde{X}\,^2Σ^+$ state, a published high-level \textit{ab initio} PES is empirically refined to all available experimental rovibrational energy levels up to $J=15.5$, reproducing the observed term values with a root-mean-square (rms) error of 0.06~cm$^{-1}$. Large-scale multireference configuration interaction (MRCI) calculations using quintuple-zeta quality basis sets are employed to generate the $\tilde{A}\,^2Π$ state PESs and $\tilde{A}$--$\tilde{X}$ DMSs. Variational calculations consider both Renner-Teller and spin-orbit coupling effects, which are essential for a correct description of the spectrum of CaOH. Computed rovibronic energy levels of the $\tilde{A}\,^2Π$ state, line list calculations up to $J=125.5$, and an analysis of Renner-Teller splittings in the $ν_2$ bending mode of CaOH are discussed.
△ Less
Submitted 23 July, 2021;
originally announced July 2021.
-
The vibrational properties of benzene on an ordered water ice surface
Authors:
Victoria H. J. Clark,
David M. Benoit
Abstract:
We present a hybrid CCSD(T)+PBE-D3 approach to calculating the vibrational signatures for gas phase benzene and benzene adsorbed on an ordered water-ice surface. We compare the results of our method against experimentally recorded spectra and calculations performed using PBE-D3-only approaches (harmonic and anharmonic). Calculations use a proton ordered XIh water-ice surface consisting of 288 wate…
▽ More
We present a hybrid CCSD(T)+PBE-D3 approach to calculating the vibrational signatures for gas phase benzene and benzene adsorbed on an ordered water-ice surface. We compare the results of our method against experimentally recorded spectra and calculations performed using PBE-D3-only approaches (harmonic and anharmonic). Calculations use a proton ordered XIh water-ice surface consisting of 288 water molecules, and results are compared against experimental spectra recorded for an ASW ice surface. We show the importance of including a water ice surface into spectroscopic calculations, owing to the resulting differences in vibrational modes, frequencies and intensities of transitions seen in the IR spectrum. The overall intensity pattern shifts from a dominating $ν_{11}$ band in the gas-phase to several high-intensity carriers for an IR spectrum of adsorbed benzene. When used for adsorbed benzene, the hybrid approach presented here achieves an RMSD for IR active modes of 21~cm$^{-1}$, compared to 72~cm$^{-1}$ and 49~cm$^{-1}$ for the anharmonic and harmonic PBE-D3 approaches, respectively. Our hybrid model for gaseous benzene also achieves the best results when compared to experiment, with an RMSD for IR active modes of 24~cm$^{-1}$, compared to 55~cm$^{-1}$ and 31~cm$^{-1}$ for the anharmonic and harmonic PBE-D3 approaches, respectively. To facilitate assignment, we generate and provide a correspondence graph between the normal modes of the gaseous and adsorbed benzene molecules. Finally, we calculate the frequency shifts, $Δν$, of adsorbed benzene relative to its gas phase to highlight the effects of surface interactions on vibrational bands and evaluate the suitability of our chosen dispersion-corrected density functional theory.
△ Less
Submitted 15 July, 2021;
originally announced July 2021.
-
Modelling the non-local thermodynamic equilibrium spectra of silylene (SiH2)
Authors:
Victoria H. J. Clark,
Sergei N. Yurchenko
Abstract:
This paper sets out a robust methodology for modelling spectra of polyatomic molecules produced in reactive or dissociative environments, with vibrational populations outside local thermal equilibrium (LTE). The methodology is based on accurate, extensive ro-vibrational line lists containing transitions with high vibrational excitations and relies on the detailed ro-vibrational assignments. The de…
▽ More
This paper sets out a robust methodology for modelling spectra of polyatomic molecules produced in reactive or dissociative environments, with vibrational populations outside local thermal equilibrium (LTE). The methodology is based on accurate, extensive ro-vibrational line lists containing transitions with high vibrational excitations and relies on the detailed ro-vibrational assignments. The developed methodology is applied to model non-LTE IR and visible spectra of silylene (SiH$_2$) produced in a decomposition of disilane (Si$_2$H$_6$), a reaction of technological importance. Two approaches for non-LTE vibrational populations of the product SiH$_2$ are introduced: a simplistic 1D approach based on the Harmonic approximation and a full 3D model incorporating accurate vibrational wavefunctions of SiH$_2$ computed variationally with the TROVE (Theoretical ROVibrational Energy) program. We show how their non-LTE spectral signatures can be used to trace different reaction channels of molecular dissociations.
△ Less
Submitted 28 April, 2021;
originally announced April 2021.
-
CANINE: Pre-training an Efficient Tokenization-Free Encoder for Language Representation
Authors:
Jonathan H. Clark,
Dan Garrette,
Iulia Turc,
John Wieting
Abstract:
Pipelined NLP systems have largely been superseded by end-to-end neural modeling, yet nearly all commonly-used models still require an explicit tokenization step. While recent tokenization approaches based on data-derived subword lexicons are less brittle than manually engineered tokenizers, these techniques are not equally suited to all languages, and the use of any fixed vocabulary may limit a m…
▽ More
Pipelined NLP systems have largely been superseded by end-to-end neural modeling, yet nearly all commonly-used models still require an explicit tokenization step. While recent tokenization approaches based on data-derived subword lexicons are less brittle than manually engineered tokenizers, these techniques are not equally suited to all languages, and the use of any fixed vocabulary may limit a model's ability to adapt. In this paper, we present CANINE, a neural encoder that operates directly on character sequences, without explicit tokenization or vocabulary, and a pre-training strategy that operates either directly on characters or optionally uses subwords as a soft inductive bias. To use its finer-grained input effectively and efficiently, CANINE combines downsampling, which reduces the input sequence length, with a deep transformer stack, which encodes context. CANINE outperforms a comparable mBERT model by 2.8 F1 on TyDi QA, a challenging multilingual benchmark, despite having 28% fewer model parameters.
△ Less
Submitted 18 May, 2022; v1 submitted 11 March, 2021;
originally announced March 2021.
-
Viscous-like forces control the impact response of shear-thickening dense suspensions
Authors:
Marc-Andre Brassard,
Neil Causley,
Nasser Krizou,
Joshua A. Dijksman,
Abram H. Clark
Abstract:
We experimentally and theoretically study impacts into dense cornstarch and water suspensions. We vary impact speed as well as intruder size, shape, and mass, and we characterize the resulting dynamics using high-speed video and an onboard accelerometer. We numerically solve previously proposed models, most notably the added-mass model as well as a class of {viscous-like} models. In the {viscous-l…
▽ More
We experimentally and theoretically study impacts into dense cornstarch and water suspensions. We vary impact speed as well as intruder size, shape, and mass, and we characterize the resulting dynamics using high-speed video and an onboard accelerometer. We numerically solve previously proposed models, most notably the added-mass model as well as a class of {viscous-like} models. In the {viscous-like models}, the intruder dynamics are dominated by {large, viscous-like forces} at the boundary of the jammed front {where large shear rates and accompanying large viscosities are present.} We find that our experimental data are consistent with this class of models and inconsistent with the added mass model. Our results strongly suggest that the added-mass model, which is the dominant model for understanding the dynamics of impact into shear-thickening dense suspensions, should be updated to include these viscous-like forces.
△ Less
Submitted 27 July, 2021; v1 submitted 23 November, 2020;
originally announced November 2020.
-
CapWAP: Captioning with a Purpose
Authors:
Adam Fisch,
Kenton Lee,
Ming-Wei Chang,
Jonathan H. Clark,
Regina Barzilay
Abstract:
The traditional image captioning task uses generic reference captions to provide textual information about images. Different user populations, however, will care about different visual aspects of images. In this paper, we propose a new task, Captioning with a Purpose (CapWAP). Our goal is to develop systems that can be tailored to be useful for the information needs of an intended population, rath…
▽ More
The traditional image captioning task uses generic reference captions to provide textual information about images. Different user populations, however, will care about different visual aspects of images. In this paper, we propose a new task, Captioning with a Purpose (CapWAP). Our goal is to develop systems that can be tailored to be useful for the information needs of an intended population, rather than merely provide generic information about an image. In this task, we use question-answer (QA) pairs---a natural expression of information need---from users, instead of reference captions, for both training and post-inference evaluation. We show that it is possible to use reinforcement learning to directly optimize for the intended information need, by rewarding outputs that allow a question answering model to provide correct answers to sampled user questions. We convert several visual question answering datasets into CapWAP datasets, and demonstrate that under a variety of scenarios our purposeful captioning system learns to anticipate and fulfill specific information needs better than its generic counterparts, as measured by QA performance on user questions from unseen images, when using the caption alone as context.
△ Less
Submitted 9 November, 2020;
originally announced November 2020.
-
Learning to Recognize Dialect Features
Authors:
Dorottya Demszky,
Devyani Sharma,
Jonathan H. Clark,
Vinodkumar Prabhakaran,
Jacob Eisenstein
Abstract:
Building NLP systems that serve everyone requires accounting for dialect differences. But dialects are not monolithic entities: rather, distinctions between and within dialects are captured by the presence, absence, and frequency of dozens of dialect features in speech and text, such as the deletion of the copula in "He {} running". In this paper, we introduce the task of dialect feature detection…
▽ More
Building NLP systems that serve everyone requires accounting for dialect differences. But dialects are not monolithic entities: rather, distinctions between and within dialects are captured by the presence, absence, and frequency of dozens of dialect features in speech and text, such as the deletion of the copula in "He {} running". In this paper, we introduce the task of dialect feature detection, and present two multitask learning approaches, both based on pretrained transformers. For most dialects, large-scale annotated corpora for these features are unavailable, making it difficult to train recognizers. We train our models on a small number of minimal pairs, building on how linguists typically define dialect features. Evaluation on a test set of 22 dialect features of Indian English demonstrates that these models learn to recognize many features with high accuracy, and that a few minimal pairs can be as effective for training as thousands of labeled examples. We also demonstrate the downstream applicability of dialect feature detection both as a measure of dialect density and as a dialect classifier.
△ Less
Submitted 6 May, 2021; v1 submitted 23 October, 2020;
originally announced October 2020.