Skip to main content

Showing 1–50 of 100 results for author: Clark, H

.
  1. arXiv:2406.19117  [pdf, other

    astro-ph.EP astro-ph.GA astro-ph.SR physics.chem-ph

    Hybrid approach predicts a lower binding energy for benzene on water ice

    Authors: Victoria H. J. Clark, David M. Benoit, Marie Van de Sande, Catherine Walsh

    Abstract: In this paper we provide a highly accurate value for the binding energy of benzene to proton-ordered crystalline water ice (XIh), as a model for interstellar ices. We compare our computed value to the latest experimental data available from temperature programmed desorption (TPD) experiments and find that our binding energy value agrees well with data obtained from binding to either crystalline or… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  2. arXiv:2406.18753  [pdf, other

    astro-ph.IM astro-ph.HE

    Curved detectors for future X-ray astrophysics missions

    Authors: Eric D. Miller, James A. Gregory, Marshall W. Bautz, Harry R. Clark, Michael Cooper, Kevan Donlon, Richard F. Foster, Catherine E. Grant, Mallory Jensen, Beverly LaMarr, Renee Lambert, Christopher Leitz, Andrew Malonis, Mo Neak, Gregory Prigozhin, Kevin Ryu, Benjamin Schneider, Keith Warner, Douglas J. Young, William W. Zhang

    Abstract: Future X-ray astrophysics missions will survey large areas of the sky with unparalleled sensitivity, enabled by lightweight, high-resolution optics. These optics inherently produce curved focal surfaces with radii as small as 2 m, requiring a large area detector system that closely conforms to the curved focal surface. We have embarked on a project using a curved charge-coupled device (CCD) detect… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: 18 pages, 13 figures, submitted to the Proceedings of SPIE, Astronomical Telescopes + Instrumentation 2024

  3. arXiv:2405.13236  [pdf, ps, other

    cond-mat.soft physics.geo-ph

    Granular temperature controls local rheology of vibrated granular flows

    Authors: Mitchell G. Irmer, Emily E. Brodsky, Abram H. Clark

    Abstract: We use numerical simulations to demonstrate a local rheology for sheared, vibrated granular flows. We consider a granular assembly that is subjected to simple shear and harmonic vibration at the boundary. This configuration allows us to isolate the effects of vibration, as parameterized by granular temperature. We find that friction is reduced due to local velocity fluctuations of grains. All data… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

  4. arXiv:2405.09605  [pdf, other

    cs.CL cs.AI cs.LG

    Elements of World Knowledge (EWOK): A cognition-inspired framework for evaluating basic world knowledge in language models

    Authors: Anna A. Ivanova, Aalok Sathe, Benjamin Lipkin, Unnathi Kumar, Setayesh Radkani, Thomas H. Clark, Carina Kauf, Jennifer Hu, R. T. Pramod, Gabriel Grand, Vivian Paulun, Maria Ryskina, Ekin Akyürek, Ethan Wilcox, Nafisa Rashid, Leshem Choshen, Roger Levy, Evelina Fedorenko, Joshua Tenenbaum, Jacob Andreas

    Abstract: The ability to build and leverage world models is essential for a general-purpose AI agent. Testing such capabilities is hard, in part because the building blocks of world models are ill-defined. We present Elements of World Knowledge (EWOK), a framework for evaluating world modeling in language models by testing their ability to use knowledge of a concept to match a target text with a plausible/i… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

    Comments: 21 pages (11 main), 7 figures. Authors Anna Ivanova, Aalok Sathe, Benjamin Lipkin contributed equally

  5. arXiv:2405.06614  [pdf, other

    cond-mat.soft physics.geo-ph

    An explicit granular-mechanics approach to marine sediment acoustics

    Authors: Abram H. Clark, Derek R. Olson, Andrew J. Swartz, W. Mason Starnes

    Abstract: Here we theoretically and computationally study the frequency dependence of phase speed and attenuation for marine sediments from the perspective of granular mechanics. We leverage recent theoretical insights from the granular physics community as well as discrete-element method simulations, where the granular material is treated as a packing of discrete objects that interact via pairwise forces.… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

    Comments: Accepted for publication in the Journal of the Acoustical Society of America

  6. arXiv:2404.01119  [pdf, other

    eess.SP

    Automatic Modulation Classification using a Waveform Signature

    Authors: William H. Clark IV, Joseph M. Ernst, Robert W. McGwier

    Abstract: Cognitive Radios (CRs) build upon Software Defined Radios (SDRs) to allow for autonomous reconfiguration of communication architectures. In recent years, CRs have been identified as an enabler for Dynamic Spectrum Access (DSA) applications in which secondary users opportunistically share licensed spectrum. A major challenge for DSA is accurately characterizing the spectral environment, which requi… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: 10 pages, 13 figures, 6 tables, conference, WInnComm16 --fixed

  7. arXiv:2402.17934  [pdf, other

    cs.CL cs.AI

    Multitask Multilingual Model Adaptation with Featurized Low-Rank Mixtures

    Authors: Chu-Cheng Lin, Xinyi Wang, Jonathan H. Clark, Han Lu, Yun Zhu, Chenxi Whitehouse, Hongkun Yu

    Abstract: Adapting pretrained large language models (LLMs) to various downstream tasks in tens or hundreds of human languages is computationally expensive. Parameter-efficient fine-tuning (PEFT) significantly reduces the adaptation cost, by tuning only a small amount of parameters. However, directly applying PEFT methods such as LoRA (Hu et al., 2022) on diverse dataset mixtures could lead to suboptimal per… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

  8. Investigating heterogeneous PSMA ligand uptake inside parotid glands

    Authors: Caleb Sample, Carlos Uribe, Arman Rahmim, François Bénard, Jonn Wu, Haley Clark

    Abstract: The purpose was to investigate the spatial heterogeneity of prostate-specific membrane antigen (PSMA) positron emission tomography (PET) uptake within parotid glands. We aim to quantify patterns in well-defined regions to facilitate further investigations. Furthermore, we investigate whether uptake is correlated with computed tomography (CT) texture features. Parotid glands from [18F]DCFPyL PSMA P… ▽ More

    Submitted 4 January, 2024; originally announced January 2024.

    Journal ref: Physica Medica: European Journal of Medical Physics, 2024

  9. arXiv:2401.02394  [pdf, other

    physics.med-ph eess.IV

    Image denoising and model-independent parameterization for improving IVIM MRI

    Authors: Caleb Sample, Jonn Wu, Haley Clark

    Abstract: Variability of IVIM parameters throughout the literature is a long-standing issue, and perfusion-related parameters are difficult to interpret. We demonstrate for improving the analysis of intravoxel incoherent motion imaging (IVIM) magnetic resonance (MR) images, using image denoising and a quantitative approach that does not require imposing specific exponential models. IVIM images were acquired… ▽ More

    Submitted 4 January, 2024; originally announced January 2024.

  10. PSMA PET/CT as a predictive tool for sub-regional importance estimates in the parotid gland

    Authors: Caleb Sample, Arman Rahmim, François Bénard, Jonn Wu, Haley Clark

    Abstract: Xerostomia and radiation-induced salivary gland dysfunction remain a common side effect for head-and-neck radiotherapy patients, and attempts have been made to quantify the heterogeneous dose response within parotid glands. Here several models of parotid gland subregional importance are compared with prostate specific membrane antigen (PSMA) positron emission tomography (PET) uptake. PSMA ligands… ▽ More

    Submitted 4 January, 2024; v1 submitted 17 September, 2023; originally announced September 2023.

    Comments: 9 Figures, 7 Tables

  11. arXiv:2309.04663  [pdf, other

    cs.CL cs.AI

    FIAT: Fusing learning paradigms with Instruction-Accelerated Tuning

    Authors: Xinyi Wang, John Wieting, Jonathan H. Clark

    Abstract: Learning paradigms for large language models (LLMs) currently tend to fall within either in-context learning (ICL) or full fine-tuning. Each of these comes with their own trade-offs based on available data, model size, compute cost, ease-of-use, and final quality with neither solution performing well across-the-board. In this article, we first describe ICL and fine-tuning paradigms in a way that h… ▽ More

    Submitted 12 September, 2023; v1 submitted 8 September, 2023; originally announced September 2023.

  12. Neural blind deconvolution for deblurring and supersampling PSMA PET

    Authors: Caleb Sample, Arman Rahmim, Carlos Uribe, François Bénard, Jonn Wu, Roberto Fedrigo, Haley Clark

    Abstract: Objective: To simultaneously deblur and supersample prostate specific membrane antigen (PSMA) positron emission tomography (PET) images using neural blind deconvolution. Approach: Blind deconvolution is a method of estimating the hypothetical "deblurred" image along with the blur kernel (related to the point spread function) simultaneously. Traditional \textit{maximum a posteriori} blind deconvolu… ▽ More

    Submitted 2 March, 2024; v1 submitted 1 September, 2023; originally announced September 2023.

    Comments: 10 Figures, 4 Tables, 19 pages

  13. arXiv:2308.07286  [pdf, other

    cs.CL cs.LG

    The Devil is in the Errors: Leveraging Large Language Models for Fine-grained Machine Translation Evaluation

    Authors: Patrick Fernandes, Daniel Deutsch, Mara Finkelstein, Parker Riley, André F. T. Martins, Graham Neubig, Ankush Garg, Jonathan H. Clark, Markus Freitag, Orhan Firat

    Abstract: Automatic evaluation of machine translation (MT) is a critical tool driving the rapid iterative development of MT systems. While considerable progress has been made on estimating a single scalar quality score, current metrics lack the informativeness of more detailed schemes that annotate individual errors, such as Multidimensional Quality Metrics (MQM). In this paper, we help fill this gap by pro… ▽ More

    Submitted 14 August, 2023; originally announced August 2023.

    Comments: 19 pages

  14. arXiv:2306.03734  [pdf, other

    cs.CL

    A Cross-Linguistic Pressure for Uniform Information Density in Word Order

    Authors: Thomas Hikaru Clark, Clara Meister, Tiago Pimentel, Michael Hahn, Ryan Cotterell, Richard Futrell, Roger Levy

    Abstract: While natural languages differ widely in both canonical word order and word order flexibility, their word orders still follow shared cross-linguistic statistical patterns, often attributed to functional pressures. In the effort to identify these pressures, prior work has compared real and counterfactual word orders. Yet one functional pressure has been overlooked in such investigations: the unifor… ▽ More

    Submitted 9 July, 2023; v1 submitted 6 June, 2023; originally announced June 2023.

  15. arXiv:2305.14332  [pdf, other

    cs.CL

    Evaluating and Modeling Attribution for Cross-Lingual Question Answering

    Authors: Benjamin Muller, John Wieting, Jonathan H. Clark, Tom Kwiatkowski, Sebastian Ruder, Livio Baldini Soares, Roee Aharoni, Jonathan Herzig, Xinyi Wang

    Abstract: Trustworthy answer content is abundant in many high-resource languages and is instantly accessible through question answering systems, yet this content can be hard to access for those that do not speak these languages. The leap forward in cross-lingual modeling quality offered by generative language models offers much promise, yet their raw generations often fall short in factuality. To improve tr… ▽ More

    Submitted 15 November, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: Published as a long paper at EMNLP 2023

  16. XTREME-UP: A User-Centric Scarce-Data Benchmark for Under-Represented Languages

    Authors: Sebastian Ruder, Jonathan H. Clark, Alexander Gutkin, Mihir Kale, Min Ma, Massimo Nicosia, Shruti Rijhwani, Parker Riley, Jean-Michel A. Sarr, Xinyi Wang, John Wieting, Nitish Gupta, Anna Katanova, Christo Kirov, Dana L. Dickinson, Brian Roark, Bidisha Samanta, Connie Tao, David I. Adelani, Vera Axelrod, Isaac Caswell, Colin Cherry, Dan Garrette, Reeve Ingle, Melvin Johnson , et al. (2 additional authors not shown)

    Abstract: Data scarcity is a crucial issue for the development of highly multilingual NLP systems. Yet for many under-represented languages (ULs) -- languages for which NLP re-search is particularly far behind in meeting user needs -- it is feasible to annotate small amounts of data. Motivated by this, we propose XTREME-UP, a benchmark defined by: its focus on the scarce-data scenario rather than zero-shot;… ▽ More

    Submitted 24 May, 2023; v1 submitted 19 May, 2023; originally announced May 2023.

  17. arXiv:2305.11787  [pdf, other

    physics.med-ph

    Improving the modeling of the Agility multi-leaf collimator

    Authors: Mohammad Hussein, Agnes Angerud, Jordi Saez, Evelien Bogaert, Matthieu Lemire, Miriam Barry, Ileana Silvestre Patallo, David Shipley, Catharine H Clark, Victor Hernandez

    Abstract: Robust fine tuning of multi-leaf collimator (MLC) Treatment Planning System (TPS) modeling parameters is crucial for creating an optimal beam model, particularly with the ever-increasing accuracy required for advancing techniques. Challenges arise from balancing the trade-off between multiple parameters and therefore the quality of tuning depends on the experience of the physicist and the procedur… ▽ More

    Submitted 19 May, 2023; originally announced May 2023.

  18. arXiv:2305.10403  [pdf, other

    cs.CL cs.AI

    PaLM 2 Technical Report

    Authors: Rohan Anil, Andrew M. Dai, Orhan Firat, Melvin Johnson, Dmitry Lepikhin, Alexandre Passos, Siamak Shakeri, Emanuel Taropa, Paige Bailey, Zhifeng Chen, Eric Chu, Jonathan H. Clark, Laurent El Shafey, Yan** Huang, Kathy Meier-Hellstern, Gaurav Mishra, Erica Moreira, Mark Omernick, Kevin Robinson, Sebastian Ruder, Yi Tay, Kefan Xiao, Yuanzhong Xu, Yu**g Zhang, Gustavo Hernandez Abrego , et al. (103 additional authors not shown)

    Abstract: We introduce PaLM 2, a new state-of-the-art language model that has better multilingual and reasoning capabilities and is more compute-efficient than its predecessor PaLM. PaLM 2 is a Transformer-based model trained using a mixture of objectives. Through extensive evaluations on English and multilingual language, and reasoning tasks, we demonstrate that PaLM 2 has significantly improved quality on… ▽ More

    Submitted 13 September, 2023; v1 submitted 17 May, 2023; originally announced May 2023.

  19. arXiv:2305.06897  [pdf, other

    cs.CL cs.AI cs.IR

    AfriQA: Cross-lingual Open-Retrieval Question Answering for African Languages

    Authors: Odunayo Ogundepo, Tajuddeen R. Gwadabe, Clara E. Rivera, Jonathan H. Clark, Sebastian Ruder, David Ifeoluwa Adelani, Bonaventure F. P. Dossou, Abdou Aziz DIOP, Claytone Sikasote, Gilles Hacheme, Happy Buzaaba, Ignatius Ezeani, Rooweither Mabuya, Salomey Osei, Chris Emezue, Albert Njoroge Kahira, Shamsuddeen H. Muhammad, Akintunde Oladipo, Abraham Toluwase Owodunni, Atnafu Lambebo Tonja, Iyanuoluwa Shode, Akari Asai, Tunde Oluwaseyi Ajayi, Clemencia Siro, Steven Arthur , et al. (27 additional authors not shown)

    Abstract: African languages have far less in-language content available digitally, making it challenging for question answering systems to satisfy the information needs of users. Cross-lingual open-retrieval question answering (XOR QA) systems -- those that retrieve answer content from other languages while serving people in their native language -- offer a means of filling this gap. To this end, we create… ▽ More

    Submitted 11 May, 2023; originally announced May 2023.

  20. The Earliest Stage of Galactic Star Formation

    Authors: Charles L. Steinhardt, Vadim Rusakov, Thomas H. Clark, Andrei Diaconu, Conor McPartland, John Forbes, Albert Sneppen, John Weaver

    Abstract: Using a recently-developed technique to estimate gas temperatures ($T_\textrm{SF}$) in star-forming regions from large photometric surveys, we propose a diagram, analogous to the Hertzsprung-Russell diagram for individual stars, to probe the evolution of individual galaxies. On this $T_\textrm{SF}$-sSFR (specific star formation rate) diagram, a small fraction of star-forming galaxies appear to be… ▽ More

    Submitted 22 June, 2023; v1 submitted 4 January, 2023; originally announced January 2023.

    Comments: ApJL 949, L38

  21. arXiv:2212.10726  [pdf, other

    cs.CL cs.LG

    Beyond Contrastive Learning: A Variational Generative Model for Multilingual Retrieval

    Authors: John Wieting, Jonathan H. Clark, William W. Cohen, Graham Neubig, Taylor Berg-Kirkpatrick

    Abstract: Contrastive learning has been successfully used for retrieval of semantically aligned sentences, but it often requires large batch sizes or careful engineering to work well. In this paper, we instead propose a generative model for learning multilingual text embeddings which can be used to retrieve or score sentence pairs. Our model operates on parallel data in $N$ languages and, through an approxi… ▽ More

    Submitted 4 June, 2023; v1 submitted 20 December, 2022; originally announced December 2022.

    Comments: Published as a long paper at ACL 2023

  22. arXiv:2211.10441  [pdf

    physics.med-ph cs.RO eess.SY math.OC

    Rastreo muscular móvil usando magnetomicrometría -- traducción al español del articulo "Untethered Muscle Tracking Using Magnetomicrometry" por el autor Cameron R. Taylor

    Authors: Cameron R. Taylor, Seong Ho Yeon, William H. Clark, Ellen G. Clarrissimeaux, Mary Kate O'Donnell, Thomas J. Roberts, Hugh M. Herr

    Abstract: Muscle tissue drives nearly all movement in the animal kingdom, providing power, mobility, and dexterity. Technologies for measuring muscle tissue motion, such as sonomicrometry, fluoromicrometry, and ultrasound, have significantly advanced our understanding of biomechanics. Yet, the field lacks the ability to monitor muscle tissue motion for animal behavior outside the lab. Towards addressing thi… ▽ More

    Submitted 19 November, 2022; originally announced November 2022.

    Comments: in Spanish language. Translation of the postprint, with the published version in English appended to the end of the PDF. Shared First Authorship: Cameron R. Taylor and Seong Ho Yeon; Shared Senior and Corresponding Authorship: Thomas J. Roberts and Hugh M. Herr

    Journal ref: Front. Bioeng. Biotechnol. 10:1010275 (2022)

  23. Angular Diameters and Fundamental Parameters of Forty-Four Stars from the Navy Precision Optical Interferometer

    Authors: Ellyn K. Baines, J. Thomas Armstrong, James H. Clark III, Jim Gorney, Donald J. Hutter, Anders M. Jorgensen, Casey Kyte, David Mozurkewich, Ishara Nisley, Jason Sanborn, Henrique R. Schmitt, Gerard T. van Belle

    Abstract: We measured the angular diameters of 44 stars with the Navy Precision Optical Interferometer, obtaining uncertainties on the limb darkened diameter of 2% or less for all but four stars. We then used our diameters with Gaia or Hipparcos parallaxes to calculate each star's physical radius. We gathered information from the literature to determine bolometric flux and luminosity, and combined that with… ▽ More

    Submitted 16 November, 2022; originally announced November 2022.

    Comments: 13 pages, 3 figures, 6 tables. arXiv admin note: substantial text overlap with arXiv:1712.08109

    Journal ref: 2021AJ....162..198B

  24. arXiv:2210.11898  [pdf, other

    cond-mat.quant-gas physics.atom-ph

    Detecting Topological phase transitions in a double kicked quantum rotor

    Authors: Nikolai Bolik, Caspar Groiseau, Jerry H. Clark, Gil S. Summy, Yingmei Liu, Sandro Wimberger

    Abstract: We present a concrete theoretical proposal for detecting topological phase transitions in double kicked atom-optics kicked rotors with internal spin-1/2 degree of freedom. The implementation utilizes a kicked Bose-Einstein condensate evolving in one-dimensional momentum space. To reduce influence of atom loss and phase decoherence we aim to keep experimental durations short while maintaining a res… ▽ More

    Submitted 21 October, 2022; originally announced October 2022.

    Journal ref: Phys. Rev. A 106, 043318 (2022)

  25. arXiv:2207.04608  [pdf, ps, other

    cond-mat.soft physics.geo-ph

    Frictional weakening of vibrated granular flows

    Authors: Abram H. Clark, H. John Nasrin, Stephanie E. Taylor, Emily E. Brodsky

    Abstract: We computationally study the frictional properties of sheared granular media subjected to harmonic vibration applied at the boundary. Such vibrations are thought to play an important role in weakening flows, yet the independent effects of amplitude, frequency, and pressure on the process have remained unclear. Based on a dimensional analysis and DEM simulations, we show that, in addition to a prev… ▽ More

    Submitted 1 March, 2023; v1 submitted 11 July, 2022; originally announced July 2022.

  26. arXiv:2207.00758  [pdf, other

    cs.CL

    MIA 2022 Shared Task: Evaluating Cross-lingual Open-Retrieval Question Answering for 16 Diverse Languages

    Authors: Akari Asai, Shayne Longpre, Jungo Kasai, Chia-Hsuan Lee, Rui Zhang, Junjie Hu, Ikuya Yamada, Jonathan H. Clark, Eunsol Choi

    Abstract: We present the results of the Workshop on Multilingual Information Access (MIA) 2022 Shared Task, evaluating cross-lingual open-retrieval question answering (QA) systems in 16 typologically diverse languages. In this task, we adapted two large-scale cross-lingual open-retrieval QA datasets in 14 typologically diverse languages, and newly annotated open-retrieval QA data in 2 underrepresented langu… ▽ More

    Submitted 2 July, 2022; originally announced July 2022.

    Comments: NAACL Workshop on Multilingual Information Access

  27. Implications of a Temperature Dependent IMF III: Mass Growth and Quiescence

    Authors: Charles L. Steinhardt, Albert Sneppen, Hagan Hensley, Adam S. Jermyn, Basel Mostafa, John R. Weaver, Gabriel Brammer, Thomas H. Clark, Iary Davidzon, Andrei C. Diaconu, Bahram Mobasher, Vadim Rusakov, Sune Toft

    Abstract: The stellar initial mass function (IMF) is predicted to depend upon the temperature of gas in star-forming molecular clouds. The introduction of an additional parameter, $T_{IMF}$ , into photometric template fitting, suggest most galaxies obey an IMF top-heavier than the Galactic IMF. The implications of these revised fits on mass functions, quiescence and turnoff are discussed. At all redshifts t… ▽ More

    Submitted 3 June, 2022; originally announced June 2022.

    Comments: 11 pages, ApJ, in press

  28. Implications of a Temperature Dependent IMF II: An Updated View of the Star-Forming Main Sequence

    Authors: Charles L. Steinhardt, Albert Sneppen, Basel Mostafa, Hagan Hensley, Adam S. Jermyn, Adrian Lopez, John Weaver, Gabriel Brammer, Thomas H. Clark, Iary Davidzon, Andrei C. Diaconu, Bahram Mobasher, Vadim Rusakov, Sune Toft

    Abstract: The stellar initial mass function (IMF) is predicted to depend upon the temperature of gas in star-forming molecular clouds. The introduction of an additional parameter, $T_{IMF}$ , into photometric template fitting, allows galaxies to be fit with a range of IMFs. Three surprising new features appear: (1) most star-forming galaxies are best fit with a bottom-lighter IMF than the Milky Way; (2) mos… ▽ More

    Submitted 27 May, 2022; originally announced May 2022.

    Comments: 11 pages, ApJ 931, 58

  29. arXiv:2205.07732  [pdf, other

    quant-ph cond-mat.quant-gas

    Light-shift induced behaviors observed in momentum-space quantum walks

    Authors: Nikolai Bolik, Caspar Groiseau, Jerry H. Clark, Alexander Gresch, Siamak Dadras, Gil S. Summy, Yingmei Liu, Sandro Wimberger

    Abstract: Over the last decade there have been many advances in studies of quantum walks (QWs) including a momentum-space QW recently realized in our spinor Bose-Einstein condensate system. This QW possessed behaviors that generally agreed with theoretical predictions; however, it also showed momentum distributions that were not adequately explained by the theory. We present a theoretical model which proves… ▽ More

    Submitted 26 September, 2022; v1 submitted 16 May, 2022; originally announced May 2022.

    Comments: experimental and theoretical paper on discrete-time quantum walks

    Journal ref: Phys. Rev. A 106, 033307 (2022)

  30. arXiv:2205.03703  [pdf, other

    cs.LG eess.SP

    Training from Zero: Radio Frequency Machine Learning Data Quantity Forecasting

    Authors: William H. Clark IV, Alan J. Michaels

    Abstract: The data used during training in any given application space is directly tied to the performance of the system once deployed. While there are many other factors that go into producing high performance models within machine learning, there is no doubt that the data used to train a system provides the foundation from which to build. One of the underlying rule of thumb heuristics used within the mach… ▽ More

    Submitted 14 June, 2024; v1 submitted 7 May, 2022; originally announced May 2022.

    Comments: 20 pages, 8 figures, submitted to MDPI Telecom

  31. arXiv:2203.17213  [pdf, other

    cs.CL

    Analyzing Wrap-Up Effects through an Information-Theoretic Lens

    Authors: Clara Meister, Tiago Pimentel, Thomas Hikaru Clark, Ryan Cotterell, Roger Levy

    Abstract: Numerous analyses of reading time (RT) data have been implemented -- all in an effort to better understand the cognitive processes driving reading comprehension. However, data measured on words at the end of a sentence -- or even at the end of a clause -- is often omitted due to the confounding factors introduced by so-called "wrap-up effects," which manifests as a skewed distribution of RTs for t… ▽ More

    Submitted 5 January, 2024; v1 submitted 31 March, 2022; originally announced March 2022.

    Comments: ACL 2022 (main conference)

  32. arXiv:2203.17189  [pdf, other

    cs.LG cs.CL

    Scaling Up Models and Data with $\texttt{t5x}$ and $\texttt{seqio}$

    Authors: Adam Roberts, Hyung Won Chung, Anselm Levskaya, Gaurav Mishra, James Bradbury, Daniel Andor, Sharan Narang, Brian Lester, Colin Gaffney, Afroz Mohiuddin, Curtis Hawthorne, Aitor Lewkowycz, Alex Salcianu, Marc van Zee, Jacob Austin, Sebastian Goodman, Livio Baldini Soares, Haitang Hu, Sasha Tsvyashchenko, Aakanksha Chowdhery, Jasmijn Bastings, Jannis Bulian, Xavier Garcia, Jianmo Ni, Andrew Chen , et al. (18 additional authors not shown)

    Abstract: Recent neural network-based language models have benefited greatly from scaling up the size of training datasets and the number of parameters in the models themselves. Scaling can be complicated due to various factors including the need to distribute computation on supercomputer clusters (e.g., TPUs), prevent bottlenecks when infeeding data, and ensure reproducible results. In this work, we presen… ▽ More

    Submitted 31 March, 2022; originally announced March 2022.

  33. arXiv:2203.16295  [pdf, other

    cond-mat.str-el cond-mat.mtrl-sci

    Crystal-field states and defect levels in candidate quantum spin ice Ce$_{2}$Hf$_{2}$O$_{7}$

    Authors: Victor Porée, Elsa Lhotel, Sylvain Petit, Aleksandra Krajewska, Pascal Puphal, Adam H. Clark, Vladimir Pomjakushin, Helen C. Walker, Nicolas Gauthier, Dariusz J. Gawryluk, Romain Sibille

    Abstract: We report the synthesis of powder and single-crystal samples of the cerium pyrohafnate and their characterization using neutron diffraction, thermogravimetry and X-ray absorption spectroscopy. We evaluate the amount of non-magnetic Ce$^{4+}$ defects and use this result to interpret the spectrum of crystal-electric field transitions observed using inelastic neutron scattering. The analysis of these… ▽ More

    Submitted 30 March, 2022; originally announced March 2022.

  34. arXiv:2203.10752  [pdf, other

    cs.CL

    XTREME-S: Evaluating Cross-lingual Speech Representations

    Authors: Alexis Conneau, Ankur Bapna, Yu Zhang, Min Ma, Patrick von Platen, Anton Lozhkov, Colin Cherry, Ye Jia, Clara Rivera, Mihir Kale, Daan Van Esch, Vera Axelrod, Simran Khanuja, Jonathan H. Clark, Orhan Firat, Michael Auli, Sebastian Ruder, Jason Riesa, Melvin Johnson

    Abstract: We introduce XTREME-S, a new benchmark to evaluate universal cross-lingual speech representations in many languages. XTREME-S covers four task families: speech recognition, classification, speech-to-text translation and retrieval. Covering 102 languages from 10+ language families, 3 different domains and 4 task families, XTREME-S aims to simplify multilingual speech representation evaluation, as w… ▽ More

    Submitted 13 April, 2022; v1 submitted 21 March, 2022; originally announced March 2022.

    Comments: Minor fix: language code for Filipino (Tagalog), "tg" -> "tl"

  35. arXiv:2111.08709  [pdf, other

    astro-ph.IM astro-ph.CO astro-ph.EP astro-ph.GA astro-ph.HE

    Faint objects in motion: the new frontier of high precision astrometry

    Authors: Fabien Malbet, Céline Boehm, Alberto Krone-Martins, Antonio Amorim, Guillem Anglada-Escudé, Alexis Brandeker, Frédéric Courbin, Torsten Enßlin, Antonio Falcão, Katherine Freese, Berry Holl, Lucas Labadie, Alain Léger, Gary Mamon, Barbara Mcarthur, Alcione Mora, Mike Shao, Alessandro Sozzetti, Douglas Spolyar, Eva Villaver, Ummi Abbas, Conrado Albertus, João Alves, Rory Barnes, Aldo Stefano Bonomo , et al. (61 additional authors not shown)

    Abstract: Sky survey telescopes and powerful targeted telescopes play complementary roles in astronomy. In order to investigate the nature and characteristics of the motions of very faint objects, a flexibly-pointed instrument capable of high astrometric accuracy is an ideal complement to current astrometric surveys and a unique tool for precision astrophysics. Such a space-based mission will push the front… ▽ More

    Submitted 16 November, 2021; originally announced November 2021.

    Comments: arXiv admin note: substantial text overlap with arXiv:1910.08028, arXiv:1707.01348

    Journal ref: Experimental Astronomy, Springer Link, 2021, 51 (3), pp.845-886

  36. arXiv:2111.04859  [pdf, other

    astro-ph.EP astro-ph.SR physics.chem-ph

    ExoMol line lists -- XLIV. IR and UV line list for silicon monoxide (SiO)

    Authors: Sergei N. Yurchenko, Jonathan Tennyson, Anna-Maree Syme, Ahmad Y. Adam, Victoria H. J. Clark, Bridgette Cooper, C. Pria Dobney, Shaun T. E. Donnelly, Maire N. Gorman, Anthony E. Lynas-Gray, Thomas Meltzer, Alec Owens, Qianwei Qu, Mikhail Semenov, Wilfrid Somogyi, Apoorva Upadhyay, Samuel Wright, Juan C. Zapata Trujillo

    Abstract: A new silicon monoxide ($^{28}$Si$^{16}$O) line list covering infrared, visible and ultraviolet regions called SiOUVenIR is presented. This line list extends the infrared EBJT ExoMol line list by including vibronic transitions to the $A\,{}^{1}Π$ and $E\,{}^{1}Σ^{+}$ electronic states. Strong perturbations to the $A\,{}^{1}Π$ band system are accurately modelled through the treatment of 6 dark elec… ▽ More

    Submitted 8 November, 2021; originally announced November 2021.

  37. arXiv:2110.13254  [pdf, other

    cs.CV cs.LG

    Pediatric Otoscopy Video Screening with Shift Contrastive Anomaly Detection

    Authors: Weiyao Wang, Aniruddha Tamhane, Christine Santos, John R. Rzasa, James H. Clark, Therese L. Canares, Mathias Unberath

    Abstract: Ear related concerns and symptoms represents the leading indication for seeking pediatric healthcare attention. Despite the high incidence of such encounters, the diagnostic process of commonly encountered disease of the middle and external presents significant challenge. Much of this challenge stems from the lack of cost effective diagnostic testing, which necessitating the presence or absence of… ▽ More

    Submitted 25 October, 2021; originally announced October 2021.

  38. arXiv:2110.10329  [pdf, other

    cs.CL cs.LG

    SLAM: A Unified Encoder for Speech and Language Modeling via Speech-Text Joint Pre-Training

    Authors: Ankur Bapna, Yu-an Chung, Nan Wu, Anmol Gulati, Ye Jia, Jonathan H. Clark, Melvin Johnson, Jason Riesa, Alexis Conneau, Yu Zhang

    Abstract: Unsupervised pre-training is now the predominant approach for both text and speech understanding. Self-attention models pre-trained on large amounts of unannotated data have been hugely successful when fine-tuned on downstream tasks from a variety of domains and languages. This paper takes the universality of unsupervised language pre-training one step further, by unifying speech and text pre-trai… ▽ More

    Submitted 19 October, 2021; originally announced October 2021.

  39. arXiv:2109.04810  [pdf, other

    cs.CL

    Mixture-of-Partitions: Infusing Large Biomedical Knowledge Graphs into BERT

    Authors: Zaiqiao Meng, Fangyu Liu, Thomas Hikaru Clark, Ehsan Shareghi, Nigel Collier

    Abstract: Infusing factual knowledge into pre-trained models is fundamental for many knowledge-intensive tasks. In this paper, we proposed Mixture-of-Partitions (MoP), an infusion approach that can handle a very large knowledge graph (KG) by partitioning it into smaller sub-graphs and infusing their specific knowledge into various BERT models using lightweight adapters. To leverage the overall factual knowl… ▽ More

    Submitted 10 September, 2021; originally announced September 2021.

    Comments: EMNLP 2021 camera-ready version

  40. arXiv:2108.09276  [pdf, ps, other

    quant-ph cond-mat.quant-gas

    Quantum to Classical Walk Transitions Tuned by Spontaneous Emissions

    Authors: J. H. Clark, C. Groiseau, Z. N. Shaw, S. Dadras, C. Binegar, S. Wimberger, G. S. Summy, Y. Liu

    Abstract: We have realized a quantum walk in momentum space with a rubidium spinor Bose-Einstein condensate by applying a periodic kicking potential as a walk operator and a resonant microwave pulse as a coin toss operator. The generated quantum walks appear to be stable for up to ten steps and then quickly transit to classical walks due to spontaneous emissions induced by laser beams of the walk operator.… ▽ More

    Submitted 20 August, 2021; originally announced August 2021.

  41. arXiv:2108.04205  [pdf, other

    math.OC

    Defense Against Adversarial Swarms with Parameter Uncertainty

    Authors: Claire Walton, Isaac Kaminer, Qi Gong, Abram. H. Clark, Theodoros Tsatsanifos

    Abstract: This paper addresses the problem of optimal defense of a High Value Unit against a large-scale swarm attack. We show that the problem can be cast in the framework of uncertain parameter optimal control and derive a consistency result for the dual problem of this framework. We show that the dual can be computed numerically and apply these numerical results to derive optimal defender strategies agai… ▽ More

    Submitted 9 August, 2021; originally announced August 2021.

  42. arXiv:2108.02311  [pdf, other

    math.OC

    Modeling and Control of Large-Scale Adversarial Swarm Engagements

    Authors: Theodoros Tsatsanifos, Abram H. Clark, Claire Walton, Isaac Kaminer, Qi Gong

    Abstract: We theoretically and numerically study the problem of optimal control of large-scale autonomous systems under explicitly adversarial conditions, including probabilistic destruction of agents during the simulation. Large-scale autonomous systems often include an adversarial component, where different agents or groups of agents explicitly compete with one another. An important component of these sys… ▽ More

    Submitted 4 August, 2021; originally announced August 2021.

  43. arXiv:2107.12584  [pdf, ps, other

    cond-mat.soft physics.flu-dyn

    Darcy-Reynolds forces during intrusion into granular-fluid beds

    Authors: Joshua Strader, Neil Causley, Joshua A. Dijksman, Abram H. Clark

    Abstract: We experimentally study intrusion into fluid-saturated granular beds by a free-falling sphere, varying particle size and fluid viscosity. We test our results against Darcy-Reynolds theory, where the deceleration of the sphere is controlled by Reynolds dilatancy and the Darcy flow resistance. We find the observed intruder dynamics are consistent with Darcy-Reynolds theory for varied particle size.… ▽ More

    Submitted 27 July, 2021; originally announced July 2021.

  44. arXiv:2107.11166  [pdf, other

    physics.chem-ph

    Theoretical rovibronic spectroscopy of the calcium monohydroxide radical (CaOH)

    Authors: Alec Owens, Victoria H. J. Clark, Alexander Mitrushchenkov, Sergei N. Yurchenko, Jonathan Tennyson

    Abstract: The rovibronic (rotation-vibration-electronic) spectrum of the calcium monohydroxide radical (CaOH) is of interest to studies of exoplanet atmospheres and ultracold molecules. Here, we theoretically investigate the $\tilde{A}\,^2Π$--$\tilde{X}\,^2Σ^+$ band system of CaOH using high-level \textit{ab initio} theory and variational nuclear motion calculations. New potential energy surfaces (PESs) are… ▽ More

    Submitted 23 July, 2021; originally announced July 2021.

    Journal ref: J. Chem. Phys. 154, 234302 (2021)

  45. arXiv:2107.08826  [pdf, other

    cond-mat.mtrl-sci astro-ph.EP physics.chem-ph

    The vibrational properties of benzene on an ordered water ice surface

    Authors: Victoria H. J. Clark, David M. Benoit

    Abstract: We present a hybrid CCSD(T)+PBE-D3 approach to calculating the vibrational signatures for gas phase benzene and benzene adsorbed on an ordered water-ice surface. We compare the results of our method against experimentally recorded spectra and calculations performed using PBE-D3-only approaches (harmonic and anharmonic). Calculations use a proton ordered XIh water-ice surface consisting of 288 wate… ▽ More

    Submitted 15 July, 2021; originally announced July 2021.

  46. arXiv:2104.14027  [pdf, other

    physics.chem-ph

    Modelling the non-local thermodynamic equilibrium spectra of silylene (SiH2)

    Authors: Victoria H. J. Clark, Sergei N. Yurchenko

    Abstract: This paper sets out a robust methodology for modelling spectra of polyatomic molecules produced in reactive or dissociative environments, with vibrational populations outside local thermal equilibrium (LTE). The methodology is based on accurate, extensive ro-vibrational line lists containing transitions with high vibrational excitations and relies on the detailed ro-vibrational assignments. The de… ▽ More

    Submitted 28 April, 2021; originally announced April 2021.

  47. CANINE: Pre-training an Efficient Tokenization-Free Encoder for Language Representation

    Authors: Jonathan H. Clark, Dan Garrette, Iulia Turc, John Wieting

    Abstract: Pipelined NLP systems have largely been superseded by end-to-end neural modeling, yet nearly all commonly-used models still require an explicit tokenization step. While recent tokenization approaches based on data-derived subword lexicons are less brittle than manually engineered tokenizers, these techniques are not equally suited to all languages, and the use of any fixed vocabulary may limit a m… ▽ More

    Submitted 18 May, 2022; v1 submitted 11 March, 2021; originally announced March 2021.

    Comments: TACL Final Version

    Journal ref: Transactions of the Association for Computational Linguistics (2022) 10: 73--91

  48. arXiv:2011.11824  [pdf, ps, other

    cond-mat.soft physics.flu-dyn

    Viscous-like forces control the impact response of shear-thickening dense suspensions

    Authors: Marc-Andre Brassard, Neil Causley, Nasser Krizou, Joshua A. Dijksman, Abram H. Clark

    Abstract: We experimentally and theoretically study impacts into dense cornstarch and water suspensions. We vary impact speed as well as intruder size, shape, and mass, and we characterize the resulting dynamics using high-speed video and an onboard accelerometer. We numerically solve previously proposed models, most notably the added-mass model as well as a class of {viscous-like} models. In the {viscous-l… ▽ More

    Submitted 27 July, 2021; v1 submitted 23 November, 2020; originally announced November 2020.

    Journal ref: Journal of Fluid Mechanics, 923, A38 (2021)

  49. arXiv:2011.04264  [pdf, other

    cs.CL cs.CV

    CapWAP: Captioning with a Purpose

    Authors: Adam Fisch, Kenton Lee, Ming-Wei Chang, Jonathan H. Clark, Regina Barzilay

    Abstract: The traditional image captioning task uses generic reference captions to provide textual information about images. Different user populations, however, will care about different visual aspects of images. In this paper, we propose a new task, Captioning with a Purpose (CapWAP). Our goal is to develop systems that can be tailored to be useful for the information needs of an intended population, rath… ▽ More

    Submitted 9 November, 2020; originally announced November 2020.

    Comments: EMNLP 2020

  50. arXiv:2010.12707  [pdf, other

    cs.CL

    Learning to Recognize Dialect Features

    Authors: Dorottya Demszky, Devyani Sharma, Jonathan H. Clark, Vinodkumar Prabhakaran, Jacob Eisenstein

    Abstract: Building NLP systems that serve everyone requires accounting for dialect differences. But dialects are not monolithic entities: rather, distinctions between and within dialects are captured by the presence, absence, and frequency of dozens of dialect features in speech and text, such as the deletion of the copula in "He {} running". In this paper, we introduce the task of dialect feature detection… ▽ More

    Submitted 6 May, 2021; v1 submitted 23 October, 2020; originally announced October 2020.

    Comments: NAACL camera-ready