Skip to main content

Showing 1–11 of 11 results for author: Allen, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.10725  [pdf, other

    cs.CL cs.IR

    INDUS: Effective and Efficient Language Models for Scientific Applications

    Authors: Bishwaranjan Bhattacharjee, Aashka Trivedi, Masayasu Muraoka, Muthukumaran Ramasubramanian, Takuma Udagawa, Iksha Gurung, Rong Zhang, Bharath Dandala, Rahul Ramachandran, Manil Maskey, Kaylin Bugbee, Mike Little, Elizabeth Fancher, Lauren Sanders, Sylvain Costes, Sergi Blanco-Cuaresma, Kelly Lockhart, Thomas Allen, Felix Grezes, Megan Ansdell, Alberto Accomazzi, Yousef El-Kurdi, Davis Wertheimer, Birgit Pfitzmann, Cesar Berrospi Ramis , et al. (9 additional authors not shown)

    Abstract: Large language models (LLMs) trained on general domain corpora showed remarkable results on natural language processing (NLP) tasks. However, previous research demonstrated LLMs trained using domain-focused corpora perform better on specialized tasks. Inspired by this pivotal insight, we developed INDUS, a comprehensive suite of LLMs tailored for the Earth science, biology, physics, heliophysics,… ▽ More

    Submitted 20 May, 2024; v1 submitted 17 May, 2024; originally announced May 2024.

  2. arXiv:2312.14211  [pdf, ps, other

    cs.CL astro-ph.IM cs.AI

    Experimenting with Large Language Models and vector embeddings in NASA SciX

    Authors: Sergi Blanco-Cuaresma, Ioana Ciucă, Alberto Accomazzi, Michael J. Kurtz, Edwin A. Henneken, Kelly E. Lockhart, Felix Grezes, Thomas Allen, Golnaz Shapurian, Carolyn S. Grant, Donna M. Thompson, Timothy W. Hostetler, Matthew R. Templeton, Shinyi Chen, Jennifer Koch, Taylor Jacovich, Daniel Chivvis, Fernanda de Macedo Alves, Jean-Claude Paquin, Jennifer Bartlett, Mugdha Polimera, Stephanie Jarmak

    Abstract: Open-source Large Language Models enable projects such as NASA SciX (i.e., NASA ADS) to think out of the box and try alternative approaches for information retrieval and data augmentation, while respecting data copyright and users' privacy. However, when large language models are directly prompted with questions without any context, they are prone to hallucination. At NASA SciX we have developed a… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

    Comments: To appear in the proceedings of the 33th annual international Astronomical Data Analysis Software & Systems (ADASS XXXIII)

  3. arXiv:2312.07743  [pdf, other

    cs.LG cs.CL cs.DC

    FULL-W2V: Fully Exploiting Data Reuse for W2V on GPU-Accelerated Systems

    Authors: Thomas Randall, Tyler Allen, Rong Ge

    Abstract: Word2Vec remains one of the highly-impactful innovations in the field of Natural Language Processing (NLP) that represents latent grammatical and syntactical information in human text with dense vectors in a low dimension. Word2Vec has high computational cost due to the algorithm's inherent sequentiality, intensive memory accesses, and the large vocabularies it represents. While prior studies have… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

    Comments: 12 pages, 7 figures, 7 tables, the definitive version of this work is published in the Proceedings of the ACM International Conference on Supercomputing 2021, available at https://doi.org/10.1145/3447818.3460373

    ACM Class: I.2.7; D.1.3; G.4

    Journal ref: Proceedings of the ACM International Conference on Supercomputing (2021) 455-466

  4. arXiv:2312.01180  [pdf, other

    cs.CY

    A Comparative Analysis of Text-to-Image Generative AI Models in Scientific Contexts: A Case Study on Nuclear Power

    Authors: Veda Joynt, Jacob Cooper, Naman Bhargava, Katie Vu, O Hwang Kwon, Todd R. Allen, Aditi Verma, Majdi I. Radaideh

    Abstract: In this work, we propose and assess the potential of generative artificial intelligence (AI) to generate public engagement around potential clean energy sources. Such an application could increase energy literacy -- an awareness of low-carbon energy sources among the public therefore leading to increased participation in decision-making about the future of energy systems. We explore the use of gen… ▽ More

    Submitted 2 December, 2023; originally announced December 2023.

    Comments: 26 pages, 11 figures, 9 tables, submitted to review

  5. arXiv:2302.03976  [pdf, other

    cs.CR cs.NI cs.OS

    Parma: Confidential Containers via Attested Execution Policies

    Authors: Matthew A. Johnson, Stavros Volos, Ken Gordon, Sean T. Allen, Christoph M. Wintersteiger, Sylvan Clebsch, John Starks, Manuel Costa

    Abstract: Container-based technologies empower cloud tenants to develop highly portable software and deploy services in the cloud at a rapid pace. Cloud privacy, meanwhile, is important as a large number of container deployments operate on privacy-sensitive data, but challenging due to the increasing frequency and sophistication of attacks. State-of-the-art confidential container-based designs leverage proc… ▽ More

    Submitted 7 March, 2023; v1 submitted 8 February, 2023; originally announced February 2023.

    Comments: 12 pages, 6 figures, 2 tables

  6. arXiv:2212.00744  [pdf, ps, other

    cs.CL astro-ph.IM

    Improving astroBERT using Semantic Textual Similarity

    Authors: Felix Grezes, Thomas Allen, Sergi Blanco-Cuaresma, Alberto Accomazzi, Michael J. Kurtz, Golnaz Shapurian, Edwin Henneken, Carolyn S. Grant, Donna M. Thompson, Timothy W. Hostetler, Matthew R. Templeton, Kelly E. Lockhart, Shinyi Chen, Jennifer Koch, Taylor Jacovich, Pavlos Protopapas

    Abstract: The NASA Astrophysics Data System (ADS) is an essential tool for researchers that allows them to explore the astronomy and astrophysics scientific literature, but it has yet to exploit recent advances in natural language processing. At ADASS 2021, we introduced astroBERT, a machine learning language model tailored to the text used in astronomy papers in ADS. In this work we: - announce the first… ▽ More

    Submitted 29 November, 2022; originally announced December 2022.

  7. arXiv:2106.02118  [pdf

    eess.IV cs.CV cs.LG

    A Prospective Observational Study to Investigate Performance of a Chest X-ray Artificial Intelligence Diagnostic Support Tool Across 12 U.S. Hospitals

    Authors: Ju Sun, Le Peng, Taihui Li, Dyah Adila, Zach Zaiman, Genevieve B. Melton, Nicholas Ingraham, Eric Murray, Daniel Boley, Sean Switzer, John L. Burns, Kun Huang, Tadashi Allen, Scott D. Steenburg, Judy Wawira Gichoya, Erich Kummerfeld, Christopher Tignanelli

    Abstract: Importance: An artificial intelligence (AI)-based model to predict COVID-19 likelihood from chest x-ray (CXR) findings can serve as an important adjunct to accelerate immediate clinical decision making and improve clinical decision making. Despite significant efforts, many limitations and biases exist in previously developed AI diagnostic models for COVID-19. Utilizing a large set of local and int… ▽ More

    Submitted 6 June, 2021; v1 submitted 3 June, 2021; originally announced June 2021.

    Comments: Check out the medRxiv version at https://doi.org/10.1101/2021.06.04.21258316 for updates

  8. arXiv:2105.11538  [pdf

    cs.SI physics.soc-ph

    The power of reciprocal knowledge sharing relationships for startup success

    Authors: T. J. Allen, P. Gloor, A. Fronzetti Colladon, S. L. Woerner, O. Raz

    Abstract: Purpose: The purpose of this paper is to examine the innovative capabilities of biotech start-ups in relation to geographic proximity and knowledge sharing interaction in the R&D network of a major high-tech cluster. Design-methodology-approach: This study compares longitudinal informal communication networks of researchers at biotech start-ups with company patent applications in subsequent year… ▽ More

    Submitted 20 May, 2021; originally announced May 2021.

    ACM Class: J.4

    Journal ref: Journal of Small Business and Enterprise Development 23(3), 636-651 (2016)

  9. arXiv:2009.12849  [pdf, other

    cs.SE

    A highly scalable Met Office NERC Cloud model

    Authors: Nick Brown, Michèle Weiland, Adrian Hill, Ben Shipway, Chris Maynard, Thomas Allen, Mike Rezny

    Abstract: Large Eddy Simulation is a critical modelling tool for scientists investigating atmospheric flows, turbulence and cloud microphysics. Within the UK, the principal LES model used by the atmospheric research community is the Met Office Large Eddy Model (LEM). The LEM was originally developed in the late 1980s using computational techniques and assumptions of the time, which means that the it does no… ▽ More

    Submitted 27 September, 2020; originally announced September 2020.

  10. arXiv:1301.2313  [pdf

    cs.AI

    Bayesian Error-Bars for Belief Net Inference

    Authors: Tim Van Allen, Russell Greiner, Peter Hooper

    Abstract: A Bayesian Belief Network (BN) is a model of a joint distribution over a setof n variables, with a DAG structure to represent the immediate dependenciesbetween the variables, and a set of parameters (aka CPTables) to represent thelocal conditional probabilities of a node, given each assignment to itsparents. In many situations, these parameters are themselves random variables - this may reflect t… ▽ More

    Submitted 10 January, 2013; originally announced January 2013.

    Comments: Appears in Proceedings of the Seventeenth Conference on Uncertainty in Artificial Intelligence (UAI2001)

    Report number: UAI-P-2001-PG-522-529

  11. arXiv:1105.1383  [pdf, other

    cs.SD math.GN

    Topological Considerations for Tuning and Fingering Stringed Instruments

    Authors: Terry Allen, Camille Goudeseune

    Abstract: We present a formal language for assigning pitches to strings for fingered multi-string instruments, particularly the six-string guitar. Given the instrument's tuning (the strings' open pitches) and the compass of the fingers of the hand stop** the strings, the formalism yields a framework for simultaneously optimizing three things: the map** of pitches to strings, the choice of instrument tun… ▽ More

    Submitted 6 May, 2011; originally announced May 2011.

    Comments: 8 pages, 3 figures

    MSC Class: 14P10 ACM Class: F.4.0; H.5.5; G.2.3