Skip to main content

Showing 1–18 of 18 results for author: Thomson, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.19631  [pdf, other

    cs.AI

    Leveraging Open-Source Large Language Models for encoding Social Determinants of Health using an Intelligent Router

    Authors: Akul Goel, Surya Narayanan Hari, Belinda Waltman, Matt Thomson

    Abstract: Social Determinants of Health (SDOH) play a significant role in patient health outcomes. The Center of Disease Control (CDC) introduced a subset of ICD-10 codes called Z-codes in an attempt to officially recognize and measure SDOH in the health care system. However, these codes are rarely annotated in a patient's Electronic Health Record (EHR), and instead, in many cases, need to be inferred from… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  2. arXiv:2310.19902  [pdf, other

    cs.AI

    Herd: Using multiple, smaller LLMs to match the performances of proprietary, large LLMs via an intelligent composer

    Authors: Surya Narayanan Hari, Matt Thomson

    Abstract: Currently, over a thousand LLMs exist that are multi-purpose and are capable of performing real world tasks, including Q&A, text summarization, content generation, etc. However, accessibility, scale and reliability of free models prevents them from being widely deployed in everyday use cases. To address the first two issues of access and scale, organisations such as HuggingFace have created model… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

  3. arXiv:2310.04444  [pdf, other

    cs.CL cs.AI cs.LG cs.NE

    What's the Magic Word? A Control Theory of LLM Prompting

    Authors: Aman Bhargava, Cameron Witkowski, Manav Shah, Matt Thomson

    Abstract: Prompt engineering is crucial for deploying LLMs but is poorly understood mathematically. We formalize LLM systems as a class of discrete stochastic dynamical systems to explore prompt engineering through the lens of control theory. We investigate the reachable set of output token sequences $R_y(\mathbf x_0)$ for which there exists a control input sequence $\mathbf u$ for each… ▽ More

    Submitted 3 January, 2024; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: 23 pages, 8 figures. Under review for ICLR 2024

  4. arXiv:2308.11601  [pdf, other

    cs.LG cs.AI cs.CL cs.MA

    Tryage: Real-time, intelligent Routing of User Prompts to Large Language Models

    Authors: Surya Narayanan Hari, Matt Thomson

    Abstract: The introduction of the transformer architecture and the self-attention mechanism has led to an explosive production of language models trained on specific downstream tasks and data domains. With over 200, 000 models in the Hugging Face ecosystem, users grapple with selecting and optimizing models to suit multifaceted workflows and data domains while addressing computational, security, and recency… ▽ More

    Submitted 23 August, 2023; v1 submitted 22 August, 2023; originally announced August 2023.

  5. arXiv:2308.10913  [pdf, other

    q-bio.NC cs.CV cs.LG eess.IV

    Automated map** of virtual environments with visual predictive coding

    Authors: James Gornet, Matthew Thomson

    Abstract: Humans construct internal cognitive maps of their environment directly from sensory inputs without access to a system of explicit coordinates or distance measurements. While machine learning algorithms like SLAM utilize specialized visual inference procedures to identify visual features and construct spatial maps from visual and odometry data, the general nature of cognitive maps in the brain sugg… ▽ More

    Submitted 17 April, 2024; v1 submitted 20 August, 2023; originally announced August 2023.

  6. arXiv:2211.04020  [pdf, other

    q-bio.QM cs.LG q-bio.GN q-bio.TO

    Generating counterfactual explanations of tumor spatial proteomes to discover effective strategies for enhancing immune infiltration

    Authors: Zitong Jerry Wang, Alexander M. Xu, Aman Bhargava, Matt W. Thomson

    Abstract: The tumor microenvironment (TME) significantly impacts cancer prognosis due to its immune composition. While therapies for altering the immune composition, including immunotherapies, have shown exciting results for treating hematological cancers, they are less effective for immunologically-cold, solid tumors. Spatial omics technologies capture the spatial organization of the TME with unprecedented… ▽ More

    Submitted 13 October, 2023; v1 submitted 8 November, 2022; originally announced November 2022.

  7. arXiv:2205.00334  [pdf, other

    cs.LG cs.AI math.DG

    Engineering flexible machine learning systems by traversing functionally-invariant paths

    Authors: Guruprasad Raghavan, Bahey Tharwat, Surya Narayanan Hari, Dhruvil Satani, Matt Thomson

    Abstract: Transformers have emerged as the state of the art neural network architecture for natural language processing and computer vision. In the foundation model paradigm, large transformer models (BERT, GPT3/4, Bloom, ViT) are pre-trained on self-supervised tasks such as word or image masking, and then, adapted through fine-tuning for downstream user applications including instruction following and Ques… ▽ More

    Submitted 3 September, 2023; v1 submitted 30 April, 2022; originally announced May 2022.

    Comments: 22 pages

  8. arXiv:2110.07069  [pdf, other

    q-bio.QM cs.LG

    CloudPred: Predicting Patient Phenotypes From Single-cell RNA-seq

    Authors: Bryan He, Matthew Thomson, Meena Subramaniam, Richard Perez, Chun Jimmie Ye, James Zou

    Abstract: Single-cell RNA sequencing (scRNA-seq) has the potential to provide powerful, high-resolution signatures to inform disease prognosis and precision medicine. This paper takes an important first step towards this goal by develo** an interpretable machine learning algorithm, CloudPred, to predict individuals' disease phenotypes from their scRNA-seq data. Predicting phenotype from scRNA-seq is chall… ▽ More

    Submitted 13 October, 2021; originally announced October 2021.

    Comments: Preprint of an article published in Pacific Symposium on Biocomputing © 2021 World Scientific Publishing Co., Singapore, http://psb.stanford.edu/

  9. arXiv:2106.08317  [pdf, other

    q-bio.GN cs.LG

    Active feature selection discovers minimal gene sets for classifying cell types and disease states with single-cell mRNA-seq data

    Authors: Xiaoqiao Chen, Sisi Chen, Matt Thomson

    Abstract: Sequencing costs currently prohibit the application of single-cell mRNA-seq to many biological and clinical analyses. Targeted single-cell mRNA-sequencing reduces sequencing costs by profiling reduced gene sets that capture biological information with a minimal number of genes. Here, we introduce an active learning method (ActiveSVM) that identifies minimal but highly-informative gene sets that en… ▽ More

    Submitted 12 February, 2022; v1 submitted 15 June, 2021; originally announced June 2021.

    Comments: 37 pages, 7 figures

  10. arXiv:2106.02793  [pdf, other

    cs.LG cs.AI

    Solving hybrid machine learning tasks by traversing weight space geodesics

    Authors: Guruprasad Raghavan, Matt Thomson

    Abstract: Machine learning problems have an intrinsic geometric structure as central objects including a neural network's weight space and the loss function associated with a particular task can be viewed as encoding the intrinsic geometry of a given machine learning problem. Therefore, geometric concepts can be applied to analyze and understand theoretical properties of machine learning strategies as well… ▽ More

    Submitted 5 June, 2021; originally announced June 2021.

    Comments: 11 pages, 7 figures

  11. arXiv:2105.14105  [pdf, other

    cs.LG eess.SY nlin.AO

    Reinforcement Learning reveals fundamental limits on the mixing of active particles

    Authors: Dominik Schildknecht, Anastasia N. Popova, Jack Stellwagen, Matt Thomson

    Abstract: The control of far-from-equilibrium physical systems, including active materials, has emerged as an important area for the application of reinforcement learning (RL) strategies to derive control policies for physical systems. In active materials, non-linear dynamics and long-range interactions between particles prohibit closed-form descriptions of the system's dynamics and prevent explicit solutio… ▽ More

    Submitted 28 May, 2021; originally announced May 2021.

    Journal ref: Soft Matter, 2022

  12. arXiv:2012.09605  [pdf, other

    cs.LG cs.NE math.DG

    Sparsifying networks by traversing Geodesics

    Authors: Guruprasad Raghavan, Matt Thomson

    Abstract: The geometry of weight spaces and functional manifolds of neural networks play an important role towards 'understanding' the intricacies of ML. In this paper, we attempt to solve certain open questions in ML, by viewing them through the lens of geometry, ultimately relating it to the discovery of points or paths of equivalent function in these spaces. We propose a mathematical framework to evaluat… ▽ More

    Submitted 12 December, 2020; originally announced December 2020.

    Comments: 5 pages; Presented work at NeurIPS 2020 Workshop (DiffGeo4DL). arXiv admin note: text overlap with arXiv:2005.11603

  13. arXiv:2006.06902  [pdf, other

    cs.NE cs.LG q-bio.NC

    Self-organization of multi-layer spiking neural networks

    Authors: Guruprasad Raghavan, Cong Lin, Matt Thomson

    Abstract: Living neural networks in our brains autonomously self-organize into large, complex architectures during early development to result in an organized and functional organic computational device. A key mechanism that enables the formation of complex architecture in the develo** brain is the emergence of traveling spatio-temporal waves of neuronal activity across the growing brain. Inspired by this… ▽ More

    Submitted 11 June, 2020; originally announced June 2020.

    Comments: 11 pages, 4 figures

  14. arXiv:2005.11603  [pdf, other

    cs.NE cs.LG

    Geometric algorithms for predicting resilience and recovering damage in neural networks

    Authors: Guruprasad Raghavan, Jiayi Li, Matt Thomson

    Abstract: Biological neural networks have evolved to maintain performance despite significant circuit damage. To survive damage, biological network architectures have both intrinsic resilience to component loss and also activate recovery programs that adjust network weights through plasticity to stabilize performance. Despite the importance of resilience in technology applications, the resilience of artific… ▽ More

    Submitted 2 June, 2020; v1 submitted 23 May, 2020; originally announced May 2020.

    Comments: 10 pages and 4 figures

  15. arXiv:1911.02549  [pdf, other

    cs.LG cs.PF stat.ML

    MLPerf Inference Benchmark

    Authors: Vijay Janapa Reddi, Christine Cheng, David Kanter, Peter Mattson, Guenther Schmuelling, Carole-Jean Wu, Brian Anderson, Maximilien Breughe, Mark Charlebois, William Chou, Ramesh Chukka, Cody Coleman, Sam Davis, Pan Deng, Greg Diamos, Jared Duke, Dave Fick, J. Scott Gardner, Itay Hubara, Sachin Idgunji, Thomas B. Jablin, Jeff Jiao, Tom St. John, Pankaj Kanwar, David Lee , et al. (22 additional authors not shown)

    Abstract: Machine-learning (ML) hardware and software system demand is burgeoning. Driven by ML applications, the number of different ML inference systems has exploded. Over 100 organizations are building ML inference chips, and the systems that incorporate existing models span at least three orders of magnitude in power consumption and five orders of magnitude in performance; they range from embedded devic… ▽ More

    Submitted 9 May, 2020; v1 submitted 6 November, 2019; originally announced November 2019.

    Comments: ISCA 2020

  16. arXiv:1906.01039  [pdf, other

    cs.NE cs.AI nlin.AO q-bio.NC

    Neural networks grown and self-organized by noise

    Authors: Guruprasad Raghavan, Matt Thomson

    Abstract: Living neural networks emerge through a process of growth and self-organization that begins with a single cell and results in a brain, an organized and functional computational device. Artificial neural networks, however, rely on human-designed, hand-programmed architectures for their remarkable performance. Can we develop artificial computational devices that can grow and self-organize without hu… ▽ More

    Submitted 3 June, 2019; originally announced June 2019.

    Comments: 21 pages (including 11 pages of appendix)

  17. arXiv:1808.07269  [pdf, other

    hep-ex cs.CV physics.data-an physics.ins-det

    A Deep Neural Network for Pixel-Level Electromagnetic Particle Identification in the MicroBooNE Liquid Argon Time Projection Chamber

    Authors: MicroBooNE collaboration, C. Adams, M. Alrashed, R. An, J. Anthony, J. Asaadi, A. Ashkenazi, M. Auger, S. Balasubramanian, B. Baller, C. Barnes, G. Barr, M. Bass, F. Bay, A. Bhat, K. Bhattacharya, M. Bishai, A. Blake, T. Bolton, L. Camilleri, D. Caratelli, I. Caro Terrazas, R. Carr, R. Castillo Fernandez, F. Cavanna , et al. (148 additional authors not shown)

    Abstract: We have developed a convolutional neural network (CNN) that can make a pixel-level prediction of objects in image data recorded by a liquid argon time projection chamber (LArTPC) for the first time. We describe the network design, training techniques, and software tools developed to train this network. The goal of this work is to develop a complete deep neural network based data reconstruction cha… ▽ More

    Submitted 22 August, 2018; originally announced August 2018.

    Journal ref: Phys. Rev. D 99, 092001 (2019)

  18. arXiv:1506.05348  [pdf, other

    physics.data-an cs.DC hep-ex physics.ins-det

    The Pandora Software Development Kit for Pattern Recognition

    Authors: J. S. Marshall, M. A. Thomson

    Abstract: The development of automated solutions to pattern recognition problems is important in many areas of scientific research and human endeavour. This paper describes the implementation of the Pandora Software Development Kit, which aids the process of designing, implementing and running pattern recognition algorithms. The Pandora Application Programming Interfaces ensure simple specification of the b… ▽ More

    Submitted 18 September, 2015; v1 submitted 16 June, 2015; originally announced June 2015.

    Comments: Accepted by European Physical Journal C, 4 September 2015