Skip to main content

Showing 1–50 of 73 results for author: Thompson, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2312.14211  [pdf, ps, other

    cs.CL astro-ph.IM cs.AI

    Experimenting with Large Language Models and vector embeddings in NASA SciX

    Authors: Sergi Blanco-Cuaresma, Ioana Ciucă, Alberto Accomazzi, Michael J. Kurtz, Edwin A. Henneken, Kelly E. Lockhart, Felix Grezes, Thomas Allen, Golnaz Shapurian, Carolyn S. Grant, Donna M. Thompson, Timothy W. Hostetler, Matthew R. Templeton, Shinyi Chen, Jennifer Koch, Taylor Jacovich, Daniel Chivvis, Fernanda de Macedo Alves, Jean-Claude Paquin, Jennifer Bartlett, Mugdha Polimera, Stephanie Jarmak

    Abstract: Open-source Large Language Models enable projects such as NASA SciX (i.e., NASA ADS) to think out of the box and try alternative approaches for information retrieval and data augmentation, while respecting data copyright and users' privacy. However, when large language models are directly prompted with questions without any context, they are prone to hallucination. At NASA SciX we have developed a… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

    Comments: To appear in the proceedings of the 33th annual international Astronomical Data Analysis Software & Systems (ADASS XXXIII)

  2. arXiv:2312.12399  [pdf

    cs.HC

    Virtual Reality-Assisted Physiotherapy for Visuospatial Neglect Rehabilitation: A Proof-of-Concept Study

    Authors: Andrew Danso, Patti Nijhuis, Alessandro Ansani, Martin Hartmann, Gulnara Minkkinen, Geoff Luck, Joshua S. Bamford, Sarah Faber, Kat Agres, Solange Glasser, Teppo Särkämö, Rebekah Rousi, Marc R. Thompson

    Abstract: This study explores a VR-based intervention for Visuospatial neglect (VSN), a post-stroke condition. It aims to develop a VR task utilizing interactive visual-audio cues to improve sensory-motor training and assess its impact on VSN patients' engagement and performance. Collaboratively designed with physiotherapists, the VR task uses directional and auditory stimuli to alert and direct patients, t… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

    Comments: 29 pages, 8 figures, 5 tables

  3. arXiv:2311.18803  [pdf, other

    cs.CV cs.CL cs.LG

    BioCLIP: A Vision Foundation Model for the Tree of Life

    Authors: Samuel Stevens, Jiaman Wu, Matthew J Thompson, Elizabeth G Campolongo, Chan Hee Song, David Edward Carlyn, Li Dong, Wasila M Dahdul, Charles Stewart, Tanya Berger-Wolf, Wei-Lun Chao, Yu Su

    Abstract: Images of the natural world, collected by a variety of cameras, from drones to individual phones, are increasingly abundant sources of biological information. There is an explosion of computational methods and tools, particularly computer vision, for extracting biologically relevant information from images for science and conservation. Yet most of these are bespoke approaches designed for a specif… ▽ More

    Submitted 14 May, 2024; v1 submitted 30 November, 2023; originally announced November 2023.

    Comments: CVPR 2024 (oral) camera-ready version; data released

  4. arXiv:2311.11046  [pdf

    q-bio.QM cs.LG q-bio.NC

    DenseNet and Support Vector Machine classifications of major depressive disorder using vertex-wise cortical features

    Authors: Vladimir Belov, Tracy Erwin-Grabner, Ling-Li Zeng, Christopher R. K. Ching, Andre Aleman, Alyssa R. Amod, Zeynep Basgoze, Francesco Benedetti, Bianca Besteher, Katharina Brosch, Robin Bülow, Romain Colle, Colm G. Connolly, Emmanuelle Corruble, Baptiste Couvy-Duchesne, Kathryn Cullen, Udo Dannlowski, Christopher G. Davey, Annemiek Dols, Jan Ernsting, Jennifer W. Evans, Lukas Fisch, Paola Fuentes-Claramonte, Ali Saffet Gonul, Ian H. Gotlib , et al. (63 additional authors not shown)

    Abstract: Major depressive disorder (MDD) is a complex psychiatric disorder that affects the lives of hundreds of millions of individuals around the globe. Even today, researchers debate if morphological alterations in the brain are linked to MDD, likely due to the heterogeneity of this disorder. The application of deep learning tools to neuroimaging data, capable of capturing complex non-linear patterns, h… ▽ More

    Submitted 18 November, 2023; originally announced November 2023.

  5. arXiv:2309.07352  [pdf

    q-bio.GN cs.LG eess.IV q-bio.QM

    Tackling the dimensions in imaging genetics with CLUB-PLS

    Authors: Andre Altmann, Ana C Lawry Aguila, Neda Jahanshad, Paul M Thompson, Marco Lorenzi

    Abstract: A major challenge in imaging genetics and similar fields is to link high-dimensional data in one domain, e.g., genetic data, to high dimensional data in a second domain, e.g., brain imaging data. The standard approach in the area are mass univariate analyses across genetic factors and imaging phenotypes. That entails executing one genome-wide association study (GWAS) for each pre-defined imaging m… ▽ More

    Submitted 19 September, 2023; v1 submitted 13 September, 2023; originally announced September 2023.

    Comments: 12 pages, 4 Figures, 2 Tables

  6. arXiv:2309.04651  [pdf

    eess.IV cs.AI cs.CV

    Video and Synthetic MRI Pre-training of 3D Vision Architectures for Neuroimage Analysis

    Authors: Nikhil J. Dhinagar, Amit Singh, Saket Ozarkar, Ketaki Buwa, Sophia I. Thomopoulos, Conor Owens-Walton, Emily Laltoo, Yao-Liang Chen, Philip Cook, Corey McMillan, Chih-Chien Tsai, J-J Wang, Yih-Ru Wu, Paul M. Thompson

    Abstract: Transfer learning represents a recent paradigm shift in the way we build artificial intelligence (AI) systems. In contrast to training task-specific models, transfer learning involves pre-training deep learning models on a large corpus of data and minimally fine-tuning them for adaptation to specific tasks. Even so, for 3D medical imaging tasks, we do not know if it is best to pre-train models on… ▽ More

    Submitted 8 September, 2023; originally announced September 2023.

  7. arXiv:2309.04607  [pdf

    cs.CL cs.AI

    Linking Symptom Inventories using Semantic Textual Similarity

    Authors: Eamonn Kennedy, Shashank Vadlamani, Hannah M Lindsey, Kelly S Peterson, Kristen Dams OConnor, Kenton Murray, Ronak Agarwal, Houshang H Amiri, Raeda K Andersen, Talin Babikian, David A Baron, Erin D Bigler, Karen Caeyenberghs, Lisa Delano-Wood, Seth G Disner, Ekaterina Dobryakova, Blessen C Eapen, Rachel M Edelstein, Carrie Esopenko, Helen M Genova, Elbert Geuze, Naomi J Goodrich-Hunsaker, Jordan Grafman, Asta K Haberg, Cooper B Hodges , et al. (57 additional authors not shown)

    Abstract: An extensive library of symptom inventories has been developed over time to measure clinical symptoms, but this variety has led to several long standing issues. Most notably, results drawn from different settings and studies are not comparable, which limits reproducibility. Here, we present an artificial intelligence (AI) approach using semantic textual similarity (STS) to link symptoms and scores… ▽ More

    Submitted 8 September, 2023; originally announced September 2023.

  8. arXiv:2308.07897  [pdf, other

    cond-mat.mtrl-sci cs.AI

    Probabilistic Phase Labeling and Lattice Refinement for Autonomous Material Research

    Authors: Ming-Chiang Chang, Sebastian Ament, Maximilian Amsler, Duncan R. Sutherland, Lan Zhou, John M. Gregoire, Carla P. Gomes, R. Bruce van Dover, Michael O. Thompson

    Abstract: X-ray diffraction (XRD) is an essential technique to determine a material's crystal structure in high-throughput experimentation, and has recently been incorporated in artificially intelligent agents in autonomous scientific discovery processes. However, rapid, automated and reliable analysis method of XRD data matching the incoming data rate remains a major challenge. To address these issues, we… ▽ More

    Submitted 15 August, 2023; originally announced August 2023.

    Comments: 13 pages, 6 figures

  9. arXiv:2306.16364  [pdf, ps, other

    cs.LO cs.DB cs.FL

    Generalized Core Spanner Inexpressibility via Ehrenfeucht-Fraïssé Games for FC

    Authors: Sam M. Thompson, Dominik D. Freydenberger

    Abstract: Despite considerable research on document spanners, little is known about the expressive power of generalized core spanners. In this paper, we use Ehrenfeucht-Fraïssé games to obtain general inexpressibility lemmas for the logic FC (a finite-model variant of the theory of concatenation). Applying these lemmas give inexpressibility results for FC that we lift to generalized core spanners. In partic… ▽ More

    Submitted 28 June, 2023; originally announced June 2023.

  10. arXiv:2305.16222  [pdf, ps, other

    eess.IV cs.CV cs.LG q-bio.NC

    Incomplete Multimodal Learning for Complex Brain Disorders Prediction

    Authors: Reza Shirkavand, Liang Zhan, Heng Huang, Li Shen, Paul M. Thompson

    Abstract: Recent advancements in the acquisition of various brain data sources have created new opportunities for integrating multimodal brain data to assist in early detection of complex brain disorders. However, current data integration approaches typically need a complete set of biomedical data modalities, which may not always be feasible, as some modalities are only available in large-scale research coh… ▽ More

    Submitted 25 May, 2023; originally announced May 2023.

  11. arXiv:2304.00134  [pdf

    physics.med-ph cs.AI

    A Surface-Based Federated Chow Test Model for Integrating APOE Status, Tau Deposition Measure, and Hippocampal Surface Morphometry

    Authors: Jianfeng Wu, Yi Su, Yanxi Chen, Wenhui Zhu, Eric M. Reiman, Richard J. Caselli, Kewei Chen, Paul M. Thompson, Junwen Wang, Yalin Wang

    Abstract: Background: Alzheimer's Disease (AD) is the most common type of age-related dementia, affecting 6.2 million people aged 65 or older according to CDC data. It is commonly agreed that discovering an effective AD diagnosis biomarker could have enormous public health benefits, potentially preventing or delaying up to 40% of dementia cases. Tau neurofibrillary tangles are the primary driver of downstre… ▽ More

    Submitted 31 March, 2023; originally announced April 2023.

  12. arXiv:2303.11756  [pdf, other

    cs.RO cs.LG

    Improving Deep Dynamics Models for Autonomous Vehicles with Multimodal Latent Map** of Surfaces

    Authors: Johan Vertens, Nicolai Dorka, Tim Welschehold, Michael Thompson, Wolfram Burgard

    Abstract: The safe deployment of autonomous vehicles relies on their ability to effectively react to environmental changes. This can require maneuvering on varying surfaces which is still a difficult problem, especially for slippery terrains. To address this issue we propose a new approach that learns a surface-aware dynamics model by conditioning it on a latent variable vector storing surface information a… ▽ More

    Submitted 21 March, 2023; originally announced March 2023.

  13. arXiv:2303.08774  [pdf, other

    cs.CL cs.AI

    GPT-4 Technical Report

    Authors: OpenAI, Josh Achiam, Steven Adler, Sandhini Agarwal, Lama Ahmad, Ilge Akkaya, Florencia Leoni Aleman, Diogo Almeida, Janko Altenschmidt, Sam Altman, Shyamal Anadkat, Red Avila, Igor Babuschkin, Suchir Balaji, Valerie Balcom, Paul Baltescu, Haiming Bao, Mohammad Bavarian, Jeff Belgum, Irwan Bello, Jake Berdine, Gabriel Bernadett-Shapiro, Christopher Berner, Lenny Bogdonoff, Oleg Boiko , et al. (256 additional authors not shown)

    Abstract: We report the development of GPT-4, a large-scale, multimodal model which can accept image and text inputs and produce text outputs. While less capable than humans in many real-world scenarios, GPT-4 exhibits human-level performance on various professional and academic benchmarks, including passing a simulated bar exam with a score around the top 10% of test takers. GPT-4 is a Transformer-based mo… ▽ More

    Submitted 4 March, 2024; v1 submitted 15 March, 2023; originally announced March 2023.

    Comments: 100 pages; updated authors list; fixed author names and added citation

  14. arXiv:2303.08224  [pdf

    eess.IV cs.AI cs.CV cs.LG q-bio.QM

    Few-Shot Classification of Autism Spectrum Disorder using Site-Agnostic Meta-Learning and Brain MRI

    Authors: Nikhil J. Dhinagar, Vignesh Santhalingam, Katherine E. Lawrence, Emily Laltoo, Paul M. Thompson

    Abstract: For machine learning applications in medical imaging, the availability of training data is often limited, which hampers the design of radiological classifiers for subtle conditions such as autism spectrum disorder (ASD). Transfer learning is one method to counter this problem of low training data regimes. Here we explore the use of meta-learning for very low data regimes in the context of having p… ▽ More

    Submitted 14 March, 2023; originally announced March 2023.

  15. arXiv:2303.08216  [pdf

    eess.IV cs.AI cs.CV cs.LG q-bio.QM

    Efficiently Training Vision Transformers on Structural MRI Scans for Alzheimer's Disease Detection

    Authors: Nikhil J. Dhinagar, Sophia I. Thomopoulos, Emily Laltoo, Paul M. Thompson

    Abstract: Neuroimaging of large populations is valuable to identify factors that promote or resist brain disease, and to assist diagnosis, subty**, and prognosis. Data-driven models such as convolutional neural networks (CNNs) have increasingly been applied to brain images to perform diagnostic and prognostic tasks by learning robust features. Vision transformers (ViT) - a new class of deep learning archi… ▽ More

    Submitted 14 March, 2023; originally announced March 2023.

  16. arXiv:2303.01491  [pdf, other

    eess.IV cs.LG q-bio.QM

    Transferring Models Trained on Natural Images to 3D MRI via Position Encoded Slice Models

    Authors: Umang Gupta, Tamoghna Chattopadhyay, Nikhil Dhinagar, Paul M. Thompson, Greg Ver Steeg, The Alzheimer's Disease Neuroimaging Initiative

    Abstract: Transfer learning has remarkably improved computer vision. These advances also promise improvements in neuroimaging, where training set sizes are often small. However, various difficulties arise in directly applying models pretrained on natural images to radiologic images, such as MRIs. In particular, a mismatch in the input space (2D images vs. 3D MRIs) restricts the direct transfer of models, of… ▽ More

    Submitted 2 March, 2023; originally announced March 2023.

    Comments: To appear at IEEE International Symposium on Biomedical Imaging 2023 (ISBI 2023). Code is available at https://github.com/umgupta/2d-slice-set-networks

  17. arXiv:2302.13631  [pdf

    eess.IV cs.AI cs.CV cs.LG q-bio.QM

    Curriculum Based Multi-Task Learning for Parkinson's Disease Detection

    Authors: Nikhil J. Dhinagar, Conor Owens-Walton, Emily Laltoo, Christina P. Boyle, Yao-Liang Chen, Philip Cook, Corey McMillan, Chih-Chien Tsai, J-J Wang, Yih-Ru Wu, Ysbrand van der Werf, Paul M. Thompson

    Abstract: There is great interest in develo** radiological classifiers for diagnosis, staging, and predictive modeling in progressive diseases such as Parkinson's disease (PD), a neurodegenerative disease that is difficult to detect in its early stages. Here we leverage severity-based meta-data on the stages of disease to define a curriculum for training a deep convolutional neural network (CNN). Typicall… ▽ More

    Submitted 27 February, 2023; originally announced February 2023.

    Comments: Accepted for publication at the 20th IEEE International Symposium on Biomedical Imaging, ISBI 2023

  18. arXiv:2212.00744  [pdf, ps, other

    cs.CL astro-ph.IM

    Improving astroBERT using Semantic Textual Similarity

    Authors: Felix Grezes, Thomas Allen, Sergi Blanco-Cuaresma, Alberto Accomazzi, Michael J. Kurtz, Golnaz Shapurian, Edwin Henneken, Carolyn S. Grant, Donna M. Thompson, Timothy W. Hostetler, Matthew R. Templeton, Kelly E. Lockhart, Shinyi Chen, Jennifer Koch, Taylor Jacovich, Pavlos Protopapas

    Abstract: The NASA Astrophysics Data System (ADS) is an essential tool for researchers that allows them to explore the astronomy and astrophysics scientific literature, but it has yet to exploit recent advances in natural language processing. At ADASS 2021, we introduced astroBERT, a machine learning language model tailored to the text used in astronomy papers in ADS. In this work we: - announce the first… ▽ More

    Submitted 29 November, 2022; originally announced December 2022.

  19. arXiv:2211.05235  [pdf

    physics.med-ph cs.LG

    Improved Prediction of Beta-Amyloid and Tau Burden Using Hippocampal Surface Multivariate Morphometry Statistics and Sparse Coding

    Authors: Jianfeng Wu, Yi Su, Wenhui Zhu, Negar Jalili Mallak, Natasha Lepore, Eric M. Reiman, Richard J. Caselli, Paul M. Thompson, Kewei Chen, Yalin Wang

    Abstract: Background: Beta-amyloid (A$β$) plaques and tau protein tangles in the brain are the defining 'A' and 'T' hallmarks of Alzheimer's disease (AD), and together with structural atrophy detectable on brain magnetic resonance imaging (MRI) scans as one of the neurodegenerative ('N') biomarkers comprise the ''ATN framework'' of AD. Current methods to detect A$β$/tau pathology include cerebrospinal fluid… ▽ More

    Submitted 27 October, 2022; originally announced November 2022.

    Comments: 34 pages, 5 figures, 1 table, accepted by the Journal of Alzheimer's Disease

    MSC Class: 65U05

  20. arXiv:2208.01298  [pdf, ps, other

    cs.LO cs.DB cs.FL

    Conjunctive Queries for Logic-Based Information Extraction

    Authors: Sam M. Thompson

    Abstract: This thesis offers two logic-based approaches to conjunctive queries in the context of information extraction. The first and main approach is the introduction of conjunctive query fragments of the logics FC and FC[REG], denoted as FC-CQ and FC[REG]-CQ respectively. FC is a first-order logic based on word equations, where the semantics are defined by limiting the universe to the factors of some fin… ▽ More

    Submitted 2 August, 2022; originally announced August 2022.

    Comments: Based on the author's PhD thesis and contains work from two conference publications (arXiv:2104.04758, arXiv:1909.10869) which are joint work with Dominik D. Freydenberger

  21. arXiv:2207.06228  [pdf

    cs.LG

    Machine Learning Application in Health

    Authors: Ghadah Alshabana, Marjn Sadati, Thao Tran, Michael Thompson, Ashritha Chitimalla

    Abstract: Coronavirus can be transmitted through the air by close proximity to infected persons. Commercial aircraft are a likely way to both transmit the virus among passengers and move the virus between locations. The importance of learning about where and how coronavirus has entered the United States will help further our understanding of the disease. Air travelers can come from countries or areas with a… ▽ More

    Submitted 9 June, 2022; originally announced July 2022.

  22. arXiv:2205.13326  [pdf, other

    cs.CV cs.GR

    SHREC 2022: pothole and crack detection in the road pavement using images and RGB-D data

    Authors: Elia Moscoso Thompson, Andrea Ranieri, Silvia Biasotti, Miguel Chicchon, Ivan Sipiran, Minh-Khoi Pham, Thang-Long Nguyen-Ho, Hai-Dang Nguyen, Minh-Triet Tran

    Abstract: This paper describes the methods submitted for evaluation to the SHREC 2022 track on pothole and crack detection in the road pavement. A total of 7 different runs for the semantic segmentation of the road surface are compared, 6 from the participants plus a baseline method. All methods exploit Deep Learning techniques and their performance is tested using the same environment (i.e.: a single Jupyt… ▽ More

    Submitted 12 July, 2022; v1 submitted 26 May, 2022; originally announced May 2022.

  23. arXiv:2205.05249  [pdf, other

    cs.LG cs.CR cs.CV cs.DC

    Secure & Private Federated Neuroimaging

    Authors: Dimitris Stripelis, Umang Gupta, Hamza Saleem, Nikhil Dhinagar, Tanmay Ghai, Rafael Chrysovalantis Anastasiou, Armaghan Asghar, Greg Ver Steeg, Srivatsan Ravi, Muhammad Naveed, Paul M. Thompson, Jose Luis Ambite

    Abstract: The amount of biomedical data continues to grow rapidly. However, collecting data from multiple sites for joint analysis remains challenging due to security, privacy, and regulatory concerns. To overcome this challenge, we use Federated Learning, which enables distributed training of neural network models over multiple data sources without sharing data. Each site trains the neural network over its… ▽ More

    Submitted 28 August, 2023; v1 submitted 10 May, 2022; originally announced May 2022.

    Comments: 18 pages, 13 figures, 2 tables

    ACM Class: I.2; I.5.1; J.3

  24. arXiv:2202.00777  [pdf, ps, other

    cs.HC astro-ph.IM

    Web accessibility trends and implementation in dynamic web applications

    Authors: Timothy W. Hostetler, Shinyi Chen, Sergi Blanco-Cuaresma, Alberto Accomazzi, Michael J. Kurtz, Carolyn S. Grant, Edwin Henneken, Donna M. Thompson, Roman Chyla, Golnaz Shapurian, Matthew R. Templeton, Kelly E. Lockhart, Nemanja Martinovic, Stephen McDonald, Felix Grezes

    Abstract: The NASA Astrophysics Data System (ADS), a critical research service for the astrophysics community, strives to provide the most accessible and inclusive environment for the discovery and exploration of the astronomical literature. Part of this goal involves creating a digital platform that can accommodate everybody, including those with disabilities that would benefit from alternative ways to pre… ▽ More

    Submitted 1 February, 2022; originally announced February 2022.

    Comments: Submitted to ADASS XXXI (2021)

  25. arXiv:2201.10005  [pdf, other

    cs.CL cs.LG

    Text and Code Embeddings by Contrastive Pre-Training

    Authors: Arvind Neelakantan, Tao Xu, Raul Puri, Alec Radford, Jesse Michael Han, Jerry Tworek, Qiming Yuan, Nikolas Tezak, Jong Wook Kim, Chris Hallacy, Johannes Heidecke, Pranav Shyam, Boris Power, Tyna Eloundou Nekoul, Girish Sastry, Gretchen Krueger, David Schnurr, Felipe Petroski Such, Kenny Hsu, Madeleine Thompson, Tabarak Khan, Toki Sherbakov, Joanne Jang, Peter Welinder, Lilian Weng

    Abstract: Text embeddings are useful features in many applications such as semantic search and computing text similarity. Previous work typically trains models customized for different use cases, varying in dataset choice, training objective and model architecture. In this work, we show that contrastive pre-training on unsupervised data at scale leads to high quality vector representations of text and code.… ▽ More

    Submitted 24 January, 2022; originally announced January 2022.

  26. arXiv:2112.00590  [pdf, ps, other

    cs.CL astro-ph.IM

    Building astroBERT, a language model for Astronomy & Astrophysics

    Authors: Felix Grezes, Sergi Blanco-Cuaresma, Alberto Accomazzi, Michael J. Kurtz, Golnaz Shapurian, Edwin Henneken, Carolyn S. Grant, Donna M. Thompson, Roman Chyla, Stephen McDonald, Timothy W. Hostetler, Matthew R. Templeton, Kelly E. Lockhart, Nemanja Martinovic, Shinyi Chen, Chris Tanner, Pavlos Protopapas

    Abstract: The existing search tools for exploring the NASA Astrophysics Data System (ADS) can be quite rich and empowering (e.g., similar and trending operators), but researchers are not yet allowed to fully leverage semantic search. For example, a query for "results from the Planck mission" should be able to distinguish between all the various meanings of Planck (person, mission, constant, institutions and… ▽ More

    Submitted 1 December, 2021; originally announced December 2021.

  27. arXiv:2110.10709  [pdf

    physics.med-ph cs.LG eess.IV

    Predicting Tau Accumulation in Cerebral Cortex with Multivariate MRI Morphometry Measurements, Sparse Coding, and Correntropy

    Authors: Jianfeng Wu, Wenhui Zhu, Yi Su, Jie Gui, Natasha Lepore, Eric M. Reiman, Richard J. Caselli, Paul M. Thompson, Kewei Chen, Yalin Wang

    Abstract: Biomarker-assisted diagnosis and intervention in Alzheimer's disease (AD) may be the key to prevention breakthroughs. One of the hallmarks of AD is the accumulation of tau plaques in the human brain. However, current methods to detect tau pathology are either invasive (lumbar puncture) or quite costly and not widely available (Tau PET). In our previous work, structural MRI-based hippocampal multiv… ▽ More

    Submitted 20 October, 2021; originally announced October 2021.

    Comments: 10 pages, 5 figures, 17th International Symposium on Medical Information Processing and Analysis

  28. arXiv:2108.03437  [pdf, other

    cs.CR cs.LG

    Secure Neuroimaging Analysis using Federated Learning with Homomorphic Encryption

    Authors: Dimitris Stripelis, Hamza Saleem, Tanmay Ghai, Nikhil Dhinagar, Umang Gupta, Chrysovalantis Anastasiou, Greg Ver Steeg, Srivatsan Ravi, Muhammad Naveed, Paul M. Thompson, Jose Luis Ambite

    Abstract: Federated learning (FL) enables distributed computation of machine learning models over various disparate, remote data sources, without requiring to transfer any individual data to a centralized location. This results in an improved generalizability of models and efficient scaling of computation as more sources and larger datasets are added to the federation. Nevertheless, recent membership attack… ▽ More

    Submitted 9 November, 2021; v1 submitted 7 August, 2021; originally announced August 2021.

    Comments: 9 pages, 3 figures, 1 algorithm

  29. arXiv:2106.00570  [pdf, other

    cs.CE

    Robust design optimisation of continuous flow polymerase chain reaction thermal flow systems

    Authors: Yongxing Wang, Hazim A. Hamad, Jochen Voss, Harvey M. Thompson

    Abstract: This paper presents an efficient methodology for the robust optimisation of Continuous Flow Polymerase Chain Reaction (CFPCR) devices. It enables the effects of uncertainties in device geometry, due to manufacturing tolerances, on the competing objectives of minimising the temperature deviations within the CFPCR thermal zones, together with minimising the pressure drop across the device, to be exp… ▽ More

    Submitted 1 June, 2021; originally announced June 2021.

  30. arXiv:2105.02866  [pdf, other

    q-bio.QM cs.CR cs.LG eess.IV

    Membership Inference Attacks on Deep Regression Models for Neuroimaging

    Authors: Umang Gupta, Dimitris Stripelis, Pradeep K. Lam, Paul M. Thompson, José Luis Ambite, Greg Ver Steeg

    Abstract: Ensuring the privacy of research participants is vital, even more so in healthcare environments. Deep learning approaches to neuroimaging require large datasets, and this often necessitates sharing data between multiple sites, which is antithetical to the privacy objectives. Federated learning is a commonly proposed solution to this problem. It circumvents the need for data sharing by sharing para… ▽ More

    Submitted 3 June, 2021; v1 submitted 6 May, 2021; originally announced May 2021.

    Comments: To appear at Medical Imaging with Deep Learning 2021 (MIDL 2021)

  31. arXiv:2104.08522  [pdf

    cs.CY

    Quantifying the Need for Attorney Pro Bono Services in Connection with the Social Determinants of Health

    Authors: Yi Mao, Stacey R. Beck, Benjamin Bartek, Beatriz Cabrera, Rachell Calhoun, David Coe, Jakob Cronberg, Suren Nalluri, Bradley Merrill Thompson

    Abstract: The paper estimates the need for additional attorney hours annually to address the legal needs of indigent clients throughout the United States in matters that comprise the so-called social determinants of health (SDoH). The result will inform stakeholders such as policy makers and private donors so they can allocate resources appropriately and design programs to close the do-called justice gap. A… ▽ More

    Submitted 17 April, 2021; originally announced April 2021.

    Comments: 23 pages

  32. arXiv:2104.04758  [pdf, other

    cs.DB cs.LO

    Splitting Spanner Atoms: A Tool for Acyclic Core Spanners

    Authors: Dominik D. Freydenberger, Sam M. Thompson

    Abstract: This paper investigates regex CQs with string equalities (SERCQs), a subclass of core spanners. As shown by Freydenberger, Kimelfeld, and Peterfreund (PODS 2018), these queries are intractable, even if restricted to acyclic queries. This previous result defines acyclicity by treating regex formulas as atoms. In contrast to this, we propose an alternative definition by converting SERCQs into FC-CQs… ▽ More

    Submitted 19 January, 2022; v1 submitted 10 April, 2021; originally announced April 2021.

  33. arXiv:2104.00038  [pdf, other

    cs.LG cs.HC

    Smartphone Camera Oximetry in an Induced Hypoxemia Study

    Authors: Jason S. Hoffman, Varun Viswanath, Xinyi Ding, Matthew J. Thompson, Eric C. Larson, Shwetak N. Patel, Edward Wang

    Abstract: Hypoxemia, a medical condition that occurs when the blood is not carrying enough oxygen to adequately supply the tissues, is a leading indicator for dangerous complications of respiratory diseases like asthma, COPD, and COVID-19. While purpose-built pulse oximeters can provide accurate blood-oxygen saturation (SpO$_2$) readings that allow for diagnosis of hypoxemia, enabling this capability in unm… ▽ More

    Submitted 31 March, 2021; originally announced April 2021.

    Comments: 26 pages, 8 figures

  34. arXiv:2102.10503  [pdf, ps, other

    eess.IV cs.CV

    Predicting Future Cognitive Decline with Hyperbolic Stochastic Coding

    Authors: J. Zhang, Q. Dong, J. Shi, Q. Li, C. M. Stonnington, B. A. Gutman, K. Chen, E. M. Reiman, R. J. Caselli, P. M. Thompson, J. Ye, Y. Wang

    Abstract: Hyperbolic geometry has been successfully applied in modeling brain cortical and subcortical surfaces with general topological structures. However such approaches, similar to other surface based brain morphology analysis methods, usually generate high dimensional features. It limits their statistical power in cognitive decline prediction research, especially in datasets with limited subject number… ▽ More

    Submitted 20 February, 2021; originally announced February 2021.

  35. arXiv:2102.04438  [pdf, other

    eess.IV cs.LG q-bio.QM

    Improved Brain Age Estimation with Slice-based Set Networks

    Authors: Umang Gupta, Pradeep K. Lam, Greg Ver Steeg, Paul M. Thompson

    Abstract: Deep Learning for neuroimaging data is a promising but challenging direction. The high dimensionality of 3D MRI scans makes this endeavor compute and data-intensive. Most conventional 3D neuroimaging methods use 3D-CNN-based architectures with a large number of parameters and require more time and data to train. Recently, 2D-slice-based models have received increasing attention as they have fewer… ▽ More

    Submitted 9 February, 2021; v1 submitted 8 February, 2021; originally announced February 2021.

    Comments: To appear at IEEE International Symposium on Biomedical Imaging 2021 (ISBI 2021). Code is available at https://git.io/JtazG

  36. arXiv:2101.07385  [pdf, other

    cond-mat.mtrl-sci cs.AI cs.LG cs.MA physics.comp-ph

    Autonomous synthesis of metastable materials

    Authors: Sebastian Ament, Maximilian Amsler, Duncan R. Sutherland, Ming-Chiang Chang, Dan Guevarra, Aine B. Connolly, John M. Gregoire, Michael O. Thompson, Carla P. Gomes, R. Bruce van Dover

    Abstract: Autonomous experimentation enabled by artificial intelligence (AI) offers a new paradigm for accelerating scientific discovery. Non-equilibrium materials synthesis is emblematic of complex, resource-intensive experimentation whose acceleration would be a watershed for materials discovery and development. The map** of non-equilibrium synthesis phase diagrams has recently been accelerated via high… ▽ More

    Submitted 19 December, 2021; v1 submitted 18 January, 2021; originally announced January 2021.

    Journal ref: Autonomous materials synthesis via hierarchical active learning of nonequilibrium phase diagrams, Science Advances, Vol 7, Issue 5, 2021

  37. arXiv:2012.00974  [pdf, other

    cs.CL cs.LG

    Extracting COVID-19 Diagnoses and Symptoms From Clinical Text: A New Annotated Corpus and Neural Event Extraction Framework

    Authors: Kevin Lybarger, Mari Ostendorf, Matthew Thompson, Meliha Yetisgen

    Abstract: Coronavirus disease 2019 (COVID-19) is a global pandemic. Although much has been learned about the novel coronavirus since its emergence, there are many open questions related to tracking its spread, describing symptomology, predicting the severity of infection, and forecasting healthcare utilization. Free-text clinical notes contain critical information for resolving these questions. Data-driven,… ▽ More

    Submitted 10 March, 2021; v1 submitted 2 December, 2020; originally announced December 2020.

  38. arXiv:2009.05048  [pdf, ps, other

    cs.SE astro-ph.IM

    Agile methodologies in teams with highly creative and autonomous members

    Authors: Sergi Blanco-Cuaresma, Alberto Accomazzi, Michael J. Kurtz, Edwin Henneken, Carolyn S. Grant, Donna M. Thompson, Roman Chyla, Stephen McDonald, Golnaz Shapurian, Timothy W. Hostetler, Matthew R. Templeton, Kelly E. Lockhart, Kris Bukovi

    Abstract: The Agile manifesto encourages us to value individuals and interactions over processes and tools, while Scrum, the most adopted Agile development methodology, is essentially based on roles, events, artifacts, and the rules that bind them together (i.e., processes). Moreover, it is generally proclaimed that whenever a Scrum project does not succeed, the reason is because Scrum was not implemented c… ▽ More

    Submitted 10 September, 2020; originally announced September 2020.

    Comments: To appear in the proceedings of the 29th annual international Astronomical Data Analysis Software & Systems (ADASS XXIX)

  39. arXiv:2009.03143  [pdf

    cs.HC

    Cyber-Human System for Remote Collaborators

    Authors: Srikanth Jonnada, Ram Dantu, Ishan Ranasinghe, Logan Widick, Mark Thompson, Janice A. Hauge

    Abstract: With the increasing ubiquity of technology in our daily lives, the complexity of our environment and the mechanisms required to function have also increased exponentially. Failure of any of the mechanical and digital devices that we rely on can be extremely disruptive. At times, the presence of an expert is needed to analyze, troubleshoot, and fix the problem. The increased demand and rapidly evol… ▽ More

    Submitted 7 September, 2020; originally announced September 2020.

    Comments: 36 pages, 28 figures

  40. arXiv:2006.00115  [pdf, other

    q-bio.QM cs.CV cs.LG eess.IV

    Overview of Scanner Invariant Representations

    Authors: Daniel Moyer, Greg Ver Steeg, Paul M. Thompson

    Abstract: Pooled imaging data from multiple sources is subject to bias from each source. Studies that do not correct for these scanner/site biases at best lose statistical power, and at worst leave spurious correlations in their data. Estimation of the bias effects is non-trivial due to the paucity of data with correspondence across sites, so called "traveling phantom" data, which is expensive to collect. N… ▽ More

    Submitted 29 May, 2020; originally announced June 2020.

    Comments: Accepted as a short paper in MIDL 2020. In accordance with the MIDL 2020 Call for Papers, this short paper is an overview of an already published work arXiv:1904.05375, and was submitted to MIDL in order to allow presentation and discussion at the meeting

    Report number: MIDL/2020/ExtendedAbstract/yqm9RD_XHT

  41. arXiv:1909.10869  [pdf, other

    cs.LO

    Dynamic Complexity of Document Spanners

    Authors: Dominik D. Freydenberger, Sam M. Thompson

    Abstract: The present paper investigates the dynamic complexity of document spanners, a formal framework for information extraction introduced by Fagin, Kimelfeld, Reiss, and Vansummeren (JACM 2015). We first look at the class of regular spanners and prove that any regular spanner can be maintained in the dynamic complexity class DynPROP. This result follows from work done previously on the dynamic complexi… ▽ More

    Submitted 9 January, 2020; v1 submitted 24 September, 2019; originally announced September 2019.

  42. arXiv:1908.01901  [pdf, other

    cs.LG eess.IV stat.ML

    Fully-automated patient-level malaria assessment on field-prepared thin blood film microscopy images, including Supplementary Information

    Authors: Charles B. Delahunt, Mayoore S. Jaiswal, Matthew P. Horning, Samantha Janko, Clay M. Thompson, Sourabh Kulhare, Liming Hu, Travis Ostbye, Grace Yun, Roman Gebrehiwot, Benjamin K. Wilson, Earl Long, Stephane Proux, Dionicia Gamboa, Peter Chiodini, Jane Carter, Mehul Dhorda, David Isaboke, Bernhards Ogutu, Wellington Oyibo, Elizabeth Villasis, Kyaw Myo Tun, Christine Bachman, David Bell, Courosh Mehanian

    Abstract: Malaria is a life-threatening disease affecting millions. Microscopy-based assessment of thin blood films is a standard method to (i) determine malaria species and (ii) quantitate high-parasitemia infections. Full automation of malaria microscopy by machine learning (ML) is a challenging task because field-prepared slides vary widely in quality and presentation, and artifacts often heavily outnumb… ▽ More

    Submitted 11 September, 2022; v1 submitted 5 August, 2019; originally announced August 2019.

    Comments: 16 pages, 13 figures

    MSC Class: 68T10 ACM Class: I.5.0

  43. arXiv:1907.04223  [pdf, other

    stat.ML cs.LG

    Characterizing Inter-Layer Functional Map**s of Deep Learning Models

    Authors: Donald Waagen, Katie Rainey, Jamie Gantert, David Gray, Megan King, M. Shane Thompson, Jonathan Barton, Will Waldron, Samantha Livingston, Don Hulsey

    Abstract: Deep learning architectures have demonstrated state-of-the-art performance for object classification and have become ubiquitous in commercial products. These methods are often applied without understanding (a) the difficulty of a classification task given the input data, and (b) how a specific deep learning architecture transforms that data. To answer (a) and (b), we illustrate the utility of a mu… ▽ More

    Submitted 23 September, 2019; v1 submitted 9 July, 2019; originally announced July 2019.

  44. arXiv:1904.05375  [pdf, other

    q-bio.QM cs.LG eess.IV stat.AP stat.ML

    Scanner Invariant Representations for Diffusion MRI Harmonization

    Authors: Daniel Moyer, Greg Ver Steeg, Chantal M. W. Tax, Paul M. Thompson

    Abstract: Purpose: In the present work we describe the correction of diffusion-weighted MRI for site and scanner biases using a novel method based on invariant representation. Theory and Methods: Pooled imaging data from multiple sources are subject to variation between the sources. Correcting for these biases has become very important as imaging studies increase in size and multi-site cases become more c… ▽ More

    Submitted 31 January, 2020; v1 submitted 10 April, 2019; originally announced April 2019.

  45. arXiv:1901.05463  [pdf, ps, other

    astro-ph.IM cs.DL

    Fundamentals of effective cloud management for the new NASA Astrophysics Data System

    Authors: Sergi Blanco-Cuaresma, Alberto Accomazzi, Michael J. Kurtz, Edwin Henneken, Carolyn S. Grant, Donna M. Thompson, Roman Chyla, Stephen McDonald, Golnaz Shapurian, Timothy W. Hostetler, Matthew R. Templeton, Kelly E. Lockhart, Kris Bukovi, Nathan Rapport

    Abstract: The new NASA Astrophysics Data System (ADS) is designed with a serviceoriented architecture (SOA) that consists of multiple customized Apache Solr search engine instances plus a collection of microservices, containerized using Docker, and deployed in Amazon Web Services (AWS). For complex systems, like the ADS, this loosely coupled architecture can lead to a more scalable, reliable and resilient s… ▽ More

    Submitted 16 January, 2019; originally announced January 2019.

    Comments: To appear in the proceedings of the 28th annual international Astronomical Data Analysis Software & Systems (ADASS XXVIII)

  46. arXiv:1810.08553  [pdf, other

    stat.ML cs.LG q-bio.NC q-bio.QM

    Federated Learning in Distributed Medical Databases: Meta-Analysis of Large-Scale Subcortical Brain Data

    Authors: Santiago Silva, Boris Gutman, Eduardo Romero, Paul M Thompson, Andre Altmann, Marco Lorenzi

    Abstract: At this moment, databanks worldwide contain brain images of previously unimaginable numbers. Combined with developments in data science, these massive data provide the potential to better understand the genetic underpinnings of brain diseases. However, different datasets, which are stored at different institutions, cannot always be shared directly due to privacy and legal concerns, thus limiting t… ▽ More

    Submitted 14 March, 2019; v1 submitted 19 October, 2018; originally announced October 2018.

    Comments: Federated learning, distributed databases, PCA, SVD, meta-analysis, brain disease

  47. arXiv:1806.04634  [pdf, other

    q-bio.QM cs.LG q-bio.TO stat.AP

    Measures of Tractography Convergence

    Authors: Daniel Moyer, Paul M. Thompson, Greg Ver Steeg

    Abstract: In the present work, we use information theory to understand the empirical convergence rate of tractography, a widely-used approach to reconstruct anatomical fiber pathways in the living brain. Based on diffusion MRI data, tractography is the starting point for many methods to study brain connectivity. Of the available methods to perform tractography, most reconstruct a finite set of streamlines,… ▽ More

    Submitted 12 June, 2018; originally announced June 2018.

    Comments: 11 pages

  48. arXiv:1805.00719  [pdf, other

    cs.CG

    Description and Retrieval of Geometric Patterns on Surface Meshes using an edge-based LBP approach

    Authors: Elia Moscoso Thompson, Silvia Biasotti

    Abstract: While texture analysis is largely addressed for images, the comparison of the geometric reliefs on surfaces embedded in the 3D space is still an open challenge. Starting from the Local Binary Pattern (LBP) description originally defined for images, we introduce the edge-Local Binary Pattern (edgeLBP) as a local description able to capture the evolution of repeated, geometric patterns on surface me… ▽ More

    Submitted 2 May, 2018; originally announced May 2018.

  49. arXiv:1804.03979  [pdf, other

    cs.GR cs.IR

    Experimental similarity assessment for a collection of fragmented artifacts

    Authors: Silvia Biasotti, Elia Moscoso Thompson, Michela Spagnuolo

    Abstract: In the Visual Heritage domain, search engines are expected to support archaeologists and curators to address cross-correlation and searching across multiple collections. Archaeological excavations return artifacts that often are damaged with parts that are fragmented in more pieces or totally missing. The notion of similarity among fragments cannot simply base on the geometric shape but style, mat… ▽ More

    Submitted 11 April, 2018; originally announced April 2018.

    Comments: Eurographics Workshop on 3D Object Retrieval 2018

    MSC Class: 68T45; 68P20; 68U05 ACM Class: I.3.6; H.3.3

  50. arXiv:1804.03977  [pdf, other

    cs.GR cs.CV

    Edge-based LBP description of surfaces with colorimetric patterns

    Authors: Elia Moscoso Thompson, Silvia Biasotti

    Abstract: In this paper we target the problem of the retrieval of colour patterns over surfaces. We generalize to surface tessellations the well known Local Binary Pattern (LBP) descriptor for images. The key concept of the LBP is to code the variability of the colour values around each pixel. In the case of a surface tessellation we adopt rings around vertices that are obtained with a sphere-mesh intersect… ▽ More

    Submitted 11 April, 2018; originally announced April 2018.

    Comments: Eurographics Workshop on 3D Object Retrieval 2018

    MSC Class: 68T45; 68U05 ACM Class: I.3.6; H.3.3