Skip to main content

Showing 1–39 of 39 results for author: White, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.01603  [pdf, other

    cs.LG cs.AI cs.CL physics.chem-ph

    A Review of Large Language Models and Autonomous Agents in Chemistry

    Authors: Mayk Caldas Ramos, Christopher J. Collison, Andrew D. White

    Abstract: Large language models (LLMs) are emerging as a powerful tool in chemistry across multiple domains. In chemistry, LLMs are able to accurately predict properties, design new molecules, optimize synthesis pathways, and accelerate drug and material discovery. A core emerging idea is combining LLMs with chemistry-specific tools like synthesis planners and databases, leading to so-called "agents." This… ▽ More

    Submitted 26 June, 2024; originally announced July 2024.

  2. arXiv:2406.15509  [pdf, other

    physics.comp-ph cs.LG physics.flu-dyn

    Machine Learning Visualization Tool for Exploring Parameterized Hydrodynamics

    Authors: C. F. Jekel, D. M. Sterbentz, T. M. Stitt, P. Mocz, R. N. Rieben, D. A. White, J. L. Belof

    Abstract: We are interested in the computational study of shock hydrodynamics, i.e. problems involving compressible solids, liquids, and gases that undergo large deformation. These problems are dynamic and nonlinear and can exhibit complex instabilities. Due to advances in high performance computing it is possible to parameterize a hydrodynamic problem and perform a computational study yielding… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Report number: LLNL-JRNL-865692

  3. arXiv:2403.01626  [pdf, other

    cs.CR

    Using LLMs for Tabletop Exercises within the Security Domain

    Authors: Sam Hays, Dr. Jules White

    Abstract: Tabletop exercises are a crucial component of many company's strategy to test and evaluate its preparedness for security incidents in a realistic way. Traditionally led by external firms specializing in cybersecurity, these exercises can be costly, time-consuming, and may not always align precisely with the client's specific needs. Large Language Models (LLMs) like ChatGPT offer a compelling alter… ▽ More

    Submitted 3 March, 2024; originally announced March 2024.

    Comments: 7 pages, 11 figures

  4. arXiv:2403.01271  [pdf, other

    cs.CR

    Employing LLMs for Incident Response Planning and Review

    Authors: Sam Hays, Dr. Jules White

    Abstract: Incident Response Planning (IRP) is essential for effective cybersecurity management, requiring detailed documentation (or playbooks) to guide security personnel during incidents. Yet, creating comprehensive IRPs is often hindered by challenges such as complex systems, high turnover rates, and legacy technologies lacking documentation. This paper argues that, despite these obstacles, the developme… ▽ More

    Submitted 2 March, 2024; originally announced March 2024.

    Comments: 10 pages, 11 figures

  5. arXiv:2401.11599  [pdf, ps, other

    cs.CR

    Reducing Usefulness of Stolen Credentials in SSO Contexts

    Authors: Sam Hays, Michael Sandborn, Dr. Jules White

    Abstract: Approximately 61% of cyber attacks involve adversaries in possession of valid credentials. Attackers acquire credentials through various means, including phishing, dark web data drops, password reuse, etc. Multi-factor authentication (MFA) helps to thwart attacks that use valid credentials, but attackers still commonly breach systems by tricking users into accepting MFA step up requests through te… ▽ More

    Submitted 21 January, 2024; originally announced January 2024.

    Comments: 8 pages, 5 figures

  6. arXiv:2312.07559  [pdf, other

    cs.CL cs.AI cs.LG

    PaperQA: Retrieval-Augmented Generative Agent for Scientific Research

    Authors: Jakub Lála, Odhran O'Donoghue, Aleksandar Shtedritski, Sam Cox, Samuel G. Rodriques, Andrew D. White

    Abstract: Large Language Models (LLMs) generalize well across language tasks, but suffer from hallucinations and uninterpretability, making it difficult to assess their accuracy without ground-truth. Retrieval-Augmented Generation (RAG) models have been proposed to reduce hallucinations and provide provenance for how an answer was generated. Applying such models to the scientific literature may enable large… ▽ More

    Submitted 14 December, 2023; v1 submitted 8 December, 2023; originally announced December 2023.

  7. arXiv:2311.10840  [pdf

    cs.AI

    Integration and Implementation Strategies for AI Algorithm Deployment with Smart Routing Rules and Workflow Management

    Authors: Barbaros Selnur Erdal, Vikash Gupta, Mutlu Demirer, Kim H. Fair, Richard D. White, Jeff Blair, Barbara Deichert, Laurie Lafleur, Ming Melvin Qin, David Bericat, Brad Genereaux

    Abstract: This paper reviews the challenges hindering the widespread adoption of artificial intelligence (AI) solutions in the healthcare industry, focusing on computer vision applications for medical imaging, and how interoperability and enterprise-grade scalability can be used to address these challenges. The complex nature of healthcare workflows, intricacies in managing large and secure medical imaging… ▽ More

    Submitted 21 November, 2023; v1 submitted 17 November, 2023; originally announced November 2023.

    Comments: 13 pages, 6 figures

    ACM Class: I.2.m

  8. arXiv:2307.16348  [pdf, other

    cs.LG cs.AI cs.RO

    Rating-based Reinforcement Learning

    Authors: Devin White, Mingkang Wu, Ellen Novoseller, Vernon J. Lawhern, Nicholas Waytowich, Yongcan Cao

    Abstract: This paper develops a novel rating-based reinforcement learning approach that uses human ratings to obtain human guidance in reinforcement learning. Different from the existing preference-based and ranking-based reinforcement learning paradigms, based on human relative preferences over sample pairs, the proposed rating-based reinforcement learning approach is based on human evaluation of individua… ▽ More

    Submitted 29 January, 2024; v1 submitted 30 July, 2023; originally announced July 2023.

    Comments: This is an extended version of the paper "Rating-based Reinforcement Learning" accepted to the 38th Annual AAAI Conference on Artificial Intelligence

  9. CliniDigest: A Case Study in Large Language Model Based Large-Scale Summarization of Clinical Trial Descriptions

    Authors: Renee D. White, Tristan Peng, Pann Sripitak, Alexander Rosenberg Johansen, Michael Snyder

    Abstract: A clinical trial is a study that evaluates new biomedical interventions. To design new trials, researchers draw inspiration from those current and completed. In 2022, there were on average more than 100 clinical trials submitted to ClinicalTrials.gov every day, with each trial having a mean of approximately 1500 words [1]. This makes it nearly impossible to keep up to date. To mitigate this issue,… ▽ More

    Submitted 31 July, 2023; v1 submitted 26 July, 2023; originally announced July 2023.

    Comments: 7 pages, 3 figures, 3 tables, conference: ACM GoodIt 23'; Second co-author: Tristan Peng; Citation: White, Peng, et al

  10. arXiv:2307.05318  [pdf, other

    physics.chem-ph cs.LG

    Predicting small molecules solubilities on endpoint devices using deep ensemble neural networks

    Authors: Mayk Caldas Ramos, Andrew D. White

    Abstract: Aqueous solubility is a valuable yet challenging property to predict. Computing solubility using first-principles methods requires accounting for the competing effects of entropy and enthalpy, resulting in long computations for relatively poor accuracy. Data-driven approaches, such as deep learning, offer improved accuracy and computational efficiency but typically lack uncertainty quantification.… ▽ More

    Submitted 7 March, 2024; v1 submitted 11 July, 2023; originally announced July 2023.

  11. arXiv:2306.06283  [pdf, other

    cond-mat.mtrl-sci cs.LG physics.chem-ph

    14 Examples of How LLMs Can Transform Materials Science and Chemistry: A Reflection on a Large Language Model Hackathon

    Authors: Kevin Maik Jablonka, Qianxiang Ai, Alexander Al-Feghali, Shruti Badhwar, Joshua D. Bocarsly, Andres M Bran, Stefan Bringuier, L. Catherine Brinson, Kamal Choudhary, Defne Circi, Sam Cox, Wibe A. de Jong, Matthew L. Evans, Nicolas Gastellu, Jerome Genzling, María Victoria Gil, Ankur K. Gupta, Zhi Hong, Alishba Imran, Sabine Kruschwitz, Anne Labarre, Jakub Lála, Tao Liu, Steven Ma, Sauradeep Majumdar , et al. (28 additional authors not shown)

    Abstract: Large-language models (LLMs) such as GPT-4 caught the interest of many scientists. Recent studies suggested that these models could be useful in chemistry and materials science. To explore these possibilities, we organized a hackathon. This article chronicles the projects built as part of this hackathon. Participants employed LLMs for various applications, including predicting properties of mole… ▽ More

    Submitted 14 July, 2023; v1 submitted 9 June, 2023; originally announced June 2023.

  12. arXiv:2305.10379  [pdf, other

    cs.LG cs.NE physics.chem-ph stat.ML

    Active Learning in Symbolic Regression with Physical Constraints

    Authors: Jorge Medina, Andrew D. White

    Abstract: Evolutionary symbolic regression (SR) fits a symbolic equation to data, which gives a concise interpretable model. We explore using SR as a method to propose which data to gather in an active learning setting with physical constraints. SR with active learning proposes which experiments to do next. Active learning is done with query by committee, where the Pareto frontier of equations is the commit… ▽ More

    Submitted 18 May, 2023; v1 submitted 17 May, 2023; originally announced May 2023.

  13. arXiv:2304.10510  [pdf, other

    cs.LG cs.CR cs.CY physics.chem-ph

    Censoring chemical data to mitigate dual use risk

    Authors: Quintina L. Campbell, Jonathan Herington, Andrew D. White

    Abstract: The dual use of machine learning applications, where models can be used for both beneficial and malicious purposes, presents a significant challenge. This has recently become a particular concern in chemistry, where chemical datasets containing sensitive labels (e.g. toxicological information) could be used to develop predictive models that identify novel toxins or chemical warfare agents. To miti… ▽ More

    Submitted 20 April, 2023; originally announced April 2023.

  14. arXiv:2304.05341  [pdf, other

    physics.chem-ph cs.LG

    Bayesian Optimization of Catalysts With In-context Learning

    Authors: Mayk Caldas Ramos, Shane S. Michtavy, Marc D. Porosoff, Andrew D. White

    Abstract: Large language models (LLMs) are able to do accurate classification with zero or only a few examples (in-context learning). We show a prompting system that enables regression with uncertainty for in-context learning with frozen LLM (GPT-3, GPT-3.5, and GPT-4) models, allowing predictions without features or architecture tuning. By incorporating uncertainty, our approach enables Bayesian optimizati… ▽ More

    Submitted 11 April, 2023; originally announced April 2023.

  15. arXiv:2302.03620  [pdf, other

    physics.chem-ph cs.LG

    Recent advances in the Self-Referencing Embedding Strings (SELFIES) library

    Authors: Alston Lo, Robert Pollice, AkshatKumar Nigam, Andrew D. White, Mario Krenn, Alán Aspuru-Guzik

    Abstract: String-based molecular representations play a crucial role in cheminformatics applications, and with the growing success of deep learning in chemistry, have been readily adopted into machine learning pipelines. However, traditional string-based representations such as SMILES are often prone to syntactic and semantic errors when produced by generative models. To address these problems, a novel repr… ▽ More

    Submitted 7 February, 2023; originally announced February 2023.

    Comments: 11 pages, 2 figures

    Journal ref: Digital Discovery 2, 897 (2023)

  16. arXiv:2212.14177  [pdf, other

    cs.AI cs.CY eess.IV

    Current State of Community-Driven Radiological AI Deployment in Medical Imaging

    Authors: Vikash Gupta, Barbaros Selnur Erdal, Carolina Ramirez, Ralf Floca, Laurence Jackson, Brad Genereaux, Sidney Bryson, Christopher P Bridge, Jens Kleesiek, Felix Nensa, Rickmer Braren, Khaled Younis, Tobias Penzkofer, Andreas Michael Bucher, Ming Melvin Qin, Gigon Bae, Hyeonhoon Lee, M. Jorge Cardoso, Sebastien Ourselin, Eric Kerfoot, Rahul Choudhury, Richard D. White, Tessa Cook, David Bericat, Matthew Lungren , et al. (2 additional authors not shown)

    Abstract: Artificial Intelligence (AI) has become commonplace to solve routine everyday tasks. Because of the exponential growth in medical imaging data volume and complexity, the workload on radiologists is steadily increasing. We project that the gap between the number of imaging exams and the number of expert radiologist readers required to cover this increase will continue to expand, consequently introd… ▽ More

    Submitted 8 May, 2023; v1 submitted 29 December, 2022; originally announced December 2022.

    Comments: 21 pages; 5 figures

    MSC Class: eess.IV

  17. arXiv:2208.11477  [pdf, other

    physics.flu-dyn cs.LG physics.comp-ph

    Using Conservation Laws to Infer Deep Learning Model Accuracy of Richtmyer-meshkov Instabilities

    Authors: Charles F. Jekel, Dane M. Sterbentz, Sylvie Aubry, Youngsoo Choi, Daniel A. White, Jonathan L. Belof

    Abstract: Richtmyer-Meshkov Instability (RMI) is a complicated phenomenon that occurs when a shockwave passes through a perturbed interface. Over a thousand hydrodynamic simulations were performed to study the formation of RMI for a parameterized high velocity impact. Deep learning was used to learn the temporal map** of initial geometric perturbations to the full-field hydrodynamic solutions of density a… ▽ More

    Submitted 18 July, 2022; originally announced August 2022.

    Comments: Presented at ECCOMAS 2022

    Report number: LLNL-CONF-837041

  18. arXiv:2206.05625  [pdf, ps, other

    cs.AI cs.CV cs.NE

    Exploring the Intersection between Neural Architecture Search and Continual Learning

    Authors: Mohamed Shahawy, Elhadj Benkhelifa, David White

    Abstract: Despite the significant advances achieved in Artificial Neural Networks (ANNs), their design process remains notoriously tedious, depending primarily on intuition, experience and trial-and-error. This human-dependent process is often time-consuming and prone to errors. Furthermore, the models are generally bound to their training contexts, with no considerations to their surrounding environments.… ▽ More

    Submitted 15 June, 2023; v1 submitted 11 June, 2022; originally announced June 2022.

    MSC Class: 68T07 ACM Class: I.2.2; D.1.2; I.2.6

  19. arXiv:2203.13938  [pdf, other

    cs.LG

    Neural Network Layers for Prediction of Positive Definite Elastic Stiffness Tensors

    Authors: Charles F. Jekel, Kenneth E. Swartz, Daniel A. White, Daniel A. Tortorelli, Seth E. Watts

    Abstract: Machine learning models can be used to predict physical quantities like homogenized elasticity stiffness tensors, which must always be symmetric positive definite (SPD) based on conservation arguments. Two datasets of homogenized elasticity tensors of lattice materials are presented as examples, where it is desired to obtain models that map unit cell geometric and material parameters to their homo… ▽ More

    Submitted 25 March, 2022; originally announced March 2022.

    Comments: 17 pages, 1 figure, 11 tables, submitted to CMAME

    Report number: LLNL-JRNL-832991

  20. arXiv:2203.13718  [pdf, other

    cs.CV cond-mat.mtrl-sci physics.comp-ph

    Digital Fingerprinting of Microstructures

    Authors: Michael D. White, Alexander Tarakanov, Christopher P. Race, Philip J. Withers, Kody J. H. Law

    Abstract: Finding efficient means of fingerprinting microstructural information is a critical step towards harnessing data-centric machine learning approaches. A statistical framework is systematically developed for compressed characterisation of a population of images, which includes some classical computer vision methods as special cases. The focus is on materials microstructure. The ultimate purpose is t… ▽ More

    Submitted 22 January, 2024; v1 submitted 25 March, 2022; originally announced March 2022.

  21. arXiv:2202.08238  [pdf

    eess.IV cs.CV cs.LG

    A multi-reconstruction study of breast density estimation using Deep Learning

    Authors: Vikash Gupta, Mutlu Demirer, Robert W. Maxwell, Richard D. White, Barbaros Selnur Erdal

    Abstract: Breast density estimation is one of the key tasks in recognizing individuals predisposed to breast cancer. It is often challenging because of low contrast and fluctuations in mammograms' fatty tissue background. Most of the time, the breast density is estimated manually where a radiologist assigns one of the four density categories decided by the Breast Imaging and Reporting Data Systems (BI-RADS)… ▽ More

    Submitted 10 October, 2022; v1 submitted 16 February, 2022; originally announced February 2022.

    Comments: 4 pages

    ACM Class: I.2.1; J.3; I.4

  22. arXiv:2108.11954  [pdf

    eess.IV cs.AI

    Cascading Neural Network Methodology for Artificial Intelligence-Assisted Radiographic Detection and Classification of Lead-Less Implanted Electronic Devices within the Chest

    Authors: Mutlu Demirer, Richard D. White, Vikash Gupta, Ronnie A. Sebro, Barbaros S. Erdal

    Abstract: Background & Purpose: Chest X-Ray (CXR) use in pre-MRI safety screening for Lead-Less Implanted Electronic Devices (LLIEDs), easily overlooked or misidentified on a frontal view (often only acquired), is common. Although most LLIED types are "MRI conditional": 1. Some are stringently conditional; 2. Different conditional types have specific patient- or device- management requirements; and 3. Parti… ▽ More

    Submitted 26 April, 2022; v1 submitted 25 August, 2021; originally announced August 2021.

    Comments: 23 pages, 4 figures

  23. arXiv:2106.15878  [pdf

    cs.SE cs.FL cs.PL eess.SY

    Towards establishing formal verification and inductive code synthesis in the PLC domain

    Authors: Matthias Weiß, Philipp Marks, Benjamin Maschler, Dustin White, Pascal Kesseli, Michael Weyrich

    Abstract: Nowadays, formal methods are used in various areas for the verification of programs or for code generation from models in order to increase the quality of software and to reduce costs. However, there are still fields in which formal methods have not been widely adopted, despite the large set of possible benefits offered. This is the case for the area of programmable logic controllers (PLC). This a… ▽ More

    Submitted 30 June, 2021; originally announced June 2021.

    Comments: 8 pages, 6 figures, 1 table. Accepted for publication at IEEE INDIN 2021

  24. arXiv:2012.00391  [pdf

    cs.NI

    IRIS: A Low Duty Cycle Cross-Layer Protocol for Long-Range Wireless Sensor Networks with Low Power Budget

    Authors: Yi Chu, Paul Mitchell, David Grace, Jonathan Roberts, Dominic White, Tautvydas Mickus

    Abstract: This paper presents a cross-layer protocol (IRIS) designed for long-range pipeline Wireless Sensor Networks with extremely low power budget, typically seen in a range of monitoring applications. IRIS uses ** packets initiated by a base station to travel through the multi-hop network and carry monitoring information. The protocol is able to operate with less than 1% duty cycle, thereby conforming… ▽ More

    Submitted 1 December, 2020; originally announced December 2020.

  25. Metaheuristics "In the Large"

    Authors: Jerry Swan, Steven Adriaensen, Alexander E. I. Brownlee, Kevin Hammond, Colin G. Johnson, Ahmed Kheiri, Faustyna Krawiec, J. J. Merelo, Leandro L. Minku, Ender Özcan, Gisele L. Pappa, Pablo García-Sánchez, Kenneth Sörensen, Stefan Voß, Markus Wagner, David R. White

    Abstract: Following decades of sustained improvement, metaheuristics are one of the great success stories of optimization research. However, in order for research in metaheuristics to avoid fragmentation and a lack of reproducibility, there is a pressing need for stronger scientific and computational infrastructure to support the development, analysis and comparison of new approaches. We argue that, via pri… ▽ More

    Submitted 3 June, 2021; v1 submitted 19 November, 2020; originally announced November 2020.

    MSC Class: 68W99

  26. arXiv:2009.13580  [pdf

    eess.IV cs.AI cs.LG

    Deep Learning-Based Automatic Detection of Poorly Positioned Mammograms to Minimize Patient Return Visits for Repeat Imaging: A Real-World Application

    Authors: Vikash Gupta, Clayton Taylor, Sarah Bonnet, Luciano M. Prevedello, Jeffrey Hawley, Richard D White, Mona G Flores, Barbaros Selnur Erdal

    Abstract: Screening mammograms are a routine imaging exam performed to detect breast cancer in its early stages to reduce morbidity and mortality attributed to this disease. In order to maximize the efficacy of breast cancer screening programs, proper mammographic positioning is paramount. Proper positioning ensures adequate visualization of breast tissue and is necessary for effective breast cancer detecti… ▽ More

    Submitted 28 September, 2020; originally announced September 2020.

    Comments: 12 pages, 13 figures, pre-print

    ACM Class: I.2.1; J.3; I.4

  27. arXiv:2009.12437  [pdf

    eess.IV cs.CV

    Democratizing Artificial Intelligence in Healthcare: A Study of Model Development Across Two Institutions Incorporating Transfer Learning

    Authors: Vikash Gupta1, Holger Roth, Varun Buch3, Marcio A. B. C. Rockenbach, Richard D White, Dong Yang, Olga Laur, Brian Ghoshhajra, Ittai Dayan, Daguang Xu, Mona G. Flores, Barbaros Selnur Erdal

    Abstract: The training of deep learning models typically requires extensive data, which are not readily available as large well-curated medical-image datasets for development of artificial intelligence (AI) models applied in Radiology. Recognizing the potential for transfer learning (TL) to allow a fully trained model from one institution to be fine-tuned by another institution using a much small local data… ▽ More

    Submitted 25 September, 2020; originally announced September 2020.

    Comments: 8 pages, 5 figures, pre-print

    ACM Class: I.2.10

  28. arXiv:2008.04802  [pdf

    eess.IV cs.CV physics.med-ph

    Artificial Intelligence to Assist in Exclusion of Coronary Atherosclerosis during CCTA Evaluation of Chest-Pain in the Emergency Department: Preparing an Application for Real-World Use

    Authors: Richard D. White, Barbaros S. Erdal, Mutlu Demirer, Vikash Gupta, Matthew T. Bigelow, Engin Dikici, Sema Candemir, Mauricio S. Galizia, Jessica L. Carpenter, Thomas P. O Donnell, Abdul H. Halabi, Luciano M. Prevedello

    Abstract: Coronary Computed Tomography Angiography (CCTA) evaluation of chest-pain patients in an Emergency Department (ED) is considered appropriate. While a negative CCTA interpretation supports direct patient discharge from an ED, labor-intensive analyses are required, with accuracy in jeopardy from distractions. We describe the development of an Artificial Intelligence (AI) algorithm and workflow for as… ▽ More

    Submitted 10 August, 2020; originally announced August 2020.

    Comments: 13 pages, 9 figures

    ACM Class: I.5.4; I.5.2; I.2.10

  29. arXiv:2007.04921  [pdf, other

    q-bio.QM cs.LG stat.ML

    Graph Neural Network Based Coarse-Grained Map** Prediction

    Authors: Zhiheng Li, Geemi P. Wellawatte, Maghesree Chakraborty, Heta A. Gandhi, Chenliang Xu, Andrew D. White

    Abstract: The selection of coarse-grained (CG) map** operators is a critical step for CG molecular dynamics (MD) simulation. It is still an open question about what is optimal for this choice and there is a need for theory. The current state-of-the art method is map** operators manually selected by experts. In this work, we demonstrate an automated approach by viewing this problem as supervised learning… ▽ More

    Submitted 19 August, 2021; v1 submitted 24 June, 2020; originally announced July 2020.

  30. arXiv:2002.10034  [pdf, other

    q-bio.QM cs.LG eess.IV q-bio.NC

    Predicting Rate of Cognitive Decline at Baseline Using a Deep Neural Network with Multidata Analysis

    Authors: Sema Candemir, Xuan V. Nguyen, Luciano M. Prevedello, Matthew T. Bigelow, Richard D. White, Barbaros S. Erdal

    Abstract: Purpose: This study investigates whether a machine-learning-based system can predict the rate of cognitive decline in mildly cognitively impaired patients by processing only the clinical and imaging data collected at the initial visit. Approach: We built a predictive model based on a supervised hybrid neural network utilizing a 3-Dimensional Convolutional Neural Network to perform volume analysi… ▽ More

    Submitted 5 October, 2020; v1 submitted 23 February, 2020; originally announced February 2020.

  31. Automated Coronary Artery Atherosclerosis Detection and Weakly Supervised Localization on Coronary CT Angiography with a Deep 3-Dimensional Convolutional Neural Network

    Authors: Sema Candemir, Richard D. White, Mutlu Demirer, Vikash Gupta, Matthew T. Bigelow, Luciano M. Prevedello, Barbaros S. Erdal

    Abstract: We propose a fully automated algorithm based on a deep learning framework enabling screening of a coronary computed tomography angiography (CCTA) examination for confident detection of the presence or absence of coronary artery atherosclerosis. The system starts with extracting the coronary arteries and their branches from CCTA datasets and representing them with multi-planar reformatted volumes;… ▽ More

    Submitted 7 June, 2020; v1 submitted 26 November, 2019; originally announced November 2019.

  32. arXiv:1911.09103  [pdf, other

    q-bio.BM cs.LG stat.ML

    Investigating Active Learning and Meta-Learning for Iterative Peptide Design

    Authors: Rainier Barrett, Andrew D. White

    Abstract: Often the development of novel functional peptides is not amenable to high throughput or purely computational screening methods. Peptides must be synthesized one at a time in a process that does not generate large amounts of data. One way this method can be improved is by ensuring that each experiment provides the best improvement in both peptide properties and predictive modeling accuracy. Here,… ▽ More

    Submitted 10 December, 2020; v1 submitted 20 November, 2019; originally announced November 2019.

    Comments: 19 pages, 8 figures, 9 tables

  33. Are Quantitative Features of Lung Nodules Reproducible at Different CT Acquisition and Reconstruction Parameters?

    Authors: Barbaros S. Erdal, Mutlu Demirer, Chiemezie C. Amadi, Gehan F. M. Ibrahim, Thomas P. O'Donnell, Rainer Grimmer, Andreas Wimmer, Kevin J. Little, Vikash Gupta, Matthew T. Bigelow, Luciano M. Prevedello, Richard D. White

    Abstract: Consistency and duplicability in Computed Tomography (CT) output is essential to quantitative imaging for lung cancer detection and monitoring. This study of CT-detected lung nodules investigated the reproducibility of volume-, density-, and texture-based features (outcome variables) over routine ranges of radiation-dose, reconstruction kernel, and slice thickness. CT raw data of 23 nodules were r… ▽ More

    Submitted 14 August, 2019; originally announced August 2019.

  34. arXiv:1904.01336  [pdf, other

    cs.NE quant-ph

    Optimising Trotter-Suzuki Decompositions for Quantum Simulation Using Evolutionary Strategies

    Authors: Benjamin D. M. Jones, George O. O'Brien, David R. White, Earl T. Campbell, John A. Clark

    Abstract: One of the most promising applications of near-term quantum computing is the simulation of quantum systems, a classically intractable task. Quantum simulation requires computationally expensive matrix exponentiation; Trotter-Suzuki decomposition of this exponentiation enables efficient simulation to a desired accuracy on a quantum computer. We apply the Covariance Matrix Adaptation Evolutionary St… ▽ More

    Submitted 23 April, 2019; v1 submitted 2 April, 2019; originally announced April 2019.

    Comments: A version of this paper is to appear in GECCO'19

  35. arXiv:1810.05726  [pdf, other

    cs.CV cs.LG stat.ML

    DeepWeeds: A Multiclass Weed Species Image Dataset for Deep Learning

    Authors: Alex Olsen, Dmitry A. Konovalov, Bronson Philippa, Peter Ridd, Jake C. Wood, Jamie Johns, Wesley Banks, Benjamin Girgenti, Owen Kenny, James Whinney, Brendan Calvert, Mostafa Rahimi Azghadi, Ronald D. White

    Abstract: Robotic weed control has seen increased research of late with its potential for boosting productivity in agriculture. Majority of works focus on develo** robotics for croplands, ignoring the weed management problems facing rangeland stock farmers. Perhaps the greatest obstacle to widespread uptake of robotic weed control is the robust classification of weed species in their natural environment.… ▽ More

    Submitted 14 February, 2019; v1 submitted 9 October, 2018; originally announced October 2018.

    Comments: 14 pages, 8 figures, 4 tables

    Journal ref: Sci.Rep. 9, 2058 (2019)

  36. arXiv:1401.2651  [pdf, ps, other

    cs.NE

    An Overview of Schema Theory

    Authors: David White

    Abstract: The purpose of this paper is to give an introduction to the field of Schema Theory written by a mathematician and for mathematicians. In particular, we endeavor to to highlight areas of the field which might be of interest to a mathematician, to point out some related open problems, and to suggest some large-scale projects. Schema theory seeks to give a theoretical justification for the efficacy o… ▽ More

    Submitted 12 January, 2014; originally announced January 2014.

    Comments: 27 pages. Originally written in 2009 and hosted on my website, I've decided to put it on the arXiv as a more permanent home. The paper is primarily expository, so I don't really know where to submit it, but perhaps one day I will find an appropriate journal

    Journal ref: Graduate Journal of Mathematics, Volume 3, Issue 2 (2018), 37-59

  37. arXiv:1310.3808  [pdf

    cs.DL cs.IR

    Pennants for Descriptors

    Authors: Howard D. White, Philipp Mayr

    Abstract: We present a new technique (called pennants) for displaying the descriptors related to a descriptor across literatures, rather in a thesaurus. It has definite implications for online searching and browsing. Pennants, named for the flag they resemble, are a form of algorithmic prediction. Their cognitive base is in relevance theory (RT) from linguistic pragmatics (Sperber & Wilson 1995).

    Submitted 14 October, 2013; originally announced October 2013.

    Comments: 3 pages, 1 figure, paper presented at the NKOS workshop at TPDL 2013

  38. arXiv:1308.4915  [pdf, other

    math.OC cs.LG stat.ML

    Minimal Dirichlet energy partitions for graphs

    Authors: Braxton Osting, Chris D. White, Edouard Oudet

    Abstract: Motivated by a geometric problem, we introduce a new non-convex graph partitioning objective where the optimality criterion is given by the sum of the Dirichlet eigenvalues of the partition components. A relaxed formulation is identified and a novel rearrangement algorithm is proposed, which we show is strictly decreasing and converges in a finite number of iterations to a local minimum of the rel… ▽ More

    Submitted 20 May, 2014; v1 submitted 22 August, 2013; originally announced August 2013.

    Comments: 17 pages, 6 figures

    Journal ref: SIAM Journal of Scientific Computing 36 (2014), no. 4, pp. A1635-A1651

  39. arXiv:1308.1041  [pdf, ps, other

    cs.DM

    Traversals of Infinite Graphs with Random Local Orientations

    Authors: David White

    Abstract: We introduce the notion of a "random basic walk" on an infinite graph, give numerous examples, list potential applications, and provide detailed comparisons between the random basic walk and existing generalizations of simple random walks. We define analogues in the setting of random basic walks of the notions of recurrence and transience in the theory of simple random walks, and we study the ques… ▽ More

    Submitted 5 August, 2013; originally announced August 2013.

    Comments: This is my masters thesis from Wesleyan University. Currently my advisor and I are selecting a journal where we will submit a shorter version. We plan to split this work into two papers: one for the case of infinite graphs and one for the finite case (which is not fully treated here)