Skip to main content

Showing 1–50 of 68 results for author: Smith, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.08818  [pdf, other

    cs.CL cs.CY

    Linguistic Bias in ChatGPT: Language Models Reinforce Dialect Discrimination

    Authors: Eve Fleisig, Genevieve Smith, Madeline Bossi, Ishita Rustagi, Xavier Yin, Dan Klein

    Abstract: We present a large-scale study of linguistic bias exhibited by ChatGPT covering ten dialects of English (Standard American English, Standard British English, and eight widely spoken non-"standard" varieties from around the world). We prompted GPT-3.5 Turbo and GPT-4 with text by native speakers of each variety and analyzed the responses via detailed linguistic feature annotation and native speaker… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  2. arXiv:2406.08726  [pdf, ps, other

    cs.CL

    Standard Language Ideology in AI-Generated Language

    Authors: Genevieve Smith, Eve Fleisig, Madeline Bossi, Ishita Rustagi, Xavier Yin

    Abstract: In this position paper, we explore standard language ideology in language generated by large language models (LLMs). First, we outline how standard language ideology is reflected and reinforced in LLMs. We then present a taxonomy of open problems regarding standard language ideology in AI-generated language with implications for minoritized language communities. We introduce the concept of standar… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  3. arXiv:2405.08597  [pdf, other

    cs.LG

    Risks and Opportunities of Open-Source Generative AI

    Authors: Francisco Eiras, Aleksandar Petrov, Bertie Vidgen, Christian Schroeder, Fabio Pizzati, Katherine Elkins, Supratik Mukhopadhyay, Adel Bibi, Aaron Purewal, Csaba Botos, Fabro Steibel, Fazel Keshtkar, Fazl Barez, Genevieve Smith, Gianluca Guadagni, Jon Chun, Jordi Cabot, Joseph Imperial, Juan Arturo Nolazco, Lori Landay, Matthew Jackson, Phillip H. S. Torr, Trevor Darrell, Yong Lee, Jakob Foerster

    Abstract: Applications of Generative AI (Gen AI) are expected to revolutionize a number of different areas, ranging from science & medicine to education. The potential for these seismic changes has triggered a lively debate about the potential risks of the technology, and resulted in calls for tighter regulation, in particular from some of the major tech companies who are leading in AI development. This reg… ▽ More

    Submitted 29 May, 2024; v1 submitted 14 May, 2024; originally announced May 2024.

    Comments: Extension of arXiv:2404.17047

  4. ContextQ: Generated Questions to Support Meaningful Parent-Child Dialogue While Co-Reading

    Authors: Griffin Dietz Smith, Siddhartha Prasad, Matt J. Davidson, Leah Findlater, R. Benjamin Shapiro

    Abstract: Much of early literacy education happens at home with caretakers reading books to young children. Prior research demonstrates how having dialogue with children during co-reading can develop critical reading readiness skills, but most adult readers are unsure if and how to lead effective conversations. We present ContextQ, a tablet-based reading application to unobtrusively present auto-generated d… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

    Comments: ACM Interaction Design and Children (IDC) 2024

  5. arXiv:2404.17047  [pdf, other

    cs.LG

    Near to Mid-term Risks and Opportunities of Open-Source Generative AI

    Authors: Francisco Eiras, Aleksandar Petrov, Bertie Vidgen, Christian Schroeder de Witt, Fabio Pizzati, Katherine Elkins, Supratik Mukhopadhyay, Adel Bibi, Botos Csaba, Fabro Steibel, Fazl Barez, Genevieve Smith, Gianluca Guadagni, Jon Chun, Jordi Cabot, Joseph Marvin Imperial, Juan A. Nolazco-Flores, Lori Landay, Matthew Jackson, Paul Röttger, Philip H. S. Torr, Trevor Darrell, Yong Suk Lee, Jakob Foerster

    Abstract: In the next few years, applications of Generative AI are expected to revolutionize a number of different areas, ranging from science & medicine to education. The potential for these seismic changes has triggered a lively debate about potential risks and resulted in calls for tighter regulation, in particular from some of the major tech companies who are leading in AI development. This regulation i… ▽ More

    Submitted 24 May, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

    Comments: Accepted to ICML'24 as a position paper

  6. arXiv:2404.07883  [pdf, other

    cs.HC cs.AI

    Apprentice Tutor Builder: A Platform For Users to Create and Personalize Intelligent Tutors

    Authors: Glen Smith, Adit Gupta, Christopher MacLellan

    Abstract: Intelligent tutoring systems (ITS) are effective for improving students' learning outcomes. However, their development is often complex, time-consuming, and requires specialized programming and tutor design knowledge, thus hindering their widespread application and personalization. We present the Apprentice Tutor Builder (ATB) , a platform that simplifies tutor creation and personalization. Instru… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

  7. arXiv:2404.06784  [pdf

    quant-ph cond-mat.mes-hall cs.AR eess.SY

    Statistical evaluation of 571 GaAs quantum point contact transistors showing the 0.7 anomaly in quantized conductance using millikelvin cryogenic on-chip multiplexing

    Authors: Pengcheng Ma, Kaveh Delfanazari, Reuben K. Puddy, Jiahui Li, Moda Cao, Teng Yi, Jonathan P. Griffiths, Harvey E. Beere, David A. Ritchie, Michael J. Kelly, Charles G. Smith

    Abstract: The mass production and the practical number of cryogenic quantum devices producible in a single chip are limited to the number of electrical contact pads and wiring of the cryostat or dilution refrigerator. It is, therefore, beneficial to contrast the measurements of hundreds of devices fabricated in a single chip in one cooldown process to promote the scalability, integrability, reliability, and… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

  8. arXiv:2404.03678  [pdf, other

    cs.LG q-bio.PE stat.AP stat.ML

    Machine learning augmented diagnostic testing to identify sources of variability in test performance

    Authors: Christopher J. Banks, Aeron Sanchez, Vicki Stewart, Kate Bowen, Graham Smith, Rowland R. Kao

    Abstract: Diagnostic tests which can detect pre-clinical or sub-clinical infection, are one of the most powerful tools in our armoury of weapons to control infectious diseases. Considerable effort has been therefore paid to improving diagnostic testing for human, plant and animal diseases, including strategies for targeting the use of diagnostic tests towards individuals who are more likely to be infected.… ▽ More

    Submitted 28 March, 2024; originally announced April 2024.

  9. arXiv:2404.00786  [pdf, ps, other

    cs.AR cs.PL

    There and Back Again: A Netlist's Tale with Much Egraphin'

    Authors: Gus Henry Smith, Zachary D. Sisco, Thanawat Techaumnuaiwit, **gtao Xia, Vishal Canumalla, Andrew Cheung, Zachary Tatlock, Chandrakana Nandi, Jonathan Balkind

    Abstract: EDA toolchains are notoriously unpredictable, incomplete, and error-prone; the generally-accepted remedy has been to re-imagine EDA tasks as compilation problems. However, any compiler framework we apply must be prepared to handle the wide range of EDA tasks, including not only compilation tasks like technology map** and optimization (the "there"} in our title), but also decompilation tasks like… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

  10. arXiv:2403.02236  [pdf, other

    eess.IV cs.CV

    Interpretable Models for Detecting and Monitoring Elevated Intracranial Pressure

    Authors: Darryl Hannan, Steven C. Nesbit, Ximing Wen, Glen Smith, Qiao Zhang, Alberto Goffi, Vincent Chan, Michael J. Morris, John C. Hunninghake, Nicholas E. Villalobos, Edward Kim, Rosina O. Weber, Christopher J. MacLellan

    Abstract: Detecting elevated intracranial pressure (ICP) is crucial in diagnosing and managing various neurological conditions. These fluctuations in pressure are transmitted to the optic nerve sheath (ONS), resulting in changes to its diameter, which can then be detected using ultrasound imaging devices. However, interpreting sonographic images of the ONS can be challenging. In this work, we propose two sy… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

    Comments: 5 pages, 2 figures, ISBI 2024

  11. FPGA Technology Map** Using Sketch-Guided Program Synthesis

    Authors: Gus Henry Smith, Ben Kushigian, Vishal Canumalla, Andrew Cheung, Steven Lyubomirsky, Sorawee Porncharoenwase, René Just, Gilbert Louis Bernstein, Zachary Tatlock

    Abstract: FPGA technology map** is the process of implementing a hardware design expressed in high-level HDL (hardware design language) code using the low-level, architecture-specific primitives of the target FPGA. As FPGAs become increasingly heterogeneous, achieving high performance requires hardware synthesis tools that better support map** to complex, highly configurable primitives like digital sign… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

  12. arXiv:2401.14236  [pdf

    cs.CV

    Exploring the Unexplored: Understanding the Impact of Layer Adjustments on Image Classification

    Authors: Haixia Liu, Tim Brailsford, James Goulding, Gavin Smith, Larry Bull

    Abstract: This paper investigates how adjustments to deep learning architectures impact model performance in image classification. Small-scale experiments generate initial insights although the trends observed are not consistent with the entire dataset. Filtering operations in the image processing pipeline are crucial, with image filtering before pre-processing yielding better results. The choice and order… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

  13. arXiv:2401.00972  [pdf

    cs.LG cs.CY stat.AP

    Robust Meta-Model for Predicting the Need for Blood Transfusion in Non-traumatic ICU Patients

    Authors: Alireza Rafiei, Ronald Moore, Tilendra Choudhary, Curtis Marshall, Geoffrey Smith, John D. Roback, Ravi M. Patel, Cassandra D. Josephson, Rishikesan Kamaleswaran

    Abstract: Objective: Blood transfusions, crucial in managing anemia and coagulopathy in ICU settings, require accurate prediction for effective resource allocation and patient risk assessment. However, existing clinical decision support systems have primarily targeted a particular patient demographic with unique medical conditions and focused on a single type of blood transfusion. This study aims to develop… ▽ More

    Submitted 1 January, 2024; originally announced January 2024.

  14. arXiv:2312.17450  [pdf, other

    quant-ph cs.IT

    Information Fragility or Robustness Under Quantum Channels

    Authors: Nicholas Laracuente, Graeme Smith

    Abstract: Quantum states naturally decay under noise. Many earlier works have quantified and demonstrated lower bounds on the decay rate, showing exponential decay in a wide variety of contexts. Here we study the converse question: are there uniform upper bounds on the ratio of post-noise to initial information quantities when noise is sufficiently weak? In several scenarios, including classical, we find… ▽ More

    Submitted 28 December, 2023; originally announced December 2023.

    Comments: 19 pages, 1 figure, presented at Beyond IID 2023

  15. arXiv:2312.12442  [pdf

    cs.CV cs.AI

    Hierarchical Classification System for Breast Cancer Specimen Report (HCSBC) -- an end-to-end model for characterizing severity and diagnosis

    Authors: Thiago Santos, Harish Kamath, Christopher R. McAdams, Mary S. Newell, Marina Mosunjac, Gabriela Oprea-Ilies, Geoffrey Smith, Constance Lehman, Judy Gichoya, Imon Banerjee, Hari Trivedi

    Abstract: Automated classification of cancer pathology reports can extract information from unstructured reports and categorize each report into structured diagnosis and severity categories. Thus, such system can reduce the burden for populating tumor registries, help registration for clinical trial as well as develo** large dataset for deep learning model development using true pathologic ground truth. H… ▽ More

    Submitted 2 November, 2023; originally announced December 2023.

  16. arXiv:2308.08396  [pdf, other

    eess.IV cs.CV

    Prediction of post-radiotherapy recurrence volumes in head and neck squamous cell carcinoma using 3D U-Net segmentation

    Authors: Denis Kutnár, Ivan R Vogelius, Katrin Elisabet Håkansson, Jens Petersen, Jeppe Friborg, Lena Specht, Mogens Bernsdorf, Anita Gothelf, Claus Kristensen, Abraham George Smith

    Abstract: Locoregional recurrences (LRR) are still a frequent site of treatment failure for head and neck squamous cell carcinoma (HNSCC) patients. Identification of high risk subvolumes based on pretreatment imaging is key to biologically targeted radiation therapy. We investigated the extent to which a Convolutional neural network (CNN) is able to predict LRR volumes based on pre-treatment 18F-fluorodeo… ▽ More

    Submitted 16 August, 2023; originally announced August 2023.

  17. arXiv:2307.10031  [pdf, other

    cs.SE

    Start Your EM(otion En)gine: Towards Computational Models of Emotion for Improving the Believability of Video Game Non-Player Characters

    Authors: Geneva M. Smith

    Abstract: Believable Non-Player Characters (NPCs) help motivate player engagement with narrative-driven games. An important aspect of believable characters is their contextually-relevant reactions to changing situations, which emotion often drives in humans. Therefore, giving NPCs "emotion" should enhance their believability. For adoption in industry, it is important to create tool development processes to… ▽ More

    Submitted 13 July, 2023; originally announced July 2023.

    Comments: 358 pages, 36 figures; See record on McMaster's Institutional Repository at http://hdl.handle.net/11375/28699

    ACM Class: D.2.1; D.2.4; J.4; J.5

  18. arXiv:2305.09580  [pdf, other

    cs.PL cs.AR

    Generate Compilers from Hardware Models!

    Authors: Gus Henry Smith, Ben Kushigian, Vishal Canumalla, Andrew Cheung, René Just, Zachary Tatlock

    Abstract: Compiler backends should be automatically generated from hardware design language (HDL) models of the hardware they target. Generating compiler components directly from HDL can provide stronger correctness guarantees, ease development effort, and encourage hardware exploration. Past work has already championed this idea; here we argue that advances in program synthesis make the approach more feasi… ▽ More

    Submitted 16 May, 2023; originally announced May 2023.

    Comments: 3 pages, 2 figures, to be presented at the 2023 PLARCH Workshop at FCRC

  19. Procedural Content Generation via Knowledge Transformation (PCG-KT)

    Authors: Anurag Sarkar, Matthew Guzdial, Sam Snodgrass, Adam Summerville, Tiago Machado, Gillian Smith

    Abstract: We introduce the concept of Procedural Content Generation via Knowledge Transformation (PCG-KT), a new lens and framework for characterizing PCG methods and approaches in which content generation is enabled by the process of knowledge transformation -- transforming knowledge derived from one domain in order to apply it in another. Our work is motivated by a substantial number of recent PCG works t… ▽ More

    Submitted 30 April, 2023; originally announced May 2023.

    Comments: 15 pages, 14 figures

    Journal ref: Sarkar, Anurag, et al. "Procedural Content Generation via Knowledge Transformation (PCG-KT)." IEEE Transactions on Games (2023)

  20. arXiv:2304.04606  [pdf, other

    eess.IV cs.CV

    Localise to segment: crop to improve organ at risk segmentation accuracy

    Authors: Abraham George Smith, Denis Kutnár, Ivan Richter Vogelius, Sune Darkner, Jens Petersen

    Abstract: Increased organ at risk segmentation accuracy is required to reduce cost and complications for patients receiving radiotherapy treatment. Some deep learning methods for the segmentation of organs at risk use a two stage process where a localisation network first crops an image to the relevant region and then a locally specialised network segments the cropped organ of interest. We investigate the a… ▽ More

    Submitted 10 April, 2023; originally announced April 2023.

  21. arXiv:2302.07648  [pdf, other

    q-bio.QM cs.LG

    Atrial Fibrillation Detection Using RR-Intervals for Application in Photoplethysmographs

    Authors: Georgia Smith, Yishi Wang

    Abstract: Atrial Fibrillation is a common form of irregular heart rhythm that can be very dangerous. Our primary goal is to analyze Atrial Fibrillation data within ECGs to develop a model based only on RR-Intervals, or the length between heart-beats, to create a real time classification model for Atrial Fibrillation to be implemented in common heart-rate monitors on the market today. Physionet's MIT-BIH Atr… ▽ More

    Submitted 13 February, 2023; originally announced February 2023.

  22. arXiv:2212.03282  [pdf, other

    cs.CV

    MobilePTX: Sparse Coding for Pneumothorax Detection Given Limited Training Examples

    Authors: Darryl Hannan, Steven C. Nesbit, Ximing Wen, Glen Smith, Qiao Zhang, Alberto Goffi, Vincent Chan, Michael J. Morris, John C. Hunninghake, Nicholas E. Villalobos, Edward Kim, Rosina O. Weber, Christopher J. MacLellan

    Abstract: Point-of-Care Ultrasound (POCUS) refers to clinician-performed and interpreted ultrasonography at the patient's bedside. Interpreting these images requires a high level of expertise, which may not be available during emergencies. In this paper, we support POCUS by develo** classifiers that can aid medical professionals by diagnosing whether or not a patient has pneumothorax. We decomposed the ta… ▽ More

    Submitted 7 December, 2022; v1 submitted 6 December, 2022; originally announced December 2022.

    Comments: IAAI 2023 (7 pages)

  23. On the separation of correlation-assisted sum capacities of multiple access channels

    Authors: Akshay Seshadri, Felix Leditzky, Vikesh Siddhu, Graeme Smith

    Abstract: The capacity of a channel characterizes the maximum rate at which information can be transmitted through the channel asymptotically faithfully. For a channel with multiple senders and a single receiver, computing its sum capacity is possible in theory, but challenging in practice because of the nonconvex optimization involved. To address this challenge, we investigate three topics in our study. In… ▽ More

    Submitted 3 August, 2023; v1 submitted 26 May, 2022; originally announced May 2022.

    Comments: v3: 70 pages, 3 figures; to appear in IEEE Transactions on Information Theory

    Journal ref: IEEE Transactions on Information Theory, vol. 69, no. 9, pp. 5805-5844 (2023)

  24. arXiv:2205.12159  [pdf, other

    cs.AI

    Do it Like the Doctor: How We Can Design a Model That Uses Domain Knowledge to Diagnose Pneumothorax

    Authors: Glen Smith, Qiao Zhang, Christopher MacLellan

    Abstract: Computer-aided diagnosis for medical imaging is a well-studied field that aims to provide real-time decision support systems for physicians. These systems attempt to detect and diagnose a plethora of medical conditions across a variety of image diagnostic technologies including ultrasound, x-ray, MRI, and CT. When designing AI models for these systems, we are often limited by little training data,… ▽ More

    Submitted 24 May, 2022; originally announced May 2022.

    Comments: 15 pages, Presented at AAAI Spring Symposium on Machine Learning and Knowledge Engineering 2022

  25. arXiv:2205.06885  [pdf

    cs.CL

    PathologyBERT -- Pre-trained Vs. A New Transformer Language Model for Pathology Domain

    Authors: Thiago Santos, Amara Tariq, Susmita Das, Kavyasree Vayalpati, Geoffrey H. Smith, Hari Trivedi, Imon Banerjee

    Abstract: Pathology text mining is a challenging task given the reporting variability and constant new findings in cancer sub-type definitions. However, successful text mining of a large pathology database can play a critical role to advance 'big data' cancer research like similarity-based treatment selection, case identification, prognostication, surveillance, clinical trial screening, risk stratification,… ▽ More

    Submitted 13 May, 2022; originally announced May 2022.

    Comments: submitted to "American Medical Informatics Association (AMIA)" 2022 Annual Symposium

  26. arXiv:2203.00218  [pdf, other

    cs.AR cs.PL

    Application-Level Validation of Accelerator Designs Using a Formal Software/Hardware Interface

    Authors: Bo-Yuan Huang, Steven Lyubomirsky, Yi Li, Mike He, Gus Henry Smith, Thierry Tambe, Akash Gaonkar, Vishal Canumalla, Andrew Cheung, Gu-Yeon Wei, Aarti Gupta, Zachary Tatlock, Sharad Malik

    Abstract: Ideally, accelerator development should be as easy as software development. Several recent design languages/tools are working toward this goal, but actually testing early designs on real applications end-to-end remains prohibitively difficult due to the costs of building specialized compiler and simulator support. We propose a new first-in-class, mostly automated methodology termed "3LA" to enable… ▽ More

    Submitted 22 August, 2023; v1 submitted 28 February, 2022; originally announced March 2022.

  27. The platypus of the quantum channel zoo

    Authors: Felix Leditzky, Debbie Leung, Vikesh Siddhu, Graeme Smith, John A. Smolin

    Abstract: Understanding quantum channels and the strange behavior of their capacities is a key objective of quantum information theory. Here we study a remarkably simple, low-dimensional, single-parameter family of quantum channels with exotic quantum information-theoretic features. As the simplest example from this family, we focus on a qutrit-to-qutrit channel that is intuitively obtained by hybridizing t… ▽ More

    Submitted 13 June, 2023; v1 submitted 16 February, 2022; originally announced February 2022.

    Comments: Comments: 42 pages, 4 figures. v2: matches published version. See also the companion paper arXiv:2202.08377

    Journal ref: IEEE Transactions on Information Theory 69(6), pp.3825-3849, 2023

  28. Generic nonadditivity of quantum capacity in simple channels

    Authors: Felix Leditzky, Debbie Leung, Vikesh Siddhu, Graeme Smith, John A. Smolin

    Abstract: Determining capacities of quantum channels is a fundamental question in quantum information theory. Despite having rigorous coding theorems quantifying the flow of information across quantum channels, their capacities are poorly understood due to super-additivity effects. Studying these phenomena is important for deepening our understanding of quantum information, yet simple and clean examples of… ▽ More

    Submitted 13 June, 2023; v1 submitted 16 February, 2022; originally announced February 2022.

    Comments: Comments: 25 pages, 9 figures. v2: matches published version. See also the companion paper arXiv:2202.08380

    Journal ref: Physical Review Letters 130, 200801 (2023)

  29. arXiv:2202.04073  [pdf

    eess.IV cs.CV cs.LG

    The EMory BrEast imaging Dataset (EMBED): A Racially Diverse, Granular Dataset of 3.5M Screening and Diagnostic Mammograms

    Authors: Jiwoong J. Jeong, Brianna L. Vey, Ananth Reddy, Thomas Kim, Thiago Santos, Ramon Correa, Raman Dutt, Marina Mosunjac, Gabriela Oprea-Ilies, Geoffrey Smith, Minjae Woo, Christopher R. McAdams, Mary S. Newell, Imon Banerjee, Judy Gichoya, Hari Trivedi

    Abstract: Develo** and validating artificial intelligence models in medical imaging requires datasets that are large, granular, and diverse. To date, the majority of publicly available breast imaging datasets lack in one or more of these areas. Models trained on these data may therefore underperform on patient populations or pathologies that have not previously been encountered. The EMory BrEast imaging D… ▽ More

    Submitted 8 February, 2022; originally announced February 2022.

  30. arXiv:2108.04367  [pdf, other

    cs.HC

    An Autonomous Driving System - Dedicated Vehicle for People with ASD and their Caregivers

    Authors: Gandhimathi Padmanaban, Nathaniel Jachim, Hala Shandi, Lilit Avetisyan, Gar-Rett Smith, Howraa Hammoud, Feng Zhou

    Abstract: Automated driving system - dedicated vehicles (ADS-DVs), specially designed for people with various disabilities, can be beneficial to improve their mobility. However, research related to autonomous vehicles (AVs) for people with cognitive disabilities, especially Autism Spectrum Disorder (ASD) is limited. Thus, in this study, we focused on the challenge that we framed: "How might we design an ADS… ▽ More

    Submitted 9 August, 2021; originally announced August 2021.

  31. arXiv:2106.13186  [pdf

    cs.CY cs.AI

    CCC/Code 8.7: Applying AI in the Fight Against Modern Slavery

    Authors: Nadya Bliss, Mark Briers, Alice Eckstein, James Goulding, Daniel P. Lopresti, Anjali Mazumder, Gavin Smith

    Abstract: On any given day, tens of millions of people find themselves trapped in instances of modern slavery. The terms "human trafficking," "trafficking in persons," and "modern slavery" are sometimes used interchangeably to refer to both sex trafficking and forced labor. Human trafficking occurs when a trafficker compels someone to provide labor or services through the use of force, fraud, and/or coercio… ▽ More

    Submitted 24 June, 2021; originally announced June 2021.

    Comments: A Computing Community Consortium (CCC) workshop report, 24 pages

    Report number: ccc2021report_1

  32. arXiv:2106.11942  [pdf, other

    cs.CV cs.HC cs.LG

    RootPainter3D: Interactive-machine-learning enables rapid and accurate contouring for radiotherapy

    Authors: Abraham George Smith, Jens Petersen, Cynthia Terrones-Campos, Anne Kiil Berthelsen, Nora Jarrett Forbes, Sune Darkner, Lena Specht, Ivan Richter Vogelius

    Abstract: Organ-at-risk contouring is still a bottleneck in radiotherapy, with many deep learning methods falling short of promised results when evaluated on clinical data. We investigate the accuracy and time-savings resulting from the use of an interactive-machine-learning method for an organ-at-risk contouring task. We compare the method to the Eclipse contouring software and find strong agreement with m… ▽ More

    Submitted 22 June, 2021; originally announced June 2021.

  33. Pure Tensor Program Rewriting via Access Patterns (Representation Pearl)

    Authors: Gus Henry Smith, Andrew Liu, Steven Lyubomirsky, Scott Davidson, Joseph McMahan, Michael Taylor, Luis Ceze, Zachary Tatlock

    Abstract: Tensor kernels in machine learning (ML) often correspond to pure mathematical expressions, making term rewriting an attractive strategy for optimization and map** to specialized hardware accelerators. However, existing ML intermediate representations (IRs) tend to either be \textit{pure but high-level}, making low-level rewrites to hardware targets inexpressible, or \textit{low-level but impure}… ▽ More

    Submitted 19 May, 2021; originally announced May 2021.

    Comments: To be published at MAPS 2021

  34. arXiv:2011.02680  [pdf, other

    physics.chem-ph cs.LG

    Multi-task learning for electronic structure to predict and explore molecular potential energy surfaces

    Authors: Zhuoran Qiao, Feizhi Ding, Matthew Welborn, Peter J. Bygrave, Daniel G. A. Smith, Animashree Anandkumar, Frederick R. Manby, Thomas F. Miller III

    Abstract: We refine the OrbNet model to accurately predict energy, forces, and other response properties for molecules using a graph neural-network architecture based on features from low-cost approximated quantum operators in the symmetry-adapted atomic orbital basis. The model is end-to-end differentiable due to the derivation of analytic gradients for all electronic structure terms, and is shown to be tr… ▽ More

    Submitted 1 December, 2020; v1 submitted 5 November, 2020; originally announced November 2020.

    Comments: Accepted for presentation at the Machine Learning for Molecules workshop at NeurIPS 2020

  35. arXiv:2011.00317  [pdf, other

    math.CO cs.DM

    Capture times in the Bridge-burning Cops and Robbers game

    Authors: Rebekah Herrman, Peter van Hintum, Stephen G. Z. Smith

    Abstract: In this paper, we consider a variant of the cops and robbers game on a graph, introduced by Kinnersley and Peterson, in which every time the robber uses an edge, it is removed from the graph, known as bridge-burning cops and robbers. In particular, we study the maximum time it takes the cops to capture the robber.

    Submitted 31 October, 2020; originally announced November 2020.

    Comments: 16 pages, 3 figures

    MSC Class: 05C57; 49N75; 91A24; 91A46; 91A05; 05C80

  36. arXiv:2003.00290  [pdf, other

    cs.DC cs.PL

    Enumerating Hardware-Software Splits with Program Rewriting

    Authors: Gus Smith, Zachary Tatlock, Luis Ceze

    Abstract: A core problem in hardware-software codesign is in the sheer size of the design space. Without a set ISA to constrain the hardware-software interface, the design space explodes. This work presents a strategy for managing the massive hardware-software design space within the domain of machine learning inference workloads and accelerators. We first propose EngineIR, a new language for representing m… ▽ More

    Submitted 29 February, 2020; originally announced March 2020.

    Comments: Accepted in the Second Young Architect Workshop, in conjunction with ASPLOS 2020

  37. arXiv:1911.09219  [pdf, other

    cs.AI cs.HC

    Integrating Automated Play in Level Co-Creation

    Authors: Andrew Hoyt, Matthew Guzdial, Yalini Kumar, Gillian Smith, Mark O. Riedl

    Abstract: In level co-creation an AI and human work together to create a video game level. One open challenge in level co-creation is how to empower human users to ensure particular qualities of the final level, such as challenge. There has been significant prior research into automated pathing and automated playtesting for video game levels, but not in how to incorporate these into tools. In this demonstra… ▽ More

    Submitted 20 November, 2019; originally announced November 2019.

    Comments: 2 pages, 2 figures, AIIDE Workshop on Experimental AI in Games

    Journal ref: AIIDE Workshop on Experimental AI in Games 2019

  38. Playing Games with Multiple Access Channels

    Authors: Felix Leditzky, Mohammad A. Alhejji, Joshua Levin, Graeme Smith

    Abstract: Communication networks have multiple users, each sending and receiving messages. A multiple access channel (MAC) models multiple senders transmitting to a single receiver, such as the uplink from many mobile phones to a single base station. The optimal performance of a MAC is quantified by a capacity region of simultaneously achievable communication rates. We study the two-sender classical MAC, th… ▽ More

    Submitted 31 March, 2020; v1 submitted 5 September, 2019; originally announced September 2019.

    Comments: 25 pages, 7 figures, comments welcome! v2: identical to published version

    Journal ref: Nature Communications 11, 1497 (2020)

  39. A Tight Uniform Continuity Bound for Equivocation

    Authors: Mohammad A. Alhejji, Graeme Smith

    Abstract: We prove a tight uniform continuity bound for the conditional Shannon entropy of discrete finitely supported random variables in terms of total variation distance.

    Submitted 14 July, 2020; v1 submitted 2 September, 2019; originally announced September 2019.

    Comments: 4 pages, streamlined the proof in v2, minor changes + added a clarifying sentence in v3

    Journal ref: IEEE International Symposium on Information Theory (ISIT), Los Angeles, CA, USA, 2020, pp. 2270-2274

  40. arXiv:1902.11050  [pdf, ps, other

    cs.CV

    Segmentation of Roots in Soil with U-Net

    Authors: Abraham George Smith, Jens Petersen, Raghavendra Selvan, Camilla Ruø Rasmussen

    Abstract: Plant root research can provide a way to attain stress-tolerant crops that produce greater yield in a diverse array of conditions. Phenoty** roots in soil is often challenging due to the roots being difficult to access and the use of time consuming manual methods. Rhizotrons allow visual inspection of root growth through transparent surfaces. Agronomists currently manually label photographs of r… ▽ More

    Submitted 18 March, 2019; v1 submitted 28 February, 2019; originally announced February 2019.

  41. Friend, Collaborator, Student, Manager: How Design of an AI-Driven Game Level Editor Affects Creators

    Authors: Matthew Guzdial, Nicholas Liao, Jonathan Chen, Shao-Yu Chen, Shukan Shah, Vishwa Shah, Joshua Reno, Gillian Smith, Mark Riedl

    Abstract: Machine learning advances have afforded an increase in algorithms capable of creating art, music, stories, games, and more. However, it is not yet well-understood how machine learning algorithms might best collaborate with people to support creative expression. To investigate how practicing designers perceive the role of AI in the creative process, we developed a game level design tool for Super M… ▽ More

    Submitted 18 January, 2019; originally announced January 2019.

    Comments: 13 pages, 3 figures, CHI Conference on Human Factors in Computing Systems

  42. arXiv:1812.00996  [pdf, ps, other

    cs.LO

    A high-level operational semantics for hardware weak memory models

    Authors: Robert J. Colvin, Graeme Smith

    Abstract: Modern processors deploy a variety of weak memory models, which for efficiency reasons may execute instructions in an order different to that specified by the program text. The consequences of instruction reordering can be complex and subtle, and can impact on ensuring correctness. In this paper we build on extensive work elucidating the semantics of assembler-level languages on hardware architect… ▽ More

    Submitted 2 December, 2018; originally announced December 2018.

    Comments: arXiv admin note: substantial text overlap with arXiv:1802.04406

  43. Correctness of Concurrent Objects under Weak Memory Models

    Authors: Graeme Smith, Kirsten Winter, Robert J. Colvin

    Abstract: In this paper we develop a theory for correctness of concurrent objects under weak memory models. Central to our definitions is the concept of observations which determine when effects of operations become visible, and hence determine the semantics of objects, under a given memory model. The resulting notion of correctness, called object refinement, is generic as it is parameterised by the memory… ▽ More

    Submitted 22 October, 2018; originally announced October 2018.

    Comments: In Proceedings Refine 2018, arXiv:1810.08739. arXiv admin note: text overlap with arXiv:1802.04954

    Journal ref: EPTCS 282, 2018, pp. 53-67

  44. arXiv:1809.09419  [pdf, other

    cs.AI

    Explainable PCGML via Game Design Patterns

    Authors: Matthew Guzdial, Joshua Reno, Jonathan Chen, Gillian Smith, Mark Riedl

    Abstract: Procedural content generation via Machine Learning (PCGML) is the umbrella term for approaches that generate content for games via machine learning. One of the benefits of PCGML is that, unlike search or grammar-based PCG, it does not require hand authoring of initial content or rules. Instead, PCGML relies on existing content and black box models, which can be difficult to tune or tweak without e… ▽ More

    Submitted 25 September, 2018; originally announced September 2018.

    Comments: 8 pages, 3 figures, Fifth Experimental AI in Games Workshop

  45. arXiv:1803.04035  [pdf, ps, other

    cs.DB cs.LG

    Entity Resolution and Federated Learning get a Federated Resolution

    Authors: Richard Nock, Stephen Hardy, Wilko Henecka, Hamish Ivey-Law, Giorgio Patrini, Guillaume Smith, Brian Thorne

    Abstract: Consider two data providers, each maintaining records of different feature sets about common entities. They aim to learn a linear model over the whole set of features. This problem of federated learning over vertically partitioned data includes a crucial upstream issue: entity resolution, i.e. finding the correspondence between the rows of the datasets. It is well known that entity resolution, jus… ▽ More

    Submitted 20 March, 2018; v1 submitted 11 March, 2018; originally announced March 2018.

    Comments: arXiv admin note: text overlap with arXiv:1711.10677

    ACM Class: I.2.6; C.2.4

  46. arXiv:1802.04954  [pdf, other

    cs.LO

    A sound and complete definition of linearizability on weak memory models

    Authors: Graeme Smith, Kirsten Winter, Robert J. Colvin

    Abstract: Linearizability is a widely accepted notion of correctness for concurrent objects. Recent research has investigated redefining linearizability for particular hardware weak memory models, in particular for TSO. In this paper, we provide an overview of this research and show that such redefinitions of linearizability are not required: under an interpretation of specification behaviour which abstract… ▽ More

    Submitted 1 July, 2019; v1 submitted 13 February, 2018; originally announced February 2018.

    Comments: 33 pages, including appendix. arXiv admin note: text overlap with arXiv:1810.09612

  47. A wide-spectrum language for verification of programs on weak memory models

    Authors: Robert J. Colvin, Graeme Smith

    Abstract: Modern processors deploy a variety of weak memory models, which for efficiency reasons may (appear to) execute instructions in an order different to that specified by the program text. The consequences of instruction reordering can be complex and subtle, and can impact on ensuring correctness. Previous work on the semantics of weak memory models has focussed on the behaviour of assembler-level pro… ▽ More

    Submitted 12 February, 2018; originally announced February 2018.

  48. arXiv:1711.10677  [pdf, other

    cs.LG

    Private federated learning on vertically partitioned data via entity resolution and additively homomorphic encryption

    Authors: Stephen Hardy, Wilko Henecka, Hamish Ivey-Law, Richard Nock, Giorgio Patrini, Guillaume Smith, Brian Thorne

    Abstract: Consider two data providers, each maintaining private records of different feature sets about common entities. They aim to learn a linear model jointly in a federated setting, namely, data is local and a shared model is trained from locally computed updates. In contrast with most work on distributed learning, in this scenario (i) data is split vertically, i.e. by features, (ii) only one data provi… ▽ More

    Submitted 28 November, 2017; originally announced November 2017.

  49. arXiv:1705.09701  [pdf, other

    cs.OS

    SMORE: A Cold Data Object Store for SMR Drives (Extended Version)

    Authors: Peter Macko, Xiongzi Ge, John Haskins Jr., James Kelley, David Slik, Keith A. Smith, Maxim G. Smith

    Abstract: Shingled magnetic recording (SMR) increases the capacity of magnetic hard drives, but it requires that each zone of a disk be written sequentially and erased in bulk. This makes SMR a good fit for workloads dominated by large data objects with limited churn. To explore this possibility, we have developed SMORE, an object storage system designed to reliably and efficiently store large, seldom-chang… ▽ More

    Submitted 26 May, 2017; originally announced May 2017.

    Comments: 13 pages, 8 figures, full version of 6 page paper published at MSST 2017

  50. Uniform Additivity in Classical and Quantum Information

    Authors: Andrew W. Cross, Ke Li, Graeme Smith

    Abstract: Information theory establishes the fundamental limits on data transmission, storage, and processing. Quantum information theory unites information theoretic ideas with an accurate quantum-mechanical description of reality to give a more accurate and complete theory with new and more powerful possibilities for information processing. The goal of both classical and quantum information theory is to q… ▽ More

    Submitted 20 January, 2016; originally announced January 2016.

    Comments: 13 pages with 4 figures + 25 page appendix

    Journal ref: Phys. Rev. Lett. 118, 040501 (2017)