Skip to main content

Showing 1–50 of 129 results for author: Martín, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.19561  [pdf, other

    cs.LG cs.AI

    Meta-Gradient Search Control: A Method for Improving the Efficiency of Dyna-style Planning

    Authors: Bradley Burega, John D. Martin, Luke Kapeluck, Michael Bowling

    Abstract: We study how a Reinforcement Learning (RL) system can remain sample-efficient when learning from an imperfect model of the environment. This is particularly challenging when the learning system is resource-constrained and in continual settings, where the environment dynamics change. To address these challenges, our paper introduces an online, meta-gradient algorithm that tunes a probability with w… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  2. arXiv:2406.11880  [pdf, other

    cs.CR cs.LG

    Knowledge Return Oriented Prompting (KROP)

    Authors: Jason Martin, Kenneth Yeung

    Abstract: Many Large Language Models (LLMs) and LLM-powered apps deployed today use some form of prompt filter or alignment to protect their integrity. However, these measures aren't foolproof. This paper introduces KROP, a prompt injection technique capable of obfuscating prompt injection attacks, rendering them virtually undetectable to most of these security measures.

    Submitted 11 June, 2024; originally announced June 2024.

  3. arXiv:2406.11036  [pdf, other

    cs.CL cs.CR

    garak: A Framework for Security Probing Large Language Models

    Authors: Leon Derczynski, Erick Galinkin, Jeffrey Martin, Subho Majumdar, Nanna Inie

    Abstract: As Large Language Models (LLMs) are deployed and integrated into thousands of applications, the need for scalable evaluation of how models respond to adversarial attacks grows rapidly. However, LLM security is a moving target: models produce unpredictable output, are constantly updated, and the potential adversary is highly diverse: anyone with access to the internet and a decent command of natura… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

    Comments: https://garak.ai

  4. arXiv:2406.00225  [pdf, other

    cs.ET cond-mat.mes-hall

    Kinematic Model of Magnetic Domain Wall Motion for Fast, High-Accuracy Simulations

    Authors: Kristi Doleh, Leonard Humphrey, Chandler M. Linseisen, Michael D. Kitcher, Joanna M. Martin, Can Cui, Jean Anne C. Incorvia, Felipe Garcia-Sanchez, Naimul Hassan, Alexander J. Edwards, Joseph S. Friedman

    Abstract: Domain wall (DW) devices have garnered recent interest for diverse applications including memory, logic, and neuromorphic primitives; fast, accurate device models are therefore imperative for large-scale system design and verification. Extant DW motion models are sub-optimal for large-scale system design either over-consuming compute resources with physics-heavy equations or oversimplifying the ph… ▽ More

    Submitted 31 May, 2024; originally announced June 2024.

  5. arXiv:2405.09153  [pdf, other

    cs.CL cs.LG

    Adapting Abstract Meaning Representation Parsing to the Clinical Narrative -- the SPRING THYME parser

    Authors: Jon Z. Cai, Kristin Wright-Bettner, Martha Palmer, Guergana K. Savova, James H. Martin

    Abstract: This paper is dedicated to the design and evaluation of the first AMR parser tailored for clinical notes. Our objective was to facilitate the precise transformation of the clinical notes into structured AMR expressions, thereby enhancing the interpretability and usability of clinical text data at scale. Leveraging the colon cancer dataset from the Temporal Histories of Your Medical Events (THYME)… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

    Comments: Accepted to the 6th Clinical NLP Workshop at NAACL, 2024

  6. arXiv:2404.17730  [pdf, other

    cs.HC cs.CL

    Bridging the Social & Technical Divide in Augmentative and Alternative Communication (AAC) Applications for Autistic Adults

    Authors: Lara J. Martin, Malathy Nagalakshmi

    Abstract: Natural Language Processing (NLP) techniques are being used more frequently to improve high-tech Augmentative and Alternative Communication (AAC), but many of these techniques are integrated without the inclusion of the users' perspectives. As many of these tools are created with children in mind, autistic adults are often neglected in the design of AAC tools to begin with. We conducted in-depth i… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

  7. arXiv:2404.08949  [pdf, other

    cs.CL

    Multimodal Cross-Document Event Coreference Resolution Using Linear Semantic Transfer and Mixed-Modality Ensembles

    Authors: Abhijnan Nath, Huma Jamil, Shafiuddin Rehan Ahmed, George Baker, Rahul Ghosh, James H. Martin, Nathaniel Blanchard, Nikhil Krishnaswamy

    Abstract: Event coreference resolution (ECR) is the task of determining whether distinct mentions of events within a multi-document corpus are actually linked to the same underlying occurrence. Images of the events can help facilitate resolution when language is ambiguous. Here, we propose a multimodal cross-document event coreference resolution method that integrates visual and textual cues with a simple l… ▽ More

    Submitted 13 April, 2024; originally announced April 2024.

    Comments: To appear at LREC-COLING 2024

  8. arXiv:2404.08656  [pdf, other

    cs.CL cs.AI

    Linear Cross-document Event Coreference Resolution with X-AMR

    Authors: Shafiuddin Rehan Ahmed, George Arthur Baker, Evi Judge, Michael Regan, Kristin Wright-Bettner, Martha Palmer, James H. Martin

    Abstract: Event Coreference Resolution (ECR) as a pairwise mention classification task is expensive both for automated systems and manual annotations. The task's quadratic difficulty is exacerbated when using Large Language Models (LLMs), making prompt engineering for ECR prohibitively costly. In this work, we propose a graphical representation of events, X-AMR, anchored around individual mentions using a \… ▽ More

    Submitted 24 March, 2024; originally announced April 2024.

    Comments: LREC-COLING 2024 main conference

  9. arXiv:2403.15407  [pdf, other

    cs.CL cs.AI

    X-AMR Annotation Tool

    Authors: Shafiuddin Rehan Ahmed, Jon Z. Cai, Martha Palmer, James H. Martin

    Abstract: This paper presents a novel Cross-document Abstract Meaning Representation (X-AMR) annotation tool designed for annotating key corpus-level event semantics. Leveraging machine assistance through the Prodigy Annotation Tool, we enhance the user experience, ensuring ease and efficiency in the annotation process. Through empirical analyses, we demonstrate the effectiveness of our tool in augmenting a… ▽ More

    Submitted 29 February, 2024; originally announced March 2024.

    Comments: EACL 2024 System Demonstration

  10. DeepSee: Multidimensional Visualizations of Seabed Ecosystems

    Authors: Adam Coscia, Haley M. Sapers, Noah Deutsch, Malika Khurana, John S. Magyar, Sergio A. Parra, Daniel R. Utter, Rebecca L. Wipfler, David W. Caress, Eric J. Martin, Jennifer B. Paduan, Maggie Hendrie, Santiago Lombeyda, Hillary Mushkin, Alex Endert, Scott Davidoff, Victoria J. Orphan

    Abstract: Scientists studying deep ocean microbial ecosystems use limited numbers of sediment samples collected from the seafloor to characterize important life-sustaining biogeochemical cycles in the environment. Yet conducting fieldwork to sample these extreme remote environments is both expensive and time consuming, requiring tools that enable scientists to explore the sampling history of field sites and… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

    Comments: Accepted to CHI 2024. 16 pages, 7 figures, 2 tables. For a demo video, see https://youtu.be/HJ4zbueJ9cs . For a live demo, visit https://www.its.caltech.edu/~datavis/deepsee/ . The source code is available at https://github.com/orphanlab/DeepSee

  11. arXiv:2402.07462  [pdf

    cs.AI cs.CY cs.LG cs.MA econ.TH

    A Hormetic Approach to the Value-Loading Problem: Preventing the Paperclip Apocalypse?

    Authors: Nathan I. N. Henry, Mangor Pedersen, Matt Williams, Jamin L. B. Martin, Liesje Donkin

    Abstract: The value-loading problem is a significant challenge for researchers aiming to create artificial intelligence (AI) systems that align with human values and preferences. This problem requires a method to define and regulate safe and optimal limits of AI behaviors. In this work, we propose HALO (Hormetic ALignment via Opponent processes), a regulatory paradigm that uses hormetic analysis to regulate… ▽ More

    Submitted 13 February, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

    Comments: 24 pages, 7 figures

    MSC Class: 68T01; 68T37; 68T42 ACM Class: I.2.0; I.2.8; I.2.11

  12. arXiv:2401.03306  [pdf, other

    cs.LG cs.AI cs.RO

    MOTO: Offline Pre-training to Online Fine-tuning for Model-based Robot Learning

    Authors: Rafael Rafailov, Kyle Hatch, Victor Kolev, John D. Martin, Mariano Phielipp, Chelsea Finn

    Abstract: We study the problem of offline pre-training and online fine-tuning for reinforcement learning from high-dimensional observations in the context of realistic robot tasks. Recent offline model-free approaches successfully use online fine-tuning to either improve the performance of the agent over the data collection policy or adapt to novel tasks. At the same time, model-based RL algorithms have ach… ▽ More

    Submitted 6 January, 2024; originally announced January 2024.

    Comments: This is an updated version of a manuscript that originally appeared at CoRL 2023. The project website is here https://sites.google.com/view/mo2o

    Journal ref: Proceedings of The 7th Conference on Robot Learning, PMLR 229:3654-3671, 2023

  13. arXiv:2312.10257  [pdf, other

    cs.LG physics.geo-ph

    The Physics-Informed Neural Network Gravity Model: Generation III

    Authors: John Martin, Hanspeter Schaub

    Abstract: Scientific machine learning and the advent of the Physics-Informed Neural Network (PINN) show considerable potential in their capacity to identify solutions to complex differential equations. Over the past two years, much work has gone into the development of PINNs capable of solving the gravity field modeling problem -- i.e.\ learning a differentiable form of the gravitational potential from posi… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

    Comments: 42 pages, 14 figures, submitted to The Journal of Astronautical Sciences

  14. arXiv:2312.09394  [pdf, other

    cs.RO

    HiER: Highlight Experience Replay for Boosting Off-Policy Reinforcement Learning Agents

    Authors: Dániel Horváth, Jesús Bujalance Martín, Ferenc Gábor Erdős, Zoltán Istenes, Fabien Moutarde

    Abstract: Even though reinforcement-learning-based algorithms achieved superhuman performance in many domains, the field of robotics poses significant challenges as the state and action spaces are continuous, and the reward function is predominantly sparse. Furthermore, on many occasions, the agent is devoid of access to any form of demonstration. Inspired by human learning, in this work, we propose a metho… ▽ More

    Submitted 9 July, 2024; v1 submitted 14 December, 2023; originally announced December 2023.

    Comments: Accpeted for publication in IEEE Access

  15. arXiv:2311.10928  [pdf, other

    cs.CL cs.AI

    CAMRA: Copilot for AMR Annotation

    Authors: Jon Z. Cai, Shafiuddin Rehan Ahmed, Julia Bonn, Kristin Wright-Bettner, Martha Palmer, James H. Martin

    Abstract: In this paper, we introduce CAMRA (Copilot for AMR Annotatations), a cutting-edge web-based tool designed for constructing Abstract Meaning Representation (AMR) from natural language text. CAMRA offers a novel approach to deep lexical semantics annotation such as AMR, treating AMR annotation akin to coding in programming languages. Leveraging the familiarity of programming paradigms, CAMRA encompa… ▽ More

    Submitted 20 February, 2024; v1 submitted 17 November, 2023; originally announced November 2023.

    Comments: EMNLP 2023 System Demonstration

  16. arXiv:2311.01468  [pdf, other

    cs.CL cs.LG

    Remember what you did so you know what to do next

    Authors: Manuel R. Ciosici, Alex Hedges, Yash Kankanampati, Justin Martin, Marjorie Freedman, Ralph Weischedel

    Abstract: We explore using a moderately sized large language model (GPT-J 6B parameters) to create a plan for a simulated robot to achieve 30 classes of goals in ScienceWorld, a text game simulator for elementary science experiments. Previously published empirical work claimed that large language models (LLMs) are a poor fit (Wang et al., 2022) compared to reinforcement learning. Using the Markov assumption… ▽ More

    Submitted 30 October, 2023; originally announced November 2023.

    Comments: Identical to EMNLP 2023 Findings

  17. arXiv:2310.07084  [pdf, other

    cs.LG

    Investigating the Adversarial Robustness of Density Estimation Using the Probability Flow ODE

    Authors: Marius Arvinte, Cory Cornelius, Jason Martin, Nageen Himayat

    Abstract: Beyond their impressive sampling capabilities, score-based diffusion models offer a powerful analysis tool in the form of unbiased density estimation of a query sample under the training data distribution. In this work, we investigate the robustness of density estimation using the probability flow (PF) neural ordinary differential equation (ODE) model against gradient-based likelihood maximization… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

  18. arXiv:2309.16744  [pdf

    cs.LG q-bio.OT

    Predicting Long-term Renal Impairment in Post-COVID-19 Patients with Machine Learning Algorithms

    Authors: Maitham G. Yousif, Hector J. Castro, John Martin, Hayder A. Albaqer, Fadhil G. Al-Amran, Habeeb W. Shubber, Salman Rawaf

    Abstract: The COVID-19 pandemic has had far-reaching implications for global public health. As we continue to grapple with its consequences, it becomes increasingly clear that post-COVID-19 complications are a significant concern. Among these complications, renal impairment has garnered particular attention due to its potential long-term health impacts. This study, conducted with a cohort of 821 post-COVID-… ▽ More

    Submitted 28 September, 2023; originally announced September 2023.

  19. arXiv:2309.16066  [pdf, other

    cs.LG

    Label Augmentation Method for Medical Landmark Detection in Hip Radiograph Images

    Authors: Yehyun Suh, Peter Chan, J. Ryan Martin, Daniel Moyer

    Abstract: This work reports the empirical performance of an automated medical landmark detection method for predict clinical markers in hip radiograph images. Notably, the detection method was trained using a label-only augmentation scheme; our results indicate that this form of augmentation outperforms traditional data augmentation and produces highly sample efficient estimators. We train a generic U-Net-b… ▽ More

    Submitted 8 December, 2023; v1 submitted 27 September, 2023; originally announced September 2023.

  20. arXiv:2309.09993  [pdf

    cs.LG q-bio.BM q-bio.NC

    Long-term Neurological Sequelae in Post-COVID-19 Patients: A Machine Learning Approach to Predict Outcomes

    Authors: Hayder A. Albaqer, Kadhum J. Al-Jibouri, John Martin, Fadhil G. Al-Amran, Salman Rawaf, Maitham G. Yousif

    Abstract: The COVID-19 pandemic has brought to light a concerning aspect of long-term neurological complications in post-recovery patients. This study delved into the investigation of such neurological sequelae in a cohort of 500 post-COVID-19 patients, encompassing individuals with varying illness severity. The primary aim was to predict outcomes using a machine learning approach based on diverse clinical… ▽ More

    Submitted 15 September, 2023; originally announced September 2023.

  21. arXiv:2308.16258  [pdf, other

    cs.CV

    Robust Principles: Architectural Design Principles for Adversarially Robust CNNs

    Authors: ShengYun Peng, Weilin Xu, Cory Cornelius, Matthew Hull, Kevin Li, Rahul Duggal, Mansi Phute, Jason Martin, Duen Horng Chau

    Abstract: Our research aims to unify existing works' diverging opinions on how architectural components affect the adversarial robustness of CNNs. To accomplish our goal, we synthesize a suite of three generalizable robust architectural design principles: (a) optimal range for depth and width configurations, (b) preferring convolutional over patchify stem stage, and (c) robust residual block design through… ▽ More

    Submitted 31 August, 2023; v1 submitted 30 August, 2023; originally announced August 2023.

    Comments: Published at BMVC'23

  22. CALYPSO: LLMs as Dungeon Masters' Assistants

    Authors: Andrew Zhu, Lara J. Martin, Andrew Head, Chris Callison-Burch

    Abstract: The role of a Dungeon Master, or DM, in the game Dungeons & Dragons is to perform multiple tasks simultaneously. The DM must digest information about the game setting and monsters, synthesize scenes to present to other players, and respond to the players' interactions with the scene. Doing all of these tasks while maintaining consistency within the narrative and story world is no small feat of hum… ▽ More

    Submitted 14 August, 2023; originally announced August 2023.

    Comments: 11 pages, 4 figures. AIIDE 2023

    Journal ref: AAAI Conference on Artificial Intelligence and Interactive Digital Entertainment (AIIDE) 2023

  23. arXiv:2307.14628  [pdf, other

    cs.LG stat.ME

    Rapid and Scalable Bayesian AB Testing

    Authors: Srivas Chennu, Andrew Maher, Christian Pangerl, Subash Prabanantham, Jae Hyeon Bae, Jamie Martin, Bud Goswami

    Abstract: AB testing aids business operators with their decision making, and is considered the gold standard method for learning from data to improve digital user experiences. However, there is usually a gap between the requirements of practitioners, and the constraints imposed by the statistical hypothesis testing methodologies commonly used for analysis of AB tests. These include the lack of statistical p… ▽ More

    Submitted 27 July, 2023; originally announced July 2023.

    Comments: The 10th IEEE International Conference On Data Science And Advanced Analytics

  24. arXiv:2306.05434  [pdf, other

    cs.CL

    How Good is the Model in Model-in-the-loop Event Coreference Resolution Annotation?

    Authors: Shafiuddin Rehan Ahmed, Abhijnan Nath, Michael Regan, Adam Pollins, Nikhil Krishnaswamy, James H. Martin

    Abstract: Annotating cross-document event coreference links is a time-consuming and cognitively demanding task that can compromise annotation quality and efficiency. To address this, we propose a model-in-the-loop annotation approach for event coreference resolution, where a machine learning model suggests likely corefering event pairs only. We evaluate the effectiveness of this approach by first simulating… ▽ More

    Submitted 6 June, 2023; originally announced June 2023.

    Comments: The 17th Liguistics Annotation Workshop, 2023 (LAW-XVII) short paper. 10 pages, 6 figures, 1 table

  25. arXiv:2305.05672  [pdf, other

    cs.CL

    $2 * n$ is better than $n^2$: Decomposing Event Coreference Resolution into Two Tractable Problems

    Authors: Shafiuddin Rehan Ahmed, Abhijnan Nath, James H. Martin, Nikhil Krishnaswamy

    Abstract: Event Coreference Resolution (ECR) is the task of linking mentions of the same event either within or across documents. Most mention pairs are not coreferent, yet many that are coreferent can be identified through simple techniques such as lemma matching of the event triggers or the sentences in which they appear. Existing methods for training coreference systems sample from a largely skewed distr… ▽ More

    Submitted 9 May, 2023; originally announced May 2023.

    Comments: Findings of the Association of Computational Linguistics, ACL 2023. 13 pages, 7 figures, 6 tables

  26. FIREBALL: A Dataset of Dungeons and Dragons Actual-Play with Structured Game State Information

    Authors: Andrew Zhu, Karmanya Aggarwal, Alexander Feng, Lara J. Martin, Chris Callison-Burch

    Abstract: Dungeons & Dragons (D&D) is a tabletop roleplaying game with complex natural language interactions between players and hidden state information. Recent work has shown that large language models (LLMs) that have access to state information can generate higher quality game turns than LLMs that use dialog history alone. However, previous work used game state information that was heuristically created… ▽ More

    Submitted 25 May, 2023; v1 submitted 2 May, 2023; originally announced May 2023.

    Comments: 21 pages, 2 figures. Accepted at ACL 2023

    Journal ref: Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023, pp. 4171-4193

  27. arXiv:2304.09996  [pdf, other

    cs.RO

    Robust Route Planning with Distributional Reinforcement Learning in a Stochastic Road Network Environment

    Authors: Xi Lin, Paul Szenher, John D. Martin, Brendan Englot

    Abstract: Route planning is essential to mobile robot navigation problems. In recent years, deep reinforcement learning (DRL) has been applied to learning optimal planning policies in stochastic environments without prior knowledge. However, existing works focus on learning policies that maximize the expected return, the performance of which can vary greatly when the level of stochasticity in the environmen… ▽ More

    Submitted 19 April, 2023; originally announced April 2023.

    Comments: The 20th International Conference on Ubiquitous Robots (UR 2023)

  28. Human-in-the-Loop Schema Induction

    Authors: Tianyi Zhang, Isaac Tham, Zhaoyi Hou, Jiaxuan Ren, Liyang Zhou, Hainiu Xu, Li Zhang, Lara J. Martin, Rotem Dror, Sha Li, Heng Ji, Martha Palmer, Susan Brown, Reece Suchocki, Chris Callison-Burch

    Abstract: Schema induction builds a graph representation explaining how events unfold in a scenario. Existing approaches have been based on information retrieval (IR) and information extraction(IE), often with limited human curation. We demonstrate a human-in-the-loop schema induction system powered by GPT-3. We first describe the different modules of our system, including prompting to generate schematic el… ▽ More

    Submitted 25 February, 2023; originally announced February 2023.

    Comments: 10 pages, ACL2023 demo track

  29. arXiv:2302.12944  [pdf, other

    cs.CL cs.AI

    Dependency Dialogue Acts -- Annotation Scheme and Case Study

    Authors: Jon Z. Cai, Brendan King, Margaret Perkoff, Shiran Dudy, Jie Cao, Marie Grace, Natalia Wojarnik, Ananya Ganesh, James H. Martin, Martha Palmer, Marilyn Walker, Jeffrey Flanigan

    Abstract: In this paper, we introduce Dependency Dialogue Acts (DDA), a novel framework for capturing the structure of speaker-intentions in multi-party dialogues. DDA combines and adapts features from existing dialogue annotation frameworks, and emphasizes the multi-relational response structure of dialogues in addition to the dialogue acts and rhetorical relations. It represents the functional, discourse,… ▽ More

    Submitted 24 February, 2023; originally announced February 2023.

    Comments: The 13th International Workshop on Spoken Dialogue Systems Technology

    Journal ref: The 13th International Workshop on Spoken Dialogue Systems Technology 2023

  30. Comprehensive and user-analytics-friendly cancer patient database for physicians and researchers

    Authors: Ali Firooz, Avery T. Funkhouser, Julie C. Martin, W. Jeffery Edenfield, Homayoun Valafar, Anna V. Blenda

    Abstract: Nuanced cancer patient care is needed, as the development and clinical course of cancer is multifactorial with influences from the general health status of the patient, germline and neoplastic mutations, co-morbidities, and environment. To effectively tailor an individualized treatment to each patient, such multifactorial data must be presented to providers in an easy-to-access and easy-to-analyze… ▽ More

    Submitted 1 February, 2023; originally announced February 2023.

    Comments: 7 pages, 12 figures, peer reviewed and accepted in "International Conference on Computational Science and Computational Intelligence (CSCI 22)"

    Journal ref: Proceedings of the 2022 International Conference on Computational Science and Computational Intelligence (CSCI)

  31. Author as Character and Narrator: Deconstructing Personal Narratives from the r/AmITheAsshole Reddit Community

    Authors: Salvatore Giorgi, Ke Zhao, Alexander H. Feng, Lara J. Martin

    Abstract: In the r/AmITheAsshole subreddit, people anonymously share first person narratives that contain some moral dilemma or conflict and ask the community to judge who is at fault (i.e., who is "the asshole"). In general, first person narratives are a unique storytelling domain where the author is the narrator (the person telling the story) but can also be a character (the person living the story) and,… ▽ More

    Submitted 15 March, 2023; v1 submitted 19 January, 2023; originally announced January 2023.

    Comments: Accepted to the 17th International AAAI Conference on Web and Social Media (ICWSM), 2023

    Journal ref: Proceedings of the International AAAI Conference on Web and Social Media (ICWSM) 2023, 17(1), 233-244

  32. arXiv:2301.03110  [pdf, other

    cs.CV cs.AI

    RobArch: Designing Robust Architectures against Adversarial Attacks

    Authors: ShengYun Peng, Weilin Xu, Cory Cornelius, Kevin Li, Rahul Duggal, Duen Horng Chau, Jason Martin

    Abstract: Adversarial Training is the most effective approach for improving the robustness of Deep Neural Networks (DNNs). However, compared to the large body of research in optimizing the adversarial training process, there are few investigations into how architecture components affect robustness, and they rarely constrain model capacity. Thus, it is unclear where robustness precisely comes from. In this w… ▽ More

    Submitted 8 January, 2023; originally announced January 2023.

  33. arXiv:2301.00280  [pdf

    cs.IR cs.AI

    RECOMED: A Comprehensive Pharmaceutical Recommendation System

    Authors: Mariam Zomorodi, Ismail Ghodsollahee, Jennifer H. Martin, Nicholas J. Talley, Vahid Salari, Pawel Plawiak, Kazem Rahimi, U. Rajendra Acharya

    Abstract: A comprehensive pharmaceutical recommendation system was designed based on the patients and drugs features extracted from Drugs.com and Druglib.com. First, data from these databases were combined, and a dataset of patients and drug information was built. Secondly, the patients and drugs were clustered, and then the recommendation was performed using different ratings provided by patients, and impo… ▽ More

    Submitted 21 August, 2023; v1 submitted 31 December, 2022; originally announced January 2023.

    Comments: 39 pages, 14 figures, 13 tables

  34. CoRRPUS: Code-based Structured Prompting for Neurosymbolic Story Understanding

    Authors: Yijiang River Dong, Lara J. Martin, Chris Callison-Burch

    Abstract: Story generation and understanding -- as with all NLG/NLU tasks -- has seen a surge in neurosymbolic work. Researchers have recognized that, while large language models (LLMs) have tremendous utility, they can be augmented with symbolic means to be even better and to make up for any flaws that the neural networks might have. However, symbolic methods are extremely costly in terms of the amount of… ▽ More

    Submitted 8 June, 2023; v1 submitted 20 December, 2022; originally announced December 2022.

    Comments: Accepted to Findings of ACL 2023

    Journal ref: Findings of ACL 2023, pp. 13152-13168

  35. arXiv:2212.10420  [pdf, other

    cs.AI cs.LG math.ST

    Settling the Reward Hypothesis

    Authors: Michael Bowling, John D. Martin, David Abel, Will Dabney

    Abstract: The reward hypothesis posits that, "all of what we mean by goals and purposes can be well thought of as maximization of the expected value of the cumulative sum of a received scalar signal (reward)." We aim to fully settle this hypothesis. This will not conclude with a simple affirmation or refutation, but rather specify completely the implicit requirements on goals and purposes under which the hy… ▽ More

    Submitted 16 September, 2023; v1 submitted 20 December, 2022; originally announced December 2022.

  36. arXiv:2212.10283  [pdf, other

    cond-mat.mtrl-sci cs.LG

    Interpretable models for extrapolation in scientific machine learning

    Authors: Eric S. Muckley, James E. Saal, Bryce Meredig, Christopher S. Roper, John H. Martin

    Abstract: Data-driven models are central to scientific discovery. In efforts to achieve state-of-the-art model accuracy, researchers are employing increasingly complex machine learning algorithms that often outperform simple regressions in interpolative settings (e.g. random k-fold cross-validation) but suffer from poor extrapolation performance, portability, and human interpretability, which limits their p… ▽ More

    Submitted 16 December, 2022; originally announced December 2022.

    Comments: DISTRIBUTION STATEMENT A (Approved for Public Release, Distribution Unlimited)

  37. arXiv:2212.09821  [pdf, other

    eess.SY cs.RO

    Reduced Order Model of a Generic Submarine for Maneuvering Near the Surface

    Authors: J. Ezequiel Martin, Maxwell Hammond, Nicholas Rober, Yakin Kim, Venanzio Cichella, Pablo Carrica

    Abstract: A reduced order model of a generic submarine is presented. Computational fluid dynamics (CFD) results are used to create and validate a model that includes depth dependence and the effect of waves on the craft. The model and the procedure to obtain its coefficients are discussed, and examples of the data used to obtain the model coefficients are presented. An example of operation following a compl… ▽ More

    Submitted 19 December, 2022; originally announced December 2022.

    Comments: Presented at the 34th Symposium on Naval Hydrodynamics, Washington DC, USA, 26 June - 1 July 2022

  38. Dungeons and Dragons as a Dialog Challenge for Artificial Intelligence

    Authors: Chris Callison-Burch, Gaurav Singh Tomar, Lara J. Martin, Daphne Ippolito, Suma Bailis, David Reitter

    Abstract: AI researchers have posited Dungeons and Dragons (D&D) as a challenge problem to test systems on various language-related capabilities. In this paper, we frame D&D specifically as a dialogue system challenge, where the tasks are to both generate the next conversational turn in the game and predict the state of the game given the dialogue history. We create a gameplay dataset consisting of nearly 9… ▽ More

    Submitted 13 October, 2022; originally announced October 2022.

    Comments: Accepted at EMNLP 2022

    Journal ref: Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 9379-9393, Dec. 2022

  39. arXiv:2209.14030  [pdf, other

    cs.RO cs.CL cs.FL

    Monitoring ROS2: from Requirements to Autonomous Robots

    Authors: Ivan Perez, Anastasia Mavridou, Tom Pressburger, Alexander Will, Patrick J. Martin

    Abstract: Runtime verification (RV) has the potential to enable the safe operation of safety-critical systems that are too complex to formally verify, such as Robot Operating System 2 (ROS2) applications. Writing correct monitors can itself be complex, and errors in the monitoring subsystem threaten the mission as a whole. This paper provides an overview of a formal approach to generating runtime monitors f… ▽ More

    Submitted 28 September, 2022; originally announced September 2022.

    Comments: In Proceedings FMAS2022 ASYDE2022, arXiv:2209.13181

    ACM Class: D.2.1; D.2.4; I.2.9;

    Journal ref: EPTCS 371, 2022, pp. 208-216

  40. arXiv:2208.03609  [pdf, other

    eess.IV cs.CV cs.LG

    Continual Learning for Tumor Classification in Histopathology Images

    Authors: Veena Kaustaban, Qinle Ba, Ipshita Bhattacharya, Nahil Sobh, Satarupa Mukherjee, Jim Martin, Mohammad Saleh Miri, Christoph Guetter, Amal Chaturvedi

    Abstract: Recent years have seen great advancements in the development of deep learning models for histopathology image analysis in digital pathology applications, evidenced by the increasingly common deployment of these models in both research and clinical settings. Although such models have shown unprecedented performance in solving fundamental computational tasks in DP applications, they suffer from cata… ▽ More

    Submitted 6 August, 2022; originally announced August 2022.

    Comments: Accepted by MOVI, a MICCAI2022 workshop: https://sites.google.com/view/movi2022

  41. arXiv:2207.10719  [pdf, other

    cs.CV cs.AI cs.LG

    Synthetic Dataset Generation for Adversarial Machine Learning Research

    Authors: Xiruo Liu, Shibani Singh, Cory Cornelius, Colin Busho, Mike Tan, Anindya Paul, Jason Martin

    Abstract: Existing adversarial example research focuses on digitally inserted perturbations on top of existing natural image datasets. This construction of adversarial examples is not realistic because it may be difficult, or even impossible, for an attacker to deploy such an attack in the real-world due to sensing and environmental effects. To better understand adversarial examples against cyber-physical s… ▽ More

    Submitted 21 July, 2022; originally announced July 2022.

    Journal ref: AdvML Frontiers 2022

  42. arXiv:2206.13960  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Dynamic Memory for Interpretable Sequential Optimisation

    Authors: Srivas Chennu, Andrew Maher, Jamie Martin, Subash Prabanantham

    Abstract: Real-world applications of reinforcement learning for recommendation and experimentation faces a practical challenge: the relative reward of different bandit arms can evolve over the lifetime of the learning agent. To deal with these non-stationary cases, the agent must forget some historical knowledge, as it may no longer be relevant to minimise regret. We present a solution to handling non-stati… ▽ More

    Submitted 28 June, 2022; originally announced June 2022.

    Comments: 2nd International Workshop on Online and Adaptive Recommender Systems, 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2022, Washington DC

  43. arXiv:2206.02848  [pdf

    cs.CY

    Plagiarism deterrence for introductory programming

    Authors: Simon J. Cohen, Michael J. Martin, Chance A. Shipley, Abhishek Kumar, Andrew R. Cohen

    Abstract: Plagiarism in introductory programming courses is an enormous challenge for both students and institutions. For students, relying on the work of others too early in their academic development can make it impossible to acquire necessary skills for independent success in the future. For institutions, widespread student cheating can dilute the quality of the educational experience being offered. Curr… ▽ More

    Submitted 6 June, 2022; originally announced June 2022.

  44. arXiv:2205.10736  [pdf, other

    cs.LG cs.AI stat.ML

    Should Models Be Accurate?

    Authors: Esra'a Saleh, John D. Martin, Anna Koop, Arash Pourzarabi, Michael Bowling

    Abstract: Model-based Reinforcement Learning (MBRL) holds promise for data-efficiency by planning with model-generated experience in addition to learning with experience from the environment. However, in complex or changing environments, models in MBRL will inevitably be imperfect, and their detrimental effects on learning can be difficult to mitigate. In this work, we question whether the objective of thes… ▽ More

    Submitted 22 May, 2022; originally announced May 2022.

    Comments: The 5th Multidisciplinary Conference on Reinforcement Learning and Decision Making ( RLDM 2022 )

  45. arXiv:2205.01254  [pdf, other

    cs.SE

    Deep API Learning Revisited

    Authors: James Martin, ** L. C. Guo

    Abstract: Understanding the correct API usage sequences is one of the most important tasks for programmers when they work with unfamiliar libraries. However, programmers often encounter obstacles to finding the appropriate information due to either poor quality of API documentation or ineffective query-based searching strategy. To help solve this issue, researchers have proposed various methods to suggest t… ▽ More

    Submitted 2 May, 2022; originally announced May 2022.

    Comments: 10 pages, 6 figures. This paper is accepted at ICPC 2022 (the 30th IEEE/ACM International Conference on Program Comprehension)

    ACM Class: D.2.13

  46. arXiv:2204.12421  [pdf, other

    cs.CL

    Disambiguation of morpho-syntactic features of African American English -- the case of habitual be

    Authors: Harrison Santiago, Joshua Martin, Sarah Moeller, Kevin Tang

    Abstract: Recent research has highlighted that natural language processing (NLP) systems exhibit a bias against African American speakers. The bias errors are often caused by poor representation of linguistic features unique to African American English (AAE), due to the relatively low probability of occurrence of many such features in training data. We present a workflow to overcome such bias in the case of… ▽ More

    Submitted 26 April, 2022; originally announced April 2022.

  47. Federated Learning Enables Big Data for Rare Cancer Boundary Detection

    Authors: Sarthak Pati, Ujjwal Baid, Brandon Edwards, Micah Sheller, Shih-Han Wang, G Anthony Reina, Patrick Foley, Alexey Gruzdev, Deepthi Karkada, Christos Davatzikos, Chiharu Sako, Satyam Ghodasara, Michel Bilello, Suyash Mohan, Philipp Vollmuth, Gianluca Brugnara, Chandrakanth J Preetha, Felix Sahm, Klaus Maier-Hein, Maximilian Zenk, Martin Bendszus, Wolfgang Wick, Evan Calabrese, Jeffrey Rudie, Javier Villanueva-Meyer , et al. (254 additional authors not shown)

    Abstract: Although machine learning (ML) has shown promise in numerous domains, there are concerns about generalizability to out-of-sample data. This is currently addressed by centrally sharing ample, and importantly diverse, data from multiple sites. However, such centralization is challenging to scale (or even not feasible) due to various limitations. Federated ML (FL) provides an alternative to train acc… ▽ More

    Submitted 25 April, 2022; v1 submitted 22 April, 2022; originally announced April 2022.

    Comments: federated learning, deep learning, convolutional neural network, segmentation, brain tumor, glioma, glioblastoma, FeTS, BraTS

  48. arXiv:2204.09652  [pdf, other

    cs.CL cs.AI cs.LG

    The TalkMoves Dataset: K-12 Mathematics Lesson Transcripts Annotated for Teacher and Student Discursive Moves

    Authors: Abhijit Suresh, Jennifer Jacobs, Charis Harty, Margaret Perkoff, James H. Martin, Tamara Sumner

    Abstract: Transcripts of teaching episodes can be effective tools to understand discourse patterns in classroom instruction. According to most educational experts, sustained classroom discourse is a critical component of equitable, engaging, and rich learning environments for students. This paper describes the TalkMoves dataset, composed of 567 human-annotated K-12 mathematics lesson transcripts (including… ▽ More

    Submitted 6 April, 2022; originally announced April 2022.

    Comments: 9 pages, 2 figures, Accepted for a Poster + Demo presentation at the 13th International Conference on Language Resources and Evaluation 2022

  49. arXiv:2202.07880  [pdf, other

    cs.CL

    $\rm{C {\small IS}}^2$: A Simplified Commonsense Inference Evaluation for Story Prose

    Authors: Bryan Li, Lara J. Martin, Chris Callison-Burch

    Abstract: Transformers have been showing near-human performance on a variety of tasks, but they are not without their limitations. We discuss the issue of conflating results of transformers that are instructed to do multiple tasks simultaneously. In particular, we focus on the domain of commonsense reasoning within story prose, which we call contextual commonsense inference (CCI). We look at the GLUCOSE (Mo… ▽ More

    Submitted 19 October, 2022; v1 submitted 16 February, 2022; originally announced February 2022.

    Comments: Published at the Workshop on Commonsense Representation and Reasoning (CSRR) @ ACL 2022

  50. arXiv:2202.03246  [pdf

    cs.AI

    AI-based artistic representation of emotions from EEG signals: a discussion on fairness, inclusion, and aesthetics

    Authors: Piera Riccio, Kristin Bergaust, Boel Christensen-Scheel, Juan-Carlos De Martin, Maria A. Zuluaga, Stefano Nichele

    Abstract: While Artificial Intelligence (AI) technologies are being progressively developed, artists and researchers are investigating their role in artistic practices. In this work, we present an AI-based Brain-Computer Interface (BCI) in which humans and machines interact to express feelings artistically. This system and its production of images give opportunities to reflect on the complexities and range… ▽ More

    Submitted 7 February, 2022; originally announced February 2022.

    Comments: Accepted to the Politics of the Machines conference 2021 (POM Berlin 2021)