Skip to main content

Showing 1–50 of 85 results for author: Gardner, M

.
  1. arXiv:2406.08375  [pdf, other

    eess.SY cs.CE

    A Parameterized Nonlinear Magnetic Equivalent Circuit for Design and Fast Analysis of Radial Flux Magnetic Gears

    Authors: Danial Kazemikia, Matthew Gardner

    Abstract: Magnetic gears offer advantages over mechanical gears, including contactless power transfer, but require robust analysis tools for optimization and commercialization. This study proposes a rapid and accurate 2D nonlinear magnetic equivalent circuit (MEC) model for radial flux magnetic gears (RFMG). The model, featuring a parameterized gear geometry and adjustable flux tube distribution, accommodat… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  2. arXiv:2405.13158  [pdf

    cond-mat.mtrl-sci

    Towards establishing best practice in the analysis of hydrogen and deuterium by atom probe tomography

    Authors: Baptiste Gault, Aparna Saksena, Xavier Sauvage, Paul Bagot, Leonardo S. Aota, Jonas Arlt, Lisa T. Belkacemi, Torben Boll, Yi-Sheng Chen, Luke Daly, Milos B. Djukic, James O. Douglas, Maria J. Duarte, Peter J. Felfer, Richard G. Forbes, **g Fu, Hazel M. Gardner, Ryota Gemma, Stephan S. A. Gerstl, Yilun Gong, Guillaume Hachet, Severin Jakob, Benjamin M. Jenkins, Megan E. Jones, Heena Khanchandani , et al. (20 additional authors not shown)

    Abstract: As hydrogen is touted as a key player in the decarbonization of modern society, it is critical to enable quantitative H analysis at high spatial resolution, if possible at the atomic scale. Indeed, H has a known deleterious impact on the mechanical properties (strength, ductility, toughness) of most materials that can hinder their use as part of the infrastructure of a hydrogen-based economy. Enab… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

  3. arXiv:2404.06962  [pdf, other

    cs.LG cs.AI

    Advancing Real-time Pandemic Forecasting Using Large Language Models: A COVID-19 Case Study

    Authors: Hongru Du, Jianan Zhao, Yang Zhao, Shaochong Xu, Xihong Lin, Yiran Chen, Lauren M. Gardner, Hao Frank Yang

    Abstract: Forecasting the short-term spread of an ongoing disease outbreak is a formidable challenge due to the complexity of contributing factors, some of which can be characterized through interlinked, multi-modality variables such as epidemiological time series data, viral biology, population demographics, and the intersection of public policy and human behavior. Existing forecasting model frameworks str… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

    Comments: 35 pages, 10 figures

  4. arXiv:2310.01810  [pdf, other

    physics.atom-ph physics.optics quant-ph

    High angular momentum coupling for enhanced Rydberg-atom sensing in the VHF band

    Authors: Nikunjkumar Prajapati, Jakob W. Kunzler, Alexandra B. Artusio-Glimpse, Andrew Rotunno, Samuel Berweger, Matthew T. Simons, Christopher L. Holloway, Chad M. Gardner, Michael S. Mcbeth, Robert A. Younts

    Abstract: Recent advances in Rydberg atom electrometry detail promising applications in radio frequency (RF) communications. Presently, most applications use carrier frequencies greater than 1~GHz where resonant Autler-Townes splitting provides the highest sensitivity. This letter documents a series of experiments with Rydberg atomic sensors to collect and process waveforms from the automated identification… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

    Comments: 9 figure, 8 pages

  5. arXiv:2305.14907  [pdf, other

    cs.CL

    Coverage-based Example Selection for In-Context Learning

    Authors: Shivanshu Gupta, Matt Gardner, Sameer Singh

    Abstract: In-context learning (ICL), the ability of large language models to perform novel tasks by conditioning on a prompt with a few task examples, requires these examples to be informative about the test instance. The standard approach of independently ranking and selecting the most similar examples selects redundant examples while omitting important information. In this work, we show that BERTScore-Rec… ▽ More

    Submitted 6 November, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: Accepted to EMNLP 2023 (Findings) Changelog: Added acknowledgments

  6. arXiv:2305.09572  [pdf, ps, other

    cs.SE stat.CO

    UQpy v4.1: Uncertainty Quantification with Python

    Authors: Dimitrios Tsapetis, Michael D. Shields, Dimitris G. Giovanis, Audrey Olivier, Lukas Novak, Promit Chakroborty, Himanshu Sharma, Mohit Chauhan, Katiana Kontolati, Lohit Vandanapu, Dimitrios Loukrezis, Michael Gardner

    Abstract: This paper presents the latest improvements introduced in Version 4 of the UQpy, Uncertainty Quantification with Python, library. In the latest version, the code was restructured to conform with the latest Python coding conventions, refactored to simplify previous tightly coupled features, and improve its extensibility and modularity. To improve the robustness of UQpy, software engineering best pr… ▽ More

    Submitted 16 May, 2023; originally announced May 2023.

  7. arXiv:2303.05384  [pdf

    physics.ed-ph

    A scalable approach to undergraduate research in physics

    Authors: Amanda L. Baxter, Rafael F. Lang, Craig Zywicki, Stephanie M. Gardner, Abigail Kopec, Andreas Jung

    Abstract: Course-based undergraduate research experiences (CUREs) increase students' access to research. This lesson plan describes an interdisciplinary CURE developed to be able to involve over 60 students per semester in original research using data from large particle physics experiments and telescopes, although the methods described can easily be adopted by other areas of data science. Students are divi… ▽ More

    Submitted 9 March, 2023; originally announced March 2023.

    Comments: 18 pages of main text, 2 figures, 2 tables, 32 pages of supporting materials

  8. arXiv:2212.04092  [pdf, other

    cs.CL

    Successive Prompting for Decomposing Complex Questions

    Authors: Dheeru Dua, Shivanshu Gupta, Sameer Singh, Matt Gardner

    Abstract: Answering complex questions that require making latent decisions is a challenging task, especially when limited supervision is available. Recent works leverage the capabilities of large language models (LMs) to perform complex question answering in a few-shot setting by demonstrating how to output intermediate rationalizations while solving the complex question in a single pass. We introduce ``Suc… ▽ More

    Submitted 8 December, 2022; originally announced December 2022.

  9. arXiv:2211.00295  [pdf, other

    cs.CL cs.AI

    CONDAQA: A Contrastive Reading Comprehension Dataset for Reasoning about Negation

    Authors: Abhilasha Ravichander, Matt Gardner, Ana Marasović

    Abstract: The full power of human language-based communication cannot be realized without negation. All human languages have some form of negation. Despite this, negation remains a challenging phenomenon for current natural language understanding systems. To facilitate the future development of models that can process negation effectively, we present CONDAQA, the first English reading comprehension dataset… ▽ More

    Submitted 1 November, 2022; originally announced November 2022.

    Comments: EMNLP 2022

  10. arXiv:2205.08124  [pdf, other

    cs.CL

    When to Use Multi-Task Learning vs Intermediate Fine-Tuning for Pre-Trained Encoder Transfer Learning

    Authors: Orion Weller, Kevin Seppi, Matt Gardner

    Abstract: Transfer learning (TL) in natural language processing (NLP) has seen a surge of interest in recent years, as pre-trained models have shown an impressive ability to transfer to novel tasks. Three main strategies have emerged for making use of multiple supervised datasets during fine-tuning: training on an intermediate task before training on the target task (STILTs), using multi-task learning (MTL)… ▽ More

    Submitted 17 May, 2022; originally announced May 2022.

    Comments: ACL 2022

  11. arXiv:2204.05991  [pdf, other

    cs.CV cs.CL

    ReCLIP: A Strong Zero-Shot Baseline for Referring Expression Comprehension

    Authors: Sanjay Subramanian, William Merrill, Trevor Darrell, Matt Gardner, Sameer Singh, Anna Rohrbach

    Abstract: Training a referring expression comprehension (ReC) model for a new visual domain requires collecting referring expressions, and potentially corresponding bounding boxes, for images in the domain. While large-scale pre-trained models are useful for image classification across domains, it remains unclear if they can be applied in a zero-shot manner to more complex tasks like ReC. We present ReCLIP,… ▽ More

    Submitted 2 May, 2022; v1 submitted 12 April, 2022; originally announced April 2022.

    Comments: ACL 2022

  12. arXiv:2203.12942  [pdf, other

    cs.CL cs.AI cs.CY

    Generating Data to Mitigate Spurious Correlations in Natural Language Inference Datasets

    Authors: Yuxiang Wu, Matt Gardner, Pontus Stenetorp, Pradeep Dasigi

    Abstract: Natural language processing models often exploit spurious correlations between task-independent features and labels in datasets to perform well only within the distributions they are trained on, while not generalising to different task distributions. We propose to tackle this problem by generating a debiased version of a dataset, which can then be used to train a debiased, off-the-shelf model, by… ▽ More

    Submitted 24 March, 2022; originally announced March 2022.

    Comments: Accepted to ACL 2022 main conference

  13. arXiv:2203.08445  [pdf, other

    cs.CL

    Structurally Diverse Sampling for Sample-Efficient Training and Comprehensive Evaluation

    Authors: Shivanshu Gupta, Sameer Singh, Matt Gardner

    Abstract: A growing body of research has demonstrated the inability of NLP models to generalize compositionally and has tried to alleviate it through specialized architectures, training schemes, and data augmentation, among other approaches. In this work, we study a different approach: training on instances with diverse structures. We propose a model-agnostic algorithm for subsampling such sets of instances… ▽ More

    Submitted 1 November, 2022; v1 submitted 16 March, 2022; originally announced March 2022.

    Comments: Accepted at Findings of EMNLP 2022

  14. arXiv:2202.11579  [pdf

    eess.SP

    Identifying Oscillations Injected by Inverter-Based Solar Energy Sources

    Authors: Chen Wang, Luigi Vanfretti, Chetan Mishra, Kevin D. Jones, R. Matthew Gardner

    Abstract: Inverter-based solar energy sources are becoming widely integrated into modern power systems. However, their impacts on the system in the frequency domain are rarely investigated at a higher frequency range than conventional electromechanical oscillations. This paper presents evidence of the emergence of an oscillation mode injected by inverter-based solar energy sources in Dominion Energy's servi… ▽ More

    Submitted 23 February, 2022; originally announced February 2022.

    Comments: 5 pages, 14 figures. This paper is accepted and will be published in the Proceedings of the 2022 IEEE PES General Meeting, July 17-21 2022, Denver, CO, USA

  15. arXiv:2202.07206  [pdf, other

    cs.CL cs.LG

    Impact of Pretraining Term Frequencies on Few-Shot Reasoning

    Authors: Yasaman Razeghi, Robert L. Logan IV, Matt Gardner, Sameer Singh

    Abstract: Pretrained Language Models (LMs) have demonstrated ability to perform numerical reasoning by extrapolating from a few examples in few-shot settings. However, the extent to which this extrapolation relies on robust reasoning is unclear. In this paper, we investigate how well these models reason with terms that are less frequent in the pretraining data. In particular, we examine the correlations bet… ▽ More

    Submitted 23 May, 2022; v1 submitted 15 February, 2022; originally announced February 2022.

  16. arXiv:2112.08688  [pdf, other

    cs.CL

    Evidentiality-guided Generation for Knowledge-Intensive NLP Tasks

    Authors: Akari Asai, Matt Gardner, Hannaneh Hajishirzi

    Abstract: Retrieval-augmented generation models have shown state-of-the-art performance across many knowledge-intensive NLP tasks such as open question answering and fact verification. These models are trained to generate the final output given the retrieved passages, which can be irrelevant to the original query, leading to learning spurious cues or answer memorization. This work introduces a method to inc… ▽ More

    Submitted 14 May, 2022; v1 submitted 16 December, 2021; originally announced December 2021.

    Comments: Published as a conference paper at NAACL 2022 (long). Code available at https://github.com/AkariAsai/evidentiality_qa

  17. arXiv:2109.10613  [pdf, other

    cs.CL

    COVR: A test-bed for Visually Grounded Compositional Generalization with real images

    Authors: Ben Bogin, Shivanshu Gupta, Matt Gardner, Jonathan Berant

    Abstract: While interest in models that generalize at test time to new compositions has risen in recent years, benchmarks in the visually-grounded domain have thus far been restricted to synthetic images. In this work, we propose COVR, a new test-bed for visually-grounded compositional generalization with real images. To create COVR, we use real images annotated with scene graphs, and propose an almost full… ▽ More

    Submitted 22 September, 2021; originally announced September 2021.

    Comments: EMNLP 2021

  18. arXiv:2108.08346  [pdf

    eess.SY

    Permanent Magnet Linear Generator Design for Surface Riding Wave Energy Converters

    Authors: Farid Naghavi, Shrikesh Sheshaprasad, Matthew Gardner, Aghamarshana Meduri, HeonYong Kang, Hamid Toliyat

    Abstract: This paper describes the detailed analysis for the design of a linear generator developed for a Surface Riding Wave Energy Converter (SR-WEC), which was designed to improve energy capture over a wider range of sea states. The study starts with an analysis of the power take-off (PTO) control strategy to harness the maximum output power from given sea states. Passive, reactive, and discrete PTO cont… ▽ More

    Submitted 18 August, 2021; originally announced August 2021.

    Comments: To be published in Energy Conversion Congress and Expo 2021

  19. arXiv:2107.12708  [pdf, other

    cs.CL cs.AI

    QA Dataset Explosion: A Taxonomy of NLP Resources for Question Answering and Reading Comprehension

    Authors: Anna Rogers, Matt Gardner, Isabelle Augenstein

    Abstract: Alongside huge volumes of research on deep learning models in NLP in the recent years, there has been also much work on benchmark datasets needed to track modeling progress. Question answering and reading comprehension have been particularly prolific in this regard, with over 80 new datasets appearing in the past two years. This study is the largest survey of the field to date. We provide an overv… ▽ More

    Submitted 19 September, 2022; v1 submitted 27 July, 2021; originally announced July 2021.

    Comments: Published in ACM Comput. Surv (2022). This version differs from the final version in that section 7 ("Languages") is not in the main paper rather than the supplementary materials

  20. arXiv:2107.07150  [pdf, other

    cs.CL

    Tailor: Generating and Perturbing Text with Semantic Controls

    Authors: Alexis Ross, Tongshuang Wu, Hao Peng, Matthew E. Peters, Matt Gardner

    Abstract: Controlled text perturbation is useful for evaluating and improving model generalizability. However, current techniques rely on training a model for every target perturbation, which is expensive and hard to generalize. We present Tailor, a semantically-controlled text generation system. Tailor builds on a pretrained seq2seq model and produces textual outputs conditioned on control codes derived fr… ▽ More

    Submitted 17 March, 2022; v1 submitted 15 July, 2021; originally announced July 2021.

  21. arXiv:2107.05833  [pdf, other

    cs.CL

    Enforcing Consistency in Weakly Supervised Semantic Parsing

    Authors: Nitish Gupta, Sameer Singh, Matt Gardner

    Abstract: The predominant challenge in weakly supervised semantic parsing is that of spurious programs that evaluate to correct answers for the wrong reasons. Prior work uses elaborate search strategies to mitigate the prevalence of spurious programs; however, they typically consider only one input at a time. In this work we explore the use of consistency between the output programs for related inputs to re… ▽ More

    Submitted 12 July, 2021; originally announced July 2021.

    Comments: Published in ACL 2021

  22. arXiv:2105.03011  [pdf, other

    cs.CL

    A Dataset of Information-Seeking Questions and Answers Anchored in Research Papers

    Authors: Pradeep Dasigi, Kyle Lo, Iz Beltagy, Arman Cohan, Noah A. Smith, Matt Gardner

    Abstract: Readers of academic research papers often read with the goal of answering specific questions. Question Answering systems that can answer those questions can make consumption of the content much more efficient. However, building such tools requires data that reflect the difficulty of the task arising from complex reasoning about claims made in multiple parts of a paper. In contrast, existing inform… ▽ More

    Submitted 6 May, 2021; originally announced May 2021.

    Comments: Accepted at NAACL 2021; Project page: https://allenai.org/project/qasper

  23. arXiv:2104.10034  [pdf, other

    cs.CR

    On Generating and Labeling Network Traffic with Realistic, Self-Propagating Malware

    Authors: Molly Buchanan, Jeffrey W. Collyer, Jack W. Davidson, Saikat Dey, Mark Gardner, Jason D. Hiser, Jeffry Lang, Alastair Nottingham, Alina Oprea

    Abstract: Research and development of techniques which detect or remediate malicious network activity require access to diverse, realistic, contemporary data sets containing labeled malicious connections. In the absence of such data, said techniques cannot be meaningfully trained, tested, and evaluated. Synthetically produced data containing fabricated or merged network traffic is of limited value as it is… ▽ More

    Submitted 27 May, 2022; v1 submitted 20 April, 2021; originally announced April 2021.

    Comments: 4+2 pages, 3 figures, 1 table, for AI4CS-SDM21

  24. arXiv:2104.08758  [pdf, other

    cs.CL cs.AI

    Documenting Large Webtext Corpora: A Case Study on the Colossal Clean Crawled Corpus

    Authors: Jesse Dodge, Maarten Sap, Ana Marasović, William Agnew, Gabriel Ilharco, Dirk Groeneveld, Margaret Mitchell, Matt Gardner

    Abstract: Large language models have led to remarkable progress on many NLP tasks, and researchers are turning to ever-larger text corpora to train them. Some of the largest corpora available are made by scra** significant portions of the internet, and are frequently introduced with only minimal documentation. In this work we provide some of the first documentation for the Colossal Clean Crawled Corpus (C… ▽ More

    Submitted 30 September, 2021; v1 submitted 18 April, 2021; originally announced April 2021.

    Comments: EMNLP 2021 accepted paper camera ready version

  25. arXiv:2104.08744  [pdf, other

    cs.CL

    Generative Context Pair Selection for Multi-hop Question Answering

    Authors: Dheeru Dua, Cicero Nogueira dos Santos, Patrick Ng, Ben Athiwaratkun, Bing Xiang, Matt Gardner, Sameer Singh

    Abstract: Compositional reasoning tasks like multi-hop question answering, require making latent decisions to get the final answer, given a question. However, crowdsourced datasets often capture only a slice of the underlying task distribution, which can induce unanticipated biases in models performing compositional reasoning. Furthermore, discriminatively trained models exploit such biases to get a better… ▽ More

    Submitted 18 April, 2021; originally announced April 2021.

  26. arXiv:2104.08735  [pdf, other

    cs.CL

    Learning with Instance Bundles for Reading Comprehension

    Authors: Dheeru Dua, Pradeep Dasigi, Sameer Singh, Matt Gardner

    Abstract: When training most modern reading comprehension models, all the questions associated with a context are treated as being independent from each other. However, closely related questions and their corresponding answers are not independent, and leveraging these relationships could provide a strong supervision signal to a model. Drawing on ideas from contrastive estimation, we introduce several new su… ▽ More

    Submitted 18 April, 2021; originally announced April 2021.

  27. arXiv:2104.08646  [pdf, other

    cs.CL

    Competency Problems: On Finding and Removing Artifacts in Language Data

    Authors: Matt Gardner, William Merrill, Jesse Dodge, Matthew E. Peters, Alexis Ross, Sameer Singh, Noah A. Smith

    Abstract: Much recent work in NLP has documented dataset artifacts, bias, and spurious correlations between input features and output labels. However, how to tell which features have "spurious" instead of legitimate correlations is typically left unspecified. In this work we argue that for complex language understanding tasks, all simple feature correlations are spurious, and we formalize this notion into a… ▽ More

    Submitted 28 December, 2021; v1 submitted 17 April, 2021; originally announced April 2021.

    Comments: EMNLP 2021. This version fixes an error in Proposition 1 and adds discussion (the EMNLP camera ready version is unfixed) (and v3 adds the acknowledgements that we forgot to put into v2)

  28. Test beam characterization of sensor prototypes for the CMS Barrel MIP Timing Detector

    Authors: R. Abbott, A. Abreu, F. Addesa, M. Alhusseini, T. Anderson, Y. Andreev, A. Apresyan, R. Arcidiacono, M. Arenton, E. Auffray, D. Bastos, L. A. T. Bauerdick, R. Bellan, M. Bellato, A. Benaglia, M. Benettoni, R. Bertoni, M. Besancon, S. Bharthuar, A. Bornheim, E. Brücken, J. N. Butler, C. Campagnari, M. Campana, R. Carlin , et al. (174 additional authors not shown)

    Abstract: The MIP Timing Detector will provide additional timing capabilities for detection of minimum ionizing particles (MIPs) at CMS during the High Luminosity LHC era, improving event reconstruction and pileup rejection. The central portion of the detector, the Barrel Timing Layer (BTL), will be instrumented with LYSO:Ce crystals and Silicon Photomultipliers (SiPMs) providing a time resolution of about… ▽ More

    Submitted 16 July, 2021; v1 submitted 15 April, 2021; originally announced April 2021.

    Journal ref: Journal of Instrumentation, Volume 16, July 2021

  29. arXiv:2104.01759  [pdf, other

    cs.CL

    Paired Examples as Indirect Supervision in Latent Decision Models

    Authors: Nitish Gupta, Sameer Singh, Matt Gardner, Dan Roth

    Abstract: Compositional, structured models are appealing because they explicitly decompose problems and provide interpretable intermediate outputs that give confidence that the model is not simply latching onto data artifacts. Learning these models is challenging, however, because end-task supervision only provides a weak indirect signal on what values the latent decisions should take. This often results in… ▽ More

    Submitted 4 April, 2021; originally announced April 2021.

  30. arXiv:2103.12235  [pdf, other

    cs.CL

    Mitigating False-Negative Contexts in Multi-document Question Answering with Retrieval Marginalization

    Authors: Ansong Ni, Matt Gardner, Pradeep Dasigi

    Abstract: Question Answering (QA) tasks requiring information from multiple documents often rely on a retrieval model to identify relevant information for reasoning. The retrieval model is typically trained to maximize the likelihood of the labeled supporting evidence. However, when retrieving from large text corpora such as Wikipedia, the correct answer can often be obtained from multiple evidence candidat… ▽ More

    Submitted 8 September, 2021; v1 submitted 22 March, 2021; originally announced March 2021.

    Comments: Accepted to EMNLP 2021 (main conference)

  31. arXiv:2011.08115  [pdf, other

    cs.CL

    Learning from Task Descriptions

    Authors: Orion Weller, Nicholas Lourie, Matt Gardner, Matthew E. Peters

    Abstract: Typically, machine learning systems solve new tasks by training on thousands of examples. In contrast, humans can solve new tasks by reading some instructions, with perhaps an example or two. To take a step toward closing this gap, we introduce a framework for develo** NLP systems that solve new tasks after reading their descriptions, synthesizing prior work in this area. We instantiate this fra… ▽ More

    Submitted 16 November, 2020; originally announced November 2020.

    Comments: EMNLP 2020

  32. arXiv:2011.07127  [pdf, other

    cs.CL

    IIRC: A Dataset of Incomplete Information Reading Comprehension Questions

    Authors: James Ferguson, Matt Gardner, Hannaneh Hajishirzi, Tushar Khot, Pradeep Dasigi

    Abstract: Humans often have to read multiple documents to address their information needs. However, most existing reading comprehension (RC) tasks only focus on questions for which the contexts provide all the information required to answer them, thus not evaluating a system's performance at identifying a potential lack of sufficient information and locating sources for that information. To fill this gap, w… ▽ More

    Submitted 13 November, 2020; originally announced November 2020.

    Comments: EMNLP 2020

  33. arXiv:2010.06694  [pdf, other

    cs.HC

    Easy, Reproducible and Quality-Controlled Data Collection with Crowdaq

    Authors: Qiang Ning, Hao Wu, Pradeep Dasigi, Dheeru Dua, Matt Gardner, Robert L. Logan IV, Ana Marasovic, Zhen Nie

    Abstract: High-quality and large-scale data are key to success for AI systems. However, large-scale data annotation efforts are often confronted with a set of common challenges: (1) designing a user-friendly annotation interface; (2) training enough annotators efficiently; and (3) reproducibility. To address these problems, we introduce Crowdaq, an open-source platform that standardizes the data collection… ▽ More

    Submitted 5 October, 2020; originally announced October 2020.

    Comments: Accepted to the demo track of EMNLP 2020

  34. arXiv:2010.06000  [pdf, other

    cs.CV cs.CL

    MedICaT: A Dataset of Medical Images, Captions, and Textual References

    Authors: Sanjay Subramanian, Lucy Lu Wang, Sachin Mehta, Ben Bogin, Madeleine van Zuylen, Sravanthi Parasa, Sameer Singh, Matt Gardner, Hannaneh Hajishirzi

    Abstract: Understanding the relationship between figures and text is key to scientific document understanding. Medical figures in particular are quite complex, often consisting of several subfigures (75% of figures in our dataset), with detailed text describing their content. Previous work studying figures in scientific papers focused on classifying figure content rather than understanding how images relate… ▽ More

    Submitted 12 October, 2020; originally announced October 2020.

    Comments: EMNLP-Findings 2020

  35. arXiv:2010.05647  [pdf, other

    cs.CL

    Improving Compositional Generalization in Semantic Parsing

    Authors: Inbar Oren, Jonathan Herzig, Nitish Gupta, Matt Gardner, Jonathan Berant

    Abstract: Generalization of models to out-of-distribution (OOD) data has captured tremendous attention recently. Specifically, compositional generalization, i.e., whether a model generalizes to new structures built of components observed during training, has sparked substantial interest. In this work, we investigate compositional generalization in semantic parsing, a natural test-bed for compositional gener… ▽ More

    Submitted 12 October, 2020; originally announced October 2020.

  36. MOCHA: A Dataset for Training and Evaluating Generative Reading Comprehension Metrics

    Authors: Anthony Chen, Gabriel Stanovsky, Sameer Singh, Matt Gardner

    Abstract: Posing reading comprehension as a generation problem provides a great deal of flexibility, allowing for open-ended questions with few restrictions on possible answers. However, progress is impeded by existing generation metrics, which rely on token overlap and are agnostic to the nuances of reading comprehension. To address this, we introduce a benchmark for training and evaluating generative read… ▽ More

    Submitted 15 October, 2020; v1 submitted 7 October, 2020; originally announced October 2020.

    Journal ref: Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)

  37. arXiv:2009.09363  [pdf, other

    cs.CL

    Understanding Mention Detector-Linker Interaction in Neural Coreference Resolution

    Authors: Zhaofeng Wu, Matt Gardner

    Abstract: Despite significant recent progress in coreference resolution, the quality of current state-of-the-art systems still considerably trails behind human-level performance. Using the CoNLL-2012 and PreCo datasets, we dissect the best instantiation of the mainstream end-to-end coreference resolution model that underlies most current best-performing coreference systems, and empirically analyze the behav… ▽ More

    Submitted 8 September, 2021; v1 submitted 20 September, 2020; originally announced September 2020.

    Comments: CRAC @ EMNLP 2021

  38. arXiv:2009.04959  [pdf

    cond-mat.mtrl-sci

    Quantifying the effect of oxygen on micro-mechanical properties of a near-alpha titanium alloy

    Authors: H. M. Gardner, P. Gopon, C. M. Magazzeni, A. Radecka, K. Fox, D. Rugg, J. Wade, D. E. J. Armstrong, M. P. Moody, P. A. J. Bagot

    Abstract: Atom probe tomography (APT), electron probe microanalysis (EPMA) and nanoindentation were used to characterise the oxygen-rich layer on an in-service jet engine compressor disc, manufactured from the titanium alloy TIMETAL 834. Oxygen ingress was quantified and related to changes in mechanical properties through nanoindentation studies. The relationship between oxygen concentration, microstructure… ▽ More

    Submitted 10 September, 2020; originally announced September 2020.

  39. arXiv:2008.12267  [pdf

    cond-mat.mtrl-sci

    Nanoindentation in multi-modal map combinations: A Correlative Approach to Local Mechanical Property Assessment

    Authors: C. M. Magazzeni, H. M. Gardner, I. Howe, P. Gopon, J. C. Waite, D. Rugg, D. E. J. Armstrong, A. J. Wilkinson

    Abstract: A method is presented for the registration and correlation of intrinsic property maps of materials, including data from nanoindentation hardness, Electron Back-Scattered Diffraction (EBSD), Electron Micro-Probe Analysis (EPMA). This highly spatially resolved method allows for the study of micron-scale microstructural features, and has the capability to rapidly extract correlations between multiple… ▽ More

    Submitted 4 January, 2021; v1 submitted 27 August, 2020; originally announced August 2020.

    Comments: 33 pages, 15 figures (9 main body) + graphical abstract, code on Github @XPCorrelate

  40. arXiv:2007.00266  [pdf, other

    cs.CL cs.AI cs.LG

    Latent Compositional Representations Improve Systematic Generalization in Grounded Question Answering

    Authors: Ben Bogin, Sanjay Subramanian, Matt Gardner, Jonathan Berant

    Abstract: Answering questions that involve multi-step reasoning requires decomposing them and using the answers of intermediate steps to reach the final answer. However, state-of-the-art models in grounded question answering often do not explicitly perform decomposition, leading to difficulties in generalization to out-of-distribution examples. In this work, we propose a model that computes a representation… ▽ More

    Submitted 10 November, 2020; v1 submitted 1 July, 2020; originally announced July 2020.

    Comments: Accepted for publication in Transactions of the Association for Computational Linguistics (TACL), 2020. Author's final version

  41. arXiv:2006.07295  [pdf, other

    math.NA

    Continuous data assimilation applied to a velocity-vorticity formulation of the 2D Navier-Stokes equations

    Authors: Matthew Gardner, Adam Larios, Leo G. Rebholz, Duygu Vargun, Camille Zerfas

    Abstract: We study a continuous data assimilation (CDA) algorithm for a velocity-vorticity formulation of the 2D Navier-Stokes equations in two cases: nudging applied to the velocity and vorticity, and nudging applied to the velocity only. We prove that under a typical finite element spatial discretization and backward Euler temporal discretization, application of CDA preserves the unconditional long-time s… ▽ More

    Submitted 12 June, 2020; originally announced June 2020.

    Comments: 28 pages, 8 figures, 2 tables

  42. arXiv:2005.00724  [pdf, other

    cs.CL cs.AI cs.CV cs.LG

    Obtaining Faithful Interpretations from Compositional Neural Networks

    Authors: Sanjay Subramanian, Ben Bogin, Nitish Gupta, Tomer Wolfson, Sameer Singh, Jonathan Berant, Matt Gardner

    Abstract: Neural module networks (NMNs) are a popular approach for modeling compositionality: they achieve high accuracy when applied to problems in language and vision, while reflecting the compositional structure of the problem in the network architecture. However, prior work implicitly assumed that the structure of the network modules, describing the abstract reasoning process, provides a faithful explan… ▽ More

    Submitted 8 September, 2020; v1 submitted 2 May, 2020; originally announced May 2020.

    Comments: ACL 2020; first three authors contributed equally

  43. arXiv:2005.00242  [pdf, other

    cs.CL

    TORQUE: A Reading Comprehension Dataset of Temporal Ordering Questions

    Authors: Qiang Ning, Hao Wu, Rujun Han, Nanyun Peng, Matt Gardner, Dan Roth

    Abstract: A critical part of reading is being able to understand the temporal relationships between events described in a passage of text, even when those relationships are not explicitly stated. However, current machine reading comprehension benchmarks have practically no questions that test temporal phenomena, so systems trained on these benchmarks have no capacity to answer questions such as "what happen… ▽ More

    Submitted 5 October, 2020; v1 submitted 1 May, 2020; originally announced May 2020.

    Comments: 15 pages (incl. 4 pages in the appendix); accepted to EMNLP 2020

  44. Multi-Step Inference for Reasoning Over Paragraphs

    Authors: Jiangming Liu, Matt Gardner, Shay B. Cohen, Mirella Lapata

    Abstract: Complex reasoning over text requires understanding and chaining together free-form predicates and logical connectives. Prior work has largely tried to do this either symbolically or with black-box transformers. We present a middle ground between these two extremes: a compositional model reminiscent of neural module networks that can perform chained logical reasoning. This model first finds relevan… ▽ More

    Submitted 7 June, 2021; v1 submitted 6 April, 2020; originally announced April 2020.

    Comments: accepted by EMNLP 2020

  45. arXiv:2004.02709  [pdf, other

    cs.CL

    Evaluating Models' Local Decision Boundaries via Contrast Sets

    Authors: Matt Gardner, Yoav Artzi, Victoria Basmova, Jonathan Berant, Ben Bogin, Sihao Chen, Pradeep Dasigi, Dheeru Dua, Yanai Elazar, Ananth Gottumukkala, Nitish Gupta, Hanna Hajishirzi, Gabriel Ilharco, Daniel Khashabi, Kevin Lin, Jiangming Liu, Nelson F. Liu, Phoebe Mulcaire, Qiang Ning, Sameer Singh, Noah A. Smith, Sanjay Subramanian, Reut Tsarfaty, Eric Wallace, Ally Zhang , et al. (1 additional authors not shown)

    Abstract: Standard test sets for supervised learning evaluate in-distribution generalization. Unfortunately, when a dataset has systematic gaps (e.g., annotation artifacts), these evaluations are misleading: a model can learn simple decision rules that perform well on the test set but do not capture a dataset's intended capabilities. We propose a new annotation paradigm for NLP that helps to close systemati… ▽ More

    Submitted 1 October, 2020; v1 submitted 6 April, 2020; originally announced April 2020.

  46. arXiv:2001.11770  [pdf, other

    cs.CL

    Break It Down: A Question Understanding Benchmark

    Authors: Tomer Wolfson, Mor Geva, Ankit Gupta, Matt Gardner, Yoav Goldberg, Daniel Deutch, Jonathan Berant

    Abstract: Understanding natural language questions entails the ability to break down a question into the requisite steps for computing its answer. In this work, we introduce a Question Decomposition Meaning Representation (QDMR) for questions. QDMR constitutes the ordered list of steps, expressed through natural language, that are necessary for answering a question. We develop a crowdsourcing pipeline, show… ▽ More

    Submitted 31 January, 2020; originally announced January 2020.

    Comments: Accepted for publication in Transactions of the Association for Computational Linguistics (TACL), 2020. Author's final version

  47. arXiv:2001.07612  [pdf, other

    eess.SY

    Optimal Dispatch of Electrified Autonomous Mobility on Demand Vehicles during Power Outages

    Authors: Colin Sheppard, Laurel N. Dunn, Sangjae Bae, Max Gardner

    Abstract: The era of fully autonomous, electrified taxi fleets is rapidly approaching, and with it the opportunity to innovate myriad on-demand services that extend beyond the realm of human mobility. This project envisions a future where autonomous plug-in electric vehicle (PEV) fleets can be dispatched as both a taxi service and a source of on-demand power serving customers during power outages. We develo… ▽ More

    Submitted 21 January, 2020; originally announced January 2020.

    Journal ref: 2017 IEEE Power and Energy Society General Meeting

  48. arXiv:1912.12598  [pdf, ps, other

    cs.CL

    ORB: An Open Reading Benchmark for Comprehensive Evaluation of Machine Reading Comprehension

    Authors: Dheeru Dua, Ananth Gottumukkala, Alon Talmor, Sameer Singh, Matt Gardner

    Abstract: Reading comprehension is one of the crucial tasks for furthering research in natural language understanding. A lot of diverse reading comprehension datasets have recently been introduced to study various phenomena in natural language, ranging from simple paraphrase matching and entity ty** to entity tracking and understanding the implications of the context. Given the availability of many such d… ▽ More

    Submitted 29 December, 2019; originally announced December 2019.

  49. arXiv:1912.04971  [pdf, other

    cs.CL

    Neural Module Networks for Reasoning over Text

    Authors: Nitish Gupta, Kevin Lin, Dan Roth, Sameer Singh, Matt Gardner

    Abstract: Answering compositional questions that require multiple steps of reasoning against text is challenging, especially when they involve discrete, symbolic operations. Neural module networks (NMNs) learn to parse such questions as executable programs composed of learnable modules, performing well on synthetic visual QA domains. However, we find that it is challenging to learn these models for non-synt… ▽ More

    Submitted 15 February, 2020; v1 submitted 10 December, 2019; originally announced December 2019.

    Comments: Published in ICLR 2020 (International Conference on Learning Representations, 2020)

  50. arXiv:1910.08812  [pdf, other

    cs.CV

    Deep Parametric Indoor Lighting Estimation

    Authors: Marc-André Gardner, Yannick Hold-Geoffroy, Kalyan Sunkavalli, Christian Gagné, Jean-François Lalonde

    Abstract: We present a method to estimate lighting from a single image of an indoor scene. Previous work has used an environment map representation that does not account for the localized nature of indoor lighting. Instead, we represent lighting as a set of discrete 3D lights with geometric and photometric parameters. We train a deep neural network to regress these parameters from a single image, on a datas… ▽ More

    Submitted 19 October, 2019; originally announced October 2019.