Skip to main content

Showing 1–50 of 114 results for author: Collins, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.16807  [pdf, other

    cs.LG cs.CL cs.CV

    Beyond Thumbs Up/Down: Untangling Challenges of Fine-Grained Feedback for Text-to-Image Generation

    Authors: Katherine M. Collins, Najoung Kim, Yonatan Bitton, Verena Rieser, Shayegan Omidshafiei, Yushi Hu, Sherol Chen, Senjuti Dutta, Minsuk Chang, Kimin Lee, Youwei Liang, Georgina Evans, Sahil Singla, Gang Li, Adrian Weller, Junfeng He, Deepak Ramachandran, Krishnamurthy Dj Dvijotham

    Abstract: Human feedback plays a critical role in learning and refining reward models for text-to-image generation, but the optimal form the feedback should take for learning an accurate reward function has not been conclusively established. This paper investigates the effectiveness of fine-grained feedback which captures nuanced distinctions in image quality and prompt-alignment, compared to traditional co… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  2. arXiv:2406.04302  [pdf, other

    cs.LG

    Representational Alignment Supports Effective Machine Teaching

    Authors: Ilia Sucholutsky, Katherine M. Collins, Maya Malaviya, Nori Jacoby, Weiyang Liu, Theodore R. Sumers, Michalis Korakakis, Umang Bhatt, Mark Ho, Joshua B. Tenenbaum, Brad Love, Zachary A. Pardos, Adrian Weller, Thomas L. Griffiths

    Abstract: A good teacher should not only be knowledgeable; but should be able to communicate in a way that the student understands -- to share the student's representation of the world. In this work, we integrate insights from machine teaching and pragmatic communication with the burgeoning literature on representational alignment to characterize a utility curve defining a relationship between representatio… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: Preprint

  3. arXiv:2406.00179  [pdf, other

    cs.CL cs.AI

    Long-Span Question-Answering: Automatic Question Generation and QA-System Ranking via Side-by-Side Evaluation

    Authors: Bernd Bohnet, Kevin Swersky, Rosanne Liu, Pranjal Awasthi, Azade Nova, Javier Snaider, Hanie Sedghi, Aaron T Parisi, Michael Collins, Angeliki Lazaridou, Orhan Firat, Noah Fiedel

    Abstract: We explore the use of long-context capabilities in large language models to create synthetic reading comprehension data from entire books. Previous efforts to construct such datasets relied on crowd-sourcing, but the emergence of transformers with a context size of 1 million or more tokens now enables entirely automatic approaches. Our objective is to test the capabilities of LLMs to analyze, unde… ▽ More

    Submitted 31 May, 2024; originally announced June 2024.

  4. arXiv:2404.04186  [pdf, other

    cs.RO

    Probabilistically Informed Robot Object Search with Multiple Regions

    Authors: Matthew Collins, Jared J. Beard, Nicholas Ohi, Yu Gu

    Abstract: The increasing use of autonomous robot systems in hazardous environments underscores the need for efficient search and rescue operations. Despite significant advancements, existing literature on object search often falls short in overcoming the difficulty of long planning horizons and dealing with sensor limitations, such as noise. This study introduces a novel approach that formulates the search… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

    Comments: 6 pages, 7 figures. Submitted to the 2024 IEEE/RSJ International Conference on Intelligent Robots and Systems in Abu Dhabi, UAE (Oct 14-18, 2024)

  5. arXiv:2403.01314  [pdf, other

    cs.NI

    Superflows: A New Tool for Forensic Network Flow Analysis

    Authors: Michael Collins, Jyotirmoy V. Deshmukh, Dristi Dinesh, Mukund Raghothaman, Srivatsan Ravi, Yuan Xia

    Abstract: Network security analysts gather data from diverse sources, from high-level summaries of network flow and traffic volumes to low-level details such as service logs from servers and the contents of individual packets. They validate and check this data against traffic patterns and historical indicators of compromise. Based on the results of this analysis, a decision is made to either automatically m… ▽ More

    Submitted 2 March, 2024; originally announced March 2024.

  6. arXiv:2402.00559  [pdf, other

    cs.CL

    A Chain-of-Thought Is as Strong as Its Weakest Link: A Benchmark for Verifiers of Reasoning Chains

    Authors: Alon Jacovi, Yonatan Bitton, Bernd Bohnet, Jonathan Herzig, Or Honovich, Michael Tseng, Michael Collins, Roee Aharoni, Mor Geva

    Abstract: Prompting language models to provide step-by-step answers (e.g., "Chain-of-Thought") is the prominent approach for complex reasoning tasks, where more accurate reasoning chains typically improve downstream task performance. Recent literature discusses automatic methods to verify reasoning to evaluate and improve their correctness. However, no fine-grained step-level datasets are available to enabl… ▽ More

    Submitted 21 May, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

    Comments: Accepted to ACL 2024

  7. arXiv:2312.08063  [pdf, other

    cs.LG cs.AI cs.CL

    Estimation of Concept Explanations Should be Uncertainty Aware

    Authors: Vihari Piratla, Juyeon Heo, Katherine M. Collins, Sukriti Singh, Adrian Weller

    Abstract: Model explanations can be valuable for interpreting and debugging predictive models. We study a specific kind called Concept Explanations, where the goal is to interpret a model using human-understandable concepts. Although popular for their easy interpretation, concept explanations are known to be noisy. We begin our work by identifying various sources of uncertainty in the estimation pipeline th… ▽ More

    Submitted 5 April, 2024; v1 submitted 13 December, 2023; originally announced December 2023.

  8. arXiv:2310.17022  [pdf, other

    cs.LG cs.AI cs.CL

    Controlled Decoding from Language Models

    Authors: Sidharth Mudgal, Jong Lee, Harish Ganapathy, YaGuang Li, Tao Wang, Yan** Huang, Zhifeng Chen, Heng-Tze Cheng, Michael Collins, Trevor Strohman, Jilin Chen, Alex Beutel, Ahmad Beirami

    Abstract: KL-regularized reinforcement learning (RL) is a popular alignment framework to control the language model responses towards high reward outcomes. We pose a tokenwise RL objective and propose a modular solver for it, called controlled decoding (CD). CD exerts control through a separate prefix scorer module, which is trained to learn a value function for the reward. The prefix scorer is used at infe… ▽ More

    Submitted 3 June, 2024; v1 submitted 25 October, 2023; originally announced October 2023.

    Comments: ICML 2024

  9. arXiv:2310.13021  [pdf, other

    q-bio.NC cs.AI

    AI for Mathematics: A Cognitive Science Perspective

    Authors: Cedegao E. Zhang, Katherine M. Collins, Adrian Weller, Joshua B. Tenenbaum

    Abstract: Mathematics is one of the most powerful conceptual systems developed and used by the human species. Dreams of automated mathematicians have a storied history in artificial intelligence (AI). Rapid progress in AI, particularly propelled by advances in large language models (LLMs), has sparked renewed, widespread interest in building such systems. In this work, we reflect on these goals from a \text… ▽ More

    Submitted 18 October, 2023; originally announced October 2023.

  10. arXiv:2310.13018  [pdf, other

    q-bio.NC cs.AI cs.LG cs.NE

    Getting aligned on representational alignment

    Authors: Ilia Sucholutsky, Lukas Muttenthaler, Adrian Weller, Andi Peng, Andreea Bobu, Been Kim, Bradley C. Love, Erin Grant, Iris Groen, Jascha Achterberg, Joshua B. Tenenbaum, Katherine M. Collins, Katherine L. Hermann, Kerem Oktar, Klaus Greff, Martin N. Hebart, Nori Jacoby, Qiuyi Zhang, Raja Marjieh, Robert Geirhos, Sherol Chen, Simon Kornblith, Sunayana Rane, Talia Konkle, Thomas P. O'Connell , et al. (5 additional authors not shown)

    Abstract: Biological and artificial information processing systems form representations that they can use to categorize, reason, plan, navigate, and make decisions. How can we measure the extent to which the representations formed by these diverse systems agree? Do similarities in representations then translate into similar behavior? How can a system's representations be modified to better match those of an… ▽ More

    Submitted 2 November, 2023; v1 submitted 18 October, 2023; originally announced October 2023.

    Comments: Working paper, changes to be made in upcoming revisions

  11. arXiv:2309.16928  [pdf, other

    cs.LG cs.AI

    Learning to Receive Help: Intervention-Aware Concept Embedding Models

    Authors: Mateo Espinosa Zarlenga, Katherine M. Collins, Krishnamurthy Dvijotham, Adrian Weller, Zohreh Shams, Mateja Jamnik

    Abstract: Concept Bottleneck Models (CBMs) tackle the opacity of neural architectures by constructing and explaining their predictions using a set of high-level concepts. A special property of these models is that they permit concept interventions, wherein users can correct mispredicted concepts and thus improve the model's performance. Recent work, however, has shown that intervention efficacy can be highl… ▽ More

    Submitted 25 October, 2023; v1 submitted 28 September, 2023; originally announced September 2023.

    Comments: Accepted as a spotlight at the Thirty-seventh Conference on Neural Information Processing Systems (NeurIPS 2023)

  12. arXiv:2307.15475  [pdf, other

    cs.HC cs.AI cs.LG

    FeedbackLogs: Recording and Incorporating Stakeholder Feedback into Machine Learning Pipelines

    Authors: Matthew Barker, Emma Kallina, Dhananjay Ashok, Katherine M. Collins, Ashley Casovan, Adrian Weller, Ameet Talwalkar, Valerie Chen, Umang Bhatt

    Abstract: Even though machine learning (ML) pipelines affect an increasing array of stakeholders, there is little work on how input from stakeholders is recorded and incorporated. We propose FeedbackLogs, addenda to existing documentation of ML pipelines, to track the input of multiple stakeholders. Each log records important details about the feedback collection process, the feedback itself, and how the fe… ▽ More

    Submitted 28 July, 2023; originally announced July 2023.

  13. arXiv:2306.14325  [pdf, other

    cs.AI cs.LG

    The Neuro-Symbolic Inverse Planning Engine (NIPE): Modeling Probabilistic Social Inferences from Linguistic Inputs

    Authors: Lance Ying, Katherine M. Collins, Megan Wei, Cedegao E. Zhang, Tan Zhi-Xuan, Adrian Weller, Joshua B. Tenenbaum, Lionel Wong

    Abstract: Human beings are social creatures. We routinely reason about other agents, and a crucial component of this social reasoning is inferring people's goals as we learn about their actions. In many settings, we can perform intuitive but reliable goal inference from language descriptions of agents, actions, and the background environments. In this paper, we study this process of language driving and inf… ▽ More

    Submitted 27 June, 2023; v1 submitted 25 June, 2023; originally announced June 2023.

    Comments: To appear at ICML Workshop on Theory of Mind in Communicating Agents

  14. arXiv:2306.08424  [pdf, other

    cs.HC cs.AI cs.LG

    Selective Concept Models: Permitting Stakeholder Customisation at Test-Time

    Authors: Matthew Barker, Katherine M. Collins, Krishnamurthy Dvijotham, Adrian Weller, Umang Bhatt

    Abstract: Concept-based models perform prediction using a set of concepts that are interpretable to stakeholders. However, such models often involve a fixed, large number of concepts, which may place a substantial cognitive load on stakeholders. We propose Selective COncept Models (SCOMs) which make predictions using only a subset of concepts and can be customised by stakeholders at test-time according to t… ▽ More

    Submitted 14 June, 2023; originally announced June 2023.

  15. arXiv:2306.01694  [pdf, other

    cs.LG cs.HC

    Evaluating Language Models for Mathematics through Interactions

    Authors: Katherine M. Collins, Albert Q. Jiang, Simon Frieder, Lionel Wong, Miri Zilka, Umang Bhatt, Thomas Lukasiewicz, Yuhuai Wu, Joshua B. Tenenbaum, William Hart, Timothy Gowers, Wenda Li, Adrian Weller, Mateja Jamnik

    Abstract: There is much excitement about the opportunity to harness the power of large language models (LLMs) when building problem-solving assistants. However, the standard methodology of evaluating LLMs relies on static pairs of inputs and outputs, and is insufficient for making an informed decision about which LLMs and under which assistive settings can they be sensibly used. Static assessment fails to a… ▽ More

    Submitted 5 November, 2023; v1 submitted 2 June, 2023; originally announced June 2023.

  16. arXiv:2305.14793  [pdf, other

    cs.CL

    Faithful Low-Resource Data-to-Text Generation through Cycle Training

    Authors: Zhuoer Wang, Marcus Collins, Nikhita Vedula, Simone Filice, Shervin Malmasi, Oleg Rokhlenko

    Abstract: Methods to generate text from structured data have advanced significantly in recent years, primarily due to fine-tuning of pre-trained language models on large datasets. However, such models can fail to produce output faithful to the input data, particularly on out-of-domain data. Sufficient annotated data is often not available for specific domains, leading us to seek an unsupervised approach to… ▽ More

    Submitted 11 July, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: 19 pages, 4 figures, ACL 2023

  17. arXiv:2304.13000  [pdf, other

    cs.CV cs.AI eess.IV

    Segment anything, from space?

    Authors: Simiao Ren, Francesco Luzi, Saad Lahrichi, Kaleb Kassaw, Leslie M. Collins, Kyle Bradbury, Jordan M. Malof

    Abstract: Recently, the first foundation model developed specifically for image segmentation tasks was developed, termed the "Segment Anything Model" (SAM). SAM can segment objects in input imagery based on cheap input prompts, such as one (or more) points, a bounding box, or a mask. The authors examined the \textit{zero-shot} image segmentation accuracy of SAM on a large number of vision benchmark tasks an… ▽ More

    Submitted 9 November, 2023; v1 submitted 25 April, 2023; originally announced April 2023.

    Comments: Work accepted at WACV 2024, this is only a pre-print, please go to WACV website for the official version

  18. arXiv:2304.07787  [pdf, other

    cs.LG

    Harnessing Digital Pathology And Causal Learning To Improve Eosinophilic Esophagitis Dietary Treatment Assignment

    Authors: Eliel Aknin, Ariel Larey, Julie M. Caldwell, Margaret H. Collins, Juan P. Abonia, Seema S. Aceves, Nicoleta C. Arva, Mirna Chehade, Evan S. Dellon, Nirmala Gonsalves, Sandeep K. Gupta, John Leung, Kathryn A. Peterson, Tetsuo Shoda, Jonathan M. Spergel, Marc E. Rothenberg, Yonatan Savir

    Abstract: Eosinophilic esophagitis (EoE) is a chronic, food antigen-driven, allergic inflammatory condition of the esophagus associated with elevated esophageal eosinophils. EoE is a top cause of chronic dysphagia after GERD. Diagnosis of EoE relies on counting eosinophils in histological slides, a manual and time-consuming task that limits the ability to extract complex patient-dependent features. The trea… ▽ More

    Submitted 16 April, 2023; originally announced April 2023.

    Comments: 11 pages, 5 figures

  19. arXiv:2304.06701  [pdf, other

    cs.LG cs.AI cs.CY cs.HC

    Learning Personalized Decision Support Policies

    Authors: Umang Bhatt, Valerie Chen, Katherine M. Collins, Parameswaran Kamalaruban, Emma Kallina, Adrian Weller, Ameet Talwalkar

    Abstract: Individual human decision-makers may benefit from different forms of support to improve decision outcomes, but when each form of support will yield better outcomes? In this work, we posit that personalizing access to decision support tools can be an effective mechanism for instantiating the appropriate use of AI assistance. Specifically, we propose the general problem of learning a decision suppor… ▽ More

    Submitted 27 May, 2024; v1 submitted 13 April, 2023; originally announced April 2023.

    Comments: 29 pages, 12 figures

  20. arXiv:2303.12872  [pdf, other

    cs.HC cs.AI cs.LG

    Human Uncertainty in Concept-Based AI Systems

    Authors: Katherine M. Collins, Matthew Barker, Mateo Espinosa Zarlenga, Naveen Raman, Umang Bhatt, Mateja Jamnik, Ilia Sucholutsky, Adrian Weller, Krishnamurthy Dvijotham

    Abstract: Placing a human in the loop may abate the risks of deploying AI systems in safety-critical settings (e.g., a clinician working with a medical AI system). However, mitigating risks arising from human error and uncertainty within such human-AI interactions is an important and understudied issue. In this work, we study human uncertainty in the context of concept-based models, a family of AI systems t… ▽ More

    Submitted 22 March, 2023; originally announced March 2023.

  21. arXiv:2301.09044  [pdf, other

    cs.LG

    Learning to Reject with a Fixed Predictor: Application to Decontextualization

    Authors: Christopher Mohri, Daniel Andor, Eunsol Choi, Michael Collins

    Abstract: We study the problem of classification with a reject option for a fixed predictor, applicable in natural language processing. We introduce a new problem formulation for this scenario, and an algorithm minimizing a new surrogate loss function. We provide a complete theoretical analysis of the surrogate loss function with a strong $H$-consistency guarantee. For evaluation, we choose the decontextual… ▽ More

    Submitted 31 January, 2023; v1 submitted 21 January, 2023; originally announced January 2023.

  22. arXiv:2212.12824  [pdf, other

    cs.CV

    Meta-Learning for Color-to-Infrared Cross-Modal Style Transfer

    Authors: Evelyn A. Stump, Francesco Luzi, Leslie M. Collins, Jordan M. Malof

    Abstract: Recent object detection models for infrared (IR) imagery are based upon deep neural networks (DNNs) and require large amounts of labeled training imagery. However, publicly-available datasets that can be used for such training are limited in their size and diversity. To address this problem, we explore cross-modal style transfer (CMST) to leverage large and diverse color imagery datasets so that t… ▽ More

    Submitted 24 December, 2022; originally announced December 2022.

  23. arXiv:2212.08037  [pdf, other

    cs.CL

    Attributed Question Answering: Evaluation and Modeling for Attributed Large Language Models

    Authors: Bernd Bohnet, Vinh Q. Tran, Pat Verga, Roee Aharoni, Daniel Andor, Livio Baldini Soares, Massimiliano Ciaramita, Jacob Eisenstein, Kuzman Ganchev, Jonathan Herzig, Kai Hui, Tom Kwiatkowski, Ji Ma, Jianmo Ni, Lierni Sestorain Saralegui, Tal Schuster, William W. Cohen, Michael Collins, Dipanjan Das, Donald Metzler, Slav Petrov, Kellie Webster

    Abstract: Large language models (LLMs) have shown impressive results while requiring little or no direct supervision. Further, there is mounting evidence that LLMs may have potential in information-seeking scenarios. We believe the ability of an LLM to attribute the text that it generates is likely to be crucial in this setting. We formulate and study Attributed QA as a key first step in the development of… ▽ More

    Submitted 10 February, 2023; v1 submitted 15 December, 2022; originally announced December 2022.

  24. Mixture Manifold Networks: A Computationally Efficient Baseline for Inverse Modeling

    Authors: Gregory P. Spell, Simiao Ren, Leslie M. Collins, Jordan M. Malof

    Abstract: We propose and show the efficacy of a new method to address generic inverse problems. Inverse modeling is the task whereby one seeks to determine the control parameters of a natural system that produce a given set of observed measurements. Recent work has shown impressive results using deep learning, but we note that there is a trade-off between model performance and computational time. For some a… ▽ More

    Submitted 25 November, 2022; originally announced November 2022.

    Comments: This paper has been accepted to AAAI 2023; this is not the final version

  25. arXiv:2211.12142  [pdf, other

    cs.CL cs.AI

    Coreference Resolution through a seq2seq Transition-Based System

    Authors: Bernd Bohnet, Chris Alberti, Michael Collins

    Abstract: Most recent coreference resolution systems use search algorithms over possible spans to identify mentions and resolve coreference. We instead present a coreference resolution system that uses a text-to-text (seq2seq) paradigm to predict mentions and links jointly. We implement the coreference system as a transition system and use multilingual T5 as an underlying language model. We obtain state-of-… ▽ More

    Submitted 22 November, 2022; originally announced November 2022.

  26. arXiv:2211.10844  [pdf, other

    cs.LG cs.CR cs.CV

    Learning to Generate Image Embeddings with User-level Differential Privacy

    Authors: Zheng Xu, Maxwell Collins, Yuxiao Wang, Liviu Panait, Sewoong Oh, Sean Augenstein, Ting Liu, Florian Schroff, H. Brendan McMahan

    Abstract: Small on-device models have been successfully trained with user-level differential privacy (DP) for next word prediction and image classification tasks in the past. However, existing methods can fail when directly applied to learn embedding models using supervised training data with a large class space. To achieve user-level DP for large image-to-embedding feature extractors, we propose DP-FedEmb,… ▽ More

    Submitted 31 March, 2023; v1 submitted 19 November, 2022; originally announced November 2022.

    Comments: CVPR camera ready. Addressed reviewer comments. Switched from add-or-remove-one DP to substitute-one DP

  27. arXiv:2211.09070  [pdf, other

    cs.CL

    Towards Computationally Verifiable Semantic Grounding for Language Models

    Authors: Chris Alberti, Kuzman Ganchev, Michael Collins, Sebastian Gehrmann, Ciprian Chelba

    Abstract: The paper presents an approach to semantic grounding of language models (LMs) that conceptualizes the LM as a conditional model generating text given a desired semantic message formalized as a set of entity-relationship triples. It embeds the LM in an auto-encoder by feeding its output to a semantic parser whose output is in the same representation domain as the input message. Compared to a baseli… ▽ More

    Submitted 16 November, 2022; originally announced November 2022.

  28. arXiv:2211.01407  [pdf, other

    cs.LG cs.AI

    On the Informativeness of Supervision Signals

    Authors: Ilia Sucholutsky, Ruairidh M. Battleday, Katherine M. Collins, Raja Marjieh, Joshua C. Peterson, Pulkit Singh, Umang Bhatt, Nori Jacoby, Adrian Weller, Thomas L. Griffiths

    Abstract: Supervised learning typically focuses on learning transferable representations from training examples annotated by humans. While rich annotations (like soft labels) carry more information than sparse annotations (like hard labels), they are also more expensive to collect. For example, while hard labels only provide information about the closest class an object belongs to (e.g., "this is a dog"), s… ▽ More

    Submitted 4 July, 2023; v1 submitted 2 November, 2022; originally announced November 2022.

    Comments: Proceedings of UAI 2023

  29. arXiv:2211.01202  [pdf, other

    cs.LG cs.CV cs.HC

    Human-in-the-Loop Mixup

    Authors: Katherine M. Collins, Umang Bhatt, Weiyang Liu, Vihari Piratla, Ilia Sucholutsky, Bradley Love, Adrian Weller

    Abstract: Aligning model representations to humans has been found to improve robustness and generalization. However, such methods often focus on standard observational data. Synthetic data is proliferating and powering many advances in machine learning; yet, it is not always clear whether synthetic labels are perceptually aligned to humans -- rendering it likely model representations are not human aligned.… ▽ More

    Submitted 30 July, 2023; v1 submitted 2 November, 2022; originally announced November 2022.

  30. arXiv:2210.17525  [pdf, ps, other

    cs.CL

    Query Refinement Prompts for Closed-Book Long-Form Question Answering

    Authors: Reinald Kim Amplayo, Kellie Webster, Michael Collins, Dipanjan Das, Shashi Narayan

    Abstract: Large language models (LLMs) have been shown to perform well in answering questions and in producing long-form texts, both in few-shot closed-book settings. While the former can be validated using well-known evaluation metrics, the latter is difficult to evaluate. We resolve the difficulties to evaluate long-form output by doing both tasks at once -- to do question answering that requires long-for… ▽ More

    Submitted 31 October, 2022; originally announced October 2022.

  31. arXiv:2210.09428  [pdf, other

    cs.CL

    Improving Low-Resource Cross-lingual Parsing with Expected Statistic Regularization

    Authors: Thomas Effland, Michael Collins

    Abstract: We present Expected Statistic Regularization (ESR), a novel regularization technique that utilizes low-order multi-task structural statistics to shape model distributions for semi-supervised learning on low-resource datasets. We study ESR in the context of cross-lingual transfer for syntactic analysis (POS tagging and labeled dependency parsing) and present several classes of low-order statistic f… ▽ More

    Submitted 17 October, 2022; originally announced October 2022.

    Comments: Accepted in TACL 2022, pre-MIT Press publication version

  32. arXiv:2210.02498  [pdf, other

    cs.CL cs.LG

    Honest Students from Untrusted Teachers: Learning an Interpretable Question-Answering Pipeline from a Pretrained Language Model

    Authors: Jacob Eisenstein, Daniel Andor, Bernd Bohnet, Michael Collins, David Mimno

    Abstract: Explainable question answering systems should produce not only accurate answers but also rationales that justify their reasoning and allow humans to check their work. But what sorts of rationales are useful and how can we train systems to produce them? We propose a new style of rationale for open-book question answering, called \emph{markup-and-mask}, which combines aspects of extractive and free-… ▽ More

    Submitted 24 April, 2024; v1 submitted 5 October, 2022; originally announced October 2022.

    Comments: added details about a human evaluation

  33. arXiv:2209.08685  [pdf, other

    cs.CV

    Meta-simulation for the Automated Design of Synthetic Overhead Imagery

    Authors: Handi Yu, Simiao Ren, Leslie M. Collins, Jordan M. Malof

    Abstract: The use of synthetic (or simulated) data for training machine learning models has grown rapidly in recent years. Synthetic data can often be generated much faster and more cheaply than its real-world counterpart. One challenge of using synthetic imagery however is scene design: e.g., the choice of content and its features and spatial arrangement. To be effective, this design must not only be reali… ▽ More

    Submitted 26 October, 2022; v1 submitted 18 September, 2022; originally announced September 2022.

  34. arXiv:2207.04044  [pdf, other

    cs.CV

    kMaX-DeepLab: k-means Mask Transformer

    Authors: Qihang Yu, Huiyu Wang, Siyuan Qiao, Maxwell Collins, Yukun Zhu, Hartwig Adam, Alan Yuille, Liang-Chieh Chen

    Abstract: The rise of transformers in vision tasks not only advances network backbone designs, but also starts a brand-new page to achieve end-to-end image recognition (e.g., object detection and panoptic segmentation). Originated from Natural Language Processing (NLP), transformer architectures, consisting of self-attention and cross-attention, effectively learn long-range interactions between elements in… ▽ More

    Submitted 10 July, 2023; v1 submitted 8 July, 2022; originally announced July 2022.

    Comments: ECCV 2022. arXiv v2: add results on ADE20K. arXiv v3: fix appendix. v4: fix typo. v5: add PyTorch re-implementation. Codes and models are available at TensorFlow: https://github.com/google-research/deeplab2 PyTorch: https://github.com/bytedance/kmax-deeplab

  35. arXiv:2207.00810  [pdf, other

    cs.LG cs.AI cs.CY cs.HC

    Eliciting and Learning with Soft Labels from Every Annotator

    Authors: Katherine M. Collins, Umang Bhatt, Adrian Weller

    Abstract: The labels used to train machine learning (ML) models are of paramount importance. Typically for ML classification tasks, datasets contain hard labels, yet learning using soft labels has been shown to yield benefits for model generalization, robustness, and calibration. Earlier work found success in forming soft labels from multiple annotators' hard labels; however, this approach may not converge… ▽ More

    Submitted 29 August, 2022; v1 submitted 2 July, 2022; originally announced July 2022.

    Comments: Accepted as a Full Paper at the 2022 AAAI Conference on Human Computation and Crowdsourcing

    Journal ref: Proceedings of the AAAI Conference on Human Computation and Crowdsourcing. Vol. 10. 2022

  36. arXiv:2206.12409  [pdf, ps, other

    cs.CE

    A hybrid volume-surface integral equation method for rapid electromagnetic simulations in MRI

    Authors: Ilias I. Giannakopoulos, Georgy D. Guryev, José E. C. Serrallés, Jan Paška, Bei Zhang, Luca Daniel, Jacob K. White, Christopher M. Collins, Riccardo Lattanzi

    Abstract: Objective: We developed a hybrid volume surface integral equation (VSIE) method based on domain decomposition to perform fast and accurate magnetic resonance imaging (MRI) simulations that include both remote and local conductive elements. Methods: We separated the conductive surfaces present in MRI setups into two domains and optimized electromagnetic (EM) modeling for each case. Specifically, in… ▽ More

    Submitted 22 June, 2022; originally announced June 2022.

    Comments: 10 pages, 6 figures

  37. arXiv:2206.08948  [pdf, other

    cs.CV

    CMT-DeepLab: Clustering Mask Transformers for Panoptic Segmentation

    Authors: Qihang Yu, Huiyu Wang, Dahun Kim, Siyuan Qiao, Maxwell Collins, Yukun Zhu, Hartwig Adam, Alan Yuille, Liang-Chieh Chen

    Abstract: We propose Clustering Mask Transformer (CMT-DeepLab), a transformer-based framework for panoptic segmentation designed around clustering. It rethinks the existing transformer architectures used in segmentation and detection; CMT-DeepLab considers the object queries as cluster centers, which fill the role of grou** the pixels when applied to segmentation. The clustering is computed with an altern… ▽ More

    Submitted 17 June, 2022; originally announced June 2022.

    Comments: CVPR 2022 Oral

  38. arXiv:2205.13583  [pdf

    cs.AI cs.CV q-bio.QM

    Harnessing Artificial Intelligence to Infer Novel Spatial Biomarkers for the Diagnosis of Eosinophilic Esophagitis

    Authors: Ariel Larey, Eliel Aknin, Nati Daniel, Garrett A. Osswald, Julie M. Caldwell, Mark Rochman, Tanya Wasserman, Margaret H. Collins, Nicoleta C. Arva, Guang-Yu Yang, Marc E. Rothenberg, Yonatan Savir

    Abstract: Eosinophilic esophagitis (EoE) is a chronic allergic inflammatory condition of the esophagus associated with elevated esophageal eosinophils. Second only to gastroesophageal reflux disease, EoE is one of the leading causes of chronic refractory dysphagia in adults and children. EoE diagnosis requires enumerating the density of esophageal eosinophils in esophageal biopsies, a somewhat subjective ta… ▽ More

    Submitted 26 May, 2022; originally announced May 2022.

    Comments: AL, EA, and ND have contributed equally to this work and share first authorship. YS is the corresponding author, e-mail: [email protected]

  39. arXiv:2205.05718  [pdf, other

    cs.CL cs.AI cs.LG cs.SC

    Structured, flexible, and robust: benchmarking and improving large language models towards more human-like behavior in out-of-distribution reasoning tasks

    Authors: Katherine M. Collins, Catherine Wong, Jiahai Feng, Megan Wei, Joshua B. Tenenbaum

    Abstract: Human language offers a powerful window into our thoughts -- we tell stories, give explanations, and express our beliefs and goals through words. Abundant evidence also suggests that language plays a developmental role in structuring our learning. Here, we ask: how much of human-like thinking can be captured by learning statistical patterns in language alone? We first contribute a new challenge be… ▽ More

    Submitted 11 May, 2022; originally announced May 2022.

    Comments: Originally accepted to the 2022 Cognitive Science (CogSci) conference

  40. arXiv:2203.15108  [pdf, other

    cs.CL

    A Well-Composed Text is Half Done! Composition Sampling for Diverse Conditional Generation

    Authors: Shashi Narayan, Gonçalo Simões, Yao Zhao, Joshua Maynez, Dipanjan Das, Michael Collins, Mirella Lapata

    Abstract: We propose Composition Sampling, a simple but effective method to generate diverse outputs for conditional generation of higher quality compared to previous stochastic decoding strategies. It builds on recently proposed plan-based neural generation models (Narayan et al, 2021) that are trained to first create a composition of the output and then generate by conditioning on it and the input. Our ap… ▽ More

    Submitted 28 March, 2022; originally announced March 2022.

    Comments: 21 pages, ACL 2022

  41. arXiv:2112.12870  [pdf, other

    cs.CL

    Measuring Attribution in Natural Language Generation Models

    Authors: Hannah Rashkin, Vitaly Nikolaev, Matthew Lamm, Lora Aroyo, Michael Collins, Dipanjan Das, Slav Petrov, Gaurav Singh Tomar, Iulia Turc, David Reitter

    Abstract: With recent improvements in natural language generation (NLG) models for various applications, it has become imperative to have the means to identify and evaluate whether NLG output is only sharing verifiable information about the external world. In this work, we present a new evaluation framework entitled Attributable to Identified Sources (AIS) for assessing the output of natural language genera… ▽ More

    Submitted 2 August, 2022; v1 submitted 23 December, 2021; originally announced December 2021.

  42. arXiv:2111.06387  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Learning Signal-Agnostic Manifolds of Neural Fields

    Authors: Yilun Du, Katherine M. Collins, Joshua B. Tenenbaum, Vincent Sitzmann

    Abstract: Deep neural networks have been used widely to learn the latent structure of datasets, across modalities such as images, shapes, and audio signals. However, existing models are generally modality-dependent, requiring custom architectures and objectives to process different classes of signals. We leverage neural fields to capture the underlying structure in image, shape, audio and cross-modal audiov… ▽ More

    Submitted 11 November, 2021; originally announced November 2021.

    Comments: NeurIPS 2021, additional results and code at https://yilundu.github.io/gem/

  43. arXiv:2108.07216  [pdf, other

    cs.CL

    Partially Supervised Named Entity Recognition via the Expected Entity Ratio Loss

    Authors: Thomas Effland, Michael Collins

    Abstract: We study learning named entity recognizers in the presence of missing entity annotations. We approach this setting as tagging with latent variables and propose a novel loss, the Expected Entity Ratio, to learn models in the presence of systematically missing tags. We show that our approach is both theoretically sound and empirically useful. Experimentally, we find that it meets or exceeds performa… ▽ More

    Submitted 16 August, 2021; originally announced August 2021.

    Comments: Accepted in TACL 2021, pre-MIT Press publication version

  44. arXiv:2108.05929  [pdf

    cs.SD eess.AS

    Parameter Tuning of Time-Frequency Masking Algorithms for Reverberant Artifact Removal within the Cochlear Implant Stimulus

    Authors: Lidea K. Shahidi, Leslie M. Collins, Boyla O. Mainsah

    Abstract: Cochlear implant users struggle to understand speech in reverberant environments. To restore speech perception, artifacts dominated by reverberant reflections can be removed from the cochlear implant stimulus. Artifacts can be identified and removed by applying a matrix of gain values, a technique referred to as time-frequency masking. Gain values are determined by an oracle algorithm that uses kn… ▽ More

    Submitted 12 August, 2021; originally announced August 2021.

    Comments: 5 pages, 4 figures

  45. arXiv:2108.02189  [pdf, other

    cs.CL

    A Biologically Plausible Parser

    Authors: Daniel Mitropolsky, Michael J. Collins, Christos H. Papadimitriou

    Abstract: We describe a parser of English effectuated by biologically plausible neurons and synapses, and implemented through the Assembly Calculus, a recently proposed computational framework for cognitive function. We demonstrate that this device is capable of correctly parsing reasonably nontrivial sentences. While our experiments entail rather simple sentences in English, our results suggest that the pa… ▽ More

    Submitted 4 August, 2021; originally announced August 2021.

  46. arXiv:2107.06393  [pdf, other

    cs.CV cs.AI cs.LG

    Hybrid Memoised Wake-Sleep: Approximate Inference at the Discrete-Continuous Interface

    Authors: Tuan Anh Le, Katherine M. Collins, Luke Hewitt, Kevin Ellis, N. Siddharth, Samuel J. Gershman, Joshua B. Tenenbaum

    Abstract: Modeling complex phenomena typically involves the use of both discrete and continuous variables. Such a setting applies across a wide range of problems, from identifying trends in time-series data to performing effective compositional scene understanding in images. Here, we propose Hybrid Memoised Wake-Sleep (HMWS), an algorithm for effective inference in such hybrid discrete-continuous models. Pr… ▽ More

    Submitted 20 April, 2022; v1 submitted 3 July, 2021; originally announced July 2021.

    Journal ref: ICLR 2022

  47. arXiv:2106.09748  [pdf, other

    cs.CV

    DeepLab2: A TensorFlow Library for Deep Labeling

    Authors: Mark Weber, Huiyu Wang, Siyuan Qiao, Jun Xie, Maxwell D. Collins, Yukun Zhu, Liangzhe Yuan, Dahun Kim, Qihang Yu, Daniel Cremers, Laura Leal-Taixe, Alan L. Yuille, Florian Schroff, Hartwig Adam, Liang-Chieh Chen

    Abstract: DeepLab2 is a TensorFlow library for deep labeling, aiming to provide a state-of-the-art and easy-to-use TensorFlow codebase for general dense pixel prediction problems in computer vision. DeepLab2 includes all our recently developed DeepLab model variants with pretrained checkpoints as well as model training and evaluation code, allowing the community to reproduce and further improve upon the sta… ▽ More

    Submitted 17 June, 2021; originally announced June 2021.

    Comments: 4-page technical report. The first three authors contributed equally to this work

  48. arXiv:2105.14135  [pdf

    eess.AS cs.SD

    Phoneme-Based Ratio Mask Estimation for Reverberant Speech Enhancement in Cochlear Implant Processors

    Authors: Kevin M. Chu, Leslie M. Collins, Boyla O. Mainsah

    Abstract: Cochlear implant (CI) users have considerable difficulty in understanding speech in reverberant listening environments. Time-frequency (T-F) masking is a common technique that aims to improve speech intelligibility by multiplying reverberant speech by a matrix of gain values to suppress T-F bins dominated by reverberation. Recently proposed mask estimation algorithms leverage machine learning appr… ▽ More

    Submitted 28 May, 2021; originally announced May 2021.

  49. arXiv:2105.14120  [pdf

    eess.AS cs.SD

    Assessing the intelligibility of vocoded speech using a remote testing framework

    Authors: Kevin M. Chu, Leslie M. Collins, Boyla O. Mainsah

    Abstract: Over the past year, remote speech intelligibility testing has become a popular and necessary alternative to traditional in-person experiments due to the need for physical distancing during the COVID-19 pandemic. A remote framework was developed for conducting speech intelligibility tests with normal hearing listeners. In this study, subjects used their personal computers to complete sentence recog… ▽ More

    Submitted 28 May, 2021; originally announced May 2021.

  50. arXiv:2105.09966  [pdf, other

    astro-ph.EP cs.LG

    On planetary systems as ordered sequences

    Authors: Emily Sandford, David Kip**, Michael Collins

    Abstract: A planetary system consists of a host star and one or more planets, arranged into a particular configuration. Here, we consider what information belongs to the configuration, or ordering, of 4286 Kepler planets in their 3277 planetary systems. First, we train a neural network model to predict the radius and period of a planet based on the properties of its host star and the radii and period of its… ▽ More

    Submitted 20 May, 2021; originally announced May 2021.

    Comments: 25 pages, 19 figures, accepted to MNRAS