Skip to main content

Showing 1–11 of 11 results for author: Lelkes, Á D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2311.04886  [pdf, other

    cs.CL cs.AI cs.LG

    SEMQA: Semi-Extractive Multi-Source Question Answering

    Authors: Tal Schuster, Adam D. Lelkes, Haitian Sun, Jai Gupta, Jonathan Berant, William W. Cohen, Donald Metzler

    Abstract: Recently proposed long-form question answering (QA) systems, supported by large language models (LLMs), have shown promising capabilities. Yet, attributing and verifying their generated abstractive answers can be difficult, and automatically evaluating their accuracy remains an ongoing challenge. In this work, we introduce a new QA task for answering multi-answer questions by summarizing multipl… ▽ More

    Submitted 30 June, 2024; v1 submitted 8 November, 2023; originally announced November 2023.

    Comments: NAACL 2024

  2. arXiv:2310.18431  [pdf, other

    cs.CL

    SDOH-NLI: a Dataset for Inferring Social Determinants of Health from Clinical Notes

    Authors: Adam D. Lelkes, Eric Loreaux, Tal Schuster, Ming-Jun Chen, Alvin Rajkomar

    Abstract: Social and behavioral determinants of health (SDOH) play a significant role in sha** health outcomes, and extracting these determinants from clinical notes is a first step to help healthcare providers systematically identify opportunities to provide appropriate care and address disparities. Progress on using NLP methods for this task has been hindered by the lack of high-quality publicly availab… ▽ More

    Submitted 27 October, 2023; originally announced October 2023.

    Comments: Findings of EMNLP 2023

  3. arXiv:2305.11841  [pdf, other

    cs.IR cs.CL

    How Does Generative Retrieval Scale to Millions of Passages?

    Authors: Ronak Pradeep, Kai Hui, Jai Gupta, Adam D. Lelkes, Honglei Zhuang, Jimmy Lin, Donald Metzler, Vinh Q. Tran

    Abstract: Popularized by the Differentiable Search Index, the emerging paradigm of generative retrieval re-frames the classic information retrieval problem into a sequence-to-sequence modeling task, forgoing external indices and encoding an entire document corpus within a single Transformer. Although many different approaches have been proposed to improve the effectiveness of generative retrieval, they have… ▽ More

    Submitted 19 May, 2023; originally announced May 2023.

  4. arXiv:2211.10828  [pdf, other

    cs.LG cs.AI

    Instability in clinical risk stratification models using deep learning

    Authors: Daniel Lopez-Martinez, Alex Yakubovich, Martin Seneviratne, Adam D. Lelkes, Akshit Tyagi, Jonas Kemp, Ethan Steinberg, N. Lance Downing, Ron C. Li, Keith E. Morse, Nigam H. Shah, Ming-Jun Chen

    Abstract: While it has been well known in the ML community that deep learning models suffer from instability, the consequences for healthcare deployments are under characterised. We study the stability of different model architectures trained on electronic health records, using a set of outpatient prediction tasks as a case study. We show that repeated training runs of the same deep learning model on the sa… ▽ More

    Submitted 19 November, 2022; originally announced November 2022.

    Comments: Accepted for publication in Machine Learning for Health (ML4H) 2022

  5. arXiv:2205.10744  [pdf, other

    cs.CL cs.LG

    All Birds with One Stone: Multi-task Text Classification for Efficient Inference with One Forward Pass

    Authors: Jiaxin Huang, Tianqi Liu, Jialu Liu, Adam D. Lelkes, Cong Yu, Jiawei Han

    Abstract: Multi-Task Learning (MTL) models have shown their robustness, effectiveness, and efficiency for transferring learned knowledge across tasks. In real industrial applications such as web content classification, multiple classification tasks are predicted from the same input text such as a web article. However, at the serving time, the existing multitask transformer models such as prompt or adaptor b… ▽ More

    Submitted 22 May, 2022; originally announced May 2022.

  6. arXiv:2106.02278  [pdf, other

    cs.CL

    AgreeSum: Agreement-Oriented Multi-Document Summarization

    Authors: Richard Yuanzhe Pang, Adam D. Lelkes, Vinh Q. Tran, Cong Yu

    Abstract: We aim to renew interest in a particular multi-document summarization (MDS) task which we call AgreeSum: agreement-oriented multi-document summarization. Given a cluster of articles, the goal is to provide abstractive summaries that represent information common and faithful to all input articles. Given the lack of existing datasets, we create a dataset for AgreeSum, and provide annotations on arti… ▽ More

    Submitted 4 June, 2021; originally announced June 2021.

    Comments: Findings of ACL 2021

  7. arXiv:2102.09094  [pdf, other

    cs.CL cs.CY cs.LG

    Quiz-Style Question Generation for News Stories

    Authors: Adam D. Lelkes, Vinh Q. Tran, Cong Yu

    Abstract: A large majority of American adults get at least some of their news from the Internet. Even though many online news products have the goal of informing their users about the news, they lack scalable and reliable tools for measuring how well they are achieving this goal, and therefore have to resort to noisy proxy metrics (e.g., click-through rates or reading time) to track their performance. As… ▽ More

    Submitted 17 February, 2021; originally announced February 2021.

  8. arXiv:1601.05764  [pdf, other

    cs.LG cs.CY

    A Confidence-Based Approach for Balancing Fairness and Accuracy

    Authors: Benjamin Fish, Jeremy Kun, Ádám D. Lelkes

    Abstract: We study three classical machine learning algorithms in the context of algorithmic fairness: adaptive boosting, support vector machines, and logistic regression. Our goal is to maintain the high accuracy of these learning algorithms while reducing the degree to which they discriminate against individuals because of their membership in a protected group. Our first contribution is a method for ach… ▽ More

    Submitted 21 January, 2016; originally announced January 2016.

  9. arXiv:1411.3640  [pdf, other

    cs.DM

    Network installation and recovery: approximation lower bounds and faster exact formulations

    Authors: Alexander Gutfraind, Jeremy Kun, Ádám D. Lelkes, Lev Reyzin

    Abstract: We study the Neighbor Aided Network Installation Problem (NANIP) introduced previously which asks for a minimal cost ordering of the vertices of a graph, where the cost of visiting a node is a function of the number of neighbors that have already been visited. This problem has applications in resource management and disaster recovery. In this paper we analyze the computational hardness of NANIP. I… ▽ More

    Submitted 13 November, 2014; originally announced November 2014.

  10. arXiv:1410.0245  [pdf, other

    cs.CC cs.DC

    On the Computational Complexity of MapReduce

    Authors: Benjamin Fish, Jeremy Kun, Ádám Dániel Lelkes, Lev Reyzin, György Turán

    Abstract: In this paper we study MapReduce computations from a complexity-theoretic perspective. First, we formulate a uniform version of the MRC model of Karloff et al. (2010). We then show that the class of regular languages, and moreover all of sublogarithmic space, lies in constant round MRC. This result also applies to the MPC model of Andoni et al. (2014). In addition, we prove that, conditioned on a… ▽ More

    Submitted 6 October, 2015; v1 submitted 1 October, 2014; originally announced October 2014.

  11. arXiv:1406.0017  [pdf, ps, other

    cs.FL cs.DM

    Biclique coverings, rectifier networks and the cost of $\varepsilon$-removal

    Authors: Szabolcs Iván, Ádám Dániel Lelkes, Judit Nagy-György, Balázs Szörényi, György Turán

    Abstract: We relate two complexity notions of bipartite graphs: the minimal weight biclique covering number $\mathrm{Cov}(G)$ and the minimal rectifier network size $\mathrm{Rect}(G)$ of a bipartite graph $G$. We show that there exist graphs with $\mathrm{Cov}(G)\geq \mathrm{Rect}(G)^{3/2-ε}$. As a corollary, we establish that there exist nondeterministic finite automata (NFAs) with $\varepsilon$-transition… ▽ More

    Submitted 30 May, 2014; originally announced June 2014.

    Comments: 12 pages, to appear in proceedings of DCFS 2014: 16th International Conference on Descriptional Complexity of Finite-State Systems

    MSC Class: 68R10 ACM Class: G.2.2; F.1.1