Skip to main content

Showing 1–50 of 64 results for author: Cardie, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.05087  [pdf, other

    cs.IR

    Corpus Poisoning via Approximate Greedy Gradient Descent

    Authors: **yan Su, John X. Morris, Preslav Nakov, Claire Cardie

    Abstract: Dense retrievers are widely used in information retrieval and have also been successfully extended to other knowledge intensive areas such as language models, e.g., Retrieval-Augmented Generation (RAG) systems. Unfortunately, they have recently been shown to be vulnerable to corpus poisoning attacks in which a malicious user injects a small fraction of adversarial passages into the retrieval corpu… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  2. arXiv:2405.01470  [pdf, other

    cs.CL

    WildChat: 1M ChatGPT Interaction Logs in the Wild

    Authors: Wenting Zhao, Xiang Ren, Jack Hessel, Claire Cardie, Ye** Choi, Yuntian Deng

    Abstract: Chatbots such as GPT-4 and ChatGPT are now serving millions of users. Despite their widespread use, there remains a lack of public datasets showcasing how these tools are used by a population of users in practice. To bridge this gap, we offered free access to ChatGPT for online users in exchange for their affirmative, consensual opt-in to anonymously collect their chat transcripts and request head… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

    Comments: accepted by ICLR 2024

  3. arXiv:2311.04917  [pdf, other

    cs.CL cs.AI

    Adapting Fake News Detection to the Era of Large Language Models

    Authors: **yan Su, Claire Cardie, Preslav Nakov

    Abstract: In the age of large language models (LLMs) and the widespread adoption of AI-driven content creation, the landscape of information dissemination has witnessed a paradigm shift. With the proliferation of both human-written and machine-generated real and fake news, robustly and effectively discerning the veracity of news articles has become an intricate challenge. While substantial research has been… ▽ More

    Submitted 13 April, 2024; v1 submitted 2 November, 2023; originally announced November 2023.

    Comments: Accept to NAACL 2024 Findings

  4. arXiv:2310.15316  [pdf, other

    cs.CL

    Probing Representations for Document-level Event Extraction

    Authors: Barry Wang, Xinya Du, Claire Cardie

    Abstract: The probing classifiers framework has been employed for interpreting deep neural network models for a variety of natural language processing (NLP) applications. Studies, however, have largely focused on sentencelevel NLP tasks. This work is the first to apply the probing paradigm to representations learned for document-level information extraction (IE). We designed eight embedding probes to analyz… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

    Comments: To appear in EMNLP 2023 Findings

  5. arXiv:2310.04407  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    Policy-Gradient Training of Language Models for Ranking

    Authors: Ge Gao, Jonathan D. Chang, Claire Cardie, Kianté Brantley, Thorsten Joachim

    Abstract: Text retrieval plays a crucial role in incorporating factual knowledge for decision making into language processing pipelines, ranging from chat-based web search to question answering systems. Current state-of-the-art text retrieval models leverage pre-trained large language models (LLMs) to achieve competitive performance, but training LLM-based retrievers via typical contrastive losses requires… ▽ More

    Submitted 6 October, 2023; originally announced October 2023.

  6. arXiv:2305.14618  [pdf, other

    cs.CL cs.AI

    Abductive Commonsense Reasoning Exploiting Mutually Exclusive Explanations

    Authors: Wenting Zhao, Justin T. Chiu, Claire Cardie, Alexander M. Rush

    Abstract: Abductive reasoning aims to find plausible explanations for an event. This style of reasoning is critical for commonsense tasks where there are often multiple plausible explanations. Existing approaches for abductive reasoning in natural language processing (NLP) often rely on manually generated annotations for supervision; however, such annotations can be subjective and biased. Instead of using d… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

    Comments: accepted at ACL'23

  7. arXiv:2305.14237  [pdf, ps, other

    cs.CL cs.AI

    HOP, UNION, GENERATE: Explainable Multi-hop Reasoning without Rationale Supervision

    Authors: Wenting Zhao, Justin T. Chiu, Claire Cardie, Alexander M. Rush

    Abstract: Explainable multi-hop question answering (QA) not only predicts answers but also identifies rationales, i. e. subsets of input sentences used to derive the answers. This problem has been extensively studied under the supervised setting, where both answer and rationale annotations are given. Because rationale annotations are expensive to collect and not always available, recent efforts have been de… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

  8. arXiv:2305.02360  [pdf, other

    cs.CV cs.AI

    Fashionpedia-Ads: Do Your Favorite Advertisements Reveal Your Fashion Taste?

    Authors: Mengyun Shi, Claire Cardie, Serge Belongie

    Abstract: Consumers are exposed to advertisements across many different domains on the internet, such as fashion, beauty, car, food, and others. On the other hand, fashion represents second highest e-commerce shop** category. Does consumer digital record behavior on various fashion ad images reveal their fashion taste? Does ads from other domains infer their fashion taste as well? In this paper, we study… ▽ More

    Submitted 3 May, 2023; originally announced May 2023.

  9. arXiv:2305.02307  [pdf, other

    cs.CV cs.AI cs.DB

    Fashionpedia-Taste: A Dataset towards Explaining Human Fashion Taste

    Authors: Mengyun Shi, Serge Belongie, Claire Cardie

    Abstract: Existing fashion datasets do not consider the multi-facts that cause a consumer to like or dislike a fashion image. Even two consumers like a same fashion image, they could like this image for total different reasons. In this paper, we study the reason why a consumer like a certain fashion image. Towards this goal, we introduce an interpretability dataset, Fashionpedia-taste, consist of rich annot… ▽ More

    Submitted 3 May, 2023; originally announced May 2023.

  10. Automatic Error Analysis for Document-level Information Extraction

    Authors: Aliva Das, Xinya Du, Barry Wang, Kejian Shi, Jiayuan Gu, Thomas Porter, Claire Cardie

    Abstract: Document-level information extraction (IE) tasks have recently begun to be revisited in earnest using the end-to-end neural network techniques that have been successful on their sentence-level IE counterparts. Evaluation of the approaches, however, has been limited in a number of dimensions. In particular, the precision/recall/F1 scores typically reported provide few insights on the range of error… ▽ More

    Submitted 15 September, 2022; originally announced September 2022.

    Comments: Accepted to ACL 2022 Main Conference. First three authors contributed equally to this work

    Journal ref: Automatic Error Analysis for Document-level Information Extraction (Das et al., ACL 2022)

  11. arXiv:2205.02068  [pdf, other

    cs.CL

    Compositional Task-Oriented Parsing as Abstractive Question Answering

    Authors: Wenting Zhao, Konstantine Arkoudas, Weiqi Sun, Claire Cardie

    Abstract: Task-oriented parsing (TOP) aims to convert natural language into machine-readable representations of specific tasks, such as setting an alarm. A popular approach to TOP is to apply seq2seq models to generate linearized parse trees. A more recent line of work argues that pretrained seq2seq models are better at generating outputs that are themselves natural language, so they replace linearized pars… ▽ More

    Submitted 4 May, 2022; originally announced May 2022.

    Comments: accepted at NAACL'22

  12. arXiv:2203.12119  [pdf, other

    cs.CV

    Visual Prompt Tuning

    Authors: Menglin Jia, Luming Tang, Bor-Chun Chen, Claire Cardie, Serge Belongie, Bharath Hariharan, Ser-Nam Lim

    Abstract: The current modus operandi in adapting pre-trained models involves updating all the backbone parameters, ie, full fine-tuning. This paper introduces Visual Prompt Tuning (VPT) as an efficient and effective alternative to full fine-tuning for large-scale Transformer models in vision. Taking inspiration from recent advances in efficiently tuning large language models, VPT introduces only a small amo… ▽ More

    Submitted 20 July, 2022; v1 submitted 22 March, 2022; originally announced March 2022.

    Comments: ECCV2022

  13. arXiv:2112.08459  [pdf, other

    cs.CV

    Rethinking Nearest Neighbors for Visual Classification

    Authors: Menglin Jia, Bor-Chun Chen, Zuxuan Wu, Claire Cardie, Serge Belongie, Ser-Nam Lim

    Abstract: Neural network classifiers have become the de-facto choice for current "pre-train then fine-tune" paradigms of visual classification. In this paper, we investigate k-Nearest-Neighbor (k-NN) classifiers, a classical model-free learning method from the pre-deep learning era, as an augmentation to modern neural network based approaches. As a lazy learning method, k-NN simply aggregates the distance b… ▽ More

    Submitted 17 December, 2021; v1 submitted 15 December, 2021; originally announced December 2021.

    Comments: Modified paragraph spacing

  14. arXiv:2109.13449  [pdf, other

    cs.LG cs.CL cs.CV

    When in Doubt: Improving Classification Performance with Alternating Normalization

    Authors: Menglin Jia, Austin Reiter, Ser-Nam Lim, Yoav Artzi, Claire Cardie

    Abstract: We introduce Classification with Alternating Normalization (CAN), a non-parametric post-processing step for classification. CAN improves classification accuracy for challenging examples by re-adjusting their predicted class probability distribution using the predicted class distributions of high-confidence validation examples. CAN is easily applicable to any probabilistic classifier, with minimal… ▽ More

    Submitted 27 September, 2021; originally announced September 2021.

    Comments: Findings of EMNLP 2021

  15. arXiv:2108.13684  [pdf, other

    cs.CL

    Faithful or Extractive? On Mitigating the Faithfulness-Abstractiveness Trade-off in Abstractive Summarization

    Authors: Faisal Ladhak, Esin Durmus, He He, Claire Cardie, Kathleen McKeown

    Abstract: Despite recent progress in abstractive summarization, systems still suffer from faithfulness errors. While prior work has proposed models that improve faithfulness, it is unclear whether the improvement comes from an increased level of extractiveness of the model outputs as one naive way to improve faithfulness is to make summarization models more extractive. In this work, we present a framework f… ▽ More

    Submitted 21 April, 2022; v1 submitted 31 August, 2021; originally announced August 2021.

    Comments: Published in ACL 2022 main conference

  16. arXiv:2104.07767  [pdf, other

    cs.CV cs.LG

    Exploring Visual Engagement Signals for Representation Learning

    Authors: Menglin Jia, Zuxuan Wu, Austin Reiter, Claire Cardie, Serge Belongie, Ser-Nam Lim

    Abstract: Visual engagement in social media platforms comprises interactions with photo posts including comments, shares, and likes. In this paper, we leverage such visual engagement clues as supervisory signals for representation learning. However, learning from engagement signals is non-trivial as it is not clear how to bridge the gap between low-level visual information and high-level social interactions… ▽ More

    Submitted 14 August, 2021; v1 submitted 15 April, 2021; originally announced April 2021.

    Comments: ICCV2021 camera ready

  17. arXiv:2102.01226  [pdf, other

    cs.CL

    Self-Teaching Machines to Read and Comprehend with Large-Scale Multi-Subject Question-Answering Data

    Authors: Dian Yu, Kai Sun, Dong Yu, Claire Cardie

    Abstract: In spite of much recent research in the area, it is still unclear whether subject-area question-answering data is useful for machine reading comprehension (MRC) tasks. In this paper, we investigate this question. We collect a large-scale multi-subject multiple-choice question-answering dataset, ExamQA, and use incomplete and noisy snippets returned by a web search engine as the relevant context fo… ▽ More

    Submitted 6 April, 2021; v1 submitted 1 February, 2021; originally announced February 2021.

  18. arXiv:2011.05558  [pdf, other

    cs.CV cs.SI

    Intentonomy: a Dataset and Study towards Human Intent Understanding

    Authors: Menglin Jia, Zuxuan Wu, Austin Reiter, Claire Cardie, Serge Belongie, Ser-Nam Lim

    Abstract: An image is worth a thousand words, conveying information that goes beyond the physical visual content therein. In this paper, we study the intent behind social media images with an aim to analyze how visual information can help the recognition of human intent. Towards this goal, we introduce an intent dataset, Intentonomy, comprising 14K images covering a wide range of everyday scenes. These imag… ▽ More

    Submitted 27 March, 2021; v1 submitted 11 November, 2020; originally announced November 2020.

    Comments: CVPR2021

  19. arXiv:2011.02610  [pdf, other

    cs.CL cs.AI

    Improving Event Duration Prediction via Time-aware Pre-training

    Authors: Zonglin Yang, Xinya Du, Alexander Rush, Claire Cardie

    Abstract: End-to-end models in NLP rarely encode external world knowledge about length of time. We introduce two effective models for duration prediction, which incorporate external knowledge by reading temporal-related news sentences (time-aware pre-training). Specifically, one model predicts the range/unit where the duration value falls in (R-pred); and the other predicts the exact duration value E-pred.… ▽ More

    Submitted 4 November, 2020; originally announced November 2020.

    Comments: to be published in Findings of EMNLP 2020

  20. arXiv:2010.12757  [pdf, other

    cs.CL

    Adding Chit-Chat to Enhance Task-Oriented Dialogues

    Authors: Kai Sun, Seungwhan Moon, Paul Crook, Stephen Roller, Becka Silvert, Bing Liu, Zhiguang Wang, Honglei Liu, Eunjoon Cho, Claire Cardie

    Abstract: Existing dialogue corpora and models are typically designed under two disjoint motives: while task-oriented systems focus on achieving functional goals (e.g., booking hotels), open-domain chatbots aim at making socially engaging conversations. In this work, we propose to integrate both types of systems by Adding Chit-Chat to ENhance Task-ORiented dialogues (ACCENTOR), with the goal of making virtu… ▽ More

    Submitted 1 May, 2021; v1 submitted 23 October, 2020; originally announced October 2020.

    Comments: To appear in NAACL-HLT 2021

  21. arXiv:2010.03538  [pdf, other

    cs.CL

    Exploring the Role of Argument Structure in Online Debate Persuasion

    Authors: Jialu Li, Esin Durmus, Claire Cardie

    Abstract: Online debate forums provide users a platform to express their opinions on controversial topics while being exposed to opinions from diverse set of viewpoints. Existing work in Natural Language Processing (NLP) has shown that linguistic features extracted from the debate text and features encoding the characteristics of the audience are both critical in persuasion studies. In this paper, we aim to… ▽ More

    Submitted 7 October, 2020; originally announced October 2020.

    Comments: Accepted to EMNLP 2020

  22. arXiv:2010.03093  [pdf, other

    cs.CL

    WikiLingua: A New Benchmark Dataset for Cross-Lingual Abstractive Summarization

    Authors: Faisal Ladhak, Esin Durmus, Claire Cardie, Kathleen McKeown

    Abstract: We introduce WikiLingua, a large-scale, multilingual dataset for the evaluation of crosslingual abstractive summarization systems. We extract article and summary pairs in 18 languages from WikiHow, a high quality, collaborative resource of how-to guides on a diverse set of topics written by human authors. We create gold-standard article-summary alignments across languages by aligning the images th… ▽ More

    Submitted 6 October, 2020; originally announced October 2020.

    Comments: Findings of EMNLP 2020

  23. arXiv:2009.05831  [pdf, other

    cs.CL

    Improving Machine Reading Comprehension with Contextualized Commonsense Knowledge

    Authors: Kai Sun, Dian Yu, Jianshu Chen, Dong Yu, Claire Cardie

    Abstract: In this paper, we aim to extract commonsense knowledge to improve machine reading comprehension. We propose to represent relations implicitly by situating structured knowledge in a context instead of relying on a pre-defined set of relations, and we call it contextualized knowledge. Each piece of contextualized knowledge consists of a pair of interrelated verbal and nonverbal messages extracted fr… ▽ More

    Submitted 18 October, 2020; v1 submitted 12 September, 2020; originally announced September 2020.

  24. arXiv:2008.09249  [pdf, other

    cs.CL

    GRIT: Generative Role-filler Transformers for Document-level Event Entity Extraction

    Authors: Xinya Du, Alexander M. Rush, Claire Cardie

    Abstract: We revisit the classic problem of document-level role-filler entity extraction (REE) for template filling. We argue that sentence-level approaches are ill-suited to the task and introduce a generative transformer-based encoder-decoder framework (GRIT) that is designed to model context at the document level: it can make extraction decisions across sentence boundaries; is implicitly aware of noun ph… ▽ More

    Submitted 28 January, 2021; v1 submitted 20 August, 2020; originally announced August 2020.

    Comments: To appear in EACL 2021; Code is available at https://github.com/xinyadu/grit_doc_event_entity

  25. arXiv:2005.06579  [pdf, other

    cs.CL

    Document-Level Event Role Filler Extraction using Multi-Granularity Contextualized Encoding

    Authors: Xinya Du, Claire Cardie

    Abstract: Few works in the literature of event extraction have gone beyond individual sentences to make extraction decisions. This is problematic when the information needed to recognize an event argument is spread across multiple sentences. We argue that document-level event extraction is a difficult task since it requires a view of a larger context to determine which spans of text correspond to event role… ▽ More

    Submitted 13 May, 2020; originally announced May 2020.

    Comments: Accepted to ACL 2020 (long papers), 12 pages

  26. arXiv:2004.13625  [pdf, other

    cs.CL

    Event Extraction by Answering (Almost) Natural Questions

    Authors: Xinya Du, Claire Cardie

    Abstract: The problem of event extraction requires detecting the event trigger and extracting its corresponding arguments. Existing work in event argument extraction typically relies heavily on entity recognition as a preprocessing/concurrent step, causing the well-known problem of error propagation. To avoid this issue, we introduce a new paradigm for event extraction by formulating it as a question answer… ▽ More

    Submitted 4 February, 2021; v1 submitted 28 April, 2020; originally announced April 2020.

    Comments: EMNLP 2020

  27. arXiv:2004.12276  [pdf, other

    cs.CV cs.LG eess.IV

    Fashionpedia: Ontology, Segmentation, and an Attribute Localization Dataset

    Authors: Menglin Jia, Mengyun Shi, Mikhail Sirotenko, Yin Cui, Claire Cardie, Bharath Hariharan, Hartwig Adam, Serge Belongie

    Abstract: In this work we explore the task of instance segmentation with attribute localization, which unifies instance segmentation (detect and segment each object instance) and fine-grained visual attribute categorization (recognize one or multiple attributes). The proposed task requires both localizing an object and describing its properties. To illustrate the various aspects of this task, we focus on th… ▽ More

    Submitted 18 July, 2020; v1 submitted 25 April, 2020; originally announced April 2020.

    Comments: eccv2020

  28. arXiv:2004.08056  [pdf, other

    cs.CL

    Dialogue-Based Relation Extraction

    Authors: Dian Yu, Kai Sun, Claire Cardie, Dong Yu

    Abstract: We present the first human-annotated dialogue-based relation extraction (RE) dataset DialogRE, aiming to support the prediction of relation(s) between two arguments that appear in a dialogue. We further offer DialogRE as a platform for studying cross-sentence RE as most facts span multiple sentences. We argue that speaker-related information plays a critical role in the proposed task, based on an… ▽ More

    Submitted 16 April, 2020; originally announced April 2020.

    Comments: To appear in ACL 2020

  29. The Role of Pragmatic and Discourse Context in Determining Argument Impact

    Authors: Esin Durmus, Faisal Ladhak, Claire Cardie

    Abstract: Research in the social sciences and psychology has shown that the persuasiveness of an argument depends not only the language employed, but also on attributes of the source/communicator, the audience, and the appropriateness and strength of the argument's claims given the pragmatic and discourse context of the argument. Among these characteristics of persuasive arguments, prior work in NLP does no… ▽ More

    Submitted 6 April, 2020; originally announced April 2020.

    Comments: EMNLP 2019

  30. Determining Relative Argument Specificity and Stance for Complex Argumentative Structures

    Authors: Esin Durmus, Faisal Ladhak, Claire Cardie

    Abstract: Systems for automatic argument generation and debate require the ability to (1) determine the stance of any claims employed in the argument and (2) assess the specificity of each claim relative to the argument context. Existing work on understanding claim specificity and stance, however, has been limited to the study of argumentative structures that are relatively shallow, most often consisting of… ▽ More

    Submitted 26 June, 2019; originally announced June 2019.

  31. A Corpus for Modeling User and Language Effects in Argumentation on Online Debating

    Authors: Esin Durmus, Claire Cardie

    Abstract: Existing argumentation datasets have succeeded in allowing researchers to develop computational methods for analyzing the content, structure and linguistic features of argumentative text. They have been much less successful in fostering studies of the effect of "user" traits -- characteristics and beliefs of the participants -- on the debate/argument outcome as this type of user information is gen… ▽ More

    Submitted 26 June, 2019; originally announced June 2019.

  32. Exploring the Role of Prior Beliefs for Argument Persuasion

    Authors: Esin Durmus, Claire Cardie

    Abstract: Public debate forums provide a common platform for exchanging opinions on a topic of interest. While recent studies in natural language processing (NLP) have provided empirical evidence that the language of the debaters and their patterns of interaction play a key role in changing the mind of a reader, research in psychology has shown that prior beliefs can affect our interpretation of an argument… ▽ More

    Submitted 26 June, 2019; originally announced June 2019.

    Comments: 11 pages

  33. arXiv:1906.08942  [pdf, other

    cs.CL cs.LG

    Be Consistent! Improving Procedural Text Comprehension using Label Consistency

    Authors: Xinya Du, Bhavana Dalvi Mishra, Niket Tandon, Antoine Bosselut, Wen-tau Yih, Peter Clark, Claire Cardie

    Abstract: Our goal is procedural text comprehension, namely tracking how the properties of entities (e.g., their location) change with time given a procedural text (e.g., a paragraph about photosynthesis, a recipe). This task is challenging as the world is changing throughout the text, and despite recent advances, current systems still struggle with this task. Our approach is to leverage the fact that, for… ▽ More

    Submitted 21 June, 2019; originally announced June 2019.

    Comments: NAACL 2019

  34. arXiv:1906.05275  [pdf, other

    cs.CL cs.LG

    Kee** Notes: Conditional Natural Language Generation with a Scratchpad Mechanism

    Authors: Ryan Y. Benmalek, Madian Khabsa, Suma Desu, Claire Cardie, Michele Banko

    Abstract: We introduce the Scratchpad Mechanism, a novel addition to the sequence-to-sequence (seq2seq) neural network architecture and demonstrate its effectiveness in improving the overall fluency of seq2seq models for natural language generation tasks. By enabling the decoder at each time step to write to all of the encoder output layers, Scratchpad can employ the encoder as a "scratchpad" memory to keep… ▽ More

    Submitted 13 June, 2019; v1 submitted 12 June, 2019; originally announced June 2019.

    Comments: Accepted to ACL 2019

  35. arXiv:1904.09679  [pdf, other

    cs.CL

    Investigating Prior Knowledge for Challenging Chinese Machine Reading Comprehension

    Authors: Kai Sun, Dian Yu, Dong Yu, Claire Cardie

    Abstract: Machine reading comprehension tasks require a machine reader to answer questions relevant to the given document. In this paper, we present the first free-form multiple-Choice Chinese machine reading Comprehension dataset (C^3), containing 13,369 documents (dialogues or more formally written mixed-genre texts) and their associated 19,577 multiple-choice free-form questions collected from Chinese-as… ▽ More

    Submitted 17 December, 2019; v1 submitted 21 April, 2019; originally announced April 2019.

    Comments: To appear in TACL

  36. arXiv:1902.00993  [pdf, other

    cs.CL

    Improving Question Answering with External Knowledge

    Authors: Xiaoman Pan, Kai Sun, Dian Yu, Jianshu Chen, Heng Ji, Claire Cardie, Dong Yu

    Abstract: We focus on multiple-choice question answering (QA) tasks in subject areas such as science, where we require both broad background knowledge and the facts from the given subject-area reference corpus. In this work, we explore simple yet effective methods for exploiting two sources of external knowledge for subject-area QA. The first enriches the original subject-area reference corpus with relevant… ▽ More

    Submitted 1 October, 2019; v1 submitted 3 February, 2019; originally announced February 2019.

    Comments: Accepted to MRQA (2019)

  37. arXiv:1902.00164  [pdf, other

    cs.CL

    DREAM: A Challenge Dataset and Models for Dialogue-Based Reading Comprehension

    Authors: Kai Sun, Dian Yu, Jianshu Chen, Dong Yu, Ye** Choi, Claire Cardie

    Abstract: We present DREAM, the first dialogue-based multiple-choice reading comprehension dataset. Collected from English-as-a-foreign-language examinations designed by human experts to evaluate the comprehension level of Chinese learners of English, our dataset contains 10,197 multiple-choice questions for 6,444 dialogues. In contrast to existing reading comprehension datasets, DREAM is the first to focus… ▽ More

    Submitted 31 January, 2019; originally announced February 2019.

    Comments: To appear in TACL

  38. arXiv:1810.13441  [pdf, other

    cs.CL

    Improving Machine Reading Comprehension with General Reading Strategies

    Authors: Kai Sun, Dian Yu, Dong Yu, Claire Cardie

    Abstract: Reading strategies have been shown to improve comprehension levels, especially for readers lacking adequate prior knowledge. Just as the process of knowledge accumulation is time-consuming for human readers, it is resource-demanding to impart rich general domain knowledge into a deep language model via pre-training. Inspired by reading strategies identified in cognitive science, and given limited… ▽ More

    Submitted 22 March, 2019; v1 submitted 31 October, 2018; originally announced October 2018.

    Comments: To appear in NAACL-HLT 2019

  39. arXiv:1810.03552  [pdf, other

    cs.CL cs.LG

    Multi-Source Cross-Lingual Model Transfer: Learning What to Share

    Authors: Xilun Chen, Ahmed Hassan Awadallah, Hany Hassan, Wei Wang, Claire Cardie

    Abstract: Modern NLP applications have enjoyed a great boost utilizing neural networks models. Such deep neural models, however, are not applicable to most human languages due to the lack of annotated training data for various NLP tasks. Cross-lingual transfer learning (CLTL) is a viable method for building NLP models for a low-resource target language by leveraging labeled data from other (source) language… ▽ More

    Submitted 5 June, 2019; v1 submitted 8 October, 2018; originally announced October 2018.

    Comments: ACL 2019

  40. arXiv:1809.00653  [pdf, other

    cs.CL cs.LG stat.ML

    Towards Dynamic Computation Graphs via Sparse Latent Structure

    Authors: Vlad Niculae, André F. T. Martins, Claire Cardie

    Abstract: Deep NLP models benefit from underlying structures in the data---e.g., parse trees---typically extracted using off-the-shelf parsers. Recent attempts to jointly learn the latent structure encounter a tradeoff: either make factorization assumptions that limit expressiveness, or sacrifice end-to-end differentiability. Using the recently proposed SparseMAP inference, which retrieves a sparse distribu… ▽ More

    Submitted 3 September, 2018; originally announced September 2018.

    Comments: EMNLP 2018; 9 pages (incl. appendix)

    MSC Class: 68T50 ACM Class: I.2.6; I.2.7

  41. arXiv:1808.08933  [pdf, other

    cs.CL

    Unsupervised Multilingual Word Embeddings

    Authors: Xilun Chen, Claire Cardie

    Abstract: Multilingual Word Embeddings (MWEs) represent words from multiple languages in a single distributional vector space. Unsupervised MWE (UMWE) methods acquire multilingual embeddings without cross-lingual supervision, which is a significant advantage over traditional supervised approaches and opens many new possibilities for low-resource languages. Prior art for learning UMWEs, however, merely relie… ▽ More

    Submitted 6 September, 2018; v1 submitted 27 August, 2018; originally announced August 2018.

    Comments: EMNLP 2018

  42. arXiv:1806.06183  [pdf, other

    cs.CV

    The Neural Painter: Multi-Turn Image Generation

    Authors: Ryan Y. Benmalek, Claire Cardie, Serge Belongie, Xiadong He, Jianfeng Gao

    Abstract: In this work we combine two research threads from Vision/ Graphics and Natural Language Processing to formulate an image generation task conditioned on attributes in a multi-turn setting. By multiturn, we mean the image is generated in a series of steps of user-specified conditioning information. Our proposed approach is practically useful and offers insights into neural interpretability. We intro… ▽ More

    Submitted 16 June, 2018; originally announced June 2018.

  43. arXiv:1805.05942  [pdf, other

    cs.CL

    Harvesting Paragraph-Level Question-Answer Pairs from Wikipedia

    Authors: Xinya Du, Claire Cardie

    Abstract: We study the task of generating from Wikipedia articles question-answer pairs that cover content beyond a single sentence. We propose a neural network approach that incorporates coreference knowledge via a novel gating mechanism. Compared to models that only take into account sentence-level information (Heilman and Smith, 2010; Du et al., 2017; Zhou et al., 2017), we find that the linguistic knowl… ▽ More

    Submitted 15 May, 2018; originally announced May 2018.

    Comments: Accepted to ACL 2018 (long paper)

  44. arXiv:1802.05694  [pdf, other

    cs.CL cs.LG stat.ML

    Multinomial Adversarial Networks for Multi-Domain Text Classification

    Authors: Xilun Chen, Claire Cardie

    Abstract: Many text classification tasks are known to be highly domain-dependent. Unfortunately, the availability of training data can vary drastically across domains. Worse still, for some domains there may not be any annotated data at all. In this work, we propose a multinomial adversarial network (MAN) to tackle the text classification problem in this real-world multidomain setting (MDTC). We provide the… ▽ More

    Submitted 15 February, 2018; originally announced February 2018.

    Comments: NAACL 2018

  45. arXiv:1802.04223  [pdf, other

    stat.ML cs.CL cs.LG

    SparseMAP: Differentiable Sparse Structured Inference

    Authors: Vlad Niculae, André F. T. Martins, Mathieu Blondel, Claire Cardie

    Abstract: Structured prediction requires searching over a combinatorial number of structures. To tackle it, we introduce SparseMAP: a new method for sparse structured inference, and its natural loss function. SparseMAP automatically selects only a few global structures: it is situated between MAP inference, which picks a single structure, and marginal inference, which assigns probability mass to all structu… ▽ More

    Submitted 20 June, 2018; v1 submitted 12 February, 2018; originally announced February 2018.

    Comments: Published in ICML 2018. 14 pages, including appendix

    MSC Class: 68T50 ACM Class: I.2.6; I.2.6

  46. arXiv:1705.00106  [pdf, other

    cs.CL cs.AI

    Learning to Ask: Neural Question Generation for Reading Comprehension

    Authors: Xinya Du, Junru Shao, Claire Cardie

    Abstract: We study automatic question generation for sentences from text passages in reading comprehension. We introduce an attention-based sequence learning model for the task and investigate the effect of encoding sentence- vs. paragraph-level information. In contrast to all previous work, our model does not rely on hand-crafted rules or a sophisticated NLP pipeline; it is instead trainable end-to-end via… ▽ More

    Submitted 28 April, 2017; originally announced May 2017.

    Comments: Accepted to ACL 2017, 11 pages

  47. arXiv:1704.06869  [pdf, other

    cs.CL

    Argument Mining with Structured SVMs and RNNs

    Authors: Vlad Niculae, Joonsuk Park, Claire Cardie

    Abstract: We propose a novel factor graph model for argument mining, designed for settings in which the argumentative relations in a document do not necessarily form a tree structure. (This is the case in over 20% of the web comments dataset we release.) Our model jointly learns elementary unit type classification and argumentative relation prediction. Moreover, our model supports SVM and RNN parametrizatio… ▽ More

    Submitted 22 April, 2017; originally announced April 2017.

    Comments: Accepted for publication at ACL 2017. 11 pages, 5 figures. Code at https://github.com/vene/marseille and data at http://joonsuk.org/

    MSC Class: 68T50 ACM Class: I.2.7

  48. arXiv:1606.07965  [pdf, ps, other

    cs.CL

    Summarizing Decisions in Spoken Meetings

    Authors: Lu Wang, Claire Cardie

    Abstract: This paper addresses the problem of summarizing decisions in spoken meetings: our goal is to produce a concise {\it decision abstract} for each meeting decision. We explore and compare token-level and dialogue act-level automatic summarization methods using both unsupervised and supervised learning frameworks. In the supervised summarization setting, and given true clusterings of decision-related… ▽ More

    Submitted 25 June, 2016; originally announced June 2016.

    Comments: ACL Workshop on Automatic Summarization for Different Genres, Media, and Languages, 2011

  49. arXiv:1606.07849  [pdf, other

    cs.CL

    Focused Meeting Summarization via Unsupervised Relation Extraction

    Authors: Lu Wang, Claire Cardie

    Abstract: We present a novel unsupervised framework for focused meeting summarization that views the problem as an instance of relation extraction. We adapt an existing in-domain relation learner (Chen et al., 2011) by exploiting a set of task-specific constraints and features. We evaluate the approach on a decision summarization task and show that it outperforms unsupervised utterance-level extractive summ… ▽ More

    Submitted 24 June, 2016; originally announced June 2016.

    Comments: SIGDIAL 2012

  50. arXiv:1606.07829  [pdf, other

    cs.CL

    Unsupervised Topic Modeling Approaches to Decision Summarization in Spoken Meetings

    Authors: Lu Wang, Claire Cardie

    Abstract: We present a token-level decision summarization framework that utilizes the latent topic structures of utterances to identify "summary-worthy" words. Concretely, a series of unsupervised topic models is explored and experimental results show that fine-grained topic models, which discover topics at the utterance-level rather than the document-level, can better identify the gist of the decision-maki… ▽ More

    Submitted 24 June, 2016; originally announced June 2016.

    Comments: SIGDIAL 2012