Skip to main content

Showing 1–19 of 19 results for author: Hsu, I

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.05365  [pdf, other

    cs.CL cs.AI cs.LG

    CaLM: Contrasting Large and Small Language Models to Verify Grounded Generation

    Authors: I-Hung Hsu, Zifeng Wang, Long T. Le, Lesly Miculicich, Nanyun Peng, Chen-Yu Lee, Tomas Pfister

    Abstract: Grounded generation aims to equip language models (LMs) with the ability to produce more credible and accountable responses by accurately citing verifiable sources. However, existing methods, by either feeding LMs with raw or preprocessed materials, remain prone to errors. To address this, we introduce CaLM, a novel verification framework. CaLM leverages the insight that a robust grounded response… ▽ More

    Submitted 24 June, 2024; v1 submitted 8 June, 2024; originally announced June 2024.

    Comments: ACL 2024 Camera Ready Version

  2. arXiv:2403.15097  [pdf, other

    cs.CL cs.AI

    Argument-Aware Approach To Event Linking

    Authors: I-Hung Hsu, Zihan Xue, Nilay Pochh, Sahil Bansal, Premkumar Natarajan, Jayanth Srinivasa, Nanyun Peng

    Abstract: Event linking connects event mentions in text with relevant nodes in a knowledge base (KB). Prior research in event linking has mainly borrowed methods from entity linking, overlooking the distinct features of events. Compared to the extensively explored entity linking task, events have more complex structures and can be more effectively distinguished by examining their associated arguments. Moreo… ▽ More

    Submitted 6 June, 2024; v1 submitted 22 March, 2024; originally announced March 2024.

    Comments: Paper accepted by ACL-findings 2024

  3. arXiv:2311.09562  [pdf, other

    cs.CL

    TextEE: Benchmark, Reevaluation, Reflections, and Future Challenges in Event Extraction

    Authors: Kuan-Hao Huang, I-Hung Hsu, Tanmay Parekh, Zhiyu Xie, Zixuan Zhang, Premkumar Natarajan, Kai-Wei Chang, Nanyun Peng, Heng Ji

    Abstract: Event extraction has gained considerable interest due to its wide-ranging applications. However, recent studies draw attention to evaluation issues, suggesting that reported scores may not accurately reflect the true performance. In this work, we identify and address evaluation challenges, including inconsistency due to varying data assumptions or preprocessing steps, the insufficiency of current… ▽ More

    Submitted 6 June, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

    Comments: Paper accepted by ACL 2024 Findings

  4. arXiv:2309.08943  [pdf, other

    cs.CL

    Contextual Label Projection for Cross-Lingual Structured Prediction

    Authors: Tanmay Parekh, I-Hung Hsu, Kuan-Hao Huang, Kai-Wei Chang, Nanyun Peng

    Abstract: Label projection, which involves obtaining translated labels and texts jointly, is essential for leveraging machine translation to facilitate cross-lingual transfer in structured prediction tasks. Prior research exploring label projection often compromise translation accuracy by favoring simplified label translation or relying solely on word-level alignments. In this paper, we introduce a novel la… ▽ More

    Submitted 14 April, 2024; v1 submitted 16 September, 2023; originally announced September 2023.

    Comments: Accepted at NAACL 2024

  5. arXiv:2305.16734  [pdf, other

    cs.CL cs.AI

    AMPERE: AMR-Aware Prefix for Generation-Based Event Argument Extraction Model

    Authors: I-Hung Hsu, Zhiyu Xie, Kuan-Hao Huang, Prem Natarajan, Nanyun Peng

    Abstract: Event argument extraction (EAE) identifies event arguments and their specific roles for a given event. Recent advancement in generation-based EAE models has shown great performance and generalizability over classification-based models. However, existing generation-based EAE models mostly focus on problem re-formulation and prompt design, without incorporating additional information that has been s… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

    Comments: Paper accepted by ACL2023 as a main conference paper. The first two authors contribute equally. Code can be publicly accessible at https://github.com/PlusLabNLP/AMPERE

  6. arXiv:2305.16724  [pdf, other

    cs.CL cs.AI

    Code-Switched Text Synthesis in Unseen Language Pairs

    Authors: I-Hung Hsu, Avik Ray, Shubham Garg, Nanyun Peng, **g Huang

    Abstract: Existing efforts on text synthesis for code-switching mostly require training on code-switched texts in the target language pairs, limiting the deployment of the models to cases lacking code-switched data. In this work, we study the problem of synthesizing code-switched texts for language pairs absent from the training data. We introduce GLOSS, a model built on top of a pre-trained multilingual ma… ▽ More

    Submitted 7 July, 2023; v1 submitted 26 May, 2023; originally announced May 2023.

    Comments: Paper accepted by ACL2023 as a Finding paper

  7. arXiv:2305.16585  [pdf, other

    cs.CL

    ParaAMR: A Large-Scale Syntactically Diverse Paraphrase Dataset by AMR Back-Translation

    Authors: Kuan-Hao Huang, Varun Iyer, I-Hung Hsu, Anoop Kumar, Kai-Wei Chang, Aram Galstyan

    Abstract: Paraphrase generation is a long-standing task in natural language processing (NLP). Supervised paraphrase generation models, which rely on human-annotated paraphrase pairs, are cost-inefficient and hard to scale up. On the other hand, automatically annotated paraphrase pairs (e.g., by machine back-translation), usually suffer from the lack of syntactic diversity -- the generated paraphrase sentenc… ▽ More

    Submitted 25 May, 2023; originally announced May 2023.

    Comments: ACL 2023

  8. arXiv:2212.10786  [pdf, other

    cs.CL cs.IR cs.LG

    Multi-hop Evidence Retrieval for Cross-document Relation Extraction

    Authors: Keming Lu, I-Hung Hsu, Wenxuan Zhou, Mingyu Derek Ma, Muhao Chen

    Abstract: Relation Extraction (RE) has been extended to cross-document scenarios because many relations are not simply described in a single document. This inevitably brings the challenge of efficient open-space evidence retrieval to support the inference of cross-document relations, along with the challenge of multi-hop reasoning on top of entities and evidence scattered in an open set of documents. To com… ▽ More

    Submitted 4 June, 2023; v1 submitted 21 December, 2022; originally announced December 2022.

    Comments: ACL 2023 (Findings)

  9. arXiv:2205.12585  [pdf, other

    cs.CL cs.AI

    TAGPRIME: A Unified Framework for Relational Structure Extraction

    Authors: I-Hung Hsu, Kuan-Hao Huang, Shuning Zhang, Wenxin Cheng, Premkumar Natarajan, Kai-Wei Chang, Nanyun Peng

    Abstract: Many tasks in natural language processing require the extraction of relationship information for a given condition, such as event argument extraction, relation extraction, and task-oriented semantic parsing. Recent works usually propose sophisticated models for each task independently and pay less attention to the commonality of these tasks and to have a unified framework for all the tasks. In thi… ▽ More

    Submitted 26 May, 2023; v1 submitted 25 May, 2022; originally announced May 2022.

    Comments: Paper accepted by ACL2023 as a main conference paper. The first two authors contribute equally

  10. arXiv:2205.12505  [pdf, other

    cs.CL

    GENEVA: Benchmarking Generalizability for Event Argument Extraction with Hundreds of Event Types and Argument Roles

    Authors: Tanmay Parekh, I-Hung Hsu, Kuan-Hao Huang, Kai-Wei Chang, Nanyun Peng

    Abstract: Recent works in Event Argument Extraction (EAE) have focused on improving model generalizability to cater to new events and domains. However, standard benchmarking datasets like ACE and ERE cover less than 40 event types and 25 entity-centric argument roles. Limited diversity and coverage hinder these datasets from adequately evaluating the generalizability of EAE models. In this paper, we first c… ▽ More

    Submitted 1 June, 2023; v1 submitted 25 May, 2022; originally announced May 2022.

    Comments: Accepted at ACL 2023 main conference

  11. arXiv:2205.09837  [pdf, other

    cs.CL cs.LG

    Summarization as Indirect Supervision for Relation Extraction

    Authors: Keming Lu, I-Hung Hsu, Wenxuan Zhou, Mingyu Derek Ma, Muhao Chen

    Abstract: Relation extraction (RE) models have been challenged by their reliance on training data with expensive annotations. Considering that summarization tasks aim at acquiring concise expressions of synoptical information from the longer context, these tasks naturally align with the objective of RE, i.e., extracting a kind of synoptical information that describes the relation of entity mentions. We pres… ▽ More

    Submitted 21 October, 2022; v1 submitted 19 May, 2022; originally announced May 2022.

    Comments: Accepted by EMNLP 2022

  12. arXiv:2203.08308  [pdf, other

    cs.CL

    Multilingual Generative Language Models for Zero-Shot Cross-Lingual Event Argument Extraction

    Authors: Kuan-Hao Huang, I-Hung Hsu, Premkumar Natarajan, Kai-Wei Chang, Nanyun Peng

    Abstract: We present a study on leveraging multilingual pre-trained generative language models for zero-shot cross-lingual event argument extraction (EAE). By formulating EAE as a language generation task, our method effectively encodes event structures and captures the dependencies between arguments. We design language-agnostic templates to represent the event argument structures, which are compatible with… ▽ More

    Submitted 15 March, 2022; originally announced March 2022.

    Comments: ACL 2022. Our code is available at https://github.com/PlusLabNLP/X-Gear

  13. arXiv:2108.12724  [pdf, other

    cs.CL cs.AI

    DEGREE: A Data-Efficient Generation-Based Event Extraction Model

    Authors: I-Hung Hsu, Kuan-Hao Huang, Elizabeth Boschee, Scott Miller, Prem Natarajan, Kai-Wei Chang, Nanyun Peng

    Abstract: Event extraction requires high-quality expert human annotations, which are usually expensive. Therefore, learning a data-efficient event extraction model that can be trained with only a few labeled examples has become a crucial challenge. In this paper, we focus on low-resource end-to-end event extraction and propose DEGREE, a data-efficient model that formulates event extraction as a conditional… ▽ More

    Submitted 3 May, 2022; v1 submitted 28 August, 2021; originally announced August 2021.

    Comments: Paper accepted by NAACL 2022. The first two authors contribute equally. Our code and models can be found at https://github.com/PlusLabNLP/DEGREE

  14. arXiv:2104.08350  [pdf, other

    cs.CL

    ESTER: A Machine Reading Comprehension Dataset for Event Semantic Relation Reasoning

    Authors: Rujun Han, I-Hung Hsu, Jiao Sun, Julia Baylon, Qiang Ning, Dan Roth, Nanyun Peng

    Abstract: Understanding how events are semantically related to each other is the essence of reading comprehension. Recent event-centric reading comprehension datasets focus mostly on event arguments or temporal relations. While these tasks partially evaluate machines' ability of narrative understanding, human-like reading comprehension requires the capability to process event-based information beyond argume… ▽ More

    Submitted 10 September, 2021; v1 submitted 16 April, 2021; originally announced April 2021.

    Comments: This paper has been accepted my EMNLP'21 main conference

  15. arXiv:2101.00124  [pdf, other

    cs.CL cs.AI cs.LG

    Discourse-level Relation Extraction via Graph Pooling

    Authors: I-Hung Hsu, Xiao Guo, Premkumar Natarajan, Nanyun Peng

    Abstract: The ability to capture complex linguistic structures and long-term dependencies among words in the passage is essential for discourse-level relation extraction (DRE) tasks. Graph neural networks (GNNs), one of the methods to encode dependency graphs, have been shown effective in prior works for DRE. However, relatively little attention has been paid to receptive fields of GNNs, which can be crucia… ▽ More

    Submitted 12 November, 2021; v1 submitted 31 December, 2020; originally announced January 2021.

    Comments: 12 pages, page 10-12 appendix

  16. arXiv:2005.06684  [pdf, other

    eess.IV cs.CV cs.LG

    W-Cell-Net: Multi-frame Interpolation of Cellular Microscopy Videos

    Authors: Rohit Saha, Abenezer Teklemariam, Ian Hsu, Alan M. Moses

    Abstract: Deep Neural Networks are increasingly used in video frame interpolation tasks such as frame rate changes as well as generating fake face videos. Our project aims to apply recent advances in Deep video interpolation to increase the temporal resolution of fluorescent microscopy time-lapse movies. To our knowledge, there is no previous work that uses Convolutional Neural Networks (CNN) to generate fr… ▽ More

    Submitted 13 May, 2020; originally announced May 2020.

  17. arXiv:1909.10094  [pdf, other

    cs.CL

    Deep Structured Neural Network for Event Temporal Relation Extraction

    Authors: Rujun Han, I-Hung Hsu, Mu Yang, Aram Galstyan, Ralph Weischedel, Nanyun Peng

    Abstract: We propose a novel deep structured learning framework for event temporal relation extraction. The model consists of 1) a recurrent neural network (RNN) to learn scoring functions for pair-wise relations, and 2) a structured support vector machine (SSVM) to make joint predictions. The neural network automatically learns representations that account for long-term contexts to provide robust features… ▽ More

    Submitted 24 September, 2019; v1 submitted 22 September, 2019; originally announced September 2019.

    Comments: This paper will be published in CoNLL 2019

  18. arXiv:1907.03233  [pdf, other

    cs.CL cs.SD eess.AS

    NIESR: Nuisance Invariant End-to-end Speech Recognition

    Authors: I-Hung Hsu, Ayush Jaiswal, Premkumar Natarajan

    Abstract: Deep neural network models for speech recognition have achieved great success recently, but they can learn incorrect associations between the target and nuisance factors of speech (e.g., speaker identities, background noise, etc.), which can lead to overfitting. While several methods have been proposed to tackle this problem, existing methods incorporate additional information about nuisance facto… ▽ More

    Submitted 7 July, 2019; originally announced July 2019.

    Comments: To appear in Proceedings of Interspeech 2019

  19. arXiv:1709.07862  [pdf, other

    cs.CL

    Mitigating the Impact of Speech Recognition Errors on Chatbot using Sequence-to-Sequence Model

    Authors: Pin-Jung Chen, I-Hung Hsu, Yi-Yao Huang, Hung-Yi Lee

    Abstract: We apply sequence-to-sequence model to mitigate the impact of speech recognition errors on open domain end-to-end dialog generation. We cast the task as a domain adaptation problem where ASR transcriptions and original text are in two different domains. In this paper, our proposed model includes two individual encoders for each domain data and make their hidden states similar to ensure the decoder… ▽ More

    Submitted 2 December, 2017; v1 submitted 22 September, 2017; originally announced September 2017.

    Comments: Accepted at ASRU 2017