Skip to main content

Showing 1–15 of 15 results for author: Htut, P M

Searching in archive cs. Search in all archives.
.
  1. RAMP: Retrieval and Attribute-Marking Enhanced Prompting for Attribute-Controlled Translation

    Authors: Gabriele Sarti, Phu Mon Htut, Xing Niu, Benjamin Hsu, Anna Currey, Georgiana Dinu, Maria Nadejde

    Abstract: Attribute-controlled translation (ACT) is a subtask of machine translation that involves controlling stylistic or linguistic attributes (like formality and gender) of translation outputs. While ACT has garnered attention in recent years due to its usefulness in real-world applications, progress in the task is currently limited by dataset availability, since most prior approaches rely on supervised… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

    Comments: Accepted at ACL 2023

    Journal ref: Proceedings of ACL (2023) 1476-1490

  2. arXiv:2212.10003  [pdf, other

    cs.CL

    (QA)$^2$: Question Answering with Questionable Assumptions

    Authors: Najoung Kim, Phu Mon Htut, Samuel R. Bowman, Jackson Petty

    Abstract: Naturally occurring information-seeking questions often contain questionable assumptions -- assumptions that are false or unverifiable. Questions containing questionable assumptions are challenging because they require a distinct answer strategy that deviates from typical answers for information-seeking questions. For instance, the question "When did Marie Curie discover Uranium?" cannot be answer… ▽ More

    Submitted 29 August, 2023; v1 submitted 20 December, 2022; originally announced December 2022.

    Comments: ACL 2023 camera-ready

  3. arXiv:2206.04615  [pdf, other

    cs.CL cs.AI cs.CY cs.LG stat.ML

    Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

    Authors: Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza , et al. (426 additional authors not shown)

    Abstract: Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet poorly characterized. In order to inform future research, prepare for disruptive new model capabilities, and ameliorate socially harmful effects, it is vital that we understand the present and near-futur… ▽ More

    Submitted 12 June, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 27 pages, 17 figures + references and appendices, repo: https://github.com/google/BIG-bench

    Journal ref: Transactions on Machine Learning Research, May/2022, https://openreview.net/forum?id=uyTL5Bvosj

  4. arXiv:2110.08193  [pdf, other

    cs.CL

    BBQ: A Hand-Built Bias Benchmark for Question Answering

    Authors: Alicia Parrish, Angelica Chen, Nikita Nangia, Vishakh Padmakumar, Jason Phang, Jana Thompson, Phu Mon Htut, Samuel R. Bowman

    Abstract: It is well documented that NLP models learn social biases, but little work has been done on how these biases manifest in model outputs for applied tasks like question answering (QA). We introduce the Bias Benchmark for QA (BBQ), a dataset of question sets constructed by the authors that highlight attested social biases against people belonging to protected classes along nine social dimensions rele… ▽ More

    Submitted 15 March, 2022; v1 submitted 15 October, 2021; originally announced October 2021.

    Comments: Accepted to ACL 2022 Findings. 20 pages, 10 figures

  5. arXiv:2106.00840  [pdf, other

    cs.CL

    Comparing Test Sets with Item Response Theory

    Authors: Clara Vania, Phu Mon Htut, William Huang, Dhara Mungra, Richard Yuanzhe Pang, Jason Phang, Haokun Liu, Kyunghyun Cho, Samuel R. Bowman

    Abstract: Recent years have seen numerous NLP datasets introduced to evaluate the performance of fine-tuned models on natural language understanding tasks. Recent results from large pretrained models, though, show that many of these datasets are largely saturated and unlikely to be able to detect further progress. What kind of datasets are still effective at discriminating among strong models, and what kind… ▽ More

    Submitted 1 June, 2021; originally announced June 2021.

    Comments: ACL 2021

  6. arXiv:2005.13013  [pdf, other

    cs.CL

    English Intermediate-Task Training Improves Zero-Shot Cross-Lingual Transfer Too

    Authors: Jason Phang, Iacer Calixto, Phu Mon Htut, Yada Pruksachatkun, Haokun Liu, Clara Vania, Katharina Kann, Samuel R. Bowman

    Abstract: Intermediate-task training---fine-tuning a pretrained model on an intermediate task before fine-tuning again on the target task---often improves model performance substantially on language understanding tasks in monolingual English settings. We investigate whether English intermediate-task training is still helpful on non-English target tasks. Using nine intermediate language-understanding tasks,… ▽ More

    Submitted 30 September, 2020; v1 submitted 26 May, 2020; originally announced May 2020.

  7. arXiv:2005.00628  [pdf, other

    cs.CL

    Intermediate-Task Transfer Learning with Pretrained Models for Natural Language Understanding: When and Why Does It Work?

    Authors: Yada Pruksachatkun, Jason Phang, Haokun Liu, Phu Mon Htut, Xiaoyi Zhang, Richard Yuanzhe Pang, Clara Vania, Katharina Kann, Samuel R. Bowman

    Abstract: While pretrained models such as BERT have shown large gains across natural language understanding tasks, their performance can be improved by further training the model on a data-rich intermediate task, before fine-tuning it on a target task. However, it is still poorly understood when and why intermediate-task training is beneficial for a given target task. To investigate this, we perform a large… ▽ More

    Submitted 9 May, 2020; v1 submitted 1 May, 2020; originally announced May 2020.

    Comments: ACL 2020

  8. arXiv:2003.02249  [pdf, other

    cs.CL

    jiant: A Software Toolkit for Research on General-Purpose Text Understanding Models

    Authors: Yada Pruksachatkun, Phil Yeres, Haokun Liu, Jason Phang, Phu Mon Htut, Alex Wang, Ian Tenney, Samuel R. Bowman

    Abstract: We introduce jiant, an open source toolkit for conducting multitask and transfer learning experiments on English NLU tasks. jiant enables modular and configuration-driven experimentation with state-of-the-art models and implements a broad set of tasks for probing, transfer learning, and multitask training experiments. jiant implements over 50 NLU tasks, including all GLUE and SuperGLUE benchmark t… ▽ More

    Submitted 13 May, 2020; v1 submitted 4 March, 2020; originally announced March 2020.

  9. arXiv:1911.12246  [pdf, other

    cs.CL

    Do Attention Heads in BERT Track Syntactic Dependencies?

    Authors: Phu Mon Htut, Jason Phang, Shikha Bordia, Samuel R. Bowman

    Abstract: We investigate the extent to which individual attention heads in pretrained transformer language models, such as BERT and RoBERTa, implicitly capture syntactic dependency relations. We employ two methods---taking the maximum attention weight and computing the maximum spanning tree---to extract implicit dependency relations from the attention weights of each layer/head, and compare them to the grou… ▽ More

    Submitted 27 November, 2019; originally announced November 2019.

  10. arXiv:1910.01727  [pdf, other

    cs.LG stat.ML

    Generalized Inner Loop Meta-Learning

    Authors: Edward Grefenstette, Brandon Amos, Denis Yarats, Phu Mon Htut, Artem Molchanov, Franziska Meier, Douwe Kiela, Kyunghyun Cho, Soumith Chintala

    Abstract: Many (but not all) approaches self-qualifying as "meta-learning" in deep learning and reinforcement learning fit a common pattern of approximating the solution to a nested optimization problem. In this paper, we give a formalization of this shared pattern, which we call GIMLI, prove its general requirements, and derive a general-purpose algorithm for implementing similar approaches. Based on this… ▽ More

    Submitted 7 October, 2019; v1 submitted 3 October, 2019; originally announced October 2019.

    Comments: 17 pages, 3 figures, 1 algorithm

  11. arXiv:1909.10056  [pdf, other

    cs.CL

    Inducing Constituency Trees through Neural Machine Translation

    Authors: Phu Mon Htut, Kyunghyun Cho, Samuel R. Bowman

    Abstract: Latent tree learning(LTL) methods learn to parse sentences using only indirect supervision from a downstream task. Recent advances in latent tree learning have made it possible to recover moderately high quality tree structures by training with language modeling or auto-encoding objectives. In this work, we explore the hypothesis that decoding in machine translation, as a conditional language mode… ▽ More

    Submitted 22 September, 2019; originally announced September 2019.

  12. arXiv:1909.02597  [pdf, other

    cs.CL

    Investigating BERT's Knowledge of Language: Five Analysis Methods with NPIs

    Authors: Alex Warstadt, Yu Cao, Ioana Grosu, Wei Peng, Hagen Blix, Yining Nie, Anna Alsop, Shikha Bordia, Haokun Liu, Alicia Parrish, Sheng-Fu Wang, Jason Phang, Anhad Mohananey, Phu Mon Htut, Paloma Jeretič, Samuel R. Bowman

    Abstract: Though state-of-the-art sentence representation models can perform tasks requiring significant knowledge of grammar, it is an open question how best to evaluate their grammatical knowledge. We explore five experimental methods inspired by prior work evaluating pretrained sentence representation models. We use a single linguistic phenomenon, negative polarity item (NPI) licensing in English, as a c… ▽ More

    Submitted 19 September, 2019; v1 submitted 5 September, 2019; originally announced September 2019.

    Comments: Accepted to EMNLP 2019; Added link to code+dataset

  13. arXiv:1907.08889  [pdf, other

    cs.CL

    The Unbearable Weight of Generating Artificial Errors for Grammatical Error Correction

    Authors: Phu Mon Htut, Joel Tetreault

    Abstract: In recent years, sequence-to-sequence models have been very effective for end-to-end grammatical error correction (GEC). As creating human-annotated parallel corpus for GEC is expensive and time-consuming, there has been work on artificial corpus generation with the aim of creating sentences that contain realistic grammatical errors from grammatically correct sentences. In this paper, we investiga… ▽ More

    Submitted 20 July, 2019; originally announced July 2019.

    Comments: To appear at ACL-BEA workshop 2019

  14. arXiv:1808.10000  [pdf, ps, other

    cs.CL

    Grammar Induction with Neural Language Models: An Unusual Replication

    Authors: Phu Mon Htut, Kyunghyun Cho, Samuel R. Bowman

    Abstract: A substantial thread of recent work on latent tree learning has attempted to develop neural network models with parse-valued latent variables and train them on non-parsing tasks, in the hope of having them discover interpretable tree structure. In a recent paper, Shen et al. (2018) introduce such a model and report near-state-of-the-art results on the target task of language modeling, and the firs… ▽ More

    Submitted 29 August, 2018; originally announced August 2018.

    Comments: To appear in Proceedings of EMNLP 2018 (short paper)

  15. arXiv:1804.04264  [pdf, other

    cs.CL cs.IR

    Training a Ranking Function for Open-Domain Question Answering

    Authors: Phu Mon Htut, Samuel R. Bowman, Kyunghyun Cho

    Abstract: In recent years, there have been amazing advances in deep learning methods for machine reading. In machine reading, the machine reader has to extract the answer from the given ground truth paragraph. Recently, the state-of-the-art machine reading models achieve human level performance in SQuAD which is a reading comprehension-style question answering (QA) task. The success of machine reading has i… ▽ More

    Submitted 11 April, 2018; originally announced April 2018.

    Comments: To appear at NAACL-SRW 2018