Skip to main content

Showing 1–8 of 8 results for author: Uppaal, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.13967  [pdf, other

    cs.CL

    DeTox: Toxic Subspace Projection for Model Editing

    Authors: Rheeya Uppaal, Apratim Dey, Yiting He, Yiqiao Zhong, Junjie Hu

    Abstract: Recent alignment algorithms such as direct preference optimization (DPO) have been developed to improve the safety of large language models (LLMs) by training these models to match human behaviors exemplified by preference data. However, these methods are both computationally intensive and lacking in controllability and transparency, making them prone to jailbreaking and inhibiting their widesprea… ▽ More

    Submitted 28 May, 2024; v1 submitted 22 May, 2024; originally announced May 2024.

    Comments: Preprint

  2. arXiv:2401.17514  [pdf, other

    cs.CL

    How Useful is Continued Pre-Training for Generative Unsupervised Domain Adaptation?

    Authors: Rheeya Uppaal, Yixuan Li, Junjie Hu

    Abstract: Recent breakthroughs in scale have enabled the emergence of powerful generative language models, and the ability to fine-tune these models on various tasks by casting them into prompts or instructions. In this landscape, the problem of Unsupervised Domain Adaptation (UDA), or the problem of leveraging knowledge from a labeled source domain to an unlabeled target domain, has been left behind, with… ▽ More

    Submitted 1 April, 2024; v1 submitted 30 January, 2024; originally announced January 2024.

  3. arXiv:2311.09661  [pdf, other

    cs.CL

    Evolving Domain Adaptation of Pretrained Language Models for Text Classification

    Authors: Yun-Shiuan Chuang, Yi Wu, Dhruv Gupta, Rheeya Uppaal, Ananya Kumar, Luhang Sun, Makesh Narsimhan Sreedhar, Sijia Yang, Timothy T. Rogers, Junjie Hu

    Abstract: Adapting pre-trained language models (PLMs) for time-series text classification amidst evolving domain shifts (EDS) is critical for maintaining accuracy in applications like stance detection. This study benchmarks the effectiveness of evolving domain adaptation (EDA) strategies, notably self-training, domain-adversarial training, and domain-adaptive pretraining, with a focus on an incremental self… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

  4. arXiv:2305.13282  [pdf, other

    cs.CL cs.LG

    Is Fine-tuning Needed? Pre-trained Language Models Are Near Perfect for Out-of-Domain Detection

    Authors: Rheeya Uppaal, Junjie Hu, Yixuan Li

    Abstract: Out-of-distribution (OOD) detection is a critical task for reliable predictions over text. Fine-tuning with pre-trained language models has been a de facto procedure to derive OOD detectors with respect to in-distribution (ID) data. Despite its common use, the understanding of the role of fine-tuning and its necessity for OOD detection is largely unexplored. In this paper, we raise the question: i… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

    Comments: Accepted to ACL 2023

  5. arXiv:2103.00751  [pdf, other

    cs.CL

    Long Document Summarization in a Low Resource Setting using Pretrained Language Models

    Authors: Ahsaas Bajaj, Pavitra Dangati, Kalpesh Krishna, Pradhiksha Ashok Kumar, Rheeya Uppaal, Bradford Windsor, Eliot Brenner, Dominic Dotterrer, Rajarshi Das, Andrew McCallum

    Abstract: Abstractive summarization is the task of compressing a long document into a coherent short document while retaining salient information. Modern abstractive summarization methods are based on deep neural networks which often require large training datasets. Since collecting summarization datasets is an expensive and time-consuming task, practical industrial settings are usually low-resource. In thi… ▽ More

    Submitted 28 February, 2021; originally announced March 2021.

  6. arXiv:1911.07335  [pdf, other

    cs.CL cs.LG stat.ML

    Using Error Decay Prediction to Overcome Practical Issues of Deep Active Learning for Named Entity Recognition

    Authors: Haw-Shiuan Chang, Shankar Vembu, Sunil Mohan, Rheeya Uppaal, Andrew McCallum

    Abstract: Existing deep active learning algorithms achieve impressive sampling efficiency on natural language processing tasks. However, they exhibit several weaknesses in practice, including (a) inability to use uncertainty sampling with black-box models, (b) lack of robustness to labeling noise, and (c) lack of transparency. In response, we propose a transparent batch active sampling framework by estimati… ▽ More

    Submitted 20 July, 2020; v1 submitted 17 November, 2019; originally announced November 2019.

    Comments: This is a pre-print of an article published in Springer Machine Learning journal. The final authenticated version is available online at: https://doi.org/10.1007/s10994-020-05897-1

  7. arXiv:1909.06718  [pdf, other

    cs.LG cs.CV stat.ML

    LRS-DAG: Low Resource Supervised Domain Adaptation with Generalization Across Domains

    Authors: Rheeya Uppaal

    Abstract: Current state of the art methods in Domain Adaptation follow adversarial approaches, making training a challenge. Existing non-adversarial methods learn map**s between the source and target domains, to achieve reasonable performance. However, even these methods do not focus on a key aspect: maintaining performance on the source domain, even after optimizing over the target domain. Additionally,… ▽ More

    Submitted 14 November, 2019; v1 submitted 14 September, 2019; originally announced September 2019.

    Comments: 10 pages, 3 figures. Accepted to NewInML Workshop at NeurIPS, 2019

  8. arXiv:1905.00125  [pdf, other

    cs.LG eess.SP stat.ML

    Multi-resolution Networks For Flexible Irregular Time Series Modeling (Multi-FIT)

    Authors: Bhanu Pratap Singh, Iman Deznabi, Bharath Narasimhan, Bryon Kucharski, Rheeya Uppaal, Akhila Josyula, Madalina Fiterau

    Abstract: Missing values, irregularly collected samples, and multi-resolution signals commonly occur in multivariate time series data, making predictive tasks difficult. These challenges are especially prevalent in the healthcare domain, where patients' vital signs and electronic records are collected at different frequencies and have occasionally missing information due to the imperfections in equipment or… ▽ More

    Submitted 30 April, 2019; originally announced May 2019.