Skip to main content

Showing 1–8 of 8 results for author: Meister, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2311.12233  [pdf, other

    cs.CL

    Unifying Corroborative and Contributive Attributions in Large Language Models

    Authors: Theodora Worledge, Judy Hanwen Shen, Nicole Meister, Caleb Winston, Carlos Guestrin

    Abstract: As businesses, products, and services spring up around large language models, the trustworthiness of these models hinges on the verifiability of their outputs. However, methods for explaining language model outputs largely fall across two distinct fields of study which both use the term "attribution" to refer to entirely separate techniques: citation generation and training data attribution. In ma… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

    Comments: NeurIPS ATTRIB Workshop 2023

  2. arXiv:2310.17623  [pdf, other

    cs.CL cs.LG

    Proving Test Set Contamination in Black Box Language Models

    Authors: Yonatan Oren, Nicole Meister, Niladri Chatterji, Faisal Ladhak, Tatsunori B. Hashimoto

    Abstract: Large language models are trained on vast amounts of internet data, prompting concerns and speculation that they have memorized public benchmarks. Going from speculation to proof of contamination is challenging, as the pretraining data used by proprietary models are often not publicly accessible. We show that it is possible to provide provable guarantees of test set contamination in language model… ▽ More

    Submitted 23 November, 2023; v1 submitted 26 October, 2023; originally announced October 2023.

  3. arXiv:2209.04951  [pdf, other

    cs.CL

    Improving Keyphrase Extraction with Data Augmentation and Information Filtering

    Authors: Amir Pouran Ben Veyseh, Nicole Meister, Franck Dernoncourt, Thien Huu Nguyen

    Abstract: Keyphrase extraction is one of the essential tasks for document understanding in NLP. While the majority of the prior works are dedicated to the formal setting, e.g., books, news or web-blogs, informal texts such as video transcripts are less explored. To address this limitation, in this work we present a novel corpus and method for keyphrase extraction from the transcripts of the videos streamed… ▽ More

    Submitted 11 September, 2022; originally announced September 2022.

  4. arXiv:2206.09191  [pdf, other

    cs.CV

    Gender Artifacts in Visual Datasets

    Authors: Nicole Meister, Dora Zhao, Angelina Wang, Vikram V. Ramaswamy, Ruth Fong, Olga Russakovsky

    Abstract: Gender biases are known to exist within large-scale visual datasets and can be reflected or even amplified in downstream models. Many prior works have proposed methods for mitigating gender biases, often by attempting to remove gender expression information from images. To understand the feasibility and practicality of these approaches, we investigate what $\textit{gender artifacts}$ exist within… ▽ More

    Submitted 17 September, 2023; v1 submitted 18 June, 2022; originally announced June 2022.

    Comments: ICCV 2023

  5. arXiv:2206.07690  [pdf, other

    cs.CV cs.LG

    ELUDE: Generating interpretable explanations via a decomposition into labelled and unlabelled features

    Authors: Vikram V. Ramaswamy, Sunnie S. Y. Kim, Nicole Meister, Ruth Fong, Olga Russakovsky

    Abstract: Deep learning models have achieved remarkable success in different areas of machine learning over the past decade; however, the size and complexity of these models make them difficult to understand. In an effort to make them more interpretable, several recent works focus on explaining parts of a deep neural network through human-interpretable, semantic attributes. However, it may be impossible to… ▽ More

    Submitted 16 June, 2022; v1 submitted 15 June, 2022; originally announced June 2022.

  6. arXiv:2202.09694  [pdf, other

    cs.CL

    MACRONYM: A Large-Scale Dataset for Multilingual and Multi-Domain Acronym Extraction

    Authors: Amir Pouran Ben Veyseh, Nicole Meister, Seunghyun Yoon, Rajiv Jain, Franck Dernoncourt, Thien Huu Nguyen

    Abstract: Acronym extraction is the task of identifying acronyms and their expanded forms in texts that is necessary for various NLP applications. Despite major progress for this task in recent years, one limitation of existing AE research is that they are limited to the English language and certain domains (i.e., scientific and biomedical). As such, challenges of AE in other languages and domains is mainly… ▽ More

    Submitted 19 February, 2022; originally announced February 2022.

  7. arXiv:2112.03184  [pdf, other

    cs.CV

    HIVE: Evaluating the Human Interpretability of Visual Explanations

    Authors: Sunnie S. Y. Kim, Nicole Meister, Vikram V. Ramaswamy, Ruth Fong, Olga Russakovsky

    Abstract: As AI technology is increasingly applied to high-impact, high-risk domains, there have been a number of new methods aimed at making AI models more human interpretable. Despite the recent growth of interpretability work, there is a lack of systematic evaluation of proposed techniques. In this work, we introduce HIVE (Human Interpretability of Visual Explanations), a novel human evaluation framework… ▽ More

    Submitted 21 July, 2022; v1 submitted 6 December, 2021; originally announced December 2021.

    Comments: ECCV 2022. Code and supplementary material are at https://princetonvisualai.github.io/HIVE

  8. [Re] Don't Judge an Object by Its Context: Learning to Overcome Contextual Bias

    Authors: Sunnie S. Y. Kim, Sharon Zhang, Nicole Meister, Olga Russakovsky

    Abstract: Singh et al. (2020) point out the dangers of contextual bias in visual recognition datasets. They propose two methods, CAM-based and feature-split, that better recognize an object or attribute in the absence of its typical context while maintaining competitive within-context accuracy. To verify their performance, we attempted to reproduce all 12 tables in the original paper, including those in the… ▽ More

    Submitted 28 April, 2021; originally announced April 2021.

    Comments: ML Reproducibility Challenge 2020. Accepted for publication in the ReScience C journal