Skip to main content

Showing 1–12 of 12 results for author: Wettig, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.16778  [pdf, other

    cs.CL

    Finding Transformer Circuits with Edge Pruning

    Authors: Adithya Bhaskar, Alexander Wettig, Dan Friedman, Danqi Chen

    Abstract: The path to interpreting a language model often proceeds via analysis of circuits -- sparse computational subgraphs of the model that capture specific aspects of its behavior. Recent work has automated the task of discovering circuits. Yet, these methods have practical limitations, as they rely either on inefficient search algorithms or inaccurate approximations. In this paper, we frame automated… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: We release our code and data publicly at https://github.com/princeton-nlp/Edge-Pruning

  2. arXiv:2405.15793  [pdf, other

    cs.SE cs.AI cs.CL cs.HC cs.LG

    SWE-agent: Agent-Computer Interfaces Enable Automated Software Engineering

    Authors: John Yang, Carlos E. Jimenez, Alexander Wettig, Kilian Lieret, Shunyu Yao, Karthik Narasimhan, Ofir Press

    Abstract: Language model (LM) agents are increasingly being used to automate complicated tasks in digital environments. Just as humans benefit from powerful software applications, such as integrated development environments, for complex tasks like software engineering, we posit that LM agents represent a new category of end users with their own needs and abilities, and would benefit from specially-built int… ▽ More

    Submitted 30 May, 2024; v1 submitted 6 May, 2024; originally announced May 2024.

    Comments: Code, data, and demo available at https://swe-agent.com

  3. arXiv:2402.11111  [pdf, other

    cs.CL

    Language Models as Science Tutors

    Authors: Alexis Chevalier, Jiayi Geng, Alexander Wettig, Howard Chen, Sebastian Mizera, Toni Annala, Max Jameson Aragon, Arturo Rodríguez Fanlo, Simon Frieder, Simon Machado, Akshara Prabhakar, Ellie Thieu, Jiachen T. Wang, Zirui Wang, Xindi Wu, Mengzhou Xia, Wenhan Jia, Jiatong Yu, Jun-Jie Zhu, Zhiyong Jason Ren, Sanjeev Arora, Danqi Chen

    Abstract: NLP has recently made exciting progress toward training language models (LMs) with strong scientific problem-solving skills. However, model development has not focused on real-life use-cases of LMs for science, including applications in education that require processing long scientific documents. To address this, we introduce TutorEval and TutorChat. TutorEval is a diverse question-answering bench… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

    Comments: 8 pages without bibliography and appendix, 26 pages total

  4. arXiv:2402.09739  [pdf, other

    cs.CL cs.LG

    QuRating: Selecting High-Quality Data for Training Language Models

    Authors: Alexander Wettig, Aatmik Gupta, Saumya Malik, Danqi Chen

    Abstract: Selecting high-quality pre-training data is important for creating capable language models, but existing methods rely on simple heuristics. We introduce QuRating, a method for selecting pre-training data that can capture human intuitions about data quality. In this paper, we investigate four qualities - writing style, required expertise, facts & trivia, and educational value - and find that LLMs a… ▽ More

    Submitted 13 June, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

    Comments: Accepted at ICML 2024. The results for top-k selection have been corrected. The code, models and data are available at https://github.com/princeton-nlp/QuRating. arXiv admin note: text overlap with arXiv:2002.04059, arXiv:hep-th/9607006, arXiv:2107.06981, arXiv:2008.09340 by other authors

  5. arXiv:2310.19156  [pdf, other

    cs.CL cs.IR

    Poisoning Retrieval Corpora by Injecting Adversarial Passages

    Authors: Zexuan Zhong, Ziqing Huang, Alexander Wettig, Danqi Chen

    Abstract: Dense retrievers have achieved state-of-the-art performance in various information retrieval tasks, but to what extent can they be safely deployed in real-world applications? In this work, we propose a novel attack for dense retrieval systems in which a malicious user generates a small number of adversarial passages by perturbing discrete tokens to maximize similarity with a provided set of traini… ▽ More

    Submitted 29 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023. Our code is available at https://github.com/princeton-nlp/corpus-poisoning

  6. arXiv:2310.06770  [pdf, other

    cs.CL cs.AI cs.SE

    SWE-bench: Can Language Models Resolve Real-World GitHub Issues?

    Authors: Carlos E. Jimenez, John Yang, Alexander Wettig, Shunyu Yao, Kexin Pei, Ofir Press, Karthik Narasimhan

    Abstract: Language models have outpaced our ability to evaluate them effectively, but for their future development it is essential to study the frontier of their capabilities. We find real-world software engineering to be a rich, sustainable, and challenging testbed for evaluating the next generation of language models. To this end, we introduce SWE-bench, an evaluation framework consisting of $2,294$ softw… ▽ More

    Submitted 5 April, 2024; v1 submitted 10 October, 2023; originally announced October 2023.

    Comments: Data, code, and leaderboard are available at https://www.swebench.com ICLR 2024, https://openreview.net/forum?id=VTF8yNQM66

  7. arXiv:2306.01128  [pdf, other

    cs.LG cs.CL

    Learning Transformer Programs

    Authors: Dan Friedman, Alexander Wettig, Danqi Chen

    Abstract: Recent research in mechanistic interpretability has attempted to reverse-engineer Transformer models by carefully inspecting network weights and activations. However, these approaches require considerable manual effort and still fall short of providing complete, faithful descriptions of the underlying algorithms. In this work, we introduce a procedure for training Transformers that are mechanistic… ▽ More

    Submitted 30 October, 2023; v1 submitted 1 June, 2023; originally announced June 2023.

    Comments: NeurIPS 2023 (oral). Our code is available at https://github.com/princeton-nlp/TransformerPrograms

  8. arXiv:2305.14788  [pdf, other

    cs.CL

    Adapting Language Models to Compress Contexts

    Authors: Alexis Chevalier, Alexander Wettig, Anirudh Ajith, Danqi Chen

    Abstract: Transformer-based language models (LMs) are powerful and widely-applicable tools, but their usefulness is constrained by a finite context window and the expensive computational cost of processing long text documents. We propose to adapt pre-trained LMs into AutoCompressors. These language models are capable of compressing long contexts into compact summary vectors, which are then accessible to the… ▽ More

    Submitted 4 November, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: Accepted to EMNLP 2023; added results for Llama-2-7B model

  9. arXiv:2210.11560  [pdf, other

    cs.CL

    Finding Dataset Shortcuts with Grammar Induction

    Authors: Dan Friedman, Alexander Wettig, Danqi Chen

    Abstract: Many NLP datasets have been found to contain shortcuts: simple decision rules that achieve surprisingly high accuracy. However, it is difficult to discover shortcuts automatically. Prior work on automatic shortcut detection has focused on enumerating features like unigrams or bigrams, which can find only low-level shortcuts, or relied on post-hoc model interpretability methods like saliency maps,… ▽ More

    Submitted 20 October, 2022; originally announced October 2022.

    Comments: EMNLP 2022. Our code is publicly available at https://github.com/princeton-nlp/ShortcutGrammar

  10. arXiv:2210.05643  [pdf, other

    cs.LG cs.CL

    A Kernel-Based View of Language Model Fine-Tuning

    Authors: Sadhika Malladi, Alexander Wettig, Dingli Yu, Danqi Chen, Sanjeev Arora

    Abstract: It has become standard to solve NLP tasks by fine-tuning pre-trained language models (LMs), especially in low-data settings. There is minimal theoretical understanding of empirical success, e.g., why fine-tuning a model with $10^8$ or more parameters on a couple dozen training points does not result in overfitting. We investigate whether the Neural Tangent Kernel (NTK) - which originated as a mode… ▽ More

    Submitted 6 June, 2023; v1 submitted 11 October, 2022; originally announced October 2022.

    Comments: Accepted at ICML 2023. Code and pre-computed kernels are publicly available at https://github.com/princeton-nlp/LM-Kernel-FT

  11. arXiv:2202.08005  [pdf, other

    cs.CL cs.LG

    Should You Mask 15% in Masked Language Modeling?

    Authors: Alexander Wettig, Tianyu Gao, Zexuan Zhong, Danqi Chen

    Abstract: Masked language models (MLMs) conventionally mask 15% of tokens due to the belief that more masking would leave insufficient context to learn good representations; this masking rate has been widely used, regardless of model sizes or masking strategies. In this work, we revisit this important choice of MLM pre-training. We first establish that 15% is not universally optimal, and larger models shoul… ▽ More

    Submitted 10 February, 2023; v1 submitted 16 February, 2022; originally announced February 2022.

    Comments: Accepted to EACL 2023. The code and pre-trained models are available at https://github.com/princeton-nlp/DinkyTrain

  12. arXiv:2109.08133  [pdf, other

    cs.CL cs.IR

    Phrase Retrieval Learns Passage Retrieval, Too

    Authors: **hyuk Lee, Alexander Wettig, Danqi Chen

    Abstract: Dense retrieval methods have shown great promise over sparse retrieval methods in a range of NLP problems. Among them, dense phrase retrieval-the most fine-grained retrieval unit-is appealing because phrases can be directly used as the output for question answering and slot filling tasks. In this work, we follow the intuition that retrieving phrases naturally entails retrieving larger text blocks… ▽ More

    Submitted 16 September, 2021; originally announced September 2021.

    Comments: EMNLP 2021. Code available at https://github.com/princeton-nlp/DensePhrases