Skip to main content

Showing 1–25 of 25 results for author: Flanigan, J

.
  1. arXiv:2406.07933  [pdf, other

    cs.CL cs.AI cs.LG

    Large Language Model Unlearning via Embedding-Corrupted Prompts

    Authors: Chris Yuhao Liu, Yaxuan Wang, Jeffrey Flanigan, Yang Liu

    Abstract: Large language models (LLMs) have advanced to encompass extensive knowledge across diverse domains. Yet controlling what a large language model should not know is important for ensuring alignment and thus safe use. However, accurately and efficiently unlearning knowledge from an LLM remains challenging due to the potential collateral damage caused by the fuzzy boundary between retention and forget… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 55 pages, 4 figures, 66 tables

  2. arXiv:2404.15219  [pdf, other

    cs.CL

    The Power of the Noisy Channel: Unsupervised End-to-End Task-Oriented Dialogue with LLMs

    Authors: Brendan King, Jeffrey Flanigan

    Abstract: Training task-oriented dialogue systems typically requires turn-level annotations for interacting with their APIs: e.g. a dialogue state and the system actions taken at each step. These annotations can be costly to produce, error-prone, and require both domain and annotation expertise. With advances in LLMs, we hypothesize unlabelled data and a schema definition are sufficient for building a worki… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

    Comments: 16 Pages, 7 Figures

  3. arXiv:2404.10297  [pdf, other

    cs.CL cs.AI

    Future Language Modeling from Temporal Document History

    Authors: Changmao Li, Jeffrey Flanigan

    Abstract: Predicting the future is of great interest across many aspects of human activity. Businesses are interested in future trends, traders are interested in future stock prices, and companies are highly interested in future technological breakthroughs. While there are many automated systems for predicting future numerical data, such as weather, stock prices, and demand for products, there is relatively… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

    Comments: Accepted by ICLR 2024

    ACM Class: I.2.7

  4. arXiv:2312.16337  [pdf, other

    cs.CL

    Task Contamination: Language Models May Not Be Few-Shot Anymore

    Authors: Changmao Li, Jeffrey Flanigan

    Abstract: Large language models (LLMs) offer impressive performance in various zero-shot and few-shot tasks. However, their success in zero-shot and few-shot settings may be affected by task contamination, a potential limitation that has not been thoroughly examined. This paper investigates how zero-shot and few-shot performance of LLMs has changed chronologically over time. Utilizing GPT-3 series models an… ▽ More

    Submitted 26 December, 2023; originally announced December 2023.

    Comments: Accepted by AAAI 2024

    ACM Class: I.2.7

  5. arXiv:2312.03951  [pdf, other

    cs.LG stat.ML

    Understanding the Role of Optimization in Double Descent

    Authors: Chris Yuhao Liu, Jeffrey Flanigan

    Abstract: The phenomenon of model-wise double descent, where the test error peaks and then reduces as the model size increases, is an interesting topic that has attracted the attention of researchers due to the striking observed gap between theory and practice \citep{Belkin2018ReconcilingMM}. Additionally, while double descent has been observed in various tasks and architectures, the peak of double descent… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

    Comments: NeurIPS Workshop 2023 Optimization for Machine Learning

  6. arXiv:2310.07957  [pdf, other

    cs.CL cs.AI

    A New Approach Towards Autoformalization

    Authors: Nilay Patel, Rahul Saha, Jeffrey Flanigan

    Abstract: Verifying mathematical proofs is difficult, but can be automated with the assistance of a computer. Autoformalization is the task of automatically translating natural language mathematics into a formal language that can be verified by a program. This is a challenging task, and especially for higher-level mathematics found in research papers. Research paper mathematics requires large amounts of bac… ▽ More

    Submitted 19 October, 2023; v1 submitted 11 October, 2023; originally announced October 2023.

    Comments: Under review at MATHAI 2023 @ NeurIPS 2023

  7. arXiv:2309.13748  [pdf, other

    cs.CL

    Does the "most sinfully decadent cake ever" taste good? Answering Yes/No Questions from Figurative Contexts

    Authors: Geetanjali Rakshit, Jeffrey Flanigan

    Abstract: Figurative language is commonplace in natural language, and while making communication memorable and creative, can be difficult to understand. In this work, we investigate the robustness of Question Answering (QA) models on figurative text. Yes/no questions, in particular, are a useful probe of figurative language understanding capabilities of large language models. We propose FigurativeQA, a set… ▽ More

    Submitted 24 September, 2023; originally announced September 2023.

    Comments: Accepted at RANLP 2023

  8. arXiv:2307.01453  [pdf, other

    cs.CL

    Diverse Retrieval-Augmented In-Context Learning for Dialogue State Tracking

    Authors: Brendan King, Jeffrey Flanigan

    Abstract: There has been significant interest in zero and few-shot learning for dialogue state tracking (DST) due to the high cost of collecting and annotating task-oriented dialogues. Recent work has demonstrated that in-context learning requires very little data and zero parameter updates, and even outperforms trained methods in the few-shot setting (Hu et al. 2022). We propose RefPyDST, which advances th… ▽ More

    Submitted 3 July, 2023; originally announced July 2023.

    Comments: 14 pages, 2 figures, to appear in Findings of the ACL 2023

  9. arXiv:2302.12944  [pdf, other

    cs.CL cs.AI

    Dependency Dialogue Acts -- Annotation Scheme and Case Study

    Authors: Jon Z. Cai, Brendan King, Margaret Perkoff, Shiran Dudy, Jie Cao, Marie Grace, Natalia Wojarnik, Ananya Ganesh, James H. Martin, Martha Palmer, Marilyn Walker, Jeffrey Flanigan

    Abstract: In this paper, we introduce Dependency Dialogue Acts (DDA), a novel framework for capturing the structure of speaker-intentions in multi-party dialogues. DDA combines and adapts features from existing dialogue annotation frameworks, and emphasizes the multi-relational response structure of dialogues in addition to the dialogue acts and rhetorical relations. It represents the functional, discourse,… ▽ More

    Submitted 24 February, 2023; originally announced February 2023.

    Comments: The 13th International Workshop on Spoken Dialogue Systems Technology

    Journal ref: The 13th International Workshop on Spoken Dialogue Systems Technology 2023

  10. arXiv:2212.08565  [pdf, other

    cs.CL

    Automatic Identification of Motivation for Code-Switching in Speech Transcripts

    Authors: Ritu Belani, Jeffrey Flanigan

    Abstract: Code-switching, or switching between languages, occurs for many reasons and has important linguistic, sociological, and cultural implications. Multilingual speakers code-switch for a variety of purposes, such as expressing emotions, borrowing terms, making jokes, introducing a new topic, etc. The reason for code-switching may be quite useful for analysis, but is not readily apparent. To remedy thi… ▽ More

    Submitted 30 November, 2022; originally announced December 2022.

  11. arXiv:2207.06960  [pdf, other

    cs.CL cs.AI cs.LG

    Forming Trees with Treeformers

    Authors: Nilay Patel, Jeffrey Flanigan

    Abstract: Human language is known to exhibit a nested, hierarchical structure, allowing us to form complex sentences out of smaller pieces. However, many state-of-the-art neural networks models such as Transformers have no explicit hierarchical structure in its architecture -- that is, they don't have an inductive bias toward hierarchical structure. Additionally, Transformers are known to perform poorly on… ▽ More

    Submitted 10 July, 2023; v1 submitted 14 July, 2022; originally announced July 2022.

    Comments: Accepted to RANLP 2023

  12. arXiv:2112.08513  [pdf, other

    cs.CL

    DocAMR: Multi-Sentence AMR Representation and Evaluation

    Authors: Tahira Naseem, Austin Blodgett, Sadhana Kumaravel, Tim O'Gorman, Young-Suk Lee, Jeffrey Flanigan, Ramón Fernandez Astudillo, Radu Florian, Salim Roukos, Nathan Schneider

    Abstract: Despite extensive research on parsing of English sentences into Abstraction Meaning Representation (AMR) graphs, which are compared to gold graphs via the Smatch metric, full-document parsing into a unified graph representation lacks well-defined representation and evaluation. Taking advantage of a super-sentential level of coreference annotation from previous work, we introduce a simple algorithm… ▽ More

    Submitted 6 May, 2022; v1 submitted 15 December, 2021; originally announced December 2021.

    MSC Class: I.2.7

  13. arXiv:2105.10023  [pdf, other

    cs.CL

    ASQ: Automatically Generating Question-Answer Pairs using AMRs

    Authors: Geetanjali Rakshit, Jeffrey Flanigan

    Abstract: We introduce ASQ, a tool to automatically mine questions and answers from a sentence using the Abstract Meaning Representation (AMR). Previous work has used question-answer pairs to specify the predicate-argument structure of a sentence using natural language, which does not require linguistic expertise or training, and created datasets such as QA-SRL and QAMR, for which the question-answer pair a… ▽ More

    Submitted 20 August, 2021; v1 submitted 20 May, 2021; originally announced May 2021.

  14. arXiv:2011.10683  [pdf, other

    cs.CL

    Athena: Constructing Dialogues Dynamically with Discourse Constraints

    Authors: Vrindavan Harrison, Juraj Juraska, Wen Cui, Lena Reed, Kevin K. Bowden, Jiaqi Wu, Brian Schwarzmann, Abteen Ebrahimi, Rishi Rajasekaran, Nikhil Varghese, Max Wechsler-Azen, Steve Whittaker, Jeffrey Flanigan, Marilyn Walker

    Abstract: This report describes Athena, a dialogue system for spoken conversation on popular topics and current events. We develop a flexible topic-agnostic approach to dialogue management that dynamically configures dialogue based on general principles of entity and topic coherence. Athena's dialogue manager uses a contract-based method where discourse constraints are dispatched to clusters of response gen… ▽ More

    Submitted 20 November, 2020; originally announced November 2020.

    Comments: 3rd Proceedings of Alexa Prize (Alexa Prize 2019)

  15. arXiv:1905.06939  [pdf, other

    cs.CL cs.LG

    The Materials Science Procedural Text Corpus: Annotating Materials Synthesis Procedures with Shallow Semantic Structures

    Authors: Sheshera Mysore, Zach Jensen, Edward Kim, Kevin Huang, Haw-Shiuan Chang, Emma Strubell, Jeffrey Flanigan, Andrew McCallum, Elsa Olivetti

    Abstract: Materials science literature contains millions of materials synthesis procedures described in unstructured natural language text. Large-scale analysis of these synthesis procedures would facilitate deeper scientific understanding of materials synthesis and enable automated synthesis planning. Such analysis requires extracting structured representations of synthesis procedures from the raw text as… ▽ More

    Submitted 13 July, 2019; v1 submitted 16 May, 2019; originally announced May 2019.

    Comments: Accepted as a long paper at the Linguistic Annotation Workshop (LAW) at ACL 2019

  16. The Green Bank North Celestial Cap Pulsar Survey. IV: Four New Timing Solutions

    Authors: R. J. Aloisi, A. Cruz, L. Daniels, N. Meyers, R. Roekle, A. Schuett, J. K. Swiggum, M. E. DeCesar, D. L. Kaplan, R. S. Lynch, K. Stovall, Lina Levin, A. M. Archibald, S. Banaszak, C. M. Biwer, J. Boyles, P. Chawla, L. P. Dartez, B. Cui, D. F. Day, A. J. Ford, J. Flanigan, E. Fonseca, J. W. T. Hessels, J. Hinojosa , et al. (18 additional authors not shown)

    Abstract: We present timing solutions for four pulsars discovered in the Green Bank Northern Celestial Cap (GBNCC) survey. All four pulsars are isolated with spin periods between 0.26$\,$s and 1.84$\,$s. PSR J0038$-$2501 has a 0.26$\,$s period and a period derivative of ${7.6} \times {10}^{-19}\,{\rm s\,s}^{-1}$, which is unusually low for isolated pulsars with similar periods. This low period derivative ma… ▽ More

    Submitted 8 March, 2019; originally announced March 2019.

    Comments: 7 pages, 5 figures

  17. arXiv:1805.10399  [pdf, other

    cs.CL

    Toward Abstractive Summarization Using Semantic Representations

    Authors: Fei Liu, Jeffrey Flanigan, Sam Thomson, Norman Sadeh, Noah A. Smith

    Abstract: We present a novel abstractive summarization framework that draws on the recent development of a treebank for the Abstract Meaning Representation (AMR). In this framework, the source text is parsed to a set of AMR graphs, the graphs are transformed into a summary graph, and then text is generated from the summary graph. We focus on the graph-to-graph transformation that reduces the source semantic… ▽ More

    Submitted 25 May, 2018; originally announced May 2018.

    Comments: 10 pages

  18. The Green Bank North Celestial Cap Pulsar Survey III: 45 New Pulsar Timing Solutions

    Authors: Ryan S. Lynch, Joseph K. Swiggum, Vlad I. Kondratiev, David L. Kaplan, Kevin Stovall, Emmanuel Fonseca, Mallory S. E. Roberts, Lina Levin, Megan E. DeCesar, Bingyi Cui, S. Bradley Cenko, Pradip Gatkine, Anne M. Archibald, Shawn Banaszak, Christopher M. Biwer, Jason Boyles, Pragya Chawla, Louis P. Dartez, David Day, Anthony J. Ford, Joseph Flanigan, Jason W. T. Hessels, Jesus Hinojosa, Fredrick A. Jenet, Chen Karako-Argaman , et al. (15 additional authors not shown)

    Abstract: We provide timing solutions for 45 radio pulsars discovered by the Robert C. Byrd Green Bank Telescope. These pulsars were found in the Green Bank North Celestial Cap pulsar survey, an all-GBT-sky survey being carried out at a frequency of 350 MHz. We include pulsar timing data from the Green Bank Telescope and Low Frequency Array. Our sample includes five fully recycled millisecond pulsars (MSPs,… ▽ More

    Submitted 13 May, 2018; originally announced May 2018.

    Comments: Accepted for publication in The Astrophysical Journal; 30 pages, 9 figures, 8 tables

    Journal ref: Astrophysical Journal, 2018, 859, 93

  19. The Green Bank Northern Celestial Cap Pulsar Survey II: The Discovery and Timing of Ten Pulsars

    Authors: A. M. Kawash, M. A. McLaughlin, D. L. Kaplan, M. E. DeCesar, L. Levin, D. R. Lorimer, R. S. Lynch, K. Stovall, J. K. Swiggum, E. Fonseca, A. M. Archibald, S. Banaszak, C. M. Biwer, J. Boyles, B. Cui, L. P. Dartez, D. Day, S. Ernst, A. J. Ford, J. Flanigan, S. A. Heatherly, J. W. T. Hessels, J. Hinojosa, F. A. Jenet, C. Karako-Argaman , et al. (19 additional authors not shown)

    Abstract: We present timing solutions for ten pulsars discovered in 350 MHz searches with the Green Bank Telescope. Nine of these were discovered in the Green Bank Northern Celestial Cap survey and one was discovered by students in the Pulsar Search Collaboratory program in analysis of drift-scan data. Following discovery and confirmation with the Green Bank Telescope, timing has yielded phase-connected sol… ▽ More

    Submitted 9 March, 2018; originally announced March 2018.

    Comments: 9 pages, 5 figures

  20. Search for transient gravitational waves in coincidence with short duration radio transients during 2007-2013

    Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, others, :, B. P. Abbott, R. Abbott, T. D. Abbott, M. R. Abernathy, F. Acernese, K. Ackley, C. Adams, T. Adams, P. Addesso, R. X. Adhikari, V. B. Adya, C. Affeldt, M. Agathos, K. Agatsuma, N. Aggarwal, O. D. Aguiar, L. Aiello, A. Ain, P. Ajith, B. Allen, A. Allocca , et al. (977 additional authors not shown)

    Abstract: We present an archival search for transient gravitational-wave bursts in coincidence with 27 single pulse triggers from Green Bank Telescope pulsar surveys, using the LIGO, Virgo and GEO interferometer network. We also discuss a check for gravitational-wave signals in coincidence with Parkes Fast Radio Bursts using similar methods. Data analyzed in these searches were collected between 2007 and 20… ▽ More

    Submitted 21 June, 2016; v1 submitted 5 May, 2016; originally announced May 2016.

    Report number: LIGO-P1400154

    Journal ref: Phys. Rev. D 93, 122008 (2016)

  21. The Physics of the B Factories

    Authors: A. J. Bevan, B. Golob, Th. Mannel, S. Prell, B. D. Yabsley, K. Abe, H. Aihara, F. Anulli, N. Arnaud, T. Aushev, M. Beneke, J. Beringer, F. Bianchi, I. I. Bigi, M. Bona, N. Brambilla, J. B rodzicka, P. Chang, M. J. Charles, C. H. Cheng, H. -Y. Cheng, R. Chistov, P. Colangelo, J. P. Coleman, A. Drutskoy , et al. (2009 additional authors not shown)

    Abstract: This work is on the Physics of the B Factories. Part A of this book contains a brief description of the SLAC and KEK B Factories as well as their detectors, BaBar and Belle, and data taking related issues. Part B discusses tools and methods used by the experiments in order to obtain results. The results themselves can be found in Part C. Please note that version 3 on the archive is the auxiliary… ▽ More

    Submitted 31 October, 2015; v1 submitted 24 June, 2014; originally announced June 2014.

    Comments: 928 pages, version 3 (arXiv:1406.6311v3) corresponds to the alpha, beta, gamma version of the book, the other versions use the phi1, phi2, phi3 notation

    Report number: SLAC-PUB-15968, KEK Preprint 2014-3

    Journal ref: Eur. Phys. J. C74 (2014) 3026

  22. arXiv:1406.5214  [pdf, other

    astro-ph.HE astro-ph.GA

    The Green Bank Northern Celestial Cap Pulsar Survey - I: Survey Description, Data Analysis, and Initial Results

    Authors: K. Stovall, R. S. Lynch, S. M. Ransom, A. M. Archibald, S. Banaszak, C. M. Biwer, J. Boyles, L. P. Dartez, D. Day, A. J. Ford, J. Flanigan, A. Garcia, J. W. T. Hessels, J. Hinojosa, F. A. Jenet, D. L. Kaplan, C. Karako-Argaman, V. M. Kaspi, V. I. Kondratiev, S. Leake, D. R. Lorimer, G. Lunsford, J. G. Martinez, A. Mata, M. A. McLaughlin , et al. (7 additional authors not shown)

    Abstract: We describe an ongoing search for pulsars and dispersed pulses of radio emission, such as those from rotating radio transients (RRATs) and fast radio bursts (FRBs), at 350 MHz using the Green Bank Telescope. With the Green Bank Ultimate Pulsar Processing Instrument, we record 100 MHz of bandwidth divided into 4,096 channels every 81.92 $μs$. This survey will cover the entire sky visible to the Gre… ▽ More

    Submitted 19 June, 2014; originally announced June 2014.

    Comments: 18 pages, 10 figures, 5 tables, accepted by ApJ

  23. Searching for pulsars using image pattern recognition

    Authors: W. W. Zhu, A. Berndsen, E. C. Madsen, M. Tan, I. H. Stairs, A. Brazier, P. Lazarus, R. Lynch, P. Scholz, K. Stovall, S. M. Ransom, S. Banaszak, C. M. Biwer, S. Cohen, L. P. Dartez, J. Flanigan, G. Lunsford, J. G. Martinez, A. Mata, M. Rohr, A. Walker, B. Allen, N. D. R. Bhat, S. Bogdanov, F. Camilo , et al. (19 additional authors not shown)

    Abstract: In this paper, we present a novel artificial intelligence (AI) program that identifies pulsars from recent surveys using image pattern recognition with deep neural nets---the PICS (Pulsar Image-based Classification System) AI. The AI mimics human experts and distinguishes pulsars from noise and interferences by looking for patterns from candidate. The information from each pulsar candidate is synt… ▽ More

    Submitted 17 December, 2013; v1 submitted 3 September, 2013; originally announced September 2013.

    Comments: 29 pages, 9 figures, two tables, accepted by ApJ

    Journal ref: ApJ 781 (2014) 117

  24. arXiv:1305.0447  [pdf, ps, other

    astro-ph.IM astro-ph.GA

    PEACE: Pulsar Evaluation Algorithm for Candidate Extraction -- A software package for post-analysis processing of pulsar survey candidates

    Authors: K. J. Lee, K. Stovall, F. A. Jenet, J. Martinez, L. P. Dartez, A. Mata, G. Lunsford, S. Cohen, C. . M. Biwer, M. Rohr, J. Flanigan, A. Walker, S. Banaszak, B. Allen, E. D. Barr, N. D. R. Bhat, S. Bogdanov, A. Brazier, F. Camilo, D. J. Champion, S. Chatterjee, J. Cordes, F. Crawford, J. Deneva, G. Desvignes , et al. (19 additional authors not shown)

    Abstract: Modern radio pulsar surveys produce a large volume of prospective candidates, the majority of which are polluted by human-created radio frequency interference or other forms of noise. Typically, large numbers of candidates need to be visually inspected in order to determine if they are real pulsars. This process can be labor intensive. In this paper, we introduce an algorithm called PEACE (Pulsar… ▽ More

    Submitted 2 May, 2013; originally announced May 2013.

    Comments: 7 pages, 4 figures, accepted by MNRAS

  25. The hunt for new pulsars with the Green Bank Telescope

    Authors: Ryan S. Lynch, Anne M. Archibald, Shawn Banaszak, Alison Becker, Aaron Berndsen, Chris Biwer, Jason Boyles, Rogerio F. Cardoso, Angus Cherry, Louis P. Dartez, David Day, Courtney R. Epstein, Joe Flanigan, Anthony Ford, Alejandro Garcia, Jason W. T. Hessels, Fredrick A. Jenet, David L. Kaplan, Chen Karako-Argaman, Victoria M. Kaspi, Vladislav I. Kondratiev, Duncan R. Lorimer, Grady Lunsford, Jose Martinez, Maura A. McLaughlin , et al. (11 additional authors not shown)

    Abstract: The Green Bank Telescope (GBT) is the largest fully steerable radio telescope in the world and is one of our greatest tools for discovering and studying radio pulsars. Over the last decade, the GBT has successfully found over 100 new pulsars through large-area surveys. Here I discuss the two most recent---the GBT 350 MHz Drift-scan survey and the Green Bank North Celestial Cap survey. The primary… ▽ More

    Submitted 21 March, 2013; originally announced March 2013.

    Comments: Proceedings of IAUS 291 "Neutron Stars and Pulsars: Challenges and Opportunities after 80 years", J. van Leeuwen (ed.); 6 pages, 2 figures