Skip to main content

Showing 1–14 of 14 results for author: Benton, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.09221  [pdf, other

    cs.CL cs.AI cs.LG

    Exploring and Improving Drafts in Blockwise Parallel Decoding

    Authors: Taehyeon Kim, Ananda Theertha Suresh, Kishore Papineni, Michael Riley, Sanjiv Kumar, Adrian Benton

    Abstract: Despite the remarkable strides made by autoregressive language models, their potential is often hampered by the slow inference speeds inherent in sequential token generation. Blockwise parallel decoding (BPD) was proposed by Stern et al. as a method to improve inference speed of language models by simultaneously predicting multiple future tokens, termed block drafts, which are subsequently verifie… ▽ More

    Submitted 5 June, 2024; v1 submitted 14 April, 2024; originally announced April 2024.

  2. arXiv:2310.13167  [pdf, other

    cs.HC

    Visualizing Causality in Mixed Reality for Manual Task Learning: An Exploratory Study

    Authors: Rahul Jain, **gyu Shi, Andrew Benton, Moiz Rasheed, Hyungjun Doh, Subramanian Chidambaram, Karthik Ramani

    Abstract: Mixed Reality (MR) is gaining prominence in manual task skill learning due to its in-situ, embodied, and immersive experience. To teach manual tasks, current methodologies break the task into hierarchies (tasks into subtasks) and visualize the current subtask and future in terms of causality. Existing psychology literature also shows that humans learn tasks by breaking them into hierarchies. In or… ▽ More

    Submitted 31 January, 2024; v1 submitted 19 October, 2023; originally announced October 2023.

  3. arXiv:2309.14894  [pdf, other

    cs.RO cs.LG eess.SY

    Verifiable Learned Behaviors via Motion Primitive Composition: Applications to Scoo** of Granular Media

    Authors: Andrew Benton, Eugen Solowjow, Prithvi Akella

    Abstract: A robotic behavior model that can reliably generate behaviors from natural language inputs in real time would substantially expedite the adoption of industrial robots due to enhanced system flexibility. To facilitate these efforts, we construct a framework in which learned behaviors, created by a natural language abstractor, are verifiable by construction. Leveraging recent advancements in motion… ▽ More

    Submitted 26 September, 2023; originally announced September 2023.

  4. arXiv:2301.10371  [pdf, other

    cs.CL cs.AI cs.LG

    Weakly Supervised Headline Dependency Parsing

    Authors: Adrian Benton, Tianze Shi, Ozan İrsoy, Igor Malioutov

    Abstract: English news headlines form a register with unique syntactic properties that have been documented in linguistics literature since the 1930s. However, headlines have received surprisingly little attention from the NLP syntactic parsing community. We aim to bridge this gap by providing the first news headline corpus of Universal Dependencies annotated syntactic dependency trees, which enables us to… ▽ More

    Submitted 24 January, 2023; originally announced January 2023.

    Comments: Findings of EMNLP 2022

    ACM Class: I.2.7

    Journal ref: In Proceedings of Findings of EMNLP 2022

  5. arXiv:2205.11505  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    What Makes Data-to-Text Generation Hard for Pretrained Language Models?

    Authors: Moniba Keymanesh, Adrian Benton, Mark Dredze

    Abstract: Expressing natural language descriptions of structured facts or relations -- data-to-text generation (D2T) -- increases the accessibility of structured knowledge repositories. Previous work shows that pre-trained language models(PLMs) perform remarkably well on this task after fine-tuning on a significant amount of task-specific training data. On the other hand, while auto-regressive PLMs can gene… ▽ More

    Submitted 23 May, 2022; originally announced May 2022.

    Comments: 15 pages, 5 figures

  6. arXiv:2109.07488  [pdf, other

    cs.CL cs.LG

    Comparing Euclidean and Hyperbolic Embeddings on the WordNet Nouns Hypernymy Graph

    Authors: Sameer Bansal, Adrian Benton

    Abstract: Nickel and Kiela (2017) present a new method for embedding tree nodes in the Poincare ball, and suggest that these hyperbolic embeddings are far more effective than Euclidean embeddings at embedding nodes in large, hierarchically structured graphs like the WordNet nouns hypernymy tree. This is especially true in low dimensions (Nickel and Kiela, 2017, Table 1). In this work, we seek to reproduce t… ▽ More

    Submitted 15 September, 2021; originally announced September 2021.

    ACM Class: I.2.7

  7. arXiv:2109.07483  [pdf, other

    cs.CL cs.AI

    Cross-Register Projection for Headline Part of Speech Tagging

    Authors: Adrian Benton, Hanyang Li, Igor Malioutov

    Abstract: Part of speech (POS) tagging is a familiar NLP task. State of the art taggers routinely achieve token-level accuracies of over 97% on news body text, evidence that the problem is well understood. However, the register of English news headlines, "headlinese", is very different from the register of long-form text, causing POS tagging models to underperform on headlines. In this work, we automaticall… ▽ More

    Submitted 15 September, 2021; originally announced September 2021.

    Comments: EMNLP 2021

    ACM Class: I.2.7

  8. arXiv:2104.13936  [pdf, other

    cs.CL cs.AI cs.LG

    Diversity-Aware Batch Active Learning for Dependency Parsing

    Authors: Tianze Shi, Adrian Benton, Igor Malioutov, Ozan İrsoy

    Abstract: While the predictive performance of modern statistical dependency parsers relies heavily on the availability of expensive expert-annotated treebank data, not all annotations contribute equally to the training of the parsers. In this paper, we attempt to reduce the number of labeled examples needed to train a strong dependency parser using batch active learning (AL). In particular, we investigate w… ▽ More

    Submitted 28 April, 2021; originally announced April 2021.

    Comments: NAACL 2021

    ACM Class: I.2.7

    Journal ref: In Proceedings of NAACL 2021

  9. arXiv:2012.15332  [pdf, other

    cs.CL stat.ML

    Corrected CBOW Performs as well as Skip-gram

    Authors: Ozan İrsoy, Adrian Benton, Karl Stratos

    Abstract: Mikolov et al. (2013a) observed that continuous bag-of-words (CBOW) word embeddings tend to underperform Skip-gram (SG) embeddings, and this finding has been reported in subsequent works. We find that these observations are driven not by fundamental differences in their training objectives, but more likely on faulty negative sampling CBOW implementations in popular libraries such as the official i… ▽ More

    Submitted 9 November, 2021; v1 submitted 30 December, 2020; originally announced December 2020.

    Comments: Presented at WINR at EMNLP 2021, added discussion about FastText, more discussion about findings, additional results on C4 data, wording changes

  10. arXiv:1812.00436  [pdf, other

    cs.LG cs.CL

    Learning Representations of Social Media Users

    Authors: Adrian Benton

    Abstract: User representations are routinely used in recommendation systems by platform developers, targeted advertisements by marketers, and by public policy researchers to gauge public opinion across demographic groups. Computer scientists consider the problem of inferring user representations more abstractly; how does one extract a stable user representation - effective for many downstream tasks - from a… ▽ More

    Submitted 2 December, 2018; originally announced December 2018.

    Comments: PhD thesis

  11. arXiv:1712.03538  [pdf, other

    cs.CL

    Multi-Task Learning for Mental Health using Social Media Text

    Authors: Adrian Benton, Margaret Mitchell, Dirk Hovy

    Abstract: We introduce initial groundwork for estimating suicide risk and mental health in a deep learning framework. By modeling multiple conditions, the system learns to make predictions about suicide risk and mental health at a low false positive rate. Conditions are modeled as tasks in a multi-task learning (MTL) framework, with gender prediction as an additional auxiliary task. We demonstrate the effec… ▽ More

    Submitted 10 December, 2017; originally announced December 2017.

    ACM Class: I.2.7

    Journal ref: Proceedings of the 15th Conference of the EACL (2017) 152-162

  12. arXiv:1702.02519  [pdf, other

    cs.LG cs.AI stat.ML

    Deep Generalized Canonical Correlation Analysis

    Authors: Adrian Benton, Huda Khayrallah, Biman Gujral, Dee Ann Reisinger, Sheng Zhang, Raman Arora

    Abstract: We present Deep Generalized Canonical Correlation Analysis (DGCCA) -- a method for learning nonlinear transformations of arbitrarily many views of data, such that the resulting transformations are maximally informative of each other. While methods for nonlinear two-view representation learning (Deep CCA, (Andrew et al., 2013)) and linear many-view representation learning (Generalized CCA (Horst, 1… ▽ More

    Submitted 14 June, 2017; v1 submitted 8 February, 2017; originally announced February 2017.

    Comments: 14 pages, 6 figures

  13. arXiv:1610.02060  [pdf, other

    cs.SI cs.CY

    After Sandy Hook Elementary: A Year in the Gun Control Debate on Twitter

    Authors: Adrian Benton, Braden Hancock, Glen Coppersmith, John W. Ayers, Mark Dredze

    Abstract: The mass shooting at Sandy Hook elementary school on December 14, 2012 catalyzed a year of active debate and legislation on gun control in the United States. Social media hosted an active public discussion where people expressed their support and opposition to a variety of issues surrounding gun legislation. In this paper, we show how a content-based analysis of Twitter data can provide insights a… ▽ More

    Submitted 6 October, 2016; originally announced October 2016.

    Comments: Presented at the Data For Good Exchange 2016

  14. arXiv:0801.4019  [pdf, other

    cs.CG

    A Class of Convex Polyhedra with Few Edge Unfoldings

    Authors: Alex Benton, Joseph O'Rourke

    Abstract: We construct a sequence of convex polyhedra on n vertices with the property that, as n -> infinity, the fraction of its edge unfoldings that avoid overlap approaches 0, and so the fraction that overlap approaches 1. Nevertheless, each does have (several) nonoverlap** edge unfoldings.

    Submitted 25 January, 2008; originally announced January 2008.

    Comments: 12 pages, 9 figures

    Report number: Smith Computer Science 088 ACM Class: F.2.2