Skip to main content

Showing 1–8 of 8 results for author: Seppi, K

.
  1. arXiv:2205.08124  [pdf, other

    cs.CL

    When to Use Multi-Task Learning vs Intermediate Fine-Tuning for Pre-Trained Encoder Transfer Learning

    Authors: Orion Weller, Kevin Seppi, Matt Gardner

    Abstract: Transfer learning (TL) in natural language processing (NLP) has seen a surge of interest in recent years, as pre-trained models have shown an impressive ability to transfer to novel tasks. Three main strategies have emerged for making use of multiple supervised datasets during fine-tuning: training on an intermediate task before training on the target task (STILTs), using multi-task learning (MTL)… ▽ More

    Submitted 17 May, 2022; originally announced May 2022.

    Comments: ACL 2022

  2. arXiv:2104.03848  [pdf, other

    cs.CL

    Exploring the Relationship Between Algorithm Performance, Vocabulary, and Run-Time in Text Classification

    Authors: Wilson Fearn, Orion Weller, Kevin Seppi

    Abstract: Text classification is a significant branch of natural language processing, and has many applications including document classification and sentiment analysis. Unsurprisingly, those who do text classification are concerned with the run-time of their algorithms, many of which depend on the size of the corpus' vocabulary due to their bag-of-words representation. Although many studies have examined t… ▽ More

    Submitted 8 April, 2021; originally announced April 2021.

    Comments: Accepted to NAACL 2021

  3. arXiv:1909.00252  [pdf, other

    cs.CL cs.LG

    Humor Detection: A Transformer Gets the Last Laugh

    Authors: Orion Weller, Kevin Seppi

    Abstract: Much previous work has been done in attempting to identify humor in text. In this paper we extend that capability by proposing a new task: assessing whether or not a joke is humorous. We present a novel way of approaching this problem by building a model that learns to identify humorous jokes based on ratings gleaned from Reddit pages, consisting of almost 16,000 labeled instances. Using these rat… ▽ More

    Submitted 31 August, 2019; originally announced September 2019.

    Comments: Accepted to EMNLP 2019

  4. arXiv:1905.13126  [pdf, other

    cs.IR cs.CL cs.LG stat.ML

    Automatic Evaluation of Local Topic Quality

    Authors: Jeffrey Lund, Piper Armstrong, Wilson Fearn, Stephen Cowley, Courtni Byun, Jordan Boyd-Graber, Kevin Seppi

    Abstract: Topic models are typically evaluated with respect to the global topic distributions that they generate, using metrics such as coherence, but without regard to local (token-level) topic assignments. Token-level assignments are important for downstream tasks such as classification. Even recent models, which aim to improve the quality of these token-level topic assignments, have been evaluated only w… ▽ More

    Submitted 17 May, 2019; originally announced May 2019.

    Comments: 8 pages 4 figures 3 tables

  5. arXiv:1905.09864  [pdf, ps, other

    cs.CL cs.HC cs.IR cs.LG

    Why Didn't You Listen to Me? Comparing User Control of Human-in-the-Loop Topic Models

    Authors: Varun Kumar, Alison Smith-Renner, Leah Findlater, Kevin Seppi, Jordan Boyd-Graber

    Abstract: To address the lack of comparative evaluation of Human-in-the-Loop Topic Modeling (HLTM) systems, we implement and evaluate three contrasting HLTM modeling approaches using simulation experiments. These approaches extend previously proposed frameworks, including constraints and informed prior-based methods. Users should have a sense of control in HLTM systems, so we propose a control metric to mea… ▽ More

    Submitted 3 June, 2019; v1 submitted 23 May, 2019; originally announced May 2019.

    Comments: In proceedings of ACL 2019

  6. arXiv:1905.07508  [pdf, other

    cs.CL cs.IR cs.LG stat.ML

    Cross-referencing using Fine-grained Topic Modeling

    Authors: Jeffrey Lund, Piper Armstrong, Wilson Fearn, Stephen Cowley, Emily Hales, Kevin Seppi

    Abstract: Cross-referencing, which links passages of text to other related passages, can be a valuable study aid for facilitating comprehension of a text. However, cross-referencing requires first, a comprehensive thematic knowledge of the entire corpus, and second, a focused search through the corpus specifically to find such useful connections. Due to this, cross-reference resources are prohibitively expe… ▽ More

    Submitted 17 May, 2019; originally announced May 2019.

    Comments: 6 figures 1 table 8 pages

  7. arXiv:1810.09942  [pdf, other

    cs.LG stat.ML

    Preprocessor Selection for Machine Learning Pipelines

    Authors: Brandon Schoenfeld, Christophe Giraud-Carrier, Mason Poggemann, Jarom Christensen, Kevin Seppi

    Abstract: Much of the work in metalearning has focused on classifier selection, combined more recently with hyperparameter optimization, with little concern for data preprocessing. Yet, it is generally well accepted that machine learning applications require not only model building, but also data preprocessing. In other words, practical solutions consist of pipelines of machine learning operators rather tha… ▽ More

    Submitted 23 October, 2018; originally announced October 2018.

    Comments: Accepted at the ICML 2018 AutoML Workshop

  8. arXiv:1709.06067  [pdf

    cs.HC

    Sculpt, Deploy, Repeat: Fast Prototy** of Interactive Physical Objects

    Authors: Michael Jones, Kevin Seppi

    Abstract: Building a deployable PhysiComp that merges form and function typically involves a significant investment of time and skill in digital electronics, 3D modeling and mechanical design. We aim to help designers quickly create prototypes by removing technical barriers in that process. Other methods for constructing PhysiComp prototypes either lack fidelity in representing shape and function or are con… ▽ More

    Submitted 18 September, 2017; originally announced September 2017.