Skip to main content

Showing 1–5 of 5 results for author: Matero, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.01980  [pdf, other

    cs.CL

    SOCIALITE-LLAMA: An Instruction-Tuned Model for Social Scientific Tasks

    Authors: Gourab Dey, Adithya V Ganesan, Yash Kumar Lal, Manal Shah, Shreyashee Sinha, Matthew Matero, Salvatore Giorgi, Vivek Kulkarni, H. Andrew Schwartz

    Abstract: Social science NLP tasks, such as emotion or humor detection, are required to capture the semantics along with the implicit pragmatics from text, often with limited amounts of training data. Instruction tuning has been shown to improve the many capabilities of large language models (LLMs) such as commonsense reasoning, reading comprehension, and computer programming. However, little is known about… ▽ More

    Submitted 14 March, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

    Comments: Short paper accepted to EACL 2024. 4 pgs, 2 tables

  2. Human Language Modeling

    Authors: Nikita Soni, Matthew Matero, Niranjan Balasubramanian, H. Andrew Schwartz

    Abstract: Natural language is generated by people, yet traditional language modeling views words or documents as if generated independently. Here, we propose human language modeling (HuLM), a hierarchical extension to the language modeling problem whereby a human-level exists to connect sequences of documents (e.g. social media messages) and capture the notion that human language is moderated by changing hu… ▽ More

    Submitted 10 May, 2022; originally announced May 2022.

  3. arXiv:2112.13795  [pdf, other

    cs.CL cs.LG

    Evaluating Contextual Embeddings and their Extraction Layers for Depression Assessment

    Authors: Matthew Matero, Albert Hung, H. Andrew Schwartz

    Abstract: Recent works have demonstrated ability to assess aspects of mental health from personal discourse. At the same time, pre-trained contextual word embedding models have grown to dominate much of NLP but little is known empirically on how to best apply them for mental health assessment. Using degree of depression as a case study, we do an empirical analysis on which off-the-shelf language model, indi… ▽ More

    Submitted 28 April, 2022; v1 submitted 27 December, 2021; originally announced December 2021.

  4. arXiv:2109.08113  [pdf, other

    cs.CL

    MeLT: Message-Level Transformer with Masked Document Representations as Pre-Training for Stance Detection

    Authors: Matthew Matero, Nikita Soni, Niranjan Balasubramanian, H. Andrew Schwartz

    Abstract: Much of natural language processing is focused on leveraging large capacity language models, typically trained over single messages with a task of predicting one or more tokens. However, modeling human language at higher-levels of context (i.e., sequences of messages) is under-explored. In stance detection and other social media tasks where the goal is to predict an attribute of a message, we have… ▽ More

    Submitted 1 November, 2021; v1 submitted 16 September, 2021; originally announced September 2021.

  5. Empirical Evaluation of Pre-trained Transformers for Human-Level NLP: The Role of Sample Size and Dimensionality

    Authors: Adithya V Ganesan, Matthew Matero, Aravind Reddy Ravula, Huy Vu, H. Andrew Schwartz

    Abstract: In human-level NLP tasks, such as predicting mental health, personality, or demographics, the number of observations is often smaller than the standard 768+ hidden state sizes of each layer within modern transformer-based language models, limiting the ability to effectively leverage transformers. Here, we provide a systematic study on the role of dimension reduction methods (principal components a… ▽ More

    Submitted 7 May, 2021; originally announced May 2021.

    Comments: 2021 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL-HLT)