Skip to main content

Showing 1–26 of 26 results for author: Schwartz, H A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.01980  [pdf, other

    cs.CL

    SOCIALITE-LLAMA: An Instruction-Tuned Model for Social Scientific Tasks

    Authors: Gourab Dey, Adithya V Ganesan, Yash Kumar Lal, Manal Shah, Shreyashee Sinha, Matthew Matero, Salvatore Giorgi, Vivek Kulkarni, H. Andrew Schwartz

    Abstract: Social science NLP tasks, such as emotion or humor detection, are required to capture the semantics along with the implicit pragmatics from text, often with limited amounts of training data. Instruction tuning has been shown to improve the many capabilities of large language models (LLMs) such as commonsense reasoning, reading comprehension, and computer programming. However, little is known about… ▽ More

    Submitted 14 March, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

    Comments: Short paper accepted to EACL 2024. 4 pgs, 2 tables

  2. arXiv:2401.12492  [pdf, other

    cs.CL cs.AI cs.LG

    Comparing Pre-trained Human Language Models: Is it Better with Human Context as Groups, Individual Traits, or Both?

    Authors: Nikita Soni, Niranjan Balasubramanian, H. Andrew Schwartz, Dirk Hovy

    Abstract: Incorporating human context into language models is the next frontier for human-centered natural language processing. Currently, two pre-training methods exist: group-wise attributes (e.g., over-45-year-olds) or individual traits. Group attributes are coarse -- not all 45-year-olds write the same way -- while modeling individual traits allows for a more personalized representation, but requires mo… ▽ More

    Submitted 26 March, 2024; v1 submitted 23 January, 2024; originally announced January 2024.

  3. arXiv:2312.07751  [pdf, other

    cs.CL cs.AI cs.LG

    Large Human Language Models: A Need and the Challenges

    Authors: Nikita Soni, H. Andrew Schwartz, João Sedoc, Niranjan Balasubramanian

    Abstract: As research in human-centered NLP advances, there is a growing recognition of the importance of incorporating human and social factors into NLP models. At the same time, our NLP systems have become heavily reliant on LLMs, most of which do not model authors. To build NLP systems that can truly understand human language, we must better integrate human contexts into LLMs. This brings to the fore a r… ▽ More

    Submitted 9 May, 2024; v1 submitted 8 November, 2023; originally announced December 2023.

  4. arXiv:2311.06467  [pdf, other

    cs.CL cs.AI

    ALBA: Adaptive Language-based Assessments for Mental Health

    Authors: Vasudha Varadarajan, Sverker Sikström, Oscar N. E. Kjell, H. Andrew Schwartz

    Abstract: Mental health issues differ widely among individuals, with varied signs and symptoms. Recently, language-based assessments have shown promise in capturing this diversity, but they require a substantial sample of words per person for accuracy. This work introduces the task of Adaptive Language-Based Assessment ALBA, which involves adaptively ordering questions while also scoring an individual's lat… ▽ More

    Submitted 16 May, 2024; v1 submitted 10 November, 2023; originally announced November 2023.

  5. arXiv:2306.01183  [pdf, other

    cs.CL

    Systematic Evaluation of GPT-3 for Zero-Shot Personality Estimation

    Authors: Adithya V Ganesan, Yash Kumar Lal, August Håkan Nilsson, H. Andrew Schwartz

    Abstract: Very large language models (LLMs) perform extremely well on a spectrum of NLP tasks in a zero-shot setting. However, little is known about their performance on human-level NLP problems which rely on understanding psychological concepts, such as assessing personality traits. In this work, we investigate the zero-shot ability of GPT-3 to estimate the Big 5 personality traits from users' social media… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

    Comments: Short Paper (5 pages), Accepted to (WASSA) 13th Workshop on Computational Approaches to Subjectivity, Sentiment & Social Media Analysis at ACL 2023

    MSC Class: 68T50 ACM Class: J.4; I.2; I.7

  6. arXiv:2305.14757  [pdf, other

    cs.CL

    Psychological Metrics for Dialog System Evaluation

    Authors: Salvatore Giorgi, Shreya Havaldar, Farhan Ahmed, Zuhaib Akhtar, Shalaka Vaidya, Gary Pan, Lyle H. Ungar, H. Andrew Schwartz, Joao Sedoc

    Abstract: We present metrics for evaluating dialog systems through a psychologically-grounded "human" lens in which conversational agents express a diversity of both states (e.g., emotion) and traits (e.g., personality), just as people do. We present five interpretable metrics from established psychology that are fundamental to human communication and relationships: emotional entropy, linguistic style and e… ▽ More

    Submitted 15 September, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

  7. arXiv:2305.02459  [pdf, other

    cs.CL cs.LG

    Transfer and Active Learning for Dissonance Detection: Addressing the Rare-Class Challenge

    Authors: Vasudha Varadarajan, Swanie Juhng, Syeda Mahwish, Xiaoran Liu, Jonah Luby, Christian Luhmann, H. Andrew Schwartz

    Abstract: While transformer-based systems have enabled greater accuracies with fewer training examples, data acquisition obstacles still persist for rare-class tasks -- when the class label is very infrequent (e.g. < 5% of samples). Active learning has in general been proposed to alleviate such challenges, but choice of selection strategy, the criteria by which rare-class examples are chosen, has not been s… ▽ More

    Submitted 4 May, 2023; v1 submitted 3 May, 2023; originally announced May 2023.

  8. arXiv:2302.12952  [pdf

    cs.CL

    Robust language-based mental health assessments in time and space through social media

    Authors: Siddharth Mangalik, Johannes C. Eichstaedt, Salvatore Giorgi, Jihu Mun, Farhan Ahmed, Gilvir Gill, Adithya V. Ganesan, Shashanka Subrahmanya, Nikita Soni, Sean A. P. Clouston, H. Andrew Schwartz

    Abstract: Compared to physical health, population mental health measurement in the U.S. is very coarse-grained. Currently, in the largest population surveys, such as those carried out by the Centers for Disease Control or Gallup, mental health is only broadly captured through "mentally unhealthy days" or "sadness", and limited to relatively infrequent state or metropolitan estimates. Through the large scale… ▽ More

    Submitted 24 February, 2023; originally announced February 2023.

    Comments: 9 pages, 7 figures, pre-print

    ACM Class: J.4; I.2.7

  9. Human Language Modeling

    Authors: Nikita Soni, Matthew Matero, Niranjan Balasubramanian, H. Andrew Schwartz

    Abstract: Natural language is generated by people, yet traditional language modeling views words or documents as if generated independently. Here, we propose human language modeling (HuLM), a hierarchical extension to the language modeling problem whereby a human-level exists to connect sequences of documents (e.g. social media messages) and capture the notion that human language is moderated by changing hu… ▽ More

    Submitted 10 May, 2022; originally announced May 2022.

  10. arXiv:2112.13795  [pdf, other

    cs.CL cs.LG

    Evaluating Contextual Embeddings and their Extraction Layers for Depression Assessment

    Authors: Matthew Matero, Albert Hung, H. Andrew Schwartz

    Abstract: Recent works have demonstrated ability to assess aspects of mental health from personal discourse. At the same time, pre-trained contextual word embedding models have grown to dominate much of NLP but little is known empirically on how to best apply them for mental health assessment. Using degree of depression as a case study, we do an empirical analysis on which off-the-shelf language model, indi… ▽ More

    Submitted 28 April, 2022; v1 submitted 27 December, 2021; originally announced December 2021.

  11. arXiv:2109.08113  [pdf, other

    cs.CL

    MeLT: Message-Level Transformer with Masked Document Representations as Pre-Training for Stance Detection

    Authors: Matthew Matero, Nikita Soni, Niranjan Balasubramanian, H. Andrew Schwartz

    Abstract: Much of natural language processing is focused on leveraging large capacity language models, typically trained over single messages with a task of predicting one or more tokens. However, modeling human language at higher-levels of context (i.e., sequences of messages) is under-explored. In stance detection and other social media tasks where the goal is to predict an attribute of a message, we have… ▽ More

    Submitted 1 November, 2021; v1 submitted 16 September, 2021; originally announced September 2021.

  12. arXiv:2106.01335  [pdf, other

    cs.CL

    On the Distribution, Sparsity, and Inference-time Quantization of Attention Values in Transformers

    Authors: Tianchu Ji, Shraddhan Jain, Michael Ferdman, Peter Milder, H. Andrew Schwartz, Niranjan Balasubramanian

    Abstract: How much information do NLP tasks really need from a transformer's attention mechanism at application-time (inference)? From recent work, we know that there is sparsity in transformers and that the floating-points within its computation can be discretized to fewer values with minimal loss to task accuracies. However, this requires retraining or even creating entirely new models, both of which can… ▽ More

    Submitted 2 June, 2021; originally announced June 2021.

  13. Empirical Evaluation of Pre-trained Transformers for Human-Level NLP: The Role of Sample Size and Dimensionality

    Authors: Adithya V Ganesan, Matthew Matero, Aravind Reddy Ravula, Huy Vu, H. Andrew Schwartz

    Abstract: In human-level NLP tasks, such as predicting mental health, personality, or demographics, the number of observations is often smaller than the standard 768+ hidden state sizes of each layer within modern transformer-based language models, limiting the ability to effectively leverage transformers. Here, we provide a systematic study on the role of dimension reduction methods (principal components a… ▽ More

    Submitted 7 May, 2021; originally announced May 2021.

    Comments: 2021 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL-HLT)

  14. arXiv:2105.01306  [pdf, other

    cs.CL

    Discourse Relation Embeddings: Representing the Relations between Discourse Segments in Social Media

    Authors: Youngseo Son, Vasudha Varadarajan, H Andrew Schwartz

    Abstract: Discourse relations are typically modeled as a discrete class that characterizes the relation between segments of text (e.g. causal explanations, expansions). However, such predefined discrete classes limits the universe of potential relationships and their nuanced differences. Analogous to contextual word embeddings, we propose representing discourse relations as points in high dimensional contin… ▽ More

    Submitted 28 February, 2023; v1 submitted 4 May, 2021; originally announced May 2021.

    Comments: Published in EMNLP 2022 UM-IoS

  15. arXiv:2011.06457  [pdf

    cs.CL

    World Trade Center responders in their own words: Predicting PTSD symptom trajectories with AI-based language analyses of interviews

    Authors: Youngseo Son, Sean A. P. Clouston, Roman Kotov, Johannes C. Eichstaedt, Evelyn J. Bromet, Benjamin J. Luft, H Andrew Schwartz

    Abstract: Background: Oral histories from 9/11 responders to the World Trade Center (WTC) attacks provide rich narratives about distress and resilience. Artificial Intelligence (AI) models promise to detect psychopathology in natural language, but they have been evaluated primarily in non-clinical settings using social media. This study sought to test the ability of AI-based language assessments to predict… ▽ More

    Submitted 12 November, 2020; originally announced November 2020.

    Comments: 20 pages, 2 figures

  16. arXiv:2011.03983  [pdf, other

    cs.CL cs.HC cs.SI

    Detecting Emerging Symptoms of COVID-19 using Context-based Twitter Embeddings

    Authors: Roshan Santosh, H. Andrew Schwartz, Johannes C. Eichstaedt, Lyle H. Ungar, Sharath C. Guntuku

    Abstract: In this paper, we present an iterative graph-based approach for the detection of symptoms of COVID-19, the pathology of which seems to be evolving. More generally, the method can be applied to finding context-specific words and texts (e.g. symptom mentions) in large imbalanced corpora (e.g. all tweets mentioning #COVID-19). Given the novelty of COVID-19, we also test if the proposed approach gener… ▽ More

    Submitted 8 November, 2020; originally announced November 2020.

    Comments: In proceedings of EMNLP 2020 (Empirical Methods in NLP) workshop on COVID-19

  17. arXiv:2004.06303  [pdf, other

    cs.CL cs.CY cs.SI

    Quantifying Community Characteristics of Maternal Mortality Using Social Media

    Authors: Rediet Abebe, Salvatore Giorgi, Anna Tedijanto, Anneke Buffone, H. Andrew Schwartz

    Abstract: While most mortality rates have decreased in the US, maternal mortality has increased and is among the highest of any OECD nation. Extensive public health research is ongoing to better understand the characteristics of communities with relatively high or low rates. In this work, we explore the role that social media language can play in providing insights into such community characteristics. Analy… ▽ More

    Submitted 14 April, 2020; originally announced April 2020.

    Comments: In Proceedings of The Web Conference 2020(WWW '20)

  18. Predictive Biases in Natural Language Processing Models: A Conceptual Framework and Overview

    Authors: Deven Shah, H. Andrew Schwartz, Dirk Hovy

    Abstract: An increasing number of works in natural language processing have addressed the effect of bias on the predicted outcomes, introducing mitigation techniques that act on different parts of the standard NLP pipeline (data and models). However, these works have been conducted in isolation, without a unifying framework to organize efforts within the field. This leads to repetitive approaches, and puts… ▽ More

    Submitted 12 September, 2020; v1 submitted 9 November, 2019; originally announced December 2019.

    Comments: 9 pages excluding references, 1 figure, 3 pages for appendix

    Journal ref: Association for Computational Linguistics. (2020) 5248--5264

  19. arXiv:1911.03855  [pdf, other

    cs.SI cs.CL cs.CY

    Correcting Sociodemographic Selection Biases for Population Prediction from Social Media

    Authors: Salvatore Giorgi, Veronica Lynn, Keshav Gupta, Farhan Ahmed, Sandra Matz, Lyle Ungar, H. Andrew Schwartz

    Abstract: Social media is increasingly used for large-scale population predictions, such as estimating community health statistics. However, social media users are not typically a representative sample of the intended population -- a "selection bias". Within the social sciences, such a bias is typically addressed with restratification techniques, where observations are reweighted according to how under- or… ▽ More

    Submitted 7 June, 2022; v1 submitted 10 November, 2019; originally announced November 2019.

    Comments: Published at the 16th International AAAI Conference on Web and Social Media (ICWSM) 2022

  20. arXiv:1810.10949  [pdf, other

    cs.CL

    Learning Emotion from 100 Observations: Unexpected Robustness of Deep Learning under Strong Data Limitations

    Authors: Sven Buechel, João Sedoc, H. Andrew Schwartz, Lyle Ungar

    Abstract: One of the major downsides of Deep Learning is its supposed need for vast amounts of training data. As such, these techniques appear ill-suited for NLP areas where annotated data is limited, such as less-resourced languages or emotion analysis, with its many nuanced and hard-to-acquire annotation formats. We conduct a questionnaire study indicating that indeed the vast majority of researchers in e… ▽ More

    Submitted 7 December, 2020; v1 submitted 25 October, 2018; originally announced October 2018.

    Comments: Published at PEOPLES 2020

  21. arXiv:1809.01202  [pdf, other

    cs.CL

    Causal Explanation Analysis on Social Media

    Authors: Youngseo Son, Nipun Bayas, H. Andrew Schwartz

    Abstract: Understanding causal explanations - reasons given for happenings in one's life - has been found to be an important psychological factor linked to physical and mental health. Causal explanations are often studied through manual identification of phrases over limited samples of personal writing. Automatic identification of causal explanations in social media, while challenging in relying on contextu… ▽ More

    Submitted 18 October, 2018; v1 submitted 4 September, 2018; originally announced September 2018.

    Comments: To appear in EMNLP 2018; 10 pages

  22. arXiv:1808.09600  [pdf, ps, other

    cs.SI cs.CY

    The Remarkable Benefit of User-Level Aggregation for Lexical-based Population-Level Predictions

    Authors: Salvatore Giorgi, Daniel Preotiuc-Pietro, Anneke Buffone, Daniel Rieman, Lyle H. Ungar, H. Andrew Schwartz

    Abstract: Nowcasting based on social media text promises to provide unobtrusive and near real-time predictions of community-level outcomes. These outcomes are typically regarding people, but the data is often aggregated without regard to users in the Twitter populations of each community. This paper describes a simple yet effective method for building community-level models using Twitter language aggregated… ▽ More

    Submitted 28 August, 2018; originally announced August 2018.

    Comments: To appear in the proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing (EMNLP)

  23. arXiv:1808.09479  [pdf, other

    cs.CL

    Residualized Factor Adaptation for Community Social Media Prediction Tasks

    Authors: Mohammadzaman Zamani, H. Andrew Schwartz, Veronica E. Lynn, Salvatore Giorgi, Niranjan Balasubramanian

    Abstract: Predictive models over social media language have shown promise in capturing community outcomes, but approaches thus far largely neglect the socio-demographic context (e.g. age, education rates, race) of the community from which the language originates. For example, it may be inaccurate to assume people in Mobile, Alabama, where the population is relatively older, will use words the same way as th… ▽ More

    Submitted 28 August, 2018; originally announced August 2018.

    Comments: Conference on Empirical Methods in Natural Language Processing (EMNLP 2018)

    Journal ref: Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, pages 3560-3569, 2018

  24. Predicting Human Trustfulness from Facebook Language

    Authors: Mohammadzaman Zamani, Anneke Buffone, H. Andrew Schwartz

    Abstract: Trustfulness -- one's general tendency to have confidence in unknown people or situations -- predicts many important real-world outcomes such as mental health and likelihood to cooperate with others such as clinicians. While data-driven measures of interpersonal trust have previously been introduced, here, we develop the first language-based assessment of the personality trait of trustfulness by f… ▽ More

    Submitted 16 August, 2018; originally announced August 2018.

    Comments: CLPsych2018

    Journal ref: In Proceedings of the Fifth Workshop on Computational Linguistics and Clinical Psychology: From Keyboard to Clinic, pages 174-181, 2018

  25. arXiv:1806.05740  [pdf, other

    cs.CY cs.AI cs.CL

    Using Search Queries to Understand Health Information Needs in Africa

    Authors: Rediet Abebe, Shawndra Hill, Jennifer Wortman Vaughan, Peter M. Small, H. Andrew Schwartz

    Abstract: The lack of comprehensive, high-quality health data in develo** nations creates a roadblock for combating the impacts of disease. One key challenge is understanding the health information needs of people in these nations. Without understanding people's everyday needs, concerns, and misconceptions, health organizations and policymakers lack the ability to effectively target education and programm… ▽ More

    Submitted 17 April, 2019; v1 submitted 14 June, 2018; originally announced June 2018.

    Comments: Extended version of an ICWSM 2019 paper

  26. Latent Human Traits in the Language of Social Media: An Open-Vocabulary Approach

    Authors: Vivek Kulkarni, Margaret L. Kern, David Stillwell, Michal Kosinski, Sandra Matz, Lyle Ungar, Steven Skiena, H. Andrew Schwartz

    Abstract: Over the past century, personality theory and research has successfully identified core sets of characteristics that consistently describe and explain fundamental differences in the way people think, feel and behave. Such characteristics were derived through theory, dictionary analyses, and survey research using explicit self-reports. The availability of social media data spanning millions of user… ▽ More

    Submitted 22 May, 2017; originally announced May 2017.

    Comments: In submission to PLOS One