Skip to main content

Showing 1–8 of 8 results for author: Harrigian, K

.
  1. arXiv:2403.01628  [pdf, ps, other

    cs.LG

    Recent Advances, Applications, and Open Challenges in Machine Learning for Health: Reflections from Research Roundtables at ML4H 2023 Symposium

    Authors: Hyewon Jeong, Sarah Jabbour, Yuzhe Yang, Rahul Thapta, Hussein Mozannar, William Jongwon Han, Nikita Mehandru, Michael Wornow, Vladislav Lialin, Xin Liu, Alejandro Lozano, Jiacheng Zhu, Rafal Dariusz Kocielnik, Keith Harrigian, Haoran Zhang, Edward Lee, Milos Vukadinovic, Aparna Balagopalan, Vincent Jeanselme, Katherine Matton, Ilker Demirel, Jason Fries, Parisa Rashidi, Brett Beaulieu-Jones, Xuhai Orson Xu , et al. (18 additional authors not shown)

    Abstract: The third ML4H symposium was held in person on December 10, 2023, in New Orleans, Louisiana, USA. The symposium included research roundtable sessions to foster discussions between participants and senior researchers on timely and relevant topics for the \ac{ML4H} community. Encouraged by the successful virtual roundtables in the previous year, we organized eleven in-person roundtables and four vir… ▽ More

    Submitted 5 April, 2024; v1 submitted 3 March, 2024; originally announced March 2024.

    Comments: ML4H 2023, Research Roundtables

  2. arXiv:2311.08687  [pdf, other

    cs.CL cs.AI cs.LG

    An Eye on Clinical BERT: Investigating Language Model Generalization for Diabetic Eye Disease Phenoty**

    Authors: Keith Harrigian, Tina Tang, Anthony Gonzales, Cindy X. Cai, Mark Dredze

    Abstract: Diabetic eye disease is a major cause of blindness worldwide. The ability to monitor relevant clinical trajectories and detect lapses in care is critical to managing the disease and preventing blindness. Alas, much of the information necessary to support these goals is found only in the free text of the electronic medical record. To fill this information gap, we introduce a system for extracting e… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

    Comments: Extended Abstract presented at Machine Learning for Health (ML4H) symposium 2023, December 10th, 2023, New Orleans, United States, 24 pages

  3. arXiv:2206.11160  [pdf, other

    cs.CL cs.CY

    The Problem of Semantic Shift in Longitudinal Monitoring of Social Media: A Case Study on Mental Health During the COVID-19 Pandemic

    Authors: Keith Harrigian, Mark Dredze

    Abstract: Social media allows researchers to track societal and cultural changes over time based on language analysis tools. Many of these tools rely on statistical algorithms which need to be tuned to specific types of language. Recent studies have shown the absence of appropriate tuning, specifically in the presence of semantic shift, can hinder robustness of the underlying methods. However, little is kno… ▽ More

    Submitted 22 June, 2022; originally announced June 2022.

    Comments: Accepted to the 14th International ACM Conference on Web Science in 2022 (WebSci '22)

  4. arXiv:2206.11155  [pdf, other

    cs.LG cs.CL cs.CY

    Then and Now: Quantifying the Longitudinal Validity of Self-Disclosed Depression Diagnoses

    Authors: Keith Harrigian, Mark Dredze

    Abstract: Self-disclosed mental health diagnoses, which serve as ground truth annotations of mental health status in the absence of clinical measures, underpin the conclusions behind most computational studies of mental health language from the last decade. However, psychiatric conditions are dynamic; a prior depression diagnosis may no longer be indicative of an individual's mental health, either due to tr… ▽ More

    Submitted 22 June, 2022; originally announced June 2022.

    Comments: Accepted to the Eighth Workshop on Computational Linguistics and Clinical Psychology (CLPsych) at NAACL

  5. arXiv:2103.10550  [pdf, other

    cs.CL

    Gender and Racial Fairness in Depression Research using Social Media

    Authors: Carlos Aguirre, Keith Harrigian, Mark Dredze

    Abstract: Multiple studies have demonstrated that behavior on internet-based social media platforms can be indicative of an individual's mental health status. The widespread availability of such data has spurred interest in mental health research from a computational lens. While previous research has raised concerns about possible biases in models produced from this data, no study has quantified how these b… ▽ More

    Submitted 18 March, 2021; originally announced March 2021.

    Comments: Accepted to EACL 2021

  6. arXiv:2011.05233  [pdf, other

    cs.CL

    On the State of Social Media Data for Mental Health Research

    Authors: Keith Harrigian, Carlos Aguirre, Mark Dredze

    Abstract: Data-driven methods for mental health treatment and surveillance have become a major focus in computational science research in the last decade. However, progress in the domain, in terms of both medical understanding and system performance, remains bounded by the availability of adequate data. Prior systematic reviews have not necessarily made it possible to measure the degree to which data-relate… ▽ More

    Submitted 25 April, 2021; v1 submitted 10 November, 2020; originally announced November 2020.

    Comments: Originally submitted to ICWSM in January 2020. v1 updated November 2020. v2 updated April 2021, to appear at CLPsych 2021. Supplementary material at https://github.com/kharrigian/mental-health-datasets

  7. arXiv:1810.03067  [pdf, other

    cs.IR cs.CL

    Geocoding Without Geotags: A Text-based Approach for reddit

    Authors: Keith Harrigian

    Abstract: In this paper, we introduce the first geolocation inference approach for reddit, a social media platform where user pseudonymity has thus far made supervised demographic inference difficult to implement and validate. In particular, we design a text-based heuristic schema to generate ground truth location labels for reddit users in the absence of explicitly geotagged data. After evaluating the accu… ▽ More

    Submitted 6 October, 2018; originally announced October 2018.

    Comments: Accepted to the EMNLP Workshop on Noisy User-generated Text (W-NUT). Brussels, Belgium. November 1, 2018

  8. arXiv:1809.08711  [pdf, other

    cs.IR cs.CL

    Recognizing Film Entities in Podcasts

    Authors: Ahmet Salih Gundogdu, Arjun Sanghvi, Keith Harrigian

    Abstract: In this paper, we propose a Named Entity Recognition (NER) system to identify film titles in podcast audio. Taking inspiration from NER systems for noisy text in social media, we implement a two-stage approach that is robust to computer transcription errors and does not require significant computational expense to accommodate new film titles/releases. Evaluating on a diverse set of podcasts, we de… ▽ More

    Submitted 23 September, 2018; originally announced September 2018.

    Comments: 4 pages, 1 figure. To appear in Proceedings of 2018 KDD Workshop on Machine Learning and Data Mining for Podcasts, August 2018, London, UK