Skip to main content

Showing 1–4 of 4 results for author: Scheuerman, M K

.
  1. arXiv:2406.06407  [pdf, other

    cs.LG cs.CY

    A Taxonomy of Challenges to Curating Fair Datasets

    Authors: Dora Zhao, Morgan Klaus Scheuerman, Pooja Chitre, Jerone T. A. Andrews, Georgia Panagiotidou, Shawn Walker, Kathleen H. Pine, Alice Xiang

    Abstract: Despite extensive efforts to create fairer machine learning (ML) datasets, there remains a limited understanding of the practical aspects of dataset curation. Drawing from interviews with 30 ML dataset curators, we present a comprehensive taxonomy of the challenges and trade-offs encountered throughout the dataset curation lifecycle. Our findings underscore overarching issues within the broader fa… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  2. A Framework of Severity for Harmful Content Online

    Authors: Morgan Klaus Scheuerman, Jialun Aaron Jiang, Casey Fiesler, Jed R. Brubaker

    Abstract: The proliferation of harmful content on online social media platforms has necessitated empirical understandings of experiences of harm online and the development of practices for harm mitigation. Both understandings of harm and approaches to mitigating that harm, often through content moderation, have implicitly embedded frameworks of prioritization - what forms of harm should be researched, how p… ▽ More

    Submitted 17 September, 2021; v1 submitted 9 August, 2021; originally announced August 2021.

    Comments: CSCW 2021; 33 pages

    Journal ref: Proc. ACM Hum.-Comput. Interact.5, CSCW2, Article 368 (October 2021), 33 pages

  3. arXiv:2108.04308  [pdf, other

    cs.CV cs.HC

    Do Datasets Have Politics? Disciplinary Values in Computer Vision Dataset Development

    Authors: Morgan Klaus Scheuerman, Emily Denton, Alex Hanna

    Abstract: Data is a crucial component of machine learning. The field is reliant on data to train, validate, and test models. With increased technical capabilities, machine learning research has boomed in both academic and industry settings, and one major focus has been on computer vision. Computer vision is a popular domain of machine learning increasingly pertinent to real-world applications, from facial r… ▽ More

    Submitted 16 September, 2021; v1 submitted 9 August, 2021; originally announced August 2021.

    Comments: CSCW 2021; 37 pages

    Journal ref: Proc. ACM Hum.-Comput. Interact.5, CSCW2, Article 317(October 2021), 37 pages

  4. arXiv:2007.07399  [pdf, ps, other

    cs.CY

    Bringing the People Back In: Contesting Benchmark Machine Learning Datasets

    Authors: Remi Denton, Alex Hanna, Razvan Amironesei, Andrew Smart, Hilary Nicole, Morgan Klaus Scheuerman

    Abstract: In response to algorithmic unfairness embedded in sociotechnical systems, significant attention has been focused on the contents of machine learning datasets which have revealed biases towards white, cisgender, male, and Western data subjects. In contrast, comparatively less attention has been paid to the histories, values, and norms embedded in such datasets. In this work, we outline a research… ▽ More

    Submitted 14 July, 2020; originally announced July 2020.