-
The Future of Research on Social Technologies: CCC Workshop Visioning Report
Authors:
Motahhare Eslami,
Eric Gilbert,
Sarita Schoenebeck,
Eric P. S. Baumer,
Eshwar Chandrasekharan,
Michelle De Mooy,
Karrie Karahalios,
David Karger,
Tressie McMillan Cottom,
Andrés Monroy-Hernández,
Loren Terveen,
John Wihbey
Abstract:
Social technologies are the systems, interfaces, features, infrastructures, and architectures that allow people to interact with each other online. These technologies dramatically shape the fabric of our everyday lives, from the information we consume to the people we interact with to the foundations of our culture and politics. While the benefits of social technologies are well documented, the ha…
▽ More
Social technologies are the systems, interfaces, features, infrastructures, and architectures that allow people to interact with each other online. These technologies dramatically shape the fabric of our everyday lives, from the information we consume to the people we interact with to the foundations of our culture and politics. While the benefits of social technologies are well documented, the harms, too, have cast a long shadow. To address widespread problems like harassment, disinformation, information access, and mental health concerns, we need to rethink the foundations of how social technologies are designed, sustained, and governed.
This report is based on discussions at the Computing Community Consortium Workshop, The Future of Research on Social Technologies, that was held November 2-3, 2023 in Washington, DC. The visioning workshop came together to focus on two questions. What should we know about social technologies, and what is needed to get there? The workshop brought together over 50 information and computer scientists, social scientists, communication and journalism scholars, and policy experts. We used a discussion format, with one day of guiding topics and a second day using an unconference model where participants created discussion topics. The interdisciplinary group of attendees discussed gaps in existing scholarship and the methods, resources, access, and collective effort needed to address those gaps. We also discussed approaches for translating scholarship for various audiences including citizens, funders, educators, industry professionals, and policymakers.
This report presents a synthesis of major themes during our discussions. The themes presented are not a summary of what we know already, they are an exploration of what we do not know enough about, and what we should spend more effort and investment on in the coming years.
△ Less
Submitted 16 April, 2024;
originally announced April 2024.
-
Creator Hearts: Investigating the Impact Positive Signals from YouTube Creators in Sha** Comment Section Behavior
Authors:
Frederick Choi,
Charlotte Lambert,
Vinay Koshy,
Sowmya Pratipati,
Tue Do,
Eshwar Chandrasekharan
Abstract:
Much of the research in online moderation focuses on punitive actions. However, emerging research has shown that positive reinforcement is effective at encouraging desirable behavior on online platforms. We extend this research by studying the "creator heart" feature on YouTube, quantifying their primary effects on comments that receive hearts and on videos where hearts have been given. We find th…
▽ More
Much of the research in online moderation focuses on punitive actions. However, emerging research has shown that positive reinforcement is effective at encouraging desirable behavior on online platforms. We extend this research by studying the "creator heart" feature on YouTube, quantifying their primary effects on comments that receive hearts and on videos where hearts have been given. We find that creator hearts increased the visibility of comments, and increased the amount of positive engagement they received from other users. We also find that the presence of a creator hearted comment soon after a video is published can incentivize viewers to comment, increasing the total engagement with the video over time. We discuss the potential for creators to use hearts to shape behavior in their communities by highlighting, rewarding, and incentivizing desirable behaviors from users. We discuss avenues for extending our study to understanding positive signals from moderators on other platforms.
△ Less
Submitted 4 April, 2024;
originally announced April 2024.
-
Harmonizing the Cacophony with MIC: An Affordance-aware Framework for Platform Moderation
Authors:
Tanvi Bajpai,
Drshika Asher,
Anwesa Goswami,
Eshwar Chandrasekharan
Abstract:
Social platforms, and the online communities that use them, are evolving at a rapid pace. As a result, research and development regarding how to moderate online communities is being out-paced. In this paper, we present a novel framework that will allow moderation researchers and practitioners to not only keep-up with the diverse landscape of available platforms and affordances, but also comprehens…
▽ More
Social platforms, and the online communities that use them, are evolving at a rapid pace. As a result, research and development regarding how to moderate online communities is being out-paced. In this paper, we present a novel framework that will allow moderation researchers and practitioners to not only keep-up with the diverse landscape of available platforms and affordances, but also comprehensively represent and analyze moderation on these platforms. The MIC framework represents a social platform's moderation ecosystem using a base-set of 12 platform-level affordances, along with a notion of the inter-affordance relationships that can exist between them. These affordances fall into the three categories -- Members, Infrastructure, and Content -- that are derived from Grimmelmann's taxonomy of moderation, a framework that is already widely accepted and used by the moderation research community. To show how MIC serves as an insightful augmentation of Grimmelmann's lens, we begin by describing how its components have already been shown to impact Grimmelmann's techniques for moderation. Then, we demonstrate the advantages of using an affordance-aware framework like MIC by analyzing several social platforms over the course of two case studies. First, we analyze individual platforms using MIC and demonstrate how MIC can be used to examine the effects of platform changes on the moderation ecosystem and identify potential new challenges in moderation. Next, use MIC to systematically compare three platforms and propose potential moderation mechanisms that each can adapt. Moderation researchers and stakeholders can use such comparisons to uncover where platforms can emulate established, successful and better-studied platforms, as well as learn from the pitfalls other platforms have encountered.
△ Less
Submitted 23 June, 2022; v1 submitted 19 July, 2021;
originally announced July 2021.
-
Conversations Gone Alright: Quantifying and Predicting Prosocial Outcomes in Online Conversations
Authors:
Jiajun Bao,
Junjie Wu,
Yiming Zhang,
Eshwar Chandrasekharan,
David Jurgens
Abstract:
Online conversations can go in many directions: some turn out poorly due to antisocial behavior, while others turn out positively to the benefit of all. Research on improving online spaces has focused primarily on detecting and reducing antisocial behavior. Yet we know little about positive outcomes in online conversations and how to increase them-is a prosocial outcome simply the lack of antisoci…
▽ More
Online conversations can go in many directions: some turn out poorly due to antisocial behavior, while others turn out positively to the benefit of all. Research on improving online spaces has focused primarily on detecting and reducing antisocial behavior. Yet we know little about positive outcomes in online conversations and how to increase them-is a prosocial outcome simply the lack of antisocial behavior or something more? Here, we examine how conversational features lead to prosocial outcomes within online discussions. We introduce a series of new theory-inspired metrics to define prosocial outcomes such as mentoring and esteem enhancement. Using a corpus of 26M Reddit conversations, we show that these outcomes can be forecasted from the initial comment of an online conversation, with the best model providing a relative 24% improvement over human forecasting performance at ranking conversations for predicted outcome. Our results indicate that platforms can use these early cues in their algorithmic ranking of early conversations to prioritize better outcomes.
△ Less
Submitted 16 February, 2021;
originally announced February 2021.
-
Quarantined! Examining the Effects of a Community-Wide Moderation Intervention on Reddit
Authors:
Eshwar Chandrasekharan,
Shagun Jhaver,
Amy Bruckman,
Eric Gilbert
Abstract:
Should social media platforms override a community's self-policing when it repeatedly break rules? What actions can they consider? In light of this debate, platforms have begun experimenting with softer alternatives to outright bans. We examine one such intervention called quarantining, that impedes direct access to and promotion of controversial communities. Specifically, we present two case stud…
▽ More
Should social media platforms override a community's self-policing when it repeatedly break rules? What actions can they consider? In light of this debate, platforms have begun experimenting with softer alternatives to outright bans. We examine one such intervention called quarantining, that impedes direct access to and promotion of controversial communities. Specifically, we present two case studies of what happened when Reddit quarantined the influential communities r/TheRedPill (TRP) and r/The_Donald (TD). Using over 85M Reddit posts, we apply causal inference methods to examine the quarantine's effects on TRP and TD. We find that the quarantine made it more difficult to recruit new members: new user influx to TRP and TD decreased by 79.5% and 58%, respectively. Despite quarantining, existing users' misogyny and racism levels remained unaffected. We conclude by reflecting on the effectiveness of this design friction in limiting the influence of toxic communities and discuss broader implications for content moderation.
△ Less
Submitted 5 October, 2021; v1 submitted 24 September, 2020;
originally announced September 2020.
-
A Just and Comprehensive Strategy for Using NLP to Address Online Abuse
Authors:
David Jurgens,
Eshwar Chandrasekharan,
Libby Hemphill
Abstract:
Online abusive behavior affects millions and the NLP community has attempted to mitigate this problem by develo** technologies to detect abuse. However, current methods have largely focused on a narrow definition of abuse to detriment of victims who seek both validation and solutions. In this position paper, we argue that the community needs to make three substantive changes: (1) expanding our s…
▽ More
Online abusive behavior affects millions and the NLP community has attempted to mitigate this problem by develo** technologies to detect abuse. However, current methods have largely focused on a narrow definition of abuse to detriment of victims who seek both validation and solutions. In this position paper, we argue that the community needs to make three substantive changes: (1) expanding our scope of problems to tackle both more subtle and more serious forms of abuse, (2) develo** proactive technologies that counter or inhibit abuse before it harms, and (3) reframing our effort within a framework of justice to promote healthy communities.
△ Less
Submitted 6 June, 2019; v1 submitted 4 June, 2019;
originally announced June 2019.
-
Hybrid Approaches to Detect Comments Violating Macro Norms on Reddit
Authors:
Eshwar Chandrasekharan,
Eric Gilbert
Abstract:
In this dataset paper, we present a three-stage process to collect Reddit comments that are removed comments by moderators of several subreddits, for violating subreddit rules and guidelines. Other than the fact that these comments were flagged by moderators for violating community norms, we do not have any other information regarding the nature of the violations. Through this procedure, we collec…
▽ More
In this dataset paper, we present a three-stage process to collect Reddit comments that are removed comments by moderators of several subreddits, for violating subreddit rules and guidelines. Other than the fact that these comments were flagged by moderators for violating community norms, we do not have any other information regarding the nature of the violations. Through this procedure, we collect over 2M comments removed by moderators of 100 different Reddit communities, and publicly release the data. Working with this dataset of removed comments, we identify 8 macro norms---norms that are widely enforced on most parts of Reddit. We extract these macro norms by employing a hybrid approach---classification, topic modeling, and open-coding---on comments identified to be norm violations within at least 85 out of the 100 study subreddits. Finally, we label over 40K Reddit comments removed by moderators according to the specific type of macro norm being violated, and make this dataset publicly available. By breaking down a collection of removed comments into more granular types of macro norm violation, our dataset can be used to train more nuanced machine learning classifiers for online moderation.
△ Less
Submitted 16 July, 2019; v1 submitted 7 April, 2019;
originally announced April 2019.
-
Still out there: Modeling and Identifying Russian Troll Accounts on Twitter
Authors:
Jane Im,
Eshwar Chandrasekharan,
Jackson Sargent,
Paige Lighthammer,
Taylor Denby,
Ankit Bhargava,
Libby Hemphill,
David Jurgens,
Eric Gilbert
Abstract:
There is evidence that Russia's Internet Research Agency attempted to interfere with the 2016 U.S. election by running fake accounts on Twitter - often referred to as "Russian trolls". In this work, we: 1) develop machine learning models that predict whether a Twitter account is a Russian troll within a set of 170K control accounts; and, 2) demonstrate that it is possible to use this model to find…
▽ More
There is evidence that Russia's Internet Research Agency attempted to interfere with the 2016 U.S. election by running fake accounts on Twitter - often referred to as "Russian trolls". In this work, we: 1) develop machine learning models that predict whether a Twitter account is a Russian troll within a set of 170K control accounts; and, 2) demonstrate that it is possible to use this model to find active accounts on Twitter still likely acting on behalf of the Russian state. Using both behavioral and linguistic features, we show that it is possible to distinguish between a troll and a non-troll with a precision of 78.5% and an AUC of 98.9%, under cross-validation. Applying the model to out-of-sample accounts still active today, we find that up to 2.6% of top journalists' mentions are occupied by Russian trolls. These findings imply that the Russian trolls are very likely still active today. Additional analysis shows that they are not merely software-controlled bots, and manage their online identities in various complex ways. Finally, we argue that if it is possible to discover these accounts using externally - accessible data, then the platforms - with access to a variety of private internal signals - should succeed at similar or better rates.
△ Less
Submitted 30 January, 2019;
originally announced January 2019.