-
SCIMAP: A Python Toolkit for Integrated Spatial Analysis of Multiplexed Imaging Data
Authors:
Ajit J. Nirmal,
Peter K. Sorger
Abstract:
Multiplexed imaging data are revolutionizing our understanding of the composition and organization of tissues and tumors. A critical aspect of such tissue profiling is quantifying the spatial relationship relationships among cells at different scales from the interaction of neighboring cells to recurrent communities of cells of multiple types. This often involves statistical analysis of 10^7 or mo…
▽ More
Multiplexed imaging data are revolutionizing our understanding of the composition and organization of tissues and tumors. A critical aspect of such tissue profiling is quantifying the spatial relationship relationships among cells at different scales from the interaction of neighboring cells to recurrent communities of cells of multiple types. This often involves statistical analysis of 10^7 or more cells in which up to 100 biomolecules (commonly proteins) have been measured. While software tools currently cater to the analysis of spatial transcriptomics data, there remains a need for toolkits explicitly tailored to the complexities of multiplexed imaging data including the need to seamlessly integrate image visualization with data analysis and exploration. We introduce SCIMAP, a Python package specifically crafted to address these challenges. With SCIMAP, users can efficiently preprocess, analyze, and visualize large datasets, facilitating the exploration of spatial relationships and their statistical significance. SCIMAP's modular design enables the integration of new algorithms, enhancing its capabilities for spatial analysis.
△ Less
Submitted 3 May, 2024;
originally announced May 2024.
-
Towards Interpretable Hate Speech Detection using Large Language Model-extracted Rationales
Authors:
Ayushi Nirmal,
Amrita Bhattacharjee,
Paras Sheth,
Huan Liu
Abstract:
Although social media platforms are a prominent arena for users to engage in interpersonal discussions and express opinions, the facade and anonymity offered by social media may allow users to spew hate speech and offensive content. Given the massive scale of such platforms, there arises a need to automatically identify and flag instances of hate speech. Although several hate speech detection meth…
▽ More
Although social media platforms are a prominent arena for users to engage in interpersonal discussions and express opinions, the facade and anonymity offered by social media may allow users to spew hate speech and offensive content. Given the massive scale of such platforms, there arises a need to automatically identify and flag instances of hate speech. Although several hate speech detection methods exist, most of these black-box methods are not interpretable or explainable by design. To address the lack of interpretability, in this paper, we propose to use state-of-the-art Large Language Models (LLMs) to extract features in the form of rationales from the input text, to train a base hate speech classifier, thereby enabling faithful interpretability by design. Our framework effectively combines the textual understanding capabilities of LLMs and the discriminative power of state-of-the-art hate speech classifiers to make these classifiers faithfully interpretable. Our comprehensive evaluation on a variety of English language social media hate speech datasets demonstrate: (1) the goodness of the LLM-extracted rationales, and (2) the surprising retention of detector performance even after training to ensure interpretability. All code and data will be made available at https://github.com/AmritaBh/shield.
△ Less
Submitted 7 May, 2024; v1 submitted 18 March, 2024;
originally announced March 2024.
-
Disinformation Detection: An Evolving Challenge in the Age of LLMs
Authors:
Bohan Jiang,
Zhen Tan,
Ayushi Nirmal,
Huan Liu
Abstract:
The advent of generative Large Language Models (LLMs) such as ChatGPT has catalyzed transformative advancements across multiple domains. However, alongside these advancements, they have also introduced potential threats. One critical concern is the misuse of LLMs by disinformation spreaders, leveraging these models to generate highly persuasive yet misleading content that challenges the disinforma…
▽ More
The advent of generative Large Language Models (LLMs) such as ChatGPT has catalyzed transformative advancements across multiple domains. However, alongside these advancements, they have also introduced potential threats. One critical concern is the misuse of LLMs by disinformation spreaders, leveraging these models to generate highly persuasive yet misleading content that challenges the disinformation detection system. This work aims to address this issue by answering three research questions: (1) To what extent can the current disinformation detection technique reliably detect LLM-generated disinformation? (2) If traditional techniques prove less effective, can LLMs themself be exploited to serve as a robust defense against advanced disinformation? and, (3) Should both these strategies falter, what novel approaches can be proposed to counter this burgeoning threat effectively? A holistic exploration for the formation and detection of disinformation is conducted to foster this line of research.
△ Less
Submitted 25 September, 2023;
originally announced September 2023.
-
User Migration across Multiple Social Media Platforms
Authors:
Ujun Jeong,
Ayushi Nirmal,
Kritshekhar Jha,
Susan Xu Tang,
H. Russell Bernard,
Huan Liu
Abstract:
After Twitter's ownership change and policy shifts, many users reconsidered their go-to social media outlets and platforms like Mastodon, Bluesky, and Threads became attractive alternatives in the battle for users. Based on the data from over 14,000 users who migrated to these platforms within the first eight weeks after the launch of Threads, our study examines: (1) distinguishing attributes of T…
▽ More
After Twitter's ownership change and policy shifts, many users reconsidered their go-to social media outlets and platforms like Mastodon, Bluesky, and Threads became attractive alternatives in the battle for users. Based on the data from over 14,000 users who migrated to these platforms within the first eight weeks after the launch of Threads, our study examines: (1) distinguishing attributes of Twitter users who migrated, compared to non-migrants; (2) temporal migration patterns and associated challenges for sustainable migration faced by each platform; and (3) how these new platforms are perceived in relation to Twitter. Our research proceeds in three stages. First, we examine migration from a broad perspective, not just one-to-one migration. Second, we leverage behavioral analysis to pinpoint the distinct migration pattern of each platform. Last, we employ a Large Language Model (LLM) to discern stances towards each platform and correlate them with the platform usage. This in-depth analysis illuminates migration patterns amid competition across social media platforms.
△ Less
Submitted 10 January, 2024; v1 submitted 22 September, 2023;
originally announced September 2023.
-
SocioHub: An Interactive Tool for Cross-Platform Social Media Data Collection
Authors:
Ayushi Nirmal,
Bohan Jiang,
Huan Liu
Abstract:
Social media is inherently about connecting and interacting with others. Different social media platforms have unique characteristics and user bases. Moreover, people use different platforms for various social and entertainment purposes. Analyzing cross-platform user behavior can provide insights into the preferences and expectations of users on each platform. By understanding how users behave and…
▽ More
Social media is inherently about connecting and interacting with others. Different social media platforms have unique characteristics and user bases. Moreover, people use different platforms for various social and entertainment purposes. Analyzing cross-platform user behavior can provide insights into the preferences and expectations of users on each platform. By understanding how users behave and interact across platforms, we can build an understanding of content consumption patterns, enhance communication and social interactions, and tailor platform-specific strategies. We can further gather insights into how users navigate and engage with their platforms on different devices. In this work, we develop a tool SocioHub, which enables users to gather data on multiple social media platforms in one place. This tool can help researchers gain insights into different data attributes for users across social media platforms such as Twitter, Instagram, and Mastodon. Keywords: Social Media Platforms, Twitter, Instagram, Mastodon.
△ Less
Submitted 5 September, 2023;
originally announced September 2023.