Skip to main content

Showing 1–50 of 84 results for author: Shu, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.14043  [pdf, other

    cs.IR cs.CL

    Taxonomy-Guided Zero-Shot Recommendations with LLMs

    Authors: Yueqing Liang, Liangwei Yang, Chen Wang, Xiongxiao Xu, Philip S. Yu, Kai Shu

    Abstract: With the emergence of large language models (LLMs) and their ability to perform a variety of tasks, their application in recommender systems (RecSys) has shown promise. However, we are facing significant challenges when deploying LLMs into RecSys, such as limited prompt length, unstructured item information, and un-constrained generation of recommendations, leading to sub-optimal performance. To a… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  2. arXiv:2405.18776  [pdf, other

    cs.CR cs.CL cs.LG

    LMO-DP: Optimizing the Randomization Mechanism for Differentially Private Fine-Tuning (Large) Language Models

    Authors: Qin Yang, Meisam Mohammad, Han Wang, Ali Payani, Ashish Kundu, Kai Shu, Yan Yan, Yuan Hong

    Abstract: Differentially Private Stochastic Gradient Descent (DP-SGD) and its variants have been proposed to ensure rigorous privacy for fine-tuning large-scale pre-trained language models. However, they rely heavily on the Gaussian mechanism, which may overly perturb the gradients and degrade the accuracy, especially in stronger privacy regimes (e.g., the privacy budget $ε< 3$). To address such limitations… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: 18 pages, 15 figures

  3. arXiv:2404.14757  [pdf, other

    cs.LG cs.AI

    Integrating Mamba and Transformer for Long-Short Range Time Series Forecasting

    Authors: Xiongxiao Xu, Yueqing Liang, Baixiang Huang, Zhiling Lan, Kai Shu

    Abstract: Time series forecasting is an important problem and plays a key role in a variety of applications including weather forecasting, stock market, and scientific simulations. Although transformers have proven to be effective in capturing dependency, its quadratic complexity of attention mechanism prevents its further adoption in long-range time series forecasting, thus limiting them attend to short-ra… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

  4. arXiv:2403.09747  [pdf, other

    cs.CL cs.AI

    Re-Search for The Truth: Multi-round Retrieval-augmented Large Language Models are Strong Fake News Detectors

    Authors: Guanghua Li, Wensheng Lu, Wei Zhang, Defu Lian, Kezhong Lu, Rui Mao, Kai Shu, Hao Liao

    Abstract: The proliferation of fake news has had far-reaching implications on politics, the economy, and society at large. While Fake news detection methods have been employed to mitigate this issue, they primarily depend on two essential elements: the quality and relevance of the evidence, and the effectiveness of the verdict prediction mechanism. Traditional methods, which often source information from st… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

  5. arXiv:2403.08213  [pdf, other

    cs.CL

    Can Large Language Models Identify Authorship?

    Authors: Baixiang Huang, Canyu Chen, Kai Shu

    Abstract: The ability to accurately identify authorship is crucial for verifying content authenticity and mitigating misinformation. Large Language Models (LLMs) have demonstrated exceptional capacity for reasoning and problem-solving. However, their potential in authorship analysis, encompassing authorship verification and attribution, remains underexplored. This paper conducts a comprehensive evaluation o… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

    Comments: 10 pages, 7 figures

  6. arXiv:2402.04559  [pdf, other

    cs.AI cs.CL cs.HC

    Can Large Language Model Agents Simulate Human Trust Behaviors?

    Authors: Chengxing Xie, Canyu Chen, Feiran Jia, Ziyu Ye, Kai Shu, Adel Bibi, Ziniu Hu, Philip Torr, Bernard Ghanem, Guohao Li

    Abstract: Large Language Model (LLM) agents have been increasingly adopted as simulation tools to model humans in applications such as social science. However, one fundamental question remains: can LLM agents really simulate human behaviors? In this paper, we focus on one of the most critical behaviors in human interactions, trust, and aim to investigate whether or not LLM agents can simulate human trust be… ▽ More

    Submitted 10 March, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

    Comments: The first two authors contributed equally. Project website: https://www.camel-ai.org/research/agent-trust

  7. arXiv:2401.15496  [pdf, other

    cs.CL cs.AI cs.LG

    Baichuan2-Sum: Instruction Finetune Baichuan2-7B Model for Dialogue Summarization

    Authors: Jianfei Xiao, Yancan Chen, Yimin Ou, Hanyi Yu, Kai Shu, Yiyong Xiao

    Abstract: Large language models (LLMs) like Llama, Baichuan and Bloom models show remarkable ability with instruction fine-tuning in many natural language tasks. Nevertheless, for the dialogue summarization task, which aims to generate summaries for different roles in dialogue, most of the state-of-the-art methods conduct on small models (e.g Bart and Bert). Existing methods try to add task specified optimi… ▽ More

    Submitted 3 April, 2024; v1 submitted 27 January, 2024; originally announced January 2024.

  8. arXiv:2401.09090  [pdf, other

    cs.CY

    Understanding the concerns and choices of public when using large language models for healthcare

    Authors: Yunpeng Xiao, Kyrie Zhixuan Zhou, Yueqing Liang, Kai Shu

    Abstract: Large language models (LLMs) have shown their potential in biomedical fields. However, how the public uses them for healthcare purposes such as medical Q\&A, self-diagnosis, and daily healthcare information seeking is under-investigated. In this paper, we adopt a mixed-methods approach, including surveys (N=167) and interviews (N=17) to investigate how and why the public uses LLMs for healthcare.… ▽ More

    Submitted 17 January, 2024; originally announced January 2024.

    Comments: 22 pages

    ACM Class: J.4; K.4.2

  9. arXiv:2401.05561  [pdf, other

    cs.CL

    TrustLLM: Trustworthiness in Large Language Models

    Authors: Lichao Sun, Yue Huang, Haoran Wang, Siyuan Wu, Qihui Zhang, Yuan Li, Chujie Gao, Yixin Huang, Wenhan Lyu, Yixuan Zhang, Xiner Li, Zhengliang Liu, Yixin Liu, Yijue Wang, Zhikun Zhang, Bertie Vidgen, Bhavya Kailkhura, Caiming Xiong, Chaowei Xiao, Chunyuan Li, Eric Xing, Furong Huang, Hao Liu, Heng Ji, Hongyi Wang , et al. (45 additional authors not shown)

    Abstract: Large language models (LLMs), exemplified by ChatGPT, have gained considerable attention for their excellent natural language processing capabilities. Nonetheless, these LLMs present many challenges, particularly in the realm of trustworthiness. Therefore, ensuring the trustworthiness of LLMs emerges as an important topic. This paper introduces TrustLLM, a comprehensive study of trustworthiness in… ▽ More

    Submitted 17 March, 2024; v1 submitted 10 January, 2024; originally announced January 2024.

    Comments: This work is still under work and we welcome your contribution

  10. arXiv:2311.11473   

    cs.LG cs.AI

    CSGNN: Conquering Noisy Node labels via Dynamic Class-wise Selection

    Authors: Yifan Li, Zhen Tan, Kai Shu, Zongsheng Cao, Yu Kong, Huan Liu

    Abstract: Graph Neural Networks (GNNs) have emerged as a powerful tool for representation learning on graphs, but they often suffer from overfitting and label noise issues, especially when the data is scarce or imbalanced. Different from the paradigm of previous methods that rely on single-node confidence, in this paper, we introduce a novel Class-wise Selection for Graph Neural Networks, dubbed CSGNN, whic… ▽ More

    Submitted 14 December, 2023; v1 submitted 19 November, 2023; originally announced November 2023.

    Comments: For the privacy issue

  11. arXiv:2311.09433  [pdf, other

    cs.CR cs.AI cs.CL

    Backdoor Activation Attack: Attack Large Language Models using Activation Steering for Safety-Alignment

    Authors: Haoran Wang, Kai Shu

    Abstract: To ensure AI safety, instruction-tuned Large Language Models (LLMs) are specifically trained to ensure alignment, which refers to making models behave in accordance with human intentions. While these models have demonstrated commendable results on various safety benchmarks, the vulnerability of their safety alignment has not been extensively studied. This is particularly troubling given the potent… ▽ More

    Submitted 24 November, 2023; v1 submitted 15 November, 2023; originally announced November 2023.

  12. arXiv:2311.09428  [pdf, other

    cs.CL cs.AI cs.CY cs.LG

    Beyond Detection: Unveiling Fairness Vulnerabilities in Abusive Language Models

    Authors: Yueqing Liang, Lu Cheng, Ali Payani, Kai Shu

    Abstract: This work investigates the potential of undermining both fairness and detection performance in abusive language detection. In a dynamic and complex digital world, it is crucial to investigate the vulnerabilities of these detection models to adversarial fairness attacks to improve their fairness robustness. We propose a simple yet effective framework FABLE that leverages backdoor attacks as they al… ▽ More

    Submitted 5 December, 2023; v1 submitted 15 November, 2023; originally announced November 2023.

    Comments: Under review

  13. arXiv:2311.05656  [pdf, other

    cs.CY

    Combating Misinformation in the Age of LLMs: Opportunities and Challenges

    Authors: Canyu Chen, Kai Shu

    Abstract: Misinformation such as fake news and rumors is a serious threat on information ecosystems and public trust. The emergence of Large Language Models (LLMs) has great potential to reshape the landscape of combating misinformation. Generally, LLMs can be a double-edged sword in the fight. On the one hand, LLMs bring promising opportunities for combating misinformation due to their profound world knowl… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

    Comments: 9 pages for the main paper, 35 pages including 656 references, more resources on "LLMs Meet Misinformation" are on the website: https://llm-misinformation.github.io/

  14. arXiv:2310.10429  [pdf, other

    cs.CL cs.CY cs.SI

    Exploiting User Comments for Early Detection of Fake News Prior to Users' Commenting

    Authors: Qiong Nan, Qiang Sheng, Juan Cao, Yongchun Zhu, Danding Wang, Guang Yang, **tao Li, Kai Shu

    Abstract: Both accuracy and timeliness are key factors in detecting fake news on social media. However, most existing methods encounter an accuracy-timeliness dilemma: Content-only methods guarantee timeliness but perform moderately because of limited available information, while social context-based ones generally perform better but inevitably lead to latency because of social context accumulation needs. T… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

    Comments: 14 pages, 8 figures, 8 tables

  15. arXiv:2310.05253  [pdf, other

    cs.CL cs.AI cs.LG

    Explainable Claim Verification via Knowledge-Grounded Reasoning with Large Language Models

    Authors: Haoran Wang, Kai Shu

    Abstract: Claim verification plays a crucial role in combating misinformation. While existing works on claim verification have shown promising results, a crucial piece of the puzzle that remains unsolved is to understand how to verify claims without relying on human-annotated data, which is expensive to create at a large scale. Additionally, it is important for models to provide comprehensive explanations t… ▽ More

    Submitted 19 October, 2023; v1 submitted 8 October, 2023; originally announced October 2023.

    Comments: Findings of EMNLP 2023

  16. arXiv:2309.13788  [pdf, other

    cs.CL cs.AI cs.CR cs.HC cs.LG

    Can LLM-Generated Misinformation Be Detected?

    Authors: Canyu Chen, Kai Shu

    Abstract: The advent of Large Language Models (LLMs) has made a transformative impact. However, the potential that LLMs such as ChatGPT can be exploited to generate misinformation has posed a serious concern to online safety and public trust. A fundamental research question is: will LLM-generated misinformation cause more harm than human-written misinformation? We propose to tackle this question from the pe… ▽ More

    Submitted 23 April, 2024; v1 submitted 24 September, 2023; originally announced September 2023.

    Comments: Accepted to Proceedings of ICLR 2024. 9 pages for main paper, 40 pages including appendix. The code, results, dataset for this paper and more resources on "LLMs Meet Misinformation" have been released on the project website: https://llm-misinformation.github.io/

  17. arXiv:2309.12363  [pdf, other

    cs.CY cs.AI

    Investigating Online Financial Misinformation and Its Consequences: A Computational Perspective

    Authors: Aman Rangapur, Haoran Wang, Kai Shu

    Abstract: The rapid dissemination of information through digital platforms has revolutionized the way we access and consume news and information, particularly in the realm of finance. However, this digital age has also given rise to an alarming proliferation of financial misinformation, which can have detrimental effects on individuals, markets, and the overall economy. This research paper aims to provide a… ▽ More

    Submitted 5 September, 2023; originally announced September 2023.

    Comments: 32 pages, 2 figures

    ACM Class: A.1; I.m

  18. arXiv:2309.08793  [pdf, other

    cs.AI cs.CE cs.LG

    Fin-Fact: A Benchmark Dataset for Multimodal Financial Fact Checking and Explanation Generation

    Authors: Aman Rangapur, Haoran Wang, Ling Jian, Kai Shu

    Abstract: Fact-checking in financial domain is under explored, and there is a shortage of quality dataset in this domain. In this paper, we propose Fin-Fact, a benchmark dataset for multimodal fact-checking within the financial domain. Notably, it includes professional fact-checker annotations and justifications, providing expertise and credibility. With its multimodal nature encompassing both textual and v… ▽ More

    Submitted 1 May, 2024; v1 submitted 15 September, 2023; originally announced September 2023.

    Comments: 8 pages, 4 figures, 4 tables

    ACM Class: I.2; E.m

  19. arXiv:2306.15231  [pdf, other

    cs.CL

    Emulating Reader Behaviors for Fake News Detection

    Authors: Junwei Yin, Min Gao, Kai Shu, Zehua Zhao, Yinqiu Huang, Jia Wang

    Abstract: The wide dissemination of fake news has affected our lives in many aspects, making fake news detection important and attracting increasing attention. Existing approaches make substantial contributions in this field by modeling news from a single-modal or multi-modal perspective. However, these modal-based methods can result in sub-optimal outcomes as they ignore reader behaviors in news consumptio… ▽ More

    Submitted 27 June, 2023; originally announced June 2023.

    Comments: 12 pages

  20. MUSER: A MUlti-Step Evidence Retrieval Enhancement Framework for Fake News Detection

    Authors: Hao Liao, Jiaohao Peng, Zhanyi Huang, Wei Zhang, Guanghua Li, Kai Shu, Xing Xie

    Abstract: The ease of spreading false information online enables individuals with malicious intent to manipulate public opinion and destabilize social stability. Recently, fake news detection based on evidence retrieval has gained popularity in an effort to identify fake news reliably and reduce its impact. Evidence retrieval-based methods can improve the reliability of fake news detection by computing the… ▽ More

    Submitted 23 June, 2023; originally announced June 2023.

    Comments: 12 pages, 5 figures, accepted by KDD '23, ADS track

    Journal ref: KDD 2023

  21. arXiv:2306.08256  [pdf, other

    eess.SP cs.LG

    Data Augmentation for Seizure Prediction with Generative Diffusion Model

    Authors: Kai Shu, Yuchang Zhao, Le Wu, Ai** Liu, Ruobing Qian, Xun Chen

    Abstract: Objective: Seizure prediction is of great importance to improve the life of patients. The focal point is to distinguish preictal states from interictal ones. With the development of machine learning, seizure prediction methods have achieved significant progress. However, the severe imbalance problem between preictal and interictal data still poses a great challenge, restricting the performance of… ▽ More

    Submitted 14 June, 2023; originally announced June 2023.

    Comments: 12 pages, 6 figures

  22. arXiv:2305.19552  [pdf, other

    cs.CY

    Investigating Gender Euphoria and Dysphoria on TikTok: Characterization and Comparison

    Authors: SJ Dillon, Yueqing Liang, H. Russell Bernard, Kai Shu

    Abstract: With the emergence of short video-sharing platforms, engagement with social media sites devoted to opinion and knowledge dissemination has rapidly increased. Among the short video platforms, TikTok is one of the most popular globally and has become the platform of choice for transgender and nonbinary individuals, who have formed a large community to mobilize personal experience and exchange inform… ▽ More

    Submitted 31 May, 2023; originally announced May 2023.

  23. arXiv:2305.10668  [pdf, other

    cs.LG cs.AI cs.CR cs.SI

    MetaGAD: Learning to Meta Transfer for Few-shot Graph Anomaly Detection

    Authors: Xiongxiao Xu, Kaize Ding, Canyu Chen, Kai Shu

    Abstract: Graph anomaly detection has long been an important problem in various domains pertaining to information security such as financial fraud, social spam, network intrusion, etc. The majority of existing methods are performed in an unsupervised manner, as labeled anomalies in a large scale are often too expensive to acquire. However, the identified anomalies may turn out to be data noises or uninteres… ▽ More

    Submitted 17 May, 2023; originally announced May 2023.

  24. arXiv:2304.08640  [pdf, other

    cs.LG cs.SI

    TAP: A Comprehensive Data Repository for Traffic Accident Prediction in Road Networks

    Authors: Baixiang Huang, Bryan Hooi, Kai Shu

    Abstract: Road safety is a major global public health concern. Effective traffic crash prediction can play a critical role in reducing road traffic accidents. However, Existing machine learning approaches tend to focus on predicting traffic accidents in isolation, without considering the potential relationships between different accident locations within road networks. To incorporate graph structure informa… ▽ More

    Submitted 17 April, 2023; originally announced April 2023.

    Comments: 10 pages, 5 figures

  25. arXiv:2302.07363  [pdf, other

    cs.SI

    Attacking Fake News Detectors via Manipulating News Social Engagement

    Authors: Haoran Wang, Yingtong Dou, Canyu Chen, Lichao Sun, Philip S. Yu, Kai Shu

    Abstract: Social media is one of the main sources for news consumption, especially among the younger generation. With the increasing popularity of news consumption on various social media platforms, there has been a surge of misinformation which includes false information or unfounded claims. As various text- and social context-based fake news detectors are proposed to detect misinformation on social media,… ▽ More

    Submitted 27 April, 2023; v1 submitted 14 February, 2023; originally announced February 2023.

    Comments: ACM Web Conference 2023 (WWW'23)

  26. arXiv:2212.12621  [pdf, other

    cs.SI cs.LG

    Nothing Stands Alone: Relational Fake News Detection with Hypergraph Neural Networks

    Authors: Ujun Jeong, Kaize Ding, Lu Cheng, Ruocheng Guo, Kai Shu, Huan Liu

    Abstract: Nowadays, fake news easily propagates through online social networks and becomes a grand threat to individuals and society. Assessing the authenticity of news is challenging due to its elaborately fabricated contents, making it difficult to obtain large-scale annotations for fake news data. Due to such data scarcity issues, detecting fake news tends to fail and overfit in the supervised setting. R… ▽ More

    Submitted 23 December, 2022; originally announced December 2022.

    Comments: Accepted in IEEE Big Data 22

  27. arXiv:2211.05289  [pdf, ps, other

    cs.SI cs.AI cs.CY

    Combating Health Misinformation in Social Media: Characterization, Detection, Intervention, and Open Issues

    Authors: Canyu Chen, Haoran Wang, Matthew Shapiro, Yunyu Xiao, Fei Wang, Kai Shu

    Abstract: Social media has been one of the main information consumption sources for the public, allowing people to seek and spread information more quickly and easily. However, the rise of various social media platforms also enables the proliferation of online misinformation. In particular, misinformation in the health domain has significant impacts on our society such as the COVID-19 infodemic. Therefore,… ▽ More

    Submitted 9 November, 2022; originally announced November 2022.

    Comments: 31 pages, 241 references

  28. arXiv:2211.03721  [pdf, other

    cs.CL cs.AI

    Streaming, fast and accurate on-device Inverse Text Normalization for Automatic Speech Recognition

    Authors: Yashesh Gaur, Nick Kibre, Jian Xue, Kangyuan Shu, Yuhui Wang, Issac Alphanso, **yu Li, Yifan Gong

    Abstract: Automatic Speech Recognition (ASR) systems typically yield output in lexical form. However, humans prefer a written form output. To bridge this gap, ASR systems usually employ Inverse Text Normalization (ITN). In previous works, Weighted Finite State Transducers (WFST) have been employed to do ITN. WFSTs are nicely suited to this task but their size and run-time costs can make deployment on embe… ▽ More

    Submitted 7 November, 2022; originally announced November 2022.

    Comments: 8 pages. 6 page paper 2 page references

  29. arXiv:2207.08336  [pdf, other

    cs.LG cs.AI cs.CR cs.CY

    When Fairness Meets Privacy: Fair Classification with Semi-Private Sensitive Attributes

    Authors: Canyu Chen, Yueqing Liang, Xiongxiao Xu, Shangyu Xie, Ashish Kundu, Ali Payani, Yuan Hong, Kai Shu

    Abstract: Machine learning models have demonstrated promising performance in many areas. However, the concerns that they can be biased against specific demographic groups hinder their adoption in high-stake applications. Thus, it is essential to ensure fairness in machine learning models. Most previous efforts require direct access to sensitive attributes for mitigating bias. Nonetheless, it is often infeas… ▽ More

    Submitted 29 May, 2023; v1 submitted 17 July, 2022; originally announced July 2022.

  30. Memory-Guided Multi-View Multi-Domain Fake News Detection

    Authors: Yongchun Zhu, Qiang Sheng, Juan Cao, Qiong Nan, Kai Shu, Minghui Wu, **dong Wang, Fuzhen Zhuang

    Abstract: The wide spread of fake news is increasingly threatening both individuals and society. Great efforts have been made for automatic fake news detection on a single domain (e.g., politics). However, correlations exist commonly across multiple news domains, and thus it is promising to simultaneously detect fake news of multiple domains. Based on our analysis, we pose two challenges in multi-domain fak… ▽ More

    Submitted 26 June, 2022; originally announced June 2022.

    Comments: Accepted by IEEE Transactions on Knowledge and Data Engineering (TKDE)

  31. arXiv:2206.10071  [pdf, other

    cs.LG cs.SI

    BOND: Benchmarking Unsupervised Outlier Node Detection on Static Attributed Graphs

    Authors: Kay Liu, Yingtong Dou, Yue Zhao, Xueying Ding, Xiyang Hu, Ruitong Zhang, Kaize Ding, Canyu Chen, Hao Peng, Kai Shu, Lichao Sun, Jundong Li, George H. Chen, Zhihao Jia, Philip S. Yu

    Abstract: Detecting which nodes in graphs are outliers is a relatively new machine learning task with numerous applications. Despite the proliferation of algorithms developed in recent years for this task, there has been no standard comprehensive setting for performance evaluation. Consequently, it has been difficult to understand which methods work well and when under a broad range of settings. To bridge t… ▽ More

    Submitted 15 October, 2022; v1 submitted 20 June, 2022; originally announced June 2022.

    Comments: NeurIPS 2022. Benchmark available at https://github.com/pygod-team/pygod/tree/main/benchmark

  32. Fair Classification via Domain Adaptation: A Dual Adversarial Learning Approach

    Authors: Yueqing Liang, Canyu Chen, Tian Tian, Kai Shu

    Abstract: Modern machine learning (ML) models are becoming increasingly popular and are widely used in decision-making systems. However, studies have shown critical issues of ML discrimination and unfairness, which hinder their adoption on high-stake applications. Recent research on fair classifiers has drawn significant attention to develo** effective algorithms to achieve fairness and good classificatio… ▽ More

    Submitted 30 May, 2023; v1 submitted 7 June, 2022; originally announced June 2022.

  33. arXiv:2205.12401  [pdf, other

    cs.LG cs.AI

    Reward Uncertainty for Exploration in Preference-based Reinforcement Learning

    Authors: Xinran Liang, Katherine Shu, Kimin Lee, Pieter Abbeel

    Abstract: Conveying complex objectives to reinforcement learning (RL) agents often requires meticulous reward engineering. Preference-based RL methods are able to learn a more flexible reward model based on human preferences by actively incorporating human feedback, i.e. teacher's preferences between two clips of behaviors. However, poor feedback-efficiency still remains a problem in current preference-base… ▽ More

    Submitted 24 May, 2022; originally announced May 2022.

    Comments: ICLR 2022. Last two authors advised equally

  34. arXiv:2205.09229  [pdf, other

    cs.CL cs.AI

    PromptDA: Label-guided Data Augmentation for Prompt-based Few-shot Learners

    Authors: Canyu Chen, Kai Shu

    Abstract: Recent advances in large pre-trained language models (PLMs) lead to impressive gains in natural language understanding (NLU) tasks with task-specific fine-tuning. However, directly fine-tuning PLMs heavily relies on sufficient labeled training instances, which are usually hard to obtain. Prompt-based tuning on PLMs has shown to be powerful for various downstream few-shot tasks. Existing works stud… ▽ More

    Submitted 22 March, 2023; v1 submitted 18 May, 2022; originally announced May 2022.

    Comments: Accepted to Proceedings of EACL 2023 main conference. Code is available at https://github.com/canyuchen/PromptDA

  35. Characterizing Multi-Domain False News and Underlying User Effects on Chinese Weibo

    Authors: Qiang Sheng, Juan Cao, H. Russell Bernard, Kai Shu, **tao Li, Huan Liu

    Abstract: False news that spreads on social media has proliferated over the past years and has led to multi-aspect threats in the real world. While there are studies of false news on specific domains (like politics or health care), little work is found comparing false news across domains. In this article, we investigate false news across nine domains on Weibo, the largest Twitter-like social media platform… ▽ More

    Submitted 21 June, 2022; v1 submitted 6 May, 2022; originally announced May 2022.

    Comments: 22 pages, 5 figures, and 12 tables. Accepted by Information Processing and Management (IP&M)

  36. Domain Adaptive Fake News Detection via Reinforcement Learning

    Authors: Ahmadreza Mosallanezhad, Mansooreh Karami, Kai Shu, Michelle V. Mancenido, Huan Liu

    Abstract: With social media being a major force in information consumption, accelerated propagation of fake news has presented new challenges for platforms to distinguish between legitimate and fake news. Effective fake news detection is a non-trivial task due to the diverse nature of news domains and expensive annotation costs. In this work, we address the limitations of existing automated fake news detect… ▽ More

    Submitted 16 February, 2022; originally announced February 2022.

    Comments: Accepted to The ACM Web Conference (WWW) 2022

  37. "This is Fake! Shared it by Mistake": Assessing the Intent of Fake News Spreaders

    Authors: Xinyi Zhou, Kai Shu, Vir V. Phoha, Huan Liu, Reza Zafarani

    Abstract: Individuals can be misled by fake news and spread it unintentionally without knowing it is false. This phenomenon has been frequently observed but has not been investigated. Our aim in this work is to assess the intent of fake news spreaders. To distinguish between intentional versus unintentional spreading, we study the psychological explanations of unintentional spreading. With this foundation,… ▽ More

    Submitted 13 February, 2022; v1 submitted 9 February, 2022; originally announced February 2022.

    Comments: Accepted to The ACM Web Conference (WWW) 2022

  38. arXiv:2110.05644  [pdf, ps, other

    cs.DM

    On the computational equivalence of co-NP refutations of a matrix being a P-matrix

    Authors: Spencer Gordon, Kevin Shu

    Abstract: A P-matrix is a square matrix $X$ such that all principal submatrices of $X$ have positive determinant. Such matrices appear naturally in instances of the linear complementarity problem, where these are precisely the matrices for which the corresponding linear complementarity problem has a unique solution for any input vector. Testing whether or not a square matrix is a P-matrix is co-NP compl… ▽ More

    Submitted 11 October, 2021; originally announced October 2021.

  39. arXiv:2110.00911  [pdf, other

    cs.LG cs.AI

    Enhancing Model Robustness and Fairness with Causality: A Regularization Approach

    Authors: Zhao Wang, Kai Shu, Aron Culotta

    Abstract: Recent work has raised concerns on the risk of spurious correlations and unintended biases in statistical machine learning models that threaten model robustness and fairness. In this paper, we propose a simple and intuitive regularization approach to integrate causal knowledge during model training and build a robust and fair model by emphasizing causal features and de-emphasizing spurious feature… ▽ More

    Submitted 2 October, 2021; originally announced October 2021.

  40. A Survey of COVID-19 Misinformation: Datasets, Detection Techniques and Open Issues

    Authors: A. R. Sana Ullah, Anupam Das, Anik Das, Muhammad Ashad Kabir, Kai Shu

    Abstract: Misinformation during pandemic situations like COVID-19 is growing rapidly on social media and other platforms. This expeditious growth of misinformation creates adverse effects on the people living in the society. Researchers are trying their best to mitigate this problem using different approaches based on Machine Learning (ML), Deep Learning (DL), and Natural Language Processing (NLP). This sur… ▽ More

    Submitted 24 October, 2021; v1 submitted 2 October, 2021; originally announced October 2021.

    Comments: 43 pages, 6 figures

    Journal ref: Social Network Analysis and Mining, 2022

  41. arXiv:2108.12603  [pdf, other

    cs.CL cs.LG

    WALNUT: A Benchmark on Semi-weakly Supervised Learning for Natural Language Understanding

    Authors: Guoqing Zheng, Giannis Karamanolakis, Kai Shu, Ahmed Hassan Awadallah

    Abstract: Building machine learning models for natural language understanding (NLU) tasks relies heavily on labeled data. Weak supervision has been proven valuable when large amount of labeled data is unavailable or expensive to obtain. Existing works studying weak supervision for NLU either mostly focus on a specific task or simulate weak supervision signals from ground-truth labels. It is thus hard to com… ▽ More

    Submitted 22 May, 2022; v1 submitted 28 August, 2021; originally announced August 2021.

    Comments: Accepted to NAACL 2022 (Long Paper)

  42. arXiv:2106.07564  [pdf

    cs.CV cs.LG eess.IV

    An optimized Capsule-LSTM model for facial expression recognition with video sequences

    Authors: Siwei Liu, Yuanpeng Long, Gao Xu, Lijia Yang, Shimei Xu, Xiaoming Yao, Kunxian Shu

    Abstract: To overcome the limitations of convolutional neural network in the process of facial expression recognition, a facial expression recognition model Capsule-LSTM based on video frame sequence is proposed. This model is composed of three networks includingcapsule encoders, capsule decoders and LSTM network. The capsule encoder extracts the spatial information of facial expressions in video frames. Ca… ▽ More

    Submitted 27 May, 2021; originally announced June 2021.

    Comments: 14pages,4 figurews

  43. arXiv:2106.07563  [pdf

    cs.CV cs.LG eess.IV

    BPLF: A Bi-Parallel Linear Flow Model for Facial Expression Generation from Emotion Set Images

    Authors: Gao Xu, Yuanpeng Long, Siwei Liu, Lijia Yang, Shimei Xu, Xiaoming Yao, Kunxian Shu

    Abstract: The flow-based generative model is a deep learning generative model, which obtains the ability to generate data by explicitly learning the data distribution. Theoretically its ability to restore data is stronger than other generative models. However, its implementation has many limitations, including limited model design, too many model parameters and tedious calculation. In this paper, a bi-paral… ▽ More

    Submitted 27 May, 2021; originally announced June 2021.

    Comments: 20 pages, 10 figures

  44. arXiv:2106.04716  [pdf, other

    cs.LG

    Labeled Data Generation with Inexact Supervision

    Authors: Enyan Dai, Kai Shu, Yiwei Sun, Suhang Wang

    Abstract: The recent advanced deep learning techniques have shown the promising results in various domains such as computer vision and natural language processing. The success of deep neural networks in supervised learning heavily relies on a large amount of labeled data. However, obtaining labeled data with target labels is often challenging due to various reasons such as cost of labeling and privacy issue… ▽ More

    Submitted 8 June, 2021; originally announced June 2021.

  45. Towards Fair Classifiers Without Sensitive Attributes: Exploring Biases in Related Features

    Authors: Tianxiang Zhao, Enyan Dai, Kai Shu, Suhang Wang

    Abstract: Despite the rapid development and great success of machine learning models, extensive studies have exposed their disadvantage of inheriting latent discrimination and societal bias from the training data. This phenomenon hinders their adoption on high-stake applications. Thus, many efforts have been taken for develo** fair machine learning models. Most of them require that sensitive attributes ar… ▽ More

    Submitted 28 December, 2021; v1 submitted 29 April, 2021; originally announced April 2021.

    Journal ref: Proceedings of the Fifteenth ACM International Conference on Web Search and Data Mining (WSDM 2022)

  46. arXiv:2104.12259  [pdf, other

    cs.SI cs.CL

    User Preference-aware Fake News Detection

    Authors: Yingtong Dou, Kai Shu, Congying Xia, Philip S. Yu, Lichao Sun

    Abstract: Disinformation and fake news have posed detrimental effects on individuals and society in recent years, attracting broad attention to fake news detection. The majority of existing fake news detection algorithms focus on mining news content and/or the surrounding exogenous context for discovering deceptive signals; while the endogenous preference of a user when he/she decides to spread a piece of f… ▽ More

    Submitted 25 April, 2021; originally announced April 2021.

    Comments: Accepted by SIGIR'21. Code is available at https://github.com/safe-graph/GNN-FakeNews

  47. arXiv:2103.02834  [pdf, ps, other

    cs.IT math.PR

    Causal Channels

    Authors: Kevin Shu

    Abstract: We consider causal models with two observed variables and one latent variables, each variable being discrete, with the goal of characterizing the possible distributions on outcomes that can result from controlling one of the observed variables. We optimize linear functions over the space of all possible interventional distributions, which allows us find properties of the interventional distributio… ▽ More

    Submitted 4 March, 2021; originally announced March 2021.

  48. arXiv:2012.04778  [pdf, other

    cs.CL

    Fact-Enhanced Synthetic News Generation

    Authors: Kai Shu, Yichuan Li, Kaize Ding, Huan Liu

    Abstract: The advanced text generation methods have witnessed great success in text summarization, language translation, and synthetic news generation. However, these techniques can be abused to generate disinformation and fake news. To better understand the potential threats of synthetic news, we develop a new generation method FactGen to generate high-quality news content. The existing text generation met… ▽ More

    Submitted 12 December, 2020; v1 submitted 8 December, 2020; originally announced December 2020.

    Comments: AAAI 2021 Preprint Version

  49. arXiv:2011.04088  [pdf, other

    cs.SI cs.CY

    MM-COVID: A Multilingual and Multimodal Data Repository for Combating COVID-19 Disinformation

    Authors: Yichuan Li, Bohan Jiang, Kai Shu, Huan Liu

    Abstract: The COVID-19 epidemic is considered as the global health crisis of the whole society and the greatest challenge mankind faced since World War Two. Unfortunately, the fake news about COVID-19 is spreading as fast as the virus itself. The incorrect health measurements, anxiety, and hate speeches will have bad consequences on people's physical health, as well as their mental health in the whole world… ▽ More

    Submitted 23 November, 2020; v1 submitted 8 November, 2020; originally announced November 2020.

  50. arXiv:2011.01579  [pdf, other

    cs.SI

    Fake News Detection through Graph Comment Advanced Learning

    Authors: Hao Liao, Qixin Liu, Kai Shu, Xing xie

    Abstract: Disinformation has long been regarded as a severe social problem, where fake news is one of the most representative issues. What is worse, today's highly developed social media makes fake news widely spread at incredible speed, bringing in substantial harm to various aspects of human life. Yet, the popularity of social media also provides opportunities to better detect fake news. Unlike convention… ▽ More

    Submitted 29 January, 2021; v1 submitted 3 November, 2020; originally announced November 2020.