Skip to main content

Showing 1–50 of 445 results for author: Yang, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.19414  [pdf, other

    q-fin.ST cs.LG q-fin.PR stat.AP stat.ML stat.OT

    Stock Volume Forecasting with Advanced Information by Conditional Variational Auto-Encoder

    Authors: Parley R Yang, Alexander Y Shestopaloff

    Abstract: We demonstrate the use of Conditional Variational Encoder (CVAE) to improve the forecasts of daily stock volume time series in both short and long term forecasting tasks, with the use of advanced information of input variables such as rebalancing dates. CVAE generates non-linear time series as out-of-sample forecasts, which have better accuracy and closer fit of correlation to the actual data, com… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  2. arXiv:2406.17624  [pdf, other

    cs.CL cs.AI

    Self-assessment, Exhibition, and Recognition: a Review of Personality in Large Language Models

    Authors: Zhiyuan Wen, Yu Yang, Jiannong Cao, Haoming Sun, Ruosong Yang, Shuaiqi Liu

    Abstract: As large language models (LLMs) appear to behave increasingly human-like in text-based interactions, more and more researchers become interested in investigating personality in LLMs. However, the diversity of psychological personality research and the rapid development of LLMs have led to a broad yet fragmented landscape of studies in this interdisciplinary field. Extensive studies across differen… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  3. arXiv:2406.17274  [pdf, other

    cs.CL cs.LG

    Can We Trust the Performance Evaluation of Uncertainty Estimation Methods in Text Summarization?

    Authors: Jianfeng He, Runing Yang, Linlin Yu, Changbin Li, Ruoxi Jia, Feng Chen, Ming **, Chang-Tien Lu

    Abstract: Text summarization, a key natural language generation (NLG) task, is vital in various domains. However, the high cost of inaccurate summaries in risk-critical applications, particularly those involving human-in-the-loop decision-making, raises concerns about the reliability of uncertainty estimation on text summarization (UE-TS) evaluation methods. This concern stems from the dependency of uncerta… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: 63 pages, 41 figures, 11 tables

  4. arXiv:2406.13369  [pdf, other

    cs.LG cs.SI

    Effective Edge-wise Representation Learning in Edge-Attributed Bipartite Graphs

    Authors: Hewen Wang, Renchi Yang, Xiaokui Xiao

    Abstract: Graph representation learning (GRL) is to encode graph elements into informative vector representations, which can be used in downstream tasks for analyzing graph-structured data and has seen extensive applications in various domains. However, the majority of extant studies on GRL are geared towards generating node representations, which cannot be readily employed to perform edge-based analytics t… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: 11 pages. Full version of the research paper accepted to KDD 2024

  5. arXiv:2406.12556  [pdf, other

    cs.NI

    Towards Deep Application-Network Integration: Architectures, Progress and Opportunities

    Authors: Berta Serracanta, Kai Gao, Jordi Ros-Giralt, Alberto Rodriguez-Natal, Luis M. Contreras, Richard Yang, Albert Cabellos

    Abstract: With the rise of a new generation of applications (e.g., virtual and augmented reality, artificial intelligence, etc) demanding stringent performance requirements, the need for networking solutions and architectures that can enable a higher Quality of Experience (QoE) is becoming increasingly important. While jointly optimizing application and network may increase the applications' QoE and simul… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  6. arXiv:2406.12449  [pdf

    cs.AI

    Retrieval-Augmented Generation for Generative Artificial Intelligence in Medicine

    Authors: Rui Yang, Yilin Ning, Emilia Keppo, Mingxuan Liu, Chuan Hong, Danielle S Bitterman, Jasmine Chiat Ling Ong, Daniel Shu Wei Ting, Nan Liu

    Abstract: Generative artificial intelligence (AI) has brought revolutionary innovations in various fields, including medicine. However, it also exhibits limitations. In response, retrieval-augmented generation (RAG) provides a potential solution, enabling models to generate more accurate contents by leveraging the retrieval of external knowledge. With the rapid advancement of generative AI, RAG can pave the… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  7. arXiv:2406.12367  [pdf, other

    cs.CV cs.LG cs.MM

    Competitive Learning for Achieving Content-specific Filters in Video Coding for Machines

    Authors: Honglei Zhang, Jukka I. Ahonen, Nam Le, Ruiying Yang, Francesco Cricri

    Abstract: This paper investigates the efficacy of jointly optimizing content-specific post-processing filters to adapt a human oriented video/image codec into a codec suitable for machine vision tasks. By observing that artifacts produced by video/image codecs are content-dependent, we propose a novel training strategy based on competitive learning principles. This strategy assigns training samples to filte… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: Accepted to be preseneted in ICIP 2024

  8. arXiv:2406.12053  [pdf, other

    cs.CL

    InternalInspector $I^2$: Robust Confidence Estimation in LLMs through Internal States

    Authors: Mohammad Beigi, Ying Shen, Runing Yang, Zihao Lin, Qifan Wang, Ankith Mohan, Jianfeng He, Ming **, Chang-Tien Lu, Lifu Huang

    Abstract: Despite their vast capabilities, Large Language Models (LLMs) often struggle with generating reliable outputs, frequently producing high-confidence inaccuracies known as hallucinations. Addressing this challenge, our research introduces InternalInspector, a novel framework designed to enhance confidence estimation in LLMs by leveraging contrastive learning on internal states including attention st… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 8 pages

  9. arXiv:2406.10216  [pdf, other

    cs.CL cs.AI

    Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMs

    Authors: Rui Yang, Ruomeng Ding, Yong Lin, Huan Zhang, Tong Zhang

    Abstract: Reward models trained on human preference data have been proven to be effective for aligning Large Language Models (LLMs) with human intent within the reinforcement learning from human feedback (RLHF) framework. However, the generalization capabilities of current reward models to unseen prompts and responses are limited. This limitation can lead to an unexpected phenomenon known as reward over-opt… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: 21 pages

  10. arXiv:2406.07801  [pdf, other

    cs.CL cs.SD eess.AS

    PolySpeech: Exploring Unified Multitask Speech Models for Competitiveness with Single-task Models

    Authors: Runyan Yang, Huibao Yang, Xiqing Zhang, Tiantian Ye, Ying Liu, Yingying Gao, Shilei Zhang, Chao Deng, Junlan Feng

    Abstract: Recently, there have been attempts to integrate various speech processing tasks into a unified model. However, few previous works directly demonstrated that joint optimization of diverse tasks in multitask speech models has positive influence on the performance of individual tasks. In this paper we present a multitask speech model -- PolySpeech, which supports speech recognition, speech synthesis,… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: 5 pages, 2 figures

  11. arXiv:2406.05482  [pdf, other

    cs.LG

    Efficient Topology-aware Data Augmentation for High-Degree Graph Neural Networks

    Authors: Yurui Lai, Xiaoyang Lin, Renchi Yang, Hongtao Wang

    Abstract: In recent years, graph neural networks (GNNs) have emerged as a potent tool for learning on graph-structured data and won fruitful successes in varied fields. The majority of GNNs follow the message-passing paradigm, where representations of each node are learned by recursively aggregating features of its neighbors. However, this mechanism brings severe over-smoothing and efficiency issues over hi… ▽ More

    Submitted 17 June, 2024; v1 submitted 8 June, 2024; originally announced June 2024.

    Comments: This is the technical report for the paper accepted to KDD 2024. 16 pages

  12. arXiv:2406.04784  [pdf, other

    cs.CL cs.AI

    SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals

    Authors: Ruihan Yang, Jiangjie Chen, Yikai Zhang, Siyu Yuan, Aili Chen, Kyle Richardson, Yanghua Xiao, Deqing Yang

    Abstract: Language agents powered by large language models (LLMs) are increasingly valuable as decision-making tools in domains such as gaming and programming. However, these agents often face challenges in achieving high-level goals without detailed instructions and in adapting to environments where feedback is delayed. In this paper, we present SelfGoal, a novel automatic approach designed to enhance agen… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: Preprint

  13. arXiv:2406.02222  [pdf, other

    cs.SE

    Towards an Extensible Model-Based Digital Twin Framework for Space Launch Vehicles

    Authors: Ran Wei, Ruizhe Yang, Shijun Liu, Chongsheng Fan, Rong Zhou, Zekun Wu, Haochi Wang, Yifan Cai, Zhe Jiang

    Abstract: The concept of Digital Twin (DT) is increasingly applied to systems on different levels of abstraction across domains, to support monitoring, analysis, diagnosis, decision making and automated control. Whilst the interest in applying DT is growing, the definition of DT is unclear, neither is there a clear pathway to develop DT to fully realise its capacities. In this paper, we revise the concept o… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  14. arXiv:2406.02143  [pdf, other

    cs.CL

    Reinforcement Tuning for Detecting Stances and Debunking Rumors Jointly with Large Language Models

    Authors: Ruichao Yang, Wei Gao, **g Ma, Hongzhan Lin, Bo Wang

    Abstract: Learning multi-task models for jointly detecting stance and verifying rumors poses challenges due to the need for training data of stance at post level and rumor veracity at claim level, which are difficult to obtain. To address this issue, we leverage large language models (LLMs) as the foundation annotators for the joint stance detection (SD) and rumor verification (RV) tasks, dubbed as JSDRV. W… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: ACL 2024 (Findings)

  15. arXiv:2406.02038  [pdf, other

    cs.CV

    Leveraging Predicate and Triplet Learning for Scene Graph Generation

    Authors: Jiankai Li, Yunhong Wang, Xiefan Guo, Ruijie Yang, Weixin Li

    Abstract: Scene Graph Generation (SGG) aims to identify entities and predict the relationship triplets \textit{\textless subject, predicate, object\textgreater } in visual scenes. Given the prevalence of large visual variations of subject-object pairs even in the same predicate, it can be quite challenging to model and refine predicate representations directly across such pairs, which is however a common st… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: CVPR 2024

  16. arXiv:2406.01584  [pdf, other

    cs.CV

    SpatialRGPT: Grounded Spatial Reasoning in Vision Language Model

    Authors: An-Chieh Cheng, Hongxu Yin, Yang Fu, Qiushan Guo, Ruihan Yang, Jan Kautz, Xiaolong Wang, Sifei Liu

    Abstract: Vision Language Models (VLMs) have demonstrated remarkable performance in 2D vision and language tasks. However, their ability to reason about spatial arrangements remains limited. In this work, we introduce Spatial Region GPT (SpatialRGPT) to enhance VLMs' spatial perception and reasoning capabilities. SpatialRGPT advances VLMs' spatial understanding through two key innovations: (1) a data curati… ▽ More

    Submitted 18 June, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

    Comments: Project Page: https://www.anjiecheng.me/SpatialRGPT

  17. arXiv:2406.01069  [pdf, other

    cs.CV

    UniQA: Unified Vision-Language Pre-training for Image Quality and Aesthetic Assessment

    Authors: Hantao Zhou, Longxiang Tang, Rui Yang, Guanyi Qin, Yan Zhang, Runze Hu, Xiu Li

    Abstract: Image Quality Assessment (IQA) and Image Aesthetic Assessment (IAA) aim to simulate human subjective perception of image visual quality and aesthetic appeal. Existing methods typically address these tasks independently due to distinct learning objectives. However, they neglect the underlying interconnectedness of both tasks, which hinders the learning of task-agnostic shared representations for hu… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  18. arXiv:2405.18959  [pdf, other

    cs.CV cs.MM

    Transcending Fusion: A Multi-Scale Alignment Method for Remote Sensing Image-Text Retrieval

    Authors: Rui Yang, Shuang Wang, Ying** Han, Yuanheng Li, Dong Zhao, Dou Quan, Yanhe Guo, Licheng Jiao

    Abstract: Remote Sensing Image-Text Retrieval (RSITR) is pivotal for knowledge services and data mining in the remote sensing (RS) domain. Considering the multi-scale representations in image content and text vocabulary can enable the models to learn richer representations and enhance retrieval. Current multi-scale RSITR approaches typically align multi-scale fused image features with text features, but ove… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: 16 pages, 9 figures

  19. arXiv:2405.18525  [pdf, other

    cs.CV

    REPARO: Compositional 3D Assets Generation with Differentiable 3D Layout Alignment

    Authors: Haonan Han, Rui Yang, Huan Liao, Jiankai Xing, Zunnan Xu, Xiaoming Yu, Junwei Zha, Xiu Li, Wanhua Li

    Abstract: Traditional image-to-3D models often struggle with scenes containing multiple objects due to biases and occlusion complexities. To address this challenge, we present REPARO, a novel approach for compositional 3D asset generation from single images. REPARO employs a two-step process: first, it extracts individual objects from the scene and reconstructs their 3D meshes using off-the-shelf image-to-3… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  20. arXiv:2405.17673  [pdf, other

    cs.CV cs.LG stat.ML

    Fast Samplers for Inverse Problems in Iterative Refinement Models

    Authors: Kushagra Pandey, Ruihan Yang, Stephan Mandt

    Abstract: Constructing fast samplers for unconditional diffusion and flow-matching models has received much attention recently; however, existing methods for solving inverse problems, such as super-resolution, inpainting, or deblurring, still require hundreds to thousands of iterative steps to obtain high-quality results. We propose a plug-and-play framework for constructing efficient samplers for inverse p… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  21. arXiv:2405.16850  [pdf, other

    eess.IV cs.CV cs.LG

    UniCompress: Enhancing Multi-Data Medical Image Compression with Knowledge Distillation

    Authors: Runzhao Yang, Yinda Chen, Zhihong Zhang, Xiaoyu Liu, Zongren Li, Kunlun He, Zhiwei Xiong, **li Suo, Qionghai Dai

    Abstract: In the field of medical image compression, Implicit Neural Representation (INR) networks have shown remarkable versatility due to their flexible compression ratios, yet they are constrained by a one-to-one fitting approach that results in lengthy encoding times. Our novel method, ``\textbf{UniCompress}'', innovatively extends the compression capabilities of INR by being the first to compress multi… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  22. arXiv:2405.16726  [pdf, other

    cs.LG

    Exploring Edge Probability Graph Models Beyond Edge Independency: Concepts, Analyses, and Algorithms

    Authors: Fanchen Bu, Ruochen Yang, Paul Bogdan, Kijung Shin

    Abstract: Desirable random graph models (RGMs) should (i) be tractable so that we can compute and control graph statistics, and (ii) generate realistic structures such as high clustering (i.e., high subgraph densities). A popular category of RGMs (e.g., Erdos-Renyi and stochastic Kronecker) outputs edge probabilities, and we need to realize (i.e., sample from) the edge probabilities to generate graphs. Typi… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

  23. arXiv:2405.16376  [pdf, other

    cs.CL cs.GT

    STRIDE: A Tool-Assisted LLM Agent Framework for Strategic and Interactive Decision-Making

    Authors: Chuanhao Li, Runhan Yang, Tiankai Li, Milad Bafarassat, Kourosh Sharifi, Dirk Bergemann, Zhuoran Yang

    Abstract: Large Language Models (LLMs) like GPT-4 have revolutionized natural language processing, showing remarkable linguistic proficiency and reasoning capabilities. However, their application in strategic multi-agent decision-making environments is hampered by significant limitations including poor mathematical reasoning, difficulty in following instructions, and a tendency to generate incorrect informa… ▽ More

    Submitted 27 May, 2024; v1 submitted 25 May, 2024; originally announced May 2024.

    Comments: 39 pages, 4 figures

  24. arXiv:2405.16030  [pdf, other

    cs.LG

    Constrained Ensemble Exploration for Unsupervised Skill Discovery

    Authors: Chenjia Bai, Rushuai Yang, Qiaosheng Zhang, Kang Xu, Yi Chen, Ting Xiao, Xuelong Li

    Abstract: Unsupervised Reinforcement Learning (RL) provides a promising paradigm for learning useful behaviors via reward-free per-training. Existing methods for unsupervised RL mainly conduct empowerment-driven skill discovery or entropy-based exploration. However, empowerment often leads to static skills, and pure exploration only maximizes the state coverage rather than learning useful behaviors. In this… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: Accepted by ICML 2024

  25. arXiv:2405.15385  [pdf, other

    cs.CV physics.med-ph

    CPT-Interp: Continuous sPatial and Temporal Motion Modeling for 4D Medical Image Interpolation

    Authors: Xia Li, Runzhao Yang, Xiangtai Li, Antony Lomax, Ye Zhang, Joachim Buhmann

    Abstract: Motion information from 4D medical imaging offers critical insights into dynamic changes in patient anatomy for clinical assessments and radiotherapy planning and, thereby, enhances the capabilities of 3D image analysis. However, inherent physical and technical constraints of imaging hardware often necessitate a compromise between temporal resolution and image quality. Frame interpolation emerges… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  26. arXiv:2405.11922  [pdf, other

    cs.SI cs.LG

    Effective Clustering on Large Attributed Bipartite Graphs

    Authors: Renchi Yang, Yidu Wu, Xiaoyang Lin, Qichen Wang, Tsz Nam Chan, Jieming Shi

    Abstract: Attributed bipartite graphs (ABGs) are an expressive data model for describing the interactions between two sets of heterogeneous nodes that are associated with rich attributes, such as customer-product purchase networks and author-paper authorship graphs. Partitioning the target node set in such graphs into k disjoint clusters (referred to as k-ABGC) finds widespread use in various domains, inclu… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

    Comments: The technical report for the paper was accepted to KDD 2024. 14 pages

  27. arXiv:2405.11921  [pdf, other

    cs.CV

    MirrorGaussian: Reflecting 3D Gaussians for Reconstructing Mirror Reflections

    Authors: Jiayue Liu, Xiao Tang, Freeman Cheng, Roy Yang, Zhihao Li, Jianzhuang Liu, Yi Huang, Jiaqi Lin, Shiyong Liu, Xiaofei Wu, Songcen Xu, Chun Yuan

    Abstract: 3D Gaussian Splatting showcases notable advancements in photo-realistic and real-time novel view synthesis. However, it faces challenges in modeling mirror reflections, which exhibit substantial appearance variations from different viewpoints. To tackle this problem, we present MirrorGaussian, the first method for mirror scene reconstruction with real-time rendering based on 3D Gaussian Splatting.… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

  28. arXiv:2405.11754  [pdf, other

    cs.CV

    Versatile Teacher: A Class-aware Teacher-student Framework for Cross-domain Adaptation

    Authors: Runou Yang, Tian Tian, **wen Tian

    Abstract: Addressing the challenge of domain shift between datasets is vital in maintaining model performance. In the context of cross-domain object detection, the teacher-student framework, a widely-used semi-supervised model, has shown significant accuracy improvements. However, existing methods often overlook class differences, treating all classes equally, resulting in suboptimal results. Furthermore, t… ▽ More

    Submitted 19 May, 2024; originally announced May 2024.

  29. arXiv:2405.11225  [pdf, other

    cs.SI cs.AI

    SeBot: Structural Entropy Guided Multi-View Contrastive Learning for Social Bot Detection

    Authors: Yingguang Yang, Qi Wu, Buyun He, Hao Peng, Renyu Yang, Zhifeng Hao, Yong Liao

    Abstract: Recent advancements in social bot detection have been driven by the adoption of Graph Neural Networks. The social graph, constructed from social network interactions, contains benign and bot accounts that influence each other. However, previous graph-based detection methods that follow the transductive message-passing paradigm may not fully utilize hidden graph information and are vulnerable to ad… ▽ More

    Submitted 18 May, 2024; originally announced May 2024.

    Comments: KDD 2024

  30. arXiv:2405.09357  [pdf, ps, other

    cs.SI physics.soc-ph

    A universal optimization framework based on cycle ranking for influence maximization in complex networks

    Authors: Wenfeng Shi, Tianlong Fan, Shuqi Xu, Rongmei Yang, Linyuan Lü

    Abstract: Influence maximization aims to identify a set of influential individuals, referred to as influencers, as information sources to maximize the spread of information within networks, constituting a vital combinatorial optimization problem with extensive practical applications and sustained interdisciplinary interest. Diverse approaches have been devised to efficiently address this issue, one of which… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

  31. arXiv:2405.07800  [pdf, other

    cs.LG

    Data Imputation by Pursuing Better Classification: A Supervised Kernel-Based Method

    Authors: Ruikai Yang, Fan He, Mingzhen He, Kaijie Wang, Xiaolin Huang

    Abstract: Data imputation, the process of filling in missing feature elements for incomplete data sets, plays a crucial role in data-driven learning. A fundamental belief is that data imputation is helpful for learning performance, and it follows that the pursuit of better classification can guide the data imputation process. While some works consider using label information to assist in this task, their si… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

  32. arXiv:2405.07791  [pdf, ps, other

    cs.LG cs.DC stat.ML

    Decentralized Kernel Ridge Regression Based on Data-dependent Random Feature

    Authors: Ruikai Yang, Fan He, Mingzhen He, Jie Yang, Xiaolin Huang

    Abstract: Random feature (RF) has been widely used for node consistency in decentralized kernel ridge regression (KRR). Currently, the consistency is guaranteed by imposing constraints on coefficients of features, necessitating that the random features on different nodes are identical. However, in many applications, data on different nodes varies significantly on the number or distribution, which calls for… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

  33. arXiv:2405.04100  [pdf, other

    cs.CV cs.LG

    ESP: Extro-Spective Prediction for Long-term Behavior Reasoning in Emergency Scenarios

    Authors: Dingrui Wang, Zheyuan Lai, Yuda Li, Yi Wu, Yuexin Ma, Johannes Betz, Ruigang Yang, Wei Li

    Abstract: Emergent-scene safety is the key milestone for fully autonomous driving, and reliable on-time prediction is essential to maintain safety in emergency scenarios. However, these emergency scenarios are long-tailed and hard to collect, which restricts the system from getting reliable predictions. In this paper, we build a new dataset, which aims at the long-term prediction with the inconspicuous stat… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: Accepted by ICRA 2024 as Oral Presentation

  34. arXiv:2405.03371  [pdf, other

    cs.CL

    Explainable Fake News Detection With Large Language Model via Defense Among Competing Wisdom

    Authors: Bo Wang, **g Ma, Hongzhan Lin, Zhiwei Yang, Ruichao Yang, Yuan Tian, Yi Chang

    Abstract: Most fake news detection methods learn latent feature representations based on neural networks, which makes them black boxes to classify a piece of news without giving any justification. Existing explainable systems generate veracity justifications from investigative journalism, which suffer from debunking delayed and low efficiency. Recent studies simply assume that the justification is equivalen… ▽ More

    Submitted 20 June, 2024; v1 submitted 6 May, 2024; originally announced May 2024.

    Comments: 12 pages, WWW'2024

  35. arXiv:2405.01927  [pdf, other

    cs.LG

    SlotGAT: Slot-based Message Passing for Heterogeneous Graph Neural Network

    Authors: Ziang Zhou, Jieming Shi, Renchi Yang, Yuanhang Zou, Qing Li

    Abstract: Heterogeneous graphs are ubiquitous to model complex data. There are urgent needs on powerful heterogeneous graph neural networks to effectively support important applications. We identify a potential semantic mixing issue in existing message passing processes, where the representations of the neighbors of a node $v$ are forced to be transformed to the feature space of $v$ for aggregation, though… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

    Comments: Published as a conference paper at ICML 2023

  36. arXiv:2405.01656  [pdf, other

    cs.CV cs.LG

    S4: Self-Supervised Sensing Across the Spectrum

    Authors: Jayanth Shenoy, Xingjian Davis Zhang, Shlok Mehrotra, Bill Tao, Rem Yang, Han Zhao, Deepak Vasisht

    Abstract: Satellite image time series (SITS) segmentation is crucial for many applications like environmental monitoring, land cover map** and agricultural crop type classification. However, training models for SITS segmentation remains a challenging task due to the lack of abundant training data, which requires fine grained annotation. We propose S4 a new self-supervised pre-training approach that signif… ▽ More

    Submitted 27 June, 2024; v1 submitted 2 May, 2024; originally announced May 2024.

  37. arXiv:2405.00676  [pdf, other

    cs.CV

    Spectrally Pruned Gaussian Fields with Neural Compensation

    Authors: Runyi Yang, Zhenxin Zhu, Zhou Jiang, Baijun Ye, Xiaoxue Chen, Yifei Zhang, Yuantao Chen, Jian Zhao, Hao Zhao

    Abstract: Recently, 3D Gaussian Splatting, as a novel 3D representation, has garnered attention for its fast rendering speed and high rendering quality. However, this comes with high memory consumption, e.g., a well-trained Gaussian field may utilize three million Gaussian primitives and over 700 MB of memory. We credit this high memory footprint to the lack of consideration for the relationship between pri… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: Code: https://github.com/RunyiYang/SUNDAE Project page: https://runyiyang.github.io/projects/SUNDAE/

  38. arXiv:2404.18231  [pdf, other

    cs.CL cs.AI

    From Persona to Personalization: A Survey on Role-Playing Language Agents

    Authors: Jiangjie Chen, Xintao Wang, Rui Xu, Siyu Yuan, Yikai Zhang, Wei Shi, Jian Xie, Shuang Li, Ruihan Yang, Tinghui Zhu, Aili Chen, Nianqi Li, Lida Chen, Caiyu Hu, Siye Wu, Scott Ren, Ziquan Fu, Yanghua Xiao

    Abstract: Recent advancements in large language models (LLMs) have significantly boosted the rise of Role-Playing Language Agents (RPLAs), i.e., specialized AI systems designed to simulate assigned personas. By harnessing multiple advanced abilities of LLMs, including in-context learning, instruction following, and social intelligence, RPLAs achieve a remarkable sense of human likeness and vivid role-playin… ▽ More

    Submitted 28 April, 2024; originally announced April 2024.

    Comments: Preprint

  39. arXiv:2404.15070  [pdf, other

    cs.SI cs.AI

    BotDGT: Dynamicity-aware Social Bot Detection with Dynamic Graph Transformers

    Authors: Buyun He, Yingguang Yang, Qi Wu, Hao Liu, Renyu Yang, Hao Peng, Xiang Wang, Yong Liao, Pengyuan Zhou

    Abstract: Detecting social bots has evolved into a pivotal yet intricate task, aimed at combating the dissemination of misinformation and preserving the authenticity of online interactions. While earlier graph-based approaches, which leverage topological structure of social networks, yielded notable outcomes, they overlooked the inherent dynamicity of social networks -- In reality, they largely depicted the… ▽ More

    Submitted 24 April, 2024; v1 submitted 23 April, 2024; originally announced April 2024.

    Comments: IJCAI 2024

  40. arXiv:2404.12635  [pdf, other

    cs.CV cs.CR cs.LG

    AED-PADA:Improving Generalizability of Adversarial Example Detection via Principal Adversarial Domain Adaptation

    Authors: Heqi Peng, Yunhong Wang, Ruijie Yang, Beichen Li, Rui Wang, Yuanfang Guo

    Abstract: Adversarial example detection, which can be conveniently applied in many scenarios, is important in the area of adversarial defense. Unfortunately, existing detection methods suffer from poor generalization performance, because their training process usually relies on the examples generated from a single known adversarial attack and there exists a large discrepancy between the training and unseen… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

  41. arXiv:2404.10494  [pdf, other

    cs.HC cs.LG

    BDAN: Mitigating Temporal Difference Across Electrodes in Cross-Subject Motor Imagery Classification via Generative Bridging Domain

    Authors: Zhige Chen, Rui Yang, Mengjie Huang, Chengxuan Qin, Zidong Wang

    Abstract: Because of "the non-repeatability of the experiment settings and conditions" and "the variability of brain patterns among subjects", the data distributions across sessions and electrodes are different in cross-subject motor imagery (MI) studies, eventually reducing the performance of the classification model. Systematically summarised based on the existing studies, a novel temporal-electrode data… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

  42. arXiv:2404.10207  [pdf, other

    stat.ML cs.LG

    HELLINGER-UCB: A novel algorithm for stochastic multi-armed bandit problem and cold start problem in recommender system

    Authors: Ruibo Yang, Jiazhou Wang, Andrew Mullhaupt

    Abstract: In this paper, we study the stochastic multi-armed bandit problem, where the reward is driven by an unknown random variable. We propose a new variant of the Upper Confidence Bound (UCB) algorithm called Hellinger-UCB, which leverages the squared Hellinger distance to build the upper confidence bound. We prove that the Hellinger-UCB reaches the theoretical lower bound. We also show that the Helling… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

  43. arXiv:2404.09127  [pdf, other

    cs.CL

    Confidence Calibration and Rationalization for LLMs via Multi-Agent Deliberation

    Authors: Ruixin Yang, Dheeraj Rajagopal, Shirley Anugrah Hayati, Bin Hu, Dongyeop Kang

    Abstract: Uncertainty estimation is a significant issue for current large language models (LLMs) that are generally poorly calibrated and over-confident, especially with reinforcement learning from human feedback (RLHF). Unlike humans, whose decisions and confidences not only stem from intrinsic beliefs but can also be adjusted through daily observations, existing calibration methods for LLMs focus on estim… ▽ More

    Submitted 10 May, 2024; v1 submitted 13 April, 2024; originally announced April 2024.

    Comments: Accepted at ICLR 2024 Workshop on Reliable and Responsible Foundation Models

  44. arXiv:2404.07364  [pdf

    cs.HC

    Fabricating Paper Circuits with Subtractive Processing

    Authors: Ruhan Yang, Krithik Ranjan, Ellen Yi-Luen Do

    Abstract: This paper introduces a new method of paper circuit fabrication that overcomes design barriers and increases flexibility in circuit design. Conventional circuit boards rely on thin traces, which limits the complexity and accuracy when applied to paper circuits. To address this issue, we propose a method that uses large conductive zones in paper circuits and performs subtractive processing during t… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

    Comments: ACM CHI 2023 Workshop 02: Beyond Prototy** Boards: Future Paradigms for Electronics Toolkits

  45. arXiv:2404.07360  [pdf

    cs.HC

    Enhancing Accessibility in Soft Robotics: Exploring Magnet-Embedded Paper-Based Interactions

    Authors: Ruhan Yang, Ellen Yi-Luen Do

    Abstract: This paper explores the implementation of embedded magnets to enhance paper-based interactions. The integration of magnets in paper-based interactions simplifies the fabrication process, making it more accessible for building soft robotics systems. We discuss various interaction patterns achievable through this approach and highlight their potential applications.

    Submitted 10 April, 2024; originally announced April 2024.

    Comments: ACM DIS 2023 Workshop 02: Soft Robotics and Programmable Materials for Human-Computer Interaction

  46. arXiv:2404.07229  [pdf, other

    cs.CL cs.AI

    Personality-affected Emotion Generation in Dialog Systems

    Authors: Zhiyuan Wen, Jiannong Cao, Jiaxing Shen, Ruosong Yang, Shuaiqi Liu, Maosong Sun

    Abstract: Generating appropriate emotions for responses is essential for dialog systems to provide human-like interaction in various application scenarios. Most previous dialog systems tried to achieve this goal by learning empathetic manners from anonymous conversational data. However, emotional responses generated by those methods may be inconsistent, which will decrease user engagement and service qualit… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

    Comments: Accepted by ACM Transactions on Information Systems

  47. arXiv:2404.06737  [pdf, other

    cs.LG cs.CR

    Disguised Copyright Infringement of Latent Diffusion Models

    Authors: Yiwei Lu, Matthew Y. R. Yang, Zuoqiu Liu, Gautam Kamath, Yaoliang Yu

    Abstract: Copyright infringement may occur when a generative model produces samples substantially similar to some copyrighted data that it had access to during the training phase. The notion of access usually refers to including copyrighted samples directly in the training dataset, which one may inspect to identify an infringement. We argue that such visual auditing largely overlooks a concealed copyright i… ▽ More

    Submitted 3 June, 2024; v1 submitted 10 April, 2024; originally announced April 2024.

    Comments: Accepted to ICML 2024

  48. arXiv:2404.02589  [pdf, other

    cs.CL cs.AI

    Affective-NLI: Towards Accurate and Interpretable Personality Recognition in Conversation

    Authors: Zhiyuan Wen, Jiannong Cao, Yu Yang, Ruosong Yang, Shuaiqi Liu

    Abstract: Personality Recognition in Conversation (PRC) aims to identify the personality traits of speakers through textual dialogue content. It is essential for providing personalized services in various applications of Human-Computer Interaction (HCI), such as AI-based mental therapy and companion robots for the elderly. Most recent studies analyze the dialog content for personality classification yet ove… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

    Comments: Accepted by IEEE PerCom 2024

  49. arXiv:2404.00876  [pdf, other

    cs.CV

    MGMap: Mask-Guided Learning for Online Vectorized HD Map Construction

    Authors: Xiaolu Liu, Song Wang, Wentong Li, Ruizi Yang, Junbo Chen, Jianke Zhu

    Abstract: Currently, high-definition (HD) map construction leans towards a lightweight online generation tendency, which aims to preserve timely and reliable road scene information. However, map elements contain strong shape priors. Subtle and sparse annotations make current detection-based frameworks ambiguous in locating relevant feature scopes and cause the loss of detailed structures in prediction. To a… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

    Comments: 18 pages, 11 figures, accepted by CVPR 2024

  50. arXiv:2403.19893  [pdf, other

    cs.CV

    PLoc: A New Evaluation Criterion Based on Physical Location for Autonomous Driving Datasets

    Authors: Ruining Yang, Yuqi Peng

    Abstract: Autonomous driving has garnered significant attention as a key research area within artificial intelligence. In the context of autonomous driving scenarios, the varying physical locations of objects correspond to different levels of danger. However, conventional evaluation criteria for automatic driving object detection often overlook the crucial aspect of an object's physical location, leading to… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.