Skip to main content

Showing 1–50 of 221 results for author: Zhu, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.07664  [pdf

    cs.AI

    Geospatial Knowledge Graphs

    Authors: Rui Zhu

    Abstract: Geospatial knowledge graphs have emerged as a novel paradigm for representing and reasoning over geospatial information. In this framework, entities such as places, people, events, and observations are depicted as nodes, while their relationships are represented as edges. This graph-based data format lays the foundation for creating a "FAIR" (Findable, Accessible, Interoperable, and Reusable) envi… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

  2. arXiv:2405.06510  [pdf, other

    cs.AI

    UniDM: A Unified Framework for Data Manipulation with Large Language Models

    Authors: Yichen Qian, Yongyi He, Rong Zhu, **tao Huang, Zhijian Ma, Haibin Wang, Yaohua Wang, Xiuyu Sun, Defu Lian, Bolin Ding, **gren Zhou

    Abstract: Designing effective data manipulation methods is a long standing problem in data lakes. Traditional methods, which rely on rules or machine learning models, require extensive human efforts on training data collection and tuning models. Recent methods apply Large Language Models (LLMs) to resolve multiple data manipulation tasks. They exhibit bright benefits in terms of performance but still requir… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

    Comments: MLSys24

  3. arXiv:2405.04435  [pdf, other

    cs.CL cs.DS

    Fast Exact Retrieval for Nearest-neighbor Lookup (FERN)

    Authors: Richard Zhu

    Abstract: Exact nearest neighbor search is a computationally intensive process, and even its simpler sibling -- vector retrieval -- can be computationally complex. This is exacerbated when retrieving vectors which have high-dimension $d$ relative to the number of vectors, $N$, in the database. Exact nearest neighbor retrieval has been generally acknowledged to be a $O(Nd)$ problem with no sub-linear solutio… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: NAACL 2024 SRW

  4. arXiv:2404.16205  [pdf, other

    cs.CV cs.MM

    AIS 2024 Challenge on Video Quality Assessment of User-Generated Content: Methods and Results

    Authors: Marcos V. Conde, Saman Zadtootaghaj, Nabajeet Barman, Radu Timofte, Chenlong He, Qi Zheng, Ruoxi Zhu, Zhengzhong Tu, Haiqiang Wang, Xiangguang Chen, Wenhui Meng, Xiang Pan, Huiying Shi, Han Zhu, Xiaozhong Xu, Lei Sun, Zhenzhong Chen, Shan Liu, Zicheng Zhang, Haoning Wu, Yingjie Zhou, Chunyi Li, Xiaohong Liu, Weisi Lin, Guangtao Zhai , et al. (11 additional authors not shown)

    Abstract: This paper reviews the AIS 2024 Video Quality Assessment (VQA) Challenge, focused on User-Generated Content (UGC). The aim of this challenge is to gather deep learning-based methods capable of estimating the perceptual quality of UGC videos. The user-generated videos from the YouTube UGC Dataset include diverse content (sports, games, lyrics, anime, etc.), quality and resolutions. The proposed met… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

    Comments: CVPR 2024 Workshop -- AI for Streaming (AIS) Video Quality Assessment Challenge

  5. arXiv:2404.16064  [pdf

    cs.HC cs.LG cs.LO

    Transparent AI: Develo** an Explainable Interface for Predicting Postoperative Complications

    Authors: Yuanfang Ren, Chirayu Tripathi, Ziyuan Guan, Ruilin Zhu, Victoria Hougha, Yingbo Ma, Zhenhong Hu, Jeremy Balch, Tyler J. Loftus, Parisa Rashidi, Benjamin Shickel, Tezcan Ozrazgat-Baslanti, Azra Bihorac

    Abstract: Given the sheer volume of surgical procedures and the significant rate of postoperative fatalities, assessing and managing surgical complications has become a critical public health concern. Existing artificial intelligence (AI) tools for risk surveillance and diagnosis often lack adequate interpretability, fairness, and reproducibility. To address this, we proposed an Explainable AI (XAI) framewo… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

    Comments: 32 pages, 7 figures, 4 supplement figures and 1 supplement table

  6. arXiv:2404.07471  [pdf, other

    cs.SE cs.AI cs.CL

    Structure-aware Fine-tuning for Code Pre-trained Models

    Authors: Jiayi Wu, Renyu Zhu, Nuo Chen, Qiushi Sun, Xiang Li, Ming Gao

    Abstract: Over the past few years, we have witnessed remarkable advancements in Code Pre-trained Models (CodePTMs). These models achieved excellent representation capabilities by designing structure-based pre-training tasks for code. However, how to enhance the absorption of structural knowledge when fine-tuning CodePTMs still remains a significant challenge. To fill this gap, in this paper, we present Stru… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

    Comments: Accepted by COLING 2024

  7. arXiv:2404.06892  [pdf, other

    cs.CV

    SparseAD: Sparse Query-Centric Paradigm for Efficient End-to-End Autonomous Driving

    Authors: Diankun Zhang, Guoan Wang, Runwen Zhu, Jianbo Zhao, Xiwu Chen, Siyu Zhang, Jiahao Gong, Qibin Zhou, Wenyuan Zhang, Ningzi Wang, Feiyang Tan, Hangning Zhou, Ziyao Xu, Haotian Yao, Chi Zhang, Xiaojun Liu, Xiaoguang Di, Bin Li

    Abstract: End-to-End paradigms use a unified framework to implement multi-tasks in an autonomous driving system. Despite simplicity and clarity, the performance of end-to-end autonomous driving methods on sub-tasks is still far behind the single-task methods. Meanwhile, the widely used dense BEV features in previous end-to-end methods make it costly to extend to more modalities or tasks. In this paper, we p… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

  8. arXiv:2404.05892  [pdf, other

    cs.CL cs.AI

    Eagle and Finch: RWKV with Matrix-Valued States and Dynamic Recurrence

    Authors: Bo Peng, Daniel Goldstein, Quentin Anthony, Alon Albalak, Eric Alcaide, Stella Biderman, Eugene Cheah, Xingjian Du, Teddy Ferdinan, Haowen Hou, Przemysław Kazienko, Kranthi Kiran GV, Jan Kocoń, Bartłomiej Koptyra, Satyapriya Krishna, Ronald McClelland Jr., Niklas Muennighoff, Fares Obeid, Atsushi Saito, Guangyu Song, Haoqin Tu, Stanisław Woźniak, Ruichong Zhang, Bingchen Zhao, Qihang Zhao , et al. (3 additional authors not shown)

    Abstract: We present Eagle (RWKV-5) and Finch (RWKV-6), sequence models improving upon the RWKV (RWKV-4) architecture. Our architectural design advancements include multi-headed matrix-valued states and a dynamic recurrence mechanism that improve expressivity while maintaining the inference efficiency characteristics of RNNs. We introduce a new multilingual corpus with 1.12 trillion tokens and a fast tokeni… ▽ More

    Submitted 10 April, 2024; v1 submitted 8 April, 2024; originally announced April 2024.

  9. arXiv:2404.05014  [pdf, other

    cs.CV

    MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators

    Authors: Shenghai Yuan, **fa Huang, Yujun Shi, Yongqi Xu, Ruijie Zhu, Bin Lin, Xinhua Cheng, Li Yuan, Jiebo Luo

    Abstract: Recent advances in Text-to-Video generation (T2V) have achieved remarkable success in synthesizing high-quality general videos from textual descriptions. A largely overlooked problem in T2V is that existing models have not adequately encoded physical knowledge of the real world, thus generated videos tend to have limited motion and poor variations. In this paper, we propose \textbf{MagicTime}, a m… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

  10. arXiv:2404.02476  [pdf, other

    math.OC cs.AI cs.LG

    Deep Reinforcement Learning for Traveling Purchaser Problems

    Authors: Haofeng Yuan, Rong** Zhu, Wanlu Yang, Shiji Song, Keyou You, Yuli Zhang

    Abstract: The traveling purchaser problem (TPP) is an important combinatorial optimization problem with broad applications. Due to the coupling between routing and purchasing, existing works on TPPs commonly address route construction and purchase planning simultaneously, which, however, leads to exact methods with high computational cost and heuristics with sophisticated design but limited performance. In… ▽ More

    Submitted 11 April, 2024; v1 submitted 3 April, 2024; originally announced April 2024.

  11. arXiv:2403.17004  [pdf, other

    cs.CV cs.MM

    SD-DiT: Unleashing the Power of Self-supervised Discrimination in Diffusion Transformer

    Authors: Rui Zhu, Yingwei Pan, Yehao Li, Ting Yao, Zhenglong Sun, Tao Mei, Chang Wen Chen

    Abstract: Diffusion Transformer (DiT) has emerged as the new trend of generative diffusion models on image generation. In view of extremely slow convergence in typical DiT, recent breakthroughs have been driven by mask strategy that significantly improves the training efficiency of DiT with additional intra-image contextual learning. Despite this progress, mask strategy still suffers from two inherent limit… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: CVPR 2024

  12. arXiv:2403.15603  [pdf, other

    cs.CV cs.AI

    Forward Learning for Gradient-based Black-box Saliency Map Generation

    Authors: Zeliang Zhang, Mingqian Feng, **yang Jiang, Rongyi Zhu, Yijie Peng, Chenliang Xu

    Abstract: Gradient-based saliency maps are widely used to explain deep neural network decisions. However, as models become deeper and more black-box, such as in closed-source APIs like ChatGPT, computing gradients become challenging, hindering conventional explanation methods. In this work, we introduce a novel unified framework for estimating gradients in black-box settings and generating saliency maps to… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

  13. arXiv:2403.14734  [pdf, other

    cs.SE cs.AI cs.CL cs.PL

    A Survey of Neural Code Intelligence: Paradigms, Advances and Beyond

    Authors: Qiushi Sun, Zhirui Chen, Fangzhi Xu, Kanzhi Cheng, Chang Ma, Zhangyue Yin, Jianing Wang, Chengcheng Han, Renyu Zhu, Shuai Yuan, Qipeng Guo, Xipeng Qiu, Pengcheng Yin, Xiaoli Li, Fei Yuan, Lingpeng Kong, Xiang Li, Zhiyong Wu

    Abstract: Neural Code Intelligence -- leveraging deep learning to understand, generate, and optimize code -- holds immense potential for transformative impacts on the whole society. Bridging the gap between Natural Language and Programming Language, this domain has drawn significant attention from researchers in both research communities over the past few years. This survey presents a systematic and chronol… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: 64 pages, 6 figures, 10 tables, 688 references

  14. arXiv:2403.14638  [pdf, other

    cs.CY cs.LG

    Personalized Programming Guidance based on Deep Programming Learning Style Capturing

    Authors: Yingfan Liu, Renyu Zhu, Ming Gao

    Abstract: With the rapid development of big data and AI technology, programming is in high demand and has become an essential skill for students. Meanwhile, researchers also focus on boosting the online judging system's guidance ability to reduce students' dropout rates. Previous studies mainly targeted at enhancing learner engagement on online platforms by providing personalized recommendations. However, t… ▽ More

    Submitted 20 February, 2024; originally announced March 2024.

    Comments: 18th International Conference on Computer Science & Education

  15. arXiv:2403.11436  [pdf, ps, other

    cs.IT

    Deep Holes of Twisted Reed-Solomon Codes

    Authors: Weijun Fang, **gke Xu, Ruiqi Zhu

    Abstract: The deep holes of a linear code are the vectors that achieve the maximum error distance to the code. There has been extensive research on the topic of deep holes in Reed-Solomon codes. As a generalization of Reed-Solomon codes, we investigate the problem of deep holes of a class of twisted Reed-Solomon codes in this paper. The covering radius and a standard class of deep holes of twisted Reed-Solo… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

    Comments: This work has been submitted to the IEEE for possible publication

  16. arXiv:2403.08826  [pdf, other

    cs.HC cs.LG

    A Dataset for the Validation of Truth Inference Algorithms Suitable for Online Deployment

    Authors: Fei Wang, Haoyu Liu, Haoyang Bi, Xiangzhuang Shen, Renyu Zhu, Runze Wu, Minmin Lin, Tangjie Lv, Changjie Fan, Qi Liu, Zhenya Huang, Enhong Chen

    Abstract: For the purpose of efficient and cost-effective large-scale data labeling, crowdsourcing is increasingly being utilized. To guarantee the quality of data labeling, multiple annotations need to be collected for each data sample, and truth inference algorithms have been developed to accurately infer the true labels. Despite previous studies having released public datasets to evaluate the efficacy of… ▽ More

    Submitted 10 March, 2024; originally announced March 2024.

  17. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Machel Reid, Nikolay Savinov, Denis Teplyashin, Dmitry, Lepikhin, Timothy Lillicrap, Jean-baptiste Alayrac, Radu Soricut, Angeliki Lazaridou, Orhan Firat, Julian Schrittwieser, Ioannis Antonoglou, Rohan Anil, Sebastian Borgeaud, Andrew Dai, Katie Millican, Ethan Dyer, Mia Glaese, Thibault Sottiaux, Benjamin Lee, Fabio Viola, Malcolm Reynolds, Yuanzhong Xu, James Molloy , et al. (683 additional authors not shown)

    Abstract: In this report, we present the latest model of the Gemini family, Gemini 1.5 Pro, a highly compute-efficient multimodal mixture-of-experts model capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. Gemini 1.5 Pro achieves near-perfect recall on long-context retrieval tasks across modalit… ▽ More

    Submitted 25 April, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  18. arXiv:2403.02571  [pdf, other

    cs.LG cs.CV

    DPAdapter: Improving Differentially Private Deep Learning through Noise Tolerance Pre-training

    Authors: Zihao Wang, Rui Zhu, Dongruo Zhou, Zhikun Zhang, John Mitchell, Haixu Tang, XiaoFeng Wang

    Abstract: Recent developments have underscored the critical role of \textit{differential privacy} (DP) in safeguarding individual data for training machine learning models. However, integrating DP oftentimes incurs significant model performance degradation due to the perturbation introduced into the training process, presenting a formidable challenge in the {differentially private machine learning} (DPML) f… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

    Comments: To appear in the 33rd USENIX Security Symposium, August 2024, Philadelphia Marriott Downtown, PA, USA

  19. arXiv:2403.02018  [pdf, other

    cs.RO cs.AI

    Cross Domain Policy Transfer with Effect Cycle-Consistency

    Authors: Ruiqi Zhu, Tianhong Dai, Oya Celiktutan

    Abstract: Training a robotic policy from scratch using deep reinforcement learning methods can be prohibitively expensive due to sample inefficiency. To address this challenge, transferring policies trained in the source domain to the target domain becomes an attractive paradigm. Previous research has typically focused on domains with similar state and action spaces but differing in other aspects. In this p… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

    Comments: Accepted to International Conference on Robotics and Automation (ICRA), 2024

  20. arXiv:2402.18742  [pdf, other

    cs.CV

    Comparing Importance Sampling Based Methods for Mitigating the Effect of Class Imbalance

    Authors: Indu Panigrahi, Richard Zhu

    Abstract: Most state-of-the-art computer vision models heavily depend on data. However, many datasets exhibit extreme class imbalance which has been shown to negatively impact model performance. Among the training-time and data-generation solutions that have been explored, one subset that leverages existing data is importance sampling. A good deal of this work focuses primarily on the CIFAR-10 and CIFAR-100… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

    Comments: Preprint

  21. arXiv:2402.01509  [pdf, other

    eess.IV cs.CV cs.LG

    Advancing Brain Tumor Inpainting with Generative Models

    Authors: Ruizhi Zhu, Xinru Zhang, Haowen Pang, Chundan Xu, Chuyang Ye

    Abstract: Synthesizing healthy brain scans from diseased brain scans offers a potential solution to address the limitations of general-purpose algorithms, such as tissue segmentation and brain extraction algorithms, which may not effectively handle diseased images. We consider this a 3D inpainting task and investigate the adaptation of 2D inpainting methods to meet the requirements of 3D magnetic resonance… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

  22. arXiv:2401.08734  [pdf, other

    cs.CV cs.LG

    Bag of Tricks to Boost Adversarial Transferability

    Authors: Zeliang Zhang, Rongyi Zhu, Wei Yao, Xiaosen Wang, Chenliang Xu

    Abstract: Deep neural networks are widely known to be vulnerable to adversarial examples. However, vanilla adversarial examples generated under the white-box setting often exhibit low transferability across different models. Since adversarial transferability poses more severe threats to practical applications, various approaches have been proposed for better transferability, including gradient-based, input… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

  23. ModuleGuard:Understanding and Detecting Module Conflicts in Python Ecosystem

    Authors: Ruofan Zhu, Xingyu Wang, Chengwei Liu, Zhengzi Xu, Wenbo Shen, Rui Chang, Yang Liu

    Abstract: Python has become one of the most popular programming languages for software development due to its simplicity, readability, and versatility. As the Python ecosystem grows, developers face increasing challenges in avoiding module conflicts, which occur when different packages have the same namespace modules. Unfortunately, existing work has neither investigated the module conflict comprehensively… ▽ More

    Submitted 4 January, 2024; originally announced January 2024.

    Comments: The paper was accepted by ICSE24

    MSC Class: 65-04 ACM Class: D.2; K.6.3

  24. arXiv:2312.17493  [pdf, other

    cs.LG cs.CR

    Differentially Private Low-Rank Adaptation of Large Language Model Using Federated Learning

    Authors: Xiao-Yang Liu, Rongyi Zhu, Daochen Zha, Jiechao Gao, Shan Zhong, Meikang Qiu

    Abstract: The surge in interest and application of large language models (LLMs) has sparked a drive to fine-tune these models to suit specific applications, such as finance and medical science. However, concerns regarding data privacy have emerged, especially when multiple stakeholders aim to collaboratively enhance LLMs using sensitive data. In this scenario, federated learning becomes a natural choice, al… ▽ More

    Submitted 29 December, 2023; originally announced December 2023.

    Comments: 20 pages, 1 figure, 22 tables

  25. arXiv:2312.17432  [pdf, other

    cs.CV cs.CL

    Video Understanding with Large Language Models: A Survey

    Authors: Yunlong Tang, **g Bi, Siting Xu, Luchuan Song, Susan Liang, Teng Wang, Daoan Zhang, Jie An, **gyang Lin, Rongyi Zhu, Ali Vosoughi, Chao Huang, Zeliang Zhang, Feng Zheng, Jianguo Zhang, ** Luo, Jiebo Luo, Chenliang Xu

    Abstract: With the burgeoning growth of online video platforms and the escalating volume of video content, the demand for proficient video understanding tools has intensified markedly. Given the remarkable capabilities of Large Language Models (LLMs) in language and multimodal tasks, this survey provides a detailed overview of the recent advancements in video understanding harnessing the power of LLMs (Vid-… ▽ More

    Submitted 3 January, 2024; v1 submitted 28 December, 2023; originally announced December 2023.

  26. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1320 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 2 April, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  27. arXiv:2312.09527  [pdf, other

    cs.CV cs.GR

    TIFace: Improving Facial Reconstruction through Tensorial Radiance Fields and Implicit Surfaces

    Authors: Ruijie Zhu, Jiahao Chang, Ziyang Song, Jiahuan Yu, Tianzhu Zhang

    Abstract: This report describes the solution that secured the first place in the "View Synthesis Challenge for Human Heads (VSCHH)" at the ICCV 2023 workshop. Given the sparse view images of human heads, the objective of this challenge is to synthesize images from novel viewpoints. Due to the complexity of textures on the face and the impact of lighting, the baseline method TensoRF yields results with signi… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

    Comments: 1st place solution in the View Synthesis Challenge for Human Heads (VSCHH) at the ICCV 2023 workshop

  28. arXiv:2312.06397  [pdf, other

    cs.DB cs.IR

    MUST: An Effective and Scalable Framework for Multimodal Search of Target Modality

    Authors: Mengzhao Wang, Xiangyu Ke, Xiaoliang Xu, Lu Chen, Yunjun Gao, Pinpin Huang, Runkai Zhu

    Abstract: We investigate the problem of multimodal search of target modality, where the task involves enhancing a query in a specific target modality by integrating information from auxiliary modalities. The goal is to retrieve relevant objects whose contents in the target modality match the specified multimodal query. The paper first introduces two baseline approaches that integrate techniques from the Dat… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

    Comments: This paper has been accepted by ICDE 2024

  29. arXiv:2312.01213  [pdf, other

    cs.AR cs.AI

    Recent Advances in Scalable Energy-Efficient and Trustworthy Spiking Neural networks: from Algorithms to Technology

    Authors: Souvik Kundu, Rui-Jie Zhu, Akhilesh Jaiswal, Peter A. Beerel

    Abstract: Neuromorphic computing and, in particular, spiking neural networks (SNNs) have become an attractive alternative to deep neural networks for a broad range of signal processing applications, processing static and/or temporal inputs from different sensory modalities, including audio and vision sensors. In this paper, we start with a description of recent advances in algorithmic and optimization innov… ▽ More

    Submitted 2 December, 2023; originally announced December 2023.

    Comments: 5 pages, 3 figures

  30. arXiv:2311.18725  [pdf, other

    stat.ME cs.LG stat.AP stat.ML

    AI in Pharma for Personalized Sequential Decision-Making: Methods, Applications and Opportunities

    Authors: Yuhan Li, Hongtao Zhang, Keaven Anderson, Songzi Li, Ruoqing Zhu

    Abstract: In the pharmaceutical industry, the use of artificial intelligence (AI) has seen consistent growth over the past decade. This rise is attributed to major advancements in statistical machine learning methodologies, computational capabilities and the increased availability of large datasets. AI techniques are applied throughout different stages of drug development, ranging from drug discovery to pos… ▽ More

    Submitted 30 November, 2023; originally announced November 2023.

  31. arXiv:2311.18251  [pdf, other

    cs.HC

    Can Large Language Models Be Good Companions? An LLM-Based Eyewear System with Conversational Common Ground

    Authors: Zhenyu Xu, Hailin Xu, Zhouyang Lu, Yingying Zhao, Rui Zhu, Yujiang Wang, Mingzhi Dong, Yuhu Chang, Qin Lv, Robert P. Dick, Fan Yang, Tun Lu, Ning Gu, Li Shang

    Abstract: Develo** chatbots as personal companions has long been a goal of artificial intelligence researchers. Recent advances in Large Language Models (LLMs) have delivered a practical solution for endowing chatbots with anthropomorphic language capabilities. However, it takes more than LLMs to enable chatbots that can act as companions. Humans use their understanding of individual personalities to driv… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

    Comments: 36 pages, 25 figures, Under review at ACM IMWUT

  32. arXiv:2311.14709  [pdf, other

    cs.HC cs.LG

    Towards Long-term Annotators: A Supervised Label Aggregation Baseline

    Authors: Haoyu Liu, Fei Wang, Minmin Lin, Runze Wu, Renyu Zhu, Shiwei Zhao, Kai Wang, Tangjie Lv, Changjie Fan

    Abstract: Relying on crowdsourced workers, data crowdsourcing platforms are able to efficiently provide vast amounts of labeled data. Due to the variability in the annotation quality of crowd workers, modern techniques resort to redundant annotations and subsequent label aggregation to infer true labels. However, these methods require model updating during the inference, posing challenges in real-world impl… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

  33. arXiv:2311.09919  [pdf, other

    cs.CV cs.AI

    DSR-Diff: Depth Map Super-Resolution with Diffusion Model

    Authors: Yuan Shi, Bin Xia, Rui Zhu, Qingmin Liao, Wenming Yang

    Abstract: Color-guided depth map super-resolution (CDSR) improve the spatial resolution of a low-quality depth map with the corresponding high-quality color map, benefiting various applications such as 3D reconstruction, virtual reality, and augmented reality. While conventional CDSR methods typically rely on convolutional neural networks or transformers, diffusion models (DMs) have demonstrated notable eff… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

  34. arXiv:2311.06570  [pdf, other

    cs.CV

    OR Residual Connection Achieving Comparable Accuracy to ADD Residual Connection in Deep Residual Spiking Neural Networks

    Authors: Yimeng Shan, Xuerui Qiu, Rui-jie Zhu, Ruike Li, Meng Wang, Haicheng Qu

    Abstract: Spiking Neural Networks (SNNs) have garnered substantial attention in brain-like computing for their biological fidelity and the capacity to execute energy-efficient spike-driven operations. As the demand for heightened performance in SNNs surges, the trend towards training deeper networks becomes imperative, while residual learning stands as a pivotal method for training deep neural networks. In… ▽ More

    Submitted 11 November, 2023; originally announced November 2023.

    Comments: 16 pages, 8 figures and 11tables

  35. arXiv:2310.20427  [pdf, other

    eess.IV cs.CV cs.LG

    Assessing and Enhancing Robustness of Deep Learning Models with Corruption Emulation in Digital Pathology

    Authors: Peixiang Huang, Songtao Zhang, Yulu Gan, Rui Xu, Rongqi Zhu, Wenkang Qin, Limei Guo, Shan Jiang, Lin Luo

    Abstract: Deep learning in digital pathology brings intelligence and automation as substantial enhancements to pathological analysis, the gold standard of clinical diagnosis. However, multiple steps from tissue preparation to slide imaging introduce various image corruptions, making it difficult for deep neural network (DNN) models to achieve stable diagnostic results for clinical use. In order to assess an… ▽ More

    Submitted 31 October, 2023; originally announced October 2023.

  36. arXiv:2310.19300  [pdf, other

    stat.ML cs.LG

    Stage-Aware Learning for Dynamic Treatments

    Authors: Hanwen Ye, Wenzhuo Zhou, Ruoqing Zhu, Annie Qu

    Abstract: Recent advances in dynamic treatment regimes (DTRs) provide powerful optimal treatment searching algorithms, which are tailored to individuals' specific needs and able to maximize their expected clinical benefits. However, existing algorithms could suffer from insufficient sample size under optimal treatments, especially for chronic diseases involving long stages of decision-making. To address the… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

  37. arXiv:2310.15469  [pdf, other

    cs.CR cs.CL

    The Janus Interface: How Fine-Tuning in Large Language Models Amplifies the Privacy Risks

    Authors: Xiaoyi Chen, Siyuan Tang, Rui Zhu, Shijun Yan, Lei **, Zihao Wang, Liya Su, Zhikun Zhang, XiaoFeng Wang, Haixu Tang

    Abstract: The rapid advancements of large language models (LLMs) have raised public concerns about the privacy leakage of personally identifiable information (PII) within their extensive training datasets. Recent studies have demonstrated that an adversary could extract highly sensitive privacy data from the training data of LLMs with carefully designed prompts. However, these attacks suffer from the model'… ▽ More

    Submitted 12 May, 2024; v1 submitted 23 October, 2023; originally announced October 2023.

  38. arXiv:2310.14751  [pdf, other

    cs.LG

    Efficient and Interpretable Bandit Algorithms

    Authors: Subhojyoti Mukherjee, Ruihao Zhu, Branislav Kveton

    Abstract: Motivated by the importance of explainability in modern machine learning, we design bandit algorithms that are efficient and interpretable. A bandit algorithm is interpretable if it explores with the objective of reducing uncertainty in the unknown model parameter. To quantify the interpretability, we introduce a novel metric of model error, which compares the rate reduction of the mean reward est… ▽ More

    Submitted 8 February, 2024; v1 submitted 23 October, 2023; originally announced October 2023.

  39. arXiv:2310.14576  [pdf, other

    cs.LG cs.CV

    Tensor Decomposition Based Attention Module for Spiking Neural Networks

    Authors: Haoyu Deng, Ruijie Zhu, Xuerui Qiu, Yule Duan, Malu Zhang, Liangjian Deng

    Abstract: The attention mechanism has been proven to be an effective way to improve spiking neural network (SNN). However, based on the fact that the current SNN input data flow is split into tensors to process on GPUs, none of the previous works consider the properties of tensors to implement an attention module. This inspires us to rethink current SNN from the perspective of tensor-relevant theories. Usin… ▽ More

    Submitted 10 April, 2024; v1 submitted 23 October, 2023; originally announced October 2023.

    Comments: Accepted by Knowledge-Based Systems

  40. arXiv:2310.08044  [pdf, other

    cs.CV

    EC-Depth: Exploring the consistency of self-supervised monocular depth estimation in challenging scenes

    Authors: Ziyang Song, Ruijie Zhu, Chuxin Wang, Jiacheng Deng, Jianfeng He, Tianzhu Zhang

    Abstract: Self-supervised monocular depth estimation holds significant importance in the fields of autonomous driving and robotics. However, existing methods are typically trained and tested on standard datasets, overlooking the impact of various adverse conditions prevalent in real-world applications, such as rainy days. As a result, it is commonly observed that these methods struggle to handle these chall… ▽ More

    Submitted 18 March, 2024; v1 submitted 12 October, 2023; originally announced October 2023.

    Comments: Project page: https://ruijiezhu94.github.io/ECDepth_page

  41. arXiv:2310.07253  [pdf

    cs.LG cs.AI q-bio.QM

    ADMEOOD: Out-of-Distribution Benchmark for Drug Property Prediction

    Authors: Shuoying Wei, Xinlong Wen, Lida Zhu, Songquan Li, Rongbo Zhu

    Abstract: Obtaining accurate and valid information for drug molecules is a crucial and challenging task. However, chemical knowledge and information have been accumulated over the past 100 years from various regions, laboratories, and experimental purposes. Little has been explored in terms of the out-of-distribution (OOD) problem with noise and inconsistency, which may lead to weak robustness and unsatisfi… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

  42. ALECE: An Attention-based Learned Cardinality Estimator for SPJ Queries on Dynamic Workloads (Extended)

    Authors: Pengfei Li, Wenqing Wei, Rong Zhu, Bolin Ding, **gren Zhou, Hua Lu

    Abstract: For efficient query processing, DBMS query optimizers have for decades relied on delicate cardinality estimation methods. In this work, we propose an Attention-based LEarned Cardinality Estimator (ALECE for short) for SPJ queries. The core idea is to discover the implicit relationships between queries and underlying dynamic data using attention mechanisms in ALECE's two modules that are built on t… ▽ More

    Submitted 23 October, 2023; v1 submitted 8 October, 2023; originally announced October 2023.

    Comments: VLDB 2024

    Journal ref: PVLDB, 17(2): 197 - 210, 2023

  43. arXiv:2309.15237  [pdf

    cs.CY cs.AI cs.ET cs.HC

    User Experience Design Professionals' Perceptions of Generative Artificial Intelligence

    Authors: Jie Li, Hancheng Cao, Laura Lin, Youyang Hou, Ruihao Zhu, Abdallah El Ali

    Abstract: Among creative professionals, Generative Artificial Intelligence (GenAI) has sparked excitement over its capabilities and fear over unanticipated consequences. How does GenAI impact User Experience Design (UXD) practice, and are fears warranted? We interviewed 20 UX Designers, with diverse experience and across companies (startups to large enterprises). We probed them to characterize their practic… ▽ More

    Submitted 16 February, 2024; v1 submitted 26 September, 2023; originally announced September 2023.

    Comments: accepted to CHI 2024

  44. arXiv:2309.13278  [pdf, other

    stat.ML cs.LG

    Distributional Shift-Aware Off-Policy Interval Estimation: A Unified Error Quantification Framework

    Authors: Wenzhuo Zhou, Yuhan Li, Ruoqing Zhu, Annie Qu

    Abstract: We study high-confidence off-policy evaluation in the context of infinite-horizon Markov decision processes, where the objective is to establish a confidence interval (CI) for the target policy value using only offline data pre-collected from unknown behavior policies. This task faces two primary challenges: providing a comprehensive and rigorous error quantification in CI estimation, and addressi… ▽ More

    Submitted 1 October, 2023; v1 submitted 23 September, 2023; originally announced September 2023.

  45. arXiv:2309.12955  [pdf, other

    cs.CR cs.CV

    On Data Fabrication in Collaborative Vehicular Perception: Attacks and Countermeasures

    Authors: Qingzhao Zhang, Shuowei **, Ruiyang Zhu, Jiachen Sun, Xumiao Zhang, Qi Alfred Chen, Z. Morley Mao

    Abstract: Collaborative perception, which greatly enhances the sensing capability of connected and autonomous vehicles (CAVs) by incorporating data from external resources, also brings forth potential security risks. CAVs' driving decisions rely on remote untrusted data, making them susceptible to attacks carried out by malicious participants in the collaborative perception system. However, security analysi… ▽ More

    Submitted 3 October, 2023; v1 submitted 22 September, 2023; originally announced September 2023.

    Comments: 18 pages, 24 figures, accepted by Usenix Security 2024

  46. arXiv:2309.12295  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    Learning to Drive Anywhere

    Authors: Ruizhao Zhu, Peng Huang, Eshed Ohn-Bar, Venkatesh Saligrama

    Abstract: Human drivers can seamlessly adapt their driving decisions across geographical locations with diverse conditions and rules of the road, e.g., left vs. right-hand traffic. In contrast, existing models for autonomous driving have been thus far only deployed within restricted operational domains, i.e., without accounting for varying driving behaviors across locations or model scalability. In this wor… ▽ More

    Submitted 25 September, 2023; v1 submitted 21 September, 2023; originally announced September 2023.

    Comments: Conference on Robot Learning (CoRL) 2023

  47. arXiv:2309.03214  [pdf, other

    cs.AR cs.SE

    Test Primitive:A Straightforward Method To Decouple March

    Authors: Yindong Xiao, Shanshan Lu, Ensheng Wang, Ruiqi Zhu, Zhijian Dai

    Abstract: The academic community has made outstanding achievements in researching the March algorithm. However, the current fault modeling method, which centers on fault primitives, cannot be directly applied to analyzing the March algorithm. This paper proposes a new test primitive. The test primitives, which decouple the cell states from sensitization and detection operations, describe the common features… ▽ More

    Submitted 29 August, 2023; originally announced September 2023.

  48. arXiv:2309.02190  [pdf, other

    cs.CV cs.AI

    Exchanging-based Multimodal Fusion with Transformer

    Authors: Renyu Zhu, Chengcheng Han, Yong Qian, Qiushi Sun, Xiang Li, Ming Gao, Xuezhi Cao, Yunsen Xian

    Abstract: We study the problem of multimodal fusion in this paper. Recent exchanging-based methods have been proposed for vision-vision fusion, which aim to exchange embeddings learned from one modality to the other. However, most of them project inputs of multimodalities into different low-dimensional spaces and cannot be applied to the sequential input data. To solve these issues, in this paper, we propos… ▽ More

    Submitted 5 September, 2023; originally announced September 2023.

  49. arXiv:2309.01071  [pdf, other

    cs.CL

    Business Process Text Sketch Automation Generation Using Large Language Model

    Authors: Rui Zhu, Quanzhou Hu, Wenxin Li, Honghao Xiao, Chaogang Wang, Zixin Zhou

    Abstract: Business Process Management (BPM) is gaining increasing attention as it has the potential to cut costs while boosting output and quality. Business process document generation is a crucial stage in BPM. However, due to a shortage of datasets, data-driven deep learning techniques struggle to deliver the expected results. We propose an approach to transform Conditional Process Trees (CPTs) into Busin… ▽ More

    Submitted 3 September, 2023; originally announced September 2023.

    Comments: 10 pages, 7 figures

  50. arXiv:2308.12315  [pdf, other

    cs.LG cs.AI

    Trustworthy Representation Learning Across Domains

    Authors: Ronghang Zhu, Dongliang Guo, Daiqing Qi, Zhixuan Chu, Xiang Yu, Sheng Li

    Abstract: As AI systems have obtained significant performance to be deployed widely in our daily live and human society, people both enjoy the benefits brought by these technologies and suffer many social issues induced by these systems. To make AI systems good enough and trustworthy, plenty of researches have been done to build guidelines for trustworthy AI systems. Machine learning is one of the most impo… ▽ More

    Submitted 29 August, 2023; v1 submitted 23 August, 2023; originally announced August 2023.

    Comments: 38 pages, 15 figures

    ACM Class: A.1