Skip to main content

Showing 1–50 of 321 results for author: Dai, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.18914  [pdf, other

    eess.SY cs.RO

    Verification and Synthesis of Compatible Control Lyapunov and Control Barrier Functions

    Authors: Hongkai Dai, Chuanrui Jiang, Hongchao Zhang, Andrew Clark

    Abstract: Safety and stability are essential properties of control systems. Control Barrier Functions (CBFs) and Control Lyapunov Functions (CLFs) have been proposed to ensure safety and stability respectively. However, previous approaches typically verify and synthesize the CBFs and CLFs separately, satisfying their respective constraints, without proving that the CBFs and CLFs are compatible with each oth… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  2. arXiv:2406.13094  [pdf, other

    cs.CL cs.AI cs.LG

    Exploring and Benchmarking the Planning Capabilities of Large Language Models

    Authors: Bernd Bohnet, Azade Nova, Aaron T Parisi, Kevin Swersky, Katayoon Goshvadi, Hanjun Dai, Dale Schuurmans, Noah Fiedel, Hanie Sedghi

    Abstract: We seek to elevate the planning capabilities of Large Language Models (LLMs)investigating four main directions. First, we construct a comprehensive benchmark suite encompassing both classical planning domains and natural language scenarios. This suite includes algorithms to generate instances with varying levels of difficulty, allowing for rigorous and systematic evaluation of LLM performance. Sec… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  3. arXiv:2406.02135  [pdf, other

    cs.IR cs.CL

    Robust Interaction-based Relevance Modeling for Online E-Commerce and LLM-based Retrieval

    Authors: Ben Chen, Huangyu Dai, Xiang Ma, Wen Jiang, Wei Ning

    Abstract: Semantic relevance calculation is crucial for e-commerce search engines, as it ensures that the items selected closely align with customer intent. Inadequate attention to this aspect can detrimentally affect user experience and engagement. Traditional text-matching techniques are prevalent but often fail to capture the nuances of search intent accurately, so neural networks now have become a prefe… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: Accepted by ECML-PKDD'24 as Outstanding Paper. 8 pages, 2 figures, 7 tables

  4. arXiv:2406.02066  [pdf, other

    cs.LG q-bio.BM

    Preference Optimization for Molecule Synthesis with Conditional Residual Energy-based Models

    Authors: Songtao Liu, Hanjun Dai, Yue Zhao, Peng Liu

    Abstract: Molecule synthesis through machine learning is one of the fundamental problems in drug discovery. Current data-driven strategies employ one-step retrosynthesis models and search algorithms to predict synthetic routes in a top-bottom manner. Despite their effective performance, these strategies face limitations in the molecule synthetic route generation due to a greedy selection of the next molecul… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: Accepted by ICML 2024(Oral)

  5. arXiv:2405.19320  [pdf, other

    cs.LG cs.AI stat.ML

    Value-Incentivized Preference Optimization: A Unified Approach to Online and Offline RLHF

    Authors: Shicong Cen, **cheng Mei, Katayoon Goshvadi, Hanjun Dai, Tong Yang, Sherry Yang, Dale Schuurmans, Yuejie Chi, Bo Dai

    Abstract: Reinforcement learning from human feedback (RLHF) has demonstrated great promise in aligning large language models (LLMs) with human preference. Depending on the availability of preference data, both online and offline RLHF are active areas of investigation. A key bottleneck is understanding how to incorporate uncertainty estimation in the reward function learned from the preference data for RLHF,… ▽ More

    Submitted 4 June, 2024; v1 submitted 29 May, 2024; originally announced May 2024.

  6. arXiv:2405.15908  [pdf, other

    cs.AI cs.CR cs.LG

    Knowledge-Informed Auto-Penetration Testing Based on Reinforcement Learning with Reward Machine

    Authors: Yuanliang Li, Hanzheng Dai, Jun Yan

    Abstract: Automated penetration testing (AutoPT) based on reinforcement learning (RL) has proven its ability to improve the efficiency of vulnerability identification in information systems. However, RL-based PT encounters several challenges, including poor sampling efficiency, intricate reward specification, and limited interpretability. To address these issues, we propose a knowledge-informed AutoPT frame… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  7. arXiv:2405.14030  [pdf, other

    cs.CV cs.CL

    Refining Skewed Perceptions in Vision-Language Models through Visual Representations

    Authors: Haocheng Dai, Sarang Joshi

    Abstract: Large vision-language models (VLMs), such as CLIP, have become foundational, demonstrating remarkable success across a variety of downstream tasks. Despite their advantages, these models, akin to other foundational systems, inherit biases from the disproportionate distribution of real-world data, leading to misconceptions about the actual environment. Prevalent datasets like ImageNet are often rid… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: 18 pages, 7 figures

  8. arXiv:2405.02520  [pdf, other

    cs.DC

    TurboFFT: A High-Performance Fast Fourier Transform with Fault Tolerance on GPU

    Authors: Shixun Wu, Yujia Zhai, **yang Liu, Jiajun Huang, Zizhe Jian, Huangliang Dai, Sheng Di, Zizhong Chen, Franck Cappello

    Abstract: The Fast Fourier Transform (FFT), as a core computation in a wide range of scientific applications, is increasingly threatened by reliability issues. In this paper, we introduce TurboFFT, a high-performance FFT implementation equipped with a two-sided checksum scheme that detects and corrects silent data corruptions at computing units efficiently. The proposed two-sided checksum addresses the erro… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

  9. arXiv:2404.07956  [pdf, other

    cs.LG cs.AI cs.RO eess.SY math.OC

    Lyapunov-stable Neural Control for State and Output Feedback: A Novel Formulation

    Authors: Lujie Yang, Hongkai Dai, Zhouxing Shi, Cho-Jui Hsieh, Russ Tedrake, Huan Zhang

    Abstract: Learning-based neural network (NN) control policies have shown impressive empirical performance in a wide range of tasks in robotics and control. However, formal (Lyapunov) stability guarantees over the region-of-attraction (ROA) for NN controllers with nonlinear dynamical systems are challenging to obtain, and most existing approaches rely on expensive solvers such as sums-of-squares (SOS), mixed… ▽ More

    Submitted 4 June, 2024; v1 submitted 11 April, 2024; originally announced April 2024.

    Comments: Paper accepted by ICML 2024

  10. arXiv:2404.00898  [pdf, other

    cs.LG

    CAAP: Class-Dependent Automatic Data Augmentation Based On Adaptive Policies For Time Series

    Authors: Tien-Yu Chang, Hao Dai, Vincent S. Tseng

    Abstract: Data Augmentation is a common technique used to enhance the performance of deep learning models by expanding the training dataset. Automatic Data Augmentation (ADA) methods are getting popular because of their capacity to generate policies for various datasets. However, existing ADA methods primarily focused on overall performance improvement, neglecting the problem of class-dependent bias that le… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

  11. arXiv:2403.19886  [pdf, other

    cs.RO

    BundledSLAM: An Accurate Visual SLAM System Using Multiple Cameras

    Authors: Han Song, Cong Liu, Huafeng Dai

    Abstract: Multi-camera SLAM systems offer a plethora of advantages, primarily stemming from their capacity to amalgamate information from a broader field of view, thereby resulting in heightened robustness and improved localization accuracy. In this research, we present a significant extension and refinement of the state-of-the-art stereo SLAM system, known as ORB-SLAM2, with the objective of attaining even… ▽ More

    Submitted 1 April, 2024; v1 submitted 28 March, 2024; originally announced March 2024.

  12. arXiv:2403.15500  [pdf, other

    q-bio.QM cs.LG q-bio.MN

    Gene Regulatory Network Inference in the Presence of Dropouts: a Causal View

    Authors: Haoyue Dai, Ignavier Ng, Gongxu Luo, Peter Spirtes, Petar Stojanov, Kun Zhang

    Abstract: Gene regulatory network inference (GRNI) is a challenging problem, particularly owing to the presence of zeros in single-cell RNA sequencing data: some are biological zeros representing no gene expression, while some others are technical zeros arising from the sequencing procedure (aka dropouts), which may bias GRNI by distorting the joint distribution of the measured gene expressions. Existing ap… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: Appears at ICLR 2024 (oral)

  13. arXiv:2403.14843  [pdf, other

    cs.LG cs.AI

    Local Causal Discovery with Linear non-Gaussian Cyclic Models

    Authors: Haoyue Dai, Ignavier Ng, Yujia Zheng, Zhengqing Gao, Kun Zhang

    Abstract: Local causal discovery is of great practical significance, as there are often situations where the discovery of the global causal structure is unnecessary, and the interest lies solely on a single target variable. Most existing local methods utilize conditional independence relations, providing only a partially directed graph, and assume acyclicity for the ground-truth structure, even though real-… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: Appears at AISTATS 2024

  14. arXiv:2403.12368  [pdf, other

    cs.CL cs.AI

    Characteristic AI Agents via Large Language Models

    Authors: Xi Wang, Hongliang Dai, Shen Gao, Piji Li

    Abstract: The advancement of Large Language Models (LLMs) has led to significant enhancements in the performance of chatbot systems. Many researchers have dedicated their efforts to the development of bringing characteristics to chatbots. While there have been commercial products for develo** role-driven chatbots using LLMs, it is worth noting that academic research in this area remains relatively scarce.… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: COLING 2024,The benchmark is available at: https://github.com/nuaa-nlp/Character100

  15. arXiv:2403.09171  [pdf, other

    cs.LG cs.AI

    ADEdgeDrop: Adversarial Edge Drop** for Robust Graph Neural Networks

    Authors: Zhaoliang Chen, Zhihao Wu, Ylli Sadikaj, Claudia Plant, Hong-Ning Dai, Shi** Wang, Wenzhong Guo

    Abstract: Although Graph Neural Networks (GNNs) have exhibited the powerful ability to gather graph-structured information from neighborhood nodes via various message-passing mechanisms, the performance of GNNs is limited by poor generalization and fragile robustness caused by noisy and redundant graph data. As a prominent solution, Graph Augmentation Learning (GAL) has recently received increasing attentio… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

  16. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1092 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 14 June, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  17. arXiv:2403.03689  [pdf, other

    cs.CL cs.AI

    General2Specialized LLMs Translation for E-commerce

    Authors: Kaidi Chen, Ben Chen, Dehong Gao, Huangyu Dai, Wen Jiang, Wei Ning, Shanqing Yu, Libin Yang, Xiaoyan Cai

    Abstract: Existing Neural Machine Translation (NMT) models mainly handle translation in the general domain, while overlooking domains with special writing formulas, such as e-commerce and legal documents. Taking e-commerce as an example, the texts usually include amounts of domain-related words and have more grammar problems, which leads to inferior performances of current NMT methods. To address these prob… ▽ More

    Submitted 6 April, 2024; v1 submitted 6 March, 2024; originally announced March 2024.

    Comments: 4 pages, 1 figure, WWW2024 accepted

  18. arXiv:2403.00396  [pdf, other

    cs.CV cs.AI

    GLFNET: Global-Local (frequency) Filter Networks for efficient medical image segmentation

    Authors: Athanasios Tragakis, Qianying Liu, Chaitanya Kaul, Swalpa Kumar Roy, Hang Dai, Fani Deligianni, Roderick Murray-Smith, Daniele Faccio

    Abstract: We propose a novel transformer-style architecture called Global-Local Filter Network (GLFNet) for medical image segmentation and demonstrate its state-of-the-art performance. We replace the self-attention mechanism with a combination of global-local filter blocks to optimize model efficiency. The global filters extract features from the whole feature map whereas the local filters are being adaptiv… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

  19. arXiv:2402.19007  [pdf, other

    cs.CV cs.RO

    DOZE: A Dataset for Open-Vocabulary Zero-Shot Object Navigation in Dynamic Environments

    Authors: Ji Ma, Hongming Dai, Yao Mu, Pengying Wu, Hao Wang, Xiaowei Chi, Yang Fei, Shanghang Zhang, Chang Liu

    Abstract: Zero-Shot Object Navigation (ZSON) requires agents to autonomously locate and approach unseen objects in unfamiliar environments and has emerged as a particularly challenging task within the domain of Embodied AI. Existing datasets for develo** ZSON algorithms lack consideration of dynamic obstacles, object attribute diversity, and scene texts, thus exhibiting noticeable discrepancy from real-wo… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

  20. arXiv:2402.13815  [pdf, other

    cs.SE cs.CR

    An Empirical Study on Oculus Virtual Reality Applications: Security and Privacy Perspectives

    Authors: Hanyang Guo, Hong-Ning Dai, Xiapu Luo, Zibin Zheng, Gengyang Xu, Fengliang He

    Abstract: Although Virtual Reality (VR) has accelerated its prevalent adoption in emerging metaverse applications, it is not a fundamentally new technology. On one hand, most VR operating systems (OS) are based on off-the-shelf mobile OS. As a result, VR apps also inherit privacy and security deficiencies from conventional mobile apps. On the other hand, in contrast to conventional mobile apps, VR apps can… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Comments: Accepted by ICSE 2024

  21. arXiv:2402.10816  [pdf, other

    cs.LG cs.CR cs.DC eess.SP

    TernaryVote: Differentially Private, Communication Efficient, and Byzantine Resilient Distributed Optimization on Heterogeneous Data

    Authors: Richeng **, Yujie Gu, Kai Yue, Xiaofan He, Zhaoyang Zhang, Huaiyu Dai

    Abstract: Distributed training of deep neural networks faces three critical challenges: privacy preservation, communication efficiency, and robustness to fault and adversarial behaviors. Although significant research efforts have been devoted to addressing these challenges independently, their synthesis remains less explored. In this paper, we propose TernaryVote, which combines a ternary compressor and the… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

  22. arXiv:2402.08703  [pdf, other

    q-bio.BM cs.AI cs.LG

    A Survey of Generative AI for de novo Drug Design: New Frontiers in Molecule and Protein Generation

    Authors: Xiangru Tang, Howard Dai, Elizabeth Knight, Fang Wu, Yunyang Li, Tianxiao Li, Mark Gerstein

    Abstract: Artificial intelligence (AI)-driven methods can vastly improve the historically costly drug design process, with various generative models already in widespread use. Generative models for de novo drug design, in particular, focus on the creation of novel biological compounds entirely from scratch, representing a promising future direction. Rapid development in the field, combined with the inherent… ▽ More

    Submitted 26 June, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

  23. arXiv:2402.08539  [pdf

    cs.LG stat.AP

    Intelligent Diagnosis of Alzheimer's Disease Based on Machine Learning

    Authors: Mingyang Li, Hongyu Liu, Yixuan Li, Zejun Wang, Yuan Yuan, Honglin Dai

    Abstract: This study is based on the Alzheimer's Disease Neuroimaging Initiative (ADNI) dataset and aims to explore early detection and disease progression in Alzheimer's disease (AD). We employ innovative data preprocessing strategies, including the use of the random forest algorithm to fill missing data and the handling of outliers and invalid data, thereby fully mining and utilizing these limited data re… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

  24. arXiv:2402.06330  [pdf, other

    cs.LG

    Continual Learning on Graphs: A Survey

    Authors: Zonggui Tian, Du Zhang, Hong-Ning Dai

    Abstract: Recently, continual graph learning has been increasingly adopted for diverse graph-structured data processing tasks in non-stationary environments. Despite its promising learning capability, current studies on continual graph learning mainly focus on mitigating the catastrophic forgetting problem while ignoring continuous performance improvement. To bridge this gap, this article aims to provide a… ▽ More

    Submitted 9 February, 2024; originally announced February 2024.

  25. arXiv:2402.02698  [pdf, other

    cs.LG cs.AI math.OC

    Beyond Expectations: Learning with Stochastic Dominance Made Practical

    Authors: Shicong Cen, **cheng Mei, Hanjun Dai, Dale Schuurmans, Yuejie Chi, Bo Dai

    Abstract: Stochastic dominance models risk-averse preferences for decision making with uncertain outcomes, which naturally captures the intrinsic structure of the underlying uncertainty, in contrast to simply resorting to the expectations. Despite theoretically appealing, the application of stochastic dominance in machine learning has been scarce, due to the following challenges: $\textbf{i)}$, the original… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

  26. arXiv:2401.14630  [pdf, other

    cs.CL cs.AI

    An Empirical Investigation of Domain Adaptation Ability for Chinese Spelling Check Models

    Authors: Xi Wang, Ruoqing Zhao, Hongliang Dai, Piji Li

    Abstract: Chinese Spelling Check (CSC) is a meaningful task in the area of Natural Language Processing (NLP) which aims at detecting spelling errors in Chinese texts and then correcting these errors. However, CSC models are based on pretrained language models, which are trained on a general corpus. Consequently, their performance may drop when confronted with downstream tasks involving domain-specific terms… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

    Comments: ICASSP2024

  27. arXiv:2401.11641  [pdf, other

    cs.CL

    Revolutionizing Finance with LLMs: An Overview of Applications and Insights

    Authors: Huaqin Zhao, Zhengliang Liu, Zihao Wu, Yiwei Li, Tianze Yang, Peng Shu, Shaochen Xu, Haixing Dai, Lin Zhao, Gengchen Mai, Ninghao Liu, Tianming Liu

    Abstract: In recent years, Large Language Models (LLMs) like ChatGPT have seen considerable advancements and have been applied in diverse fields. Built on the Transformer architecture, these models are trained on extensive datasets, enabling them to understand and generate human language effectively. In the financial domain, the deployment of LLMs is gaining momentum. These models are being utilized for aut… ▽ More

    Submitted 21 January, 2024; originally announced January 2024.

  28. arXiv:2401.10519   

    eess.SY cs.RO

    A Wind-Aware Path Planning Method for UAV-Asisted Bridge Inspection

    Authors: Jian Xu, Hua Dai

    Abstract: In response to the gap in considering wind conditions in the bridge inspection using unmanned aerial vehicle (UAV) , this paper proposes a path planning method for UAVs that takes into account the influence of wind, based on the simulated annealing algorithm. The algorithm considers the wind factors, including the influence of different wind speeds and directions at the same time on the path plann… ▽ More

    Submitted 22 March, 2024; v1 submitted 19 January, 2024; originally announced January 2024.

    Comments: After carefully analysis, there is a bit design flaws in Algorithm 1. The experimental work of the paper is not comprehensive,which lacks an evaluation of the algorithm's running time

  29. arXiv:2401.06994  [pdf, other

    cs.CV

    UniVision: A Unified Framework for Vision-Centric 3D Perception

    Authors: Yu Hong, Qian Liu, Huayuan Cheng, Danjiao Ma, Hang Dai, Yu Wang, Guangzhi Cao, Yong Ding

    Abstract: The past few years have witnessed the rapid development of vision-centric 3D perception in autonomous driving. Although the 3D perception models share many structural and conceptual similarities, there still exist gaps in their feature representations, data formats, and objectives, posing challenges for unified and efficient 3D perception framework design. In this paper, we present UniVision, a si… ▽ More

    Submitted 13 January, 2024; originally announced January 2024.

  30. arXiv:2401.06224  [pdf, other

    eess.IV cs.CV cs.LG

    Leveraging Frequency Domain Learning in 3D Vessel Segmentation

    Authors: Xinyuan Wang, Chengwei Pan, Hongming Dai, Gangming Zhao, **peng Li, Xiao Zhang, Yizhou Yu

    Abstract: Coronary microvascular disease constitutes a substantial risk to human health. Employing computer-aided analysis and diagnostic systems, medical professionals can intervene early in disease progression, with 3D vessel segmentation serving as a crucial component. Nevertheless, conventional U-Net architectures tend to yield incoherent and imprecise segmentation outcomes, particularly for small vesse… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

  31. arXiv:2401.05414  [pdf, other

    q-fin.ST cs.LG stat.ME

    On the Three Demons in Causality in Finance: Time Resolution, Nonstationarity, and Latent Factors

    Authors: Xinshuai Dong, Haoyue Dai, Yewen Fan, Songyao **, Sathyamoorthy Rajendran, Kun Zhang

    Abstract: Financial data is generally time series in essence and thus suffers from three fundamental issues: the mismatch in time resolution, the time-varying property of the distribution - nonstationarity, and causal factors that are important but unknown/unobserved. In this paper, we follow a causal perspective to systematically look into these three demons in finance. Specifically, we reexamine these iss… ▽ More

    Submitted 12 January, 2024; v1 submitted 28 December, 2023; originally announced January 2024.

  32. arXiv:2401.04334  [pdf, other

    cs.RO cs.AI

    Large Language Models for Robotics: Opportunities, Challenges, and Perspectives

    Authors: Jiaqi Wang, Zihao Wu, Yiwei Li, Hanqi Jiang, Peng Shu, Enze Shi, Huawen Hu, Chong Ma, Yiheng Liu, Xuhui Wang, Yincheng Yao, Xuan Liu, Huaqin Zhao, Zhengliang Liu, Haixing Dai, Lin Zhao, Bao Ge, Xiang Li, Tianming Liu, Shu Zhang

    Abstract: Large language models (LLMs) have undergone significant expansion and have been increasingly integrated across various domains. Notably, in the realm of robot task planning, LLMs harness their advanced reasoning and language comprehension capabilities to formulate precise and efficient action plans based on natural language instructions. However, for embodied tasks, where robots interact with comp… ▽ More

    Submitted 8 January, 2024; originally announced January 2024.

  33. arXiv:2401.03494  [pdf

    cs.LG cs.CE physics.app-ph

    Pre-insertion resistors temperature prediction based on improved WOA-SVR

    Authors: Honghe Dai, Site Mo, Haoxin Wang, Nan Yin, Songhai Fan, Bixiong Li

    Abstract: The pre-insertion resistors (PIR) within high-voltage circuit breakers are critical components and warm up by generating Joule heat when an electric current flows through them. Elevated temperature can lead to temporary closure failure and, in severe cases, the rupture of PIR. To accurately predict the temperature of PIR, this study combines finite element simulation techniques with Support Vector… ▽ More

    Submitted 7 January, 2024; originally announced January 2024.

  34. arXiv:2312.15869  [pdf, other

    cs.CL cs.AI

    Medical Report Generation based on Segment-Enhanced Contrastive Representation Learning

    Authors: Ruoqing Zhao, Xi Wang, Hongliang Dai, Pan Gao, Piji Li

    Abstract: Automated radiology report generation has the potential to improve radiology reporting and alleviate the workload of radiologists. However, the medical report generation task poses unique challenges due to the limited availability of medical data and the presence of data bias. To maximize the utility of available data and reduce data bias, we propose MSCL (Medical image Segmentation with Contrasti… ▽ More

    Submitted 25 December, 2023; originally announced December 2023.

    Comments: NLPCC 2023

  35. arXiv:2312.15740  [pdf, other

    cs.NI cs.CV cs.LG

    BiSwift: Bandwidth Orchestrator for Multi-Stream Video Analytics on Edge

    Authors: Lin Sun, Weijun Wang, Tingting Yuan, Liang Mi, Haipeng Dai, Yunxin Liu, Xiaoming Fu

    Abstract: High-definition (HD) cameras for surveillance and road traffic have experienced tremendous growth, demanding intensive computation resources for real-time analytics. Recently, offloading frames from the front-end device to the back-end edge server has shown great promise. In multi-stream competitive environments, efficient bandwidth management and proper scheduling are crucial to ensure both high… ▽ More

    Submitted 4 February, 2024; v1 submitted 25 December, 2023; originally announced December 2023.

    Comments: Accepted by 2024 IEEE INFOCOM

  36. arXiv:2312.11882  [pdf, other

    cs.CL cs.AI cs.LG

    ConsistentEE: A Consistent and Hardness-Guided Early Exiting Method for Accelerating Language Models Inference

    Authors: Ziqian Zeng, Yihuai Hong, Hongliang Dai, Hui** Zhuang, Cen Chen

    Abstract: Early Exiting is one of the most popular methods to achieve efficient inference. Current early exiting methods adopt the (weighted) sum of the cross entropy loss of all internal classifiers during training, imposing all these classifiers to predict all instances correctly. However, during inference, as long as one internal classifier predicts an instance correctly, it can accelerate without losing… ▽ More

    Submitted 7 April, 2024; v1 submitted 19 December, 2023; originally announced December 2023.

    Comments: Accepted in AAAI24

  37. arXiv:2312.10935  [pdf, other

    cs.DC cs.AI cs.LG

    AEDFL: Efficient Asynchronous Decentralized Federated Learning with Heterogeneous Devices

    Authors: Ji Liu, Tianshi Che, Yang Zhou, Ruoming **, Huaiyu Dai, De**g Dou, Patrick Valduriez

    Abstract: Federated Learning (FL) has achieved significant achievements recently, enabling collaborative model training on distributed data over edge devices. Iterative gradient or model exchanges between devices and the centralized server in the standard FL paradigm suffer from severe efficiency bottlenecks on the server. While enabling collaborative training without a central server, existing decentralize… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

    Comments: To appear in SDM 2024, 15 pages

  38. arXiv:2312.08034  [pdf, other

    eess.IV cs.CR cs.CV cs.LG

    Individualized Deepfake Detection Exploiting Traces Due to Double Neural-Network Operations

    Authors: Mushfiqur Rahman, Runze Liu, Chau-Wai Wong, Huaiyu Dai

    Abstract: In today's digital landscape, journalists urgently require tools to verify the authenticity of facial images and videos depicting specific public figures before incorporating them into news stories. Existing deepfake detectors are not optimized for this detection task when an image is associated with a specific and identifiable individual. This study focuses on the deepfake detection of facial ima… ▽ More

    Submitted 13 December, 2023; originally announced December 2023.

  39. arXiv:2312.06220  [pdf, other

    cs.LG cs.AI

    Dance of Channel and Sequence: An Efficient Attention-Based Approach for Multivariate Time Series Forecasting

    Authors: Haoxin Wang, Yipeng Mo, Nan Yin, Honghe Dai, Bixiong Li, Songhai Fan, Site Mo

    Abstract: In recent developments, predictive models for multivariate time series analysis have exhibited commendable performance through the adoption of the prevalent principle of channel independence. Nevertheless, it is imperative to acknowledge the intricate interplay among channels, which fundamentally influences the outcomes of multivariate predictions. Consequently, the notion of channel independence,… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

  40. arXiv:2312.06188  [pdf, other

    cs.CL

    From Ultra-Fine to Fine: Fine-tuning Ultra-Fine Entity Ty** Models to Fine-grained

    Authors: Hongliang Dai, Ziqian Zeng

    Abstract: For the task of fine-grained entity ty** (FET), due to the use of a large number of entity types, it is usually considered too costly to manually annotating a training dataset that contains an ample number of examples for each type. A common way to address this problem is to use distantly annotated training data that contains incorrect labels. However, the performance of models trained solely wi… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

    Comments: ACL 2023

  41. arXiv:2312.05770  [pdf, other

    cs.DC

    FedASMU: Efficient Asynchronous Federated Learning with Dynamic Staleness-aware Model Update

    Authors: Ji Liu, Juncheng Jia, Tianshi Che, Chao Huo, Jiaxiang Ren, Yang Zhou, Huaiyu Dai, De**g Dou

    Abstract: As a promising approach to deal with distributed data, Federated Learning (FL) achieves major advancements in recent years. FL enables collaborative model training by exploiting the raw data dispersed in multiple edge devices. However, the data is generally non-independent and identically distributed, i.e., statistical heterogeneity, and the edge devices significantly differ in terms of both compu… ▽ More

    Submitted 10 December, 2023; originally announced December 2023.

    Comments: 18 pages, to appear in AAAI 2024

  42. arXiv:2312.05256  [pdf, other

    eess.IV cs.AI

    Holistic Evaluation of GPT-4V for Biomedical Imaging

    Authors: Zhengliang Liu, Hanqi Jiang, Tianyang Zhong, Zihao Wu, Chong Ma, Yiwei Li, Xiaowei Yu, Yutong Zhang, Yi Pan, Peng Shu, Yanjun Lyu, Lu Zhang, Junjie Yao, Peixin Dong, Chao Cao, Zhenxiang Xiao, Jiaqi Wang, Huan Zhao, Shaochen Xu, Yaonai Wei, **gyuan Chen, Haixing Dai, Peilong Wang, Hao He, Zewei Wang , et al. (25 additional authors not shown)

    Abstract: In this paper, we present a large-scale evaluation probing GPT-4V's capabilities and limitations for biomedical image analysis. GPT-4V represents a breakthrough in artificial general intelligence (AGI) for computer vision, with applications in the biomedical domain. We assess GPT-4V's performance across 16 medical imaging categories, including radiology, oncology, ophthalmology, pathology, and mor… ▽ More

    Submitted 10 November, 2023; originally announced December 2023.

  43. arXiv:2311.17401  [pdf, ps, other

    cs.LG cs.AI

    Gene-MOE: A sparsely gated prognosis and classification framework exploiting pan-cancer genomic information

    Authors: Xiangyu Meng, Xue Li, Qing Yang, Huanhuan Dai, Lian Qiao, Hongzhen Ding, Long Hao, Xun Wang

    Abstract: Benefiting from the advancements in deep learning, various genomic analytical techniques, such as survival analysis, classification of tumors and their subtypes, and exploration of specific pathways, have significantly enhanced our understanding of the biological mechanisms driving cancer. However, the overfitting issue, arising from the limited number of patient samples, poses a challenge in impr… ▽ More

    Submitted 18 December, 2023; v1 submitted 29 November, 2023; originally announced November 2023.

  44. arXiv:2311.11249  [pdf, other

    cs.LG cs.AI cs.CR

    Open Set Dandelion Network for IoT Intrusion Detection

    Authors: Jiashu Wu, Hao Dai, Kenneth B. Kent, Jerome Yen, Chengzhong Xu, Yang Wang

    Abstract: As IoT devices become widely, it is crucial to protect them from malicious intrusions. However, the data scarcity of IoT limits the applicability of traditional intrusion detection methods, which are highly data-dependent. To address this, in this paper we propose the Open-Set Dandelion Network (OSDN) based on unsupervised heterogeneous domain adaptation in an open-set manner. The OSDN model perfo… ▽ More

    Submitted 7 January, 2024; v1 submitted 19 November, 2023; originally announced November 2023.

    Comments: Accepted by ACM Transactions on Internet Technology

  45. arXiv:2311.02883  [pdf, other

    cs.CL

    SQLPrompt: In-Context Text-to-SQL with Minimal Labeled Data

    Authors: Ruoxi Sun, Sercan Ö. Arik, Rajarishi Sinha, Hootan Nakhost, Hanjun Dai, Pengcheng Yin, Tomas Pfister

    Abstract: Text-to-SQL aims to automate the process of generating SQL queries on a database from natural language text. In this work, we propose "SQLPrompt", tailored to improve the few-shot prompting capabilities of Text-to-SQL for Large Language Models (LLMs). Our methods include innovative prompt design, execution-based consistency decoding strategy which selects the SQL with the most consistent execution… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

  46. arXiv:2311.00693  [pdf, other

    cs.AI

    On Task-personalized Multimodal Few-shot Learning for Visually-rich Document Entity Retrieval

    Authors: Jiayi Chen, Hanjun Dai, Bo Dai, Aidong Zhang, Wei Wei

    Abstract: Visually-rich document entity retrieval (VDER), which extracts key information (e.g. date, address) from document images like invoices and receipts, has become an important topic in industrial NLP applications. The emergence of new document types at a constant pace, each with its unique entity types, presents a unique challenge: many documents contain unseen entity types that occur only a couple o… ▽ More

    Submitted 8 December, 2023; v1 submitted 1 November, 2023; originally announced November 2023.

    Comments: Paper published at Findings of the Association for Computational Linguistics: EMNLP, 2023

  47. arXiv:2310.15080  [pdf, other

    cs.LG cs.CL cs.DC

    Federated Learning of Large Language Models with Parameter-Efficient Prompt Tuning and Adaptive Optimization

    Authors: Tianshi Che, Ji Liu, Yang Zhou, Jiaxiang Ren, Jiwen Zhou, Victor S. Sheng, Huaiyu Dai, De**g Dou

    Abstract: Federated learning (FL) is a promising paradigm to enable collaborative model training with decentralized data. However, the training process of Large Language Models (LLMs) generally incurs the update of significant parameters, which limits the applicability of FL techniques to tackle the LLMs in real scenarios. Prompt tuning can significantly reduce the number of parameters to update, but it eit… ▽ More

    Submitted 11 February, 2024; v1 submitted 23 October, 2023; originally announced October 2023.

    Comments: 18 pages, accepted by EMNLP 2023

  48. arXiv:2310.07064  [pdf, other

    cs.AI cs.CL

    Large Language Models can Learn Rules

    Authors: Zhaocheng Zhu, Yuan Xue, Xinyun Chen, Denny Zhou, Jian Tang, Dale Schuurmans, Hanjun Dai

    Abstract: When prompted with a few examples and intermediate steps, large language models (LLMs) have demonstrated impressive performance in various reasoning tasks. However, prompting methods that rely on implicit knowledge in an LLM often generate incorrect answers when the implicit knowledge is wrong or inconsistent with the task. To tackle this problem, we present Hypotheses-to-Theories (HtT), a framewo… ▽ More

    Submitted 24 April, 2024; v1 submitted 10 October, 2023; originally announced October 2023.

  49. arXiv:2310.05242  [pdf, other

    cs.CL cs.AI

    ChatRadio-Valuer: A Chat Large Language Model for Generalizable Radiology Report Generation Based on Multi-institution and Multi-system Data

    Authors: Tianyang Zhong, Wei Zhao, Yutong Zhang, Yi Pan, Peixin Dong, Zuowei Jiang, Xiaoyan Kui, Youlan Shang, Li Yang, Yaonai Wei, Longtao Yang, Hao Chen, Huan Zhao, Yuxiao Liu, Ning Zhu, Yiwei Li, Yisong Wang, Jiaqi Yao, Jiaqi Wang, Ying Zeng, Lei He, Chao Zheng, Zhixue Zhang, Ming Li, Zhengliang Liu , et al. (17 additional authors not shown)

    Abstract: Radiology report generation, as a key step in medical image analysis, is critical to the quantitative analysis of clinically informed decision-making levels. However, complex and diverse radiology reports with cross-source heterogeneity pose a huge generalizability challenge to the current methods under massive data volume, mainly because the style and normativity of radiology reports are obviousl… ▽ More

    Submitted 9 October, 2023; v1 submitted 8 October, 2023; originally announced October 2023.

  50. arXiv:2309.13446  [pdf, other

    cs.CV

    Video Timeline Modeling For News Story Understanding

    Authors: Meng Liu, Mingda Zhang, Jialu Liu, Hanjun Dai, Ming-Hsuan Yang, Shuiwang Ji, Zheyun Feng, Boqing Gong

    Abstract: In this paper, we present a novel problem, namely video timeline modeling. Our objective is to create a video-associated timeline from a set of videos related to a specific topic, thereby facilitating the content and structure understanding of the story being told. This problem has significant potential in various real-world applications, for instance, news story summarization. To bootstrap resear… ▽ More

    Submitted 27 October, 2023; v1 submitted 23 September, 2023; originally announced September 2023.

    Comments: Accepted as a spotlight by NeurIPS 2023, Track on Datasets and Benchmarks