Skip to main content

Showing 1–50 of 103 results for author: Cao, D

Searching in archive cs. Search in all archives.
.
  1. MuGSI: Distilling GNNs with Multi-Granularity Structural Information for Graph Classification

    Authors: Tianjun Yao, Jiaqi Sun, Defu Cao, Kun Zhang, Guangyi Chen

    Abstract: Recent works have introduced GNN-to-MLP knowledge distillation (KD) frameworks to combine both GNN's superior performance and MLP's fast inference speed. However, existing KD frameworks are primarily designed for node classification within single graphs, leaving their applicability to graph classification largely unexplored. Two main challenges arise when extending KD for node classification to gr… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

    Comments: 12 pages, 4 figures. Accepted by TheWebConf2024

    ACM Class: I.2.6

  2. arXiv:2406.05033  [pdf, other

    cs.LG math.OC

    Gradient Descent on Logistic Regression with Non-Separable Data and Large Step Sizes

    Authors: Si Yi Meng, Antonio Orvieto, Daniel Yiming Cao, Christopher De Sa

    Abstract: We study gradient descent (GD) dynamics on logistic regression problems with large, constant step sizes. For linearly-separable data, it is known that GD converges to the minimizer with arbitrarily large step sizes, a property which no longer holds when the problem is not separable. In fact, the behaviour can be much more complex -- a sequence of period-doubling bifurcations begins at the critical… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  3. arXiv:2405.21056  [pdf, other

    cs.RO cs.AI cs.CV

    An Organic Weed Control Prototype using Directed Energy and Deep Learning

    Authors: Deng Cao, Hongbo Zhang, Rajveer Dhillon

    Abstract: Organic weed control is a vital to improve crop yield with a sustainable approach. In this work, a directed energy weed control robot prototype specifically designed for organic farms is proposed. The robot uses a novel distributed array robot (DAR) unit for weed treatment. Soybean and corn databases are built to train deep learning neural nets to perform weed recognition. The initial deep learnin… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

  4. arXiv:2404.09403  [pdf, other

    cs.LG

    Neuro-Inspired Information-Theoretic Hierarchical Perception for Multimodal Learning

    Authors: Xiongye Xiao, Gengshuo Liu, Gaurav Gupta, Defu Cao, Shixuan Li, Yaxing Li, Tianqing Fang, Mingxi Cheng, Paul Bogdan

    Abstract: Integrating and processing information from various sources or modalities are critical for obtaining a comprehensive and accurate perception of the real world in autonomous systems and cyber-physical systems. Drawing inspiration from neuroscience, we develop the Information-Theoretic Hierarchical Perception (ITHP) model, which utilizes the concept of information bottleneck. Different from most tra… ▽ More

    Submitted 22 April, 2024; v1 submitted 14 April, 2024; originally announced April 2024.

    Comments: Accepted by ICLR 2024. Camera Ready Version

  5. arXiv:2403.14409  [pdf, other

    cs.CL cs.AI

    Locating and Mitigating Gender Bias in Large Language Models

    Authors: Yuchen Cai, Ding Cao, Rongxi Guo, Yaqin Wen, Guiquan Liu, Enhong Chen

    Abstract: Large language models(LLM) are pre-trained on extensive corpora to learn facts and human cognition which contain human preferences. However, this process can inadvertently lead to these models acquiring biases and stereotypes prevalent in society. Prior research has typically tackled the issue of bias through a one-dimensional perspective, concentrating either on locating or mitigating it. This li… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: 23 pages, 5 figures

  6. arXiv:2403.14381  [pdf, other

    cs.CL cs.AI

    Editing Knowledge Representation of Language Model via Rephrased Prefix Prompts

    Authors: Yuchen Cai, Ding Cao, Rongxi Guo, Yaqin Wen, Guiquan Liu, Enhong Chen

    Abstract: Neural language models (LMs) have been extensively trained on vast corpora to store factual knowledge about various aspects of the world described in texts. Current technologies typically employ knowledge editing methods or specific prompts to modify LM outputs. However, existing knowledge editing methods are costly and inefficient, struggling to produce appropriate text. Additionally, prompt engi… ▽ More

    Submitted 11 May, 2024; v1 submitted 21 March, 2024; originally announced March 2024.

    Comments: 19pages,3figures

  7. arXiv:2403.06419  [pdf, other

    cs.LG

    Causal Multi-Label Feature Selection in Federated Setting

    Authors: Yukun Song, Dayuan Cao, Jiali Miao, Shuai Yang, Kui Yu

    Abstract: Multi-label feature selection serves as an effective mean for dealing with high-dimensional multi-label data. To achieve satisfactory performance, existing methods for multi-label feature selection often require the centralization of substantial data from multiple sources. However, in Federated setting, centralizing data from all sources and merging them into a single dataset is not feasible. To t… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

  8. arXiv:2402.18920  [pdf, other

    cs.CV cs.AI cs.CG

    Spectral Meets Spatial: Harmonising 3D Shape Matching and Interpolation

    Authors: Dongliang Cao, Marvin Eisenberger, Nafie El Amrani, Daniel Cremers, Florian Bernard

    Abstract: Although 3D shape matching and interpolation are highly interrelated, they are often studied separately and applied sequentially to relate different 3D shapes, thus resulting in sub-optimal performance. In this work we present a unified framework to predict both point-wise correspondences and shape interpolation between 3D shapes. To this end, we combine the deep functional map framework with clas… ▽ More

    Submitted 27 March, 2024; v1 submitted 29 February, 2024; originally announced February 2024.

    Comments: accepted by CVPR2024

  9. arXiv:2402.09099  [pdf, other

    cs.AI

    Exploring Neuron Interactions and Emergence in LLMs: From the Multifractal Analysis Perspective

    Authors: Xiongye Xiao, Chenyu Zhou, Heng **, Defu Cao, Yaxing Li, Yizhuo Zhou, Shixuan Li, Paul Bogdan

    Abstract: Prior studies on the emergence in large models have primarily focused on how the functional capabilities of large language models (LLMs) scale with model size. Our research, however, transcends this traditional paradigm, aiming to deepen our understanding of the emergence within LLMs by placing a special emphasis not just on the model size but more significantly on the complex behavior of neuron i… ▽ More

    Submitted 21 March, 2024; v1 submitted 14 February, 2024; originally announced February 2024.

  10. arXiv:2402.06073  [pdf

    cs.CL cs.SD eess.AS

    LightCAM: A Fast and Light Implementation of Context-Aware Masking based D-TDNN for Speaker Verification

    Authors: Di Cao, Xianchen Wang, Junfeng Zhou, Jiakai Zhang, Yan**g Lei, Wenpeng Chen

    Abstract: Traditional Time Delay Neural Networks (TDNN) have achieved state-of-the-art performance at the cost of high computational complexity and slower inference speed, making them difficult to implement in an industrial environment. The Densely Connected Time Delay Neural Network (D-TDNN) with Context Aware Masking (CAM) module has proven to be an efficient structure to reduce complexity while maintaini… ▽ More

    Submitted 12 February, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

  11. arXiv:2402.05359  [pdf, other

    cs.AI cs.CL cs.LG

    Prompting with Divide-and-Conquer Program Makes Large Language Models Discerning to Hallucination and Deception

    Authors: Yizhou Zhang, Lun Du, Defu Cao, Qiang Fu, Yan Liu

    Abstract: Foundation models, such as Large language Models (LLMs), have attracted significant amount of interest due to their large number of applications. However, when handling tasks involving repetitive sub-tasks and/or deceptive contents, such as arithmetic calculation and article-level fake news detection, simple instructional prompts suffer from inaccurate responses. Existing works show that more comp… ▽ More

    Submitted 24 June, 2024; v1 submitted 7 February, 2024; originally announced February 2024.

    Comments: Preprint

  12. arXiv:2401.01918  [pdf, other

    cs.CV

    Distilling Temporal Knowledge with Masked Feature Reconstruction for 3D Object Detection

    Authors: Haowen Zheng, Dong Cao, **tao Xu, Rui Ai, Weihao Gu, Yang Yang, Yanyan Liang

    Abstract: Striking a balance between precision and efficiency presents a prominent challenge in the bird's-eye-view (BEV) 3D object detection. Although previous camera-based BEV methods achieved remarkable performance by incorporating long-term temporal information, most of them still face the problem of low efficiency. One potential solution is knowledge distillation. Existing distillation methods only foc… ▽ More

    Submitted 8 January, 2024; v1 submitted 3 January, 2024; originally announced January 2024.

  13. arXiv:2312.16797  [pdf, other

    cs.CV

    Multi-Prompts Learning with Cross-Modal Alignment for Attribute-based Person Re-Identification

    Authors: Ya**g Zhai, Yawen Zeng, Zhiyong Huang, Zheng Qin, Xin **, Da Cao

    Abstract: The fine-grained attribute descriptions can significantly supplement the valuable semantic information for person image, which is vital to the success of person re-identification (ReID) task. However, current ReID algorithms typically failed to effectively leverage the rich contextual information available, primarily due to their reliance on simplistic and coarse utilization of image attributes. R… ▽ More

    Submitted 27 December, 2023; originally announced December 2023.

    Comments: AAAI 2024

  14. arXiv:2311.12799  [pdf, other

    cs.CV cs.AI

    A Fine-Grained Image Description Generation Method Based on Joint Objectives

    Authors: Yifan Zhang, Chunzhen Lin, Donglin Cao, Dazhen Lin

    Abstract: The goal of fine-grained image description generation techniques is to learn detailed information from images and simulate human-like descriptions that provide coherent and comprehensive textual details about the image content. Currently, most of these methods face two main challenges: description repetition and omission. Moreover, the existing evaluation metrics cannot clearly reflect the perform… ▽ More

    Submitted 1 September, 2023; originally announced November 2023.

  15. arXiv:2311.06278  [pdf

    q-fin.ST cs.AI cs.LG

    Boosting Stock Price Prediction with Anticipated Macro Policy Changes

    Authors: Md Sabbirul Haque, Md Shahedul Amin, Jonayet Miah, Duc Minh Cao, Ashiqul Haque Ahmed

    Abstract: Prediction of stock prices plays a significant role in aiding the decision-making of investors. Considering its importance, a growing literature has emerged trying to forecast stock prices with improved accuracy. In this study, we introduce an innovative approach for forecasting stock prices with greater accuracy. We incorporate external economic environment-related information along with stock pr… ▽ More

    Submitted 27 October, 2023; originally announced November 2023.

    Journal ref: Journal of Mathematics and Statistics Studies, 4(3), 29-34 (2023)

  16. arXiv:2311.00517  [pdf

    cs.LG cs.CV cs.HC

    Improving Cardiovascular Disease Prediction Through Comparative Analysis of Machine Learning Models: A Case Study on Myocardial Infarction

    Authors: Jonayet Miah, Duc M Ca, Md Abu Sayed, Ehsanur Rashid Lipu, Fuad Mahmud, S M Yasir Arafat

    Abstract: Cardiovascular disease remains a leading cause of mortality in the contemporary world. Its association with smoking, elevated blood pressure, and cholesterol levels underscores the significance of these risk factors. This study addresses the challenge of predicting myocardial illness, a formidable task in medical research. Accurate predictions are pivotal for refining healthcare strategies. This i… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

    Journal ref: 2023 15th International Conference on Innovations in Information Technology (IIT) - Track 2: Artificial Intelligence in Data Science

  17. arXiv:2310.18237   

    cs.CV cs.HC

    Generative AI Model for Artistic Style Transfer Using Convolutional Neural Networks

    Authors: Jonayet Miah, Duc M Cao, Md Abu Sayed, Md. Sabbirul Haque

    Abstract: Artistic style transfer, a captivating application of generative artificial intelligence, involves fusing the content of one image with the artistic style of another to create unique visual compositions. This paper presents a comprehensive overview of a novel technique for style transfer using Convolutional Neural Networks (CNNs). By leveraging deep image representations learned by CNNs, we demons… ▽ More

    Submitted 30 October, 2023; v1 submitted 27 October, 2023; originally announced October 2023.

    Comments: Incorrectly Input

  18. arXiv:2310.17729  [pdf

    cs.LG cs.AI cs.CV

    Improving Traffic Density Forecasting in Intelligent Transportation Systems Using Gated Graph Neural Networks

    Authors: Razib Hayat Khan, Jonayet Miah, S M Yasir Arafat, M M Mahbubul Syeed, Duc M Ca

    Abstract: This study delves into the application of graph neural networks in the realm of traffic forecasting, a crucial facet of intelligent transportation systems. Accurate traffic predictions are vital for functions like trip planning, traffic control, and vehicle routing in such systems. Three prominent GNN architectures Graph Convolutional Networks (Graph Sample and Aggregation) and Gated Graph Neural… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

  19. arXiv:2310.17720  [pdf

    eess.IV cs.CV cs.LG

    Advancing Brain Tumor Detection: A Thorough Investigation of CNNs, Clustering, and SoftMax Classification in the Analysis of MRI Images

    Authors: Jonayet Miah, Duc M Cao, Md Abu Sayed3, Md Siam Taluckder, Md Sabbirul Haque, Fuad Mahmud

    Abstract: Brain tumors pose a significant global health challenge due to their high prevalence and mortality rates across all age groups. Detecting brain tumors at an early stage is crucial for effective treatment and patient outcomes. This study presents a comprehensive investigation into the use of Convolutional Neural Networks (CNNs) for brain tumor detection using Magnetic Resonance Imaging (MRI) images… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

    Journal ref: JOIV : International Journal on Informatics Visualization, JOIV : Int. J. Inform. Visualization ISSN / E-ISSN 2549-9610 / 2549-9904, 2023

  20. arXiv:2310.11420  [pdf, other

    cs.CV cs.CG

    Revisiting Map Relations for Unsupervised Non-Rigid Shape Matching

    Authors: Dongliang Cao, Paul Roetzer, Florian Bernard

    Abstract: We propose a novel unsupervised learning approach for non-rigid 3D shape matching. Our approach improves upon recent state-of-the art deep functional map methods and can be applied to a broad range of different challenging scenarios. Previous deep functional map methods mainly focus on feature extraction and aim exclusively at obtaining more expressive features for functional map computation. Howe… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

    Comments: 3DV 2024

  21. arXiv:2310.08230  [pdf, other

    cs.CV

    Fast Discrete Optimisation for Geometrically Consistent 3D Shape Matching

    Authors: Paul Roetzer, Ahmed Abbas, Dongliang Cao, Florian Bernard, Paul Swoboda

    Abstract: In this work we propose to combine the advantages of learning-based and combinatorial formalisms for 3D shape matching. While learning-based shape matching solutions lead to state-of-the-art matching performance, they do not ensure geometric consistency, so that obtained matchings are locally unsmooth. On the contrary, axiomatic methods allow to take geometric consistency into account by explicitl… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

    Comments: Paul Roetzer and Ahmed Abbas contributed equally

  22. arXiv:2310.07259  [pdf, other

    cs.CV cs.AI

    Uncovering Hidden Connections: Iterative Search and Reasoning for Video-grounded Dialog

    Authors: Haoyu Zhang, Meng Liu, Yaowei Wang, Da Cao, Weili Guan, Liqiang Nie

    Abstract: In contrast to conventional visual question answering, video-grounded dialog necessitates a profound understanding of both dialog history and video content for accurate response generation. Despite commendable progress made by existing approaches, they still face the challenges of incrementally understanding complex dialog history and assimilating video information. In response to these challenges… ▽ More

    Submitted 22 May, 2024; v1 submitted 11 October, 2023; originally announced October 2023.

  23. arXiv:2310.04948  [pdf, other

    cs.LG cs.CL

    TEMPO: Prompt-based Generative Pre-trained Transformer for Time Series Forecasting

    Authors: Defu Cao, Furong Jia, Sercan O Arik, Tomas Pfister, Yixiang Zheng, Wen Ye, Yan Liu

    Abstract: The past decade has witnessed significant advances in time series modeling with deep learning. While achieving state-of-the-art results, the best-performing architectures vary highly across applications and domains. Meanwhile, for natural language processing, the Generative Pre-trained Transformer (GPT) has demonstrated impressive performance via training one general-purpose model across various t… ▽ More

    Submitted 2 April, 2024; v1 submitted 7 October, 2023; originally announced October 2023.

    Comments: Accepted by ICLR 2024. Camera Ready Version

  24. arXiv:2309.15877   

    cs.LG cs.AI

    Neuro-Inspired Hierarchical Multimodal Learning

    Authors: Xiongye Xiao, Gengshuo Liu, Gaurav Gupta, Defu Cao, Shixuan Li, Yaxing Li, Tianqing Fang, Mingxi Cheng, Paul Bogdan

    Abstract: Integrating and processing information from various sources or modalities are critical for obtaining a comprehensive and accurate perception of the real world. Drawing inspiration from neuroscience, we develop the Information-Theoretic Hierarchical Perception (ITHP) model, which utilizes the concept of information bottleneck. Distinct from most traditional fusion models that aim to incorporate all… ▽ More

    Submitted 23 April, 2024; v1 submitted 27 September, 2023; originally announced September 2023.

    Comments: I am requesting the withdrawal of this submission due to an inadvertent duplication. The paper was submitted twice under different IDs, which was not intentional. The other submission (arXiv:2404.09403) contains the most updated and comprehensive version of the paper, and I would like to retain that as the sole version on the platform

  25. arXiv:2308.03865  [pdf, other

    eess.IV cs.CV

    DefCor-Net: Physics-Aware Ultrasound Deformation Correction

    Authors: Zhongliang Jiang, Yue Zhou, Dongliang Cao, Nassir Navab

    Abstract: The recovery of morphologically accurate anatomical images from deformed ones is challenging in ultrasound (US) image acquisition, but crucial to accurate and consistent diagnosis, particularly in the emerging field of computer-assisted diagnosis. This article presents a novel anatomy-aware deformation correction approach based on a coarse-to-fine, multi-scale deep neural network (DefCor-Net). To… ▽ More

    Submitted 7 August, 2023; originally announced August 2023.

    Comments: Accepted by MedIA. code is available

  26. arXiv:2306.05257  [pdf, other

    cs.LG q-bio.QM

    Comprehensive evaluation of deep and graph learning on drug-drug interactions prediction

    Authors: Xuan Lin, Lichang Dai, Yafang Zhou, Zu-Guo Yu, Wen Zhang, Jian-Yu Shi, Dong-Sheng Cao, Li Zeng, Haowen Chen, Bosheng Song, Philip S. Yu, Xiangxiang Zeng

    Abstract: Recent advances and achievements of artificial intelligence (AI) as well as deep and graph learning models have established their usefulness in biomedical applications, especially in drug-drug interactions (DDIs). DDIs refer to a change in the effect of one drug to the presence of another drug in the human body, which plays an essential role in drug discovery and clinical research. DDIs prediction… ▽ More

    Submitted 8 June, 2023; originally announced June 2023.

    Comments: Accepted by Briefings in Bioinformatics

  27. Milestones in Autonomous Driving and Intelligent Vehicles Part II: Perception and Planning

    Authors: Long Chen, Siyu Teng, Bai Li, Xiaoxiang Na, Yuchen Li, Zixuan Li, **jun Wang, Dongpu Cao, Nanning Zheng, Fei-Yue Wang

    Abstract: Growing interest in autonomous driving (AD) and intelligent vehicles (IVs) is fueled by their promise for enhanced safety, efficiency, and economic benefits. While previous surveys have captured progress in this field, a comprehensive and forward-looking summary is needed. Our work fills this gap through three distinct articles. The first part, a "Survey of Surveys" (SoS), outlines the history, su… ▽ More

    Submitted 2 June, 2023; originally announced June 2023.

    Comments: 17pages, 6figures. IEEE Transactions on Systems, Man, and Cybernetics: Systems. arXiv admin note: text overlap with arXiv:2303.09824

  28. arXiv:2305.11239  [pdf, other

    cs.AI cs.RO eess.SY

    Milestones in Autonomous Driving and Intelligent Vehicles Part I: Control, Computing System Design, Communication, HD Map, Testing, and Human Behaviors

    Authors: Long Chen, Yuchen Li, Chao Huang, Yang Xing, Daxin Tian, Li Li, Zhongxu Hu, Siyu Teng, Chen Lv, **jun Wang, Dongpu Cao, Nanning Zheng, Fei-Yue Wang

    Abstract: Interest in autonomous driving (AD) and intelligent vehicles (IVs) is growing at a rapid pace due to the convenience, safety, and economic benefits. Although a number of surveys have reviewed research achievements in this field, they are still limited in specific tasks and lack systematic summaries and research directions in the future. Our work is divided into 3 independent articles and the first… ▽ More

    Submitted 26 May, 2023; v1 submitted 11 May, 2023; originally announced May 2023.

    Comments: 18 pages, 4 figures, 3 tables, in IEEE Trans. Syst. Man Cybern. Syst

  29. arXiv:2304.14419  [pdf, other

    cs.CV cs.AI

    Unsupervised Learning of Robust Spectral Shape Matching

    Authors: Dongliang Cao, Paul Roetzer, Florian Bernard

    Abstract: We propose a novel learning-based approach for robust 3D shape matching. Our method builds upon deep functional maps and can be trained in a fully unsupervised manner. Previous deep functional map methods mainly focus on predicting optimised functional maps alone, and then rely on off-the-shelf post-processing to obtain accurate point-wise maps during inference. However, this two-stage procedure f… ▽ More

    Submitted 26 April, 2023; originally announced April 2023.

    Journal ref: ACM Transactions on Graphics 2023

  30. Visualising Personal Data Flows: Insights from a Case Study of Booking.com

    Authors: Haiyue Yuan, Matthew Boakes, Xiao Ma, Dongmei Cao, Shujun Li

    Abstract: Commercial organisations are holding and processing an ever-increasing amount of personal data. Policies and laws are continually changing to require these companies to be more transparent regarding the collection, storage, processing and sharing of this data. This paper reports our work of taking Booking.com as a case study to visualise personal data flows extracted from their privacy policy. By… ▽ More

    Submitted 17 July, 2023; v1 submitted 19 April, 2023; originally announced April 2023.

    Comments: This is the full edition of a paper published in Intelligent Information Systems: CAiSE Forum 2023, Zaragoza, Spain, June 12-16, 2023, Proceedings, Lecture Notes in Business Information Processing (LNBIP), Volume 477, pp. 52-60, 2023, Springer Nature, https://link.springer.com/book/10.1007/978-3-031-34674-3

    Journal ref: Lecture Notes in Business Information Processing (LNBIP), 2023

  31. arXiv:2304.07633  [pdf, other

    cs.CL cs.LG

    Interpretable Detection of Out-of-Context Misinformation with Neural-Symbolic-Enhanced Large Multimodal Model

    Authors: Yizhou Zhang, Loc Trinh, Defu Cao, Zijun Cui, Yan Liu

    Abstract: Recent years have witnessed the sustained evolution of misinformation that aims at manipulating public opinions. Unlike traditional rumors or fake news editors who mainly rely on generated and/or counterfeited images, text and videos, current misinformation creators now more tend to use out-of-context multimedia contents (e.g. mismatched images and captions) to deceive the public and fake news det… ▽ More

    Submitted 5 April, 2024; v1 submitted 15 April, 2023; originally announced April 2023.

    Comments: 9 Pages, 3 Figures

  32. arXiv:2304.04332  [pdf, other

    cs.PL

    Better Together: Unifying Datalog and Equality Saturation

    Authors: Yihong Zhang, Yisu Remy Wang, Oliver Flatt, David Cao, Philip Zucker, Eli Rosenthal, Zachary Tatlock, Max Willsey

    Abstract: We present egglog, a fixpoint reasoning system that unifies Datalog and equality saturation (EqSat). Like Datalog, it supports efficient incremental execution, cooperating analyses, and lattice-based reasoning. Like EqSat, it supports term rewriting, efficient congruence closure, and extraction of optimized terms. We identify two recent applications--a unification-based pointer analysis in Datal… ▽ More

    Submitted 15 May, 2023; v1 submitted 9 April, 2023; originally announced April 2023.

    Comments: PLDI 2023

  33. arXiv:2304.00592  [pdf, other

    cs.CL

    PK-Chat: Pointer Network Guided Knowledge Driven Generative Dialogue Model

    Authors: Cheng Deng, Bo Tong, Luoyi Fu, Jiaxin Ding, Dexing Cao, Xinbing Wang, Chenghu Zhou

    Abstract: In the research of end-to-end dialogue systems, using real-world knowledge to generate natural, fluent, and human-like utterances with correct answers is crucial. However, domain-specific conversational dialogue systems may be incoherent and introduce erroneous external information to answer questions due to the out-of-vocabulary issue or the wrong knowledge from the parameters of the neural netwo… ▽ More

    Submitted 2 April, 2023; originally announced April 2023.

    ACM Class: I.2.7; F.4.1

  34. Milestones in Autonomous Driving and Intelligent Vehicles: Survey of Surveys

    Authors: Long Chen, Yuchen Li, Chao Huang, Bai Li, Yang Xing, Daxin Tian, Li Li, Zhongxu Hu, Xiaoxiang Na, Zixuan Li, Siyu Teng, Chen Lv, **jun Wang, Dongpu Cao, Nanning Zheng, Fei-Yue Wang

    Abstract: Interest in autonomous driving (AD) and intelligent vehicles (IVs) is growing at a rapid pace due to the convenience, safety, and economic benefits. Although a number of surveys have reviewed research achievements in this field, they are still limited in specific tasks, lack of systematic summary and research directions in the future. Here we propose a Survey of Surveys (SoS) for total technologie… ▽ More

    Submitted 30 March, 2023; originally announced March 2023.

    Comments: 13 pages, 3 tables, 0 figure

    Journal ref: IEEE Transactions on Intelligent Vehicles, vol. 8, no. 2, pp. 1046-1056, Feb. 2023

  35. arXiv:2303.10971  [pdf, other

    cs.CV cs.AI cs.CG

    Self-Supervised Learning for Multimodal Non-Rigid 3D Shape Matching

    Authors: Dongliang Cao, Florian Bernard

    Abstract: The matching of 3D shapes has been extensively studied for shapes represented as surface meshes, as well as for shapes represented as point clouds. While point clouds are a common representation of raw real-world 3D data (e.g. from laser scanners), meshes encode rich and expressive topological information, but their creation typically requires some form of (often manual) curation. In turn, methods… ▽ More

    Submitted 20 March, 2023; originally announced March 2023.

    Comments: accepted by CVPR 2023

  36. arXiv:2303.02320  [pdf, other

    cs.LG

    Estimating Treatment Effects from Irregular Time Series Observations with Hidden Confounders

    Authors: Defu Cao, James Enouen, Yu**g Wang, Xiangchen Song, Chuizheng Meng, Hao Niu, Yan Liu

    Abstract: Causal analysis for time series data, in particular estimating individualized treatment effect (ITE), is a key task in many real-world applications, such as finance, retail, healthcare, etc. Real-world time series can include large-scale, irregular, and intermittent time series observations, raising significant challenges to existing work attempting to estimate treatment effects. Specifically, the… ▽ More

    Submitted 3 March, 2023; originally announced March 2023.

    Comments: Accepted by AAAI 2023

  37. arXiv:2303.02304  [pdf, other

    cs.LG

    Coupled Multiwavelet Neural Operator Learning for Coupled Partial Differential Equations

    Authors: Xiongye Xiao, Defu Cao, Ruochen Yang, Gaurav Gupta, Gengshuo Liu, Chenzhong Yin, Radu Balan, Paul Bogdan

    Abstract: Coupled partial differential equations (PDEs) are key tasks in modeling the complex dynamics of many physical processes. Recently, neural operators have shown the ability to solve PDEs by learning the integral kernel directly in Fourier/Wavelet space, so the difficulty for solving the coupled PDEs depends on dealing with the coupled map**s between the functions. Towards this end, we propose a \t… ▽ More

    Submitted 8 December, 2023; v1 submitted 3 March, 2023; originally announced March 2023.

    Comments: Accepted to ICLR 2023

  38. arXiv:2302.09446  [pdf, other

    cs.LG stat.ME

    Estimating Treatment Effects in Continuous Time with Hidden Confounders

    Authors: Defu Cao, James Enouen, Yan Liu

    Abstract: Estimating treatment effects plays a crucial role in causal inference, having many real-world applications like policy analysis and decision making. Nevertheless, estimating treatment effects in the longitudinal setting in the presence of hidden confounders remains an extremely challenging problem. Recently, there is a growing body of work attempting to obtain unbiased ITE estimates from time-dyna… ▽ More

    Submitted 20 February, 2023; v1 submitted 18 February, 2023; originally announced February 2023.

    Comments: 7 pages. First presentation was at ICML 2022 workshop Continuous time methods for machine learning

  39. arXiv:2302.07622  [pdf

    cs.RO math.OC

    Embodied Footprints: A Safety-guaranteed Collision Avoidance Model for Numerical Optimization-based Trajectory Planning

    Authors: Bai Li, Youmin Zhang, Tantan Zhang, Tankut Acarman, Yakun Ouyang, Li Li, Hairong Dong, Dongpu Cao

    Abstract: Optimization-based methods are commonly applied in autonomous driving trajectory planners, which transform the continuous-time trajectory planning problem into a finite nonlinear program with constraints imposed at finite collocation points. However, potential violations between adjacent collocation points can occur. To address this issue thoroughly, we propose a safety-guaranteed collision-avoida… ▽ More

    Submitted 14 September, 2023; v1 submitted 15 February, 2023; originally announced February 2023.

    Comments: 15 pages, 16 figures

  40. babble: Learning Better Abstractions with E-Graphs and Anti-Unification

    Authors: David Cao, Rose Kunkel, Chandrakana Nandi, Max Willsey, Zachary Tatlock, Nadia Polikarpova

    Abstract: Library learning compresses a given corpus of programs by extracting common structure from the corpus into reusable library functions. Prior work on library learning suffers from two limitations that prevent it from scaling to larger, more complex inputs. First, it explores too many candidate library functions that are not useful for compression. Second, it is not robust to syntactic variation in… ▽ More

    Submitted 8 December, 2022; originally announced December 2022.

    Comments: POPL 2023

  41. arXiv:2211.11513  [pdf, other

    q-fin.ST cs.AI cs.LG

    DSLOB: A Synthetic Limit Order Book Dataset for Benchmarking Forecasting Algorithms under Distributional Shift

    Authors: Defu Cao, Yousef El-Laham, Loc Trinh, Svitlana Vyetrenko, Yan Liu

    Abstract: In electronic trading markets, limit order books (LOBs) provide information about pending buy/sell orders at various price levels for a given security. Recently, there has been a growing interest in using LOB data for resolving downstream machine learning tasks (e.g., forecasting). However, dealing with out-of-distribution (OOD) LOB data is challenging since distributional shifts are unlabeled in… ▽ More

    Submitted 17 November, 2022; originally announced November 2022.

    Comments: 11 pages, 5 figures, already accepted by NeurIPS 2022 Distribution Shifts Workshop

  42. arXiv:2210.07518  [pdf, other

    cs.LG cs.AI cs.SI

    Counterfactual Neural Temporal Point Process for Estimating Causal Influence of Misinformation on Social Media

    Authors: Yizhou Zhang, Defu Cao, Yan Liu

    Abstract: Recent years have witnessed the rise of misinformation campaigns that spread specific narratives on social media to manipulate public opinions on different areas, such as politics and healthcare. Consequently, an effective and efficient automatic methodology to estimate the influence of the misinformation on user beliefs and activities is needed. However, existing works on misinformation impact es… ▽ More

    Submitted 14 October, 2022; originally announced October 2022.

    Comments: 19 pages, 8 figures, already accepted by NeurIPS 2022

  43. arXiv:2210.06006  [pdf, other

    cs.CV

    BEV-LaneDet: a Simple and Effective 3D Lane Detection Baseline

    Authors: Ruihao Wang, Jian Qin, Kaiying Li, Yaochen Li, Dong Cao, **tao Xu

    Abstract: 3D lane detection which plays a crucial role in vehicle routing, has recently been a rapidly develo** topic in autonomous driving. Previous works struggle with practicality due to their complicated spatial transformations and inflexible representations of 3D lanes. Faced with the issues, our work proposes an efficient and robust monocular 3D lane detection called BEV-LaneDet with three main cont… ▽ More

    Submitted 11 March, 2023; v1 submitted 12 October, 2022; originally announced October 2022.

    Comments: Accepted by CVPR2023

  44. arXiv:2210.05084  [pdf, other

    cs.IT

    Covert Communication Gains from Adversary's Uncertainty of Phase Angles

    Authors: Sen Qiao, Daming Cao, Qiaosheng Zhang, Yinfei Xu, Guangjie Liu

    Abstract: This work investigates the phase gain of intelligent reflecting surface (IRS) covert communication over complex-valued additive white Gaussian noise (AWGN) channels. The transmitter Alice intends to transmit covert messages to the legitimate receiver Bob via reflecting the broadcast signals from a radio frequency (RF) source, while rendering the adversary Willie's detector arbitrarily close to ine… ▽ More

    Submitted 6 May, 2023; v1 submitted 10 October, 2022; originally announced October 2022.

  45. arXiv:2207.09610  [pdf, other

    cs.CV cs.AI cs.CG

    Unsupervised Deep Multi-Shape Matching

    Authors: Dongliang Cao, Florian Bernard

    Abstract: 3D shape matching is a long-standing problem in computer vision and computer graphics. While deep neural networks were shown to lead to state-of-the-art results in shape matching, existing learning-based approaches are limited in the context of multi-shape matching: (i) either they focus on matching pairs of shapes only and thus suffer from cycle-inconsistent multi-matchings, or (ii) they require… ▽ More

    Submitted 19 July, 2022; originally announced July 2022.

    Comments: to be published in ECCV2022

  46. arXiv:2207.07896  [pdf, other

    cs.CV

    Cross Vision-RF Gait Re-identification with Low-cost RGB-D Cameras and mmWave Radars

    Authors: Dongjiang Cao, Ruofeng Liu, Hao Li, Shuai Wang, Wenchao Jiang, Chris Xiaoxuan Lu

    Abstract: Human identification is a key requirement for many applications in everyday life, such as personalized services, automatic surveillance, continuous authentication, and contact tracing during pandemics, etc. This work studies the problem of cross-modal human re-identification (ReID), in response to the regular human movements across camera-allowed regions (e.g., streets) and camera-restricted regio… ▽ More

    Submitted 16 July, 2022; originally announced July 2022.

    Comments: 24 pages, 20 figures, accepted to IMWUT

  47. arXiv:2206.07257  [pdf, other

    astro-ph.SR astro-ph.IM cs.LG

    Investigation of stellar magnetic activity using variational autoencoder based on low-resolution spectroscopic survey

    Authors: Yue Xiang, Shenghong Gu, Dongtao Cao

    Abstract: We apply the variational autoencoder (VAE) to the LAMOST-K2 low-resolution spectra to detect the magnetic activity of the stars in the K2 field. After the training on the spectra of the selected inactive stars, the VAE model can efficiently generate the synthetic reference templates needed by the spectral subtraction procedure, without knowing any stellar parameters. Then we detect the peculiar sp… ▽ More

    Submitted 6 July, 2022; v1 submitted 14 June, 2022; originally announced June 2022.

    Comments: 13 pages, 19 figures, accepted for publication in MNRAS. Table 1 is available on Zenodo at https://doi.org/10.5281/zenodo.6802956 and the code can be found on GitHub at https://github.com/xylib/vae-for-spectroscopic-survey

  48. arXiv:2203.16797  [pdf, ps, other

    cs.LG stat.ML

    When Physics Meets Machine Learning: A Survey of Physics-Informed Machine Learning

    Authors: Chuizheng Meng, Sungyong Seo, Defu Cao, Sam Griesemer, Yan Liu

    Abstract: Physics-informed machine learning (PIML), referring to the combination of prior knowledge of physics, which is the high level abstraction of natural phenomenons and human behaviours in the long history, with data-driven machine learning models, has emerged as an effective way to mitigate the shortage of training data, to increase models' generalizability and to ensure the physical plausibility of… ▽ More

    Submitted 31 March, 2022; originally announced March 2022.

  49. Type-Directed Program Synthesis for RESTful APIs

    Authors: Zheng Guo, David Cao, Davin Tjong, Jean Yang, Cole Schlesinger, Nadia Polikarpova

    Abstract: With the rise of software-as-a-service and microservice architectures, RESTful APIs are now ubiquitous in mobile and web applications. A service can have tens or hundreds of API methods, making it a challenge for programmers to find the right combination of methods to solve their task. We present APIphany, a component-based synthesizer for programs that compose calls to RESTful APIs. The main in… ▽ More

    Submitted 5 April, 2022; v1 submitted 30 March, 2022; originally announced March 2022.

  50. BARGAIN-MATCH: A Game Theoretical Approach for Resource Allocation and Task Offloading in Vehicular Edge Computing Networks

    Authors: Zemin Sun, Geng Sun, Yanheng Liu, Jian Wang, Dongpu Cao

    Abstract: Vehicular edge computing (VEC) is emerging as a promising architecture of vehicular networks (VNs) by deploying the cloud computing resources at the edge of the VNs. This work aims to optimize resource allocation and task offloading in VEC networks. Specifically, we formulate a game theoretical resource allocation and task offloading problem (GTRATOP) that aims to maximize the system performance b… ▽ More

    Submitted 23 December, 2023; v1 submitted 26 March, 2022; originally announced March 2022.