Skip to main content

Showing 1–50 of 57 results for author: Du, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.01245  [pdf, other

    cs.AI cs.CY

    SINKT: A Structure-Aware Inductive Knowledge Tracing Model with Large Language Model

    Authors: Lingyue Fu, Hao Guan, Kounianhua Du, Jianghao Lin, Wei Xia, Weinan Zhang, Ruiming Tang, Yasheng Wang, Yong Yu

    Abstract: Knowledge Tracing (KT) aims to determine whether students will respond correctly to the next question, which is a crucial task in intelligent tutoring systems (ITS). In educational KT scenarios, transductive ID-based methods often face severe data sparsity and cold start problems, where interactions between individual students and questions are sparse, and new questions and concepts consistently a… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  2. arXiv:2406.18825  [pdf, other

    cs.IR

    ELCoRec: Enhance Language Understanding with Co-Propagation of Numerical and Categorical Features for Recommendation

    Authors: Jizheng Chen, Kounianhua Du, Jianghao Lin, Bo Chen, Ruiming Tang, Weinan Zhang

    Abstract: Large language models have been flourishing in the natural language processing (NLP) domain, and their potential for recommendation has been paid much attention to. Despite the intelligence shown by the recommendation-oriented finetuned models, LLMs struggle to fully understand the user behavior patterns due to their innate weakness in interpreting numerical features and the overhead for long cont… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  3. arXiv:2406.00012  [pdf, other

    cs.IR cs.AI

    Extracting Essential and Disentangled Knowledge for Recommendation Enhancement

    Authors: Kounianhua Du, Jizheng Chen, Jianghao Lin, Menghui Zhu, Bo Chen, Shuai Li, Ruiming Tang

    Abstract: Recommender models play a vital role in various industrial scenarios, while often faced with the catastrophic forgetting problem caused by the fast shifting data distribution, e.g., the evolving user interests, click signals fluctuation during sales promotions, etc. To alleviate this problem, a common approach is to reuse knowledge from the historical data. However, preserving the vast and fast-ac… ▽ More

    Submitted 20 May, 2024; originally announced June 2024.

  4. arXiv:2406.00011  [pdf, other

    cs.IR cs.AI

    DisCo: Towards Harmonious Disentanglement and Collaboration between Tabular and Semantic Space for Recommendation

    Authors: Kounianhua Du, Jizheng Chen, Jianghao Lin, Yunjia Xi, Hangyu Wang, Xinyi Dai, Bo Chen, Ruiming Tang, Weinan Zhang

    Abstract: Recommender systems play important roles in various applications such as e-commerce, social media, etc. Conventional recommendation methods usually model the collaborative signals within the tabular representation space. Despite the personalization modeling and the efficiency, the latent semantic dependencies are omitted. Methods that introduce semantics into recommendation then emerge, injecting… ▽ More

    Submitted 4 June, 2024; v1 submitted 20 May, 2024; originally announced June 2024.

  5. arXiv:2405.16444  [pdf, other

    cs.LG

    CacheBlend: Fast Large Language Model Serving for RAG with Cached Knowledge Fusion

    Authors: Jiayi Yao, Hanchen Li, Yuhan Liu, Siddhant Ray, Yihua Cheng, Qizheng Zhang, Kuntai Du, Shan Lu, Junchen Jiang

    Abstract: Large language models (LLMs) often incorporate multiple text chunks in their inputs to provide the necessary contexts. To speed up the prefill of the long LLM inputs, one can pre-compute the KV cache of a text and re-use the KV cache when the context is reused as the prefix of another LLM input. However, the reused text chunks are not always the input prefix, and when they are not, their precomput… ▽ More

    Submitted 3 June, 2024; v1 submitted 26 May, 2024; originally announced May 2024.

  6. arXiv:2405.12442  [pdf, other

    cs.IR cs.AI

    Learning Structure and Knowledge Aware Representation with Large Language Models for Concept Recommendation

    Authors: Qingyao Li, Wei Xia, Kounianhua Du, Qiji Zhang, Weinan Zhang, Ruiming Tang, Yong Yu

    Abstract: Concept recommendation aims to suggest the next concept for learners to study based on their knowledge states and the human knowledge system. While knowledge states can be predicted using knowledge tracing models, previous approaches have not effectively integrated the human knowledge system into the process of designing these educational models. In the era of rapidly evolving Large Language Model… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

    Comments: 11 pages, 8 figures

  7. arXiv:2405.06902   

    cs.LG stat.ML

    Causal Inference from Slowly Varying Nonstationary Processes

    Authors: Kang Du, Yu Xiang

    Abstract: Causal inference from observational data following the restricted structural causal models (SCM) framework hinges largely on the asymmetry between cause and effect from the data generating mechanisms, such as non-Gaussianity or non-linearity. This methodology can be adapted to stationary time series, yet inferring causal relationships from nonstationary time series remains a challenging task. In t… ▽ More

    Submitted 29 May, 2024; v1 submitted 11 May, 2024; originally announced May 2024.

    Comments: This work was intended as a replacement of arXiv:2012.13025 and any subsequent updates will appear there

  8. arXiv:2405.02355  [pdf, other

    cs.SE cs.AI

    CodeGRAG: Extracting Composed Syntax Graphs for Retrieval Augmented Cross-Lingual Code Generation

    Authors: Kounianhua Du, Renting Rui, Huacan Chai, Lingyue Fu, Wei Xia, Yasheng Wang, Ruiming Tang, Yong Yu, Weinan Zhang

    Abstract: Utilizing large language models to generate codes has shown promising meaning in software development revolution. Despite the intelligence shown by the general large language models, their specificity in code generation can still be improved due to the syntactic gap and mismatched vocabulary existing among natural language and different programming languages. In addition, programming languages are… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

  9. arXiv:2404.15245  [pdf, other

    stat.ME cs.LG

    Mining Invariance from Nonlinear Multi-Environment Data: Binary Classification

    Authors: Austin Goddard, Kang Du, Yu Xiang

    Abstract: Making predictions in an unseen environment given data from multiple training environments is a challenging task. We approach this problem from an invariance perspective, focusing on binary classification to shed light on general nonlinear data generation mechanisms. We identify a unique form of invariance that exists solely in a binary setting that allows us to train models invariant over environ… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

    Comments: Accepted to the 2024 International Symposium on Information Theory (ISIT)

  10. arXiv:2404.04633  [pdf, other

    cs.CL

    Context versus Prior Knowledge in Language Models

    Authors: Kevin Du, Vésteinn Snæbjarnarson, Niklas Stoehr, Jennifer C. White, Aaron Schein, Ryan Cotterell

    Abstract: To answer a question, language models often need to integrate prior knowledge learned during pretraining and new information presented in context. We hypothesize that models perform this integration in a predictable way across different questions and contexts: models will rely more on prior knowledge for questions about entities (e.g., persons, places, etc.) that they are more familiar with due to… ▽ More

    Submitted 16 June, 2024; v1 submitted 6 April, 2024; originally announced April 2024.

    Comments: Long paper accepted at ACL 2024

  11. arXiv:2404.00633  [pdf, other

    cs.CV

    IPT-V2: Efficient Image Processing Transformer using Hierarchical Attentions

    Authors: Zhijun Tu, Kunpeng Du, Hanting Chen, Hailing Wang, Wei Li, Jie Hu, Yunhe Wang

    Abstract: Recent advances have demonstrated the powerful capability of transformer architecture in image restoration. However, our analysis indicates that existing transformerbased methods can not establish both exact global and local dependencies simultaneously, which are much critical to restore the details and missing content of degraded images. To this end, we present an efficient image processing trans… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

  12. arXiv:2403.16520  [pdf, other

    cs.CV

    CMViM: Contrastive Masked Vim Autoencoder for 3D Multi-modal Representation Learning for AD classification

    Authors: Guangqian Yang, Kangrui Du, Zhihan Yang, Ye Du, Yong** Zheng, Shujun Wang

    Abstract: Alzheimer's disease (AD) is an incurable neurodegenerative condition leading to cognitive and functional deterioration. Given the lack of a cure, prompt and precise AD diagnosis is vital, a complex process dependent on multiple factors and multi-modal data. While successful efforts have been made to integrate multi-modal representation learning into medical datasets, scant attention has been given… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: 11 pages, 1 figure

  13. arXiv:2403.12559  [pdf, other

    cs.CV cs.LG

    Confidence Self-Calibration for Multi-Label Class-Incremental Learning

    Authors: Kaile Du, Yifan Zhou, Fan Lyu, Yuyang Li, Chen Lu, Guangcan Liu

    Abstract: The partial label challenge in Multi-Label Class-Incremental Learning (MLCIL) arises when only the new classes are labeled during training, while past and future labels remain unavailable. This issue leads to a proliferation of false-positive errors due to erroneously high confidence multi-label predictions, exacerbating catastrophic forgetting within the disjoint label space. In this paper, we ai… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

  14. arXiv:2403.11434  [pdf, other

    cs.NI cs.DC

    Earth+: on-board satellite imagery compression leveraging historical earth observations

    Authors: Kuntai Du, Yihua Cheng, Peder Olsen, Shadi Noghabi, Ranveer Chandra, Junchen Jiang

    Abstract: With the increasing deployment of earth observation satellite constellations, the downlink (satellite-to-ground) capacity often limits the freshness, quality, and coverage of the imagery data available to applications on the ground. To overcome the downlink limitation, we present Earth+, a new satellite imagery compression system that, instead of compressing each image individually, pinpoints and… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

  15. arXiv:2402.08182  [pdf, other

    cs.LG stat.ML

    Variational Continual Test-Time Adaptation

    Authors: Fan Lyu, Kaile Du, Yuyang Li, Hanyu Zhao, Zhang Zhang, Guangcan Liu, Liang Wang

    Abstract: The prior drift is crucial in Continual Test-Time Adaptation (CTTA) methods that only use unlabeled test data, as it can cause significant error propagation. In this paper, we introduce VCoTTA, a variational Bayesian approach to measure uncertainties in CTTA. At the source stage, we transform a pre-trained deterministic model into a Bayesian Neural Network (BNN) via a variational warm-up strategy,… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

  16. arXiv:2402.06884  [pdf, other

    stat.ML cs.LG

    Low-Rank Approximation of Structural Redundancy for Self-Supervised Learning

    Authors: Kang Du, Yu Xiang

    Abstract: We study the data-generating mechanism for reconstructive SSL to shed light on its effectiveness. With an infinite amount of labeled samples, we provide a sufficient and necessary condition for perfect linear approximation. The condition reveals a full-rank component that preserves the label classes of Y, along with a redundant component. Motivated by the condition, we propose to approximate the r… ▽ More

    Submitted 27 May, 2024; v1 submitted 9 February, 2024; originally announced February 2024.

    Comments: Accepted to the 3rd Conference on Causal Learning and Reasoning (CLeaR)

  17. arXiv:2402.02547  [pdf

    cs.AI cs.CL

    Integration of cognitive tasks into artificial general intelligence test for large models

    Authors: Youzhi Qu, Chen Wei, Penghui Du, Wenxin Che, Chi Zhang, Wanli Ouyang, Yatao Bian, Feiyang Xu, Bin Hu, Kai Du, Haiyan Wu, Jia Liu, Quanying Liu

    Abstract: During the evolution of large models, performance evaluation is necessarily performed to assess their capabilities and ensure safety before practical application. However, current model evaluations mainly rely on specific tasks and datasets, lacking a united framework for assessing the multidimensional intelligence of large models. In this perspective, we advocate for a comprehensive framework of… ▽ More

    Submitted 5 March, 2024; v1 submitted 4 February, 2024; originally announced February 2024.

  18. Eloquent: A More Robust Transmission Scheme for LLM Token Streaming

    Authors: Hanchen Li, Yuhan Liu, Yihua Cheng, Siddhant Ray, Kuntai Du, Junchen Jiang

    Abstract: To render each generated token in real-time for users, the Large Language Model (LLM) server generates tokens one by one and streams each token (or group of a few tokens) through the network to the user right after generation, which we refer to as LLM token streaming. However, under unstable network conditions, the LLM token streaming experience could suffer greatly from stalls since one packet lo… ▽ More

    Submitted 16 June, 2024; v1 submitted 23 January, 2024; originally announced January 2024.

    Comments: In SIGCOMM Workshop on Networks for AI Computing (NAIC '24)

  19. arXiv:2401.08221  [pdf, other

    cs.LG cs.AI

    Towards Causal Relationship in Indefinite Data: Baseline Model and New Datasets

    Authors: Hang Chen, Xinyu Yang, Keqing Du

    Abstract: Integrating deep learning and causal discovery has encouraged us to spot that learning causal structures and representations in dialogue and video is full of challenges. We defined These data forms as "Indefinite Data", characterized by multi-structure data and multi-value representations. Unlike existing adaptable data forms, Indefinite Data still faces gaps in datasets and methods. To address th… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

    Comments: If you are interested in the two new datasets, pls contact us by email

  20. arXiv:2311.18567  [pdf, other

    cs.CL

    Grammatical Gender's Influence on Distributional Semantics: A Causal Perspective

    Authors: Karolina Stańczak, Kevin Du, Adina Williams, Isabelle Augenstein, Ryan Cotterell

    Abstract: How much meaning influences gender assignment across languages is an active area of research in modern linguistics and cognitive science. We can view current approaches as aiming to determine where gender assignment falls on a spectrum, from being fully arbitrarily determined to being largely semantically determined. For the latter case, there is a formulation of the neo-Whorfian hypothesis, which… ▽ More

    Submitted 30 November, 2023; originally announced November 2023.

  21. arXiv:2311.12401   

    cs.CV cs.MM

    CASR: Refining Action Segmentation via Marginalizing Frame-levle Causal Relationships

    Authors: Keqing Du, Xinyu Yang, Hang Chen

    Abstract: Integrating deep learning and causal discovery has increased the interpretability of Temporal Action Segmentation (TAS) tasks. However, frame-level causal relationships exist many complicated noises outside the segment-level, making it infeasible to directly express macro action semantics. Thus, we propose Causal Abstraction Segmentation Refiner (CASR), which can refine TAS results from various mo… ▽ More

    Submitted 26 January, 2024; v1 submitted 21 November, 2023; originally announced November 2023.

    Comments: We found that the paper needs to be modified in the model and all experiments must be re-run, so we request to withdraw the current version

  22. arXiv:2311.00923  [pdf, other

    cs.LG stat.ME

    A Review and Roadmap of Deep Causal Model from Different Causal Structures and Representations

    Authors: Hang Chen, Keqing Du, Chenguang Li, Xinyu Yang

    Abstract: The fusion of causal models with deep learning introducing increasingly intricate data sets, such as the causal associations within images or between textual components, has surfaced as a focal research area. Nonetheless, the broadening of original causal concepts and theories to such complex, non-statistical data has been met with serious challenges. In response, our study proposes redefinitions… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

    Comments: under review

  23. arXiv:2310.18634  [pdf, other

    cs.LG

    SSL Framework for Causal Inconsistency between Structures and Representations

    Authors: Hang Chen, Xinyu Yang, Keqing Du

    Abstract: The cross-pollination of deep learning and causal discovery has catalyzed a burgeoning field of research seeking to elucidate causal relationships within non-statistical data forms like images, videos, and text. Such data, often being named `indefinite data', exhibit unique challenges-inconsistency between causal structure and representation, which are not common in conventional data forms. To tac… ▽ More

    Submitted 28 October, 2023; originally announced October 2023.

  24. arXiv:2310.07240  [pdf, other

    cs.NI cs.LG

    CacheGen: KV Cache Compression and Streaming for Fast Language Model Serving

    Authors: Yuhan Liu, Hanchen Li, Yihua Cheng, Siddhant Ray, Yuyang Huang, Qizheng Zhang, Kuntai Du, Jiayi Yao, Shan Lu, Ganesh Ananthanarayanan, Michael Maire, Henry Hoffmann, Ari Holtzman, Junchen Jiang

    Abstract: As large language models (LLMs) take on complex tasks, their inputs are supplemented with longer contexts that incorporate domain knowledge or user-specific information. Yet using long contexts poses a challenge for responsive LLM systems, as nothing can be generated until the whole context is processed by the LLM. . CacheGen is a fast context-loading module for LLM systems. First, CacheGen uses… ▽ More

    Submitted 30 April, 2024; v1 submitted 11 October, 2023; originally announced October 2023.

  25. arXiv:2310.04685  [pdf, other

    cs.SE cs.AI cs.NI

    Automatic and Efficient Customization of Neural Networks for ML Applications

    Authors: Yuhan Liu, Chengcheng Wan, Kuntai Du, Henry Hoffmann, Junchen Jiang, Shan Lu, Michael Maire

    Abstract: ML APIs have greatly relieved application developers of the burden to design and train their own neural network models -- classifying objects in an image can now be as simple as one line of Python code to call an API. However, these APIs offer the same pre-trained models regardless of how their output is used by different applications. This can be suboptimal as not all ML inference errors can caus… ▽ More

    Submitted 7 October, 2023; originally announced October 2023.

  26. arXiv:2310.02422  [pdf, other

    cs.LG cs.AI cs.DC cs.MM cs.NI

    OneAdapt: Fast Adaptation for Deep Learning Applications via Backpropagation

    Authors: Kuntai Du, Yuhan Liu, Yitian Hao, Qizheng Zhang, Haodong Wang, Yuyang Huang, Ganesh Ananthanarayanan, Junchen Jiang

    Abstract: Deep learning inference on streaming media data, such as object detection in video or LiDAR feeds and text extraction from audio waves, is now ubiquitous. To achieve high inference accuracy, these applications typically require significant network bandwidth to gather high-fidelity data and extensive GPU resources to run deep neural networks (DNNs). While the high demand for network bandwidth and G… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

    Comments: SoCC' 23

  27. arXiv:2309.06118  [pdf, other

    cs.CV

    CHITNet: A Complementary to Harmonious Information Transfer Network for Infrared and Visible Image Fusion

    Authors: Yafei Zhang, Keying Du, Huafeng Li, Zhengtao Yu, Yu Liu

    Abstract: Current infrared and visible image fusion (IVIF) methods go to great lengths to excavate complementary features and design complex fusion strategies, which is extremely challenging. To this end, we rethink the IVIF outside the box, proposing a complementary to harmonious information transfer network (CHITNet). It reasonably transfers complementary information into harmonious one, which integrates… ▽ More

    Submitted 25 October, 2023; v1 submitted 12 September, 2023; originally announced September 2023.

  28. arXiv:2309.05793  [pdf, other

    cs.CV cs.AI

    PhotoVerse: Tuning-Free Image Customization with Text-to-Image Diffusion Models

    Authors: Li Chen, Mengyi Zhao, Yiheng Liu, Mingxu Ding, Yangyang Song, Shizun Wang, Xu Wang, Hao Yang, **g Liu, Kang Du, Min Zheng

    Abstract: Personalized text-to-image generation has emerged as a powerful and sought-after tool, empowering users to create customized images based on their specific concepts and prompts. However, existing approaches to personalization encounter multiple challenges, including long tuning times, large storage requirements, the necessity for multiple input images per identity, and limitations in preserving id… ▽ More

    Submitted 11 September, 2023; originally announced September 2023.

  29. arXiv:2309.01940  [pdf, other

    cs.CL cs.AI

    CodeApex: A Bilingual Programming Evaluation Benchmark for Large Language Models

    Authors: Lingyue Fu, Huacan Chai, Shuang Luo, Kounianhua Du, Weiming Zhang, Longteng Fan, Jiayi Lei, Renting Rui, Jianghao Lin, Yuchen Fang, Yifan Liu, **gkuan Wang, Siyuan Qi, Kangning Zhang, Weinan Zhang, Yong Yu

    Abstract: With the emergence of Large Language Models (LLMs), there has been a significant improvement in the programming capabilities of models, attracting growing attention from researchers. Evaluating the programming capabilities of LLMs is crucial as it reflects the multifaceted abilities of LLMs, and it has numerous downstream applications. In this paper, we propose CodeApex, a bilingual benchmark data… ▽ More

    Submitted 11 March, 2024; v1 submitted 5 September, 2023; originally announced September 2023.

    Comments: 33pages

  30. arXiv:2308.11131  [pdf, other

    cs.IR cs.AI

    ReLLa: Retrieval-enhanced Large Language Models for Lifelong Sequential Behavior Comprehension in Recommendation

    Authors: Jianghao Lin, Rong Shan, Chenxu Zhu, Kounianhua Du, Bo Chen, Shigang Quan, Ruiming Tang, Yong Yu, Weinan Zhang

    Abstract: With large language models (LLMs) achieving remarkable breakthroughs in natural language processing (NLP) domains, LLM-enhanced recommender systems have received much attention and have been actively explored currently. In this paper, we focus on adapting and empowering a pure large language model for zero-shot and few-shot recommendation tasks. First and foremost, we identify and formulate the li… ▽ More

    Submitted 26 June, 2024; v1 submitted 21 August, 2023; originally announced August 2023.

    Comments: Accepted by WWW 2024. Full and More Readable Version

  31. arXiv:2308.03610  [pdf, other

    cs.CV

    AvatarVerse: High-quality & Stable 3D Avatar Creation from Text and Pose

    Authors: Huichao Zhang, Bowen Chen, Hao Yang, Liao Qu, Xu Wang, Li Chen, Chao Long, Feida Zhu, Kang Du, Min Zheng

    Abstract: Creating expressive, diverse and high-quality 3D avatars from highly customized text descriptions and pose guidance is a challenging task, due to the intricacy of modeling and texturing in 3D that ensure details and various styles (realistic, fictional, etc). We present AvatarVerse, a stable pipeline for generating expressive high-quality 3D avatars from nothing but text descriptions and pose guid… ▽ More

    Submitted 7 August, 2023; originally announced August 2023.

  32. arXiv:2308.00391  [pdf, other

    cs.LG cs.AI

    Counterfactual Graph Transformer for Traffic Flow Prediction

    Authors: Ying Yang, Kai Du, Xingyuan Dai, Jianwu Fang

    Abstract: Traffic flow prediction (TFP) is a fundamental problem of the Intelligent Transportation System (ITS), as it models the latent spatial-temporal dependency of traffic flow for potential congestion prediction. Recent graph-based models with multiple kinds of attention mechanisms have achieved promising performance. However, existing methods for traffic flow prediction tend to inherit the bias patter… ▽ More

    Submitted 1 August, 2023; originally announced August 2023.

    Comments: accepted by ITSC 2023

  33. arXiv:2307.03056  [pdf, other

    cs.LG cs.AI cs.CL

    Generalizing Backpropagation for Gradient-Based Interpretability

    Authors: Kevin Du, Lucas Torroba Hennigen, Niklas Stoehr, Alexander Warstadt, Ryan Cotterell

    Abstract: Many popular feature-attribution methods for interpreting deep neural networks rely on computing the gradients of a model's output with respect to its inputs. While these methods can indicate which input features may be important for the model's prediction, they reveal little about the inner workings of the model itself. In this paper, we observe that the gradient computation of a model is a speci… ▽ More

    Submitted 6 July, 2023; originally announced July 2023.

    Comments: Long paper accepted at ACL 2023

  34. arXiv:2304.05959  [pdf, other

    cs.RO cs.AI

    UAV Obstacle Avoidance by Human-in-the-Loop Reinforcement in Arbitrary 3D Environment

    Authors: Xuyang Li, Jianwu Fang, Kai Du, Kuizhi Mei, Jianru Xue

    Abstract: This paper focuses on the continuous control of the unmanned aerial vehicle (UAV) based on a deep reinforcement learning method for a large-scale 3D complex environment. The purpose is to make the UAV reach any target point from a certain starting point, and the flying height and speed are variable during navigation. In this work, we propose a deep reinforcement learning (DRL)-based method combine… ▽ More

    Submitted 6 April, 2023; originally announced April 2023.

    Comments: accepted in CCC2023

  35. arXiv:2303.14653  [pdf, other

    cs.CV

    SDTracker: Synthetic Data Based Multi-Object Tracking

    Authors: Yingda Guan, Zhengyang Feng, Huiying Chang, Kuo Du, Tingting Li, Min Wang

    Abstract: We present SDTracker, a method that harnesses the potential of synthetic data for multi-object tracking of real-world scenes in a domain generalization and semi-supervised fashion. First, we use the ImageNet dataset as an auxiliary to randomize the style of synthetic data. With out-of-domain data, we further enforce pyramid consistency loss across different "stylized" images from the same sample t… ▽ More

    Submitted 26 March, 2023; originally announced March 2023.

    Comments: cvpr2022 workshop

  36. arXiv:2301.05975  [pdf, ps, other

    stat.ME cs.LG

    Generalized Invariant Matching Property via LASSO

    Authors: Kang Du, Yu Xiang

    Abstract: Learning under distribution shifts is a challenging task. One principled approach is to exploit the invariance principle via the structural causal models. However, the invariance principle is violated when the response is intervened, making it a difficult setting. In a recent work, the invariant matching property has been developed to shed light on this scenario and shows promising performance. In… ▽ More

    Submitted 11 March, 2023; v1 submitted 14 January, 2023; originally announced January 2023.

    Comments: Accepted to the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2023)

  37. arXiv:2211.14763  [pdf, other

    cs.CV cs.AI

    Multi-Label Continual Learning using Augmented Graph Convolutional Network

    Authors: Kaile Du, Fan Lyu, Linyan Li, Fuyuan Hu, Wei Feng, Fenglei Xu, Xuefeng Xi, Han**g Cheng

    Abstract: Multi-Label Continual Learning (MLCL) builds a class-incremental framework in a sequential multi-label image recognition data stream. The critical challenges of MLCL are the construction of label relationships on past-missing and future-missing partial labels of training data and the catastrophic forgetting on old classes, resulting in poor generalization. To solve the problems, the study proposes… ▽ More

    Submitted 27 November, 2022; originally announced November 2022.

  38. arXiv:2211.09622  [pdf, other

    cs.AI

    AlphaSnake: Policy Iteration on a Nondeterministic NP-hard Markov Decision Process

    Authors: Kevin Du, Ian Gemp, Yi Wu, Yingying Wu

    Abstract: Reinforcement learning has recently been used to approach well-known NP-hard combinatorial problems in graph theory. Among these problems, Hamiltonian cycle problems are exceptionally difficult to analyze, even when restricted to individual instances of structurally complex graphs. In this paper, we use Monte Carlo Tree Search (MCTS), the search algorithm behind many state-of-the-art reinforcement… ▽ More

    Submitted 17 November, 2022; originally announced November 2022.

  39. arXiv:2209.06367  [pdf, other

    cs.LG cs.AI stat.ME

    A Review and Roadmap of Deep Learning Causal Discovery in Different Variable Paradigms

    Authors: Hang Chen, Keqing Du, Xinyu Yang, Chenguang Li

    Abstract: Understanding causality helps to structure interventions to achieve specific goals and enables predictions under interventions. With the growing importance of learning causal relationships, causal discovery tasks have transitioned from using traditional methods to infer potential causal structures from observational data to the field of pattern recognition involved in deep learning. The rapid accu… ▽ More

    Submitted 13 September, 2022; originally announced September 2022.

    Comments: 26 pages,10 figures. arXiv admin note: text overlap with arXiv:2012.07138, arXiv:1605.08179, arXiv:2203.14237 by other authors

  40. arXiv:2208.10027  [pdf, other

    stat.ME cs.LG

    Learning Invariant Representations under General Interventions on the Response

    Authors: Kang Du, Yu Xiang

    Abstract: It has become increasingly common nowadays to collect observations of feature and response pairs from different environments. As a consequence, one has to apply learned predictors to data with a different distribution due to distribution shifts. One principled approach is to adopt the structural causal models to describe training and test models, following the invariance principle which says that… ▽ More

    Submitted 30 October, 2023; v1 submitted 21 August, 2022; originally announced August 2022.

    Comments: Accepted to the IEEE Journal on Selected Areas in Information Theory. Special Issue: Causality: Fundamental Limits and Applications

  41. arXiv:2207.07840  [pdf, other

    cs.LG cs.AI

    Class-Incremental Lifelong Learning in Multi-Label Classification

    Authors: Kaile Du, Linyan Li, Fan Lyu, Fuyuan Hu, Fenglei Xu

    Abstract: Existing class-incremental lifelong learning studies only the data is with single-label, which limits its adaptation to multi-label data. This paper studies Lifelong Multi-Label (LML) classification, which builds an online class-incremental classifier in a sequential multi-label classification data stream. Training on the data with Partial Labels in LML classification may result in more serious Ca… ▽ More

    Submitted 16 July, 2022; originally announced July 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2203.05534

  42. arXiv:2206.06587  [pdf, other

    cs.LG cs.AI

    Learning Enhanced Representations for Tabular Data via Neighborhood Propagation

    Authors: Kounianhua Du, Weinan Zhang, Ruiwen Zhou, Yangkun Wang, Xilong Zhao, Jiarui **, Quan Gan, Zheng Zhang, David Wipf

    Abstract: Prediction over tabular data is an essential and fundamental problem in many important downstream tasks. However, existing methods either take a data instance of the table independently as input or do not fully utilize the multi-rows features and labels to directly change and enhance the target data representations. In this paper, we propose to 1) construct a hypergraph from relevant data instance… ▽ More

    Submitted 14 June, 2022; originally announced June 2022.

  43. arXiv:2205.09162  [pdf, ps, other

    stat.ME cs.LG

    An Invariant Matching Property for Distribution Generalization under Intervened Response

    Authors: Kang Du, Yu Xiang

    Abstract: The task of distribution generalization concerns making reliable prediction of a response in unseen environments. The structural causal models are shown to be useful to model distribution changes through intervention. Motivated by the fundamental invariance principle, it is often assumed that the conditional distribution of the response given its predictors remains the same across environments. Ho… ▽ More

    Submitted 10 June, 2022; v1 submitted 18 May, 2022; originally announced May 2022.

    Comments: Accepted to the European Signal Processing Conference (EUSIPCO) 2022

  44. arXiv:2204.12534  [pdf, other

    cs.NI cs.CV cs.MM

    AccMPEG: Optimizing Video Encoding for Video Analytics

    Authors: Kuntai Du, Qizheng Zhang, Anton Arapin, Haodong Wang, Zhengxu Xia, Junchen Jiang

    Abstract: With more videos being recorded by edge sensors (cameras) and analyzed by computer-vision deep neural nets (DNNs), a new breed of video streaming systems has emerged, with the goal to compress and stream videos to remote servers in real time while preserving enough information to allow highly accurate inference by the server-side DNNs. An ideal design of the video streaming system should simultane… ▽ More

    Submitted 26 April, 2022; originally announced April 2022.

    Comments: Accepted by MLSys 2022

  45. arXiv:2203.15320  [pdf, other

    cs.CV

    Dressing in the Wild by Watching Dance Videos

    Authors: Xin Dong, Fuwei Zhao, Zhenyu Xie, Xi** Zhang, Daniel K. Du, Min Zheng, Xiang Long, Xiaodan Liang, Jianchao Yang

    Abstract: While significant progress has been made in garment transfer, one of the most applicable directions of human-centric image generation, existing works overlook the in-the-wild imagery, presenting severe garment-person misalignment as well as noticeable degradation in fine texture details. This paper, therefore, attends to virtual try-on in real-world scenes and brings essential improvements in auth… ▽ More

    Submitted 29 March, 2022; originally announced March 2022.

    Comments: Accepted at CVPR2022, Project: https://awesome-wflow.github.io

  46. arXiv:2203.05534  [pdf, other

    cs.CV cs.AI

    AGCN: Augmented Graph Convolutional Network for Lifelong Multi-label Image Recognition

    Authors: Kaile Du, Fan Lyu, Fuyuan Hu, Linyan Li, Wei Feng, Fenglei Xu, Qiming Fu

    Abstract: The Lifelong Multi-Label (LML) image recognition builds an online class-incremental classifier in a sequential multi-label image recognition data stream. The key challenges of LML image recognition are the construction of label relationships on Partial Labels of training data and the Catastrophic Forgetting on old classes, resulting in poor generalization. To solve the problems, the study proposes… ▽ More

    Submitted 10 March, 2022; v1 submitted 10 March, 2022; originally announced March 2022.

    Comments: Accpted in ICME 2022

  47. RumorLens: Interactive Analysis and Validation of Suspected Rumors on Social Media

    Authors: Ran Wang, Kehan Du, Qianhe Chen, Yifei Zhao, Mojie Tang, Hongxi Tao, Shipan Wang, Yiyao Li, Yong Wang

    Abstract: With the development of social media, various rumors can be easily spread on the Internet and such rumors can have serious negative effects on society. Thus, it has become a critical task for social media platforms to deal with suspected rumors. However, due to the lack of effective tools, it is often difficult for platform administrators to analyze and validate rumors from a large volume of infor… ▽ More

    Submitted 6 March, 2022; originally announced March 2022.

  48. arXiv:2112.07057  [pdf, other

    cs.NE cs.LG

    NEORL: NeuroEvolution Optimization with Reinforcement Learning

    Authors: Majdi I. Radaideh, Katelin Du, Paul Seurin, Devin Seyler, Xubo Gu, Haijia Wang, Koroush Shirvan

    Abstract: We present an open-source Python framework for NeuroEvolution Optimization with Reinforcement Learning (NEORL) developed at the Massachusetts Institute of Technology. NEORL offers a global optimization interface of state-of-the-art algorithms in the field of evolutionary computation, neural networks through reinforcement learning, and hybrid neuroevolution algorithms. NEORL features diverse set of… ▽ More

    Submitted 1 December, 2021; originally announced December 2021.

    Comments: 23 pages, 6 figures, 7 tables

  49. arXiv:2012.13025  [pdf, ps, other

    stat.ME cs.IT cs.LG

    Causal Inference from Slowly Varying Nonstationary Processes

    Authors: Kang Du, Yu Xiang

    Abstract: Causal inference from observational data following the restricted structural causal model (SCM) framework hinges largely on the asymmetry between cause and effect from the data generating mechanisms, such as non-Gaussianity or nonlinearity. This methodology can be adapted to stationary time series, yet inferring causal relationships from nonstationary time series remains a challenging task. In thi… ▽ More

    Submitted 3 September, 2021; v1 submitted 23 December, 2020; originally announced December 2020.

  50. arXiv:2011.12683  [pdf, other

    cs.IR

    GraphHINGE: Learning Interaction Models of Structured Neighborhood on Heterogeneous Information Network

    Authors: Jiarui **, Kounianhua Du, Weinan Zhang, Jiarui Qin, Yuchen Fang, Yong Yu, Zheng Zhang, Alexander J. Smola

    Abstract: Heterogeneous information network (HIN) has been widely used to characterize entities of various types and their complex relations. Recent attempts either rely on explicit path reachability to leverage path-based semantic relatedness or graph neighborhood to learn heterogeneous network representations before predictions. These weakly coupled manners overlook the rich interactions among neighbor no… ▽ More

    Submitted 30 June, 2021; v1 submitted 25 November, 2020; originally announced November 2020.

    Comments: TOIS (Special Issue on Graph Technologies for User Modeling and Recommendation). arXiv admin note: text overlap with arXiv:2007.00216