Skip to main content

Showing 1–50 of 63 results for author: Due, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.09087  [pdf, other

    cs.LG cs.CV

    On the Role of Discrete Tokenization in Visual Representation Learning

    Authors: Tianqi Du, Yifei Wang, Yisen Wang

    Abstract: In the realm of self-supervised learning (SSL), masked image modeling (MIM) has gained popularity alongside contrastive learning methods. MIM involves reconstructing masked regions of input images using their unmasked portions. A notable subset of MIM methodologies employs discrete tokens as the reconstruction target, but the theoretical underpinnings of this choice remain underexplored. In this p… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: ICLR 2024 Spotlight

  2. arXiv:2407.00935  [pdf, other

    cs.LG cs.CL

    Look Ahead or Look Around? A Theoretical Comparison Between Autoregressive and Masked Pretraining

    Authors: Qi Zhang, Tianqi Du, Haotian Huang, Yifei Wang, Yisen Wang

    Abstract: In recent years, the rise of generative self-supervised learning (SSL) paradigms has exhibited impressive performance across visual, language, and multi-modal domains. While the varied designs of generative SSL objectives lead to distinct properties in downstream tasks, a theoretical understanding of these differences remains largely unexplored. In this paper, we establish the first theoretical co… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

  3. arXiv:2406.17972  [pdf, other

    cs.LG cs.CL econ.EM

    LABOR-LLM: Language-Based Occupational Representations with Large Language Models

    Authors: Tianyu Du, Ayush Kanodia, Herman Brunborg, Keyon Vafa, Susan Athey

    Abstract: Many empirical studies of labor market questions rely on estimating relatively simple predictive models using small, carefully constructed longitudinal survey datasets based on hand-engineered features. Large Language Models (LLMs), trained on massive datasets, encode vast quantities of world knowledge and can be used for the next job prediction problem. However, while an off-the-shelf LLM produce… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  4. arXiv:2406.11087  [pdf, other

    cs.CR cs.AI cs.CL cs.LG

    MemDPT: Differential Privacy for Memory Efficient Language Models

    Authors: Yanming Liu, Xinyue Peng, Jiannan Cao, Yuwei Zhang, Chen Ma, Songhang Deng, Mengchen Fu, Xuhong Zhang, Sheng Cheng, Xun Wang, Jianwei Yin, Tianyu Du

    Abstract: Large language models have consistently demonstrated remarkable performance across a wide spectrum of applications. Nonetheless, the deployment of these models can inadvertently expose user privacy to potential risks. The substantial memory demands of these models during training represent a significant resource consumption challenge. The sheer size of these models imposes a considerable burden on… ▽ More

    Submitted 20 June, 2024; v1 submitted 16 June, 2024; originally announced June 2024.

    Comments: 12 pages first version

  5. arXiv:2406.03807  [pdf, other

    cs.AI cs.CL cs.RO

    Tool-Planner: Dynamic Solution Tree Planning for Large Language Model with Tool Clustering

    Authors: Yanming Liu, Xinyue Peng, Yuwei Zhang, Jiannan Cao, Xuhong Zhang, Sheng Cheng, Xun Wang, Jianwei Yin, Tianyu Du

    Abstract: Large language models (LLMs) have demonstrated exceptional reasoning capabilities, enabling them to solve various complex problems. Recently, this ability has been applied to the paradigm of tool learning. Tool learning involves providing examples of tool usage and their corresponding functions, allowing LLMs to formulate plans and demonstrate the process of invoking and executing each tool. LLMs… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: 46pages first version

  6. arXiv:2405.15432  [pdf, other

    eess.SP cs.NI

    Throughput Requirements for RAN Functional Splits in 3D-Networks

    Authors: MohammadAmin Vakilifard, Tim Düe, Mohammad Rihan, Maik Röper, Dirk Wübben, Carsten Bockelmann, Armin Dekorsy

    Abstract: The rapid growth of non-terrestrial communication necessitates its integration with existing terrestrial networks, as highlighted in 3GPP Releases 16 and 17. This paper analyses the concept of functional splits in 3D-Networks. To manage this complex structure effectively, the adoption of a Radio Access Network (RAN) architecture with Functional Split (FS) offers advantages in flexibility, scalabil… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: submitted to Globecom2024 SELECTED AREAS IN COMMUNICATIONS SATELLITE AND SPACE COMMUNICATIONS

  7. arXiv:2405.14903  [pdf, other

    physics.flu-dyn cs.AI cs.GR

    Neural Fluidic System Design and Control with Differentiable Simulation

    Authors: Yifei Li, Yuchen Sun, **chuan Ma, Eftychios Sifakis, Tao Du, Bo Zhu, Wojciech Matusik

    Abstract: We present a novel framework to explore neural control and design of complex fluidic systems with dynamic solid boundaries. Our system features a fast differentiable Navier-Stokes solver with solid-fluid interface handling, a low-dimensional differentiable parametric geometry representation, a control-shape co-design algorithm, and gym-like simulation environments to facilitate various fluidic con… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  8. arXiv:2405.05846  [pdf, other

    cs.CR cs.CV

    Could It Be Generated? Towards Practical Analysis of Memorization in Text-To-Image Diffusion Models

    Authors: Zhe Ma, Xuhong Zhang, Qingming Li, Tianyu Du, Wenzhi Chen, Zonghui Wang, Shouling Ji

    Abstract: The past few years have witnessed substantial advancement in text-guided image generation powered by diffusion models. However, it was shown that text-to-image diffusion models are vulnerable to training image memorization, raising concerns on copyright infringement and privacy invasion. In this work, we perform practical analysis of memorization in text-to-image diffusion models. Targeting a set… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

  9. arXiv:2405.04753  [pdf, other

    cs.CR cs.AI

    AttacKG+:Boosting Attack Knowledge Graph Construction with Large Language Models

    Authors: Yongheng Zhang, Tingwen Du, Yunshan Ma, Xiang Wang, Yi Xie, Guozheng Yang, Yuliang Lu, Ee-Chien Chang

    Abstract: Attack knowledge graph construction seeks to convert textual cyber threat intelligence (CTI) reports into structured representations, portraying the evolutionary traces of cyber attacks. Even though previous research has proposed various methods to construct attack knowledge graphs, they generally suffer from limited generalization capability to diverse knowledge types as well as requirement of ex… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: 20 pages, 5 figures

  10. arXiv:2404.11121  [pdf, other

    cs.CR cs.AI

    TransLinkGuard: Safeguarding Transformer Models Against Model Stealing in Edge Deployment

    Authors: Qinfeng Li, Zhiqiang Shen, Zhenghan Qin, Yangfan Xie, Xuhong Zhang, Tianyu Du, Jianwei Yin

    Abstract: Proprietary large language models (LLMs) have been widely applied in various scenarios. Additionally, deploying LLMs on edge devices is trending for efficiency and privacy reasons. However, edge deployment of proprietary LLMs introduces new security challenges: edge-deployed models are exposed as white-box accessible to users, enabling adversaries to conduct effective model stealing (MS) attacks.… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

    Comments: arXiv admin note: text overlap with arXiv:2310.07152 by other authors

  11. arXiv:2404.07988  [pdf, other

    cs.RO cs.CV cs.GR

    QuasiSim: Parameterized Quasi-Physical Simulators for Dexterous Manipulations Transfer

    Authors: Xueyi Liu, Kangbo Lyu, Jieqiong Zhang, Tao Du, Li Yi

    Abstract: We explore the dexterous manipulation transfer problem by designing simulators. The task wishes to transfer human manipulations to dexterous robot hand simulations and is inherently difficult due to its intricate, highly-constrained, and discontinuous dynamics and the need to control a dexterous hand with a DoF to accurately replicate human manipulations. Previous approaches that optimize in high-… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

    Comments: Project website: https://meowuu7.github.io/QuasiSim/ Code: https://github.com/Meowuu7/QuasiSim Hugging Face Demo: https://huggingface.co/spaces/xymeow7/quasi-physical-sims

  12. arXiv:2404.07424  [pdf, other

    cs.CV

    CopilotCAD: Empowering Radiologists with Report Completion Models and Quantitative Evidence from Medical Image Foundation Models

    Authors: Sheng Wang, Tianming Du, Katherine Fischer, Gregory E Tasian, Justin Ziemba, Joanie M Garratt, Hersh Sagreiya, Yong Fan

    Abstract: Computer-aided diagnosis systems hold great promise to aid radiologists and clinicians in radiological clinical practice and enhance diagnostic accuracy and efficiency. However, the conventional systems primarily focus on delivering diagnostic results through text report generation or medical image classification, positioning them as standalone decision-makers rather than helpers and ignoring radi… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

  13. arXiv:2403.06932  [pdf, other

    cs.CL

    ERA-CoT: Improving Chain-of-Thought through Entity Relationship Analysis

    Authors: Yanming Liu, Xinyue Peng, Tianyu Du, Jianwei Yin, Weihao Liu, Xuhong Zhang

    Abstract: Large language models (LLMs) have achieved commendable accomplishments in various natural language processing tasks. However, LLMs still encounter significant challenges when dealing with complex scenarios involving multiple entities. These challenges arise from the presence of implicit relationships that demand multi-step reasoning. In this paper, we propose a novel approach ERA-CoT, which aids L… ▽ More

    Submitted 6 June, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

    Comments: 15 pages, second version of ERA-CoT

  14. arXiv:2403.06840  [pdf, other

    cs.CL cs.AI

    RA-ISF: Learning to Answer and Understand from Retrieval Augmentation via Iterative Self-Feedback

    Authors: Yanming Liu, Xinyue Peng, Xuhong Zhang, Weihao Liu, Jianwei Yin, Jiannan Cao, Tianyu Du

    Abstract: Large language models (LLMs) demonstrate exceptional performance in numerous tasks but still heavily rely on knowledge stored in their parameters. Moreover, updating this knowledge incurs high training costs. Retrieval-augmented generation (RAG) methods address this issue by integrating external knowledge. The model can answer questions it couldn't previously by retrieving knowledge relevant to th… ▽ More

    Submitted 6 June, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

    Comments: 20 pages, multiple figures. Providing second version RA-ISF

  15. arXiv:2403.03561  [pdf, ps, other

    cs.CV

    HMD-Poser: On-Device Real-time Human Motion Tracking from Scalable Sparse Observations

    Authors: Peng Dai, Yang Zhang, Tao Liu, Zhen Fan, Tianyuan Du, Zhuo Su, Xiaozheng Zheng, Zeming Li

    Abstract: It is especially challenging to achieve real-time human motion tracking on a standalone VR Head-Mounted Display (HMD) such as Meta Quest and PICO. In this paper, we propose HMD-Poser, the first unified approach to recover full-body motions using scalable sparse observations from HMD and body-worn IMUs. In particular, it can support a variety of input scenarios, such as HMD, HMD+2IMUs, HMD+3IMUs, e… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

    Comments: CVPR2024 Accepted

  16. arXiv:2403.02360  [pdf, other

    cs.LG cs.AI

    Towards Optimal Customized Architecture for Heterogeneous Federated Learning with Contrastive Cloud-Edge Model Decoupling

    Authors: Xingyan Chen, Tian Du, Mu Wang, Tiancheng Gu, Yu Zhao, Gang Kou, Changqiao Xu, Dapeng Oliver Wu

    Abstract: Federated learning, as a promising distributed learning paradigm, enables collaborative training of a global model across multiple network edge clients without the need for central data collecting. However, the heterogeneity of edge data distribution drags the model towards the local minima, which can be distant from the global optimum. Such heterogeneity often leads to slow convergence and substa… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  17. arXiv:2402.00744  [pdf, other

    cs.SD cs.CL eess.AS

    BATON: Aligning Text-to-Audio Model with Human Preference Feedback

    Authors: Huan Liao, Haonan Han, Kai Yang, Tianjiao Du, Rui Yang, Zunnan Xu, Qinmei Xu, **gquan Liu, Jiasheng Lu, Xiu Li

    Abstract: With the development of AI-Generated Content (AIGC), text-to-audio models are gaining widespread attention. However, it is challenging for these models to generate audio aligned with human preference due to the inherent information density of natural language and limited model understanding ability. To alleviate this issue, we formulate the BATON, a framework designed to enhance the alignment betw… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

  18. arXiv:2310.17317  [pdf, other

    cs.NI eess.SY

    RAN Functional Split Options for Integrated Terrestrial and Non-Terrestrial 6G Networks

    Authors: Mohamed Rihan, Tim Due, MohammadAmin Vakilifard, Dirk Wubben, Armin Dekorsy

    Abstract: Leveraging non-terrestrial platforms in 6G networks holds immense significance as it opens up opportunities to expand network coverage, enhance connectivity, and support a wide range of innovative applications, including global-scale Internet of Things and ultra-high-definition content delivery. To accomplish the seamless integration between terrestrial and non-terrestrial networks, substantial ch… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

  19. arXiv:2310.04655  [pdf, other

    cs.CR cs.CV

    VLATTACK: Multimodal Adversarial Attacks on Vision-Language Tasks via Pre-trained Models

    Authors: Ziyi Yin, Muchao Ye, Tianrong Zhang, Tianyu Du, **guo Zhu, Han Liu, **ghui Chen, Ting Wang, Fenglong Ma

    Abstract: Vision-Language (VL) pre-trained models have shown their superiority on many multimodal tasks. However, the adversarial robustness of such models has not been fully explored. Existing approaches mainly focus on exploring the adversarial robustness under the white-box setting, which is unrealistic. In this paper, we aim to investigate a new yet practical task to craft image and text perturbations u… ▽ More

    Submitted 5 February, 2024; v1 submitted 6 October, 2023; originally announced October 2023.

    Comments: Accepted by NeurIPS 2023, 21 pages

  20. arXiv:2310.02428  [pdf, other

    cs.LG cond-mat.mtrl-sci

    EGraFFBench: Evaluation of Equivariant Graph Neural Network Force Fields for Atomistic Simulations

    Authors: Vaibhav Bihani, Utkarsh Pratiush, Sajid Mannan, Tao Du, Zhimin Chen, Santiago Miret, Matthieu Micoulaut, Morten M Smedskjaer, Sayan Ranu, N M Anoop Krishnan

    Abstract: Equivariant graph neural networks force fields (EGraFFs) have shown great promise in modelling complex interactions in atomic systems by exploiting the graphs' inherent symmetries. Recent works have led to a surge in the development of novel architectures that incorporate equivariance-based inductive biases alongside architectural innovations like graph transformers and message passing to model at… ▽ More

    Submitted 24 November, 2023; v1 submitted 3 October, 2023; originally announced October 2023.

  21. arXiv:2309.13793  [pdf, other

    cs.LG

    ReMasker: Imputing Tabular Data with Masked Autoencoding

    Authors: Tianyu Du, Luca Melis, Ting Wang

    Abstract: We present ReMasker, a new method of imputing missing values in tabular data by extending the masked autoencoding framework. Compared with prior work, ReMasker is both simple -- besides the missing values (i.e., naturally masked), we randomly ``re-mask'' another set of values, optimize the autoencoder by reconstructing this re-masked set, and apply the trained model to predict the missing values;… ▽ More

    Submitted 24 September, 2023; originally announced September 2023.

  22. arXiv:2309.13256  [pdf, other

    cs.LG cs.AI

    Defending Pre-trained Language Models as Few-shot Learners against Backdoor Attacks

    Authors: Zhaohan Xi, Tianyu Du, Changjiang Li, Ren Pang, Shouling Ji, **ghui Chen, Fenglong Ma, Ting Wang

    Abstract: Pre-trained language models (PLMs) have demonstrated remarkable performance as few-shot learners. However, their security risks under such settings are largely unexplored. In this work, we conduct a pilot study showing that PLMs as few-shot learners are highly vulnerable to backdoor attacks while existing defenses are inadequate due to the unique challenges of few-shot scenarios. To address such c… ▽ More

    Submitted 23 September, 2023; originally announced September 2023.

    Comments: Accepted by NeurIPS'23

  23. arXiv:2308.10521  [pdf, other

    cs.CV

    PHE-SICH-CT-IDS: A Benchmark CT Image Dataset for Evaluation Semantic Segmentation, Object Detection and Radiomic Feature Extraction of Perihematomal Edema in Spontaneous Intracerebral Hemorrhage

    Authors: Deguo Ma, Chen Li, Lin Qiao, Tianming Du, Dechao Tang, Zhiyu Ma, Marcin Grzegorzek Hongzan, Hongzan Sun

    Abstract: Intracerebral hemorrhage is one of the diseases with the highest mortality and poorest prognosis worldwide. Spontaneous intracerebral hemorrhage (SICH) typically presents acutely, prompt and expedited radiological examination is crucial for diagnosis, localization, and quantification of the hemorrhage. Early detection and accurate segmentation of perihematomal edema (PHE) play a critical role in g… ▽ More

    Submitted 21 August, 2023; originally announced August 2023.

  24. arXiv:2308.08313  [pdf, other

    eess.IV cs.CV

    ECPC-IDS:A benchmark endometrail cancer PET/CT image dataset for evaluation of semantic segmentation and detection of hypermetabolic regions

    Authors: Dechao Tang, Tianming Du, Deguo Ma, Zhiyu Ma, Hongzan Sun, Marcin Grzegorzek, Huiyan Jiang, Chen Li

    Abstract: Endometrial cancer is one of the most common tumors in the female reproductive system and is the third most common gynecological malignancy that causes death after ovarian and cervical cancer. Early diagnosis can significantly improve the 5-year survival rate of patients. With the development of artificial intelligence, computer-assisted diagnosis plays an increasingly important role in improving… ▽ More

    Submitted 11 October, 2023; v1 submitted 16 August, 2023; originally announced August 2023.

    Comments: 14 pages,6 figures

  25. arXiv:2308.08172  [pdf, other

    eess.IV cs.CV cs.LG

    AATCT-IDS: A Benchmark Abdominal Adipose Tissue CT Image Dataset for Image Denoising, Semantic Segmentation, and Radiomics Evaluation

    Authors: Zhiyu Ma, Chen Li, Tianming Du, Le Zhang, Dechao Tang, Deguo Ma, Shanchuan Huang, Yan Liu, Yihao Sun, Zhihao Chen, ** Yuan, Qianqing Nie, Marcin Grzegorzek, Hongzan Sun

    Abstract: Methods: In this study, a benchmark \emph{Abdominal Adipose Tissue CT Image Dataset} (AATTCT-IDS) containing 300 subjects is prepared and published. AATTCT-IDS publics 13,732 raw CT slices, and the researchers individually annotate the subcutaneous and visceral adipose tissue regions of 3,213 of those slices that have the same slice distance to validate denoising methods, train semantic segmentati… ▽ More

    Submitted 16 August, 2023; originally announced August 2023.

    Comments: 17 pages, 7 figures

  26. arXiv:2306.04984  [pdf, other

    cs.CR cs.LG

    G$^2$uardFL: Safeguarding Federated Learning Against Backdoor Attacks through Attributed Client Graph Clustering

    Authors: Hao Yu, Chuan Ma, Meng Liu, Tianyu Du, Ming Ding, Tao Xiang, Shouling Ji, Xinwang Liu

    Abstract: Federated Learning (FL) offers collaborative model training without data sharing but is vulnerable to backdoor attacks, where poisoned model weights lead to compromised system integrity. Existing countermeasures, primarily based on anomaly detection, are prone to erroneous rejections of normal weights while accepting poisoned ones, largely due to shortcomings in quantifying similarities among clie… ▽ More

    Submitted 7 December, 2023; v1 submitted 8 June, 2023; originally announced June 2023.

    Comments: 19 pages, 7 figures

  27. arXiv:2305.02383  [pdf, other

    cs.CR cs.AI

    On the Security Risks of Knowledge Graph Reasoning

    Authors: Zhaohan Xi, Tianyu Du, Changjiang Li, Ren Pang, Shouling Ji, Xiapu Luo, Xusheng Xiao, Fenglong Ma, Ting Wang

    Abstract: Knowledge graph reasoning (KGR) -- answering complex logical queries over large knowledge graphs -- represents an important artificial intelligence task, entailing a range of applications (e.g., cyber threat hunting). However, despite its surging popularity, the potential security risks of KGR are largely unexplored, which is concerning, given the increasing use of such capability in security-crit… ▽ More

    Submitted 22 June, 2023; v1 submitted 3 May, 2023; originally announced May 2023.

    Comments: In proceedings of USENIX Security'23. Codes: https://github.com/HarrialX/security-risk-KG-reasoning

  28. arXiv:2304.14369  [pdf, other

    cs.LG cs.GR

    Learning Neural Constitutive Laws From Motion Observations for Generalizable PDE Dynamics

    Authors: **chuan Ma, Peter Yichen Chen, Bolei Deng, Joshua B. Tenenbaum, Tao Du, Chuang Gan, Wojciech Matusik

    Abstract: We propose a hybrid neural network (NN) and PDE approach for learning generalizable PDE dynamics from motion observations. Many NN approaches learn an end-to-end model that implicitly models both the governing PDE and constitutive models (or material models). Without explicit PDE knowledge, these approaches cannot guarantee physical correctness and have limited generalizability. We argue that the… ▽ More

    Submitted 15 June, 2023; v1 submitted 27 April, 2023; originally announced April 2023.

    Comments: Homepage: https://sites.google.com/view/nclaw

  29. arXiv:2304.07980  [pdf, other

    cs.LG cs.CR

    RNN-Guard: Certified Robustness Against Multi-frame Attacks for Recurrent Neural Networks

    Authors: Yunruo Zhang, Tianyu Du, Shouling Ji, Peng Tang, Shanqing Guo

    Abstract: It is well-known that recurrent neural networks (RNNs), although widely used, are vulnerable to adversarial attacks including one-frame attacks and multi-frame attacks. Though a few certified defenses exist to provide guaranteed robustness against one-frame attacks, we prove that defending against multi-frame attacks remains a challenging problem due to their enormous perturbation space. In this p… ▽ More

    Submitted 16 April, 2023; originally announced April 2023.

    Comments: 13 pages, 7 figures, 6 tables

  30. arXiv:2304.03223  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    DexDeform: Dexterous Deformable Object Manipulation with Human Demonstrations and Differentiable Physics

    Authors: Sizhe Li, Zhiao Huang, Tao Chen, Tao Du, Hao Su, Joshua B. Tenenbaum, Chuang Gan

    Abstract: In this work, we aim to learn dexterous manipulation of deformable objects using multi-fingered hands. Reinforcement learning approaches for dexterous rigid object manipulation would struggle in this setting due to the complexity of physics interaction with deformable objects. At the same time, previous trajectory optimization approaches with differentiable physics for deformable manipulation woul… ▽ More

    Submitted 27 March, 2023; originally announced April 2023.

    Comments: ICLR 2023. Project page: https://sites.google.com/view/dexdeform

  31. arXiv:2304.01906  [pdf, other

    cs.LG cs.MS econ.EM

    Torch-Choice: A PyTorch Package for Large-Scale Choice Modelling with Python

    Authors: Tianyu Du, Ayush Kanodia, Susan Athey

    Abstract: The $\texttt{torch-choice}$ is an open-source library for flexible, fast choice modeling with Python and PyTorch. $\texttt{torch-choice}$ provides a $\texttt{ChoiceDataset}$ data structure to manage databases flexibly and memory-efficiently. The paper demonstrates constructing a $\texttt{ChoiceDataset}$ from databases of various formats and functionalities of $\texttt{ChoiceDataset}$. The package… ▽ More

    Submitted 14 July, 2023; v1 submitted 4 April, 2023; originally announced April 2023.

  32. arXiv:2303.06562  [pdf, other

    cs.LG cs.CV stat.ML

    ContraNorm: A Contrastive Learning Perspective on Oversmoothing and Beyond

    Authors: Xiaojun Guo, Yifei Wang, Tianqi Du, Yisen Wang

    Abstract: Oversmoothing is a common phenomenon in a wide range of Graph Neural Networks (GNNs) and Transformers, where performance worsens as the number of layers increases. Instead of characterizing oversmoothing from the view of complete collapse in which representations converge to a single point, we dive into a more general perspective of dimensional collapse in which representations lie in a narrow con… ▽ More

    Submitted 2 May, 2023; v1 submitted 11 March, 2023; originally announced March 2023.

    Comments: ICLR 2023

  33. arXiv:2303.04435  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    A Message Passing Perspective on Learning Dynamics of Contrastive Learning

    Authors: Yifei Wang, Qi Zhang, Tianqi Du, Jiansheng Yang, Zhouchen Lin, Yisen Wang

    Abstract: In recent years, contrastive learning achieves impressive results on self-supervised visual representation learning, but there still lacks a rigorous understanding of its learning dynamics. In this paper, we show that if we cast a contrastive objective equivalently into the feature space, then its learning dynamics admits an interpretable form. Specifically, we show that its gradient descent corre… ▽ More

    Submitted 8 March, 2023; originally announced March 2023.

    Comments: ICLR 2023

  34. arXiv:2303.01092  [pdf, ps, other

    cs.LG cs.AI cs.CV

    ArCL: Enhancing Contrastive Learning with Augmentation-Robust Representations

    Authors: Xuyang Zhao, Tianqi Du, Yisen Wang, Jun Yao, Weiran Huang

    Abstract: Self-Supervised Learning (SSL) is a paradigm that leverages unlabeled data for model training. Empirical studies show that SSL can achieve promising performance in distribution shift scenarios, where the downstream and training distributions differ. However, the theoretical understanding of its transferability remains limited. In this paper, we develop a theoretical framework to analyze the transf… ▽ More

    Submitted 12 December, 2023; v1 submitted 2 March, 2023; originally announced March 2023.

    Comments: Accepted by ICLR 2023

  35. arXiv:2211.05982  [pdf, other

    cs.IT eess.SP

    Multi-domain Cooperative SLAM: The Enabler for Integrated Sensing and Communications

    Authors: Jie Yang, Chao-Kai Wen, Xi Yang, **g Xu, Tao Du, Shi **

    Abstract: Simultaneous localization and map** (SLAM) provides user tracking and environmental map** capabilities, enabling communication systems to gain situational awareness. Advanced communication networks with ultra-wideband, multiple antennas, and a large number of connections present opportunities for deep integration of sensing and communications. First, the development of integrated sensing and c… ▽ More

    Submitted 10 November, 2022; originally announced November 2022.

    Comments: Accepted by the IEEE Wireless Communications Magazine

  36. arXiv:2210.07346  [pdf, other

    cs.CR cs.CV cs.LG

    An Embarrassingly Simple Backdoor Attack on Self-supervised Learning

    Authors: Changjiang Li, Ren Pang, Zhaohan Xi, Tianyu Du, Shouling Ji, Yuan Yao, Ting Wang

    Abstract: As a new paradigm in machine learning, self-supervised learning (SSL) is capable of learning high-quality representations of complex data without relying on labels. In addition to eliminating the need for labeled data, research has found that SSL improves the adversarial robustness over supervised learning since lacking labels makes it more challenging for adversaries to manipulate model predictio… ▽ More

    Submitted 13 August, 2023; v1 submitted 13 October, 2022; originally announced October 2022.

    Comments: The 2023 International Conference on Computer Vision (ICCV '23)

  37. arXiv:2209.13702  [pdf, other

    cs.AI cs.LG

    Reasoning over Multi-view Knowledge Graphs

    Authors: Zhaohan Xi, Ren Pang, Changjiang Li, Tianyu Du, Shouling Ji, Fenglong Ma, Ting Wang

    Abstract: Recently, knowledge representation learning (KRL) is emerging as the state-of-the-art approach to process queries over knowledge graphs (KGs), wherein KG entities and the query are embedded into a latent space such that entities that answer the query are embedded close to the query. Yet, despite the intensive research on KRL, most existing studies either focus on homogenous KGs or assume KG comple… ▽ More

    Submitted 27 September, 2022; originally announced September 2022.

  38. Fluidic Topology Optimization with an Anisotropic Mixture Model

    Authors: Yifei Li, Tao Du, Sangeetha Grama Srinivasan, Kui Wu, Bo Zhu, Eftychios Sifakis, Wojciech Matusik

    Abstract: Fluidic devices are crucial components in many industrial applications involving fluid mechanics. Computational design of a high-performance fluidic system faces multifaceted challenges regarding its geometric representation and physical accuracy. We present a novel topology optimization method to design fluidic devices in a Stokes flow context. Our approach is featured by its capability in accomm… ▽ More

    Submitted 24 September, 2022; v1 submitted 21 September, 2022; originally announced September 2022.

    Comments: Accepted by SIGGRAPH Asia 2022. For low resolution paper see https://people.csail.mit.edu/liyifei/publication/anisotropic-stokes-fluidic-device/

    Journal ref: ACM Transactions on Graphics (TOG), 2022

  39. arXiv:2208.07239  [pdf, other

    cs.LG cs.AI cs.SI

    ROLAND: Graph Learning Framework for Dynamic Graphs

    Authors: Jiaxuan You, Tianyu Du, Jure Leskovec

    Abstract: Graph Neural Networks (GNNs) have been successfully applied to many real-world static graphs. However, the success of static graphs has not fully translated to dynamic graphs due to the limitations in model design, evaluation settings, and training strategies. Concretely, existing dynamic GNNs do not incorporate state-of-the-art designs from static GNNs, which limits their performance. Current eva… ▽ More

    Submitted 15 August, 2022; originally announced August 2022.

    Comments: Published in SIGKDD 2022 (Research Track)

  40. arXiv:2205.05678  [pdf, other

    cs.CV cs.AI cs.GR cs.LG cs.RO

    RISP: Rendering-Invariant State Predictor with Differentiable Simulation and Rendering for Cross-Domain Parameter Estimation

    Authors: **chuan Ma, Tao Du, Joshua B. Tenenbaum, Wojciech Matusik, Chuang Gan

    Abstract: This work considers identifying parameters characterizing a physical system's dynamic motion directly from a video whose rendering configurations are inaccessible. Existing solutions require massive training data or lack generalizability to unknown rendering configurations. We propose a novel approach that marries domain randomization and differentiable rendering gradients to address this problem.… ▽ More

    Submitted 11 May, 2022; originally announced May 2022.

    Comments: ICLR Oral. Project page: http://risp.csail.mit.edu

  41. arXiv:2205.02835  [pdf, other

    cs.RO cs.CV cs.GR cs.LG

    Contact Points Discovery for Soft-Body Manipulations with Differentiable Physics

    Authors: Sizhe Li, Zhiao Huang, Tao Du, Hao Su, Joshua B. Tenenbaum, Chuang Gan

    Abstract: Differentiable physics has recently been shown as a powerful tool for solving soft-body manipulation tasks. However, the differentiable physics solver often gets stuck when the initial contact points of the end effectors are sub-optimal or when performing multi-stage tasks that require contact point switching, which often leads to local minima. To address this challenge, we propose a contact point… ▽ More

    Submitted 5 May, 2022; originally announced May 2022.

    Comments: ICLR 2022 Spotlight. Project page: http://cpdeform.csail.mit.edu

  42. arXiv:2204.12584  [pdf, other

    cs.RO cs.LG physics.flu-dyn

    Fast Aquatic Swimmer Optimization with Differentiable Projective Dynamics and Neural Network Hydrodynamic Models

    Authors: Elvis Nava, John Z. Zhang, Mike Y. Michelis, Tao Du, **chuan Ma, Benjamin F. Grewe, Wojciech Matusik, Robert K. Katzschmann

    Abstract: Aquatic locomotion is a classic fluid-structure interaction (FSI) problem of interest to biologists and engineers. Solving the fully coupled FSI equations for incompressible Navier-Stokes and finite elasticity is computationally expensive. Optimizing robotic swimmer design within such a system generally involves cumbersome, gradient-free procedures on top of the already costly simulation. To addre… ▽ More

    Submitted 22 June, 2022; v1 submitted 30 March, 2022; originally announced April 2022.

    Comments: ICML 2022

  43. arXiv:2204.05231  [pdf, other

    cs.IR cs.AI cs.CL cs.LG

    Towards Generalizable Semantic Product Search by Text Similarity Pre-training on Search Click Logs

    Authors: Zheng Liu, Wei Zhang, Yan Chen, Weiyi Sun, Tianchuan Du, Benjamin Schroeder

    Abstract: Recently, semantic search has been successfully applied to e-commerce product search and the learned semantic space(s) for query and product encoding are expected to generalize to unseen queries or products. Yet, whether generalization can conveniently emerge has not been thoroughly studied in the domain thus far. In this paper, we examine several general-domain and domain-specific pre-trained Rob… ▽ More

    Submitted 28 April, 2022; v1 submitted 11 April, 2022; originally announced April 2022.

    Comments: 9 pages. ACL 2022 ECNLP Workshop

    ACM Class: F.2.2; I.2.7

    Journal ref: ECNLP 2022

  44. arXiv:2202.08370  [pdf, other

    cs.LG econ.EM

    CAREER: A Foundation Model for Labor Sequence Data

    Authors: Keyon Vafa, Emil Palikot, Tianyu Du, Ayush Kanodia, Susan Athey, David M. Blei

    Abstract: Labor economists regularly analyze employment data by fitting predictive models to small, carefully constructed longitudinal survey datasets. Although machine learning methods offer promise for such problems, these survey datasets are too small to take advantage of them. In recent years large datasets of online resumes have also become available, providing data about the career trajectories of mil… ▽ More

    Submitted 29 February, 2024; v1 submitted 16 February, 2022; originally announced February 2022.

  45. arXiv:2202.02545  [pdf

    cs.SD cs.CL eess.AS

    Optimization of a Real-Time Wavelet-Based Algorithm for Improving Speech Intelligibility

    Authors: Tianqu Kang, Anh-Dung Dinh, Binghong Wang, Tianyuan Du, Yijia Chen, Kevin Chau

    Abstract: The optimization of a wavelet-based algorithm to improve speech intelligibility along with the full data set and results are reported. The discrete-time speech signal is split into frequency sub-bands via a multi-level discrete wavelet transform. Various gains are applied to the sub-band signals before they are recombined to form a modified version of the speech. The sub-band gains are adjusted wh… ▽ More

    Submitted 21 July, 2022; v1 submitted 5 February, 2022; originally announced February 2022.

    Comments: 16 pages, 7 figures, 4 tables

  46. arXiv:2112.13214  [pdf, other

    cs.LG cs.AI

    NeuronFair: Interpretable White-Box Fairness Testing through Biased Neuron Identification

    Authors: Haibin Zheng, Zhiqing Chen, Tianyu Du, Xuhong Zhang, Yao Cheng, Shouling Ji, **gyi Wang, Yue Yu, **yin Chen

    Abstract: Deep neural networks (DNNs) have demonstrated their outperformance in various domains. However, it raises a social concern whether DNNs can produce reliable and fair decisions especially when they are applied to sensitive domains involving valuable resource allocation, such as education, loan, and employment. It is crucial to conduct fairness testing before DNNs are reliably deployed to such sensi… ▽ More

    Submitted 25 December, 2021; originally announced December 2021.

  47. arXiv:2110.15358  [pdf, other

    cs.CV cs.AI cs.LG

    Dynamic Visual Reasoning by Learning Differentiable Physics Models from Video and Language

    Authors: Mingyu Ding, Zhenfang Chen, Tao Du, ** Luo, Joshua B. Tenenbaum, Chuang Gan

    Abstract: In this work, we propose a unified framework, called Visual Reasoning with Differ-entiable Physics (VRDP), that can jointly learn visual concepts and infer physics models of objects and their interactions from videos and language. This is achieved by seamlessly integrating three components: a visual perception module, a concept learner, and a differentiable physics engine. The visual perception mo… ▽ More

    Submitted 28 October, 2021; originally announced October 2021.

    Comments: NeurIPS 2021. Project page: http://vrdp.csail.mit.edu/

  48. Sim2Real for Soft Robotic Fish via Differentiable Simulation

    Authors: John Z. Zhang, Yu Zhang, **chuan Ma, Elvis Nava, Tao Du, Philip Arm, Wojciech Matusik, Robert K. Katzschmann

    Abstract: Accurate simulation of soft mechanisms under dynamic actuation is critical for the design of soft robots. We address this gap with our differentiable simulation tool by learning the material parameters of our soft robotic fish. On the example of a soft robotic fish, we demonstrate an experimentally-verified, fast optimization pipeline for learning the material parameters from quasi-static data via… ▽ More

    Submitted 8 November, 2022; v1 submitted 30 September, 2021; originally announced September 2021.

    Comments: Published at IROS 2022. 8 pages, 9 figures

  49. arXiv:2106.05306  [pdf, other

    cs.GR cs.LG

    DiffCloth: Differentiable Cloth Simulation with Dry Frictional Contact

    Authors: Yifei Li, Tao Du, Kui Wu, Jie Xu, Wojciech Matusik

    Abstract: Cloth simulation has wide applications in computer animation, garment design, and robot-assisted dressing. This work presents a differentiable cloth simulator whose additional gradient information facilitates cloth-related applications. Our differentiable simulator extends a state-of-the-art cloth simulator based on Projective Dynamics (PD) and with dry frictional contact. We draw inspiration from… ▽ More

    Submitted 17 July, 2022; v1 submitted 9 June, 2021; originally announced June 2021.

    Journal ref: ACM Transactions on Graphics (TOG), 2022

  50. arXiv:2104.11783  [pdf, other

    cs.IR econ.GN q-fin.GN

    Form 10-Q Itemization

    Authors: Yanci Zhang, Tianming Du, Yujie Sun, Lawrence Donohue, Rui Dai

    Abstract: The quarterly financial statement, or Form 10-Q, is one of the most frequently required filings for US public companies to disclose financial and other important business information. Due to the massive volume of 10-Q filings and the enormous variations in the reporting format, it has been a long-standing challenge to retrieve item-specific information from 10-Q filings that lack machine-readable… ▽ More

    Submitted 19 October, 2021; v1 submitted 23 April, 2021; originally announced April 2021.

    Comments: 6 pages, 3 figures, 3 tables, http://review10q.ddns.net/