Skip to main content

Showing 1–50 of 68 results for author: Fan, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.12403  [pdf, other

    cs.CL cs.AI

    PDSS: A Privacy-Preserving Framework for Step-by-Step Distillation of Large Language Models

    Authors: Tao Fan, Yan Kang, Wei**g Chen, Hanlin Gu, Yuanfeng Song, Lixin Fan, Kai Chen, Qiang Yang

    Abstract: In the context of real-world applications, leveraging large language models (LLMs) for domain-specific tasks often faces two major challenges: domain-specific knowledge privacy and constrained resources. To address these issues, we propose PDSS, a privacy-preserving framework for step-by-step distillation of LLMs. PDSS works on a server-client architecture, wherein client transmits perturbed promp… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  2. arXiv:2406.02224  [pdf, other

    cs.CL cs.AI

    FedMKT: Federated Mutual Knowledge Transfer for Large and Small Language Models

    Authors: Tao Fan, Guoqiang Ma, Yan Kang, Hanlin Gu, Yuanfeng Song, Lixin Fan, Kai Chen, Qiang Yang

    Abstract: Recent research in federated large language models (LLMs) has primarily focused on enabling clients to fine-tune their locally deployed homogeneous LLMs collaboratively or on transferring knowledge from server-based LLMs to small language models (SLMs) at downstream clients. However, a significant gap remains in the simultaneous mutual enhancement of both the server's LLM and clients' SLMs. To bri… ▽ More

    Submitted 18 June, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

  3. arXiv:2405.09357  [pdf, ps, other

    cs.SI physics.soc-ph

    A universal optimization framework based on cycle ranking for influence maximization in complex networks

    Authors: Wenfeng Shi, Tianlong Fan, Shuqi Xu, Rongmei Yang, Linyuan Lü

    Abstract: Influence maximization aims to identify a set of influential individuals, referred to as influencers, as information sources to maximize the spread of information within networks, constituting a vital combinatorial optimization problem with extensive practical applications and sustained interdisciplinary interest. Diverse approaches have been devised to efficiently address this issue, one of which… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

  4. arXiv:2404.11862  [pdf

    cs.SI cs.DS cs.IR physics.data-an

    A Fast Maximum Clique Algorithm Based on Network Decomposition for Large Sparse Networks

    Authors: Tianlong Fan, Wenjun Jiang, Yi-Cheng Zhang, Linyuan Lü

    Abstract: Finding maximum cliques in large networks is a challenging combinatorial problem with many real-world applications. We present a fast algorithm to achieve the exact solution for the maximum clique problem in large sparse networks based on efficient graph decomposition. A bunch of effective techniques is being used to greatly prune the graph and a novel concept called Complete-Upper-Bound-Induced S… ▽ More

    Submitted 18 April, 2024; v1 submitted 17 April, 2024; originally announced April 2024.

    Comments: 12 pages, 2 figures, 1 table

    MSC Class: 05C82(Primary); 05C80; 91D30; 68P20(Secondary) ACM Class: H.3.3; F.2.2; J.2

  5. arXiv:2403.01400  [pdf, other

    cs.LG cs.AI

    Decoupling Weighing and Selecting for Integrating Multiple Graph Pre-training Tasks

    Authors: Tianyu Fan, Lirong Wu, Yufei Huang, Haitao Lin, Cheng Tan, Zhangyang Gao, Stan Z. Li

    Abstract: Recent years have witnessed the great success of graph pre-training for graph representation learning. With hundreds of graph pre-training tasks proposed, integrating knowledge acquired from multiple pre-training tasks has become a popular research topic. In this paper, we identify two important collaborative processes for this topic: (1) select: how to select an optimal task combination from a gi… ▽ More

    Submitted 3 March, 2024; originally announced March 2024.

    Comments: Published as a conference paper at ICLR 2024

  6. arXiv:2403.00027  [pdf, ps, other

    cs.SI cs.LG cs.NI

    A Quick Framework for Evaluating Worst Robustness of Complex Networks

    Authors: Wenjun Jiang, Peiyan Li, Tianlong Fan, Ting Li, Chuan-fu Zhang, Tao Zhang, Zong-fu Luo

    Abstract: Robustness is pivotal for comprehending, designing, optimizing, and rehabilitating networks, with simulation attacks being the prevailing evaluation method. Simulation attacks are often time-consuming or even impractical, however, a more crucial yet persistently overlooked drawback is that any attack strategy merely provides a potential paradigm of disintegration. The key concern is: in the worst-… ▽ More

    Submitted 28 February, 2024; originally announced March 2024.

    Comments: 30 pages, 8figures, 4tables,journal

    MSC Class: 68T07(Primary)90B25; 05C80; 05C82; 90B15; 90B18(Secondary) ACM Class: I.2.6; G.2.2; J.4; F.2.2

  7. arXiv:2312.13583  [pdf, other

    cs.LG cs.AI stat.ML

    Fine-tuning Graph Neural Networks by Preserving Graph Generative Patterns

    Authors: Yifei Sun, Qi Zhu, Yang Yang, Chun** Wang, Tianyu Fan, Jiajun Zhu, Lei Chen

    Abstract: Recently, the paradigm of pre-training and fine-tuning graph neural networks has been intensively studied and applied in a wide range of graph mining tasks. Its success is generally attributed to the structural consistency between pre-training and downstream datasets, which, however, does not hold in many real-world scenarios. Existing works have shown that the structural divergence between pre-tr… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

    Comments: Accepted to AAAI 2024

  8. arXiv:2312.13469  [pdf, other

    cs.RO cs.CV cs.LG

    Neural feels with neural fields: Visuo-tactile perception for in-hand manipulation

    Authors: Sudharshan Suresh, Haozhi Qi, Tingfan Wu, Taosha Fan, Luis Pineda, Mike Lambeta, Jitendra Malik, Mrinal Kalakrishnan, Roberto Calandra, Michael Kaess, Joseph Ortiz, Mustafa Mukadam

    Abstract: To achieve human-level dexterity, robots must infer spatial awareness from multimodal sensing to reason over contact interactions. During in-hand manipulation of novel objects, such spatial awareness involves estimating the object's pose and shape. The status quo for in-hand perception primarily employs vision, and restricts to tracking a priori known objects. Moreover, visual occlusion of objects… ▽ More

    Submitted 20 December, 2023; originally announced December 2023.

    Comments: 43 pages, 20 figures, 1 table; https://suddhu.github.io/neural-feels/

  9. arXiv:2311.17431  [pdf, other

    cs.LG cs.AI

    Grounding Foundation Models through Federated Transfer Learning: A General Framework

    Authors: Yan Kang, Tao Fan, Hanlin Gu, Xiao** Zhang, Lixin Fan, Qiang Yang

    Abstract: Foundation Models (FMs) such as GPT-4 encoded with vast knowledge and powerful emergent abilities have achieved remarkable success in various natural language processing and computer vision tasks. Grounding FMs by adapting them to domain-specific tasks or augmenting them with domain-specific knowledge enables us to exploit the full potential of FMs. However, grounding FMs faces several challenges,… ▽ More

    Submitted 29 March, 2024; v1 submitted 29 November, 2023; originally announced November 2023.

    Comments: In progress

  10. arXiv:2311.00684  [pdf, other

    cs.CL cs.LG

    Attention Alignment and Flexible Positional Embeddings Improve Transformer Length Extrapolation

    Authors: Ta-Chung Chi, Ting-Han Fan, Alexander I. Rudnicky

    Abstract: An ideal length-extrapolatable Transformer language model can handle sequences longer than the training length without any fine-tuning. Such long-context utilization capability relies heavily on a flexible positional embedding design. Upon investigating the flexibility of existing large pre-trained Transformer language models, we find that the T5 family deserves a closer look, as its positional em… ▽ More

    Submitted 15 November, 2023; v1 submitted 1 November, 2023; originally announced November 2023.

  11. arXiv:2310.10049  [pdf, other

    cs.LG cs.AI

    FATE-LLM: A Industrial Grade Federated Learning Framework for Large Language Models

    Authors: Tao Fan, Yan Kang, Guoqiang Ma, Wei**g Chen, Wenbin Wei, Lixin Fan, Qiang Yang

    Abstract: Large Language Models (LLMs), such as ChatGPT, LLaMA, GLM, and PaLM, have exhibited remarkable performances across various tasks in recent years. However, LLMs face two main challenges in real-world applications. One challenge is that training LLMs consumes vast computing resources, preventing LLMs from being adopted by small and medium-sized enterprises with limited computing resources. Another i… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

  12. arXiv:2309.07412  [pdf, other

    cs.CL cs.LG

    Advancing Regular Language Reasoning in Linear Recurrent Neural Networks

    Authors: Ting-Han Fan, Ta-Chung Chi, Alexander I. Rudnicky

    Abstract: In recent studies, linear recurrent neural networks (LRNNs) have achieved Transformer-level performance in natural language and long-range modeling, while offering rapid parallel training and constant inference cost. With the resurgence of interest in LRNNs, we study whether they can learn the hidden rules in training sequences, such as the grammatical structures of regular language. We theoretica… ▽ More

    Submitted 9 April, 2024; v1 submitted 13 September, 2023; originally announced September 2023.

    Comments: Accepted at the 2024 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2024). The first two authors contributed equally to this work

  13. arXiv:2308.08012  [pdf, ps, other

    cs.CV cs.LG cs.NI cs.SI

    Comprehensive Analysis of Network Robustness Evaluation Based on Convolutional Neural Networks with Spatial Pyramid Pooling

    Authors: Wenjun Jiang, Tianlong Fan, Changhao Li, Chuanfu Zhang, Tao Zhang, Zong-fu Luo

    Abstract: Connectivity robustness, a crucial aspect for understanding, optimizing, and repairing complex networks, has traditionally been evaluated through time-consuming and often impractical simulations. Fortunately, machine learning provides a new avenue for addressing this challenge. However, several key issues remain unresolved, including the performance in more general edge removal scenarios, capturin… ▽ More

    Submitted 28 May, 2024; v1 submitted 10 August, 2023; originally announced August 2023.

    Comments: 25 pages, 8 figures, 7 tables, journal

    MSC Class: 68T07 (Primary) 90B25; 05C80; 05C82; 90B15; 90B18 (Secondary) ACM Class: I.2.6; G.2.2; J.4; F.2.2

  14. arXiv:2305.13571  [pdf, other

    cs.CL

    Latent Positional Information is in the Self-Attention Variance of Transformer Language Models Without Positional Embeddings

    Authors: Ta-Chung Chi, Ting-Han Fan, Li-Wei Chen, Alexander I. Rudnicky, Peter J. Ramadge

    Abstract: The use of positional embeddings in transformer language models is widely accepted. However, recent research has called into question the necessity of such embeddings. We further extend this inquiry by demonstrating that a randomly initialized and frozen transformer language model, devoid of positional embeddings, inherently encodes strong positional information through the shrinkage of self-atten… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

    Comments: Accepted by ACL 2023

  15. arXiv:2305.10758  [pdf, other

    cs.LG

    Extracting Low-/High- Frequency Knowledge from Graph Neural Networks and Injecting it into MLPs: An Effective GNN-to-MLP Distillation Framework

    Authors: Lirong Wu, Haitao Lin, Yufei Huang, Tianyu Fan, Stan Z. Li

    Abstract: Recent years have witnessed the great success of Graph Neural Networks (GNNs) in handling graph-related tasks. However, MLPs remain the primary workhorse for practical industrial applications due to their desirable inference efficiency and scalability. To reduce their gaps, one can directly distill knowledge from a well-designed teacher GNN to a student MLP, which is termed as GNN-to-MLP distillat… ▽ More

    Submitted 4 June, 2023; v1 submitted 18 May, 2023; originally announced May 2023.

  16. arXiv:2305.07026  [pdf, other

    cs.CV cs.RO math.OC

    Decentralization and Acceleration Enables Large-Scale Bundle Adjustment

    Authors: Taosha Fan, Joseph Ortiz, Ming Hsiao, Maurizio Monge, **g Dong, Todd Murphey, Mustafa Mukadam

    Abstract: Scaling to arbitrarily large bundle adjustment problems requires data and compute to be distributed across multiple devices. Centralized methods in prior works are only able to solve small or medium size problems due to overhead in computation and communication. In this paper, we present a fully decentralized method that alleviates computation and communication bottlenecks to solve arbitrarily lar… ▽ More

    Submitted 8 August, 2023; v1 submitted 11 May, 2023; originally announced May 2023.

    Comments: Robotics: Science and Systems (RSS), 2023

  17. arXiv:2305.05356  [pdf, other

    cs.CV cs.MM eess.IV

    Learning Dynamic Point Cloud Compression via Hierarchical Inter-frame Block Matching

    Authors: Shuting Xia, Tingyu Fan, Yiling Xu, Jenq-Neng Hwang, Zhu Li

    Abstract: 3D dynamic point cloud (DPC) compression relies on mining its temporal context, which faces significant challenges due to DPC's sparsity and non-uniform structure. Existing methods are limited in capturing sufficient temporal dependencies. Therefore, this paper proposes a learning-based DPC compression framework via hierarchical block-matching-based inter-prediction module to compensate and compre… ▽ More

    Submitted 16 May, 2023; v1 submitted 9 May, 2023; originally announced May 2023.

    Comments: 9 pages for the main body, 3 pages for the supplemental after References

  18. arXiv:2305.03796  [pdf, other

    cs.CL

    Transformer Working Memory Enables Regular Language Reasoning and Natural Language Length Extrapolation

    Authors: Ta-Chung Chi, Ting-Han Fan, Alexander I. Rudnicky, Peter J. Ramadge

    Abstract: Unlike recurrent models, conventional wisdom has it that Transformers cannot perfectly model regular languages. Inspired by the notion of working memory, we propose a new Transformer variant named RegularGPT. With its novel combination of Weight-Sharing, Adaptive-Depth, and Sliding-Dilated-Attention, RegularGPT constructs working memory along the depth dimension, thereby enabling efficient and suc… ▽ More

    Submitted 5 May, 2023; originally announced May 2023.

  19. arXiv:2304.14356  [pdf, other

    cs.RO

    S$^2$MAT: Simultaneous and Self-Reinforced Map** and Tracking in Dynamic Urban Scenariosorcing Framework for Simultaneous Map** and Tracking in Unbounded Urban Environments

    Authors: Tingxiang Fan, Bowen Shen, Yinqiang Zhang, Chuye Zhang, Lei Yang, Hua Chen, Wei Zhang, Jia Pan

    Abstract: Despite the increasing prevalence of robots in daily life, their navigation capabilities are still limited to environments with prior knowledge, such as a global map. To fully unlock the potential of robots, it is crucial to enable them to navigate in large-scale unknown and changing unstructured scenarios. This requires the robot to construct an accurate static map in real-time as it explores, wh… ▽ More

    Submitted 20 November, 2023; v1 submitted 27 April, 2023; originally announced April 2023.

    Comments: homepage: https://sites.google.com/view/smat-nav

  20. arXiv:2303.13277  [pdf, other

    cs.CV

    SINE: Semantic-driven Image-based NeRF Editing with Prior-guided Editing Field

    Authors: Chong Bao, Yinda Zhang, Bangbang Yang, Tianxing Fan, Zesong Yang, Hujun Bao, Guofeng Zhang, Zhaopeng Cui

    Abstract: Despite the great success in 2D editing using user-friendly tools, such as Photoshop, semantic strokes, or even text prompts, similar capabilities in 3D areas are still limited, either relying on 3D modeling skills or allowing editing within only a few categories. In this paper, we present a novel semantic-driven NeRF editing approach, which enables users to edit a neural radiance field with a sin… ▽ More

    Submitted 25 March, 2023; v1 submitted 23 March, 2023; originally announced March 2023.

    Comments: Accepted to CVPR 2023. Project Page: https://zju3dv.github.io/sine/

  21. arXiv:2212.11589  [pdf, other

    cs.SE

    Simulation-based Testing of Simulink Models with Test Sequence and Test Assessment Blocks

    Authors: Federico Formica, Tony Fan, Akshay Rajhans, Vera Pantelic, Mark Lawford, Claudio Menghi

    Abstract: Simulation-based software testing supports engineers in finding faults in Simulink models. It typically relies on search algorithms that iteratively generate test inputs used to exercise models in simulation to detect design errors. While simulation-based software testing techniques are effective in many practical scenarios, they are typically not fully integrated within the Simulink environment a… ▽ More

    Submitted 22 December, 2022; originally announced December 2022.

  22. arXiv:2212.10356  [pdf, other

    cs.CL

    Dissecting Transformer Length Extrapolation via the Lens of Receptive Field Analysis

    Authors: Ta-Chung Chi, Ting-Han Fan, Alexander I. Rudnicky, Peter J. Ramadge

    Abstract: Length extrapolation permits training a transformer language model on short sequences that preserves perplexities when tested on substantially longer sequences. A relative positional embedding design, ALiBi, has had the widest usage to date. We dissect ALiBi via the lens of receptive field analysis empowered by a novel cumulative normalized gradient tool. The concept of receptive field further all… ▽ More

    Submitted 23 May, 2023; v1 submitted 20 December, 2022; originally announced December 2022.

    Comments: Accepted by ACL 2023

  23. Probabilistic partition of unity networks for high-dimensional regression problems

    Authors: Tiffany Fan, Nathaniel Trask, Marta D'Elia, Eric Darve

    Abstract: We explore the probabilistic partition of unity network (PPOU-Net) model in the context of high-dimensional regression problems and propose a general framework focusing on adaptive dimensionality reduction. With the proposed framework, the target function is approximated by a mixture of experts model on a low-dimensional manifold, where each cluster is associated with a local fixed-degree polynomi… ▽ More

    Submitted 11 June, 2023; v1 submitted 6 October, 2022; originally announced October 2022.

  24. arXiv:2210.02099  [pdf, other

    cs.LG

    Automated Graph Self-supervised Learning via Multi-teacher Knowledge Distillation

    Authors: Lirong Wu, Yufei Huang, Haitao Lin, Zicheng Liu, Tianyu Fan, Stan Z. Li

    Abstract: Self-supervised learning on graphs has recently achieved remarkable success in graph representation learning. With hundreds of self-supervised pretext tasks proposed over the past few years, the research community has greatly developed, and the key is no longer to design more powerful but complex pretext tasks, but to make more effective use of those already on hand. This paper studies the problem… ▽ More

    Submitted 5 October, 2022; originally announced October 2022.

  25. arXiv:2209.12512  [pdf, other

    cs.CV eess.IV

    Multiscale Latent-Guided Entropy Model for LiDAR Point Cloud Compression

    Authors: Tingyu Fan, Linyao Gao, Yiling Xu, Dong Wang, Zhu Li

    Abstract: The non-uniform distribution and extremely sparse nature of the LiDAR point cloud (LPC) bring significant challenges to its high-efficient compression. This paper proposes a novel end-to-end, fully-factorized deep framework that encodes the original LPC into an octree structure and hierarchically decomposes the octree entropy model in layers. The proposed framework utilizes a hierarchical latent v… ▽ More

    Submitted 14 February, 2023; v1 submitted 26 September, 2022; originally announced September 2022.

  26. arXiv:2207.13979   

    cs.CL cs.AI

    Knowing Where and What: Unified Word Block Pretraining for Document Understanding

    Authors: Song Tao, Zijian Wang, Tiantian Fan, Canjie Luo, Can Huang

    Abstract: Due to the complex layouts of documents, it is challenging to extract information for documents. Most previous studies develop multimodal pre-trained models in a self-supervised way. In this paper, we focus on the embedding learning of word blocks containing text and layout information, and propose UTel, a language model with Unified TExt and Layout pre-training. Specifically, we propose two pre-t… ▽ More

    Submitted 29 July, 2022; v1 submitted 28 July, 2022; originally announced July 2022.

    Comments: incomplete experiments

  27. arXiv:2207.11467  [pdf, other

    cs.CV cs.AI

    CompNVS: Novel View Synthesis with Scene Completion

    Authors: Zuoyue Li, Tianxing Fan, Zhenqiang Li, Zhaopeng Cui, Yoichi Sato, Marc Pollefeys, Martin R. Oswald

    Abstract: We introduce a scalable framework for novel view synthesis from RGB-D images with largely incomplete scene coverage. While generative neural approaches have demonstrated spectacular results on 2D images, they have not yet achieved similar photorealistic results in combination with scene completion where a spatial 3D scene understanding is essential. To this end, we propose a generative pipeline pe… ▽ More

    Submitted 23 July, 2022; originally announced July 2022.

    Comments: ECCV 2022

  28. Accelerating Vertical Federated Learning

    Authors: Dongqi Cai, Tao Fan, Yan Kang, Lixin Fan, Mengwei Xu, Shangguang Wang, Qiang Yang

    Abstract: Privacy, security and data governance constraints rule out a brute force process in the integration of cross-silo data, which inherits the development of the Internet of Things. Federated learning is proposed to ensure that all parties can collaboratively complete the training task while the data is not out of the local. Vertical federated learning is a specialization of federated learning for dis… ▽ More

    Submitted 21 January, 2024; v1 submitted 23 July, 2022; originally announced July 2022.

  29. arXiv:2207.11016  [pdf, other

    cs.SE

    Search-based Software Testing Driven by Automatically Generated and Manually Defined Fitness Functions

    Authors: Federico Formica, Tony Fan, Claudio Menghi

    Abstract: Search-based software testing (SBST) typically relies on fitness functions to guide the search exploration toward software failures. There are two main techniques to define fitness functions: (a) automated fitness function computation from the specification of the system requirements, and (b) manual fitness function design. Both techniques have advantages. The former uses information from the syst… ▽ More

    Submitted 7 September, 2023; v1 submitted 22 July, 2022; originally announced July 2022.

  30. arXiv:2207.09442  [pdf, other

    cs.RO cs.CV cs.LG math.OC

    Theseus: A Library for Differentiable Nonlinear Optimization

    Authors: Luis Pineda, Taosha Fan, Maurizio Monge, Shobha Venkataraman, Paloma Sodhi, Ricky T. Q. Chen, Joseph Ortiz, Daniel DeTone, Austin Wang, Stuart Anderson, **g Dong, Brandon Amos, Mustafa Mukadam

    Abstract: We present Theseus, an efficient application-agnostic open source library for differentiable nonlinear least squares (DNLS) optimization built on PyTorch, providing a common framework for end-to-end structured learning in robotics and vision. Existing DNLS implementations are application specific and do not always incorporate many ingredients important for efficiency. Theseus is application-agnost… ▽ More

    Submitted 18 January, 2023; v1 submitted 19 July, 2022; originally announced July 2022.

    Comments: Advances in Neural Information Processing Systems (NeurIPS), 2022

  31. arXiv:2206.15102  [pdf, other

    cs.RO

    DynamicFilter: an Online Dynamic Objects Removal Framework for Highly Dynamic Environments

    Authors: Tingxiang Fan, Bowen Shen, Hua Chen, Wei Zhang, Jia Pan

    Abstract: Emergence of massive dynamic objects will diversify spatial structures when robots navigate in urban environments. Therefore, the online removal of dynamic objects is critical. In this paper, we introduce a novel online removal framework for highly dynamic urban environments. The framework consists of the scan-to-map front-end and the map-to-map back-end modules. Both the front- and back-ends deep… ▽ More

    Submitted 30 June, 2022; originally announced June 2022.

    Comments: ICRA 2022

  32. arXiv:2206.07235  [pdf, other

    cs.LG cs.AI

    Training Discrete Deep Generative Models via Gapped Straight-Through Estimator

    Authors: Ting-Han Fan, Ta-Chung Chi, Alexander I. Rudnicky, Peter J. Ramadge

    Abstract: While deep generative models have succeeded in image processing, natural language processing, and reinforcement learning, training that involves discrete random variables remains challenging due to the high variance of its gradient estimation process. Monte Carlo is a common solution used in most variance reduction approaches. However, this involves time-consuming resampling and multiple function… ▽ More

    Submitted 14 June, 2022; originally announced June 2022.

    Comments: Accepted at the International Conference on Machine Learning (ICML) 2022. The first two authors contributed equally

  33. arXiv:2205.09921  [pdf, other

    cs.CL cs.LG

    KERPLE: Kernelized Relative Positional Embedding for Length Extrapolation

    Authors: Ta-Chung Chi, Ting-Han Fan, Peter J. Ramadge, Alexander I. Rudnicky

    Abstract: Relative positional embeddings (RPE) have received considerable attention since RPEs effectively model the relative distance among tokens and enable length extrapolation. We propose KERPLE, a framework that generalizes relative position embedding for extrapolation by kernelizing positional differences. We achieve this goal using conditionally positive definite (CPD) kernels, a class of functions k… ▽ More

    Submitted 13 October, 2022; v1 submitted 19 May, 2022; originally announced May 2022.

    Comments: Accepted at the 36th Conference on Neural Information Processing Systems (NeurIPS 2022). The first two authors contributed equally to this work

  34. arXiv:2205.01135  [pdf, other

    cs.CV cs.LG eess.IV

    D-DPCC: Deep Dynamic Point Cloud Compression via 3D Motion Prediction

    Authors: Tingyu Fan, Linyao Gao, Yiling Xu, Zhu Li, Dong Wang

    Abstract: The non-uniformly distributed nature of the 3D dynamic point cloud (DPC) brings significant challenges to its high-efficient inter-frame compression. This paper proposes a novel 3D sparse convolution-based Deep Dynamic Point Cloud Compression (D-DPCC) network to compensate and compress the DPC geometry with 3D motion estimation and motion compensation in the feature space. In the proposed D-DPCC n… ▽ More

    Submitted 2 May, 2022; originally announced May 2022.

  35. arXiv:2202.03846  [pdf

    cs.RO

    The Soft Compiler: A Web-Based Tool for the Design of Modular Pneumatic Circuits for Soft Robots

    Authors: Lauryn Whiteside, Savita V. Kendre, Tian Y. Fan, Jovanna A. Tracz, Gus T. Teran, Thomas C. Underwood, Mohammed E. Sayed, Haihui J. Jiang, Adam A. Stokes, Daniel J. Preston, George M. Whitesides, Markus P. Nemitz

    Abstract: Develo** soft circuits from individual soft logic gates poses a unique challenge: with increasing numbers of logic gates, the design and implementation of circuits leads to inefficiencies due to mathematically unoptimized circuits and wiring mistakes during assembly. It is therefore practically important to introduce design tools that support the development of soft circuits. We developed a web-… ▽ More

    Submitted 8 February, 2022; originally announced February 2022.

    Comments: Accepted manuscript (journal): Robotics and Automation Letter, 2022

  36. arXiv:2111.15113  [pdf, other

    cs.CV

    LatentHuman: Shape-and-Pose Disentangled Latent Representation for Human Bodies

    Authors: Sandro Lombardi, Bangbang Yang, Tianxing Fan, Hujun Bao, Guofeng Zhang, Marc Pollefeys, Zhaopeng Cui

    Abstract: 3D representation and reconstruction of human bodies have been studied for a long time in computer vision. Traditional methods rely mostly on parametric statistical linear models, limiting the space of possible bodies to linear combinations. It is only recently that some approaches try to leverage neural implicit representations for human body modeling, and while demonstrating impressive results,… ▽ More

    Submitted 29 November, 2021; originally announced November 2021.

    Comments: Accepted to 3DV 2021. Project Page: https://latenthuman.github.io/

  37. arXiv:2110.12352  [pdf, other

    cs.RO cs.AI

    DiffSRL: Learning Dynamical State Representation for Deformable Object Manipulation with Differentiable Simulator

    Authors: Sirui Chen, Yunhao Liu, Jialong Li, Shang Wen Yao, Tingxiang Fan, Jia Pan

    Abstract: Dynamic state representation learning is an important task in robot learning. Latent space that can capture dynamics related information has wide application in areas such as accelerating model free reinforcement learning, closing the simulation to reality gap, as well as reducing the motion planning complexity. However, current dynamic state representation learning methods scale poorly on complex… ▽ More

    Submitted 25 July, 2022; v1 submitted 24 October, 2021; originally announced October 2021.

    Comments: 8 pages 9 figures

    Journal ref: IEEE Robotics and Automation Letters, 2022

  38. arXiv:2110.10927  [pdf, other

    cs.LG cs.AI

    SecureBoost+: Large Scale and High-Performance Vertical Federated Gradient Boosting Decision Tree

    Authors: Tao Fan, Wei**g Chen, Guoqiang Ma, Yan Kang, Lixin Fan, Qiang Yang

    Abstract: Gradient boosting decision tree (GBDT) is an ensemble machine learning algorithm, which is widely used in industry, due to its good performance and easy interpretation. Due to the problem of data isolation and the requirement of privacy, many works try to use vertical federated learning to train machine learning models collaboratively with privacy guarantees between different data owners. SecureBo… ▽ More

    Submitted 18 June, 2024; v1 submitted 21 October, 2021; originally announced October 2021.

  39. arXiv:2110.02421  [pdf, other

    cs.LG cs.AI

    Explaining Off-Policy Actor-Critic From A Bias-Variance Perspective

    Authors: Ting-Han Fan, Peter J. Ramadge

    Abstract: Off-policy Actor-Critic algorithms have demonstrated phenomenal experimental performance but still require better explanations. To this end, we show its policy evaluation error on the distribution of transitions decomposes into: a Bellman error, a bias from policy mismatch, and a variance term from sampling. By comparing the magnitude of bias and variance, we explain the success of the Emphasizing… ▽ More

    Submitted 5 October, 2021; originally announced October 2021.

  40. arXiv:2109.08512  [pdf, other

    cs.LG cs.AI

    Soft Actor-Critic With Integer Actions

    Authors: Ting-Han Fan, Yubo Wang

    Abstract: Reinforcement learning is well-studied under discrete actions. Integer actions setting is popular in the industry yet still challenging due to its high dimensionality. To this end, we study reinforcement learning under integer actions by incorporating the Soft Actor-Critic (SAC) algorithm with an integer reparameterization. Our key observation for integer actions is that their discrete structure c… ▽ More

    Submitted 14 March, 2022; v1 submitted 17 September, 2021; originally announced September 2021.

    Comments: The 2022 American Control Conference (ACC)

  41. arXiv:2109.03970  [pdf, other

    cs.LG cs.AI

    PowerGym: A Reinforcement Learning Environment for Volt-Var Control in Power Distribution Systems

    Authors: Ting-Han Fan, Xian Yeow Lee, Yubo Wang

    Abstract: We introduce PowerGym, an open-source reinforcement learning environment for Volt-Var control in power distribution systems. Following OpenAI Gym APIs, PowerGym targets minimizing power loss and voltage violations under physical networked constraints. PowerGym provides four distribution systems (13Bus, 34Bus, 123Bus, and 8500Node) based on IEEE benchmark systems and design variants for various con… ▽ More

    Submitted 14 March, 2022; v1 submitted 8 September, 2021; originally announced September 2021.

    Comments: The 4th Annual Learning for Dynamics & Control Conference (L4DC) 2022

  42. arXiv:2108.00083  [pdf, other

    cs.RO math.OC

    Majorization Minimization Methods for Distributed Pose Graph Optimization

    Authors: Taosha Fan, Todd Murphey

    Abstract: We consider the problem of distributed pose graph optimization (PGO) that has important applications in multi-robot simultaneous localization and map** (SLAM). We propose the majorization minimization (MM) method for distributed PGO ($\mathsf{MM-PGO}$) that applies to a broad class of robust loss kernels. The $\mathsf{MM-PGO}$ method is guaranteed to converge to first-order critical points under… ▽ More

    Submitted 23 January, 2023; v1 submitted 30 July, 2021; originally announced August 2021.

    Comments: 33 pages

  43. arXiv:2105.13965  [pdf, other

    cs.CV cs.RO

    Revitalizing Optimization for 3D Human Pose and Shape Estimation: A Sparse Constrained Formulation

    Authors: Taosha Fan, Kalyan Vasudev Alwala, Donglai Xiang, Weipeng Xu, Todd Murphey, Mustafa Mukadam

    Abstract: We propose a novel sparse constrained formulation and from it derive a real-time optimization method for 3D human pose and shape estimation. Our optimization method, SCOPE (Sparse Constrained Optimization for 3D human Pose and shapE estimation), is orders of magnitude faster (avg. 4 ms convergence) than existing optimization methods, while being mathematically equivalent to their dense unconstrain… ▽ More

    Submitted 4 October, 2021; v1 submitted 28 May, 2021; originally announced May 2021.

    Comments: 21 pages, including appendix

  44. arXiv:2104.11674  [pdf, other

    cs.LG q-bio.QM

    Genetic Constrained Graph Variational Autoencoder for COVID-19 Drug Discovery

    Authors: Tianyue Cheng, Tianchi Fan, Landi Wang

    Abstract: In the past several months, COVID-19 has spread over the globe and caused severe damage to the people and the society. In the context of this severe situation, an effective drug discovery method to generate potential drugs is extremely meaningful. In this paper, we provide a methodology of discovering potential drugs for the treatment of Severe Acute Respiratory Syndrome Corona-Virus 2 (commonly k… ▽ More

    Submitted 23 April, 2021; originally announced April 2021.

  45. arXiv:2103.03505  [pdf

    econ.EM cs.LG

    Prediction of financial time series using LSTM and data denoising methods

    Authors: Qi Tang, Tongmei Fan, Ruchen Shi, **gyan Huang, Yidan Ma

    Abstract: In order to further overcome the difficulties of the existing models in dealing with the non-stationary and nonlinear characteristics of high-frequency financial time series data, especially its weak generalization ability, this paper proposes an ensemble method based on data denoising methods, including the wavelet transform (WT) and singular spectrum analysis (SSA), and long-term short-term memo… ▽ More

    Submitted 5 March, 2021; originally announced March 2021.

  46. arXiv:2012.10099  [pdf, other

    cs.RO

    Crowd-Driven Map**, Localization and Planning

    Authors: Tingxiang Fan, Dawei Wang, Wenxi Liu, Jia Pan

    Abstract: Navigation in dense crowds is a well-known open problem in robotics with many challenges in map**, localization, and planning. Traditional solutions consider dense pedestrians as passive/active moving obstacles that are the cause of all troubles: they negatively affect the sensing of static scene landmarks and must be actively avoided for safety. In this paper, we provide a new perspective: the… ▽ More

    Submitted 3 January, 2021; v1 submitted 18 December, 2020; originally announced December 2020.

    Comments: Accepted to ISER 2020

  47. arXiv:2012.02709  [pdf, other

    math.OC cs.RO

    Generalized Proximal Methods for Pose Graph Optimization

    Authors: Taosha Fan, Todd Murphey

    Abstract: In this paper, we generalize proximal methods that were originally designed for convex optimization on normed vector space to non-convex pose graph optimization (PGO) on special Euclidean groups, and show that our proposed generalized proximal methods for PGO converge to first-order critical points. Furthermore, we propose methods that significantly accelerate the rates of convergence almost witho… ▽ More

    Submitted 4 May, 2021; v1 submitted 4 December, 2020; originally announced December 2020.

    Comments: 29 pages

    Journal ref: International Symposium on Robotics Research (ISRR), 2019

  48. arXiv:2009.08586  [pdf, ps, other

    cs.LG cs.AI stat.ML

    A Contraction Approach to Model-based Reinforcement Learning

    Authors: Ting-Han Fan, Peter J. Ramadge

    Abstract: Despite its experimental success, Model-based Reinforcement Learning still lacks a complete theoretical understanding. To this end, we analyze the error in the cumulative reward using a contraction approach. We consider both stochastic and deterministic state transitions for continuous (non-discrete) state and action spaces. This approach doesn't require strong assumptions and can recover the typi… ▽ More

    Submitted 25 February, 2021; v1 submitted 17 September, 2020; originally announced September 2020.

    Comments: The 24th International Conference on Artificial Intelligence and Statistics (AISTATS) 2021

  49. arXiv:2008.08889  [pdf, other

    cs.RO

    Autonomous Social Distancing in Urban Environments using a Quadruped Robot

    Authors: Tingxiang Fan, Zhiming Chen, Xuan Zhao, **g Liang, Cong Shen, Dinesh Manocha, Jia Pan, Wei Zhang

    Abstract: COVID-19 pandemic has become a global challenge faced by people all over the world. Social distancing has been proved to be an effective practice to reduce the spread of COVID-19. Against this backdrop, we propose that the surveillance robots can not only monitor but also promote social distancing. Robots can be flexibly deployed and they can take precautionary actions to remind people of practici… ▽ More

    Submitted 20 August, 2020; originally announced August 2020.

  50. arXiv:2007.13393  [pdf, other

    cs.CV

    Ladybird: Quasi-Monte Carlo Sampling for Deep Implicit Field Based 3D Reconstruction with Symmetry

    Authors: Yifan Xu, Tianqi Fan, Yi Yuan, Gurprit Singh

    Abstract: Deep implicit field regression methods are effective for 3D reconstruction from single-view images. However, the impact of different sampling patterns on the reconstruction quality is not well-understood. In this work, we first study the effect of point set discrepancy on the network training. Based on Farthest Point Sampling algorithm, we propose a sampling scheme that theoretically encourages be… ▽ More

    Submitted 27 July, 2020; originally announced July 2020.

    Comments: European Conference on Computer Vision 2020 (ECCV 2020)