Skip to main content

Showing 1–50 of 433 results for author: Tian, X

.
  1. arXiv:2406.18585  [pdf, other

    cs.CV cs.AI

    Flexible ViG: Learning the Self-Saliency for Flexible Object Recognition

    Authors: Lin Zuo, Kunshan Yang, Xianlong Tian, Kunbin He, Yongqi Ding, Mengmeng **g

    Abstract: Existing computer vision methods mainly focus on the recognition of rigid objects, whereas the recognition of flexible objects remains unexplored. Recognizing flexible objects poses significant challenges due to their inherently diverse shapes and sizes, translucent attributes, ambiguous boundaries, and subtle inter-class differences. In this paper, we claim that these problems primarily arise fro… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: under review

  2. arXiv:2406.14264  [pdf, other

    eess.IV cs.CV

    Zero-Shot Image Denoising for High-Resolution Electron Microscopy

    Authors: Xuanyu Tian, Zhuoya Dong, Xiyue Lin, Yue Gao, Hongjiang Wei, Yanhang Ma, **gyi Yu, Yuyao Zhang

    Abstract: High-resolution electron microscopy (HREM) imaging technique is a powerful tool for directly visualizing a broad range of materials in real-space. However, it faces challenges in denoising due to ultra-low signal-to-noise ratio (SNR) and scarce data availability. In this work, we propose Noise2SR, a zero-shot self-supervised learning (ZS-SSL) denoising framework for HREM. Within our framework, we… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: 12 pages, 12 figures

  3. arXiv:2406.13340  [pdf, other

    cs.CL cs.SD eess.AS

    SD-Eval: A Benchmark Dataset for Spoken Dialogue Understanding Beyond Words

    Authors: Junyi Ao, Yuancheng Wang, Xiaohai Tian, Dekun Chen, Jun Zhang, Lu Lu, Yuxuan Wang, Haizhou Li, Zhizheng Wu

    Abstract: Speech encompasses a wealth of information, including but not limited to content, paralinguistic, and environmental information. This comprehensive nature of speech significantly impacts communication and is crucial for human-computer interaction. Chat-Oriented Large Language Models (LLMs), known for their general-purpose assistance capabilities, have evolved to handle multi-modal inputs, includin… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  4. arXiv:2406.11890  [pdf, other

    cs.LG cs.AI cs.CL

    Unraveling the Mechanics of Learning-Based Demonstration Selection for In-Context Learning

    Authors: Hui Liu, Wenya Wang, Hao Sun, Chris Xing Tian, Chenqi Kong, Xin Dong, Haoliang Li

    Abstract: Large Language Models (LLMs) have demonstrated impressive in-context learning (ICL) capabilities from few-shot demonstration exemplars. While recent learning-based demonstration selection methods have proven beneficial to ICL by choosing more useful exemplars, their underlying mechanisms are opaque, hindering efforts to address limitations such as high training costs and poor generalization across… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  5. arXiv:2406.08782  [pdf, other

    eess.IV cs.CV

    Hybrid Spatial-spectral Neural Network for Hyperspectral Image Denoising

    Authors: Hao Liang, Chengjie, Kun Li, Xin Tian

    Abstract: Hyperspectral image (HSI) denoising is an essential procedure for HSI applications. Unfortunately, the existing Transformer-based methods mainly focus on non-local modeling, neglecting the importance of locality in image denoising. Moreover, deep learning methods employ complex spectral learning mechanisms, thus introducing large computation costs. To address these problems, we propose a hybrid… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  6. arXiv:2406.08343  [pdf, other

    cs.AR cs.AI cs.ET cs.NE

    Continuous-Time Digital Twin with Analogue Memristive Neural Ordinary Differential Equation Solver

    Authors: Hegan Chen, Jichang Yang, Jia Chen, Songqi Wang, Shaocong Wang, Dingchen Wang, Xinyu Tian, Yifei Yu, Xi Chen, Yinan Lin, Yangu He, Xiaoshan Wu, Yi Li, Xinyuan Zhang, Ning Lin, Meng Xu, Yi Li, Xumeng Zhang, Zhongrui Wang, Han Wang, Dashan Shang, Qi Liu, Kwang-Ting Cheng, Ming Liu

    Abstract: Digital twins, the cornerstone of Industry 4.0, replicate real-world entities through computer models, revolutionising fields such as manufacturing management and industrial automation. Recent advances in machine learning provide data-driven methods for develo** digital twins using discrete-time data and finite-depth models on digital computers. However, this approach fails to capture the underl… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 14 pages, 4 figures

  7. arXiv:2405.18955  [pdf, other

    cs.CV

    RGB-T Object Detection via Group Shuffled Multi-receptive Attention and Multi-modal Supervision

    Authors: **zhong Wang, Xuetao Tian, Shun Dai, Tao Zhuo, Haorui Zeng, Hongjuan Liu, Jiaqi Liu, Xiuwei Zhang, Yanning Zhang

    Abstract: Multispectral object detection, utilizing both visible (RGB) and thermal infrared (T) modals, has garnered significant attention for its robust performance across diverse weather and lighting conditions. However, effectively exploiting the complementarity between RGB-T modals while maintaining efficiency remains a critical challenge. In this paper, a very simple Group Shuffled Multi-receptive Atte… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  8. arXiv:2405.18203  [pdf, other

    cs.CL

    IAPT: Instruction-Aware Prompt Tuning for Large Language Models

    Authors: Wei Zhu, Aaron Xuxiang Tian, Congrui Yin, Yuan Ni, Xiaoling Wang, Guotong Xie

    Abstract: Soft prompt tuning is a widely studied parameter-efficient fine-tuning method. However, it has a clear drawback: many soft tokens must be inserted into the input sequences to guarantee downstream performance. As a result, soft prompt tuning is less considered than Low-rank adaptation (LoRA) in the large language modeling (LLM) era. In this work, we propose a novel prompt tuning method, Instruction… ▽ More

    Submitted 7 June, 2024; v1 submitted 28 May, 2024; originally announced May 2024.

    Comments: Accepted by ACL-2024

  9. arXiv:2405.16837  [pdf, ps, other

    stat.ML cs.LG

    Enhancing Accuracy in Generative Models via Knowledge Transfer

    Authors: Xinyu Tian, Xiaotong Shen

    Abstract: This paper investigates the accuracy of generative models and the impact of knowledge transfer on their generation precision. Specifically, we examine a generative model for a target task, fine-tuned using a pre-trained model from a source task. Building on the "Shared Embedding" concept, which bridges the source and target tasks, we introduce a novel framework for transfer learning under distribu… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  10. arXiv:2405.13350  [pdf, other

    cs.CL cs.LG

    Efficacy of ByT5 in Multilingual Translation of Biblical Texts for Underrepresented Languages

    Authors: Corinne Aars, Lauren Adams, Xiaokan Tian, Zhaoyu Wang, Colton Wismer, Jason Wu, Pablo Rivas, Korn Sooksatra, Matthew Fendt

    Abstract: This study presents the development and evaluation of a ByT5-based multilingual translation model tailored for translating the Bible into underrepresented languages. Utilizing the comprehensive Johns Hopkins University Bible Corpus, we trained the model to capture the intricate nuances of character-based and morphologically rich languages. Our results, measured by the BLEU score and supplemented w… ▽ More

    Submitted 30 May, 2024; v1 submitted 22 May, 2024; originally announced May 2024.

    Comments: LXAI Workshop at the 2024 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2024)

    ACM Class: I.2.7

  11. arXiv:2405.11485  [pdf

    cond-mat.mtrl-sci

    Evidence for Multiferroicity in Single-Layer CuCrSe$_2$

    Authors: Zhenyu Sun, Yueqi Su, Aomiao Zhi, Zhicheng Gao, Xu Han, Kang Wu, Lihong Bao, Yuan Huang, Youguo Shi, Xuedong Bai, Peng Cheng, Lan Chen, Kehui Wu, Xuezeng Tian, Changzheng Wu, Baojie Feng

    Abstract: Multiferroic materials, which simultaneously exhibit ferroelectricity and magnetism, have attracted substantial attention due to their fascinating physical properties and potential technological applications. With the trends towards device miniaturization, there is an increasing demand for the persistence of multiferroicity in single-layer materials at elevated temperatures. Here, we report high-t… ▽ More

    Submitted 19 May, 2024; originally announced May 2024.

    Journal ref: Nature Communications 15, 4252 (2024)

  12. arXiv:2405.05702  [pdf, other

    cs.RO

    NGM-SLAM: Gaussian Splatting SLAM with Radiance Field Submap

    Authors: Mingrui Li, **gwei Huang, Lei Sun, Aaron Xuxiang Tian, Tianchen Deng, Hongyu Wang

    Abstract: SLAM systems based on Gaussian Splatting have garnered attention due to their capabilities for rapid real-time rendering and high-fidelity map**. However, current Gaussian Splatting SLAM systems usually struggle with large scene representation and lack effective loop closure detection. To address these issues, we introduce NGM-SLAM, the first 3DGS based SLAM system that utilizes neural radiance… ▽ More

    Submitted 28 June, 2024; v1 submitted 9 May, 2024; originally announced May 2024.

    Comments: 9pages, 4 figures

  13. arXiv:2405.02807  [pdf

    cs.LG cs.AI cs.CV

    Kinematic analysis of structural mechanics based on convolutional neural network

    Authors: Leye Zhang, Xiangxiang Tian, Hongjun Zhang

    Abstract: Attempt to use convolutional neural network to achieve kinematic analysis of plane bar structure. Through 3dsMax animation software and OpenCV module, self-build image dataset of geometrically stable system and geometrically unstable system. we construct and train convolutional neural network model based on the TensorFlow and Keras deep learning platform framework. The model achieves 100% accuracy… ▽ More

    Submitted 5 May, 2024; originally announced May 2024.

    Comments: 9 pages, 13 figures

  14. arXiv:2405.01189  [pdf, other

    cs.LG cs.AI

    Gradient-Congruity Guided Federated Sparse Training

    Authors: Chris Xing Tian, Yibing Liu, Haoliang Li, Ray C. C. Cheung, Shiqi Wang

    Abstract: Edge computing allows artificial intelligence and machine learning models to be deployed on edge devices, where they can learn from local data and collaborate to form a global model. Federated learning (FL) is a distributed machine learning technique that facilitates this process while preserving data privacy. However, FL also faces challenges such as high computational and communication costs reg… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

  15. arXiv:2404.17890  [pdf, other

    eess.IV cs.AI cs.CV

    DPER: Diffusion Prior Driven Neural Representation for Limited Angle and Sparse View CT Reconstruction

    Authors: Chenhe Du, Xiyue Lin, Qing Wu, Xuanyu Tian, Ying Su, Zhe Luo, Hongjiang Wei, S. Kevin Zhou, **gyi Yu, Yuyao Zhang

    Abstract: Limited-angle and sparse-view computed tomography (LACT and SVCT) are crucial for expanding the scope of X-ray CT applications. However, they face challenges due to incomplete data acquisition, resulting in diverse artifacts in the reconstructed CT images. Emerging implicit neural representation (INR) techniques, such as NeRF, NeAT, and NeRP, have shown promise in under-determined CT imaging recon… ▽ More

    Submitted 27 April, 2024; originally announced April 2024.

    Comments: 15 pages, 10 figures

    ACM Class: I.2.10; I.4.5

  16. arXiv:2404.12826  [pdf, ps, other

    math.FA

    Semi-harmonious and harmonious quasi-projection pairs on Hilbert $C^*$-modules

    Authors: Xiaoyi Tian, Qingxiang Xu, Chunhong Fu

    Abstract: For each adjointable idempotent $Q$ on a Hilbert $C^*$-module $H$, a specific projection $m(Q)$ called the matched projection of $Q$ was introduced recently due to the characterization of the minimum value among all the distances from projections to $Q$. Inspired by the relationship between $m(Q)$ and $Q$, another term called the quasi-projection pair $(P,Q)$ was also introduced recently, where… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2305.12984

    MSC Class: 46L08; 47A05

  17. arXiv:2404.12759  [pdf, other

    cs.LG

    decoupleQ: Towards 2-bit Post-Training Uniform Quantization via decoupling Parameters into Integer and Floating Points

    Authors: Yi Guo, Fanliu Kong, Xiaoyang Li, Hui Li, Wei Chen, Xiaogang Tian, **** Cai, Yang Zhang, Shouda Liu

    Abstract: Quantization emerges as one of the most promising compression technologies for deploying efficient large models for various real time application in recent years. Considering that the storage and IO of weights take up the vast majority of the overhead inside a large model, weight only quantization can lead to large gains. However, existing quantization schemes suffer from significant accuracy degr… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

    Comments: quantization for deep models

  18. arXiv:2404.09079  [pdf, ps, other

    math.AP

    Compactness results for a Dirichlet energy of nonlocal gradient with applications

    Authors: Zhaolong Han, Tadele Mengesha, Xiaochuan Tian

    Abstract: We prove two compactness results for function spaces with finite Dirichlet energy of half-space nonlocal gradients. In each of these results, we provide sufficient conditions on a sequence of kernel functions that guarantee the asymptotic compact embedding of the associated nonlocal function spaces into the class of square-integrable functions. Moreover, we will demonstrate that the sequence of no… ▽ More

    Submitted 13 April, 2024; originally announced April 2024.

  19. arXiv:2404.03523  [pdf

    cs.CE

    Integrating Generative AI into Financial Market Prediction for Improved Decision Making

    Authors: Chang Che, Zengyi Huang, Chen Li, Haotian Zheng, Xinyu Tian

    Abstract: This study provides an in-depth analysis of the model architecture and key technologies of generative artificial intelligence, combined with specific application cases, and uses conditional generative adversarial networks ( cGAN ) and time series analysis methods to simulate and predict dynamic changes in financial markets. The research results show that the cGAN model can effectively capture the… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

  20. arXiv:2404.03433  [pdf, ps, other

    math.FA

    The operator distances from projections to an idempotent

    Authors: Xiaofeng Zhang, Xiaoyi Tian, Qingxiang Xu

    Abstract: The main purpose of this paper is to give a full characterization of the operator distances from projections to an idempotent, which includes the minimum value, the maximum value and the intermediate values. Let $H$ be a Hilbert space and $\mathbb{B}(H)$ be the set of bounded linear operators on $H$. Given an arbitrary idempotent $Q\in \mathbb{B}(H)$, it is proved that… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

    MSC Class: 47A05

  21. arXiv:2404.02082  [pdf, other

    cs.CV

    WcDT: World-centric Diffusion Transformer for Traffic Scene Generation

    Authors: Chen Yang, Aaron Xuxiang Tian, Dong Chen, Tianyu Shi, Arsalan Heydarian

    Abstract: In this paper, we introduce a novel approach for autonomous driving trajectory generation by harnessing the complementary strengths of diffusion probabilistic models (a.k.a., diffusion models) and transformers. Our proposed framework, termed the "World-Centric Diffusion Transformer" (WcDT), optimizes the entire trajectory generation process, from feature extraction to model inference. To enhance t… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: 12 pages, 6 figures

  22. arXiv:2404.00730  [pdf, other

    physics.atom-ph quant-ph

    Long-range dipole-dipole exchange-induced atomic grating

    Authors: Xuan-Qian Bao, Xue-Dong Tian, Dong-Xiao Li, Yi-Mou Liu

    Abstract: We propose a theoretical scheme for dipole exchange-induced grating (DEIG) based on a hybrid system consisting of ultra-cold Rubidium ($^{87}$Rb) atomic ensemble and movable Rydberg spin atoms. The optical response of the grating appears as a superposition of three- and four-level configurations, similar to the cooperative optical nonlinear effect caused by the dipole blockade effect. However, suc… ▽ More

    Submitted 2 April, 2024; v1 submitted 31 March, 2024; originally announced April 2024.

  23. arXiv:2403.16187  [pdf, other

    cs.CL

    ALoRA: Allocating Low-Rank Adaptation for Fine-tuning Large Language Models

    Authors: Zequan Liu, Jiawen Lyn, Wei Zhu, Xing Tian, Yvette Graham

    Abstract: Parameter-efficient fine-tuning (PEFT) is widely studied for its effectiveness and efficiency in the era of large language models. Low-rank adaptation (LoRA) has demonstrated commendable performance as a popular and representative method. However, it is implemented with a fixed intrinsic rank that might not be the ideal setting for the downstream tasks. Recognizing the need for more flexible downs… ▽ More

    Submitted 15 April, 2024; v1 submitted 24 March, 2024; originally announced March 2024.

    Comments: Accepted by NAACL-2024

  24. arXiv:2403.14057  [pdf

    cond-mat.supr-con cond-mat.str-el

    Exploring Fermi Surface Nesting and the Nature of Heavy Quasiparticles in the Spin-Triplet Superconductor Candidate CeRh$_2$As$_2$

    Authors: Bo Chen, Hao Liu, Qi-Yi Wu, Chen Zhang, Xue-Qing Ye, Yin-Zou Zhao, Jiao-Jiao Song, Xin-Yi Tian, Ba-Lei Tan, Zheng-Tai Liu, Mao Ye, Zhen-Hua Chen, Yao-Bo Huang, Da-Wei Shen, Ya-Hua Yuan, Jun He, Yu-Xia Duan, Jian-Qiao Meng

    Abstract: In this study, we investigate the electronic structure of a spin-triplet superconductor candidate CeRh$_2$As$_2$ using high-resolution angle-resolved photoemission spectroscopy and density functional theory calculations. Notably, Fermi surface nesting hints at connections to magnetic excitation or quadrupole density wave phenomena, elucidating the superconducting mechanisms. Measured band structur… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

    Comments: 6 pages, 3 figures

  25. arXiv:2403.11405  [pdf, other

    eess.SP

    A Deep Learning Method for Beat-Level Risk Analysis and Interpretation of Atrial Fibrillation Patients during Sinus Rhythm

    Authors: Jun Lei, Yuxi Zhou, Xue Tian, Qinghao Zhao, Qi Zhang, Shijia Geng, Qingbo Wu, Shenda Hong

    Abstract: Atrial Fibrillation (AF) is a common cardiac arrhythmia. Many AF patients experience complications such as stroke and other cardiovascular issues. Early detection of AF is crucial. Existing algorithms can only distinguish ``AF rhythm in AF patients'' from ``sinus rhythm in normal individuals'' . However, AF patients do not always exhibit AF rhythm, posing a challenge for diagnosis when the AF rhyt… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

  26. arXiv:2403.09996  [pdf, other

    cs.CV

    MEDPNet: Achieving High-Precision Adaptive Registration for Complex Die Castings

    Authors: Yu Du, Yu Song, Ce Guo, Xiao**g Tian, Dong Liu, Ming Cong

    Abstract: Due to their complex spatial structure and diverse geometric features, achieving high-precision and robust point cloud registration for complex Die Castings has been a significant challenge in the die-casting industry. Existing point cloud registration methods primarily optimize network models using well-established high-quality datasets, often neglecting practical application in real scenarios. T… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

  27. arXiv:2403.09412  [pdf, other

    cs.CV cs.AI cs.RO

    OpenGraph: Open-Vocabulary Hierarchical 3D Graph Representation in Large-Scale Outdoor Environments

    Authors: Yinan Deng, Jiahui Wang, **gyu Zhao, Xinyu Tian, Guangyan Chen, Yi Yang, Yufeng Yue

    Abstract: Environment representations endowed with sophisticated semantics are pivotal for facilitating seamless interaction between robots and humans, enabling them to effectively carry out various tasks. Open-vocabulary maps, powered by Visual-Language models (VLMs), possess inherent advantages, including zero-shot learning and support for open-set classes. However, existing open-vocabulary maps are prima… ▽ More

    Submitted 28 March, 2024; v1 submitted 14 March, 2024; originally announced March 2024.

  28. arXiv:2403.05801  [pdf, other

    cs.AI

    Enhancing Multi-Hop Knowledge Graph Reasoning through Reward Sha** Techniques

    Authors: Chen Li, Haotian Zheng, Yi** Sun, Cangqing Wang, Liqiang Yu, Che Chang, Xinyu Tian, Bo Liu

    Abstract: In the realm of computational knowledge representation, Knowledge Graph Reasoning (KG-R) stands at the forefront of facilitating sophisticated inferential capabilities across multifarious domains. The quintessence of this research elucidates the employment of reinforcement learning (RL) strategies, notably the REINFORCE algorithm, to navigate the intricacies inherent in multi-hop KG-R. This invest… ▽ More

    Submitted 9 March, 2024; originally announced March 2024.

    Comments: This paper has been accepted by the 2024 5th International Seminar on Artificial Intelligence, Networking and Information Technology (AINIT 2024)

  29. arXiv:2402.14430  [pdf, other

    cs.LG

    Robust Training of Federated Models with Extremely Label Deficiency

    Authors: Yonggang Zhang, Zhiqin Yang, Xinmei Tian, Nannan Wang, Tongliang Liu, Bo Han

    Abstract: Federated semi-supervised learning (FSSL) has emerged as a powerful paradigm for collaboratively training machine learning models using distributed data with label deficiency. Advanced FSSL methods predominantly focus on training a single model on each client. However, this approach could lead to a discrepancy between the objective functions of labeled and unlabeled data, resulting in gradient con… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

    Comments: ICLR 2024, 22 pages

  30. arXiv:2402.14155  [pdf, other

    cs.CL cs.AI

    Can Similarity-Based Domain-Ordering Reduce Catastrophic Forgetting for Intent Recognition?

    Authors: Amogh Mannekote, Xiaoyi Tian, Kristy Elizabeth Boyer, Bonnie J. Dorr

    Abstract: Task-oriented dialogue systems are expected to handle a constantly expanding set of intents and domains even after they have been deployed to support more and more functionalities. To live up to this expectation, it becomes critical to mitigate the catastrophic forgetting problem (CF) that occurs in continual learning (CL) settings for a task such as intent recognition. While existing dialogue sys… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

  31. arXiv:2402.12289  [pdf, other

    cs.CV

    DriveVLM: The Convergence of Autonomous Driving and Large Vision-Language Models

    Authors: Xiaoyu Tian, Junru Gu, Bailin Li, Yicheng Liu, Yang Wang, Zhiyong Zhao, Kun Zhan, Peng Jia, Xianpeng Lang, Hang Zhao

    Abstract: A primary hurdle of autonomous driving in urban environments is understanding complex and long-tail scenarios, such as challenging road conditions and delicate human behaviors. We introduce DriveVLM, an autonomous driving system leveraging Vision-Language Models (VLMs) for enhanced scene understanding and planning capabilities. DriveVLM integrates a unique combination of reasoning modules for scen… ▽ More

    Submitted 25 June, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

    Comments: Project Page: https://tsinghua-mars-lab.github.io/DriveVLM/

  32. arXiv:2402.11778  [pdf, other

    cs.LG cs.AI

    Towards Theoretical Understandings of Self-Consuming Generative Models

    Authors: Shi Fu, Sen Zhang, Yingjie Wang, Xinmei Tian, Dacheng Tao

    Abstract: This paper tackles the emerging challenge of training generative models within a self-consuming loop, wherein successive generations of models are recursively trained on mixtures of real and synthetic data from previous generations. We construct a theoretical framework to rigorously evaluate how this training procedure impacts the data distributions learned by future models, including parametric a… ▽ More

    Submitted 24 June, 2024; v1 submitted 18 February, 2024; originally announced February 2024.

    Comments: Accepted at ICML 2024

  33. arXiv:2402.07749  [pdf, ps, other

    math.NA

    Asymptotically compatible schemes for nonlinear variational models via Gamma-convergence and applications to nonlocal problems

    Authors: Qiang Du, James M. Scott, Xiaochuan Tian

    Abstract: We present a study on asymptotically compatible Galerkin discretizations for a class of parametrized nonlinear variational problems. The abstract analytical framework is based on variational convergence, or Gamma-convergence. We demonstrate the broad applicability of the theoretical framework by develo** asymptotically compatible finite element discretizations of some representative nonlinear no… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

  34. arXiv:2402.07011  [pdf, other

    cs.LG cs.AI cs.DC

    FedImpro: Measuring and Improving Client Update in Federated Learning

    Authors: Zhenheng Tang, Yonggang Zhang, Shaohuai Shi, Xinmei Tian, Tongliang Liu, Bo Han, Xiaowen Chu

    Abstract: Federated Learning (FL) models often experience client drift caused by heterogeneous data, where the distribution of data differs across clients. To address this issue, advanced research primarily focuses on manipulating the existing gradients to achieve more consistent client models. In this paper, we present an alternative perspective on client drift and aim to mitigate it by generating improved… ▽ More

    Submitted 14 March, 2024; v1 submitted 10 February, 2024; originally announced February 2024.

  35. arXiv:2401.12264  [pdf, other

    eess.AS cs.MM cs.SD eess.IV

    CoAVT: A Cognition-Inspired Unified Audio-Visual-Text Pre-Training Model for Multimodal Processing

    Authors: Xianghu Yue, Xiaohai Tian, Lu Lu, Malu Zhang, Zhizheng Wu, Haizhou Li

    Abstract: There has been a long-standing quest for a unified audio-visual-text model to enable various multimodal understanding tasks, which mimics the listening, seeing and reading process of human beings. Humans tends to represent knowledge using two separate systems: one for representing verbal (textual) information and one for representing non-verbal (visual and auditory) information. These two systems… ▽ More

    Submitted 21 February, 2024; v1 submitted 22 January, 2024; originally announced January 2024.

  36. arXiv:2401.04129  [pdf, ps, other

    math.AP

    Gradient stability of Caffarelli-Kohn-Nirenberg inequality involving weighted p-Laplace

    Authors: Shengbing Deng, Xingliang Tian

    Abstract: The best constant and extremal functions are well known of the following Caffarelli-Kohn-Nirenberg inequality \[ \int_{\mathbb{R}^N}|\nabla u|^p\frac{\mathrm{d}x}{|x|^μ}\geq \mathcal{S} \left(\int_{\mathbb{R}^N}|u|^r\frac{\mathrm{d}x}{|x|^s} \right)^{\frac{p}{r}}, \quad \mbox{for all}\quad u\in C^\infty_c(\mathbb{R}^N), \] where $1<p<p+μ<N$, $\fracμ{p}\leq \frac{s}{r}<\fracμ{p}+1$,… ▽ More

    Submitted 5 January, 2024; originally announced January 2024.

    Comments: 38 pages. Any suggestions and comments are welcome! arXiv admin note: text overlap with arXiv:2308.04111

  37. arXiv:2401.02777  [pdf, other

    cs.CL cs.AI

    From LLM to Conversational Agent: A Memory Enhanced Architecture with Fine-Tuning of Large Language Models

    Authors: Na Liu, Liangyu Chen, Xiaoyu Tian, Wei Zou, Kaijiang Chen, Ming Cui

    Abstract: This paper introduces RAISE (Reasoning and Acting through Scratchpad and Examples), an advanced architecture enhancing the integration of Large Language Models (LLMs) like GPT-4 into conversational agents. RAISE, an enhancement of the ReAct framework, incorporates a dual-component memory system, mirroring human short-term and long-term memory, to maintain context and continuity in conversations. I… ▽ More

    Submitted 30 January, 2024; v1 submitted 5 January, 2024; originally announced January 2024.

  38. arXiv:2401.02034  [pdf, other

    cs.CL

    Text2MDT: Extracting Medical Decision Trees from Medical Texts

    Authors: Wei Zhu, Wenfeng Li, Xing Tian, Pengfei Wang, Xiaoling Wang, ** Chen, Yuanbin Wu, Yuan Ni, Guotong Xie

    Abstract: Knowledge of the medical decision process, which can be modeled as medical decision trees (MDTs), is critical to build clinical decision support systems. However, the current MDT construction methods rely heavily on time-consuming and laborious manual annotation. In this work, we propose a novel task, Text2MDT, to explore the automatic extraction of MDTs from medical texts such as medical guidelin… ▽ More

    Submitted 3 January, 2024; originally announced January 2024.

  39. arXiv:2401.00464  [pdf, ps, other

    math.AP

    A note on the $L^p$-Sobolev inequality

    Authors: Shengbing Deng, Xingliang Tian

    Abstract: The usual Sobolev inequality in $\mathbb{R}^N$, asserts that $\|\nabla u\|_{L^p(\mathbb{R}^N)} \geq \mathcal{S}\|u\|_{L^{p^*}(\mathbb{R}^N)}$ for $1<p<N$ and $p^*=\frac{pN}{N-p}$, with $\mathcal{S}$ being the sharp constant. This note is concerned, instead, with function restricted to bounded domain $Ω\subset \mathbb{R}^N$. Based on the recent work of Figalli and Zhang [Duke Math. J., 2022], a rem… ▽ More

    Submitted 31 December, 2023; originally announced January 2024.

  40. arXiv:2312.17461  [pdf, ps, other

    math.NA

    Gaussian radial basis functions collocation for fractional PDEs: methodology and error analysis

    Authors: Xiaochuan Tian, Yixuan Wu, Yanzhi Zhang

    Abstract: The paper introduces a new meshfree pseudospectral method based on Gaussian radial basis functions (RBFs) collocation to solve fractional Poisson equations. Hypergeometric functions are used to represent the fractional Laplacian of Gaussian RBFs, enabling an efficient computation of stiffness matrix entries. Unlike existing RBF-based methods, our approach ensures a Toeplitz structure in the stiffn… ▽ More

    Submitted 28 December, 2023; originally announced December 2023.

  41. arXiv:2312.13305  [pdf, other

    cs.CV

    DVIS++: Improved Decoupled Framework for Universal Video Segmentation

    Authors: Tao Zhang, Xingye Tian, Yikang Zhou, Shun** Ji, Xuebo Wang, Xin Tao, Yuan Zhang, Pengfei Wan, Zhongyuan Wang, Yu Wu

    Abstract: We present the \textbf{D}ecoupled \textbf{VI}deo \textbf{S}egmentation (DVIS) framework, a novel approach for the challenging task of universal video segmentation, including video instance segmentation (VIS), video semantic segmentation (VSS), and video panoptic segmentation (VPS). Unlike previous methods that model video segmentation in an end-to-end manner, our approach decouples video segmentat… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

  42. arXiv:2312.11413  [pdf, other

    cs.LG cs.AI

    DeRDaVa: Deletion-Robust Data Valuation for Machine Learning

    Authors: Xiao Tian, Rachael Hwee Ling Sim, Jue Fan, Bryan Kian Hsiang Low

    Abstract: Data valuation is concerned with determining a fair valuation of data from data sources to compensate them or to identify training examples that are the most or least useful for predictions. With the rising interest in personal data ownership and data protection regulations, model owners will likely have to fulfil more data deletion requests. This raises issues that have not been addressed by exis… ▽ More

    Submitted 21 January, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  43. arXiv:2312.10315  [pdf, ps, other

    math.NA

    A neural network kernel decomposition for learning multiple steady states in parameterized dynamical systems

    Authors: Yimeng Zhang, Alexander Cloninger, Bo Li, Xiaochuan Tian

    Abstract: We develop a machine learning approach to identifying parameters with steady-state solutions, locating such solutions, and determining their linear stability for systems of ordinary differential equations and dynamical systems with parameters. Our approach begins with the construction of target functions that can be used to identify parameters with steady-state solution and the linear stability of… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

  44. arXiv:2312.07514  [pdf, other

    cs.RO

    Integrated and Lightweight Design of Electro-hydraulic Ankle Prosthesis

    Authors: Yi Wei, Xingjian Wang, Xinyu Tian, Shao** Wang, Rujun Jia

    Abstract: For lower limb amputees, an active ankle joint prosthesis can provide basic mobility functions. This study focuses on an ankle joint prosthesis system based on the principle of electric-hydraulic actuation. By analyzing the characteristics of human gait cycles and the mechanics of ankle joint movement, a lightweight and integrated ankle joint prosthesis is designed, considering the requirements fo… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

    Comments: 8 pages, 21 figures, conference

  45. arXiv:2312.07257  [pdf, ps, other

    math.FA math.OA

    The generalized polar decomposition, the weak complementarity and the parallel sum for adjointable operators on Hilbert $C^*$-modules

    Authors: Xiaofeng Zhang, Xiaoyi Tian, Qingxiang Xu

    Abstract: This paper deals mainly with some aspects of the adjointable operators on Hilbert $C^*$-modules. A new tool called the generalized polar decomposition for each adjointable operator is introduced and clarified. As an application, the general theory of the weakly complementable operators is set up in the framework of Hilbert $C^*$-modules. It is proved that there exists an operator equation which ha… ▽ More

    Submitted 24 April, 2024; v1 submitted 12 December, 2023; originally announced December 2023.

    Comments: This version is accepted for publication in Banach Journal of Mathematical Analysis

    MSC Class: 46L08; 47A05

  46. arXiv:2312.04877  [pdf, other

    cs.CL cs.DB

    Generating Explanations to Understand and Repair Embedding-based Entity Alignment

    Authors: Xiaobin Tian, Zequn Sun, Wei Hu

    Abstract: Entity alignment (EA) seeks identical entities in different knowledge graphs, which is a long-standing task in the database research. Recent work leverages deep learning to embed entities in vector space and align them via nearest neighbor search. Although embedding-based EA has gained marked success in recent years, it lacks explanations for alignment decisions. In this paper, we present the firs… ▽ More

    Submitted 21 March, 2024; v1 submitted 8 December, 2023; originally announced December 2023.

    Comments: Accepted in the 40th IEEE International Conference on Data Engineering (ICDE 2024)

  47. arXiv:2312.01598  [pdf, other

    cs.CV

    Good Questions Help Zero-Shot Image Reasoning

    Authors: Kaiwen Yang, Tao Shen, Xinmei Tian, Xiubo Geng, Chongyang Tao, Dacheng Tao, Tianyi Zhou

    Abstract: Aligning the recent large language models (LLMs) with computer vision models leads to large vision-language models (LVLMs), which have paved the way for zero-shot image reasoning tasks. However, LVLMs are usually trained on short high-level captions only referring to sparse focus regions in images. Such a ``tunnel vision'' limits LVLMs to exploring other relevant contexts in complex scenes. To add… ▽ More

    Submitted 8 December, 2023; v1 submitted 3 December, 2023; originally announced December 2023.

  48. The Frobenious distances from projections to an idempotent matrix

    Authors: Xiaoyi Tian, Qingxiang Xu, Chunhong Fu

    Abstract: For each pair of matrices $A$ and $B$ with the same order, let $\|A-B\|_F$ denote their Frobenius distance. This paper deals mainly with the Frobenius distances from projections to an idempotent matrix. For every idempotent $Q\in \mathbb{C}^{n\times n}$, a projection $m(Q)$ called the matched projection can be induced. It is proved that $m(Q)$ is the unique projection whose Frobenius distance away… ▽ More

    Submitted 17 December, 2023; v1 submitted 2 December, 2023; originally announced December 2023.

    Journal ref: Linear Algebra Appl. 688 (2024), 21--43

  49. arXiv:2312.01165  [pdf, other

    math.DS math.OC

    Data-driven optimal control with neural network modeling of gradient flows

    Authors: Xu** Tian, Baskar Ganapathysubramanian, Hailiang Liu

    Abstract: Extracting physical laws from observation data is a central challenge in many diverse areas of science and engineering. We propose Optimal Control Neural Networks (OCN) to learn the laws of vector fields in dynamical systems, with no assumption on their analytical form, given data consisting of sampled trajectories. The OCN framework consists of a neural network representation and an optimal contr… ▽ More

    Submitted 2 December, 2023; originally announced December 2023.

    Comments: 28 pages, 8 figures

    MSC Class: 93C15; 49K15

  50. arXiv:2311.16494  [pdf, other

    cs.CV

    ArGue: Attribute-Guided Prompt Tuning for Vision-Language Models

    Authors: Xinyu Tian, Shu Zou, Zhaoyuan Yang, **g Zhang

    Abstract: Although soft prompt tuning is effective in efficiently adapting Vision-Language (V&L) models for downstream tasks, it shows limitations in dealing with distribution shifts. We address this issue with Attribute-Guided Prompt Tuning (ArGue), making three key contributions. 1) In contrast to the conventional approach of directly appending soft prompts preceding class names, we align the model with p… ▽ More

    Submitted 12 March, 2024; v1 submitted 27 November, 2023; originally announced November 2023.

    Comments: Accepted to CVPR2024