Skip to main content

Showing 1–50 of 186 results for author: Tao, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.12839  [pdf, other

    cs.LG math.DS math.OC math.PR stat.ML

    Evaluating the design space of diffusion-based generative models

    Authors: Yuqing Wang, Ye He, Molei Tao

    Abstract: Most existing theoretical investigations of the accuracy of diffusion models, albeit significant, assume the score function has been approximated to a certain accuracy, and then use this a priori bound to control the error of generation. This article instead provides a first quantitative understanding of the whole generation process, i.e., both training and sampling. More precisely, it conducts a… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: Comments are welcome

  2. arXiv:2406.10556  [pdf, other

    cs.IT cs.AI

    Multi-User Semantic Fusion for Semantic Communications over Degraded Broadcast Channels

    Authors: Tong Wu, Zhiyong Chen, Meixia Tao, Bin Xia, Wenjun Zhang

    Abstract: Degraded broadcast channels (DBC) are a typical multiuser communication scenario, Semantic communications over DBC still lack in-depth research. In this paper, we design a semantic communications approach based on multi-user semantic fusion for wireless image transmission over DBC. In the proposed method, the transmitter extracts semantic features for two users separately. It then effectively fuse… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

    Comments: accepted by China Communications

  3. arXiv:2406.07915  [pdf, ps, other

    cs.IT eess.SP

    Aggregation Design for Personalized Federated Multi-Modal Learning over Wireless Networks

    Authors: Benshun Yin, Zhiyong Chen, Meixia Tao

    Abstract: Federated Multi-Modal Learning (FMML) is an emerging field that integrates information from different modalities in federated learning to improve the learning performance. In this letter, we develop a parameter scheduling scheme to improve personalized performance and communication efficiency in personalized FMML, considering the non-independent and nonidentically distributed (non-IID) data along… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: accepted by IEEE Communications Letters

  4. arXiv:2406.01937  [pdf, other

    cs.IT eess.SP

    Cramér-Rao Bound Analysis and Beamforming Design for Integrated Sensing and Communication with Extended Targets

    Authors: Yiqiu Wang, Meixia Tao, Shu Sun

    Abstract: This paper studies an integrated sensing and communication (ISAC) system, where a multi-antenna base station transmits beamformed signals for joint downlink multi-user communication and radar sensing of an extended target (ET). By considering echo signals as reflections from valid elements on the ET contour, a set of novel Cramér-Rao bounds (CRBs) is derived for parameter estimation of the ET, inc… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: Submitted to IEEE Transactions on Wireless Communications. arXiv admin note: text overlap with arXiv:2312.10641

  5. arXiv:2405.21050  [pdf, other

    cs.CV cs.LG

    Spectrum-Aware Parameter Efficient Fine-Tuning for Diffusion Models

    Authors: Xinxi Zhang, Song Wen, Ligong Han, Felix Juefei-Xu, Akash Srivastava, Junzhou Huang, Hao Wang, Molei Tao, Dimitris N. Metaxas

    Abstract: Adapting large-scale pre-trained generative models in a parameter-efficient manner is gaining traction. Traditional methods like low rank adaptation achieve parameter efficiency by imposing constraints but may not be optimal for tasks requiring high representation capacity. We propose a novel spectrum-aware adaptation framework for generative models. Our method adjusts both singular values and the… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

  6. arXiv:2405.20390  [pdf, other

    cs.LG math.NA math.OC stat.ML

    Quantitative Convergences of Lie Group Momentum Optimizers

    Authors: Lingkai Kong, Molei Tao

    Abstract: Explicit, momentum-based dynamics that optimize functions defined on Lie groups can be constructed via variational optimization and momentum trivialization. Structure preserving time discretizations can then turn this dynamics into optimization algorithms. This article investigates two types of discretization, Lie Heavy-Ball, which is a known splitting scheme, and Lie NAG-SC, which is newly propos… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  7. arXiv:2405.16381  [pdf, other

    cs.LG cs.AI stat.ML

    Trivialized Momentum Facilitates Diffusion Generative Modeling on Lie Groups

    Authors: Yuchen Zhu, Tianrong Chen, Lingkai Kong, Evangelos A. Theodorou, Molei Tao

    Abstract: The generative modeling of data on manifold is an important task, for which diffusion models in flat spaces typically need nontrivial adaptations. This article demonstrates how a technique called `trivialization' can transfer the effectiveness of diffusion models in Euclidean spaces to Lie groups. In particular, an auxiliary momentum variable was algorithmically introduced to help transport the po… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

  8. arXiv:2405.06105  [pdf, ps, other

    cs.CL

    Can Perplexity Reflect Large Language Model's Ability in Long Text Understanding?

    Authors: Yutong Hu, Quzhe Huang, Mingxu Tao, Chen Zhang, Yansong Feng

    Abstract: Recent studies have shown that Large Language Models (LLMs) have the potential to process extremely long text. Many works only evaluate LLMs' long-text processing ability on the language modeling task, with perplexity (PPL) as the evaluation metric. However, in our study, we find that there is no correlation between PPL and LLMs' long-text understanding ability. Besides, PPL may only reflect the m… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

  9. arXiv:2405.03131  [pdf, other

    cs.IT cs.AI cs.LG

    WDMoE: Wireless Distributed Large Language Models with Mixture of Experts

    Authors: Nan Xue, Ya** Sun, Zhiyong Chen, Meixia Tao, Xiaodong Xu, Liang Qian, Shuguang Cui, ** Zhang

    Abstract: Large Language Models (LLMs) have achieved significant success in various natural language processing tasks, but how wireless communications can support LLMs has not been extensively studied. In this paper, we propose a wireless distributed LLMs paradigm based on Mixture of Experts (MoE), named WDMoE, deploying LLMs collaboratively across edge servers of base station (BS) and mobile devices in the… ▽ More

    Submitted 5 May, 2024; originally announced May 2024.

    Comments: submitted to IEEE conference

  10. arXiv:2405.03125  [pdf, other

    cs.IT

    MambaJSCC: Deep Joint Source-Channel Coding with Visual State Space Model

    Authors: Tong Wu, Zhiyong Chen, Meixia Tao, Xiaodong Xu, Wenjun Zhang, ** Zhang

    Abstract: Lightweight and efficient deep joint source-channel coding (JSCC) is a key technology for semantic communications. In this paper, we design a novel JSCC scheme named MambaJSCC, which utilizes a visual state space model with channel adaptation (VSSM-CA) block as its backbone for transmitting images over wireless channels. The VSSM-CA block utilizes VSSM to integrate two-dimensional images with the… ▽ More

    Submitted 5 May, 2024; originally announced May 2024.

    Comments: submitted to IEEE conference

  11. arXiv:2404.06336  [pdf, other

    quant-ph cs.LG stat.ML

    Quantum State Generation with Structure-Preserving Diffusion Model

    Authors: Yuchen Zhu, Tianrong Chen, Evangelos A. Theodorou, Xie Chen, Molei Tao

    Abstract: This article considers the generative modeling of the (mixed) states of quantum systems, and an approach based on denoising diffusion model is proposed. The key contribution is an algorithmic innovation that respects the physical nature of quantum states. More precisely, the commonly used density matrix representation of mixed-state has to be complex-valued Hermitian, positive semi-definite, and t… ▽ More

    Submitted 25 May, 2024; v1 submitted 9 April, 2024; originally announced April 2024.

  12. arXiv:2404.05979  [pdf, other

    cs.CV

    StoryImager: A Unified and Efficient Framework for Coherent Story Visualization and Completion

    Authors: Ming Tao, Bing-Kun Bao, Hao Tang, Yaowei Wang, Changsheng Xu

    Abstract: Story visualization aims to generate a series of realistic and coherent images based on a storyline. Current models adopt a frame-by-frame architecture by transforming the pre-trained text-to-image model into an auto-regressive manner. Although these models have shown notable progress, there are still three flaws. 1) The unidirectional generation of auto-regressive manner restricts the usability i… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

    Comments: 17 pages

  13. arXiv:2404.01663  [pdf, other

    cs.CL

    CMAT: A Multi-Agent Collaboration Tuning Framework for Enhancing Small Language Models

    Authors: Xuechen Liang, Meiling Tao, Tianyu Shi, Yiting Xie

    Abstract: Open large language models (LLMs) have significantly advanced the field of natural language processing, showcasing impressive performance across various tasks.Despite the significant advancements in LLMs, their effective operation still relies heavily on human input to accurately guide the dialogue flow, with agent tuning being a crucial optimization technique that involves human adjustments to th… ▽ More

    Submitted 4 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

  14. arXiv:2403.12012  [pdf, other

    math.ST cs.LG math.NA math.PR stat.ML

    Convergence of Kinetic Langevin Monte Carlo on Lie groups

    Authors: Lingkai Kong, Molei Tao

    Abstract: Explicit, momentum-based dynamics for optimizing functions defined on Lie groups was recently constructed, based on techniques such as variational optimization and left trivialization. We appropriately add tractable noise to the optimization dynamics to turn it into a sampling dynamics, leveraging the advantageous feature that the trivialized momentum variable is Euclidean despite that the potenti… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

  15. arXiv:2403.07652  [pdf, other

    cs.LG cs.CL

    Harder Tasks Need More Experts: Dynamic Routing in MoE Models

    Authors: Quzhe Huang, Zhenwei An, Nan Zhuang, Mingxu Tao, Chen Zhang, Yang **, Kun Xu, Kun Xu, Liwei Chen, Songfang Huang, Yansong Feng

    Abstract: In this paper, we introduce a novel dynamic expert selection framework for Mixture of Experts (MoE) models, aiming to enhance computational efficiency and model performance by adjusting the number of activated experts based on input difficulty. Unlike traditional MoE approaches that rely on fixed Top-K routing, which activates a predetermined number of experts regardless of the input's complexity,… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

  16. arXiv:2402.17886  [pdf, other

    stat.ML cs.LG math.PR math.ST stat.ME

    Zeroth-Order Sampling Methods for Non-Log-Concave Distributions: Alleviating Metastability by Denoising Diffusion

    Authors: Ye He, Kevin Rojas, Molei Tao

    Abstract: This paper considers the problem of sampling from non-logconcave distribution, based on queries of its unnormalized density. It first describes a framework, Diffusion Monte Carlo (DMC), based on the simulation of a denoising diffusion process with its score function approximated by a generic Monte Carlo estimator. DMC is an oracle-based meta-algorithm, where its oracle is the assumed access to sam… ▽ More

    Submitted 26 May, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

  17. arXiv:2402.17304  [pdf, ps, other

    cs.CL cs.AI

    Probing Multimodal Large Language Models for Global and Local Semantic Representations

    Authors: Mingxu Tao, Quzhe Huang, Kun Xu, Liwei Chen, Yansong Feng, Dongyan Zhao

    Abstract: The advancement of Multimodal Large Language Models (MLLMs) has greatly accelerated the development of applications in understanding integrated texts and images. Recent works leverage image-caption datasets to train MLLMs, achieving state-of-the-art performance on image-to-text tasks. However, there are few studies exploring which layers of MLLMs make the most effort to the global image informatio… ▽ More

    Submitted 26 March, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

    Comments: Accepted by LREC-COLING 2024 as a short paper (Camera Ready)

  18. arXiv:2402.16313  [pdf, other

    cs.CL cs.AI

    Chain-of-Discussion: A Multi-Model Framework for Complex Evidence-Based Question Answering

    Authors: Mingxu Tao, Dongyan Zhao, Yansong Feng

    Abstract: Open-ended question answering requires models to find appropriate evidence to form well-reasoned, comprehensive and helpful answers. In practical applications, models also need to engage in extended discussions on potential scenarios closely relevant to the question. With augmentation of retrieval module, open-source Large Language Models (LLMs) can produce coherent answers often with different fo… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

    Comments: Under review

  19. arXiv:2402.10062  [pdf, other

    cs.LG stat.ML

    Optimal Parameter and Neuron Pruning for Out-of-Distribution Detection

    Authors: Chao Chen, Zhihang Fu, Kai Liu, Ze Chen, Mingyuan Tao, Jie** Ye

    Abstract: For a machine learning model deployed in real world scenarios, the ability of detecting out-of-distribution (OOD) samples is indispensable and challenging. Most existing OOD detection methods focused on exploring advanced training skills or training-free tricks to prevent the model from yielding overconfident confidence score for unknown samples. The training-based methods require expensive traini… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

    Comments: Accepted by NeurIPS 2023. 19 pages

    Journal ref: NeurIPS 2023

  20. arXiv:2402.03744  [pdf, other

    cs.CL

    INSIDE: LLMs' Internal States Retain the Power of Hallucination Detection

    Authors: Chao Chen, Kai Liu, Ze Chen, Yi Gu, Yue Wu, Mingyuan Tao, Zhihang Fu, Jie** Ye

    Abstract: Knowledge hallucination have raised widespread concerns for the security and reliability of deployed LLMs. Previous efforts in detecting hallucinations have been employed at logit-level uncertainty estimation or language-level self-consistency evaluation, where the semantic information is inevitably lost during the token-decoding procedure. Thus, we propose to explore the dense semantic informatio… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

    Comments: Accepted by ICLR-2024

  21. arXiv:2401.15344  [pdf, other

    cs.IT eess.SP

    IRS Aided Millimeter-Wave Sensing and Communication: Beam Scanning, Beam Splitting, and Performance Analysis

    Authors: Renwang Li, Xiaodan Shao, Shu Sun, Meixia Tao, Rui Zhang

    Abstract: Integrated sensing and communication (ISAC) has attracted growing interests for enabling the future 6G wireless networks, due to its capability of sharing spectrum and hardware resources between communication and sensing systems. However, existing works on ISAC usually need to modify the communication protocol to cater for the new sensing performance requirement, which may be difficult to implemen… ▽ More

    Submitted 27 January, 2024; originally announced January 2024.

    Comments: submitted to IEEE TWC

  22. arXiv:2401.09432  [pdf, other

    cs.CL cs.AI cs.LG

    RoleCraft-GLM: Advancing Personalized Role-Playing in Large Language Models

    Authors: Meiling Tao, Xuechen Liang, Tianyu Shi, Lei Yu, Yiting Xie

    Abstract: This study presents RoleCraft-GLM, an innovative framework aimed at enhancing personalized role-playing with Large Language Models (LLMs). RoleCraft-GLM addresses the key issue of lacking personalized interactions in conversational AI, and offers a solution with detailed and emotionally nuanced character portrayals. We contribute a unique conversational dataset that shifts from conventional celebr… ▽ More

    Submitted 4 April, 2024; v1 submitted 17 December, 2023; originally announced January 2024.

  23. arXiv:2401.06144  [pdf, other

    cs.CV cs.LG

    DFU: scale-robust diffusion model for zero-shot super-resolution image generation

    Authors: Alex Havrilla, Kevin Rojas, Wen**g Liao, Molei Tao

    Abstract: Diffusion generative models have achieved remarkable success in generating images with a fixed resolution. However, existing models have limited ability to generalize to different resolutions when training data at those resolutions are not available. Leveraging techniques from operator learning, we present a novel deep-learning architecture, Dual-FNO UNet (DFU), which approximates the score operat… ▽ More

    Submitted 22 January, 2024; v1 submitted 30 November, 2023; originally announced January 2024.

  24. arXiv:2401.01564  [pdf, other

    cs.IT eess.SP

    Deep Learning Based Superposition Coded Modulation for Hierarchical Semantic Communications over Broadcast Channels

    Authors: Yufei Bo, Shuo Shao, Meixia tao

    Abstract: We consider multi-user semantic communications over broadcast channels. While most existing works consider that each receiver requires either the same or independent semantic information, this paper explores the scenario where the semantic information desired by different receivers is different but correlated. In particular, we investigate semantic communications over Gaussian broadcast channels w… ▽ More

    Submitted 12 June, 2024; v1 submitted 3 January, 2024; originally announced January 2024.

  25. arXiv:2312.17428  [pdf, other

    cs.CV

    ChangeNet: Multi-Temporal Asymmetric Change Detection Dataset

    Authors: Deyi Ji, Siqi Gao, Mingyuan Tao, Hongtao Lu, Feng Zhao

    Abstract: Change Detection (CD) has been attracting extensive interests with the availability of bi-temporal datasets. However, due to the huge cost of multi-temporal images acquisition and labeling, existing change detection datasets are small in quantity, short in temporal, and low in practicability. Therefore, a large-scale practical-oriented dataset covering wide temporal phases is urgently needed to fa… ▽ More

    Submitted 11 April, 2024; v1 submitted 28 December, 2023; originally announced December 2023.

    Comments: Accepted to ICASSP 2024 Oral/Lecture

  26. arXiv:2312.10641  [pdf, other

    cs.IT eess.SP

    Beamforming Design for Integrated Sensing and Communication with Extended Target

    Authors: Yiqiu Wang, Meixia Tao, Shu Sun

    Abstract: This paper studies transmit beamforming design in an integrated sensing and communication (ISAC) system, where a base station sends symbols to perform downlink multi-user communication and sense an extended target simultaneously. We first model the extended target contour with truncated Fourier series. By considering echo signals as reflections from the valid elements on the target contour, a nove… ▽ More

    Submitted 17 December, 2023; originally announced December 2023.

    Comments: 8 pages, 3 figures, published to 8th Workshop on Integrated Sensing and Communications for Internet of Things in IEEE Global Communications Conference 2023

  27. arXiv:2312.05786  [pdf, other

    eess.SP cs.IT

    Deep Learning for Joint Design of Pilot, Channel Feedback, and Hybrid Beamforming in FDD Massive MIMO-OFDM Systems

    Authors: Junyi Yang, Weifeng Zhu, Shu Sun, Xiaofeng Li, Xingqin Lin, Meixia Tao

    Abstract: This letter considers the transceiver design in frequency division duplex (FDD) massive multiple-input multiple-output (MIMO) orthogonal frequency division multiplexing (OFDM) systems for high-quality data transmission. We propose a novel deep learning based framework where the procedures of pilot design, channel feedback, and hybrid beamforming are realized by carefully crafted deep neural networ… ▽ More

    Submitted 10 December, 2023; originally announced December 2023.

    Comments: 5 pages, 4 figures, acccpted by IEEE Communication Letters

  28. arXiv:2311.08348  [pdf, other

    cs.CL

    MC$^2$: Towards Transparent and Culturally-Aware NLP for Minority Languages in China

    Authors: Chen Zhang, Mingxu Tao, Quzhe Huang, Jiuheng Lin, Zhibin Chen, Yansong Feng

    Abstract: Current large language models demonstrate deficiencies in understanding low-resource languages, particularly the minority languages in China. This limitation stems from the scarcity of available pre-training data. To address this accessibility challenge, we present MC$^2$, a Multilingual Corpus of Minority Languages in China, which is the largest open-source corpus of its kind so far. MC$^2$ inclu… ▽ More

    Submitted 13 June, 2024; v1 submitted 14 November, 2023; originally announced November 2023.

    Comments: ACL 2024 https://github.com/luciusssss/mc2_corpus

  29. arXiv:2311.06500  [pdf, other

    cs.IT

    Knowledge Distillation and Training Balance for Heterogeneous Decentralized Multi-Modal Learning over Wireless Networks

    Authors: Benshun Yin, Zhiyong Chen, Meixia Tao

    Abstract: Decentralized learning is widely employed for collaboratively training models using distributed data over wireless networks. Existing decentralized learning methods primarily focus on training single-modal networks. For the decentralized multi-modal learning (DMML), the modality heterogeneity and the non-independent and non-identically distributed (non-IID) data across devices make it difficult fo… ▽ More

    Submitted 11 November, 2023; originally announced November 2023.

    Comments: submitted to IEEE Trans. on Mobile Computing

  30. arXiv:2310.17087  [pdf, other

    cs.LG math.DS math.OC stat.ML

    Good regularity creates large learning rate implicit biases: edge of stability, balancing, and catapult

    Authors: Yuqing Wang, Zhenghao Xu, Tuo Zhao, Molei Tao

    Abstract: Large learning rates, when applied to gradient descent for nonconvex optimization, yield various implicit biases including the edge of stability (Cohen et al., 2021), balancing (Wang et al., 2022), and catapult (Lewkowycz et al., 2020). These phenomena cannot be well explained by classical optimization theory. Though significant theoretical progress has been made in understanding these implicit bi… ▽ More

    Submitted 11 December, 2023; v1 submitted 25 October, 2023; originally announced October 2023.

  31. arXiv:2310.08233  [pdf, other

    cs.RO cs.AI

    The Impact of Time Step Frequency on the Realism of Robotic Manipulation Simulation for Objects of Different Scales

    Authors: Minh Q. Ta, Holly Dinkel, Hameed Abdul-Rashid, Yangfei Dai, Jessica Myers, Tan Chen, Junyi Geng, Timothy Bretl

    Abstract: This work evaluates the impact of time step frequency and component scale on robotic manipulation simulation accuracy. Increasing the time step frequency for small-scale objects is shown to improve simulation accuracy. This simulation, demonstrating pre-assembly part picking for two object geometries, serves as a starting point for discussing how to improve Sim2Real transfer in robotic assembly pr… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

    Comments: 3 pages, 3 figures, Best Poster Finalist at the 2023 Robotics and AI in Future Factory Workshop at the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). Video presentation [https://www.youtube.com/watch?v=JOXrBpMmI0A]. Robotics and AI in Future Factory workshop [https://sites.google.com/view/robot-ai-future-factory/]

  32. arXiv:2310.08095  [pdf, other

    cs.IT eess.SP

    Multi-Satellite Cooperative Networks: Joint Hybrid Beamforming and User Scheduling Design

    Authors: Xuan Zhang, Shu Sun, Meixia Tao, Qin Huang, Xiaohu Tang

    Abstract: In this paper, we consider a cooperative communication network where multiple low-Earth-orbit (LEO) satellites provide services to multiple ground users (GUs) cooperatively at the same time and on the same frequency. The multi-satellite cooperation has great potential in extending communication coverage and increasing spectral efficiency. Considering that the on-board radio-frequency circuit resou… ▽ More

    Submitted 27 December, 2023; v1 submitted 12 October, 2023; originally announced October 2023.

    Comments: 14 pages, 13 figures. arXiv admin note: substantial text overlap with arXiv:2301.03888

  33. arXiv:2310.06690  [pdf, other

    cs.IT eess.SP

    Joint Coding-Modulation for Digital Semantic Communications via Variational Autoencoder

    Authors: Yufei Bo, Yiheng Duan, Shuo Shao, Meixia Tao

    Abstract: Semantic communications have emerged as a new paradigm for improving communication efficiency by transmitting the semantic information of a source message that is most relevant to a desired task at the receiver. Most existing approaches typically utilize neural networks (NNs) to design end-to-end semantic communication systems, where NN-based semantic encoders output continuously distributed signa… ▽ More

    Submitted 29 January, 2024; v1 submitted 10 October, 2023; originally announced October 2023.

  34. arXiv:2310.06283  [pdf, other

    cs.CV

    Towards More Efficient Depression Risk Recognition via Gait

    Authors: Min Ren, Muchan Tao, Xuecai Hu, Xiaotong Liu, Qiong Li, Yongzhen Huang

    Abstract: Depression, a highly prevalent mental illness, affects over 280 million individuals worldwide. Early detection and timely intervention are crucial for promoting remission, preventing relapse, and alleviating the emotional and financial burdens associated with depression. However, patients with depression often go undiagnosed in the primary care setting. Unlike many physiological illnesses, depress… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

  35. arXiv:2310.01236  [pdf, other

    stat.ML cs.CV cs.LG

    Mirror Diffusion Models for Constrained and Watermarked Generation

    Authors: Guan-Horng Liu, Tianrong Chen, Evangelos A. Theodorou, Molei Tao

    Abstract: Modern successes of diffusion models in learning complex, high-dimensional data distributions are attributed, in part, to their capability to construct diffusion processes with analytic transition kernels and score functions. The tractability results in a simulation-free framework with stable regression losses, from which reversed, generative processes can be learned at scale. However, when data i… ▽ More

    Submitted 29 February, 2024; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: submitted to NeurIPS on 5/18 but did not arxiv per NeurIPS policy, accepted on 9/22

  36. arXiv:2309.14155  [pdf, other

    math.OC cs.LG

    Extragradient Type Methods for Riemannian Variational Inequality Problems

    Authors: Zihao Hu, Guanghui Wang, Xi Wang, Andre Wibisono, Jacob Abernethy, Molei Tao

    Abstract: Riemannian convex optimization and minimax optimization have recently drawn considerable attention. Their appeal lies in their capacity to adeptly manage the non-convexity of the objective function as well as constraints inherent in the feasible set in the Euclidean sense. In this work, we delve into monotone Riemannian Variational Inequality Problems (RVIPs), which encompass both Riemannian conve… ▽ More

    Submitted 1 June, 2024; v1 submitted 25 September, 2023; originally announced September 2023.

    Comments: Published in Proceedings of The 27th International Conference on Artificial Intelligence and Statistics (AISTATS 2024)

  37. arXiv:2309.08895  [pdf, other

    cs.IT eess.SP

    CDDM: Channel Denoising Diffusion Models for Wireless Semantic Communications

    Authors: Tong Wu, Zhiyong Chen, Dazhi He, Liang Qian, Yin Xu, Meixia Tao, Wenjun Zhang

    Abstract: Diffusion models (DM) can gradually learn to remove noise, which have been widely used in artificial intelligence generated content (AIGC) in recent years. The property of DM for eliminating noise leads us to wonder whether DM can be applied to wireless communications to help the receiver mitigate the channel noise. To address this, we propose channel denoising diffusion models (CDDM) for semantic… ▽ More

    Submitted 16 September, 2023; originally announced September 2023.

    Comments: submitted to IEEE Transactions on Wireless Communications. arXiv admin note: substantial text overlap with arXiv:2305.09161

  38. arXiv:2308.12198  [pdf, other

    eess.SP cs.IT

    Hierarchical Beam Alignment for Millimeter-Wave Communication Systems: A Deep Learning Approach

    Authors: Junyi Yang, Weifeng Zhu, Meixia Tao, Shu Sun

    Abstract: Fast and precise beam alignment is crucial for high-quality data transmission in millimeter-wave (mmWave) communication systems, where large-scale antenna arrays are utilized to overcome the severe propagation loss. To tackle the challenging problem, we propose a novel deep learning-based hierarchical beam alignment method for both multiple-input single-output (MISO) and multiple-input multiple-ou… ▽ More

    Submitted 23 August, 2023; originally announced August 2023.

    Comments: 15 pages, 16 figures, to appear in Transactions on Wireless Communications. arXiv admin note: text overlap with arXiv:2209.03643

  39. arXiv:2307.04440  [pdf, ps, other

    cs.IT eess.SP

    Time-Frequency-Space Transmit Design and Receiver Processing with Dynamic Subarray for Terahertz Integrated Sensing and Communication

    Authors: Yongzhi Wu, Chong Han, Meixia Tao

    Abstract: Terahertz (THz) integrated sensing and communication (ISAC) enables simultaneous data transmission with Terabit-per-second (Tbps) rate and millimeter-level accurate sensing. To realize such a blueprint, ultra-massive antenna arrays with directional beamforming are used to compensate for severe path loss in the THz band. In this paper, the time-frequency-space transmit design is investigated for TH… ▽ More

    Submitted 25 March, 2024; v1 submitted 10 July, 2023; originally announced July 2023.

  40. arXiv:2307.00228  [pdf, other

    cs.LG

    InferTurbo: A Scalable System for Boosting Full-graph Inference of Graph Neural Network over Huge Graphs

    Authors: Dalong Zhang, Xianzheng Song, Zhiyang Hu, Yang Li, Miao Tao, Binbin Hu, Lin Wang, Zhiqiang Zhang, Jun Zhou

    Abstract: GNN inference is a non-trivial task, especially in industrial scenarios with giant graphs, given three main challenges, i.e., scalability tailored for full-graph inference on huge graphs, inconsistency caused by stochastic acceleration strategies (e.g., sampling), and the serious redundant computation issue. To address the above challenges, we propose a scalable system named InferTurbo to boost th… ▽ More

    Submitted 1 July, 2023; originally announced July 2023.

    Comments: Accepted by ICDE 2023

  41. arXiv:2307.00200  [pdf, other

    cs.IT eess.SP

    Beam Scanning for Integrated Sensing and Communication in IRS-aided mmWave Systems

    Authors: Renwang Li, Xiaodan Shao, Shu Sun, Meixia Tao, Rui Zhang

    Abstract: This paper investigates an intelligent reflecting surface (IRS) aided millimeter-wave integrated sensing and communication (ISAC) system. Specifically, based on the passive beam scanning in the downlink, the IRS finds the optimal beam for reflecting the signals from the base station to a communication user. Meanwhile, the IRS estimates the angle of a nearby target based on its echo signal received… ▽ More

    Submitted 30 June, 2023; originally announced July 2023.

    Comments: Accepted by IEEE SPAWC

  42. arXiv:2305.15062  [pdf, other

    cs.CL cs.AI

    Lawyer LLaMA Technical Report

    Authors: Quzhe Huang, Mingxu Tao, Chen Zhang, Zhenwei An, Cong Jiang, Zhibin Chen, Zirui Wu, Yansong Feng

    Abstract: Large Language Models (LLMs), like LLaMA, have exhibited remarkable performance across various tasks. Nevertheless, when deployed to specific domains such as law or medicine, the models still confront the challenge of a deficiency in domain-specific knowledge and an inadequate capability to leverage that knowledge to resolve domain-related problems. In this paper, we propose a new framework to ada… ▽ More

    Submitted 13 October, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

  43. arXiv:2305.10899  [pdf, other

    cs.CV

    Ultra-High Resolution Segmentation with Ultra-Rich Context: A Novel Benchmark

    Authors: Deyi Ji, Feng Zhao, Hongtao Lu, Mingyuan Tao, Jie** Ye

    Abstract: With the increasing interest and rapid development of methods for Ultra-High Resolution (UHR) segmentation, a large-scale benchmark covering a wide range of scenes with full fine-grained dense annotations is urgently needed to facilitate the field. To this end, the URUR dataset is introduced, in the meaning of Ultra-High Resolution dataset with Ultra-Rich Context. As the name suggests, URUR contai… ▽ More

    Submitted 18 May, 2023; originally announced May 2023.

    Comments: Accepted to CVPR 2023

  44. arXiv:2305.09165  [pdf, other

    cs.IT eess.SP

    Fusion-Based Multi-User Semantic Communications for Wireless Image Transmission over Degraded Broadcast Channels

    Authors: Tong Wu, Zhiyong Chen, Meixia Tao, Bin Xia, Wenjun Zhang

    Abstract: Degraded broadcast channels (DBC) are a typical multi-user communications scenario. There exist classic transmission methods, such as superposition coding with successive interference cancellation, to achieve the DBC capacity region. However, semantic communications method over DBC remains lack of in-depth research. To address this, we design a fusion-based multi-user semantic communications syste… ▽ More

    Submitted 16 May, 2023; originally announced May 2023.

  45. arXiv:2305.09161  [pdf, other

    cs.IT eess.SP

    CDDM: Channel Denoising Diffusion Models for Wireless Communications

    Authors: Tong Wu, Zhiyong Chen, Dazhi He, Liang Qian, Yin Xu, Meixia Tao, Wenjun Zhang

    Abstract: Diffusion models (DM) can gradually learn to remove noise, which have been widely used in artificial intelligence generated content (AIGC) in recent years. The property of DM for removing noise leads us to wonder whether DM can be applied to wireless communications to help the receiver eliminate the channel noise. To address this, we propose channel denoising diffusion models (CDDM) for wireless c… ▽ More

    Submitted 16 May, 2023; originally announced May 2023.

  46. arXiv:2305.06279  [pdf, other

    cs.IT cs.LG eess.SP

    Vertical Federated Learning over Cloud-RAN: Convergence Analysis and System Optimization

    Authors: Yuanming Shi, Shuhao Xia, Yong Zhou, Yijie Mao, Chunxiao Jiang, Meixia Tao

    Abstract: Vertical federated learning (FL) is a collaborative machine learning framework that enables devices to learn a global model from the feature-partition datasets without sharing local raw data. However, as the number of the local intermediate outputs is proportional to the training samples, it is critical to develop communication-efficient techniques for wireless vertical FL to support high-dimensio… ▽ More

    Submitted 4 May, 2023; originally announced May 2023.

    Comments: 32 pages, 7 figures

  47. A Frustratingly Easy Improvement for Position Embeddings via Random Padding

    Authors: Mingxu Tao, Yansong Feng, Dongyan Zhao

    Abstract: Position embeddings, encoding the positional relationships among tokens in text sequences, make great contributions to modeling local context features in Transformer-based pre-trained language models. However, in Extractive Question Answering, position embeddings trained with instances of varied context lengths may not perform well as we expect. Since the embeddings of rear positions are updated f… ▽ More

    Submitted 8 May, 2023; originally announced May 2023.

  48. arXiv:2305.03944  [pdf, other

    cs.CV

    Structural and Statistical Texture Knowledge Distillation for Semantic Segmentation

    Authors: Deyi Ji, Haoran Wang, Mingyuan Tao, Jianqiang Huang, Xian-Sheng Hua, Hongtao Lu

    Abstract: Existing knowledge distillation works for semantic segmentation mainly focus on transferring high-level contextual knowledge from teacher to student. However, low-level texture knowledge is also of vital importance for characterizing the local structural pattern and global statistical property, such as boundary, smoothness, regularity and color contrast, which may not be well addressed by high-lev… ▽ More

    Submitted 5 July, 2023; v1 submitted 6 May, 2023; originally announced May 2023.

    Comments: Accepted to CVPR 2022

  49. arXiv:2304.09727  [pdf, other

    eess.SP cs.IT

    Cooperative Multi-Cell Massive Access with Temporally Correlated Activity

    Authors: Weifeng Zhu, Meixia Tao, Xiaojun Yuan, Fan Xu, Yunfeng Guan

    Abstract: This paper investigates the problem of activity detection and channel estimation in cooperative multi-cell massive access systems with temporally correlated activity, where all access points (APs) are connected to a central unit via fronthaul links. We propose to perform user-centric AP cooperation for computation burden alleviation and introduce a generalized sliding-window detection strategy for… ▽ More

    Submitted 19 April, 2023; originally announced April 2023.

    Comments: 16 pages, 17 figures, minor revision

  50. arXiv:2303.15474  [pdf

    cs.LG cs.AR cs.NE eess.SY

    A Heterogeneous Parallel Non-von Neumann Architecture System for Accurate and Efficient Machine Learning Molecular Dynamics

    Authors: Zhuoying Zhao, Ziling Tan, **hui Mo, Xiaonan Wang, Dan Zhao, Xin Zhang, Ming Tao, Jie Liu

    Abstract: This paper proposes a special-purpose system to achieve high-accuracy and high-efficiency machine learning (ML) molecular dynamics (MD) calculations. The system consists of field programmable gate array (FPGA) and application specific integrated circuit (ASIC) working in heterogeneous parallelization. To be specific, a multiplication-less neural network (NN) is deployed on the non-von Neumann (NvN… ▽ More

    Submitted 26 March, 2023; originally announced March 2023.