Skip to main content

Showing 1–50 of 1,410 results for author: Yang, Q

.
  1. arXiv:2407.00478  [pdf, other

    cs.LG cs.AI

    Knowledge-Aware Parsimony Learning: A Perspective from Relational Graphs

    Authors: Quanming Yao, Yongqi Zhang, Yaqing Wang, Nan Yin, James Kwok, Qiang Yang

    Abstract: The scaling law, a strategy that involves the brute-force scaling of the training dataset and learnable parameters, has become a prevalent approach for develo** stronger learning models. In this paper, we examine its rationale in terms of learning from relational graphs. We demonstrate that directly adhering to such a scaling law does not necessarily yield stronger models due to architectural in… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

  2. arXiv:2406.19703  [pdf, other

    cs.CV

    Vision Transformer with Key-select Routing Attention for Single Image Dehazing

    Authors: Lihan Tong, Weijia Li, Qingxia Yang, Liyuan Chen, Peng Chen

    Abstract: We present Ksformer, utilizing Multi-scale Key-select Routing Attention (MKRA) for intelligent selection of key areas through multi-channel, multi-scale windows with a top-k operator, and Lightweight Frequency Processing Module (LFPM) to enhance high-frequency features, outperforming other dehazing methods in tests.

    Submitted 28 June, 2024; originally announced June 2024.

    Comments: 5 pages,4 figures,IEICE Trans. Information and Systems

    Report number: Vol.E107-D,No.11,pp.-,Nov. 2024 MSC Class: 68U10(Primary) ACM Class: I.4

  3. arXiv:2406.18862  [pdf, other

    cs.SD eess.AS

    Streaming Decoder-Only Automatic Speech Recognition with Discrete Speech Units: A Pilot Study

    Authors: Peikun Chen, Sining Sun, Changhao Shan, Qing Yang, Lei Xie

    Abstract: Unified speech-text models like SpeechGPT, VioLA, and AudioPaLM have shown impressive performance across various speech-related tasks, especially in Automatic Speech Recognition (ASR). These models typically adopt a unified method to model discrete speech and text tokens, followed by training a decoder-only transformer. However, they are all designed for non-streaming ASR tasks, where the entire s… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: Accepted for Interspeech 2024

  4. arXiv:2406.17404  [pdf, other

    cs.CL cs.LG

    Make Some Noise: Unlocking Language Model Parallel Inference Capability through Noisy Training

    Authors: Yixuan Wang, Xianzhen Luo, Fuxuan Wei, Yijun Liu, Qingfu Zhu, Xuanyu Zhang, Qing Yang, Dongliang Xu, Wanxiang Che

    Abstract: Existing speculative decoding methods typically require additional model structure and training processes to assist the model for draft token generation. This makes the migration of acceleration methods to the new model more costly and more demanding on device memory. To address this problem, we propose the Make Some Noise (MSN) training framework as a replacement for the supervised fine-tuning st… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: 11 pages, 6 figures

  5. arXiv:2406.16520  [pdf

    cond-mat.str-el cond-mat.mtrl-sci cond-mat.supr-con

    Gigantic-oxidative atomically layered epitaxy for designed complex oxides

    Authors: Guangdi Zhou, Haoliang Huang, Fengzhe Wang, Heng Wang, Qishuo Yang, Zihao Nie, Wei Lv, Cui Ding, Yueying Li, Danfeng Li, Yujie Sun, Junhao Lin, Guang-Ming Zhang, Qi-Kun Xue, Zhuoyu Chen

    Abstract: In designing material functionality within the intricate realm of transition metal oxides, lattice structure and d-orbital occupancy are two principal determinants of the correlated physical properties, such as superconductivity. However, the modulation of these two factors is inherently limited by the need to balance thermodynamic stability, kinetic mobility, and synthesis precision, particularly… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  6. arXiv:2406.16442  [pdf, other

    cs.CV

    EmoLLM: Multimodal Emotional Understanding Meets Large Language Models

    Authors: Qu Yang, Mang Ye, Bo Du

    Abstract: Multi-modal large language models (MLLMs) have achieved remarkable performance on objective multimodal perception tasks, but their ability to interpret subjective, emotionally nuanced multimodal content remains largely unexplored. Thus, it impedes their ability to effectively understand and react to the intricate emotions expressed by humans through multimodal media. To bridge this gap, we introdu… ▽ More

    Submitted 29 June, 2024; v1 submitted 24 June, 2024; originally announced June 2024.

    Comments: 9 pages

  7. arXiv:2406.16278  [pdf, ps, other

    math.AP

    Sharp fractional Sobolev and related inequalities on H-type groups

    Authors: Yaojun Wang, Qiaohua Yang

    Abstract: We determine the sharp constants for the fractional Sobolev inequalities associated with the conformally invariant fractional powers $\mathcal{L}_{s}(0<s<1)$ of the sublaplacian on H-type groups. From these inequalities we derive a sharp log-Sobolev inequality by considering a limiting case and a sharp Sobolev trace inequality. The later extends to this context the result of Frank, González, Monti… ▽ More

    Submitted 27 June, 2024; v1 submitted 23 June, 2024; originally announced June 2024.

  8. arXiv:2406.16271  [pdf, other

    cs.CV

    Feature-prompting GBMSeg: One-Shot Reference Guided Training-Free Prompt Engineering for Glomerular Basement Membrane Segmentation

    Authors: Xueyu Liu, Guangze Shi, Rui Wang, Yexin Lai, Jianan Zhang, Lele Sun, Quan Yang, Yongfei Wu, MIng Li, Weixia Han, Wen Zheng

    Abstract: Assessment of the glomerular basement membrane (GBM) in transmission electron microscopy (TEM) is crucial for diagnosing chronic kidney disease (CKD). The lack of domain-independent automatic segmentation tools for the GBM necessitates an AI-based solution to automate the process. In this study, we introduce GBMSeg, a training-free framework designed to automatically segment the GBM in TEM images… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

    Comments: Accepted for MICCAI2024

  9. arXiv:2406.13007  [pdf, other

    cs.CV

    NTIRE 2024 Challenge on Night Photography Rendering

    Authors: Egor Ershov, Artyom Panshin, Oleg Karasev, Sergey Korchagin, Shepelev Lev, Alexandr Startsev, Daniil Vladimirov, Ekaterina Zaychenkova, Nikola Banić, Dmitrii Iarchuk, Maria Efimova, Radu Timofte, Arseniy Terekhin, Shuwei Yue, Yuyang Liu, Minchen Wei, Lu Xu, Chao Zhang, Yasi Wang, Furkan Kınlı, Doğa Yılmaz, Barış Özcan, Furkan Kıraç, Shuai Liu, **gyuan Xiao , et al. (25 additional authors not shown)

    Abstract: This paper presents a review of the NTIRE 2024 challenge on night photography rendering. The goal of the challenge was to find solutions that process raw camera images taken in nighttime conditions, and thereby produce a photo-quality output images in the standard RGB (sRGB) space. Unlike the previous year's competition, the challenge images were collected with a mobile phone and the speed of algo… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: 10 pages, 10 figures

  10. arXiv:2406.12726  [pdf, other

    cs.SD cs.AI eess.AS

    ED-sKWS: Early-Decision Spiking Neural Networks for Rapid,and Energy-Efficient Keyword Spotting

    Authors: Zeyang Song, Qianhui Liu, Qu Yang, Yizhou Peng, Haizhou Li

    Abstract: Keyword Spotting (KWS) is essential in edge computing requiring rapid and energy-efficient responses. Spiking Neural Networks (SNNs) are well-suited for KWS for their efficiency and temporal capacity for speech. To further reduce the latency and energy consumption, this study introduces ED-sKWS, an SNN-based KWS model with an early-decision mechanism that can stop speech processing and output the… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: Accepted by INTERSPEECH2024

  11. arXiv:2406.12403  [pdf, other

    cs.CL cs.AI

    PDSS: A Privacy-Preserving Framework for Step-by-Step Distillation of Large Language Models

    Authors: Tao Fan, Yan Kang, Wei**g Chen, Hanlin Gu, Yuanfeng Song, Lixin Fan, Kai Chen, Qiang Yang

    Abstract: In the context of real-world applications, leveraging large language models (LLMs) for domain-specific tasks often faces two major challenges: domain-specific knowledge privacy and constrained resources. To address these issues, we propose PDSS, a privacy-preserving framework for step-by-step distillation of LLMs. PDSS works on a server-client architecture, wherein client transmits perturbed promp… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  12. arXiv:2406.12254  [pdf, other

    eess.IV cs.CV

    Enhancing Single-Slice Segmentation with 3D-to-2D Unpaired Scan Distillation

    Authors: Xin Yu, Qi Yang, Han Liu, Ho Hin Lee, Yucheng Tang, Lucas W. Remedios, Michael Kim, Shunxing Bao, Ann Xenobia Moore, Luigi Ferrucci, Bennett A. Landman

    Abstract: 2D single-slice abdominal computed tomography (CT) enables the assessment of body habitus and organ health with low radiation exposure. However, single-slice data necessitates the use of 2D networks for segmentation, but these networks often struggle to capture contextual information effectively. Consequently, even when trained on identical datasets, 3D networks typically achieve superior segmenta… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  13. arXiv:2406.11967  [pdf, other

    cond-mat.mes-hall

    Elf autoencoder: unsupervised exploration of flat-band materials using electronic band structure fingerprints

    Authors: Henry Kelbrick Pentz, Thomas Warford, Ivan Timokhin, Qian Yang, Anupam Bhattacharya, Artem Mishchenko

    Abstract: Two-dimensional materials with flat electronic bands are promising for realizing exotic quantum phenomena such as unconventional superconductivity and nontrivial topology, but exploring their vast chemical space remains challenging. Here, we introduce an unsupervised convolutional autoencoder agent (elf) that operates on electronic band structure images and is capable of map** band features and… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  14. arXiv:2406.11158  [pdf, other

    eess.SY

    Dynamic Modeling and Control for an Offshore Semisubmersible Floating Wind Turbine

    Authors: Yingjie Gong, Qinmin Yang, Hua Geng, Wenchao Meng, Lin Wang

    Abstract: Floating wind turbines (FWTs) hold significant potential for the exploitation of offshore renewable energy resources. Nevertheless, prior to the construction of FWTs, it is imperative to tackle several critical challenges, especially the issue of performance degradation under combined wind and wave loads. This study initiates with the development of a simplified nonlinear dynamical model for a sem… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

  15. arXiv:2406.10715  [pdf, other

    physics.optics quant-ph

    Chip-scale generation of 60-mode continuous-variable cluster states

    Authors: Ze Wang, Kangkang Li, Yue Wang, Xin Zhou, Yinke Cheng, Boxuan **g, Fengxiao Sun, **cheng Li, Zhilin Li, Qihuang Gong, Qiongyi He, Bei-Bei Li, Qi-Fan Yang

    Abstract: Increasing the number of entangled entities is crucial for achieving exponential computational speedups and secure quantum networks. Despite recent progress in generating large-scale entanglement through continuous-variable (CV) cluster states, translating these technologies to photonic chips has been hindered by decoherence, limiting the number of entangled entities to 8. Here, we demonstrate 60-… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

  16. HiFGL: A Hierarchical Framework for Cross-silo Cross-device Federated Graph Learning

    Authors: Zhuoning Guo, Duanyi Yao, Qiang Yang, Hao Liu

    Abstract: Federated Graph Learning (FGL) has emerged as a promising way to learn high-quality representations from distributed graph data with privacy preservation. Despite considerable efforts have been made for FGL under either cross-device or cross-silo paradigm, how to effectively capture graph knowledge in a more complicated cross-silo cross-device environment remains an under-explored problem. However… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

    Comments: Accepted by SIGKDD 2024

  17. arXiv:2406.10606  [pdf, other

    eess.SP

    Semantic Communication for Edge Intelligence Enabled Autonomous Driving System

    Authors: Yunqi Feng, Hesheng Shen, Zhendong Shan, Qianqian Yang, Xiufang Shi

    Abstract: Expected to provide higher transportation efficiency and security, autonomous driving has attracted substantial attentions from both industry and academia. Meanwhile, the emergence of edge intelligence has further introduced significant advancements to this field. However, the crucial demands of ultra-reliable and low-latency communications (URLLC) among the vehicles and edge servers have hindered… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

    Comments: This paper has been submitted to IEEE Network Magazine, and is ungergoing major revisions

  18. arXiv:2406.10540  [pdf, other

    cs.AI cs.NE cs.RO

    Generating and Evolving Reward Functions for Highway Driving with Large Language Models

    Authors: Xu Han, Qiannan Yang, Xianda Chen, Xiaowen Chu, Meixin Zhu

    Abstract: Reinforcement Learning (RL) plays a crucial role in advancing autonomous driving technologies by maximizing reward functions to achieve the optimal policy. However, crafting these reward functions has been a complex, manual process in many practices. To reduce this complexity, we introduce a novel framework that integrates Large Language Models (LLMs) with RL to improve reward function design in a… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

    Comments: 7 pages, 6 figures

  19. arXiv:2406.10469  [pdf, other

    eess.IV cs.CV cs.MM

    Object-Attribute-Relation Representation based Video Semantic Communication

    Authors: Qiyuan Du, Yi** Duan, Qianqian Yang, Xiaoming Tao, Mérouane Debbah

    Abstract: With the rapid growth of multimedia data volume, there is an increasing need for efficient video transmission in applications such as virtual reality and future video streaming services. Semantic communication is emerging as a vital technique for ensuring efficient and reliable transmission in low-bandwidth, high-noise settings. However, most current approaches focus on joint source-channel coding… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  20. arXiv:2406.10277  [pdf

    physics.class-ph physics.optics

    Tellegen responses in metamaterials

    Authors: Qingdong Yang, Xinhua Wen, Zhongfu Li, Oubo You, Shuang Zhang

    Abstract: Tellegen medium has long been a topic of debate, with its existence being contested over several decades. It was first proposed by Tellegen in 1948 and is characterized by a real-valued cross coupling between electric and magnetic responses, distinguishing it from the well-known chiral medium that has imaginary coupling coefficients. Significantly, Tellegen responses are closely linked to axion dy… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: 19 pages, 4 figures

  21. HiFAST : An HI Data Calibration and Imaging Pipeline for FAST II. Flux Density Calibration

    Authors: Ziming Liu, Jie Wang, Yingjie **g, Zhi-Yu Zhang, Chen Xu, Tiantian Liang, Qingze Chen, Ningyu Tang, Qingliang Yang

    Abstract: Accurate flux density calibration is essential for precise analysis and interpretation of observations across different observation modes and instruments. In this research, we firstly introduce the flux calibration model incorporated in HIFAST pipeline, designed for processing HI 21-cm spectra. Furthermore, we investigate different calibration techniques and assess the dependence of the gain param… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 14 pages, 15 figures, accepted by RAA

  22. arXiv:2406.05811  [pdf, other

    math.ST

    CLT for Generalized Linear Spectral Statistics of High-dimensional Sample Covariance Matrices and Applications

    Authors: Yanlin Hu, Qing Yang, Xiao Han

    Abstract: In this paper, we introduce the $\mathbf{G}$eneralized $\mathbf{L}$inear $\mathbf{S}$pectral $\mathbf{S}$tatistics (GLSS) of a high-dimensional sample covariance matrix $\mathbf{S}_n$, denoted as $\operatorname{tr}f(\mathbf{S}_n)\mathbf{B}_n$, which effectively captures distinct spectral properties of $\mathbf{S}_n$ by involving an ancillary matrix $\mathbf{B}_n$ and a test function $f$. The joint… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

  23. arXiv:2406.04601  [pdf, other

    cs.LG

    Enhancing Size Generalization in Graph Neural Networks through Disentangled Representation Learning

    Authors: Zheng Huang, Qihui Yang, Dawei Zhou, Yujun Yan

    Abstract: Although most graph neural networks (GNNs) can operate on graphs of any size, their classification performance often declines on graphs larger than those encountered during training. Existing methods insufficiently address the removal of size information from graph representations, resulting in sub-optimal performance and reliance on backbone models. In response, we propose DISGEN, a novel and mod… ▽ More

    Submitted 11 June, 2024; v1 submitted 6 June, 2024; originally announced June 2024.

  24. arXiv:2406.04323  [pdf, other

    cs.LG cs.AI cs.CV

    ATraDiff: Accelerating Online Reinforcement Learning with Imaginary Trajectories

    Authors: Qianlan Yang, Yu-Xiong Wang

    Abstract: Training autonomous agents with sparse rewards is a long-standing problem in online reinforcement learning (RL), due to low data efficiency. Prior work overcomes this challenge by extracting useful knowledge from offline data, often accomplished through the learning of action distribution from offline data and utilizing the learned distribution to facilitate online RL. However, since the offline d… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: ICML 2024 Accepted

  25. arXiv:2406.04025  [pdf

    cs.CL

    The syntax-semantics interface in a child's path: A study of 3- to 11-year-olds' elicited production of Mandarin recursive relative clauses

    Authors: Caimei Yang, Qihang Yang, Xingzhi Su, Chenxi Fu, Xiaoyi Wang, Ying Yan, Zaijiang Man

    Abstract: There have been apparently conflicting claims over the syntax-semantics relationship in child acquisition. However, few of them have assessed the child's path toward the acquisition of recursive relative clauses (RRCs). The authors of the current paper did experiments to investigate 3- to 11-year-olds' most-structured elicited production of eight Mandarin RRCs in a 4 (syntactic types)*2 (semantic… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  26. arXiv:2406.03868  [pdf, other

    cs.DC

    PALM: A Efficient Performance Simulator for Tiled Accelerators with Large-scale Model Training

    Authors: Jiahao Fang, Huizheng Wang, Qize Yang, Dehao Kong, Xu Dai, **yi Deng, Yang Hu, Shouyi Yin

    Abstract: Deep learning (DL) models are piquing high interest and scaling at an unprecedented rate. To this end, a handful of tiled accelerators have been proposed to support such large-scale training tasks. However, these accelerators often incorporate numerous cores or tiles even extending to wafer-scale, substantial on-chip bandwidth, and distributed memory systems. This results in an exceedingly complex… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: 11 pages

  27. arXiv:2406.02916  [pdf, other

    cs.RO

    Real-time Motion Planning for autonomous vehicles in dynamic environments

    Authors: Mohammad Dehghani Tezerjani, Dominic Carrillo, Deyuan Qu, Sudip Dhakal, Amir Mirzaeinia, Qing Yang

    Abstract: Recent advancements in self-driving car technologies have enabled them to navigate autonomously through various environments. However, one of the critical challenges in autonomous vehicle operation is trajectory planning, especially in dynamic environments with moving obstacles. This research aims to tackle this challenge by proposing a robust algorithm tailored for autonomous cars operating in dy… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: 8 pages

  28. arXiv:2406.02852  [pdf

    cond-mat.mtrl-sci

    Isolated anions induced high ionic conductivity

    Authors: Qifan Yang, **g Xu, Yuqi Wang, Xiao Fu, Ruijuan Xiao, Hong Li

    Abstract: One of the key materials in solid-state lithium batteries is fast ion conductors. However, the Li+ ion transport in inorganic crystals involves complex factors, making it a mystery to find and design ion conductors with low migration barriers. In this work, a distinctive structural characteristic involving isolated anions has been discovered to enhance high ionic conductivity in crystals. It is an… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  29. arXiv:2406.02224  [pdf, other

    cs.CL cs.AI

    FedMKT: Federated Mutual Knowledge Transfer for Large and Small Language Models

    Authors: Tao Fan, Guoqiang Ma, Yan Kang, Hanlin Gu, Yuanfeng Song, Lixin Fan, Kai Chen, Qiang Yang

    Abstract: Recent research in federated large language models (LLMs) has primarily focused on enabling clients to fine-tune their locally deployed homogeneous LLMs collaboratively or on transferring knowledge from server-based LLMs to small language models (SLMs) at downstream clients. However, a significant gap remains in the simultaneous mutual enhancement of both the server's LLM and clients' SLMs. To bri… ▽ More

    Submitted 18 June, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

  30. arXiv:2406.01956  [pdf, other

    cs.CV

    Enhance Image-to-Image Generation with LLaVA Prompt and Negative Prompt

    Authors: Zhicheng Ding, Panfeng Li, Qikai Yang, Siyang Li

    Abstract: This paper presents a novel approach to enhance image-to-image generation by leveraging the multimodal capabilities of the Large Language and Vision Assistant (LLaVA). We propose a framework where LLaVA analyzes input images and generates textual descriptions, hereinafter LLaVA-generated prompts. These prompts, along with the original image, are fed into the image-to-image generation pipeline. Thi… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: Accepted by 2024 5th International Conference on Information Science, Parallel and Distributed Systems

  31. arXiv:2406.01422  [pdf, other

    cs.SE cs.CL

    How to Understand Whole Software Repository?

    Authors: Yingwei Ma, Qing** Yang, Rongyu Cao, Binhua Li, Fei Huang, Yongbin Li

    Abstract: Recently, Large Language Model (LLM) based agents have advanced the significant development of Automatic Software Engineering (ASE). Although verified effectiveness, the designs of the existing methods mainly focus on the local information of codes, e.g., issues, classes, and functions, leading to limitations in capturing the global context and interdependencies within the software system. From th… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  32. arXiv:2406.01085  [pdf, other

    cs.CR cs.AI

    FedAdOb: Privacy-Preserving Federated Deep Learning with Adaptive Obfuscation

    Authors: Hanlin Gu, Jiahuan Luo, Yan Kang, Yuan Yao, Gongxi Zhu, Bowen Li, Lixin Fan, Qiang Yang

    Abstract: Federated learning (FL) has emerged as a collaborative approach that allows multiple clients to jointly learn a machine learning model without sharing their private data. The concern about privacy leakage, albeit demonstrated under specific conditions, has triggered numerous follow-up research in designing powerful attacking methods and effective defending mechanisms aiming to thwart these attacki… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  33. arXiv:2405.20681  [pdf, other

    cs.CR cs.AI

    No Free Lunch Theorem for Privacy-Preserving LLM Inference

    Authors: Xiao** Zhang, Yulin Fei, Yan Kang, Wei Chen, Lixin Fan, Hai **, Qiang Yang

    Abstract: Individuals and businesses have been significantly benefited by Large Language Models (LLMs) including PaLM, Gemini and ChatGPT in various ways. For example, LLMs enhance productivity, reduce costs, and enable us to focus on more valuable tasks. Furthermore, LLMs possess the capacity to sift through extensive datasets, uncover underlying patterns, and furnish critical insights that propel the fron… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

  34. arXiv:2405.18802  [pdf, other

    cs.CR cs.AI

    Enhancing Security and Privacy in Federated Learning using Update Digests and Voting-Based Defense

    Authors: Wenjie Li, Kai Fan, **gyuan Zhang, Hui Li, Wei Yang Bryan Lim, Qiang Yang

    Abstract: Federated Learning (FL) is a promising privacy-preserving machine learning paradigm that allows data owners to collaboratively train models while kee** their data localized. Despite its potential, FL faces challenges related to the trustworthiness of both clients and servers, especially in the presence of curious or malicious adversaries. In this paper, we introduce a novel framework named \unde… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: 14 pages

  35. arXiv:2405.18776  [pdf, other

    cs.CR cs.CL cs.LG

    LMO-DP: Optimizing the Randomization Mechanism for Differentially Private Fine-Tuning (Large) Language Models

    Authors: Qin Yang, Meisam Mohammad, Han Wang, Ali Payani, Ashish Kundu, Kai Shu, Yan Yan, Yuan Hong

    Abstract: Differentially Private Stochastic Gradient Descent (DP-SGD) and its variants have been proposed to ensure rigorous privacy for fine-tuning large-scale pre-trained language models. However, they rely heavily on the Gaussian mechanism, which may overly perturb the gradients and degrade the accuracy, especially in stronger privacy regimes (e.g., the privacy budget $ε< 3$). To address such limitations… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: 18 pages, 15 figures

  36. arXiv:2405.17660  [pdf, other

    cs.CV

    LoReTrack: Efficient and Accurate Low-Resolution Transformer Tracking

    Authors: Shaohua Dong, Yunhe Feng, Qing Yang, Yuewei Lin, Heng Fan

    Abstract: High-performance Transformer trackers have shown excellent results, yet they often bear a heavy computational load. Observing that a smaller input can immediately and conveniently reduce computations without changing the model, an easy solution is to adopt the low-resolution input for efficient Transformer tracking. Albeit faster, this hurts tracking accuracy much due to information loss in low re… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  37. arXiv:2405.17522  [pdf, other

    cs.LG cs.DC

    Efficient Model Compression for Hierarchical Federated Learning

    Authors: Xi Zhu, Songcan Yu, Junbo Wang, Qinglin Yang

    Abstract: Federated learning (FL), as an emerging collaborative learning paradigm, has garnered significant attention due to its capacity to preserve privacy within distributed learning systems. In these systems, clients collaboratively train a unified neural network model using their local datasets and share model parameters rather than raw data, enhancing privacy. Predominantly, FL systems are designed fo… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  38. arXiv:2405.17221  [pdf, other

    cs.AI cs.AR

    Efficient Orchestrated AI Workflows Execution on Scale-out Spatial Architecture

    Authors: **yi Deng, Xinru Tang, Zhiheng Yue, Guangyang Lu, Qize Yang, Jiahao Zhang, **xi Li, Chao Li, Shaojun Wei, Yang Hu, Shouyi Yin

    Abstract: Given the increasing complexity of AI applications, traditional spatial architectures frequently fall short. Our analysis identifies a pattern of interconnected, multi-faceted tasks encompassing both AI and general computational processes. In response, we have conceptualized "Orchestrated AI Workflows," an approach that integrates various tasks with logic-driven decisions into dynamic, sophisticat… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

  39. arXiv:2405.16944  [pdf

    cond-mat.mes-hall cond-mat.str-el

    Even- and Odd-denominator Fractional Quantum Anomalous Hall Effect in Graphene Moire Superlattices

    Authors: Jian Xie, Zihao Huo, Xin Lu, Zuo Feng, Zaizhe Zhang, Wenxuan Wang, Qiu Yang, Kenji Watanabe, Takashi Taniguchi, Kaihui Liu, Zhida Song, X. C. Xie, Jianpeng Liu, Xiaobo Lu

    Abstract: Fractional quantum anomalous hall effect (FQAHE), a transport effect with fractionally quantized Hall plateau emerging under zero magnetic field, provides a radically new opportunity to engineer topological quantum electronics. By construction of topological flat band with moire engineering, intrinsic FQAHE has been observed in twisted MoTe2 system and rhombohedral pentalayer graphene/hBN moire su… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  40. arXiv:2405.16138  [pdf, ps, other

    cond-mat.supr-con cond-mat.str-el

    Anomalous isotope Effect in d-wave superconductors on the square lattice

    Authors: Gan Sun, Qing-Geng Yang, Da Wang, Qiang-Hua Wang

    Abstract: Isotope effect with a large coefficient $α=-\partial \ln T_c/\partial \ln M$ is usually taken as an evidence of phonon mediated superconductors in the Bardeen-Cooper-Schrieffer (BCS) theory. However, in cuprates which are now widely believed to be strong correlation induced d-wave superconductors, $α$ is experimentally observed to be quite small at optimal do**, but keeps growing up with decreas… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

    Journal ref: Phys. Rev. B 109, L180508 (2024)

  41. arXiv:2405.15474  [pdf, other

    cs.LG cs.DC

    Unlearning during Learning: An Efficient Federated Machine Unlearning Method

    Authors: Hanlin Gu, Gongxi Zhu, Jie Zhang, Xinyuan Zhao, Yuxing Han, Lixin Fan, Qiang Yang

    Abstract: In recent years, Federated Learning (FL) has garnered significant attention as a distributed machine learning paradigm. To facilitate the implementation of the right to be forgotten, the concept of federated machine unlearning (FMU) has also emerged. However, current FMU approaches often involve additional time-consuming steps and may not offer comprehensive unlearning capabilities, which renders… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: Accepted by IJCAI 2024

  42. arXiv:2405.14488  [pdf, other

    cs.CL

    MoGU: A Framework for Enhancing Safety of Open-Sourced LLMs While Preserving Their Usability

    Authors: Yanrui Du, Sendong Zhao, Danyang Zhao, Ming Ma, Yuhan Chen, Liangyu Huo, Qing Yang, Dongliang Xu, Bing Qin

    Abstract: Large Language Models (LLMs) are increasingly deployed in various applications. As their usage grows, concerns regarding their safety are rising, especially in maintaining harmless responses when faced with malicious instructions. Many defense strategies have been developed to enhance the safety of LLMs. However, our research finds that existing defense strategies lead LLMs to predominantly adopt… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  43. arXiv:2405.14212  [pdf, other

    cs.CR cs.CL

    Federated Domain-Specific Knowledge Transfer on Large Language Models Using Synthetic Data

    Authors: Haoran Li, Xinyuan Zhao, Dadi Guo, Hanlin Gu, Ziqian Zeng, Yuxing Han, Yangqiu Song, Lixin Fan, Qiang Yang

    Abstract: As large language models (LLMs) demonstrate unparalleled performance and generalization ability, LLMs are widely used and integrated into various applications. When it comes to sensitive domains, as commonly described in federated learning scenarios, directly using external LLMs on private data is strictly prohibited by stringent data security and privacy regulations. For local clients, the utiliz… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  44. arXiv:2405.13519  [pdf

    physics.flu-dyn

    Multi-fidelity topology optimization of flow boiling heat transfer in microchannels

    Authors: Yi Yuan, Li Chen, Qirui Yang, Lingran Gu, Wen-Quan Tao

    Abstract: Topology optimization (TO) is a powerful method to design innovative structures with improved heat transfer performance. In the present study, a multi-fidelity TO method with a delicately defined objective function is developed for flow boiling heat transfer in microchannels. Low-fidelity TO is conducted for the reduced-order process of single-phase laminar convective heat transfer, which generate… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  45. arXiv:2405.13483  [pdf, other

    cs.IT

    Distributed Indirect Source Coding with Decoder Side Information

    Authors: Jiancheng Tang, Qianqian Yang, Deniz Gündüz

    Abstract: This paper studies a variant of the rate-distortion problem motivated by task-oriented semantic communication and distributed learning problems, where $M$ correlated sources are independently encoded for a central decoder. The decoder has access to a correlated side information in addition to the messages received from the encoders, and aims to recover a latent random variable correlated with the… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  46. arXiv:2405.11493  [pdf, other

    cs.CV cs.IT eess.SP

    Point Cloud Compression with Implicit Neural Representations: A Unified Framework

    Authors: Hongning Ruan, Yulin Shao, Qianqian Yang, Liang Zhao, Dusit Niyato

    Abstract: Point clouds have become increasingly vital across various applications thanks to their ability to realistically depict 3D objects and scenes. Nevertheless, effectively compressing unstructured, high-precision point cloud data remains a significant challenge. In this paper, we present a pioneering point cloud compression framework capable of handling both geometry and attribute components. Unlike… ▽ More

    Submitted 19 May, 2024; originally announced May 2024.

    Comments: 6 Pages, 6 Figures, submitted to IEEE ICCC

  47. arXiv:2405.10762  [pdf

    q-fin.RM cs.AI cs.LG

    Research on Credit Risk Early Warning Model of Commercial Banks Based on Neural Network Algorithm

    Authors: Yu Cheng, Qin Yang, Liyang Wang, Ao Xiang, **gyu Zhang

    Abstract: In the realm of globalized financial markets, commercial banks are confronted with an escalating magnitude of credit risk, thereby imposing heightened requisites upon the security of bank assets and financial stability. This study harnesses advanced neural network techniques, notably the Backpropagation (BP) neural network, to pioneer a novel model for preempting credit risk in commercial banks. T… ▽ More

    Submitted 30 May, 2024; v1 submitted 17 May, 2024; originally announced May 2024.

  48. arXiv:2405.09853  [pdf, other

    hep-th cond-mat.str-el

    Chiral symmetry breaking in the pseudo-quantum electrodynamics with non-Abelian four-fermion interactions

    Authors: Qiao Yang, Yu-Biao Wu, Wu-Ming Liu

    Abstract: In the context of 2+1 dimensional Dirac materials, we consider electromagnetic interactions alongside a type of spin-dependent Hubbard interaction. The former is described by PQED theory, while the latter corresponds to an effective theory represented by the $SU(N_c)$ Thirring model. Employing Hubbard-Stratonovich transformation and large N expansion in the model yields a non-local $SU(N_c)$ Yang-… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

    Comments: 9 pages, 4 figures

  49. arXiv:2405.09234  [pdf, other

    eess.IV

    Enhancing Image Privacy in Semantic Communication over Wiretap Channels leveraging Differential Privacy

    Authors: Weixuan Chen, Shunpu Tang, Qianqian Yang

    Abstract: Semantic communication (SemCom) enhances transmission efficiency by sending only task-relevant information compared to traditional methods. However, transmitting semantic-rich data over insecure or public channels poses security and privacy risks. This paper addresses the privacy problem of transmitting images over wiretap channels and proposes a novel SemCom approach ensuring privacy through a di… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

  50. arXiv:2405.07639  [pdf

    astro-ph.EP physics.geo-ph

    Unveiling the Magmatic Architecture Beneath Oceanus Procellarum: Insights from GRAIL Mission Data

    Authors: Meixia Geng, Qingjie Yang, Chaouki Kasmi, J. Kim Welford, Alexander L. Peace

    Abstract: The Oceanus Procellarum region, characterized by its vast basaltic plains and pronounced volcanic activity, serves as a focal point for understanding the volcanic history of the Moon. Leveraging the Gravity Recovery and Interior Laboratory (GRAIL) mission data, we imaged the magmatic structures beneath the Oceanus Procellarum region. Our 3D density models uncover pronounced linear magmatic structu… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: 30 pages, 6 figures, and 1 table