Search | arXiv e-print repository

Asynchronous and Segmented Bidirectional Encoding for NMT

Authors: **gpu Yang, Zehua Han, Mengyu Xiang, Helin Wang, Yuxiao Huang, Miao Fang

Abstract: With the rapid advancement of Neural Machine Translation (NMT), enhancing translation efficiency and quality has become a focal point of research. Despite the commendable performance of general models such as the Transformer in various aspects, they still fall short in processing long sentences and fully leveraging bidirectional contextual information. This paper introduces an improved model based… ▽ More With the rapid advancement of Neural Machine Translation (NMT), enhancing translation efficiency and quality has become a focal point of research. Despite the commendable performance of general models such as the Transformer in various aspects, they still fall short in processing long sentences and fully leveraging bidirectional contextual information. This paper introduces an improved model based on the Transformer, implementing an asynchronous and segmented bidirectional decoding strategy aimed at elevating translation efficiency and accuracy. Compared to traditional unidirectional translations from left-to-right or right-to-left, our method demonstrates heightened efficiency and improved translation quality, particularly in handling long sentences. Experimental results on the IWSLT2017 dataset confirm the effectiveness of our approach in accelerating translation and increasing accuracy, especially surpassing traditional unidirectional strategies in long sentence translation. Furthermore, this study analyzes the impact of sentence length on decoding outcomes and explores the model's performance in various scenarios. The findings of this research not only provide an effective encoding strategy for the NMT field but also pave new avenues and directions for future studies. △ Less

Submitted 19 February, 2024; originally announced February 2024.

arXiv:2402.14594 [pdf, other]

Improving Assessment of Tutoring Practices using Retrieval-Augmented Generation

Authors: Zifei FeiFei Han, Jionghao Lin, Ashish Gurung, Danielle R. Thomas, Eason Chen, Conrad Borchers, Shivang Gupta, Kenneth R. Koedinger

Abstract: One-on-one tutoring is an effective instructional method for enhancing learning, yet its efficacy hinges on tutor competencies. Novice math tutors often prioritize content-specific guidance, neglecting aspects such as social-emotional learning. Social-emotional learning promotes equity and inclusion and nurturing relationships with students, which is crucial for holistic student development. Asses… ▽ More One-on-one tutoring is an effective instructional method for enhancing learning, yet its efficacy hinges on tutor competencies. Novice math tutors often prioritize content-specific guidance, neglecting aspects such as social-emotional learning. Social-emotional learning promotes equity and inclusion and nurturing relationships with students, which is crucial for holistic student development. Assessing the competencies of tutors accurately and efficiently can drive the development of tailored tutor training programs. However, evaluating novice tutor ability during real-time tutoring remains challenging as it typically requires experts-in-the-loop. To address this challenge, this preliminary study aims to harness Generative Pre-trained Transformers (GPT), such as GPT-3.5 and GPT-4 models, to automatically assess tutors' ability of using social-emotional tutoring strategies. Moreover, this study also reports on the financial dimensions and considerations of employing these models in real-time and at scale for automated assessment. The current study examined four prompting strategies: two basic Zero-shot prompt strategies, Tree of Thought prompt, and Retrieval-Augmented Generator (RAG) based prompt. The results indicate that the RAG prompt demonstrated more accurate performance (assessed by the level of hallucination and correctness in the generated assessment texts) and lower financial costs than the other strategies evaluated. These findings inform the development of personalized tutor training interventions to enhance the the educational effectiveness of tutored learning. △ Less

Submitted 4 February, 2024; originally announced February 2024.

Comments: 11 page Workshop paper, AAAI2024 Workshop on AI for Education - Bridging Innovation and Responsibility, Large Language Model, Personalized Tutor Training, Automatic Assessment

arXiv:2402.13073 [pdf, other]

Towards Intelligent Communications: Large Model Empowered Semantic Communications

Authors: Huiqiang Xie, Zhi** Qin, Xiaoming Tao, Zhu Han

Abstract: Deep learning enabled semantic communications have shown great potential to significantly improve transmission efficiency and alleviate spectrum scarcity, by effectively exchanging the semantics behind the data. Recently, the emergence of large models, boasting billions of parameters, has unveiled remarkable human-like intelligence, offering a promising avenue for advancing semantic communication… ▽ More Deep learning enabled semantic communications have shown great potential to significantly improve transmission efficiency and alleviate spectrum scarcity, by effectively exchanging the semantics behind the data. Recently, the emergence of large models, boasting billions of parameters, has unveiled remarkable human-like intelligence, offering a promising avenue for advancing semantic communication by enhancing semantic understanding and contextual understanding. This article systematically investigates the large model-empowered semantic communication systems from potential applications to system design. First, we propose a new semantic communication architecture that seamlessly integrates large models into semantic communication through the introduction of a memory module. Then, the typical applications are illustrated to show the benefits of the new architecture. Besides, we discuss the key designs in implementing the new semantic communication systems from module design to system training. Finally, the potential research directions are identified to boost the large model-empowered semantic communications. △ Less

Submitted 19 March, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

Comments: 7 pages, 6 figures

arXiv:2402.11518 [pdf, other]

doi 10.1145/3637528.3671965

Large Language Model-driven Meta-structure Discovery in Heterogeneous Information Network

Authors: Lin Chen, Fengli Xu, Nian Li, Zhenyu Han, Meng Wang, Yong Li, Pan Hui

Abstract: Heterogeneous information networks (HIN) have gained increasing popularity in recent years for capturing complex relations between diverse types of nodes. Meta-structures are proposed as a useful tool to identify the important patterns in HINs, but hand-crafted meta-structures pose significant challenges for scaling up, drawing wide research attention towards develo** automatic search algorithms… ▽ More Heterogeneous information networks (HIN) have gained increasing popularity in recent years for capturing complex relations between diverse types of nodes. Meta-structures are proposed as a useful tool to identify the important patterns in HINs, but hand-crafted meta-structures pose significant challenges for scaling up, drawing wide research attention towards develo** automatic search algorithms. Previous efforts primarily focused on searching for meta-structures with good empirical performance, overlooking the importance of human comprehensibility and generalizability. To address this challenge, we draw inspiration from the emergent reasoning abilities of large language models (LLMs). We propose ReStruct, a meta-structure search framework that integrates LLM reasoning into the evolutionary procedure. ReStruct uses a grammar translator to encode the meta-structures into natural language sentences, and leverages the reasoning power of LLMs to evaluate their semantic feasibility. Besides, ReStruct also employs performance-oriented evolutionary operations. These two competing forces allow ReStruct to jointly optimize the semantic explainability and empirical performance of meta-structures. Furthermore, ReStruct contains a differential LLM explainer to generate and refine natural language explanations for the discovered meta-structures by reasoning through the search history. Experiments on eight representative HIN datasets demonstrate that ReStruct achieves state-of-the-art performance in both recommendation and node classification tasks. Moreover, a survey study involving 73 graduate students shows that the discovered meta-structures and generated explanations by ReStruct are substantially more comprehensible. Our code and questionnaire are available at https://github.com/LinChen-65/ReStruct. △ Less

Submitted 22 June, 2024; v1 submitted 18 February, 2024; originally announced February 2024.

arXiv:2402.10787 [pdf, other]

EdgeQAT: Entropy and Distribution Guided Quantization-Aware Training for the Acceleration of Lightweight LLMs on the Edge

Authors: Xuan Shen, Zhenglun Kong, Changdi Yang, Zhaoyang Han, Lei Lu, Peiyan Dong, Cheng Lyu, Chih-hsiang Li, Xuehang Guo, Zhihao Shu, Wei Niu, Miriam Leeser, Pu Zhao, Yanzhi Wang

Abstract: Despite the remarkable strides of Large Language Models (LLMs) in various fields, the wide applications of LLMs on edge devices are limited due to their massive parameters and computations. To address this, quantization is commonly adopted to generate lightweight LLMs with efficient computations and fast inference. However, Post-Training Quantization (PTQ) methods dramatically degrade in quality w… ▽ More Despite the remarkable strides of Large Language Models (LLMs) in various fields, the wide applications of LLMs on edge devices are limited due to their massive parameters and computations. To address this, quantization is commonly adopted to generate lightweight LLMs with efficient computations and fast inference. However, Post-Training Quantization (PTQ) methods dramatically degrade in quality when quantizing weights, activations, and KV cache together to below 8 bits. Besides, many Quantization-Aware Training (QAT) works quantize model weights, leaving the activations untouched, which do not fully exploit the potential of quantization for inference acceleration on the edge. In this paper, we propose EdgeQAT, the Entropy and Distribution Guided QAT for the optimization of lightweight LLMs to achieve inference acceleration on Edge devices. We first identify that the performance drop of quantization primarily stems from the information distortion in quantized attention maps, demonstrated by the different distributions in quantized query and key of the self-attention mechanism. Then, the entropy and distribution guided QAT is proposed to mitigate the information distortion. Moreover, we design a token importance-aware adaptive method to dynamically quantize the tokens with different bit widths for further optimization and acceleration. Our extensive experiments verify the substantial improvements with our framework across various datasets. Furthermore, we achieve an on-device speedup of up to 2.37x compared with its FP16 counterparts across multiple edge devices, signaling a groundbreaking advancement. △ Less

Submitted 16 February, 2024; originally announced February 2024.

Comments: Preprint

arXiv:2402.09736 [pdf, other]

Federated Analytics-Empowered Frequent Pattern Mining for Decentralized Web 3.0 Applications

Authors: Zibo Wang, Yifei Zhu, Dan Wang, Zhu Han

Abstract: The emerging Web 3.0 paradigm aims to decentralize existing web services, enabling desirable properties such as transparency, incentives, and privacy preservation. However, current Web 3.0 applications supported by blockchain infrastructure still cannot support complex data analytics tasks in a scalable and privacy-preserving way. This paper introduces the emerging federated analytics (FA) paradig… ▽ More The emerging Web 3.0 paradigm aims to decentralize existing web services, enabling desirable properties such as transparency, incentives, and privacy preservation. However, current Web 3.0 applications supported by blockchain infrastructure still cannot support complex data analytics tasks in a scalable and privacy-preserving way. This paper introduces the emerging federated analytics (FA) paradigm into the realm of Web 3.0 services, enabling data to stay local while still contributing to complex web analytics tasks in a privacy-preserving way. We propose FedWeb, a tailored FA design for important frequent pattern mining tasks in Web 3.0. FedWeb remarkably reduces the number of required participating data owners to support privacy-preserving Web 3.0 data analytics based on a novel distributed differential privacy technique. The correctness of mining results is guaranteed by a theoretically rigid candidate filtering scheme based on Hoeffding's inequality and Chebychev's inequality. Two response budget saving solutions are proposed to further reduce participating data owners. Experiments on three representative Web 3.0 scenarios show that FedWeb can improve data utility by ~25.3% and reduce the participating data owners by ~98.4%. △ Less

Submitted 15 February, 2024; originally announced February 2024.

Comments: Accepted by IEEE International Conference on Computer Communications (INFOCOM'24)

arXiv:2402.09637 [pdf, other]

Orthogonal Time Frequency Space for Integrated Sensing and Communication: A Survey

Authors: Eyad Shtaiwi, Ahmed Abdelhadi, Husheng Li, Zhu Han, H. Vincent Poor

Abstract: Sixth-generation (6G) wireless communication systems, as stated in the European 6G flagship project Hexa-X, are anticipated to feature the integration of intelligence, communication, sensing, positioning, and computation. An important aspect of this integration is integrated sensing and communication (ISAC), in which the same waveform is used for both systems both sensing and communication, to add… ▽ More Sixth-generation (6G) wireless communication systems, as stated in the European 6G flagship project Hexa-X, are anticipated to feature the integration of intelligence, communication, sensing, positioning, and computation. An important aspect of this integration is integrated sensing and communication (ISAC), in which the same waveform is used for both systems both sensing and communication, to address the challenge of spectrum scarcity. Recently, the orthogonal time frequency space (OTFS) waveform has been proposed to address OFDM's limitations due to the high Doppler spread in some future wireless communication systems. In this paper, we review existing OTFS waveforms for ISAC systems and provide some insights into future research. Firstly, we introduce the basic principles and a system model of OTFS and provide a foundational understanding of this innovative technology's core concepts and architecture. Subsequently, we present an overview of OTFS-based ISAC system frameworks. We provide a comprehensive review of recent research developments and the current state of the art in the field of OTFS-assisted ISAC systems to gain a thorough understanding of the current landscape and advancements. Furthermore, we perform a thorough comparison between OTFS-enabled ISAC operations and traditional OFDM, highlighting the distinctive advantages of OTFS, especially in high Doppler spread scenarios. Subsequently, we address the primary challenges facing OTFS-based ISAC systems, identifying potential limitations and drawbacks. Then, finally, we suggest future research directions, aiming to inspire further innovation in the 6G wireless communication landscape. △ Less

Submitted 14 February, 2024; originally announced February 2024.

arXiv:2402.08384 [pdf, other]

Selective Learning: Towards Robust Calibration with Dynamic Regularization

Authors: Zongbo Han, Yifeng Yang, Changqing Zhang, Linjun Zhang, Joey Tianyi Zhou, Qinghua Hu, Huaxiu Yao

Abstract: Miscalibration in deep learning refers to there is a discrepancy between the predicted confidence and performance. This problem usually arises due to the overfitting problem, which is characterized by learning everything presented in the training set, resulting in overconfident predictions during testing. Existing methods typically address overfitting and mitigate the miscalibration by adding a ma… ▽ More Miscalibration in deep learning refers to there is a discrepancy between the predicted confidence and performance. This problem usually arises due to the overfitting problem, which is characterized by learning everything presented in the training set, resulting in overconfident predictions during testing. Existing methods typically address overfitting and mitigate the miscalibration by adding a maximum-entropy regularizer to the objective function. The objective can be understood as seeking a model that fits the ground-truth labels by increasing the confidence while also maximizing the entropy of predicted probabilities by decreasing the confidence. However, previous methods lack clear guidance on confidence adjustment, leading to conflicting objectives (increasing but also decreasing confidence). Therefore, we introduce a method called Dynamic Regularization (DReg), which aims to learn what should be learned during training thereby circumventing the confidence adjusting trade-off. At a high level, DReg aims to obtain a more reliable model capable of acknowledging what it knows and does not know. Specifically, DReg effectively fits the labels for in-distribution samples (samples that should be learned) while applying regularization dynamically to samples beyond model capabilities (e.g., outliers), thereby obtaining a robust calibrated model especially on the samples beyond model capabilities. Both theoretical and empirical analyses sufficiently demonstrate the superiority of DReg compared with previous methods. △ Less

Submitted 13 February, 2024; originally announced February 2024.

arXiv:2402.06824 [pdf]

Beating bandwidth limits for large aperture broadband nano-optics

Authors: Johannes E. Fröch, Praneeth K. Chakravarthula, Jipeng Sun, Ethan Tseng, Shane Colburn, Alan Zhan, Forrest Miller, Anna Wirth-Singh, Quentin A. A. Tanguy, Zheyi Han, Karl F. Böhringer, Felix Heide, Arka Majumdar

Abstract: Flat optics have been proposed as an attractive approach for the implementation of new imaging and sensing modalities to replace and augment refractive optics. However, chromatic aberrations impose fundamental limitations on diffractive flat optics. As such, true broadband high-quality imaging has thus far been out of reach for low f-number, large aperture, flat optics. In this work, we overcome t… ▽ More Flat optics have been proposed as an attractive approach for the implementation of new imaging and sensing modalities to replace and augment refractive optics. However, chromatic aberrations impose fundamental limitations on diffractive flat optics. As such, true broadband high-quality imaging has thus far been out of reach for low f-number, large aperture, flat optics. In this work, we overcome these intrinsic fundamental limitations, achieving broadband imaging in the visible wavelength range with a flat meta-optic, co-designed with computational reconstruction. We derive the necessary conditions for a broadband, 1 cm aperture, f/2 flat optic, with a diagonal field of view of 30° and an average system MTF contrast of 30% or larger for a spatial frequency of 100 lp/mm in the visible band (> 50 % for 70 lp/mm and below). Finally, we use a coaxial, dual-aperture system to train the broadband imaging meta-optic with a learned reconstruction method operating on pair-wise captured imaging data. Fundamentally, our work challenges the entrenched belief of the inability of capturing high-quality, full-color images using a single large aperture meta-optic. △ Less

Submitted 9 February, 2024; originally announced February 2024.

arXiv:2402.01347 [pdf, ps, other]

doi 10.1103/PhysRevB.109.224508

Quantum Griffiths singularity in three-dimensional MoTiN superconducting films

Authors: Zi-Xiao Wang, Tian-Yu **g, Zi-Yan Han, Kuang-Hong Gao, Song-Ci Li, Zhi-Qing Li

Abstract: Quantum Griffiths singularity (QGS) has been experimentally observed in a range of two-dimensional (2D) superconducting systems. Although it is theoretically suggested that the QGS also exists in three-dimensional (3D) superconductors, there is almost no experimental support to the theoretical prediction. In the present paper, we observe the occurrence of QGS in a series of $\sim$80-nm-thick Mo… ▽ More Quantum Griffiths singularity (QGS) has been experimentally observed in a range of two-dimensional (2D) superconducting systems. Although it is theoretically suggested that the QGS also exists in three-dimensional (3D) superconductors, there is almost no experimental support to the theoretical prediction. In the present paper, we observe the occurrence of QGS in a series of $\sim$80-nm-thick Mo$_{0.8}$Ti$_{0.2}$N$_x$ ($0.84 \lesssim x \lesssim 1.12$) superconducting films near the field-driven superconductor-metal transition (SMT). These films have a NaCl structure and are 3D with respect to the superconductivity. For each film, the low-temperature magnetoresistance isotherms, measured at magnetic fields being perpendicular or parallel to the film plane, do not cross at a single point but at a clear wide region. The dynamical critical exponents $zν_{\perp}$ (for perpendicular field) and $zν_{\parallel}$ (for parallel field) obtained by analyzing the related magnetoresistance isotherms increase with decreasing temperature and tend to diverge as $T\rightarrow 0$ K. In addition, the effective resistivity data for the perpendicular and parallel field in the vicinity of the SMTs both obey an activated scaling based on the random transverse-field Ising model. We also fabricate a $\sim$80-nm-thick (Mo$_{0.8}$Ti$_{0.2}$)$_2$N$_{1.06}$ superconducting film with face-centered cubic structure at low nitrogen partial pressure. It is found that the low-temperature magnetoresistance isotherms for the perpendicular (parallel) field cross at a single point and the resistivity data for the perpendicular (parallel) field in the vicinity of the field-induced SMT obey the power-law scaling deduced from the dirty-boson model. Our results provide unambigous experimental evidence for the existence of QGS in 3D superconductors. △ Less

Submitted 2 February, 2024; originally announced February 2024.

Comments: 11 pages and 9 Figures

Journal ref: Physical Review B 109, 224508 (2024)

arXiv:2402.01345 [pdf, other]

Skip \n: A Simple Method to Reduce Hallucination in Large Vision-Language Models

Authors: Zongbo Han, Zechen Bai, Haiyang Mei, Qianli Xu, Changqing Zhang, Mike Zheng Shou

Abstract: Recent advancements in large vision-language models (LVLMs) have demonstrated impressive capability in visual information understanding with human language. Despite these advances, LVLMs still face challenges with multimodal hallucination, such as generating text descriptions of objects that are not present in the visual information. However, the underlying fundamental reasons of multimodal halluc… ▽ More Recent advancements in large vision-language models (LVLMs) have demonstrated impressive capability in visual information understanding with human language. Despite these advances, LVLMs still face challenges with multimodal hallucination, such as generating text descriptions of objects that are not present in the visual information. However, the underlying fundamental reasons of multimodal hallucinations remain poorly explored. In this paper, we propose a new perspective, suggesting that the inherent biases in LVLMs might be a key factor in hallucinations. Specifically, we systematically identify a semantic shift bias related to paragraph breaks (\n\n), where the content before and after '\n\n' in the training data frequently exhibit significant semantic changes. This pattern leads the model to infer that the contents following '\n\n' should be obviously different from the preceding contents with less hallucinatory descriptions, thereby increasing the probability of hallucinatory descriptions subsequent to the '\n\n'. We have validated this hypothesis on multiple publicly available LVLMs. Besides, we find that deliberately inserting '\n\n' at the generated description can induce more hallucinations. A simple method is proposed to effectively mitigate the hallucination of LVLMs by skip** the output of '\n'. △ Less

Submitted 7 May, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

arXiv:2402.01113 [pdf, other]

Single-modulated-pulse two-qubit gates for Rydberg atoms with noncyclic geometric control

Authors: Zi-Yuan Chen, Jia-Hao Liang, Zhao-Xin Fu, Hong-Zhi Liu, Ze-Rui He, 1 Meng Wang, Zhi-Wei Han, Jia-Yi Huang, Qing-Xian Lv, Yan-Xiong Du

Abstract: Arrays of neutral atoms have emerged as promising platforms for quantum computing. Realization of high-fidelity two-qubit gates with robustness is currently a significant important task for large-scale operations. In this paper, we present a convenient approach for implementing a two-qubit controlled-phase gate using Rydberg blockade. We achieve the noncyclic geometric control with a single modula… ▽ More Arrays of neutral atoms have emerged as promising platforms for quantum computing. Realization of high-fidelity two-qubit gates with robustness is currently a significant important task for large-scale operations. In this paper, we present a convenient approach for implementing a two-qubit controlled-phase gate using Rydberg blockade. We achieve the noncyclic geometric control with a single modulated pulse. As compared with the control scheme by cyclic evolution that determined by dynamical parameters, the robustness of the proposal against systematic errors will be remarkably improved due to the geometric characteristic. Importantly, the noncyclic geometric control reduces the gate time for small rotation angles and will be more insensitive to the decoherence effect. We accelerate the adiabatic control with the aid of shortcuts to adiabaticity to further shorten the operation time. We apply our protocol to the algorithm of quantum Fourier transformation to show the actual acceleration. Therefore, the proposed scheme will provide an analytical waveforms for arbitrary two-qubit gates and may have important use in the experiments of atomic arrays. △ Less

Submitted 1 February, 2024; originally announced February 2024.

Comments: 8 pages, 5 figures

arXiv:2401.17757 [pdf, ps, other]

Will Lanczos Iterations Generate Symmetric Quadrature Nodes?

Authors: Wenhao Li, Zongyuan Han, Andrew J Wathen, Shengxin Zhu

Abstract: The Golub-Welsch algorithm [Math. Comp., 23: 221-230 (1969)] for computing Gaussian quadrature rules is of importance in estimating quadratic forms. Quadrature rules based on this algorithm have long been assumed to be symmetric. Recent research indicates that the presence of asymmetric quadrature nodes may be more often. Such a divergence has led to varying error analyses of the Lanczos quadratur… ▽ More The Golub-Welsch algorithm [Math. Comp., 23: 221-230 (1969)] for computing Gaussian quadrature rules is of importance in estimating quadratic forms. Quadrature rules based on this algorithm have long been assumed to be symmetric. Recent research indicates that the presence of asymmetric quadrature nodes may be more often. Such a divergence has led to varying error analyses of the Lanczos quadrature method. Since symmetry often implies simplicity, it is of great interest to ask when do Lanczos iterations generate symmetric quadrature rules. This paper derives a sufficient condition that ensures symmetric quadrature nodes which partially answers the question that when the Ritz values of a symmetric matrix are symmetrically distributed. Additionally, we establish both lower and upper bounds on the disparity between the minimum Lanczos iterations required for symmetric and asymmetric quadrature. △ Less

Submitted 31 January, 2024; originally announced January 2024.

Comments: 17 pages, 2 figures

MSC Class: 65C05; 65D32; 65F15; 65F60; 65G99; 65Y20; 68Q10; 68Q87

arXiv:2401.17203 [pdf, other]

CPR++: Object Localization via Single Coarse Point Supervision

Authors: Xuehui Yu, Pengfei Chen, Kuiran Wang, Xumeng Han, Guorong Li, Zhenjun Han, Qixiang Ye, Jianbin Jiao

Abstract: Point-based object localization (POL), which pursues high-performance object sensing under low-cost data annotation, has attracted increased attention. However, the point annotation mode inevitably introduces semantic variance due to the inconsistency of annotated points. Existing POL heavily rely on strict annotation rules, which are difficult to define and apply, to handle the problem. In this s… ▽ More Point-based object localization (POL), which pursues high-performance object sensing under low-cost data annotation, has attracted increased attention. However, the point annotation mode inevitably introduces semantic variance due to the inconsistency of annotated points. Existing POL heavily rely on strict annotation rules, which are difficult to define and apply, to handle the problem. In this study, we propose coarse point refinement (CPR), which to our best knowledge is the first attempt to alleviate semantic variance from an algorithmic perspective. CPR reduces the semantic variance by selecting a semantic centre point in a neighbourhood region to replace the initial annotated point. Furthermore, We design a sampling region estimation module to dynamically compute a sampling region for each object and use a cascaded structure to achieve end-to-end optimization. We further integrate a variance regularization into the structure to concentrate the predicted scores, yielding CPR++. We observe that CPR++ can obtain scale information and further reduce the semantic variance in a global region, thus guaranteeing high-performance object localization. Extensive experiments on four challenging datasets validate the effectiveness of both CPR and CPR++. We hope our work can inspire more research on designing algorithms rather than annotation rules to address the semantic variance problem in POL. The dataset and code will be public at github.com/ucas-vg/PointTinyBenchmark. △ Less

Submitted 30 January, 2024; originally announced January 2024.

Comments: Accpted by TPAMI 2024

arXiv:2401.16566 [pdf, other]

Excitation Trajectory Optimization for Dynamic Parameter Identification Using Virtual Constraints in Hands-on Robotic System

Authors: Huanyu Tian, Martin Huber, Christopher E. Mower, Zhe Han, Changsheng Li, Xingguang Duan, Christos Bergeles

Abstract: This paper proposes a novel, more computationally efficient method for optimizing robot excitation trajectories for dynamic parameter identification, emphasizing self-collision avoidance. This addresses the system identification challenges for getting high-quality training data associated with co-manipulated robotic arms that can be equipped with a variety of tools, a common scenario in industrial… ▽ More This paper proposes a novel, more computationally efficient method for optimizing robot excitation trajectories for dynamic parameter identification, emphasizing self-collision avoidance. This addresses the system identification challenges for getting high-quality training data associated with co-manipulated robotic arms that can be equipped with a variety of tools, a common scenario in industrial but also clinical and research contexts. Utilizing the Unified Robotics Description Format (URDF) to implement a symbolic Python implementation of the Recursive Newton-Euler Algorithm (RNEA), the approach aids in dynamically estimating parameters such as inertia using regression analyses on data from real robots. The excitation trajectory was evaluated and achieved on par criteria when compared to state-of-the-art reported results which didn't consider self-collision and tool calibrations. Furthermore, physical Human-Robot Interaction (pHRI) admittance control experiments were conducted in a surgical context to evaluate the derived inverse dynamics model showing a 30.1\% workload reduction by the NASA TLX questionnaire. △ Less

Submitted 29 January, 2024; originally announced January 2024.

arXiv:2401.15654 [pdf, ps, other]

doi 10.1140/epjc/s10052-024-12856-w

Accretion of matter by a Charged dilaton black hole

Authors: Yinan Jia, Tong-Yu He, Wen-Qian Wang, Zhan-Wen Han, Rong-Jia Yang

Abstract: Considering accretion onto a charged dilaton black hole, the fundamental equations governing accretion, general analytic expressions for critical points, critical velocity, critical speed of sound, and ultimately the mass accretion rate are obtained. A new constraint on the dilation parameter coming from string theory is found and the case for polytropic gas is delved into a detailed discussion. I… ▽ More Considering accretion onto a charged dilaton black hole, the fundamental equations governing accretion, general analytic expressions for critical points, critical velocity, critical speed of sound, and ultimately the mass accretion rate are obtained. A new constraint on the dilation parameter coming from string theory is found and the case for polytropic gas is delved into a detailed discussion. It is found that the dialtion and the adiabatic index of accreted material have deep effects on the accretion process. △ Less

Submitted 18 May, 2024; v1 submitted 28 January, 2024; originally announced January 2024.

Comments: 9 pages, 3 figures

Journal ref: Eur.Phys.J.C 84 (2024) 5, 501

arXiv:2401.14687 [pdf, other]

Heavy Neutral Leptons in Gauged $U(1)_{L_μ-L_τ}$ at Muon Collider

Authors: Ru-Yi He, Jia-Qi Huang, **-Yuan Xu, Fa-Xin Yang, Zhi-Long Han, Feng-Lan Shao

Abstract: Heavy neutral leptons $N$ are the most appealing candidates to generate the tiny neutrino masses. In this paper, we study the signature of heavy neutral leptons in gauged $U(1)_{L_μ-L_τ}$ at a muon collider. Charged under the $U(1)_{L_μ-L_τ}$ symmetry, the heavy neutral leptons can be pair produced via the new gauge boson $Z'$ at muon collider as $μ^+μ^-\to Z^{\prime *}\to NN$ and… ▽ More Heavy neutral leptons $N$ are the most appealing candidates to generate the tiny neutrino masses. In this paper, we study the signature of heavy neutral leptons in gauged $U(1)_{L_μ-L_τ}$ at a muon collider. Charged under the $U(1)_{L_μ-L_τ}$ symmetry, the heavy neutral leptons can be pair produced via the new gauge boson $Z'$ at muon collider as $μ^+μ^-\to Z^{\prime *}\to NN$ and $μ^+μ^-\to Z^{\prime (*)} γ\to NNγ$. We then perform a detailed analysis on the lepton number violation signature $μ^+μ^-\to NN\to μ^\pmμ^\pm W^\mp W^\mp$ and $μ^+μ^-\to NN γ\to μ^\pmμ^\pm W^\mp W^\mp γ$ at the 3 TeV muon collider, where the hadronic decays of $W$ boson are treated as fat-jets $J$. These lepton number violation signatures have quite clean backgrounds at the muon collider. Our simulation shows that a wide range of viable parameter space is within the reach of the 3 TeV muon collider. For instance, with new gauge coupling $g'=0.6$ and an integrated luminosity of 1000 fb$^{-1}$, the $μ^\pmμ^\pm JJ$ signal could probe $m_{Z'}\lesssim 12.5$ TeV. Meanwhile, if the gauge boson mass satisfies $2 m_N<m_{Z'}<\sqrt{s}$, the $μ^\pmμ^\pm JJγ$ signature would be more promising than the $μ^\pmμ^\pm JJ$ signature. △ Less

Submitted 26 January, 2024; originally announced January 2024.

Comments: 20 pages, 8 figures, 2 tables

arXiv:2401.14612 [pdf, ps, other]

On Inhomogeneous Infinite Products of Stochastic Matrices and Applications

Authors: Zhaoyue Xia, Jun Du, Chunxiao Jiang, H. Vincent Poor, Zhu Han, Yong Ren

Abstract: With the growth of magnitude of multi-agent networks, distributed optimization holds considerable significance within complex systems. Convergence, a pivotal goal in this domain, is contingent upon the analysis of infinite products of stochastic matrices (IPSMs). In this work, convergence properties of inhomogeneous IPSMs are investigated. The convergence rate of inhomogeneous IPSMs towards an abs… ▽ More With the growth of magnitude of multi-agent networks, distributed optimization holds considerable significance within complex systems. Convergence, a pivotal goal in this domain, is contingent upon the analysis of infinite products of stochastic matrices (IPSMs). In this work, convergence properties of inhomogeneous IPSMs are investigated. The convergence rate of inhomogeneous IPSMs towards an absolute probability sequence $π$ is derived. We also show that the convergence rate is nearly exponential, which coincides with existing results on ergodic chains. The methodology employed relies on delineating the interrelations among Sarymsakov matrices, scrambling matrices, and positive-column matrices. Based on the theoretical results on inhomogeneous IPSMs, we propose a decentralized projected subgradient method for time-varying multi-agent systems with graph-related stretches in (sub)gradient descent directions. The convergence of the proposed method is established for convex objective functions, and extended to non-convex objectives that satisfy Polyak-Lojasiewicz conditions. To corroborate the theoretical findings, we conduct numerical simulations, aligning the outcomes with the established theoretical framework. △ Less

Submitted 25 January, 2024; originally announced January 2024.

arXiv:2401.12429 [pdf, other]

A new route to massive hot subdwarfs: common envelope ejection from asymptotic giant branch stars

Authors: Zhenwei Li, Yangyang Zhang, Hailiang Chen, Hongwei Ge, Dengkai Jiang, Jiangdan Li, Xuefei Chen, Zhanwen Han

Abstract: The hot subdwarf O/B stars (sdO/Bs) are known as extreme horizontal branch stars, which is of great importance in stellar evolution theory. The sdO/Bs are generally thought to have a helium-burning core and a thin hydrogen envelope $(M_{\rm env }<0.02M_\odot)$. In the canonical binary evolution scenario, sdO/Bs are considered to be the stripped cores of red giants. However, such a scenario cannot… ▽ More The hot subdwarf O/B stars (sdO/Bs) are known as extreme horizontal branch stars, which is of great importance in stellar evolution theory. The sdO/Bs are generally thought to have a helium-burning core and a thin hydrogen envelope $(M_{\rm env }<0.02M_\odot)$. In the canonical binary evolution scenario, sdO/Bs are considered to be the stripped cores of red giants. However, such a scenario cannot explain the recently discovered sdO/B binary, SMSS J1920, where the strong Ca H$\&$K lines in the spectrum are found. It suggests that this binary is likely originated from the recent ejection of common envelope (CE). In this {work}, we proposed a new formation channel of massive sdO/Bs, namely sdO/Bs produced from a CE ejection process with an asymptotic giant branch (AGB) star (hereafter AGB CE channel). We constructed the evolutionary model of sdO/Bs and successfully explained most of the important observed parameters of the sdO/B star in SMSS J1920, including the evolutionary age, sdO/B mass, effective temperature, surface gravity and surface helium abundance. The minimum sdO/B mass produced from the AGB CE channel is about $0.48M_\odot$. The evolutionary tracks in $\log T_{\rm eff}-\log g$ plane {may explain a fraction of the observational samples} with high-$\log T_{\rm eff}$ and low-$\log g$. Considering wind mass-loss of sdO/Bs, the model could produce helium-rich hot subdwarfs with $\log (n_{\rm He}/n_{\rm H})\gtrsim-1$. △ Less

Submitted 22 January, 2024; originally announced January 2024.

Comments: 9 pages, 5 figures, accepted for publication in ApJ

arXiv:2401.11940 [pdf, other]

Low-Tubal-Rank Tensor Recovery via Factorized Gradient Descent

Authors: Zhiyu Liu, Zhi Han, Yandong Tang, Xi-Le Zhao, Yao Wang

Abstract: This paper considers the problem of recovering a tensor with an underlying low-tubal-rank structure from a small number of corrupted linear measurements. Traditional approaches tackling such a problem require the computation of tensor Singular Value Decomposition (t-SVD), that is a computationally intensive process, rendering them impractical for dealing with large-scale tensors. Aim to address th… ▽ More This paper considers the problem of recovering a tensor with an underlying low-tubal-rank structure from a small number of corrupted linear measurements. Traditional approaches tackling such a problem require the computation of tensor Singular Value Decomposition (t-SVD), that is a computationally intensive process, rendering them impractical for dealing with large-scale tensors. Aim to address this challenge, we propose an efficient and effective low-tubal-rank tensor recovery method based on a factorization procedure akin to the Burer-Monteiro (BM) method. Precisely, our fundamental approach involves decomposing a large tensor into two smaller factor tensors, followed by solving the problem through factorized gradient descent (FGD). This strategy eliminates the need for t-SVD computation, thereby reducing computational costs and storage requirements. We provide rigorous theoretical analysis to ensure the convergence of FGD under both noise-free and noisy situations. Additionally, it is worth noting that our method does not require the precise estimation of the tensor tubal-rank. Even in cases where the tubal-rank is slightly overestimated, our approach continues to demonstrate robust performance. A series of experiments have been carried out to demonstrate that, as compared to other popular ones, our approach exhibits superior performance in multiple scenarios, in terms of the faster computational speed and the smaller convergence error. △ Less

Submitted 2 February, 2024; v1 submitted 22 January, 2024; originally announced January 2024.

Comments: 13 pages, 4 figures

arXiv:2401.11724 [pdf, other]

Augmenting Prototype Network with TransMix for Few-shot Hyperspectral Image Classification

Authors: Chun Liu, Longwei Yang, Dongmei Dong, Zheng Li, Wei Yang, Zhigang Han, Jiayao Wang

Abstract: Few-shot hyperspectral image classification aims to identify the classes of each pixel in the images by only marking few of these pixels. And in order to obtain the spatial-spectral joint features of each pixel, the fixed-size patches centering around each pixel are often used for classification. However, observing the classification results of existing methods, we found that boundary patches corr… ▽ More Few-shot hyperspectral image classification aims to identify the classes of each pixel in the images by only marking few of these pixels. And in order to obtain the spatial-spectral joint features of each pixel, the fixed-size patches centering around each pixel are often used for classification. However, observing the classification results of existing methods, we found that boundary patches corresponding to the pixels which are located at the boundary of the objects in the hyperspectral images, are hard to classify. These boundary patchs are mixed with multi-class spectral information. Inspired by this, we propose to augment the prototype network with TransMix for few-shot hyperspectrial image classification(APNT). While taking the prototype network as the backbone, it adopts the transformer as feature extractor to learn the pixel-to-pixel relation and pay different attentions to different pixels. At the same time, instead of directly using the patches which are cut from the hyperspectral images for training, it randomly mixs up two patches to imitate the boundary patches and uses the synthetic patches to train the model, with the aim to enlarge the number of hard training samples and enhance their diversity. And by following the data agumentation technique TransMix, the attention returned by the transformer is also used to mix up the labels of two patches to generate better labels for synthetic patches. Compared with existing methods, the proposed method has demonstrated sate of the art performance and better robustness for few-shot hyperspectral image classification in our experiments. △ Less

Submitted 22 January, 2024; originally announced January 2024.

arXiv:2401.11520 [pdf, ps, other]

Is it a Real CD Mismatch in Interdomain Routing?

Authors: Sun Letong, Shi Xingang, Han Fengyan, Yin Xia, Wang Zhiliang, Zhang Han

Abstract: In inter-domain routing, a packet is not always forwarded along the Autonomous System (AS) level path determined by the BGP routing protocol. This is often called control-plane and data-plane (CD) mismatch, which allows for flexible traffic control, but also leads to operation and security issues. We systematically analyze this phenomenon with path pairs collected from 128 pairs of vantage points… ▽ More In inter-domain routing, a packet is not always forwarded along the Autonomous System (AS) level path determined by the BGP routing protocol. This is often called control-plane and data-plane (CD) mismatch, which allows for flexible traffic control, but also leads to operation and security issues. We systematically analyze this phenomenon with path pairs collected from 128 pairs of vantage points over more than 5 years, and use multiple IP-to-AS map** methods to compare CD paths. What is interesting is that, working at such a large scale in turn helps us design a novel method to fairly evaluate the accuracy of various existing map** methods, and further develop a new map** method, i.e., LearnToCorrect, that can correct more than 70\% map** errors of the state-of-the-art one. Then we devise to identify real mismatches with LearnToCorrect, and estimate that the real-mismatch ratio in the wild is typically less than 6\%. At last, we use our proposed methods to detect routing security issues, which are previously difficult to accurately find out. △ Less

Submitted 21 January, 2024; originally announced January 2024.

arXiv:2401.11225 [pdf, ps, other]

Protecting Personalized Trajectory with Differential Privacy under Temporal Correlations

Authors: Mingge Cao, Haopeng Zhu, Minghui Min, Yulu Li, Shiyin Li, Hongliang Zhang, Zhu Han

Abstract: Location-based services (LBSs) in vehicular ad hoc networks (VANETs) offer users numerous conveniences. However, the extensive use of LBSs raises concerns about the privacy of users' trajectories, as adversaries can exploit temporal correlations between different locations to extract personal information. Additionally, users have varying privacy requirements depending on the time and location. To… ▽ More Location-based services (LBSs) in vehicular ad hoc networks (VANETs) offer users numerous conveniences. However, the extensive use of LBSs raises concerns about the privacy of users' trajectories, as adversaries can exploit temporal correlations between different locations to extract personal information. Additionally, users have varying privacy requirements depending on the time and location. To address these issues, this paper proposes a personalized trajectory privacy protection mechanism (PTPPM). This mechanism first uses the temporal correlation between trajectory locations to determine the possible location set for each time instant. We identify a protection location set (PLS) for each location by employing the Hilbert curve-based minimum distance search algorithm. This approach incorporates the complementary features of geo-indistinguishability and distortion privacy. We put forth a novel Permute-and-Flip mechanism for location perturbation, which maps its initial application in data publishing privacy protection to a location perturbation mechanism. This mechanism generates fake locations with smaller perturbation distances while improving the balance between privacy and quality of service (QoS). Simulation results show that our mechanism outperforms the benchmark by providing enhanced privacy protection while meeting user's QoS requirements. △ Less

Submitted 20 January, 2024; originally announced January 2024.

arXiv:2401.09716 [pdf, other]

HCVP: Leveraging Hierarchical Contrastive Visual Prompt for Domain Generalization

Authors: Guanglin Zhou, Zhongyi Han, Shiming Chen, Biwei Huang, Liming Zhu, Tongliang Liu, Lina Yao, Kun Zhang

Abstract: Domain Generalization (DG) endeavors to create machine learning models that excel in unseen scenarios by learning invariant features. In DG, the prevalent practice of constraining models to a fixed structure or uniform parameterization to encapsulate invariant features can inadvertently blend specific aspects. Such an approach struggles with nuanced differentiation of inter-domain variations and m… ▽ More Domain Generalization (DG) endeavors to create machine learning models that excel in unseen scenarios by learning invariant features. In DG, the prevalent practice of constraining models to a fixed structure or uniform parameterization to encapsulate invariant features can inadvertently blend specific aspects. Such an approach struggles with nuanced differentiation of inter-domain variations and may exhibit bias towards certain domains, hindering the precise learning of domain-invariant features. Recognizing this, we introduce a novel method designed to supplement the model with domain-level and task-specific characteristics. This approach aims to guide the model in more effectively separating invariant features from specific characteristics, thereby boosting the generalization. Building on the emerging trend of visual prompts in the DG paradigm, our work introduces the novel \textbf{H}ierarchical \textbf{C}ontrastive \textbf{V}isual \textbf{P}rompt (HCVP) methodology. This represents a significant advancement in the field, setting itself apart with a unique generative approach to prompts, alongside an explicit model structure and specialized loss functions. Differing from traditional visual prompts that are often shared across entire datasets, HCVP utilizes a hierarchical prompt generation network enhanced by prompt contrastive learning. These generative prompts are instance-dependent, catering to the unique characteristics inherent to different domains and tasks. Additionally, we devise a prompt modulation network that serves as a bridge, effectively incorporating the generated visual prompts into the vision transformer backbone. Experiments conducted on five DG datasets demonstrate the effectiveness of HCVP, outperforming both established DG algorithms and adaptation protocols. △ Less

Submitted 17 January, 2024; originally announced January 2024.

arXiv:2401.09709 [pdf, other]

P2Seg: Pointly-supervised Segmentation via Mutual Distillation

Authors: Zipeng Wang, Xuehui Yu, Xumeng Han, Wenwen Yu, Zhixun Huang, Jianbin Jiao, Zhenjun Han

Abstract: Point-level Supervised Instance Segmentation (PSIS) aims to enhance the applicability and scalability of instance segmentation by utilizing low-cost yet instance-informative annotations. Existing PSIS methods usually rely on positional information to distinguish objects, but predicting precise boundaries remains challenging due to the lack of contour annotations. Nevertheless, weakly supervised se… ▽ More Point-level Supervised Instance Segmentation (PSIS) aims to enhance the applicability and scalability of instance segmentation by utilizing low-cost yet instance-informative annotations. Existing PSIS methods usually rely on positional information to distinguish objects, but predicting precise boundaries remains challenging due to the lack of contour annotations. Nevertheless, weakly supervised semantic segmentation methods are proficient in utilizing intra-class feature consistency to capture the boundary contours of the same semantic regions. In this paper, we design a Mutual Distillation Module (MDM) to leverage the complementary strengths of both instance position and semantic information and achieve accurate instance-level object perception. The MDM consists of Semantic to Instance (S2I) and Instance to Semantic (I2S). S2I is guided by the precise boundaries of semantic regions to learn the association between annotated points and instance contours. I2S leverages discriminative relationships between instances to facilitate the differentiation of various objects within the semantic map. Extensive experiments substantiate the efficacy of MDM in fostering the synergy between instance and semantic information, consequently improving the quality of instance-level object representations. Our method achieves 55.7 mAP$_{50}$ and 17.6 mAP on the PASCAL VOC and MS COCO datasets, significantly outperforming recent PSIS methods and several box-supervised instance segmentation competitors. △ Less

Submitted 17 January, 2024; originally announced January 2024.

Comments: 14 pages, 12 figures, published to ICLR2024

arXiv:2401.09036 [pdf, other]

IRS-Enhanced Anti-Jamming Precoding Against DISCO Physical Layer Jamming Attacks

Authors: Huan Huang, Hongliang Zhang, Yi Cai, Yun**g Zhang, A. Lee Swindlehurst, Zhu Han

Abstract: Illegitimate intelligent reflective surfaces (IRSs) can pose significant physical layer security risks on multi-user multiple-input single-output (MU-MISO) systems. Recently, a DISCO approach has been proposed an illegitimate IRS with random and time-varying reflection coefficients, referred to as a "disco" IRS (DIRS). Such DIRS can attack MU-MISO systems without relying on either jamming power or… ▽ More Illegitimate intelligent reflective surfaces (IRSs) can pose significant physical layer security risks on multi-user multiple-input single-output (MU-MISO) systems. Recently, a DISCO approach has been proposed an illegitimate IRS with random and time-varying reflection coefficients, referred to as a "disco" IRS (DIRS). Such DIRS can attack MU-MISO systems without relying on either jamming power or channel state information (CSI), and classical anti-jamming techniques are ineffective for the DIRS-based fully-passive jammers (DIRS-based FPJs). In this paper, we propose an IRS-enhanced anti-jamming precoder against DIRS-based FPJs that requires only statistical rather than instantaneous CSI of the DIRS-jammed channels. Specifically, a legitimate IRS is introduced to reduce the strength of the DIRS-based jamming relative to the transmit signals at a legitimate user (LU). In addition, the active beamforming at the legitimate access point (AP) is designed to maximize the signal-to-jamming-plus-noise ratios (SJNRs). Numerical results are presented to evaluate the effectiveness of the proposed IRS-enhanced anti-jamming precoder against DIRS-based FPJs. △ Less

Submitted 17 January, 2024; originally announced January 2024.

Comments: This paper has been accepted by IEEE ICC 2024

arXiv:2401.08956 [pdf, other]

doi 10.1109/TAES.2023.3260059

A Unified NOMA Framework in Beam-Hop** Satellite Communication Systems

Authors: Xuyang Zhang, Xinwei Yue, Tian Li, Zhihao Han, Yafei Wang, Yong Ding, Rongke Liu

Abstract: This paper investigates the application of a unified non-orthogonal multiple access framework in beam hop** (U-NOMA-BH) based satellite communication systems. More specifically, the proposed U-NOMA-BH framework can be applied to code-domain NOMA based BH (CD-NOMA-BH) and power-domain NOMA based BH (PD-NOMA-BH) systems. To satisfy dynamic-uneven traffic demands, we formulate the optimization prob… ▽ More This paper investigates the application of a unified non-orthogonal multiple access framework in beam hop** (U-NOMA-BH) based satellite communication systems. More specifically, the proposed U-NOMA-BH framework can be applied to code-domain NOMA based BH (CD-NOMA-BH) and power-domain NOMA based BH (PD-NOMA-BH) systems. To satisfy dynamic-uneven traffic demands, we formulate the optimization problem to minimize the square of discrete difference by jointly optimizing power allocation, carrier assignment and beam scheduling. The non-convexity of the objective function and the constraint condition is solved through Dinkelbach's transform and variable relaxation. As a further development, the closed-from and asymptotic expressions of outage probability are derived for CD/PD-NOMA-BH systems. Based on approximated results, the diversity orders of a pair of users are obtained in detail. In addition, the system throughput of U-NOMA-BH is discussed in delay-limited transmission mode. Numerical results verify that: i) The gap between traffic requests of CD/PD-NOMA-BH systems appears to be more closely compared with orthogonal multiple access based BH (OMA-BH); ii) The CD-NOMA-BH system is capable of providing the enhanced traffic request and capacity provision; and iii) The outage behaviors of CD/PD-NOMA-BH are better than that of OMA-BH. △ Less

Submitted 16 January, 2024; originally announced January 2024.

Journal ref: IEEE Transactions on Aerospace and Electronic Systems, vol. 59, no. 5, pp. 5390-5404, Oct. 2023

arXiv:2401.07764 [pdf, other]

When Large Language Model Agents Meet 6G Networks: Perception, Grounding, and Alignment

Authors: Minrui Xu, Dusit Niyato, Jiawen Kang, Zehui Xiong, Shiwen Mao, Zhu Han, Dong In Kim, Khaled B. Letaief

Abstract: AI agents based on multimodal large language models (LLMs) are expected to revolutionize human-computer interaction and offer more personalized assistant services across various domains like healthcare, education, manufacturing, and entertainment. Deploying LLM agents in 6G networks enables users to access previously expensive AI assistant services via mobile devices democratically, thereby reduci… ▽ More AI agents based on multimodal large language models (LLMs) are expected to revolutionize human-computer interaction and offer more personalized assistant services across various domains like healthcare, education, manufacturing, and entertainment. Deploying LLM agents in 6G networks enables users to access previously expensive AI assistant services via mobile devices democratically, thereby reducing interaction latency and better preserving user privacy. Nevertheless, the limited capacity of mobile devices constrains the effectiveness of deploying and executing local LLMs, which necessitates offloading complex tasks to global LLMs running on edge servers during long-horizon interactions. In this article, we propose a split learning system for LLM agents in 6G networks leveraging the collaboration between mobile devices and edge servers, where multiple LLMs with different roles are distributed across mobile devices and edge servers to perform user-agent interactive tasks collaboratively. In the proposed system, LLM agents are split into perception, grounding, and alignment modules, facilitating inter-module communications to meet extended user requirements on 6G network functions, including integrated sensing and communication, digital twins, and task-oriented communications. Furthermore, we introduce a novel model caching algorithm for LLMs within the proposed system to improve model utilization in context, thus reducing network costs of the collaborative mobile and edge LLM agents. △ Less

Submitted 16 February, 2024; v1 submitted 15 January, 2024; originally announced January 2024.

arXiv:2401.07754 [pdf, ps, other]

Passive Beamforming For Practical RIS-Assisted Communication Systems With Non-Ideal Hardware

Authors: Yiming Liu, Rui Wang, Zhu Han

Abstract: Reconfigurable intelligent surface (RIS) technology is a promising solution to improve the performance of existing wireless communications. To achieve its cost-effectiveness advantage, there inevitably exist certain hardware impairments in the system. Therefore, it is more reasonable to design passive beamforming in this scenario. Some existing research has considered such problems under transceiv… ▽ More Reconfigurable intelligent surface (RIS) technology is a promising solution to improve the performance of existing wireless communications. To achieve its cost-effectiveness advantage, there inevitably exist certain hardware impairments in the system. Therefore, it is more reasonable to design passive beamforming in this scenario. Some existing research has considered such problems under transceiver impairments. However, their performance still leaves room for improvement, possibly due to their algorithms not properly handling the fractional structure of the objective function. To address this, the passive beamforming is redesigned in this correspondence paper, taking into account both transceiver impairments and the practical phase-shift model. We tackle the fractional structure of the problem by employing the quadratic transform. The remaining sub-problems are addressed utilizing the penalty-based method and the difference-of-convex programming. Since we provide closed-form solutions for all sub-problems, our algorithm is highly efficient. The simulation results demonstrate the superiority of our proposed algorithm. △ Less

Submitted 15 January, 2024; originally announced January 2024.

arXiv:2401.05103 [pdf, other]

Electron-capture supernovae in NS+He star systems and the double neutron star systems

Authors: Yun-Lang Guo, Bo Wang, Wen-Cong Chen, Xiang-Dong Li, Hong-Wei Ge, Long Jiang, Zhan-Wen Han

Abstract: Electron-capture supernovae (EC-SNe) provide an alternative channel for producing neutron stars (NSs). They play an important role in the formation of double NS (DNS) systems and the chemical evolution of galaxies, and contribute to the NS mass distribution in observations. It is generally believed that EC-SNe originate from $e$-captures on $\rm^{24}Mg$ and $\rm^{20}Ne$ in the massive degenerate o… ▽ More Electron-capture supernovae (EC-SNe) provide an alternative channel for producing neutron stars (NSs). They play an important role in the formation of double NS (DNS) systems and the chemical evolution of galaxies, and contribute to the NS mass distribution in observations. It is generally believed that EC-SNe originate from $e$-captures on $\rm^{24}Mg$ and $\rm^{20}Ne$ in the massive degenerate oxygen-neon (ONe) cores with masses close to the Chandrasekhar limit ($M_{\rm Ch}$). However, the origin of EC-SNe is still uncertain. In this paper, we systematically studied the EC-SNe in NS+He star systems by considering the explosive oxygen burning that may occur in the near-$M_{\rm Ch}$ ONe core. We provided the initial parameter spaces for producing EC-SNe in the initial orbital period $-$ initial He star mass (log$P_{\rm orb}^{\rm i}-M_{\rm He}^{\rm i}$) diagram, and found that both $M_{\rm He}^{\rm i}$ and minimum $P_{\rm orb}^{\rm i}$ for EC-SNe increase with metallicity. Then, by considering NS kicks added to the newborn NS, we investigated the properties of the formed DNS systems after the He star companions collapse into NSs, such as the orbital periods, eccentricities and spin periods of recycle pulsars ($P_{\rm spin}$), etc. The results show that most of the observed DNS systems can be produced by NS kicks of $\lesssim50\rm\,km\,s^{-1}$. In addition, we found that NSs could accrete more material if the residual H envelope on the He star companions is considered, which can form the mildly recycled pulsars ($P_{\rm spin}\sim20\,$ms) in DNS systems. △ Less

Submitted 23 April, 2024; v1 submitted 10 January, 2024; originally announced January 2024.

Comments: 13 pages, 14 figures, 3 tables, accepted for publication in MNRAS, comments welcome!

arXiv:2401.04395 [pdf, ps, other]

Formation of millisecond pulsars with wide orbits

Authors: Bo Wang, Dongdong Liu, Yunlang Guo, Hailiang Chen, Wenshi Tang, Luhan Li, Zhanwen Han

Abstract: Millisecond pulsars (MSPs) are a kind of radio pulsars with short spin periods, playing a key role in many aspects of stellar astrophysics. In recent years, some more MSPs with wide orbits ($>30\,\rm d$) have been discovered, but their origin is still highly unclear. In the present work, according to an adiabatic power-law assumption for the mass-transfer process, we carried out a large number of… ▽ More Millisecond pulsars (MSPs) are a kind of radio pulsars with short spin periods, playing a key role in many aspects of stellar astrophysics. In recent years, some more MSPs with wide orbits ($>30\,\rm d$) have been discovered, but their origin is still highly unclear. In the present work, according to an adiabatic power-law assumption for the mass-transfer process, we carried out a large number of complete binary evolution computations for the formation of MSPs with wide orbits through the iron core-collapse supernova (CCSN) channel, in which a neutron star (NS) originating from a CCSN accretes matter from a red-giant (RG) star and spun up to millisecond periods. We found that this channel can form the observed MSPs with wide orbits in the range of $30-1200\,{\rm d}$, in which the WD companions have masses in the range of $0.28-0.55\,\rm M_{\odot}$. We also found that almost all the observed MSPs can be reproduced by this channel in the WD companion mass versus orbital period diagram. We estimate that the Galactic numbers of the resulting MSPs from the CCSN channel are in the range of $\sim 0.9-1.4\times10^{6}$. Compared with the accretion-induced collapse channel, the CCSN channel provides a dominant way to produce MSPs with wide orbits. △ Less

Submitted 9 January, 2024; originally announced January 2024.

Comments: 17 pages, 7 figures, submitted to MNRAS, a revised version after referee's comments

arXiv:2401.04163 [pdf, other]

"Quantum Geometric Nesting'' and Solvable Model Flat-Band Systems

Authors: Zhaoyu Han, Jonah Herzog-Arbeitman, B. Andrei Bernevig, Steven A. Kivelson

Abstract: We introduce the concept of "quantum geometric nesting'' (QGN) to characterize the idealized ordering tendencies of certain flat-band systems implicit in the geometric structure of the flat-band subspace. Perfect QGN implies the existence of an infinite class of local interactions that can be explicitly constructed and give rise to solvable ground states with various forms of possible fermion bi-l… ▽ More We introduce the concept of "quantum geometric nesting'' (QGN) to characterize the idealized ordering tendencies of certain flat-band systems implicit in the geometric structure of the flat-band subspace. Perfect QGN implies the existence of an infinite class of local interactions that can be explicitly constructed and give rise to solvable ground states with various forms of possible fermion bi-linear order, including flavor ferromagnetism, density waves, and superconductivity. For the ideal Hamiltonians constructed in this way, we show that certain aspects of the low-energy spectrum can also be exactly computed including, in the superconducting case, the phase stiffness. Examples of perfect QGN include flat bands with certain symmetries (e.g. chiral or time-reversal), and non-symmetry-related cases exemplified with an engineered model for pair-density-wave. Extending this approach, we obtain exact superconducting ground states with nontrivial pairing symmetry. △ Less

Submitted 27 March, 2024; v1 submitted 8 January, 2024; originally announced January 2024.

arXiv:2401.03158 [pdf, other]

Quartet Logic: A Four-Step Reasoning (QLFR) framework for advancing Short Text Classification

Authors: Hui Wu, Yuanben Zhang, Zhonghe Han, Yingyan Hou, Lei Wang, Siye Liu, Qihang Gong, Yun** Ge

Abstract: Short Text Classification (STC) is crucial for processing and comprehending the brief but substantial content prevalent on contemporary digital platforms. The STC encounters difficulties in gras** semantic and syntactic intricacies, an issue that is apparent in traditional pre-trained language models. Although Graph Convolutional Networks enhance performance by integrating external knowledge bas… ▽ More Short Text Classification (STC) is crucial for processing and comprehending the brief but substantial content prevalent on contemporary digital platforms. The STC encounters difficulties in gras** semantic and syntactic intricacies, an issue that is apparent in traditional pre-trained language models. Although Graph Convolutional Networks enhance performance by integrating external knowledge bases, these methods are limited by the quality and extent of the knowledge applied. Recently, the emergence of Large Language Models (LLMs) and Chain-of-Thought (CoT) has significantly improved the performance of complex reasoning tasks. However, some studies have highlighted the limitations of their application in fundamental NLP tasks. Consequently, this study sought to employ CoT to investigate the capabilities of LLMs in STC tasks. This study introduces Quartet Logic: A Four-Step Reasoning (QLFR) framework. This framework primarily incorporates Syntactic and Semantic Enrichment CoT, effectively decomposing the STC task into four distinct steps: (i) essential concept identification, (ii) common-sense knowledge retrieval, (iii) text rewriting, and (iv) classification. This elicits the inherent knowledge and abilities of LLMs to address the challenges in STC. Surprisingly, we found that QLFR can also improve the performance of smaller models. Therefore, we developed a CoT-Driven Multi-task learning (QLFR-CML) method to facilitate the knowledge transfer from LLMs to smaller models. Extensive experimentation across six short-text benchmarks validated the efficacy of the proposed methods. Notably, QLFR achieved state-of-the-art performance on all datasets, with significant improvements, particularly on the Ohsumed and TagMyNews datasets. △ Less

Submitted 6 January, 2024; originally announced January 2024.

arXiv:2401.02304 [pdf, ps, other]

Sending-or-not-sending quantum key distribution with phase postselection

Authors: Yang-Guang Shan, Yao Zhou, Zhen-Qiang Yin, Shuang Wang, Wei Chen, De-Yong He, Guang-Can Guo, Zheng-Fu Han

Abstract: Quantum key distribution (QKD) could help to share secure key between two distant peers. In recent years, twin-field (TF) QKD has been widely investigated because of its long transmission distance. One of the popular variants of TF QKD is sending-or-not-sending (SNS) QKD, which has been experimentally verified to realize 1000-km level fibre key distribution. In this article, the authors introduce… ▽ More Quantum key distribution (QKD) could help to share secure key between two distant peers. In recent years, twin-field (TF) QKD has been widely investigated because of its long transmission distance. One of the popular variants of TF QKD is sending-or-not-sending (SNS) QKD, which has been experimentally verified to realize 1000-km level fibre key distribution. In this article, the authors introduce phase postselection into the SNS protocol. With this modification, the probability of selecting "sending" can be substantially improved. The numerical simulation shows that the transmission distance can be improved both with and without the actively odd-parity pairing method. With discrete phase randomization, the variant can have both a larger key rate and a longer distance. △ Less

Submitted 9 January, 2024; v1 submitted 4 January, 2024; originally announced January 2024.

arXiv:2401.01703 [pdf, other]

Proposal of detecting topological transition of quantum braiding in three-fold degenerate eigen subspace

Authors: Zhi-Wei Han, Jia-Hao Liang, Zhao-Xin Fu, Hong-Zhi Liu, Zi-Yuan Chen, Meng Wang, Ze-Rui He, Jia-Yi Huang, Qing-Xian Lv, Kai-Yu Liao, Yan-Xiong Du

Abstract: The braiding operations of quantum states have attracted substantial attention due to their great potential for realizing topological quantum computations. In this paper, we show that a three-fold degenerate eigen subspace can be obtained in a four-level Hamiltonian which is the minimal physical system. Braiding operations are proposed to apply to dressed states in the subspace. The topology of th… ▽ More The braiding operations of quantum states have attracted substantial attention due to their great potential for realizing topological quantum computations. In this paper, we show that a three-fold degenerate eigen subspace can be obtained in a four-level Hamiltonian which is the minimal physical system. Braiding operations are proposed to apply to dressed states in the subspace. The topology of the braiding diagram can be characterized through physical methods once that the sequential braiding pulses are adopted. We establish an equivalent relationship function between the permutation group and the output states where different output states correspond to different values of the function. The topological transition of the braiding happens when two operations overlap, which is detectable through the measurement of the function. Combined with the phase variation method, we can analyze the wringing pattern of the braiding. Therefore, the experimentally-feasible system provides a platform to investigate braiding dynamics, the SU(3) physics and the qutrit gates. △ Less

Submitted 3 January, 2024; originally announced January 2024.

Comments: 10 pages, 6 figures

arXiv:2401.01486 [pdf, other]

Anomalous Landau level gaps near magnetic transitions in monolayer WSe$_2$

Authors: Benjamin A. Foutty, Vladimir Calvera, Zhaoyu Han, Carlos R. Kometter, Song Liu, Kenji Watanabe, Takashi Taniguchi, James C. Hone, Steven A. Kivelson, Benjamin E. Feldman

Abstract: First-order phase transitions produce abrupt changes to the character of both ground and excited electronic states. Here we conduct electronic compressibility measurements to map the spin phase diagram and Landau level (LL) energies of monolayer WSe$_2$ in a magnetic field. We resolve a sequence of first-order phase transitions between completely spin-polarized LLs and states with LLs of both spin… ▽ More First-order phase transitions produce abrupt changes to the character of both ground and excited electronic states. Here we conduct electronic compressibility measurements to map the spin phase diagram and Landau level (LL) energies of monolayer WSe$_2$ in a magnetic field. We resolve a sequence of first-order phase transitions between completely spin-polarized LLs and states with LLs of both spins. Unexpectedly, the LL gaps are roughly constant over a wide range of magnetic fields below the transitions, which we show reflects a preference for opposite spin excitations of the spin-polarized ground state. These transitions also extend into compressible regimes, with a sawtooth boundary between full and partial spin polarization. We link these observations to the important influence of LL filling on the exchange energy beyond a smooth density-dependent contribution. Our results show that WSe$_2$ realizes a unique hierarchy of energy scales where such effects induce re-entrant magnetic phase transitions tuned by density and magnetic field. △ Less

Submitted 2 January, 2024; originally announced January 2024.

arXiv:2401.01140 [pdf, ps, other]

Joint Offloading and Resource Allocation for Hybrid Cloud and Edge Computing in SAGINs: A Decision Assisted Hybrid Action Space Deep Reinforcement Learning Approach

Authors: Chong Huang, Gaojie Chen, Pei Xiao, Yue Xiao, Zhu Han, Jonathon A. Chambers

Abstract: In recent years, the amalgamation of satellite communications and aerial platforms into space-air-ground integrated network (SAGINs) has emerged as an indispensable area of research for future communications due to the global coverage capacity of low Earth orbit (LEO) satellites and the flexible Deployment of aerial platforms. This paper presents a deep reinforcement learning (DRL)-based approach… ▽ More In recent years, the amalgamation of satellite communications and aerial platforms into space-air-ground integrated network (SAGINs) has emerged as an indispensable area of research for future communications due to the global coverage capacity of low Earth orbit (LEO) satellites and the flexible Deployment of aerial platforms. This paper presents a deep reinforcement learning (DRL)-based approach for the joint optimization of offloading and resource allocation in hybrid cloud and multi-access edge computing (MEC) scenarios within SAGINs. The proposed system considers the presence of multiple satellites, clouds and unmanned aerial vehicles (UAVs). The multiple tasks from ground users are modeled as directed acyclic graphs (DAGs). With the goal of reducing energy consumption and latency in MEC, we propose a novel multi-agent algorithm based on DRL that optimizes both the offloading strategy and the allocation of resources in the MEC infrastructure within SAGIN. A hybrid action algorithm is utilized to address the challenge of hybrid continuous and discrete action space in the proposed problems, and a decision-assisted DRL method is adopted to reduce the impact of unavailable actions in the training process of DRL. Through extensive simulations, the results demonstrate the efficacy of the proposed learning-based scheme, the proposed approach consistently outperforms benchmark schemes, highlighting its superior performance and potential for practical applications. △ Less

Submitted 2 January, 2024; originally announced January 2024.

Comments: 15 pages, accepted for publication in IEEE Journal on Selected Areas in Communications

arXiv:2312.17446 [pdf, other]

doi 10.1109/TWC.2023.3347537

ClST: A Convolutional Transformer Framework for Automatic Modulation Recognition by Knowledge Distillation

Authors: Dongbin Hou, Lixin Li, Wensheng Lin, Junli Liang, Zhu Han

Abstract: With the rapid development of deep learning (DL) in recent years, automatic modulation recognition (AMR) with DL has achieved high accuracy. However, insufficient training signal data in complicated channel environments and large-scale DL models are critical factors that make DL methods difficult to deploy in practice. Aiming to these problems, we propose a novel neural network named convolution-l… ▽ More With the rapid development of deep learning (DL) in recent years, automatic modulation recognition (AMR) with DL has achieved high accuracy. However, insufficient training signal data in complicated channel environments and large-scale DL models are critical factors that make DL methods difficult to deploy in practice. Aiming to these problems, we propose a novel neural network named convolution-linked signal transformer (ClST) and a novel knowledge distillation method named signal knowledge distillation (SKD). The ClST is accomplished through three primary modifications: a hierarchy of transformer containing convolution, a novel attention mechanism named parallel spatial-channel attention (PSCA) mechanism and a novel convolutional transformer block named convolution-transformer projection (CTP) to leverage a convolutional projection. The SKD is a knowledge distillation method to effectively reduce the parameters and complexity of neural networks. We train two lightweight neural networks using the SKD algorithm, KD-CNN and KD-MobileNet, to meet the demand that neural networks can be used on miniaturized devices. The simulation results demonstrate that the ClST outperforms advanced neural networks on all datasets. Moreover, both KD-CNN and KD-MobileNet obtain higher recognition accuracy with less network complexity, which is very beneficial for the deployment of AMR on miniaturized communication devices. △ Less

Submitted 28 December, 2023; originally announced December 2023.

arXiv:2312.17427 [pdf, other]

Phenomenology of Heavy Neutral Gauge Boson at Muon Collider

Authors: Zongyang Lu, Honglei Li, Zhi-Long Han, Zong-Guo Si, Liuxin Zhao

Abstract: Heavy neutral gauge boson $Z^\prime$ is proposed in many new physics models. It has rich phenomena at the future muon collider. We study the properties of $Z^\prime$ boson with the process of $μ^+ μ^- \rightarrow q \bar{q}$, $μ^+ μ^- \rightarrow l^+ l^-$, $μ^+ μ^- \rightarrow Z H$ and $μ^+ μ^- \rightarrow W^+ W^-$. The discrepancy of $Z^\prime$ coupling to different types of particles can be shown… ▽ More Heavy neutral gauge boson $Z^\prime$ is proposed in many new physics models. It has rich phenomena at the future muon collider. We study the properties of $Z^\prime$ boson with the process of $μ^+ μ^- \rightarrow q \bar{q}$, $μ^+ μ^- \rightarrow l^+ l^-$, $μ^+ μ^- \rightarrow Z H$ and $μ^+ μ^- \rightarrow W^+ W^-$. The discrepancy of $Z^\prime$ coupling to different types of particles can be shown in the cross section distributions around the resonance peak of various decay modes. Angular distributions of the final quark or lepton in $μ^+ μ^- \rightarrow q \bar{q}/l^+ l^- $ process are sensitive to the parameters such as mass of $Z^\prime$ and the $Z-Z^\prime$ mixing angle. The interaction of new gauge boson coupling to the standard model gauge particles and Higgs boson are also studied through $μ^+ μ^- \rightarrow Z H \rightarrow l^+l^- b \bar{b}$ and $μ^+ μ^- \rightarrow W^+W^- \rightarrow l^+l^- ν_l \barν_l$. The cross section and the final particles' angular distributions with the contribution of $Z^\prime$ boson differ from those processes with only standard model particles. A forward-backward asymmetry defined by the angular distribution is provided to show the potential of searching for new physics at the muon collider. Especially, the beam polarization with certain value can effectively enlarge the forward-backward asymmetry. △ Less

Submitted 28 December, 2023; originally announced December 2023.

Comments: 41 pages, 25 figures

arXiv:2312.15895 [pdf, other]

Semantic-aware SAM for Point-Prompted Instance Segmentation

Authors: Zhaoyang Wei, Pengfei Chen, Xuehui Yu, Guorong Li, Jianbin Jiao, Zhenjun Han

Abstract: Single-point annotation in visual tasks, with the goal of minimizing labelling costs, is becoming increasingly prominent in research. Recently, visual foundation models, such as Segment Anything (SAM), have gained widespread usage due to their robust zero-shot capabilities and exceptional annotation performance. However, SAM's class-agnostic output and high confidence in local segmentation introdu… ▽ More Single-point annotation in visual tasks, with the goal of minimizing labelling costs, is becoming increasingly prominent in research. Recently, visual foundation models, such as Segment Anything (SAM), have gained widespread usage due to their robust zero-shot capabilities and exceptional annotation performance. However, SAM's class-agnostic output and high confidence in local segmentation introduce 'semantic ambiguity', posing a challenge for precise category-specific segmentation. In this paper, we introduce a cost-effective category-specific segmenter using SAM. To tackle this challenge, we have devised a Semantic-Aware Instance Segmentation Network (SAPNet) that integrates Multiple Instance Learning (MIL) with matching capability and SAM with point prompts. SAPNet strategically selects the most representative mask proposals generated by SAM to supervise segmentation, with a specific focus on object category information. Moreover, we introduce the Point Distance Guidance and Box Mining Strategy to mitigate inherent challenges: 'group' and 'local' issues in weakly supervised segmentation. These strategies serve to further enhance the overall segmentation performance. The experimental results on Pascal VOC and COCO demonstrate the promising performance of our proposed SAPNet, emphasizing its semantic matching capabilities and its potential to advance point-prompted instance segmentation. The code will be made publicly available. △ Less

Submitted 26 May, 2024; v1 submitted 26 December, 2023; originally announced December 2023.

Comments: 16 pages, 8 figures, CVPR2024

arXiv:2312.15867 [pdf, other]

Punctuation Matters! Stealthy Backdoor Attack for Language Models

Authors: Xuan Sheng, Zhicheng Li, Zhaoyang Han, Xiangmao Chang, Piji Li

Abstract: Recent studies have pointed out that natural language processing (NLP) models are vulnerable to backdoor attacks. A backdoored model produces normal outputs on the clean samples while performing improperly on the texts with triggers that the adversary injects. However, previous studies on textual backdoor attack pay little attention to stealthiness. Moreover, some attack methods even cause grammat… ▽ More Recent studies have pointed out that natural language processing (NLP) models are vulnerable to backdoor attacks. A backdoored model produces normal outputs on the clean samples while performing improperly on the texts with triggers that the adversary injects. However, previous studies on textual backdoor attack pay little attention to stealthiness. Moreover, some attack methods even cause grammatical issues or change the semantic meaning of the original texts. Therefore, they can easily be detected by humans or defense systems. In this paper, we propose a novel stealthy backdoor attack method against textual models, which is called \textbf{PuncAttack}. It leverages combinations of punctuation marks as the trigger and chooses proper locations strategically to replace them. Through extensive experiments, we demonstrate that the proposed method can effectively compromise multiple models in various tasks. Meanwhile, we conduct automatic evaluation and human inspection, which indicate the proposed method possesses good performance of stealthiness without bringing grammatical issues and altering the meaning of sentences. △ Less

Submitted 25 December, 2023; originally announced December 2023.

Comments: NLPCC 2023

arXiv:2312.15808 [pdf, other]

Quantum-Assisted Online Task Offloading and Resource Allocation in MEC-Enabled Satellite-Aerial-Terrestrial Integrated Networks

Authors: Yu Zhang, Yanmin Gong, Lei Fan, Yu Wang, Zhu Han, Yuanxiong Guo

Abstract: In the era of Internet of Things (IoT), multi-access edge computing (MEC)-enabled satellite-aerial-terrestrial integrated network (SATIN) has emerged as a promising technology to provide massive IoT devices with seamless and reliable communication and computation services. This paper investigates the cooperation of low Earth orbit (LEO) satellites, high altitude platforms (HAPs), and terrestrial b… ▽ More In the era of Internet of Things (IoT), multi-access edge computing (MEC)-enabled satellite-aerial-terrestrial integrated network (SATIN) has emerged as a promising technology to provide massive IoT devices with seamless and reliable communication and computation services. This paper investigates the cooperation of low Earth orbit (LEO) satellites, high altitude platforms (HAPs), and terrestrial base stations (BSs) to provide relaying and computation services for vastly distributed IoT devices. Considering the uncertainty in dynamic SATIN systems, we formulate a stochastic optimization problem to minimize the time-average expected service delay by jointly optimizing resource allocation and task offloading while satisfying the energy constraints. To solve the formulated problem, we first develop a Lyapunov-based online control algorithm to decompose it into multiple one-slot problems. Since each one-slot problem is a large-scale mixed-integer nonlinear program (MINLP) that is intractable for classical computers, we further propose novel hybrid quantum-classical generalized Benders' decomposition (HQCGBD) algorithms to solve the problem efficiently by leveraging quantum advantages in parallel computing. Numerical results validate the effectiveness of the proposed MEC-enabled SATIN schemes. △ Less

Submitted 25 December, 2023; originally announced December 2023.

arXiv:2312.15805 [pdf, other]

Astrocyte Regulated Neuromorphic Central Pattern Generator Control of Legged Robotic Locomotion

Authors: Zhuangyu Han, Abhronil Sengupta

Abstract: Neuromorphic computing systems, where information is transmitted through action potentials in a bio-plausible fashion, is gaining increasing interest due to its promise of low-power event-driven computing. Application of neuromorphic computing in robotic locomotion research have largely focused on Central Pattern Generators (CPGs) for bionics robotic control algorithms - inspired from neural circu… ▽ More Neuromorphic computing systems, where information is transmitted through action potentials in a bio-plausible fashion, is gaining increasing interest due to its promise of low-power event-driven computing. Application of neuromorphic computing in robotic locomotion research have largely focused on Central Pattern Generators (CPGs) for bionics robotic control algorithms - inspired from neural circuits governing the collaboration of the limb muscles in animal movement. Implementation of artificial CPGs on neuromorphic hardware platforms can potentially enable adaptive and energy-efficient edge robotics applications in resource constrained environments. However, underlying rewiring mechanisms in CPG for gait emergence process is not well understood. This work addresses the missing gap in literature pertaining to CPG plasticity and underscores the critical homeostatic functionality of astrocytes - a cellular component in the brain that is believed to play a major role in multiple brain functions. This paper introduces an astrocyte regulated Spiking Neural Network (SNN)-based CPG for learning locomotion gait through Reward-Modulated STDP for quadruped robots, where the astrocytes help build inhibitory connections among the artificial motor neurons in different limbs. The SNN-based CPG is simulated on a multi-object physics simulation platform resulting in the emergence of a trotting gait while running the robot on flat ground. $23.3\times$ computational power savings is observed in comparison to a state-of-the-art reinforcement learning based robot control algorithm. Such a neuroscience-algorithm co-design approach can potentially enable a quantum leap in the functionality of neuromorphic systems incorporating glial cell functionality. △ Less

Submitted 5 January, 2024; v1 submitted 25 December, 2023; originally announced December 2023.

arXiv:2312.15133 [pdf, other]

Learning Continuous Implicit Field with Local Distance Indicator for Arbitrary-Scale Point Cloud Upsampling

Authors: Shujuan Li, Junsheng Zhou, Baorui Ma, Yu-Shen Liu, Zhizhong Han

Abstract: Point cloud upsampling aims to generate dense and uniformly distributed point sets from a sparse point cloud, which plays a critical role in 3D computer vision. Previous methods typically split a sparse point cloud into several local patches, upsample patch points, and merge all upsampled patches. However, these methods often produce holes, outliers or nonuniformity due to the splitting and mergin… ▽ More Point cloud upsampling aims to generate dense and uniformly distributed point sets from a sparse point cloud, which plays a critical role in 3D computer vision. Previous methods typically split a sparse point cloud into several local patches, upsample patch points, and merge all upsampled patches. However, these methods often produce holes, outliers or nonuniformity due to the splitting and merging process which does not maintain consistency among local patches. To address these issues, we propose a novel approach that learns an unsigned distance field guided by local priors for point cloud upsampling. Specifically, we train a local distance indicator (LDI) that predicts the unsigned distance from a query point to a local implicit surface. Utilizing the learned LDI, we learn an unsigned distance field to represent the sparse point cloud with patch consistency. At inference time, we randomly sample queries around the sparse point cloud, and project these query points onto the zero-level set of the learned implicit field to generate a dense point cloud. We justify that the implicit field is naturally continuous, which inherently enables the application of arbitrary-scale upsampling without necessarily retraining for various scales. We conduct comprehensive experiments on both synthetic data and real scans, and report state-of-the-art results under widely used benchmarks. △ Less

Submitted 22 December, 2023; originally announced December 2023.

Comments: Accepted by AAAI 2024. Project page: https://lisj575.github.io/APU-LDI

arXiv:2312.14448 [pdf, other]

Quantum-Assisted Joint Caching and Power Allocation for Integrated Satellite-Terrestrial Networks

Authors: Yu Zhang, Yanmin Gong, Lei Fan, Yu Wang, Zhu Han, Yuanxiong Guo

Abstract: Low earth orbit (LEO) satellite network can complement terrestrial networks for achieving global wireless coverage and improving delay-sensitive Internet services. This paper proposes an integrated satellite-terrestrial network (ISTN) architecture to provide ground users with seamless and reliable content delivery services. For optimal service provisioning in this architecture, we formulate an opt… ▽ More Low earth orbit (LEO) satellite network can complement terrestrial networks for achieving global wireless coverage and improving delay-sensitive Internet services. This paper proposes an integrated satellite-terrestrial network (ISTN) architecture to provide ground users with seamless and reliable content delivery services. For optimal service provisioning in this architecture, we formulate an optimization model to maximize the network throughput by jointly optimizing content delivery policy, cache placement, and transmission power allocation. The resulting optimization model is a large-scale mixed-integer nonlinear program (MINLP) that is intractable for classical computer solvers. Inspired by quantum computing techniques, we propose a hybrid quantum-classical generalized Benders' decomposition (HQCGBD) algorithm to address this challenge. Specifically, we first exploit the generalized Benders' decomposition (GBD) to decompose the problem into a master problem and a subproblem and then leverage the state-of-art quantum annealer to solve the challenging master problem. △ Less

Submitted 22 December, 2023; originally announced December 2023.

arXiv:2312.13654 [pdf, other]

Free Space Optical Integrated Sensing and Communication Based on DCO-OFDM: Performance Metrics and Resource Allocation

Authors: Yunfeng Wen, Fang Yang, Jian Song, Zhu Han

Abstract: As one of the six usage scenarios of the sixth generation (6G) mobile communication system, integrated sensing and communication (ISAC) has garnered considerable attention, and numerous studies have been conducted on radio-frequency (RF)-ISAC. Benefitting from the communication and sensing capabilities of an optical system, free space optical (FSO)-ISAC becomes a potential complement to RF-ISAC. I… ▽ More As one of the six usage scenarios of the sixth generation (6G) mobile communication system, integrated sensing and communication (ISAC) has garnered considerable attention, and numerous studies have been conducted on radio-frequency (RF)-ISAC. Benefitting from the communication and sensing capabilities of an optical system, free space optical (FSO)-ISAC becomes a potential complement to RF-ISAC. In this paper, a direct-current-biased optical orthogonal frequency division multiplexing (DCO-OFDM) scheme is proposed for FSO-ISAC. To derive the spectral efficiency for communication and the Fisher information for sensing as performance metrics, we model the clip** noise of DCO-OFDM as additive colored Gaussian noise to obtain the expression of the signal-to-noise ratio. Based on the derived performance metrics, joint power allocation problems are formulated for both communication-centric and sensing-centric scenarios. In addition, the non-convex joint optimization problems are decomposed into sub-problems for DC bias and subcarriers, which can be solved by block coordinate descent algorithms. Furthermore, numerical simulations demonstrate the proposed algorithms and reveal the trade-off between communication and sensing functionalities of the OFDM-based FSO-ISAC system. △ Less

Submitted 21 December, 2023; originally announced December 2023.

Comments: 13 pages, 8 figures

arXiv:2312.13640 [pdf, other]

Optical Integrated Sensing and Communication: Architectures, Potentials and Challenges

Authors: Yunfeng Wen, Fang Yang, Jian Song, Zhu Han

Abstract: Integrated sensing and communication (ISAC) is viewed as a crucial component of future mobile networks and has gained much interest in both academia and industry. Similar to the emergence of radio-frequency (RF) ISAC, the integration of free space optical communication and optical sensing yields optical ISAC (O-ISAC), which is regarded as a powerful complement to its RF counterpart. In this articl… ▽ More Integrated sensing and communication (ISAC) is viewed as a crucial component of future mobile networks and has gained much interest in both academia and industry. Similar to the emergence of radio-frequency (RF) ISAC, the integration of free space optical communication and optical sensing yields optical ISAC (O-ISAC), which is regarded as a powerful complement to its RF counterpart. In this article, we first introduce the generalized system structure of O-ISAC, and then elaborate on three advantages of O-ISAC, i.e., increasing communication rate, enhancing sensing precision, and reducing interference. Next, waveform design and resource allocation of O-ISAC are discussed based on pulsed waveform, constant-modulus waveform, and multi-carrier waveform. Furthermore, we put forward future trends and challenges of O-ISAC, which are expected to provide some valuable directions for future research. △ Less

Submitted 10 March, 2024; v1 submitted 21 December, 2023; originally announced December 2023.

Comments: 7 pages, 5 figures

arXiv:2312.13612 [pdf, other]

doi 10.1038/s41550-023-02188-2

A seven-Earth-radius helium-burning star inside a 20.5-min detached binary

Authors: Jie Lin, Chengyuan Wu, Heran Xiong, Xiaofeng Wang, Peter Nemeth, Zhanwen Han, Jiangdan Li, Nancy Elias-Rosa, Irene Salmaso, Alexei V. Filippenko, Thomas G. Brink, Yi Yang, Xuefei Chen, Shengyu Yan, Jujia Zhang, Sufen Guo, Yongzhi Cai, Jun Mo, Gaobo Xi, Jialian Liu, **cheng Guo, Qiqi Xia, Danfeng Xiang, Gaici Li, Zhenwei Li , et al. (6 additional authors not shown)

Abstract: Binary evolution theory predicts that the second common envelope (CE) ejection can produce low-mass (0.32-0.36 Msun) subdwarf B (sdB) stars inside ultrashort-orbital-period binary systems, as their helium cores are ignited under nondegenerate conditions. With the orbital decay driven by gravitational-wave (GW) radiation, the minimum orbital periods of detached sdB binaries could be as short as ~20… ▽ More Binary evolution theory predicts that the second common envelope (CE) ejection can produce low-mass (0.32-0.36 Msun) subdwarf B (sdB) stars inside ultrashort-orbital-period binary systems, as their helium cores are ignited under nondegenerate conditions. With the orbital decay driven by gravitational-wave (GW) radiation, the minimum orbital periods of detached sdB binaries could be as short as ~20 minutes. However, only four sdB binaries with orbital periods below an hour have been reported so far, while none of them has an orbital period approaching the above theoretical limit. Here we report the discovery of a 20.5-minute-orbital-period ellipsoidal binary, TMTS J052610.43+593445.1, in which the visible star is being tidally deformed by an invisible carbon-oxygen white dwarf (WD) companion. The visible component is inferred to be an sdB star with a mass of ~0.33 Msun, approaching that of helium-ignition limit, although a He-core WD cannot be completely ruled out. In particular, the radius of this low-mass sdB star is only 0.066 Rsun, about seven Earth radii, possibly representing the most compact nondegenerate star ever known. Such a system provides a key clue to map the binary evolution scheme from the second CE ejection to the formation of AM CVn stars having a helium-star donor, and it will also serve as a crucial verification binary of space-borne GW detectors in the future. △ Less

Submitted 10 February, 2024; v1 submitted 21 December, 2023; originally announced December 2023.

Comments: 24 pages, 11 figures, 1 table, published on Nature Astronomy, URL: https://www.nature.com/articles/s41550-023-02188-2

arXiv:2312.11392 [pdf, other]

SCEdit: Efficient and Controllable Image Diffusion Generation via Skip Connection Editing

Authors: Zeyinzi Jiang, Chaojie Mao, Yulin Pan, Zhen Han, **gfeng Zhang

Abstract: Image diffusion models have been utilized in various tasks, such as text-to-image generation and controllable image synthesis. Recent research has introduced tuning methods that make subtle adjustments to the original models, yielding promising results in specific adaptations of foundational generative diffusion models. Rather than modifying the main backbone of the diffusion model, we delve into… ▽ More Image diffusion models have been utilized in various tasks, such as text-to-image generation and controllable image synthesis. Recent research has introduced tuning methods that make subtle adjustments to the original models, yielding promising results in specific adaptations of foundational generative diffusion models. Rather than modifying the main backbone of the diffusion model, we delve into the role of skip connection in U-Net and reveal that hierarchical features aggregating long-distance information across encoder and decoder make a significant impact on the content and quality of image generation. Based on the observation, we propose an efficient generative tuning framework, dubbed SCEdit, which integrates and edits Skip Connection using a lightweight tuning module named SC-Tuner. Furthermore, the proposed framework allows for straightforward extension to controllable image synthesis by injecting different conditions with Controllable SC-Tuner, simplifying and unifying the network design for multi-condition inputs. Our SCEdit substantially reduces training parameters, memory usage, and computational expense due to its lightweight tuners, with backward propagation only passing to the decoder blocks. Extensive experiments conducted on text-to-image generation and controllable image synthesis tasks demonstrate the superiority of our method in terms of efficiency and performance. Project page: \url{https://scedit.github.io/} △ Less

Submitted 18 December, 2023; originally announced December 2023.

arXiv:2312.08714 [pdf, other]

Aerial STAR-RIS Empowered MEC: A DRL Approach for Energy Minimization

Authors: Pyae Sone Aung, Loc X. Nguyen, Yan Kyaw Tun, Zhu Han, Choong Seon Hong

Abstract: Multi-access Edge Computing (MEC) addresses computational and battery limitations in devices by allowing them to offload computation tasks. To overcome the difficulties in establishing line-of-sight connections, integrating unmanned aerial vehicles (UAVs) has proven beneficial, offering enhanced data exchange, rapid deployment, and mobility. The utilization of reconfigurable intelligent surfaces (… ▽ More Multi-access Edge Computing (MEC) addresses computational and battery limitations in devices by allowing them to offload computation tasks. To overcome the difficulties in establishing line-of-sight connections, integrating unmanned aerial vehicles (UAVs) has proven beneficial, offering enhanced data exchange, rapid deployment, and mobility. The utilization of reconfigurable intelligent surfaces (RIS), specifically simultaneously transmitting and reflecting RIS (STAR-RIS) technology, further extends coverage capabilities and introduces flexibility in MEC. This study explores the integration of UAV and STAR-RIS to facilitate communication between IoT devices and an MEC server. The formulated problem aims to minimize energy consumption for IoT devices and aerial STAR-RIS by jointly optimizing task offloading, aerial STAR-RIS trajectory, amplitude and phase shift coefficients, and transmit power. Given the non-convexity of the problem and the dynamic environment, solving it directly within a polynomial time frame is challenging. Therefore, deep reinforcement learning (DRL), particularly proximal policy optimization (PPO), is introduced for its sample efficiency and stability. Simulation results illustrate the effectiveness of the proposed system compared to benchmark schemes in the literature. △ Less

Submitted 14 December, 2023; originally announced December 2023.

Showing 101–150 of 1,406 results for author: Han, Z