Skip to main content

Showing 101–150 of 1,143 results for author: Xie, L

.
  1. arXiv:2401.03687  [pdf, other

    eess.AS cs.SD

    BS-PLCNet: Band-split Packet Loss Concealment Network with Multi-task Learning Framework and Multi-discriminators

    Authors: Zihan Zhang, Jiayao Sun, Xianjun Xia, Chuanzeng Huang, Yijian Xiao, Lei Xie

    Abstract: Packet loss is a common and unavoidable problem in voice over internet phone (VoIP) systems. To deal with the problem, we propose a band-split packet loss concealment network (BS-PLCNet). Specifically, we split the full-band signal into wide-band (0-8kHz) and high-band (8-24kHz). The wide-band signals are processed by a gated convolutional recurrent network (GCRN), while the high-band counterpart… ▽ More

    Submitted 8 January, 2024; originally announced January 2024.

    Comments: submitted to ICASSP 2024

  2. arXiv:2401.03473  [pdf, ps, other

    cs.SD cs.AI eess.AS

    ICMC-ASR: The ICASSP 2024 In-Car Multi-Channel Automatic Speech Recognition Challenge

    Authors: He Wang, Pengcheng Guo, Yue Li, Ao Zhang, Jiayao Sun, Lei Xie, Wei Chen, Pan Zhou, Hui Bu, Xin Xu, Binbin Zhang, Zhuo Chen, Jian Wu, Longbiao Wang, Eng Siong Chng, Sun Li

    Abstract: To promote speech processing and recognition research in driving scenarios, we build on the success of the Intelligent Cockpit Speech Recognition Challenge (ICSRC) held at ISCSLP 2022 and launch the ICASSP 2024 In-Car Multi-Channel Automatic Speech Recognition (ICMC-ASR) Challenge. This challenge collects over 100 hours of multi-channel speech data recorded inside a new energy vehicle and 40 hours… ▽ More

    Submitted 20 February, 2024; v1 submitted 7 January, 2024; originally announced January 2024.

    Comments: Accepted at ICASSP 2024

  3. MLCA-AVSR: Multi-Layer Cross Attention Fusion based Audio-Visual Speech Recognition

    Authors: He Wang, Pengcheng Guo, Pan Zhou, Lei Xie

    Abstract: While automatic speech recognition (ASR) systems degrade significantly in noisy environments, audio-visual speech recognition (AVSR) systems aim to complement the audio stream with noise-invariant visual cues and improve the system's robustness. However, current studies mainly focus on fusing the well-learned modality features, like the output of modality-specific encoders, without considering the… ▽ More

    Submitted 8 April, 2024; v1 submitted 7 January, 2024; originally announced January 2024.

    Comments: 5 pages, 3 figures Accepted at ICASSP 2024

  4. arXiv:2401.03105  [pdf, other

    cs.CV cs.MM

    Incorporating Visual Experts to Resolve the Information Loss in Multimodal Large Language Models

    Authors: Xin He, Longhui Wei, Lingxi Xie, Qi Tian

    Abstract: Multimodal Large Language Models (MLLMs) are experiencing rapid growth, yielding a plethora of noteworthy contributions in recent months. The prevailing trend involves adopting data-driven methodologies, wherein diverse instruction-following datasets are collected. However, a prevailing challenge persists in these approaches, specifically in relation to the limited visual perception ability, as CL… ▽ More

    Submitted 13 January, 2024; v1 submitted 5 January, 2024; originally announced January 2024.

  5. arXiv:2401.02617  [pdf, ps, other

    astro-ph.SR

    How are the abnormally hot chromosphere and corona heated by the solar magnetic fields?

    Authors: K. J. Li, J. C. Xu, W. F eng, J. L. Xie, X. J. Shi, L. H. Deng

    Abstract: The corona is a structure possessed by stars, including the sun. The abnormal heating of the solar corona and chromosphere is one of the greatest mysteries in modern astronomy. While state-of-the-art observations have identified some candidates of magnetic activity events that could be responsible for this abnormal heating, and theoretical studies have proposed various heating modes, a complete ph… ▽ More

    Submitted 4 January, 2024; originally announced January 2024.

    Comments: accepted for publication in ApJ

  6. arXiv:2401.01685  [pdf

    eess.IV cs.CV

    Modality Exchange Network for Retinogeniculate Visual Pathway Segmentation

    Authors: Hua Han, Cheng Li, Lei Xie, Yuan**g Feng, Alou Diakite, Shanshan Wang

    Abstract: Accurate segmentation of the retinogeniculate visual pathway (RGVP) aids in the diagnosis and treatment of visual disorders by identifying disruptions or abnormalities within the pathway. However, the complex anatomical structure and connectivity of RGVP make it challenging to achieve accurate segmentation. In this study, we propose a novel Modality Exchange Network (ME-Net) that effectively utili… ▽ More

    Submitted 3 January, 2024; originally announced January 2024.

  7. arXiv:2401.01654  [pdf, other

    eess.IV cs.LG

    LESEN: Label-Efficient deep learning for Multi-parametric MRI-based Visual Pathway Segmentation

    Authors: Alou Diakite, Cheng Li, Lei Xie, Yuan**g Feng, Hua Han, Shanshan Wang

    Abstract: Recent research has shown the potential of deep learning in multi-parametric MRI-based visual pathway (VP) segmentation. However, obtaining labeled data for training is laborious and time-consuming. Therefore, it is crucial to develop effective algorithms in situations with limited labeled samples. In this work, we propose a label-efficient deep learning method with self-ensembling (LESEN). LESEN… ▽ More

    Submitted 3 January, 2024; originally announced January 2024.

  8. arXiv:2401.00475  [pdf, other

    cs.SD eess.AS

    E-chat: Emotion-sensitive Spoken Dialogue System with Large Language Models

    Authors: Hongfei Xue, Yuhao Liang, Bingshen Mu, Shiliang Zhang, Mengzhe Chen, Qian Chen, Lei Xie

    Abstract: This study focuses on emotion-sensitive spoken dialogue in human-machine speech interaction. With the advancement of Large Language Models (LLMs), dialogue systems can handle multimodal data, including audio. Recent models have enhanced the understanding of complex audio signals through the integration of various audio events. However, they are unable to generate appropriate responses based on emo… ▽ More

    Submitted 6 January, 2024; v1 submitted 31 December, 2023; originally announced January 2024.

    Comments: 6 pages, 3 figures

  9. arXiv:2312.17495  [pdf

    cs.LG physics.bio-ph q-bio.BM

    Integrating Chemical Language and Molecular Graph in Multimodal Fused Deep Learning for Drug Property Prediction

    Authors: Xiaohua Lu, Liangxu Xie, Lei Xu, Rongzhi Mao, Shan Chang, Xiaojun Xu

    Abstract: Accurately predicting molecular properties is a challenging but essential task in drug discovery. Recently, many mono-modal deep learning methods have been successfully applied to molecular property prediction. However, the inherent limitation of mono-modal learning arises from relying solely on one modality of molecular representation, which restricts a comprehensive understanding of drug molecul… ▽ More

    Submitted 29 December, 2023; originally announced December 2023.

  10. arXiv:2312.16850  [pdf, other

    cs.SD eess.AS

    Accent-VITS:accent transfer for end-to-end TTS

    Authors: Linhan Ma, Yongmao Zhang, Xinfa Zhu, Yi Lei, Ziqian Ning, Pengcheng Zhu, Lei Xie

    Abstract: Accent transfer aims to transfer an accent from a source speaker to synthetic speech in the target speaker's voice. The main challenge is how to effectively disentangle speaker timbre and accent which are entangled in speech. This paper presents a VITS-based end-to-end accent transfer model named Accent-VITS.Based on the main structure of VITS, Accent-VITS makes substantial improvements to enable… ▽ More

    Submitted 29 December, 2023; v1 submitted 28 December, 2023; originally announced December 2023.

    Comments: Accepted by NCMMSC2023

  11. arXiv:2312.15340  [pdf, other

    eess.SY cs.LG

    Meta-Learning-Based Adaptive Stability Certificates for Dynamical Systems

    Authors: Amit Jena, Dileep Kalathil, Le Xie

    Abstract: This paper addresses the problem of Neural Network (NN) based adaptive stability certification in a dynamical system. The state-of-the-art methods, such as Neural Lyapunov Functions (NLFs), use NN-based formulations to assess the stability of a non-linear dynamical system and compute a Region of Attraction (ROA) in the state space. However, under parametric uncertainty, if the values of system par… ▽ More

    Submitted 23 December, 2023; originally announced December 2023.

    Comments: This article has been accepted for AAAI-24 (The 38th Annual AAAI Conference on Artificial Intelligence)

  12. arXiv:2312.15067  [pdf, other

    eess.SY

    Electromagnetic Transient Model of Cryptocurrency Mining Loads for Low-Voltage Ride Through Assessment in Transmission Grids

    Authors: Anindita Samanta, Subir Majumder, Hasan Ibrahim, Prasad Enjeti, Le Xie

    Abstract: In this paper, we developed an Electromagnetic Transient (EMT) model tailored for large cryptocurrency mining loads to understand the cross-interaction of these loads with the electric grid. The load model has been built using Electromagnetic Transients Program (EMTP) software. We have cross-validated the performance of the EMT model of the load with commercial application-specific integrated circ… ▽ More

    Submitted 22 December, 2023; originally announced December 2023.

    Comments: 5 pages, 10 figures, conference

  13. arXiv:2312.13076  [pdf, other

    cs.RO eess.SY

    How to Integrate Digital Twin and Virtual Reality in Robotics Systems? Design and Implementation for Providing Robotics Maintenance Services in Data Centers

    Authors: Lin Xie, Hanyi Li

    Abstract: In the context of Industry 4.0, the physical and digital worlds are closely connected, and robots are widely used to achieve system automation. Digital twin solutions have contributed significantly to the growth of Industry 4.0. Combining various technologies is a trend that aims to improve system performance. For example, digital twinning can be combined with virtual reality in automated systems.… ▽ More

    Submitted 20 December, 2023; originally announced December 2023.

  14. The stellar mass function of quiescent galaxies in 2 < z < 2.5 protoclusters

    Authors: Adit H. Edward, Michael L. Balogh, Yannick M. Bahe, Michael C. Cooper, Nina A. Hatch, Justin Marchioni, Adam Muzzin, Allison Noble, Gregory H. Rednick, Benedetta Vulcani, Gillian Wilson, Gabriella De Lucia, Ricardo Demarco, Ben Forrest, Michaela Hirschmann, Gianluca Castignani, Pierluigi Cerulo, Rose A. Finn, Guillaume Hewitt, Pascale Jablonka, Yadayuki Kodama, Sophie Maurogordato, Julie Nantais, Lizhi Xie

    Abstract: We present an analysis of the galaxy stellar mass function (SMF) of 14 known protoclusters between $2.0 < z < 2.5$ in the COSMOS field, down to a mass limit of $10^{9.5}$ M$_{\odot}$. We use existing photometric redshifts with a statistical background subtraction, and consider star-forming and quiescent galaxies identified from $(NUV - r)$ and $(r - J)$ colours separately. Our fiducial sample incl… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

    Comments: 23 pages, 22 figures. Accepted for publication in MNRAS

  15. Angle-Displacement Rigidity Theory with Application to Distributed Network Localization

    Authors: Xu Fang, Xiaolei Li, Lihua Xie

    Abstract: This paper investigates the localization problem of a network in 2-D and 3-D spaces given the positions of anchor nodes in a global frame and inter-node relative measurements in local coordinate frames. It is assumed that the local frames of different nodes have different unknown orientations. First, an angle-displacement rigidity theory is developed, which can be used to localize all the free nod… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

  16. Distributed Semi-global Output Feedback Formation Maneuver Control of High-order Multi-agent Systems

    Authors: Xu Fang, Lihua Xie

    Abstract: This paper addresses the formation maneuver control problem of leader-follower multi-agent systems with high-order integrator dynamics. A distributed output feedback formation maneuver controller is proposed to achieve desired maneuvers so that the scale, orientation, translation, and shape of formation can be manipulated continuously, where the followers do not need to know or estimate the time-v… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

  17. 3-D Distributed Localization with Mixed Local Relative Measurements

    Authors: Xu Fang, Xiaolei Li, Lihua Xie

    Abstract: This paper studies 3-D distributed network localization using mixed types of local relative measurements. Each node holds a local coordinate frame without a common orientation and can only measure one type of information (relative position, distance, relative bearing, angle, or ratio-of-distance measurements) about its neighboring nodes in its local coordinate frame. A novel rigidity-theory-based… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

  18. Distributed Localization in Dynamic Networks via Complex Laplacian

    Authors: Xu Fang, Lihua Xie, Xiaolei Li

    Abstract: Different from most existing distributed localization approaches in static networks where the agents in a network are static, this paper addresses the distributed localization problem in dynamic networks where the positions of the agents are time-varying. Firstly, complex constraints for the positions of the agents are constructed based on local relative position (distance and local bearing) measu… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

  19. arXiv:2312.10037  [pdf, ps, other

    math.RA math.NA

    A system of dual quaternion matrix equations with its applications

    Authors: Lv-Ming Xie, Qing-Wen Wang

    Abstract: We employ the M-P inverses and ranks of quaternion matrices to establish the necessary and sufficient conditions for solving a system of the dual quaternion matrix equations $(AX, XC) = (B, D)$, along with providing an expression for its general solution. Serving as an application, we investigate the solutions to the dual quaternion matrix equations $AX = B$ and $XC=D$, including $η$-Hermitian sol… ▽ More

    Submitted 13 November, 2023; originally announced December 2023.

  20. arXiv:2312.09760  [pdf, other

    eess.AS cs.SD

    U2-KWS: Unified Two-pass Open-vocabulary Keyword Spotting with Keyword Bias

    Authors: Ao Zhang, Pan Zhou, Kaixun Huang, Yong Zou, Ming Liu, Lei Xie

    Abstract: Open-vocabulary keyword spotting (KWS), which allows users to customize keywords, has attracted increasingly more interest. However, existing methods based on acoustic models and post-processing train the acoustic model with ASR training criteria to model all phonemes, making the acoustic model under-optimized for the KWS task. To solve this problem, we propose a novel unified two-pass open-vocabu… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

    Comments: Accepted by ASRU2023

  21. arXiv:2312.09747  [pdf, other

    eess.AS eess.SP

    SELM: Speech Enhancement Using Discrete Tokens and Language Models

    Authors: Ziqian Wang, Xinfa Zhu, Zihan Zhang, YuanJun Lv, Ning Jiang, Guoqing Zhao, Lei Xie

    Abstract: Language models (LMs) have shown superior performances in various speech generation tasks recently, demonstrating their powerful ability for semantic context modeling. Given the intrinsic similarity between speech generation and speech enhancement, harnessing semantic information holds potential advantages for speech enhancement tasks. In light of this, we propose SELM, a novel paradigm for speech… ▽ More

    Submitted 7 January, 2024; v1 submitted 15 December, 2023; originally announced December 2023.

    Comments: Accepted by ICASSP 2024

  22. arXiv:2312.09746  [pdf, other

    cs.SD eess.AS

    Automatic channel selection and spatial feature integration for multi-channel speech recognition across various array topologies

    Authors: Bingshen Mu, Pengcheng Guo, Dake Guo, Pan Zhou, Wei Chen, Lei Xie

    Abstract: Automatic Speech Recognition (ASR) has shown remarkable progress, yet it still faces challenges in real-world distant scenarios across various array topologies each with multiple recording devices. The focal point of the CHiME-7 Distant ASR task is to devise a unified system capable of generalizing various array topologies that have multiple recording devices and offering reliable recognition perf… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

    Comments: Accepted by ICASSP 2024

  23. arXiv:2312.06739  [pdf, other

    cs.CV

    SmartEdit: Exploring Complex Instruction-based Image Editing with Multimodal Large Language Models

    Authors: Yuzhou Huang, Liangbin Xie, Xintao Wang, Ziyang Yuan, Xiaodong Cun, Yixiao Ge, Jiantao Zhou, Chao Dong, Rui Huang, Ruimao Zhang, Ying Shan

    Abstract: Current instruction-based editing methods, such as InstructPix2Pix, often fail to produce satisfactory results in complex scenarios due to their dependence on the simple CLIP text encoder in diffusion models. To rectify this, this paper introduces SmartEdit, a novel approach to instruction-based image editing that leverages Multimodal Large Language Models (MLLMs) to enhance their understanding an… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

    Comments: Project page: https://yuzhou914.github.io/SmartEdit/

  24. arXiv:2312.06607  [pdf, other

    cs.CV

    DiAD: A Diffusion-based Framework for Multi-class Anomaly Detection

    Authors: Haoyang He, Jiangning Zhang, Hongxu Chen, Xuhai Chen, Zhishan Li, Xu Chen, Yabiao Wang, Chengjie Wang, Lei Xie

    Abstract: Reconstruction-based approaches have achieved remarkable outcomes in anomaly detection. The exceptional image reconstruction capabilities of recently popular diffusion models have sparked research efforts to utilize them for enhanced reconstruction of anomalous images. Nonetheless, these methods might face challenges related to the preservation of image categories and pixel-wise structural integri… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

    Comments: Accepted by AAAI 2024

  25. arXiv:2312.06154  [pdf, other

    eess.SY

    Predictive Reliability Assessment of Distribution Grids with Residential Distributed Energy Resources

    Authors: Arun Kumar Karngala, Chanan Singh, Le Xie

    Abstract: Distribution system end users are transforming from passive to active participants, marked by the push towards widespread adoption of edge-level Distributed Energy Resources (DERs). This paper addresses the challenges in distribution system planning arising from these dynamic changes. We introduce a bottom-up probabilistic approach that integrates these edge-level DERs into the reliability evaluat… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

    Comments: 10 Pages, 6 figures, Journal

  26. arXiv:2312.04424  [pdf, other

    cs.CV cs.GR

    Cascade-Zero123: One Image to Highly Consistent 3D with Self-Prompted Nearby Views

    Authors: Yabo Chen, Jiemin Fang, Yuyang Huang, Taoran Yi, Xiaopeng Zhang, Lingxi Xie, Xinggang Wang, Wenrui Dai, Hongkai Xiong, Qi Tian

    Abstract: Synthesizing multi-view 3D from one single image is a significant and challenging task. For this goal, Zero-1-to-3 methods aim to extend a 2D latent diffusion model to the 3D scope. These approaches generate the target-view image with a single-view source image and the camera pose as condition information. However, the one-to-one manner adopted in Zero-1-to-3 incurs challenges for building geometr… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

    Comments: Project page: https://cascadezero123.github.io/

  27. Distributed Formation Maneuver Control Using Complex Laplacian

    Authors: Xu Fang, Lihua Xie

    Abstract: This paper studies the problem of distributed formation maneuver control of multi-agent systems via complex Laplacian. We will show how to change the translation, scaling, rotation, and also the shape of formation continuously by only tuning the positions of the leaders in both 2-D and 3-D spaces, where the rotation of formation in 3-D space is realized by changing the yaw angle, pitch angle, and… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

    Comments: 8 pages

  28. arXiv:2312.04131  [pdf, other

    eess.AS cs.SD

    Joint Training or Not: An Exploration of Pre-trained Speech Models in Audio-Visual Speaker Diarization

    Authors: Huan Zhao, Li Zhang, Yue Li, Yannan Wang, Hongji Wang, Wei Rao, Qing Wang, Lei Xie

    Abstract: The scarcity of labeled audio-visual datasets is a constraint for training superior audio-visual speaker diarization systems. To improve the performance of audio-visual speaker diarization, we leverage pre-trained supervised and self-supervised speech models for audio-visual speaker diarization. Specifically, we adopt supervised~(ResNet and ECAPA-TDNN) and self-supervised pre-trained models~(WavLM… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

  29. arXiv:2312.03016  [pdf, other

    q-bio.QM cs.CL cs.LG

    Protein Language Model-Powered 3D Ligand Binding Site Prediction from Protein Sequence

    Authors: Shuo Zhang, Lei Xie

    Abstract: Prediction of ligand binding sites of proteins is a fundamental and important task for understanding the function of proteins and screening potential drugs. Most existing methods require experimentally determined protein holo-structures as input. However, such structures can be unavailable on novel or less-studied proteins. To tackle this limitation, we propose LaMPSite, which only takes protein s… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

    Comments: Accepted by the AI for Science (AI4Science) Workshop and the New Frontiers of AI for Drug Discovery and Development (AI4D3) Workshop at NeurIPS 2023

  30. arXiv:2312.00860  [pdf, other

    cs.CV

    Segment Any 3D Gaussians

    Authors: Jiazhong Cen, Jiemin Fang, Chen Yang, Lingxi Xie, Xiaopeng Zhang, Wei Shen, Qi Tian

    Abstract: This paper presents SAGA (Segment Any 3D GAussians), a highly efficient 3D promptable segmentation method based on 3D Gaussian Splatting (3D-GS). Given 2D visual prompts as input, SAGA can segment the corresponding 3D target represented by 3D Gaussians within 4 ms. This is achieved by attaching an scale-gated affinity feature to each 3D Gaussian to endow it a new property towards multi-granularity… ▽ More

    Submitted 27 May, 2024; v1 submitted 1 December, 2023; originally announced December 2023.

    Comments: Work in progress. Project page: https://jumpat.github.io/SAGA

  31. arXiv:2311.17112  [pdf, other

    cs.CV

    Parameter Efficient Fine-tuning via Cross Block Orchestration for Segment Anything Model

    Authors: Zelin Peng, Zhengqin Xu, Zhilin Zeng, Lingxi Xie, Qi Tian, Wei Shen

    Abstract: Parameter-efficient fine-tuning (PEFT) is an effective methodology to unleash the potential of large foundation models in novel scenarios with limited training data. In the computer vision community, PEFT has shown effectiveness in image classification, but little research has studied its ability for image segmentation. Fine-tuning segmentation models usually require a heavier adjustment of parame… ▽ More

    Submitted 28 March, 2024; v1 submitted 28 November, 2023; originally announced November 2023.

    Comments: Accepted by CVPR2024

  32. arXiv:2311.16037  [pdf, other

    cs.CV cs.GR

    GaussianEditor: Editing 3D Gaussians Delicately with Text Instructions

    Authors: Jiemin Fang, Junjie Wang, Xiaopeng Zhang, Lingxi Xie, Qi Tian

    Abstract: Recently, impressive results have been achieved in 3D scene editing with text instructions based on a 2D diffusion model. However, current diffusion models primarily generate images by predicting noise in the latent space, and the editing is usually applied to the whole image, which makes it challenging to perform delicate, especially localized, editing for 3D scenes. Inspired by recent 3D Gaussia… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

    Comments: Project page: https://GaussianEditor.github.io

  33. One-bit Supervision for Image Classification: Problem, Solution, and Beyond

    Authors: Hengtong Hu, Lingxi Xie, Xinyue Hue, Richang Hong, Qi Tian

    Abstract: This paper presents one-bit supervision, a novel setting of learning with fewer labels, for image classification. Instead of training model using the accurate label of each sample, our setting requires the model to interact with the system by predicting the class label of each sample and learn from the answer whether the guess is correct, which provides one bit (yes or no) of information. An intri… ▽ More

    Submitted 26 November, 2023; originally announced November 2023.

    Comments: ACM TOMM. arXiv admin note: text overlap with arXiv:2009.06168

  34. arXiv:2311.14861  [pdf, other

    eess.SY

    Voltage Constrained Heavy Duty Vehicle Electrification: Formulation and Case Study

    Authors: Apurv Shukla, Rayan El Helou, Le Xie

    Abstract: The electrification of heavy-duty vehicles (HDEVs) is a rapidly emerging avenue for decarbonization of energy and transportation sectors. Compared to light duty vehicles, HDEVs exhibit unique travel and charging patterns over long distances. In this paper, we formulate an analytically tractable model that considers the routing decisions for the HDEVs and their charging implications on the power gr… ▽ More

    Submitted 24 November, 2023; originally announced November 2023.

    Comments: Accepted at CDC 2023

  35. Variation of the stellar initial mass function in semi-analytical models III: testing the cosmic ray regulated integrated galaxy-wide initial mass function

    Authors: Fabio Fontanot, Francesco La Barbera, Gabriella De Lucia, Rachele Cecchi, Lizhi Xie, Michaela Hirschmann, Gustavo Bruzual, Stéphane Charlot, Alexandre Vazdekis

    Abstract: In our previous work, we derive the CR-IGIMF: a new scenario for a variable stellar initial mass function (IMF), which combines numerical results on the role played by cosmic rays in setting the thermal state of star forming gas, with the analytical approach of the integrated galaxy-wide IMF. In this work, we study the implications of this scenario for the properties of local Early-Type galaxies (… ▽ More

    Submitted 18 April, 2024; v1 submitted 21 November, 2023; originally announced November 2023.

    Comments: 13 pages, 8 figures, 2 tables, A&A accepted

    Journal ref: A&A 686, A302 (2024)

  36. A Universal Framework for Accurate and Efficient Geometric Deep Learning of Molecular Systems

    Authors: Shuo Zhang, Yang Liu, Lei Xie

    Abstract: Molecular sciences address a wide range of problems involving molecules of different types and sizes and their complexes. Recently, geometric deep learning, especially Graph Neural Networks, has shown promising performance in molecular science applications. However, most existing works often impose targeted inductive biases to a specific molecular system, and are inefficient when applied to macrom… ▽ More

    Submitted 18 November, 2023; originally announced November 2023.

    Comments: Published in Scientific Reports (DOI: 10.1038/s41598-023-46382-8)

    Journal ref: Scientific Reports 13, 19171 (2023)

  37. arXiv:2311.10806  [pdf, other

    cs.LG cs.AI

    SEA++: Multi-Graph-based High-Order Sensor Alignment for Multivariate Time-Series Unsupervised Domain Adaptation

    Authors: Yucheng Wang, Yuecong Xu, Jianfei Yang, Min Wu, Xiaoli Li, Lihua Xie, Zhenghua Chen

    Abstract: Unsupervised Domain Adaptation (UDA) methods have been successful in reducing label dependency by minimizing the domain discrepancy between a labeled source domain and an unlabeled target domain. However, these methods face challenges when dealing with Multivariate Time-Series (MTS) data. MTS data typically consist of multiple sensors, each with its own unique distribution. This characteristic mak… ▽ More

    Submitted 17 November, 2023; originally announced November 2023.

  38. arXiv:2311.10219  [pdf, other

    cs.SI

    Measuring Moral Dimensions in Social Media with Mformer

    Authors: Tuan Dung Nguyen, Ziyu Chen, Nicholas George Carroll, Alasdair Tran, Colin Klein, Lexing Xie

    Abstract: The ever-growing textual records of contemporary social issues, often discussed online with moral rhetoric, present both an opportunity and a challenge for studying how moral concerns are debated in real life. Moral foundations theory is a taxonomy of intuitions widely used in data-driven analyses of online content, but current computational tools to detect moral foundations suffer from the incomp… ▽ More

    Submitted 19 April, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

    Comments: To be published in ICWSM 2024

  39. arXiv:2311.08814  [pdf, ps, other

    math.GN

    The quotient spaces of topological groups with a $q$-point

    Authors: Li-Hong Xie, Hai-Hua Lin, Piyu Li

    Abstract: In this paper, we study the uniformities on the double coset spaces in topological groups. As an implication, the quotient spaces of topological groups with a $q$-point are studied. It mainly shows that: (1) Suppose that $G$ is a topological group with a $q$-point and $H$ is a closed subgroup of $G$; then the quotient space $G/H$ is an open and quasi-perfect preimage of a metrizable space; in part… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

    Comments: 17

    MSC Class: 54A20; 54H11; 54B15; 54C10; 54E15

  40. arXiv:2311.08245  [pdf, other

    cs.CV

    TENT: Connect Language Models with IoT Sensors for Zero-Shot Activity Recognition

    Authors: Yunjiao Zhou, Jianfei Yang, Han Zou, Lihua Xie

    Abstract: Recent achievements in language models have showcased their extraordinary capabilities in bridging visual information with semantic language understanding. This leads us to a novel question: can language models connect textual semantics with IoT sensory signals to perform recognition tasks, e.g., Human Activity Recognition (HAR)? If so, an intelligent HAR system with human-like cognition can be bu… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

    Comments: Preprint manuscript in submission

  41. arXiv:2311.07179  [pdf, other

    cs.SD eess.AS

    SponTTS: modeling and transferring spontaneous style for TTS

    Authors: Hanzhao Li, Xinfa Zhu, Liumeng Xue, Yang Song, Yunlin Chen, Lei Xie

    Abstract: Spontaneous speaking style exhibits notable differences from other speaking styles due to various spontaneous phenomena (e.g., filled pauses, prolongation) and substantial prosody variation (e.g., diverse pitch and duration variation, occasional non-verbal speech like a smile), posing challenges to modeling and prediction of spontaneous style. Moreover, the limitation of high-quality spontaneous d… ▽ More

    Submitted 8 January, 2024; v1 submitted 13 November, 2023; originally announced November 2023.

    Comments: 5 pages, 3 figures, Accepted by ICASSP2024

  42. arXiv:2311.07081  [pdf, other

    cs.IT eess.SP

    Sensing Mutual Information with Random Signals in Gaussian Channels

    Authors: Lei Xie, Fan Liu, Zhanyuan Xie, Zheng Jiang, Shenghui Song

    Abstract: Sensing performance is typically evaluated by classical metrics, such as Cramer-Rao bound and signal-to-clutter-plus-noise ratio. The recent development of the integrated sensing and communication (ISAC) framework motivated the efforts to unify the metric for sensing and communication, where researchers have proposed to utilize mutual information (MI) to measure the sensing performance with determ… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

  43. Decoupling and Interacting Multi-Task Learning Network for Joint Speech and Accent Recognition

    Authors: Qijie Shao, Pengcheng Guo, **ghao Yan, Pengfei Hu, Lei Xie

    Abstract: Accents, as variations from standard pronunciation, pose significant challenges for speech recognition systems. Although joint automatic speech recognition (ASR) and accent recognition (AR) training has been proven effective in handling multi-accent scenarios, current multi-task ASR-AR approaches overlook the granularity differences between tasks. Fine-grained units capture pronunciation-related a… ▽ More

    Submitted 17 November, 2023; v1 submitted 12 November, 2023; originally announced November 2023.

    Comments: Accepted by IEEE Transactions on Audio, Speech and Language Processing (TASLP)

  44. arXiv:2311.02817  [pdf, other

    cs.RO

    Safe-VLN: Collision Avoidance for Vision-and-Language Navigation of Autonomous Robots Operating in Continuous Environments

    Authors: Lu Yue, Dongliang Zhou, Liang Xie, Feitian Zhang, Ye Yan, Erwei Yin

    Abstract: The task of vision-and-language navigation in continuous environments (VLN-CE) aims at training an autonomous agent to perform low-level actions to navigate through 3D continuous surroundings using visual observations and language instructions. The significant potential of VLN-CE for mobile robots has been demonstrated across a large number of studies. However, most existing works in VLN-CE focus… ▽ More

    Submitted 11 April, 2024; v1 submitted 5 November, 2023; originally announced November 2023.

  45. arXiv:2311.02612  [pdf, other

    cs.CV

    GPT-4V-AD: Exploring Grounding Potential of VQA-oriented GPT-4V for Zero-shot Anomaly Detection

    Authors: Jiangning Zhang, Haoyang He, Xuhai Chen, Zhucun Xue, Yabiao Wang, Chengjie Wang, Lei Xie, Yong Liu

    Abstract: Large Multimodal Model (LMM) GPT-4V(ision) endows GPT-4 with visual grounding capabilities, making it possible to handle certain tasks through the Visual Question Answering (VQA) paradigm. This paper explores the potential of VQA-oriented GPT-4V in the recently popular visual Anomaly Detection (AD) and is the first to conduct qualitative and quantitative evaluations on the popular MVTec AD and Vis… ▽ More

    Submitted 16 April, 2024; v1 submitted 5 November, 2023; originally announced November 2023.

  46. arXiv:2311.02250  [pdf, other

    math.OC eess.SY

    Efficient Scenario Generation for Chance-constrained Economic Dispatch Considering Ambient Wind Conditions

    Authors: Qian Zhang, Apurv Shukla, Le Xie

    Abstract: Scenario generation is an effective data-driven method for solving chance-constrained optimization while ensuring desired risk guarantees with a finite number of samples. Crucial challenges in deploying this technique in the real world arise due to the absence of appropriate risk-tuning models tailored for the desired application. In this paper, we focus on designing efficient scenario generation… ▽ More

    Submitted 2 January, 2024; v1 submitted 3 November, 2023; originally announced November 2023.

    Comments: 12 pages

  47. arXiv:2311.00345  [pdf, ps, other

    math.GN

    Some characterizations of $ω$-balanced topological groups with a $q$-point

    Authors: Deng-Bin Chen, Hai-Hua Lin, Li-Hong Xie

    Abstract: In this paper, we study some characterizations of $q$-spaces, strict $q$-spaces and strong $q$-spaces under $ω$-balanced topological groups as follows: (1) A topological group $G$ is $ω$-balanced and a $q$-space if and only if for each open neighborhood $O$ of the identity in $G$, there is a countably compact invariant subgroup $H$ which is of countable character in $G$, such that… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

    Comments: 11

  48. arXiv:2311.00263  [pdf, other

    eess.SY

    The bottleneck and ceiling effects in quantized tracking control of heterogeneous multi-agent systems under DoS attacks

    Authors: Shuai Feng, Maopeng Ran, Baoyong Zhang, Lihua Xie, Shengyuan Xu

    Abstract: In this paper, we investigate tracking control of heterogeneous multi-agent systems under Denial-of-Service (DoS) attacks and state quantization. Dynamic quantized mechanisms are designed for inter-follower communication and leader-follower communication. Zooming-in and out factors, and data rates of both mechanisms for preventing quantizer saturation are provided. Our results show that by tuning… ▽ More

    Submitted 31 October, 2023; originally announced November 2023.

  49. arXiv:2310.19787  [pdf

    stat.ME stat.AP stat.ML

    $e^{\text{RPCA}}$: Robust Principal Component Analysis for Exponential Family Distributions

    Authors: Xiaojun Zheng, Simon Mak, Liyan Xie, Yao Xie

    Abstract: Robust Principal Component Analysis (RPCA) is a widely used method for recovering low-rank structure from data matrices corrupted by significant and sparse outliers. These corruptions may arise from occlusions, malicious tampering, or other causes for anomalies, and the joint identification of such corruptions with low-rank background is critical for process monitoring and diagnosis. However, exis… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

  50. Integrated Relative-Measurement-Based Network Localization and Formation Maneuver Control (Extended Version)

    Authors: Xu Fang, Lihua Xie, Xiaolei Li

    Abstract: This paper studies the problem of integrated distributed network localization and formation maneuver control. We develop an integrated relative-measurement-based scheme, which only uses relative positions, distances, bearings, angles, ratio-of-distances, or their combination to achieve distributed network localization and formation maneuver control in $\mathbb{R}^d (d \ge 2)$. By exploring the loc… ▽ More

    Submitted 13 November, 2023; v1 submitted 28 October, 2023; originally announced October 2023.

    Comments: 12 pages; 7 figures, title corrected, DOI added