Search | arXiv e-print repository

arXiv:2406.19640 [pdf, other]

Efficient Event Stream Super-Resolution with Recursive Multi-Branch Fusion

Authors: Quanmin Liang, Zhilin Huang, Xiawu Zheng, Feidiao Yang, Jun Peng, Kai Huang, Yonghong Tian

Abstract: Current Event Stream Super-Resolution (ESR) methods overlook the redundant and complementary information present in positive and negative events within the event stream, employing a direct mixing approach for super-resolution, which may lead to detail loss and inefficiency. To address these issues, we propose an efficient Recursive Multi-Branch Information Fusion Network (RMFNet) that separates po… ▽ More Current Event Stream Super-Resolution (ESR) methods overlook the redundant and complementary information present in positive and negative events within the event stream, employing a direct mixing approach for super-resolution, which may lead to detail loss and inefficiency. To address these issues, we propose an efficient Recursive Multi-Branch Information Fusion Network (RMFNet) that separates positive and negative events for complementary information extraction, followed by mutual supplementation and refinement. Particularly, we introduce Feature Fusion Modules (FFM) and Feature Exchange Modules (FEM). FFM is designed for the fusion of contextual information within neighboring event streams, leveraging the coupling relationship between positive and negative events to alleviate the misleading of noises in the respective branches. FEM efficiently promotes the fusion and exchange of information between positive and negative branches, enabling superior local information enhancement and global information complementation. Experimental results demonstrate that our approach achieves over 17% and 31% improvement on synthetic and real datasets, accompanied by a 2.3X acceleration. Furthermore, we evaluate our method on two downstream event-driven applications, \emph{i.e.}, object recognition and video reconstruction, achieving remarkable results that outperform existing methods. Our code and Supplementary Material are available at https://github.com/Lqm26/RMFNet. △ Less

Submitted 28 June, 2024; originally announced June 2024.

Journal ref: International Joint Conference on Artificial Intelligence 2024

arXiv:2406.19083 [pdf, other]

Axion Detection Experiments Meet the Majoron

Authors: Qiuyue Liang, Xavier Ponce Díaz, Tsutomu T. Yanagida

Abstract: The majoron is a well-motivated light (pseudo-Nambu-Goldstone) boson associated with the spontaneous breaking of a global lepton-number symmetry. In this {\it letter}, we relate the spontaneous breaking scale and its soft-breaking mass by requiring that the majoron is the main component of the dark matter. An electromagnetic-anomalous coupling can be induced by minimally modifying the original maj… ▽ More The majoron is a well-motivated light (pseudo-Nambu-Goldstone) boson associated with the spontaneous breaking of a global lepton-number symmetry. In this {\it letter}, we relate the spontaneous breaking scale and its soft-breaking mass by requiring that the majoron is the main component of the dark matter. An electromagnetic-anomalous coupling can be induced by minimally modifying the original majoron model, surprisingly, predicting a parameter region that largely overlaps with the QCD-axion dark matter band. Thus, we expect that axion search experiments meet the majoron. △ Less

Submitted 27 June, 2024; originally announced June 2024.

arXiv:2406.08152 [pdf, other]

CT3D++: Improving 3D Object Detection with Keypoint-induced Channel-wise Transformer

Authors: Hualian Sheng, Sijia Cai, Na Zhao, Bing Deng, Qiao Liang, Min-Jian Zhao, Jie** Ye

Abstract: The field of 3D object detection from point clouds is rapidly advancing in computer vision, aiming to accurately and efficiently detect and localize objects in three-dimensional space. Current 3D detectors commonly fall short in terms of flexibility and scalability, with ample room for advancements in performance. In this paper, our objective is to address these limitations by introducing two fram… ▽ More The field of 3D object detection from point clouds is rapidly advancing in computer vision, aiming to accurately and efficiently detect and localize objects in three-dimensional space. Current 3D detectors commonly fall short in terms of flexibility and scalability, with ample room for advancements in performance. In this paper, our objective is to address these limitations by introducing two frameworks for 3D object detection with minimal hand-crafted design. Firstly, we propose CT3D, which sequentially performs raw-point-based embedding, a standard Transformer encoder, and a channel-wise decoder for point features within each proposal. Secondly, we present an enhanced network called CT3D++, which incorporates geometric and semantic fusion-based embedding to extract more valuable and comprehensive proposal-aware information. Additionally, CT3D ++ utilizes a point-to-key bidirectional encoder for more efficient feature encoding with reduced computational cost. By replacing the corresponding components of CT3D with these novel modules, CT3D++ achieves state-of-the-art performance on both the KITTI dataset and the large-scale Way\-mo Open Dataset. The source code for our frameworks will be made accessible at https://github.com/hlsheng1/CT3D-plusplus. △ Less

Submitted 12 June, 2024; originally announced June 2024.

Comments: 19 pages, 8 figures

arXiv:2406.03609 [pdf]

Modulated Ringdown Comb Interferometry for next-generation high complexity trace gas sensing

Authors: Qizhong Liang, Apoorva Bisht, Andrew Scheck, Peter G. Schunemann, Jun Ye

Abstract: Gas samples relevant to health and environment typically contain a plethora of molecular species that span a huge concentration dynamic range. High-concentration molecules impose a strong absorption background that hinders robust identification of low-concentration species. While mid-infrared frequency comb spectroscopy with high-finesse cavity enhancement has realized many of the most sensitive m… ▽ More Gas samples relevant to health and environment typically contain a plethora of molecular species that span a huge concentration dynamic range. High-concentration molecules impose a strong absorption background that hinders robust identification of low-concentration species. While mid-infrared frequency comb spectroscopy with high-finesse cavity enhancement has realized many of the most sensitive multi-species trace gas detection to date, its robust performance requires gas samples to contain only weak absorption features to avoid dispersing cavity resonances from the comb line frequencies. Here we introduce a new technique that is free from this restriction, thus enabling the development of next-generation multi-species trace gas sensing with broad applicability to complex and dynamic molecular compositions. The principle of Modulated Ringdown Comb Interferometry is to resolve ringdown dynamics carried by massively parallel comb lines transmitted through a length-modulated cavity. This method leverages both periodicity of the field dynamics and Doppler frequency shifts introduced from a Michelson interferometer. Scalable enhancement of both spectral coverage and cavity finesse is enabled with dispersion immune and high-efficiency data collection. Built upon this platform, we realize in the mid-infrared a product of finesse and spectral coverage that is orders of magnitude better than all prior experiments. We demonstrate the power of this technique by measuring highly dispersive exhaled human breath samples over a vastly expanded spectral coverage of 1,010 cm-1 and with cavity finesse of 23,000. This allows for the first time simultaneous quantification of 20 distinct molecular species at > 1 part-per-trillion sensitivity with their concentrations varying by 7 orders of magnitude. △ Less

Submitted 5 June, 2024; originally announced June 2024.

arXiv:2405.21004 [pdf, other]

MunchSonic: Tracking Fine-grained Dietary Actions through Active Acoustic Sensing on Eyeglasses

Authors: Saif Mahmud, Devansh Agarwal, Ashwin Ajit, Qikang Liang, Thalia Viranda, Francois Guimbretiere, Cheng Zhang

Abstract: We introduce MunchSonic, an AI-powered active acoustic sensing system integrated into eyeglasses, designed to track fine-grained dietary actions like hand-to-mouth movements for food intake, chewing, and drinking. MunchSonic emits inaudible ultrasonic waves from a commodity eyeglass frame. The reflected signals contain rich information about the position and movements of various body parts, includ… ▽ More We introduce MunchSonic, an AI-powered active acoustic sensing system integrated into eyeglasses, designed to track fine-grained dietary actions like hand-to-mouth movements for food intake, chewing, and drinking. MunchSonic emits inaudible ultrasonic waves from a commodity eyeglass frame. The reflected signals contain rich information about the position and movements of various body parts, including the mouth, jaw, arms, and hands, all of which are involved in eating activities. These signals are then processed by a custom deep-learning pipeline to classify six actions: food intake, chewing, drinking, talking, face-hand touching, and other activities (null). In an unconstrained user study with 12 participants, MunchSonic achieves a 93.5% macro F1-score in a user-independent evaluation with a 2-second time resolution, demonstrating its effectiveness. Additionally, MunchSonic accurately tracks eating episodes and the frequency of food intake within those episodes. △ Less

Submitted 31 May, 2024; originally announced May 2024.

Comments: 8 pages, 7 figures

arXiv:2405.13055 [pdf, other]

Large Language Models for Medicine: A Survey

Authors: Yanxin Zheng, Wensheng Gan, Zefeng Chen, Zhenlian Qi, Qian Liang, Philip S. Yu

Abstract: To address challenges in the digital economy's landscape of digital intelligence, large language models (LLMs) have been developed. Improvements in computational power and available resources have significantly advanced LLMs, allowing their integration into diverse domains for human life. Medical LLMs are essential application tools with potential across various medical scenarios. In this paper, w… ▽ More To address challenges in the digital economy's landscape of digital intelligence, large language models (LLMs) have been developed. Improvements in computational power and available resources have significantly advanced LLMs, allowing their integration into diverse domains for human life. Medical LLMs are essential application tools with potential across various medical scenarios. In this paper, we review LLM developments, focusing on the requirements and applications of medical LLMs. We provide a concise overview of existing models, aiming to explore advanced research directions and benefit researchers for future medical applications. We emphasize the advantages of medical LLMs in applications, as well as the challenges encountered during their development. Finally, we suggest directions for technical integration to mitigate challenges and potential research directions for the future of medical LLMs, aiming to meet the demands of the medical field better. △ Less

Submitted 19 May, 2024; originally announced May 2024.

Comments: Preprint. 5 figures,5 tables

arXiv:2405.11755 [pdf, other]

Testing Gravity with Frequency-Dependent Overlap Reduction Function in Pulsar Timing Array

Authors: Qiuyue Liang, Ippei Obata, Misao Sasaki

Abstract: The positive evidence of a nano-hertz gravitational wave background recently found by several pulsar timing array (PTA) collaborations opened up a window to test modified gravity theories in a unique frequency band in parallel to other gravitational wave detection experiments. In particular, the overlap reduction function (ORF) in PTA observation is sensitive to the phase velocity of gravitational… ▽ More The positive evidence of a nano-hertz gravitational wave background recently found by several pulsar timing array (PTA) collaborations opened up a window to test modified gravity theories in a unique frequency band in parallel to other gravitational wave detection experiments. In particular, the overlap reduction function (ORF) in PTA observation is sensitive to the phase velocity of gravitational waves. In this work, we provide analytical expressions for the coefficients of the multipole moments in the ORF, and utilize these analytical results to study constraints on the phase velocity from the frequency dependent overlap reduction function obtained from the Chinese PTA (CPTA) data. While the data contain large error bars yet, interesting constraints are found in the frequency-dependent ORF in the case of subluminal phase velocity. This makes us expect that the nano-hertz band gravitational wave background will become one of the important arenas for exploring modified gravity theories. △ Less

Submitted 19 May, 2024; originally announced May 2024.

Comments: 16 pages, 3figures

arXiv:2405.10037 [pdf, other]

Bilateral Event Mining and Complementary for Event Stream Super-Resolution

Authors: Zhilin Huang, Quanmin Liang, Yijie Yu, Chujun Qin, Xiawu Zheng, Kai Huang, Zikun Zhou, Wenming Yang

Abstract: Event Stream Super-Resolution (ESR) aims to address the challenge of insufficient spatial resolution in event streams, which holds great significance for the application of event cameras in complex scenarios. Previous works for ESR often process positive and negative events in a mixed paradigm. This paradigm limits their ability to effectively model the unique characteristics of each event and mut… ▽ More Event Stream Super-Resolution (ESR) aims to address the challenge of insufficient spatial resolution in event streams, which holds great significance for the application of event cameras in complex scenarios. Previous works for ESR often process positive and negative events in a mixed paradigm. This paradigm limits their ability to effectively model the unique characteristics of each event and mutually refine each other by considering their correlations. In this paper, we propose a bilateral event mining and complementary network (BMCNet) to fully leverage the potential of each event and capture the shared information to complement each other simultaneously. Specifically, we resort to a two-stream network to accomplish comprehensive mining of each type of events individually. To facilitate the exchange of information between two streams, we propose a bilateral information exchange (BIE) module. This module is layer-wisely embedded between two streams, enabling the effective propagation of hierarchical global information while alleviating the impact of invalid information brought by inherent characteristics of events. The experimental results demonstrate that our approach outperforms the previous state-of-the-art methods in ESR, achieving performance improvements of over 11\% on both real and synthetic datasets. Moreover, our method significantly enhances the performance of event-based downstream tasks such as object recognition and video reconstruction. Our code is available at https://github.com/Lqm26/BMCNet-ESR. △ Less

Submitted 16 May, 2024; originally announced May 2024.

Comments: Accepted to CVPR2024

arXiv:2405.09883 [pdf, other]

RoScenes: A Large-scale Multi-view 3D Dataset for Roadside Perception

Authors: Xiaosu Zhu, Hualian Sheng, Sijia Cai, Bing Deng, Shaopeng Yang, Qiao Liang, Ken Chen, Lianli Gao, **gkuan Song, Jie** Ye

Abstract: We introduce RoScenes, the largest multi-view roadside perception dataset, which aims to shed light on the development of vision-centric Bird's Eye View (BEV) approaches for more challenging traffic scenes. The highlights of RoScenes include significantly large perception area, full scene coverage and crowded traffic. More specifically, our dataset achieves surprising 21.13M 3D annotations within… ▽ More We introduce RoScenes, the largest multi-view roadside perception dataset, which aims to shed light on the development of vision-centric Bird's Eye View (BEV) approaches for more challenging traffic scenes. The highlights of RoScenes include significantly large perception area, full scene coverage and crowded traffic. More specifically, our dataset achieves surprising 21.13M 3D annotations within 64,000 $m^2$. To relieve the expensive costs of roadside 3D labeling, we present a novel BEV-to-3D joint annotation pipeline to efficiently collect such a large volume of data. After that, we organize a comprehensive study for current BEV methods on RoScenes in terms of effectiveness and efficiency. Tested methods suffer from the vast perception area and variation of sensor layout across scenes, resulting in performance levels falling below expectations. To this end, we propose RoBEV that incorporates feature-guided position embedding for effective 2D-3D feature assignment. With its help, our method outperforms state-of-the-art by a large margin without extra computational overhead on validation set. Our dataset and devkit will be made available at https://github.com/xiaosu-zhu/RoScenes. △ Less

Submitted 19 May, 2024; v1 submitted 16 May, 2024; originally announced May 2024.

Comments: Technical report. 32 pages, 21 figures, 13 tables. https://github.com/xiaosu-zhu/RoScenes

arXiv:2405.07702 [pdf, other]

FORESEE: Multimodal and Multi-view Representation Learning for Robust Prediction of Cancer Survival

Authors: Liangrui Pan, Yijun Peng, Yan Li, Yiyi Liang, Liwen Xu, Qingchun Liang, Shaoliang Peng

Abstract: Integrating the different data modalities of cancer patients can significantly improve the predictive performance of patient survival. However, most existing methods ignore the simultaneous utilization of rich semantic features at different scales in pathology images. When collecting multimodal data and extracting features, there is a likelihood of encountering intra-modality missing data, introdu… ▽ More Integrating the different data modalities of cancer patients can significantly improve the predictive performance of patient survival. However, most existing methods ignore the simultaneous utilization of rich semantic features at different scales in pathology images. When collecting multimodal data and extracting features, there is a likelihood of encountering intra-modality missing data, introducing noise into the multimodal data. To address these challenges, this paper proposes a new end-to-end framework, FORESEE, for robustly predicting patient survival by mining multimodal information. Specifically, the cross-fusion transformer effectively utilizes features at the cellular level, tissue level, and tumor heterogeneity level to correlate prognosis through a cross-scale feature cross-fusion method. This enhances the ability of pathological image feature representation. Secondly, the hybrid attention encoder (HAE) uses the denoising contextual attention module to obtain the contextual relationship features and local detail features of the molecular data. HAE's channel attention module obtains global features of molecular data. Furthermore, to address the issue of missing information within modalities, we propose an asymmetrically masked triplet masked autoencoder to reconstruct lost information within modalities. Extensive experiments demonstrate the superiority of our method over state-of-the-art methods on four benchmark datasets in both complete and missing settings. △ Less

Submitted 13 May, 2024; originally announced May 2024.

arXiv:2404.13924 [pdf, other]

ActSonic: Recognizing Everyday Activities from Inaudible Acoustic Waves Around the Body

Authors: Saif Mahmud, Vineet Parikh, Qikang Liang, Ke Li, Ruidong Zhang, Ashwin Ajit, Vipin Gunda, Devansh Agarwal, François Guimbretière, Cheng Zhang

Abstract: We present ActSonic, an intelligent, low-power active acoustic sensing system integrated into eyeglasses that can recognize 27 different everyday activities (e.g., eating, drinking, toothbrushing) from inaudible acoustic waves around the body with a time resolution of one second. It only needs a pair of miniature speakers and microphones mounted on each hinge of eyeglasses to emit ultrasonic waves… ▽ More We present ActSonic, an intelligent, low-power active acoustic sensing system integrated into eyeglasses that can recognize 27 different everyday activities (e.g., eating, drinking, toothbrushing) from inaudible acoustic waves around the body with a time resolution of one second. It only needs a pair of miniature speakers and microphones mounted on each hinge of eyeglasses to emit ultrasonic waves to create an acoustic aura around the body. Based on the position and motion of various body parts, the acoustic signals are reflected with unique patterns captured by the microphone and analyzed by a customized self-supervised deep learning framework to infer the performed activities. ActSonic was deployed in a user study with 19 participants across 19 households to evaluate its efficacy. Without requiring any training data from a new user (leave-one-participant-out evaluation), ActSonic was able to detect 27 activities, achieving an average F1-score of 86.6% in fully unconstrained scenarios and 93.4% in prompted settings at participants' homes. △ Less

Submitted 8 May, 2024; v1 submitted 22 April, 2024; originally announced April 2024.

Comments: 29 pages, 11 figures

arXiv:2404.11082 [pdf, other]

Urban traffic resilience control -- An ecological resilience perspective

Authors: Shengling Gao, Zhikun She, Quanyi Liang, Nan Zheng, Daqing Li

Abstract: Urban traffic resilience has gained increased attention, with most studies adopting an engineering perspective that assumes a single optimal equilibrium and prioritizes local recovery. On the other hand, systems may possess multiple metastable states, and ecological resilience is the ability to switch between these states according to perturbations. Control strategies from these two resilience per… ▽ More Urban traffic resilience has gained increased attention, with most studies adopting an engineering perspective that assumes a single optimal equilibrium and prioritizes local recovery. On the other hand, systems may possess multiple metastable states, and ecological resilience is the ability to switch between these states according to perturbations. Control strategies from these two resilience perspectives yield distinct outcomes. In fact, ecological resilience oriented control has rarely been viewed in urban traffic, despite the fact that traffic system is a complex system in highly uncertain environment with possible multiple metastable states. This absence highlights the necessity for urban traffic ecological resilience definition. To bridge this gap, we defines urban traffic ecological resilience as the ability to absorb uncertain perturbations by shifting to alternative states. The goal is to generate a system with greater adaptability, without necessarily returning to the original equilibrium. Our control framework comprises three aspects: portraying the recoverable scopes; designing alternative steady states; and controlling system to shift to alternative steady states for adapting large disturbances. Among them, the recoverable scopes are portrayed by attraction region; the alternative steady states are set close to the optimal state and outside the attraction region of the original equilibrium; the controller needs to ensure the local stability of the alternative steady states, without changing the trajectories inside the attraction region of the original equilibrium. Comparisons with classical engineering resilience oriented urban traffic resilience control schemes show that, proposed ecological resilience oriented control schemes can generate greater resilience. These results will contribute to the fundamental theory of future resilient intelligent transportation system. △ Less

Submitted 17 April, 2024; originally announced April 2024.

arXiv:2403.18331 [pdf, other]

Neighbor-Environment Observer: An Intelligent Agent for Immersive Working Companionship

Authors: Zhe Sun, Qixuan Liang, Meng Wang, Zhenliang Zhang

Abstract: Human-computer symbiosis is a crucial direction for the development of artificial intelligence. As intelligent systems become increasingly prevalent in our work and personal lives, it is important to develop strategies to support users across physical and virtual environments. While technological advances in personal digital devices, such as personal computers and virtual reality devices, can prov… ▽ More Human-computer symbiosis is a crucial direction for the development of artificial intelligence. As intelligent systems become increasingly prevalent in our work and personal lives, it is important to develop strategies to support users across physical and virtual environments. While technological advances in personal digital devices, such as personal computers and virtual reality devices, can provide immersive experiences, they can also disrupt users' awareness of their surroundings and enhance the frustration caused by disturbances. In this paper, we propose a joint observation strategy for artificial agents to support users across virtual and physical environments. We introduce a prototype system, neighbor-environment observer (NEO), that utilizes non-invasive sensors to assist users in dealing with disruptions to their immersive experience. System experiments evaluate NEO from different perspectives and demonstrate the effectiveness of the joint observation strategy. A user study is conducted to evaluate its usability. The results show that NEO could lessen users' workload with the learned user preference. We suggest that the proposed strategy can be applied to various smart home scenarios. △ Less

Submitted 27 March, 2024; originally announced March 2024.

Comments: UIST 2023

arXiv:2403.16112 [pdf, other]

Opportunities and challenges in the application of large artificial intelligence models in radiology

Authors: Liangrui Pan, Zhenyu Zhao, Ying Lu, Kewei Tang, Liyong Fu, Qingchun Liang, Shaoliang Peng

Abstract: Influenced by ChatGPT, artificial intelligence (AI) large models have witnessed a global upsurge in large model research and development. As people enjoy the convenience by this AI large model, more and more large models in subdivided fields are gradually being proposed, especially large models in radiology imaging field. This article first introduces the development history of large models, techn… ▽ More Influenced by ChatGPT, artificial intelligence (AI) large models have witnessed a global upsurge in large model research and development. As people enjoy the convenience by this AI large model, more and more large models in subdivided fields are gradually being proposed, especially large models in radiology imaging field. This article first introduces the development history of large models, technical details, workflow, working principles of multimodal large models and working principles of video generation large models. Secondly, we summarize the latest research progress of AI large models in radiology education, radiology report generation, applications of unimodal and multimodal radiology. Finally, this paper also summarizes some of the challenges of large AI models in radiology, with the aim of better promoting the rapid revolution in the field of radiography. △ Less

Submitted 24 March, 2024; originally announced March 2024.

arXiv:2403.11465 [pdf]

Ultra-Long Homochiral Graphene Nanoribbons Grown Within h-BN Stacks for High-Performance Electronics

Authors: Bosai Lyu, Jiajun Chen, Sen Wang, Shuo Lou, Peiyue Shen, **gxu Xie, Lu Qiu, Izaac Mitchell, Can Li, Cheng Hu, Xianliang Zhou, Kenji Watanabe, Takashi Taniguchi, Xiaoqun Wang, **feng Jia, Qi Liang, Guorui Chen, Tingxin Li, Shiyong Wang, Wengen Ouyang, Oded Hod, Feng Ding, Michael Urbakh, Zhiwen Shi

Abstract: Van der Waals encapsulation of two-dimensional materials within hexagonal boron nitride (h-BN) stacks has proven to be a promising way to create ultrahigh-performance electronic devices. However, contemporary approaches for achieving van der Waals encapsulation, which involve artificial layer stacking using mechanical transfer techniques, are difficult to control, prone to contamination, and unsca… ▽ More Van der Waals encapsulation of two-dimensional materials within hexagonal boron nitride (h-BN) stacks has proven to be a promising way to create ultrahigh-performance electronic devices. However, contemporary approaches for achieving van der Waals encapsulation, which involve artificial layer stacking using mechanical transfer techniques, are difficult to control, prone to contamination, and unscalable. Here, we report on the transfer-free direct growth of high-quality graphene nanoribbons (GNRs) within h-BN stacks. The as-grown embedded GNRs exhibit highly desirable features being ultralong (up to 0.25 mm), ultranarrow ( < 5 nm), and homochiral with zigzag edges. Our atomistic simulations reveal that the mechanism underlying the embedded growth involves ultralow GNR friction when sliding between AA'-stacked h-BN layers. Using the grown structures, we demonstrate the transfer-free fabrication of embedded GNR field-effect devices that exhibit excellent performance at room temperature with mobilities of up to 4,600 $cm^{2} V^{-1} s^{-1}$ and on-off ratios of up to $10^{6}$. This paves the way to the bottom-up fabrication of high-performance electronic devices based on embedded layered materials. △ Less

Submitted 18 March, 2024; originally announced March 2024.

arXiv:2403.09290 [pdf, other]

SELECTOR: Heterogeneous graph network with convolutional masked autoencoder for multimodal robust prediction of cancer survival

Authors: Liangrui Pan, Yijun Peng, Yan Li, Xiang Wang, Wenjuan Liu, Liwen Xu, Qingchun Liang, Shaoliang Peng

Abstract: Accurately predicting the survival rate of cancer patients is crucial for aiding clinicians in planning appropriate treatment, reducing cancer-related medical expenses, and significantly enhancing patients' quality of life. Multimodal prediction of cancer patient survival offers a more comprehensive and precise approach. However, existing methods still grapple with challenges related to missing mu… ▽ More Accurately predicting the survival rate of cancer patients is crucial for aiding clinicians in planning appropriate treatment, reducing cancer-related medical expenses, and significantly enhancing patients' quality of life. Multimodal prediction of cancer patient survival offers a more comprehensive and precise approach. However, existing methods still grapple with challenges related to missing multimodal data and information interaction within modalities. This paper introduces SELECTOR, a heterogeneous graph-aware network based on convolutional mask encoders for robust multimodal prediction of cancer patient survival. SELECTOR comprises feature edge reconstruction, convolutional mask encoder, feature cross-fusion, and multimodal survival prediction modules. Initially, we construct a multimodal heterogeneous graph and employ the meta-path method for feature edge reconstruction, ensuring comprehensive incorporation of feature information from graph edges and effective embedding of nodes. To mitigate the impact of missing features within the modality on prediction accuracy, we devised a convolutional masked autoencoder (CMAE) to process the heterogeneous graph post-feature reconstruction. Subsequently, the feature cross-fusion module facilitates communication between modalities, ensuring that output features encompass all features of the modality and relevant information from other modalities. Extensive experiments and analysis on six cancer datasets from TCGA demonstrate that our method significantly outperforms state-of-the-art methods in both modality-missing and intra-modality information-confirmed cases. Our codes are made available at https://github.com/panliangrui/Selector. △ Less

Submitted 14 March, 2024; originally announced March 2024.

Comments: Accepted on Computers in Biology and Medicine

arXiv:2403.06064 [pdf, other]

L^2GC:Lorentzian Linear Graph Convolutional Networks for Node Classification

Authors: Qiuyu Liang, Weihua Wang, Feilong Bao, Guanglai Gao

Abstract: Linear Graph Convolutional Networks (GCNs) are used to classify the node in the graph data. However, we note that most existing linear GCN models perform neural network operations in Euclidean space, which do not explicitly capture the tree-like hierarchical structure exhibited in real-world datasets that modeled as graphs. In this paper, we attempt to introduce hyperbolic space into linear GCN an… ▽ More Linear Graph Convolutional Networks (GCNs) are used to classify the node in the graph data. However, we note that most existing linear GCN models perform neural network operations in Euclidean space, which do not explicitly capture the tree-like hierarchical structure exhibited in real-world datasets that modeled as graphs. In this paper, we attempt to introduce hyperbolic space into linear GCN and propose a novel framework for Lorentzian linear GCN. Specifically, we map the learned features of graph nodes into hyperbolic space, and then perform a Lorentzian linear feature transformation to capture the underlying tree-like structure of data. Experimental results on standard citation networks datasets with semi-supervised learning show that our approach yields new state-of-the-art results of accuracy 74.7$\%$ on Citeseer and 81.3$\%$ on PubMed datasets. Furthermore, we observe that our approach can be trained up to two orders of magnitude faster than other nonlinear GCN models on PubMed dataset. Our code is publicly available at https://github.com/llqy123/LLGC-master. △ Less

Submitted 14 June, 2024; v1 submitted 9 March, 2024; originally announced March 2024.

Comments: Accepted by LREC-COLING 2024

arXiv:2402.16539 [pdf]

Integrating Large Language Models with Graphical Session-Based Recommendation

Authors: Naicheng Guo, Hongwei Cheng, Qianqiao Liang, Linxun Chen, Bing Han

Abstract: With the rapid development of Large Language Models (LLMs), various explorations have arisen to utilize LLMs capability of context understanding on recommender systems. While pioneering strategies have primarily transformed traditional recommendation tasks into challenges of natural language generation, there has been a relative scarcity of exploration in the domain of session-based recommendation… ▽ More With the rapid development of Large Language Models (LLMs), various explorations have arisen to utilize LLMs capability of context understanding on recommender systems. While pioneering strategies have primarily transformed traditional recommendation tasks into challenges of natural language generation, there has been a relative scarcity of exploration in the domain of session-based recommendation (SBR) due to its specificity. SBR has been primarily dominated by Graph Neural Networks, which have achieved many successful outcomes due to their ability to capture both the implicit and explicit relationships between adjacent behaviors. The structural nature of graphs contrasts with the essence of natural language, posing a significant adaptation gap for LLMs. In this paper, we introduce large language models with graphical Session-Based recommendation, named LLMGR, an effective framework that bridges the aforementioned gap by harmoniously integrating LLMs with Graph Neural Networks (GNNs) for SBR tasks. This integration seeks to leverage the complementary strengths of LLMs in natural language understanding and GNNs in relational data processing, leading to a more powerful session-based recommender system that can understand and recommend items within a session. Moreover, to endow the LLM with the capability to empower SBR tasks, we design a series of prompts for both auxiliary and major instruction tuning tasks. These prompts are crafted to assist the LLM in understanding graph-structured data and align textual information with nodes, effectively translating nuanced user interactions into a format that can be understood and utilized by LLM architectures. Extensive experiments on three real-world datasets demonstrate that LLMGR outperforms several competitive baselines, indicating its effectiveness in enhancing SBR tasks and its potential as a research direction for future exploration. △ Less

Submitted 26 February, 2024; originally announced February 2024.

arXiv:2402.14634 [pdf, other]

doi 10.1145/3636534.3649376

GazeTrak: Exploring Acoustic-based Eye Tracking on a Glass Frame

Authors: Ke Li, Ruidong Zhang, Boao Chen, Siyuan Chen, Sicheng Yin, Saif Mahmud, Qikang Liang, François Guimbretière, Cheng Zhang

Abstract: In this paper, we present GazeTrak, the first acoustic-based eye tracking system on glasses. Our system only needs one speaker and four microphones attached to each side of the glasses. These acoustic sensors capture the formations of the eyeballs and the surrounding areas by emitting encoded inaudible sound towards eyeballs and receiving the reflected signals. These reflected signals are further… ▽ More In this paper, we present GazeTrak, the first acoustic-based eye tracking system on glasses. Our system only needs one speaker and four microphones attached to each side of the glasses. These acoustic sensors capture the formations of the eyeballs and the surrounding areas by emitting encoded inaudible sound towards eyeballs and receiving the reflected signals. These reflected signals are further processed to calculate the echo profiles, which are fed to a customized deep learning pipeline to continuously infer the gaze position. In a user study with 20 participants, GazeTrak achieves an accuracy of 3.6° within the same remounting session and 4.9° across different sessions with a refreshing rate of 83.3 Hz and a power signature of 287.9 mW. Furthermore, we report the performance of our gaze tracking system fully implemented on an MCU with a low-power CNN accelerator (MAX78002). In this configuration, the system runs at up to 83.3 Hz and has a total power signature of 95.4 mW with a 30 Hz FPS. △ Less

Submitted 23 February, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

Comments: 16 pages, 5 figures, 7 tables, The 30th Annual International Conference on Mobile Computing and Networking (ACM MobiCom 2024)

arXiv:2402.12074 [pdf, other]

doi 10.24963/IJCAI.2021/264

HIP Network: Historical Information Passing Network for Extrapolation Reasoning on Temporal Knowledge Graph

Authors: Yongquan He, Peng Zhang, Luchen Liu, Qi Liang, Wenyuan Zhang, Chuang Zhang

Abstract: In recent years, temporal knowledge graph (TKG) reasoning has received significant attention. Most existing methods assume that all timestamps and corresponding graphs are available during training, which makes it difficult to predict future events. To address this issue, recent works learn to infer future events based on historical information. However, these methods do not comprehensively consid… ▽ More In recent years, temporal knowledge graph (TKG) reasoning has received significant attention. Most existing methods assume that all timestamps and corresponding graphs are available during training, which makes it difficult to predict future events. To address this issue, recent works learn to infer future events based on historical information. However, these methods do not comprehensively consider the latent patterns behind temporal changes, to pass historical information selectively, update representations appropriately and predict events accurately. In this paper, we propose the Historical Information Passing (HIP) network to predict future events. HIP network passes information from temporal, structural and repetitive perspectives, which are used to model the temporal evolution of events, the interactions of events at the same time step, and the known events respectively. In particular, our method considers the updating of relation representations and adopts three scoring functions corresponding to the above dimensions. Experimental results on five benchmark datasets show the superiority of HIP network, and the significant improvements on Hits@1 prove that our method can more accurately predict what is going to happen. △ Less

Submitted 19 February, 2024; originally announced February 2024.

Comments: 7 pages, 3 figures

ACM Class: I.2.4; I.2.6; I.2.7

Journal ref: IJCAI (2021) 1915-1921

arXiv:2402.10834 [pdf, other]

Agent-based Simulation Evaluation of CBD Tolling: A Case Study from New York City

Authors: Qingnan Liang, Ruili Yao, Ruixuan Zhang, Zhibin Chen, Guoyuan Wu

Abstract: Congestion tollings have been widely developed and adopted as an effective tool to mitigate urban traffic congestion and enhance transportation system sustainability. Nevertheless, these tolling schemes are often tailored on a city-by-city or even area-by-area basis, and the cost of conducting field experiments often makes the design and evaluation process challenging. In this work, we leverage MA… ▽ More Congestion tollings have been widely developed and adopted as an effective tool to mitigate urban traffic congestion and enhance transportation system sustainability. Nevertheless, these tolling schemes are often tailored on a city-by-city or even area-by-area basis, and the cost of conducting field experiments often makes the design and evaluation process challenging. In this work, we leverage MATSim, a simulation platform that provides microscopic behaviors at the agent level, to evaluate performance on tolling schemes. Specifically, we conduct a case study of the Manhattan Central Business District (CBD) in New York City (NYC) using a fine-granularity traffic network model in the large-scale agent behavior setting. The flexibility of MATSim enables the implementation of a customized tolling policy proposed yet not deployed by the NYC agency while providing detailed interpretations. The quantitative and qualitative results indicate that the tested tolling program can regulate the personal vehicle volume in the CBD area and encourage the usage of public transportation, which proves to be a practical move towards sustainable transportation systems. More importantly, our work demonstrates that agent-based simulation helps better understand the travel pattern change subject to tollings in dense and complex urban environments, and it has the potential to facilitate efficient decision-making for the devotion to sustainable traffic management. △ Less

Submitted 16 February, 2024; originally announced February 2024.

Comments: Accepted by 2024 IEEE Forum on Integrated and Sustainable Transportation Systems

arXiv:2402.05092 [pdf, other]

doi 10.1103/PhysRevResearch.6.023217

Matter-wave interferometers with trapped strongly interacting Feshbach molecules

Authors: Chen Li, Qi Liang, Pradyumna Paranjape, RuGway Wu, Jörg Schmiedmayer

Abstract: We implement two types of matter-wave interferometers using trapped Bose-condensed Feshbach molecules, from weak to strong interactions. In each case, we focus on investigating interaction effects and their implications for the performance. In the Ramsey-type interferometer where interference between the two motional quantum states in an optical lattice is observed, interparticle interactions are… ▽ More We implement two types of matter-wave interferometers using trapped Bose-condensed Feshbach molecules, from weak to strong interactions. In each case, we focus on investigating interaction effects and their implications for the performance. In the Ramsey-type interferometer where interference between the two motional quantum states in an optical lattice is observed, interparticle interactions are found to induce energy shifts in the states. Consequently, this results in a reduction of the interferometer frequency and introduces a phase shift during the lattice pulses used for state manipulation. Furthermore, nonuniformity leads to dephasing and collisional effects contribute to the degradation of contrast. In the Michelson-type interferometer, where matter waves are spatially split and recombined in a waveguide, interference is observed in the presence of significant interaction, however coherence degrades with increasing interaction strength. Notably, coherence is also observed in thermal clouds, indicating the white-light nature of the implemented Michelson-type interferometer. △ Less

Submitted 21 May, 2024; v1 submitted 7 February, 2024; originally announced February 2024.

Comments: 13 pages, 8 figures

Journal ref: Phys. Rev. Research 6, 023217 (2024)

arXiv:2402.03677 [pdf, other]

Physics-based Modeling of Pulse and Relaxation of High-rate Li/CF$_{x}$-SVO batteries in Implantable Medical Devices

Authors: Qiaohao Liang, Giacomo Galuppini, Partha M. Gomadam, Prabhakar A. Tamirisa, Jeffrey A. Lemmerman, Michael J. M. Mazack, Melani G. Sullivan, Richard D. Braatz, Martin Z. Bazant

Abstract: We present a physics-based model that accurately predicts the performance of Medtronic's implantable medical device battery lithium/carbon monofluoride (CF$_x$) - silver vanadium oxide (SVO) under both low-rate background monitoring and high-rate pulsing currents. The distinct properties of multiple active materials are reflected by parameterizing their thermodynamics, kinetics, and mass transport… ▽ More We present a physics-based model that accurately predicts the performance of Medtronic's implantable medical device battery lithium/carbon monofluoride (CF$_x$) - silver vanadium oxide (SVO) under both low-rate background monitoring and high-rate pulsing currents. The distinct properties of multiple active materials are reflected by parameterizing their thermodynamics, kinetics, and mass transport properties separately. Diffusion limitations of Li$^+$ in SVO are used to explain cell voltage transient behavior during pulse and post-pulse relaxation. We also introduce change in cathode electronic conductivity, Li metal anode surface morphology, and film resistance buildup to capture evolution of cell internal resistance throughout multi-year electrical tests. We share our insights on how the Li$^+$ redistribution process between active materials can restore pulse capability of the hybrid electrode, allow CF$_x$ to indirectly contribute to capacity release during pulsing, and affect the operation protocols and design principles of batteries with other hybrid electrodes. We also discuss additional complexities in porous electrode model parameterization and electrochemical characterization techniques due to parallel reactions and solid diffusion pathways across active materials. We hope our models implemented in the Hybrid Multiphase Porous Electrode Theory (Hybrid-MPET) framework can complement future experimental research and accelerate development of multi-active material electrodes with targeted performance. △ Less

Submitted 5 February, 2024; originally announced February 2024.

Comments: For code and sample usage, please visit: https://github.com/HarryQL/Hybrid-MPET/tree/medtronic_pulse

arXiv:2402.03305 [pdf, other]

Do Diffusion Models Learn Semantically Meaningful and Efficient Representations?

Authors: Qiyao Liang, Ziming Liu, Ila Fiete

Abstract: Diffusion models are capable of impressive feats of image generation with uncommon juxtapositions such as astronauts riding horses on the moon with properly placed shadows. These outputs indicate the ability to perform compositional generalization, but how do the models do so? We perform controlled experiments on conditional DDPMs learning to generate 2D spherical Gaussian bumps centered at specif… ▽ More Diffusion models are capable of impressive feats of image generation with uncommon juxtapositions such as astronauts riding horses on the moon with properly placed shadows. These outputs indicate the ability to perform compositional generalization, but how do the models do so? We perform controlled experiments on conditional DDPMs learning to generate 2D spherical Gaussian bumps centered at specified $x$- and $y$-positions. Our results show that the emergence of semantically meaningful latent representations is key to achieving high performance. En route to successful performance over learning, the model traverses three distinct phases of latent representations: (phase A) no latent structure, (phase B) a 2D manifold of disordered states, and (phase C) a 2D ordered manifold. Corresponding to each of these phases, we identify qualitatively different generation behaviors: 1) multiple bumps are generated, 2) one bump is generated but at inaccurate $x$ and $y$ locations, 3) a bump is generated at the correct $x$ and y location. Furthermore, we show that even under imbalanced datasets where features ($x$- versus $y$-positions) are represented with skewed frequencies, the learning process for $x$ and $y$ is coupled rather than factorized, demonstrating that simple vanilla-flavored diffusion models cannot learn efficient representations in which localization in $x$ and $y$ are factorized into separate 1D tasks. These findings suggest the need for future work to find inductive biases that will push generative models to discover and exploit factorizable independent structures in their inputs, which will be required to vault these models into more data-efficient regimes. △ Less

Submitted 30 April, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

Comments: 13 pages, 9 figures

arXiv:2402.02772 [pdf, other]

Contrastive Diffuser: Planning Towards High Return States via Contrastive Learning

Authors: Yixiang Shan, Zhengbang Zhu, Ting Long, Qifan Liang, Yi Chang, Weinan Zhang, Liang Yin

Abstract: The performance of offline reinforcement learning (RL) is sensitive to the proportion of high-return trajectories in the offline dataset. However, in many simulation environments and real-world scenarios, there are large ratios of low-return trajectories rather than high-return trajectories, which makes learning an efficient policy challenging. In this paper, we propose a method called Contrastive… ▽ More The performance of offline reinforcement learning (RL) is sensitive to the proportion of high-return trajectories in the offline dataset. However, in many simulation environments and real-world scenarios, there are large ratios of low-return trajectories rather than high-return trajectories, which makes learning an efficient policy challenging. In this paper, we propose a method called Contrastive Diffuser (CDiffuser) to make full use of low-return trajectories and improve the performance of offline RL algorithms. Specifically, CDiffuser groups the states of trajectories in the offline dataset into high-return states and low-return states and treats them as positive and negative samples correspondingly. Then, it designs a contrastive mechanism to pull the trajectory of an agent toward high-return states and push them away from low-return states. Through the contrast mechanism, trajectories with low returns can serve as negative examples for policy learning, guiding the agent to avoid areas associated with low returns and achieve better performance. Experiments on 14 commonly used D4RL benchmarks demonstrate the effectiveness of our proposed method. Our code is publicly available at \url{https://anonymous.4open.science/r/CDiffuser}. △ Less

Submitted 15 June, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

Comments: 18 pages with appendix and references, 10 figures, 4 tables

arXiv:2401.03612 [pdf, other]

Chiral Dynamics of Ultracold Atoms under a Tunable SU(2) Synthetic Gauge Field

Authors: Qian Liang, Zhaoli Dong, Hongru Wang, Hang Li, Zhaoju Yang, Jian-Song Pan, Wei Yi, Bo Yan

Abstract: Surface currents emerge in superconductors exposed to magnetic fields, and are a key signature of the Meissner effect. Analogously, chiral dynamics were observed in quantum simulators under synthetic Abelian gauge fields. The flexible control of these simulators also facilitates the engineering of non-Abelian gauge fields, but their impact on the chiral dynamics remains elusive. Here, by employing… ▽ More Surface currents emerge in superconductors exposed to magnetic fields, and are a key signature of the Meissner effect. Analogously, chiral dynamics were observed in quantum simulators under synthetic Abelian gauge fields. The flexible control of these simulators also facilitates the engineering of non-Abelian gauge fields, but their impact on the chiral dynamics remains elusive. Here, by employing the cutting-edge momentum-lattice technique, we implement a synthetic SU(2) gauge field in a spinful 1D ladder and study the rich chiral dynamics therein. We confirm the non-Abelian nature of the synthetic potential by observing the non-Abelian Aharonov-Bohm effect on a single plaquette. More importantly, the chiral current along the two legs of the ladder is observed to be spin-dependent and highly tunable through the parameters of the gauge potential. We experimentally map out different dynamic regimes of the chiral current, and reveal the underlying competition between overlaying flux ladders with distinct spin compositions. Our experiment demonstrates the dramatic impact of non-Abelian gauge fields on the system dynamics, paving the way for future studies of exotic synthetic gauge fields on the versatile platform of momentum lattices. △ Less

Submitted 7 January, 2024; originally announced January 2024.

Comments: 6 pages, 4 figures

arXiv:2401.03611 [pdf, other]

Engineering topological chiral transport in a flat-band lattice of ultracold atoms

Authors: Hang Li, Qian Liang, Zhaoli Dong, Hongru Wang, Wei Yi, Jian-Song Pan, Bo Yan

Abstract: The manipulation of particle transport in synthetic quantum matter is an active research frontier for its theoretical importance and potential applications. Here we experimentally demonstrate an engineered topological transport in a synthetic flat-band lattice of ultracold $^{87}$Rb atoms. We implement a quasi-one-dimensional rhombic chain with staggered flux in the momentum space of the atomic co… ▽ More The manipulation of particle transport in synthetic quantum matter is an active research frontier for its theoretical importance and potential applications. Here we experimentally demonstrate an engineered topological transport in a synthetic flat-band lattice of ultracold $^{87}$Rb atoms. We implement a quasi-one-dimensional rhombic chain with staggered flux in the momentum space of the atomic condensate and observe biased local oscillations that originate from the interplay of the staggered flux and flat-band localization under the mechanism of Aharonov-Bohm caging. Based on these features, we design and experimentally confirm a state-dependent chiral transport under the periodic modulation of the synthetic flux. We show that the phenomenon is topologically protected by the winding of the Floquet Bloch bands of a coarse-grained effective Hamiltonian. The observed chiral transport offers a strategy for efficient quantum device design where topological robustness is ensured by fast Floquet driving and flat-band localization. △ Less

Submitted 7 January, 2024; originally announced January 2024.

Comments: 5 pages, 3 figures

arXiv:2401.03142 [pdf, other]

Explicit Visual Prompts for Visual Object Tracking

Authors: Liangtao Shi, Bineng Zhong, Qihua Liang, Ning Li, Sheng** Zhang, Xianxian Li

Abstract: How to effectively exploit spatio-temporal information is crucial to capture target appearance changes in visual tracking. However, most deep learning-based trackers mainly focus on designing a complicated appearance model or template updating strategy, while lacking the exploitation of context between consecutive frames and thus entailing the \textit{when-and-how-to-update} dilemma. To address th… ▽ More How to effectively exploit spatio-temporal information is crucial to capture target appearance changes in visual tracking. However, most deep learning-based trackers mainly focus on designing a complicated appearance model or template updating strategy, while lacking the exploitation of context between consecutive frames and thus entailing the \textit{when-and-how-to-update} dilemma. To address these issues, we propose a novel explicit visual prompts framework for visual tracking, dubbed \textbf{EVPTrack}. Specifically, we utilize spatio-temporal tokens to propagate information between consecutive frames without focusing on updating templates. As a result, we cannot only alleviate the challenge of \textit{when-to-update}, but also avoid the hyper-parameters associated with updating strategies. Then, we utilize the spatio-temporal tokens to generate explicit visual prompts that facilitate inference in the current frame. The prompts are fed into a transformer encoder together with the image tokens without additional processing. Consequently, the efficiency of our model is improved by avoiding \textit{how-to-update}. In addition, we consider multi-scale information as explicit visual prompts, providing multiscale template features to enhance the EVPTrack's ability to handle target scale changes. Extensive experimental results on six benchmarks (i.e., LaSOT, LaSOT\rm $_{ext}$, GOT-10k, UAV123, TrackingNet, and TNL2K.) validate that our EVPTrack can achieve competitive performance at a real-time speed by effectively exploiting both spatio-temporal and multi-scale information. Code and models are available at https://github.com/GXNU-ZhongLab/EVPTrack. △ Less

Submitted 6 January, 2024; originally announced January 2024.

arXiv:2401.01686 [pdf, other]

ODTrack: Online Dense Temporal Token Learning for Visual Tracking

Authors: Yaozong Zheng, Bineng Zhong, Qihua Liang, Zhiyi Mo, Sheng** Zhang, Xianxian Li

Abstract: Online contextual reasoning and association across consecutive video frames are critical to perceive instances in visual tracking. However, most current top-performing trackers persistently lean on sparse temporal relationships between reference and search frames via an offline mode. Consequently, they can only interact independently within each image-pair and establish limited temporal correlatio… ▽ More Online contextual reasoning and association across consecutive video frames are critical to perceive instances in visual tracking. However, most current top-performing trackers persistently lean on sparse temporal relationships between reference and search frames via an offline mode. Consequently, they can only interact independently within each image-pair and establish limited temporal correlations. To alleviate the above problem, we propose a simple, flexible and effective video-level tracking pipeline, named \textbf{ODTrack}, which densely associates the contextual relationships of video frames in an online token propagation manner. ODTrack receives video frames of arbitrary length to capture the spatio-temporal trajectory relationships of an instance, and compresses the discrimination features (localization information) of a target into a token sequence to achieve frame-to-frame association. This new solution brings the following benefits: 1) the purified token sequences can serve as prompts for the inference in the next video frame, whereby past information is leveraged to guide future inference; 2) the complex online update strategies are effectively avoided by the iterative propagation of token sequences, and thus we can achieve more efficient model representation and computation. ODTrack achieves a new \textit{SOTA} performance on seven benchmarks, while running at real-time speed. Code and models are available at \url{https://github.com/GXNU-ZhongLab/ODTrack}. △ Less

Submitted 3 January, 2024; originally announced January 2024.

arXiv:2311.14319 [pdf, ps, other]

Convolutional double copy in (Anti) de Sitter space

Authors: Qiuyue Liang, Silvia Nagy

Abstract: The double copy is a remarkable relationship between gauge theory and gravity that has been explored in a number of contexts, most notably scattering amplitudes and classical solutions. The convolutional double copy provides a straightforward method to bridge the two theories via a precise map for the fields and symmetries at the linearised level. This method has been thoroughly investigated in fl… ▽ More The double copy is a remarkable relationship between gauge theory and gravity that has been explored in a number of contexts, most notably scattering amplitudes and classical solutions. The convolutional double copy provides a straightforward method to bridge the two theories via a precise map for the fields and symmetries at the linearised level. This method has been thoroughly investigated in flat space, offering a comprehensive dictionary both with and without fixing the gauge degrees of freedom. In this paper, we extend this to curved space with an (anti) de Sitter background metric. We work in the temporal gauge, and employ a modified convolution that involves the Mellin transformation in the time direction. As an example, we show that the point-like charge in gauge theory double copies to the (dS-) Schwarzschild black hole solution. △ Less

Submitted 24 November, 2023; originally announced November 2023.

arXiv:2311.04760 [pdf, other]

Towards Open-world Cross-Domain Sequential Recommendation: A Model-Agnostic Contrastive Denoising Approach

Authors: Wujiang Xu, Xuying Ning, Wenfang Lin, Mingming Ha, Qiongxu Ma, Qianqiao Liang, Xuewen Tao, Linxun Chen, Bing Han, Minnan Luo

Abstract: Cross-domain sequential recommendation (CDSR) aims to address the data sparsity problems that exist in traditional sequential recommendation (SR) systems. The existing approaches aim to design a specific cross-domain unit that can transfer and propagate information across multiple domains by relying on overlap** users with abundant behaviors. However, in real-world recommender systems, CDSR sc… ▽ More Cross-domain sequential recommendation (CDSR) aims to address the data sparsity problems that exist in traditional sequential recommendation (SR) systems. The existing approaches aim to design a specific cross-domain unit that can transfer and propagate information across multiple domains by relying on overlap** users with abundant behaviors. However, in real-world recommender systems, CDSR scenarios usually consist of a majority of long-tailed users with sparse behaviors and cold-start users who only exist in one domain. This leads to a drop in the performance of existing CDSR methods in the real-world industry platform. Therefore, improving the consistency and effectiveness of models in open-world CDSR scenarios is crucial for constructing CDSR models (\textit{1st} CH). Recently, some SR approaches have utilized auxiliary behaviors to complement the information for long-tailed users. However, these multi-behavior SR methods cannot deliver promising performance in CDSR, as they overlook the semantic gap between target and auxiliary behaviors, as well as user interest deviation across domains (\textit{2nd} CH). △ Less

Submitted 5 June, 2024; v1 submitted 8 November, 2023; originally announced November 2023.

arXiv:2309.16666 [pdf, other]

doi 10.1103/PhysRevD.109.083028

Probing Parity Violation in the Stochastic Gravitational Wave Background with Astrometry

Authors: Qiuyue Liang, Meng-Xiang Lin, Mark Trodden, Sam S. C. Wong

Abstract: Astrometry holds the potential for testing fundamental physics through the effects of the Stochastic Gravitational Wave Background (SGWB) in the $\sim 1-100$ nHz frequency band on precision measurements of stellar positions. Such measurements are complementary to tests made possible by the detection of the SGWB using Pulsar Timing Arrays. Here, the feasibility of using astrometry for the identific… ▽ More Astrometry holds the potential for testing fundamental physics through the effects of the Stochastic Gravitational Wave Background (SGWB) in the $\sim 1-100$ nHz frequency band on precision measurements of stellar positions. Such measurements are complementary to tests made possible by the detection of the SGWB using Pulsar Timing Arrays. Here, the feasibility of using astrometry for the identification of parity-violating signals within the SGWB is investigated. This is achieved by defining and quantifying a non-vanishing $EB$ correlation function within astrometric correlation functions, and investigating how one might estimate the detectability of such signals. △ Less

Submitted 6 May, 2024; v1 submitted 28 September, 2023; originally announced September 2023.

Comments: Match the published version, adding references. 8 pages, 2 figures

arXiv:2309.12502 [pdf, ps, other]

doi 10.1109/TSP.2023.3310252

Secure Degree of Freedom of Wireless Networks Using Collaborative Pilots

Authors: Yingbo Hua, Qingpeng Liang, Md Saydur Rahman

Abstract: A wireless network of full-duplex nodes/users, using anti-eavesdrop** channel estimation (ANECE) based on collaborative pilots, can yield a positive secure degree-of-freedom (SDoF) regardless of the number of antennas an eavesdropper may have. This paper presents novel results on SDoF of ANECE by analyzing secret-key capacity (SKC) of each pair of nodes in a network of multiple collaborative nod… ▽ More A wireless network of full-duplex nodes/users, using anti-eavesdrop** channel estimation (ANECE) based on collaborative pilots, can yield a positive secure degree-of-freedom (SDoF) regardless of the number of antennas an eavesdropper may have. This paper presents novel results on SDoF of ANECE by analyzing secret-key capacity (SKC) of each pair of nodes in a network of multiple collaborative nodes per channel coherence period. Each transmission session of ANECE has two phases: phase 1 is used for pilots, and phase 2 is used for random symbols. This results in two parts of SDoF of ANECE. Both lower and upper bounds on the SDoF of ANECE for any number of users are shown, and the conditions for the two bounds to meet are given. This leads to important discoveries, including: a) The phase-1 SDoF is the same for both multi-user ANECE and pair-wise ANECE while the former may require only a fraction of the number of time slots needed by the latter; b) For a three-user network, the phase-2 SDoF of all-user ANECE is generally larger than that of pair-wise ANECE; c) For a two-user network, a modified ANECE deploying square-shaped nonsingular pilot matrices yields a higher total SDoF than the original ANECE. The multi-user ANECE and the modified two-user ANECE shown in this paper appear to be the best full-duplex schemes known today in terms of SDoF subject to each node using a given number of antennas for both transmitting and receiving. △ Less

Submitted 21 September, 2023; originally announced September 2023.

arXiv:2309.11895 [pdf, other]

Audio Contrastive based Fine-tuning

Authors: Yang Wang, Qibin Liang, Chenghao Xiao, Yizhi Li, Noura Al Moubayed, Chenghua Lin

Abstract: Audio classification plays a crucial role in speech and sound processing tasks with a wide range of applications. There still remains a challenge of striking the right balance between fitting the model to the training data (avoiding overfitting) and enabling it to generalise well to a new domain. Leveraging the transferability of contrastive learning, we introduce Audio Contrastive-based Fine-tuni… ▽ More Audio classification plays a crucial role in speech and sound processing tasks with a wide range of applications. There still remains a challenge of striking the right balance between fitting the model to the training data (avoiding overfitting) and enabling it to generalise well to a new domain. Leveraging the transferability of contrastive learning, we introduce Audio Contrastive-based Fine-tuning (AudioConFit), an efficient approach characterised by robust generalisability. Empirical experiments on a variety of audio classification tasks demonstrate the effectiveness and robustness of our approach, which achieves state-of-the-art results in various settings. △ Less

Submitted 19 October, 2023; v1 submitted 21 September, 2023; originally announced September 2023.

Comments: Under review

arXiv:2309.07652 [pdf]

Photochemical reaction enabling the engineering of photonic spin-orbit coupling in organic-crystal optical microcavities

Authors: Qian Liang, Xuekai Ma, Jiahuan Ren, Teng Long, Chunling Gu, Cunbin An, Hongbing Fu, Stefan Schumacher, Qing Liao

Abstract: The control and active manipulation of spin-orbit coupling (SOC) in photonic systems is fundamental in the development of modern spin optics and topological photonic devices. Here, we demonstrate the control of an artificial Rashba-Dresselhaus (RD) SOC mediated by photochemical reactions in a microcavity filled with an organic single-crystal of photochromic phase-change character. Splitting of the… ▽ More The control and active manipulation of spin-orbit coupling (SOC) in photonic systems is fundamental in the development of modern spin optics and topological photonic devices. Here, we demonstrate the control of an artificial Rashba-Dresselhaus (RD) SOC mediated by photochemical reactions in a microcavity filled with an organic single-crystal of photochromic phase-change character. Splitting of the circular polarization components of the optical modes induced by photonic RD SOC is observed experimentally in momentum space. By applying an ultraviolet light beam, we control the spatial molecular orientation through a photochemical reaction and with that we control the energies of the photonic modes. This way we realize a reversible conversion of spin-splitting of the optical modes with different energies, leading to an optically controlled switching between circularly and linearly polarized emission from our device. Our strategy of in situ and reversible engineering of SOC induced by a light field provides a promising approach to actively design and manipulate synthetic gauge fields towards future on-chip integration in photonics and topological photonic devices. △ Less

Submitted 14 September, 2023; originally announced September 2023.

arXiv:2308.14103 [pdf, other]

Towards Unified Token Learning for Vision-Language Tracking

Authors: Yaozong Zheng, Bineng Zhong, Qihua Liang, Guorong Li, Rongrong Ji, Xianxian Li

Abstract: In this paper, we present a simple, flexible and effective vision-language (VL) tracking pipeline, termed \textbf{MMTrack}, which casts VL tracking as a token generation task. Traditional paradigms address VL tracking task indirectly with sophisticated prior designs, making them over-specialize on the features of specific architectures or mechanisms. In contrast, our proposed framework serializes… ▽ More In this paper, we present a simple, flexible and effective vision-language (VL) tracking pipeline, termed \textbf{MMTrack}, which casts VL tracking as a token generation task. Traditional paradigms address VL tracking task indirectly with sophisticated prior designs, making them over-specialize on the features of specific architectures or mechanisms. In contrast, our proposed framework serializes language description and bounding box into a sequence of discrete tokens. In this new design paradigm, all token queries are required to perceive the desired target and directly predict spatial coordinates of the target in an auto-regressive manner. The design without other prior modules avoids multiple sub-tasks learning and hand-designed loss functions, significantly reducing the complexity of VL tracking modeling and allowing our tracker to use a simple cross-entropy loss as unified optimization objective for VL tracking task. Extensive experiments on TNL2K, LaSOT, LaSOT$_{\rm{ext}}$ and OTB99-Lang benchmarks show that our approach achieves promising results, compared to other state-of-the-arts. △ Less

Submitted 27 August, 2023; originally announced August 2023.

arXiv:2308.06898 [pdf, other]

CupCleaner: A Data Cleaning Approach for Comment Updating

Authors: Qingyuan Liang, Zeyu Sun, Qihao Zhu, Junhao Hu, Yifan Zhao, Lu Zhang

Abstract: Recently, deep learning-based techniques have shown promising performance on various tasks related to software engineering. For these learning-based approaches to perform well, obtaining high-quality data is one fundamental and crucial issue. The comment updating task is an emerging software engineering task aiming at automatically updating the corresponding comments based on changes in source cod… ▽ More Recently, deep learning-based techniques have shown promising performance on various tasks related to software engineering. For these learning-based approaches to perform well, obtaining high-quality data is one fundamental and crucial issue. The comment updating task is an emerging software engineering task aiming at automatically updating the corresponding comments based on changes in source code. However, datasets for the comment updating tasks are usually crawled from committed versions in open source software repositories such as GitHub, where there is lack of quality control of comments. In this paper, we focus on cleaning existing comment updating datasets with considering some properties of the comment updating process in software development. We propose a semantic and overlap**-aware approach named CupCleaner (Comment UPdating's CLEANER) to achieve this purpose. Specifically, we calculate a score based on semantics and overlap** information of the code and comments. Based on the distribution of the scores, we filter out the data with low scores in the tail of the distribution to get rid of possible unclean data. We first conducted a human evaluation on the noise data and high-quality data identified by CupCleaner. The results show that the human ratings of the noise data identified by CupCleaner are significantly lower. Then, we applied our data cleaning approach to the training and validation sets of three existing comment updating datasets while kee** the test set unchanged. Our experimental results show that even after filtering out over 30\% of the data using CupCleaner, there is still an improvement in all performance metrics. The experimental results on the cleaned test set also suggest that CupCleaner may provide help for constructing datasets for updating-related tasks. △ Less

Submitted 13 August, 2023; originally announced August 2023.

arXiv:2307.15841 [pdf, other]

doi 10.1088/2058-9565/ad3a98

Pulse optimization for high-precision motional-mode characterization in trapped-ion quantum computers

Authors: Qiyao Liang, Mingyu Kang, Ming Li, Yunseong Nam

Abstract: High-fidelity operation of quantum computers requires precise knowledge of the physical system through characterization. For motion-mediated entanglement generation in trapped ions, it is crucial to have precise knowledge of the motional-mode parameters such as the mode frequencies and the Lamb-Dicke parameters. Unfortunately, the state-of-the-art mode-characterization schemes do not easily render… ▽ More High-fidelity operation of quantum computers requires precise knowledge of the physical system through characterization. For motion-mediated entanglement generation in trapped ions, it is crucial to have precise knowledge of the motional-mode parameters such as the mode frequencies and the Lamb-Dicke parameters. Unfortunately, the state-of-the-art mode-characterization schemes do not easily render the mode parameters in a sufficiently scalable and accurate fashion, due to the unwanted excitation of adjacent modes in the frequency space when targeting a single mode, an effect known as the \textit{cross-mode coupling}. Here, we develop an alternative scheme that leverages the degrees of freedom in pulse design for the characterization experiment such that the effects of the cross-mode coupling is actively silenced. Further, we devise stabilization methods to accurately characterize the Lamb-Dicke parameters even when the mode frequencies are not precisely known due to experimental drifts or characterization inaccuracies. We extensively benchmark our scheme in simulations of a three-ion chain and discuss the parameter regimes in which the shaped pulses significantly outperform the traditional square pulses. △ Less

Submitted 4 April, 2024; v1 submitted 28 July, 2023; originally announced July 2023.

Comments: 20 pages, 7 figures

Journal ref: Quantum Sci. Technol. 9, 035007 (2024)

arXiv:2306.17235 [pdf, other]

doi 10.1103/PhysRevResearch.6.023118

Modeling the Performance of Early Fault-Tolerant Quantum Algorithms

Authors: Qiyao Liang, Yiqing Zhou, Archismita Dalal, Peter D. Johnson

Abstract: Progress in fault-tolerant quantum computation (FTQC) has driven the pursuit of practical applications with early fault-tolerant quantum computers (EFTQC). These devices, limited in their qubit counts and fault-tolerance capabilities, require algorithms that can accommodate some degrees of error, which are known as EFTQC algorithms. To predict the onset of early quantum advantage, a comprehensive… ▽ More Progress in fault-tolerant quantum computation (FTQC) has driven the pursuit of practical applications with early fault-tolerant quantum computers (EFTQC). These devices, limited in their qubit counts and fault-tolerance capabilities, require algorithms that can accommodate some degrees of error, which are known as EFTQC algorithms. To predict the onset of early quantum advantage, a comprehensive methodology is needed to develop and analyze EFTQC algorithms, drawing insights from both the methodologies of noisy intermediate-scale quantum (NISQ) and traditional FTQC. To address this need, we propose such a methodology for modeling algorithm performance on EFTQC devices under varying degrees of error. As a case study, we apply our methodology to analyze the performance of Randomized Fourier Estimation (RFE), an EFTQC algorithm for phase estimation. We investigate the runtime performance and the fault-tolerant overhead of RFE in comparison to the traditional quantum phase estimation algorithm. Our analysis reveals that RFE achieves significant savings in physical qubit counts while having a much higher runtime upper bound. We anticipate even greater physical qubit savings when considering more realistic assumptions about the performance of EFTQC devices. By providing insights into the performance trade-offs and resource requirements of EFTQC algorithms, our work contributes to the development of practical and efficient quantum computing solutions on the path to quantum advantage. △ Less

Submitted 12 December, 2023; v1 submitted 29 June, 2023; originally announced June 2023.

Comments: 9 pages, 8 figures, plus appendix

Journal ref: Phys. Rev. Research 6, 023118 (2024)

arXiv:2306.05301 [pdf, other]

ToolAlpaca: Generalized Tool Learning for Language Models with 3000 Simulated Cases

Authors: Qiaoyu Tang, Ziliang Deng, Hongyu Lin, Xianpei Han, Qiao Liang, Boxi Cao, Le Sun

Abstract: Enabling large language models to utilize real-world tools effectively is crucial for achieving embodied intelligence. Existing approaches to tool learning have either primarily relied on extremely large language models, such as GPT-4, to attain generalized tool-use abilities in a zero-shot manner, or utilized supervised learning to train limited scopes of tools on compact models. However, it rema… ▽ More Enabling large language models to utilize real-world tools effectively is crucial for achieving embodied intelligence. Existing approaches to tool learning have either primarily relied on extremely large language models, such as GPT-4, to attain generalized tool-use abilities in a zero-shot manner, or utilized supervised learning to train limited scopes of tools on compact models. However, it remains uncertain whether smaller language models can achieve generalized tool-use abilities without tool-specific training. To address this question, this paper introduces ToolAlpaca, a novel framework designed to automatically generate a diverse tool-use corpus and learn generalized tool-use abilities on compact language models with minimal human intervention. Specifically, ToolAlpaca first automatically creates a highly diversified tool-use corpus by building a multi-agent simulation environment. The corpus contains 3938 tool-use instances from more than 400 real-world tool APIs spanning 50 distinct categories. Subsequently, the constructed corpus is employed to fine-tune compact language models, resulting in two models, namely ToolAlpaca-7B and ToolAlpaca-13B, respectively. Finally, we evaluate the ability of these models to utilize previously unseen tools without specific training. Experimental results demonstrate that ToolAlpaca achieves effective generalized tool-use capabilities comparable to those of extremely large language models like GPT-3.5, demonstrating that learning generalized tool-use ability is feasible for compact language models. △ Less

Submitted 7 September, 2023; v1 submitted 8 June, 2023; originally announced June 2023.

arXiv:2305.15599 [pdf, other]

Hybrid-MPET: an open-source simulation software for hybrid electrode batteries

Authors: Qiaohao Liang, Martin Z. Bazant

Abstract: As the design of single-component battery electrodes has matured, the battery industry has turned to hybrid electrodes with blends of two or more active materials to enhance battery performance. Leveraging the best properties of each material while mitigating their drawbacks, multi-component hybrid electrodes open a vast new design space that could be most efficiently explored through simulations.… ▽ More As the design of single-component battery electrodes has matured, the battery industry has turned to hybrid electrodes with blends of two or more active materials to enhance battery performance. Leveraging the best properties of each material while mitigating their drawbacks, multi-component hybrid electrodes open a vast new design space that could be most efficiently explored through simulations. In this article, we introduce a mathematical modeling framework and open-source battery simulation software package for Hybrid Multiphase Porous Electrode Theory (Hybrid-MPET), capable of accounting for the parallel reactions, phase transformations and multiscale heterogeneities in hybrid porous electrodes. Hybrid-MPET models can simulate both solid solution and multiphase active materials in hybrid electrodes at intra-particle and inter-particle scales. Its modular design also allows the combination of different active materials at any capacity fraction. To illustrate the novel features of Hybrid-MPET, we present experimentally validated models of silicon-graphite (Si-Gr) anodes used in electric vehicle batteries and carbon monofluoride (CFx) - silver vanadium oxide (SVO) cathodes used in implantable medical device batteries. The results demonstrate the potential of Hybrid-MPET models to accelerate the development of hybrid electrode batteries by providing fast predictions of their performance over a wide range of design parameters and operating protocols. △ Less

Submitted 24 May, 2023; originally announced May 2023.

Comments: For code and sample usage, please visit: https://github.com/HarryQL/Hybrid-MPET

arXiv:2305.09920 [pdf, other]

doi 10.1021/acs.jctc.3c00518

Low-data deep quantum chemical learning for accurate MP2 and coupled-cluster correlations

Authors: Wai-Pan Ng, Qiujiang Liang, Jun Yang

Abstract: Accurate ab-initio prediction of electronic energies is very expensive for macromolecules by explicitly solving post-Hartree-Fock equations. We here exploit the physically justified local correlation feature in compact basis of small molecules, and construct an expressive low-data deep neural network (dNN) model to obtain machine-learned electron correlation energies on par with MP2 and CCSD level… ▽ More Accurate ab-initio prediction of electronic energies is very expensive for macromolecules by explicitly solving post-Hartree-Fock equations. We here exploit the physically justified local correlation feature in compact basis of small molecules, and construct an expressive low-data deep neural network (dNN) model to obtain machine-learned electron correlation energies on par with MP2 and CCSD levels of theory for more complex molecules and different datasets that are not represented in the training set. We show that our dNN-powered model is data efficient and makes highly transferable prediction across alkanes of various lengths, organic molecules with non-covalent and biomolecular interactions, as well as water clusters of different sizes and morphologies. In particular, by training 800 (H$_2$O)$_8$ clusters with the local correlation descriptors, accurate MP2/cc-pVTZ correlation energies up to (H$_2$O)$_{128}$ can be predicted with a small random error within chemical accuracy from exact values, while a majority of prediction deviations are attributed to an intrinsically systematic error. Our results reveal that an extremely compact local correlation feature set, which is poor for any direct post-Hartree-Fock calculations, has however a prominent advantage in reserving important electron correlation patterns for making accurate transferable predictions across distinct molecular compositions, bond types and geometries. △ Less

Submitted 17 July, 2023; v1 submitted 16 May, 2023; originally announced May 2023.

arXiv:2304.10026 [pdf, ps, other]

On a Sharp Estimate of Overlap** Schwarz Methods in $\mathrm{H}(\mathrm{curl};Ω)$ and $\mathrm{H}(\mathrm{div};Ω)$

Authors: Qigang Liang, Xuejun Xu, Shangyou Zhang

Abstract: The previous proved-bound is $C(1+\frac{H^2}{δ^2})$ for the condition number of the overlap** domain decomposition $\mathrm{H}(\mathrm{curl};Ω)$ and $\mathrm{H}(\mathrm{div};Ω)$ methods, where $H$ and $δ$ are the sizes of subdomains and overlaps respectively. But all numerical results indicate that the best bound is $C(1+\frac{H}δ)$. In this work, we solve this long-standing open problem by prov… ▽ More The previous proved-bound is $C(1+\frac{H^2}{δ^2})$ for the condition number of the overlap** domain decomposition $\mathrm{H}(\mathrm{curl};Ω)$ and $\mathrm{H}(\mathrm{div};Ω)$ methods, where $H$ and $δ$ are the sizes of subdomains and overlaps respectively. But all numerical results indicate that the best bound is $C(1+\frac{H}δ)$. In this work, we solve this long-standing open problem by proving that $C(1+\frac{H}δ)$ is indeed the best bound. △ Less

Submitted 19 April, 2023; originally announced April 2023.

arXiv:2304.03029 [pdf, other]

The most probable distributions with finite number of particles and the applications on Bose-Einstein condensation

Authors: Qi-Wei Liang, Wenyu Wang

Abstract: Motivated by the Asynchronous Finite Differences Method utilized for the calculation of the most probable distributions of finite particle number systems, this study employs numerical variation and central difference techniques to provide more precise estimations regarding these distributions. Specifically, three novel finite distributions are derived and applied to Bose-Einstein condensation, rev… ▽ More Motivated by the Asynchronous Finite Differences Method utilized for the calculation of the most probable distributions of finite particle number systems, this study employs numerical variation and central difference techniques to provide more precise estimations regarding these distributions. Specifically, three novel finite distributions are derived and applied to Bose-Einstein condensation, revealing that the critical condition ($nλ^3=2.612$) may be relaxed in finite particle number scenarios. Moreover, maintaining density as a constant is anticipated to result in a higher critical temperature compared to infinite number systems. Notably, the obtained condensate number on the zero-energy level surpasses that of predictions generated by canonical distributions. △ Less

Submitted 6 April, 2023; originally announced April 2023.

Comments: 13 pages,5 figures

arXiv:2304.02640 [pdf, other]

A Test of Gravity with Pulsar Timing Arrays

Authors: Qiuyue Liang, Meng-Xiang Lin, Mark Trodden

Abstract: A successful measurement of the Stochastic Gravitational Wave Background (SGWB) in Pulsar Timing Arrays (PTAs) would open up a new window through which to test the predictions of General Relativity (GR). We consider how these measurements might reveal deviations from GR by studying the overlap reduction function -- the quantity that in GR is approximated by the Hellings-Downs curve -- in some samp… ▽ More A successful measurement of the Stochastic Gravitational Wave Background (SGWB) in Pulsar Timing Arrays (PTAs) would open up a new window through which to test the predictions of General Relativity (GR). We consider how these measurements might reveal deviations from GR by studying the overlap reduction function -- the quantity that in GR is approximated by the Hellings-Downs curve -- in some sample modifications of gravity, focusing on the generic prediction of a modified dispersion relation for gravitational waves. We find a distinct signature of such modifications to GR -- a shift in the minimum angle of the angular distribution -- and demonstrate that this shift is quantitatively sensitive to any change in the phase velocity. In a given modification of gravity, this result can be used, in some regions of parameter space, to distinguish the effect of a modified dispersion relation from that due to the presence of extra polarization modes. △ Less

Submitted 14 April, 2023; v1 submitted 5 April, 2023; originally announced April 2023.

Comments: v2: add references and discussions. 23 pages, 4 figures

arXiv:2303.13848 [pdf, other]

doi 10.1371/journal.pcbi.1011513

Patch formation driven by stochastic effects of interaction between viruses and defective interfering particles

Authors: Qiantong Liang, Johnny Yang, Wai-Tong Louis Fan, Wing-Cheong Lo

Abstract: Defective interfering particles (DIPs) are virus-like particles that occur naturally during virus infections. These particles are defective, lacking essential genetic materials for replication, but they can interact with the wild-type virus and potentially be used as therapeutic agents. However, the effect of DIPs on infection spread is still unclear due to complicated stochastic effects and nonli… ▽ More Defective interfering particles (DIPs) are virus-like particles that occur naturally during virus infections. These particles are defective, lacking essential genetic materials for replication, but they can interact with the wild-type virus and potentially be used as therapeutic agents. However, the effect of DIPs on infection spread is still unclear due to complicated stochastic effects and nonlinear spatial dynamics. In this work, we develop a model with a new hybrid method to study the spatial-temporal dynamics of viruses and DIPs co-infections within hosts. We present two different scenarios of virus production and compare the results from deterministic and stochastic models to demonstrate how the stochastic effect is involved in the spatial dynamics of virus transmission. We quantitatively study the spread features of the virus, including the formation and the speed of virus spread and the emergence of stochastic patchy patterns of virus distribution. Our simulations simultaneously capture observed spatial spread features in the experimental data, including the spread rate of the virus and its patchiness. The results demonstrate that DIPs can slow down the growth of virus particles and make the spread of the virus more patchy. △ Less

Submitted 24 March, 2023; originally announced March 2023.

Journal ref: PLoS Comput Biol 19(10), 2023

arXiv:2303.02963 [pdf, other]

doi 10.1038/s41566-023-01290-1

Quantum time reflection and refraction of ultracold atoms

Authors: Zhaoli Dong, Hang Li, Tuo Wan, Qian Liang, Zhaoju Yang, Bo Yan

Abstract: Time reflection and refraction are temporal analogies of the spatial boundary effects derived from Fermat's principle. They occur when classical waves strike a time boundary where an abrupt change in the properties of the medium is introduced. The main features of time-reflected and refracted waves are the shift of frequency and conservation of momentum, which offer a new degree of freedom for ste… ▽ More Time reflection and refraction are temporal analogies of the spatial boundary effects derived from Fermat's principle. They occur when classical waves strike a time boundary where an abrupt change in the properties of the medium is introduced. The main features of time-reflected and refracted waves are the shift of frequency and conservation of momentum, which offer a new degree of freedom for steering extreme waves and controlling phases of matter. The concept was originally proposed for manipulating optical waves more than five decades ago. However, due to the extreme challenges in the ultrafast engineering of the optical materials, the experimental realization of the time boundary effects remains elusive. Here, we introduce a time boundary into a momentum lattice of ultracold atoms and simultaneously demonstrate the time reflection and refraction experimentally. Through launching a Gaussian-superposed state into the Su-Schrieffer-Heeger (SSH) atomic chain, we observe the time-reflected and refracted waves when the input state strikes a time boundary. Furthermore, we detect a transition from time reflection/refraction to localization with increasing strength of disorder and show that the time boundary effects are robust against considerable disorder. Our work opens a new avenue for future exploration of time boundaries and spatiotemporal lattices, and their interplay with non-Hermiticity and many-body interactions. △ Less

Submitted 6 March, 2023; originally announced March 2023.

Comments: 6 pages, 4 figures

Journal ref: Nature Photonics (2023)

arXiv:2303.01778 [pdf, other]

FedML Parrot: A Scalable Federated Learning System via Heterogeneity-aware Scheduling on Sequential and Hierarchical Training

Authors: Zhenheng Tang, Xiaowen Chu, Ryan Yide Ran, Sunwoo Lee, Shaohuai Shi, Yonggang Zhang, Yuxin Wang, Alex Qiaozhong Liang, Salman Avestimehr, Chaoyang He

Abstract: Federated Learning (FL) enables collaborations among clients for train machine learning models while protecting their data privacy. Existing FL simulation platforms that are designed from the perspectives of traditional distributed training, suffer from laborious code migration between simulation and production, low efficiency, low GPU utility, low scalability with high hardware requirements and d… ▽ More Federated Learning (FL) enables collaborations among clients for train machine learning models while protecting their data privacy. Existing FL simulation platforms that are designed from the perspectives of traditional distributed training, suffer from laborious code migration between simulation and production, low efficiency, low GPU utility, low scalability with high hardware requirements and difficulty of simulating stateful clients. In this work, we firstly demystify the challenges and bottlenecks of simulating FL, and design a new FL system named as FedML \texttt{Parrot}. It improves the training efficiency, remarkably relaxes the requirements on the hardware, and supports efficient large-scale FL experiments with stateful clients by: (1) sequential training clients on devices; (2) decomposing original aggregation into local and global aggregation on devices and server respectively; (3) scheduling tasks to mitigate straggler problems and enhance computing utility; (4) distributed client state manager to support various FL algorithms. Besides, built upon our generic APIs and communication interfaces, users can seamlessly transform the simulation into the real-world deployment without modifying codes. We evaluate \texttt{Parrot} through extensive experiments for training diverse models on various FL datasets to demonstrate that \texttt{Parrot} can achieve simulating over 1000 clients (stateful or stateless) with flexible GPU devices setting ($4 \sim 32$) and high GPU utility, 1.2 $\sim$ 4 times faster than FedScale, and 10 $\sim$ 100 times memory saving than FedML. And we verify that \texttt{Parrot} works well with homogeneous and heterogeneous devices in three different clusters. Two FL algorithms with stateful clients and four algorithms with stateless clients are simulated to verify the wide adaptability of \texttt{Parrot} to different algorithms. △ Less

Submitted 3 March, 2023; originally announced March 2023.

arXiv:2302.09091 [pdf, other]

Neutrino-Assisted Early Dark Energy is a Natural Resolution of the Hubble Tension

Authors: Mariana Carrillo González, Qiuyue Liang, Jeremy Sakstein, Mark Trodden

Abstract: It has very recently been claimed that the neutrino-assisted early dark energy model -- a promising resolution of the Hubble tension that can ameliorate the theoretical fine-tuning and coincidence problems that plague other theories -- does not provide natural or cosmologically interesting results. In this short paper, we show that these conclusions are incorrect for three reasons. First, we ident… ▽ More It has very recently been claimed that the neutrino-assisted early dark energy model -- a promising resolution of the Hubble tension that can ameliorate the theoretical fine-tuning and coincidence problems that plague other theories -- does not provide natural or cosmologically interesting results. In this short paper, we show that these conclusions are incorrect for three reasons. First, we identify errors in the calculations. Second, we dispute the definition in of what constitutes an 'interesting' and 'natural' model. Finally, we demonstrate that the conclusions of were arrived at without fully exploring the full parameter space of the model. Neutrino-assisted early dark energy remains a natural and interesting potential resolution of the Hubble tension that merits further study. △ Less

Submitted 17 February, 2023; originally announced February 2023.

Comments: 5 pages, 1 figure, comment

arXiv:2302.08681 [pdf, other]

doi 10.1145/3626788

CarbonScaler: Leveraging Cloud Workload Elasticity for Optimizing Carbon-Efficiency

Authors: Walid A. Hanafy, Qianlin Liang, Noman Bashir, David Irwin, Prashant Shenoy

Abstract: Cloud platforms are increasing their emphasis on sustainability and reducing their operational carbon footprint. A common approach for reducing carbon emissions is to exploit the temporal flexibility inherent to many cloud workloads by executing them in periods with the greenest energy and suspending them at other times. Since such suspend-resume approaches can incur long delays in job completion… ▽ More Cloud platforms are increasing their emphasis on sustainability and reducing their operational carbon footprint. A common approach for reducing carbon emissions is to exploit the temporal flexibility inherent to many cloud workloads by executing them in periods with the greenest energy and suspending them at other times. Since such suspend-resume approaches can incur long delays in job completion times, we present a new approach that exploits the elasticity of batch workloads in the cloud to optimize their carbon emissions. Our approach is based on the notion of "carbon scaling," similar to cloud autoscaling, where a job dynamically varies its server allocation based on fluctuations in the carbon cost of the grid's energy. We develop a greedy algorithm for minimizing a job's carbon emissions via carbon scaling that is based on the well-known problem of marginal resource allocation. We implement a CarbonScaler prototype in Kubernetes using its autoscaling capabilities and an analytic tool to guide the carbon-efficient deployment of batch applications in the cloud. We then evaluate CarbonScaler using real-world machine learning training and MPI jobs on a commercial cloud platform and show that it can yield i) 51% carbon savings over carbon-agnostic execution; ii) 37% over a state-of-the-art suspend-resume policy; and iii) 8% over the best static scaling policy. △ Less

Submitted 19 October, 2023; v1 submitted 16 February, 2023; originally announced February 2023.

Journal ref: Proc. ACM Meas. Anal. Comput. Syst. 7, 3, Article 57 (December 2023), 28 pages

Showing 1–50 of 224 results for author: Liang, Q