Search | arXiv e-print repository

Performance Analysis for Hybrid mmWave and THz Networks with Downlink and Uplink Decoupled Cell Association

Authors: Yunbai Wang, Chen Chen, Xiaoli Chu

Abstract: It is expected that B5G/6G networks will exploit both terahertz (THz) and millimetre wave (mmWave) frequency bands and will increase flexibility in user equipment (UE)-cell association. In this paper, we introduce a novel stochastic geometry-based framework for the analysis of the signal-to-interference-plus-noise-ratio (SINR) and rate coverage in a multi-tier hybrid mmWave and THz network, where… ▽ More It is expected that B5G/6G networks will exploit both terahertz (THz) and millimetre wave (mmWave) frequency bands and will increase flexibility in user equipment (UE)-cell association. In this paper, we introduce a novel stochastic geometry-based framework for the analysis of the signal-to-interference-plus-noise-ratio (SINR) and rate coverage in a multi-tier hybrid mmWave and THz network, where each tier has a particular base station (BS) density, transmit power, bandwidth, number of BS antennas, and cell-association bias factor. The proposed framework incorporates the effects of mmWave and THz channel characteristics, BS beamforming gain, and blockages. We investigate the downlink (DL) and uplink (UL) decoupled cell-association strategy and characterise the per-tier cell-association probability. Based on that, we analytically derive the SINR and rate coverage probabilities of a typical user for both DL and UL transmissions. The analytical results are validated via extensive Monte Carlo simulations. Numerical results demonstrate the superiority of the DL and UL decoupled cell-association strategy in terms of SINR and rate coverage over its coupled counterpart. Moreover, we observe that the superiority of using the DL and UL decoupled cell-association strategy becomes more evident with the dense deployment of THz networks. △ Less

Submitted 10 August, 2023; originally announced August 2023.

Comments: This paper has been submitted to IEEE for possible publications

arXiv:2308.04911 [pdf, other]

SLPT: Selective Labeling Meets Prompt Tuning on Label-Limited Lesion Segmentation

Authors: Fan Bai, Ke Yan, Xiaoyu Bai, Xinyu Mao, Xiaoli Yin, **gren Zhou, Yu Shi, Le Lu, Max Q. -H. Meng

Abstract: Medical image analysis using deep learning is often challenged by limited labeled data and high annotation costs. Fine-tuning the entire network in label-limited scenarios can lead to overfitting and suboptimal performance. Recently, prompt tuning has emerged as a more promising technique that introduces a few additional tunable parameters as prompts to a task-agnostic pre-trained model, and updat… ▽ More Medical image analysis using deep learning is often challenged by limited labeled data and high annotation costs. Fine-tuning the entire network in label-limited scenarios can lead to overfitting and suboptimal performance. Recently, prompt tuning has emerged as a more promising technique that introduces a few additional tunable parameters as prompts to a task-agnostic pre-trained model, and updates only these parameters using supervision from limited labeled data while kee** the pre-trained model unchanged. However, previous work has overlooked the importance of selective labeling in downstream tasks, which aims to select the most valuable downstream samples for annotation to achieve the best performance with minimum annotation cost. To address this, we propose a framework that combines selective labeling with prompt tuning (SLPT) to boost performance in limited labels. Specifically, we introduce a feature-aware prompt updater to guide prompt tuning and a TandEm Selective LAbeling (TESLA) strategy. TESLA includes unsupervised diversity selection and supervised selection using prompt-based uncertainty. In addition, we propose a diversified visual prompt tuning strategy to provide multi-prompt-based discrepant predictions for TESLA. We evaluate our method on liver tumor segmentation and achieve state-of-the-art performance, outperforming traditional fine-tuning with only 6% of tunable parameters, also achieving 94% of full-data performance by labeling only 5% of the data. △ Less

Submitted 9 August, 2023; originally announced August 2023.

Comments: accepted by MICCAI 2023

arXiv:2308.04119 [pdf, other]

Constructing Custom Thermodynamics Using Deep Learning

Authors: Xiaoli Chen, Beatrice W. Soh, Zi-En Ooi, Eleonore Vissol-Gaudin, Haijun Yu, Kostya S. Novoselov, Kedar Hippalgaonkar, Qianxiao Li

Abstract: One of the most exciting applications of artificial intelligence (AI) is automated scientific discovery based on previously amassed data, coupled with restrictions provided by known physical principles, including symmetries and conservation laws. Such automated hypothesis creation and verification can assist scientists in studying complex phenomena, where traditional physical intuition may fail. H… ▽ More One of the most exciting applications of artificial intelligence (AI) is automated scientific discovery based on previously amassed data, coupled with restrictions provided by known physical principles, including symmetries and conservation laws. Such automated hypothesis creation and verification can assist scientists in studying complex phenomena, where traditional physical intuition may fail. Here we develop a platform based on a generalized Onsager principle to learn macroscopic dynamical descriptions of arbitrary stochastic dissipative systems directly from observations of their microscopic trajectories. Our method simultaneously constructs reduced thermodynamic coordinates and interprets the dynamics on these coordinates. We demonstrate its effectiveness by studying theoretically and validating experimentally the stretching of long polymer chains in an externally applied field. Specifically, we learn three interpretable thermodynamic coordinates and build a dynamical landscape of polymer stretching, including the identification of stable and transition states and the control of the stretching rate. Our general methodology can be used to address a wide range of scientific and technological applications. △ Less

Submitted 22 December, 2023; v1 submitted 8 August, 2023; originally announced August 2023.

Comments: Fix figure visibility issue

arXiv:2308.02678 [pdf, ps, other]

Ethical Considerations and Policy Implications for Large Language Models: Guiding Responsible Development and Deployment

Authors: Jianyi Zhang, Xu Ji, Zhangchi Zhao, Xiali Hei, Kim-Kwang Raymond Choo

Abstract: This paper examines the ethical considerations and implications of large language models (LLMs) in generating content. It highlights the potential for both positive and negative uses of generative AI programs and explores the challenges in assigning responsibility for their outputs. The discussion emphasizes the need for proactive ethical frameworks and policy measures to guide the responsible dev… ▽ More This paper examines the ethical considerations and implications of large language models (LLMs) in generating content. It highlights the potential for both positive and negative uses of generative AI programs and explores the challenges in assigning responsibility for their outputs. The discussion emphasizes the need for proactive ethical frameworks and policy measures to guide the responsible development and deployment of LLMs. △ Less

Submitted 1 August, 2023; originally announced August 2023.

Comments: 5 pages

arXiv:2308.01634 [pdf, other]

doi 10.1145/3581783.3611794

Disentangling Multi-view Representations Beyond Inductive Bias

Authors: Guanzhou Ke, Yang Yu, Guoqing Chao, Xiaoli Wang, Chenyang Xu, Shengfeng He

Abstract: Multi-view (or -modality) representation learning aims to understand the relationships between different view representations. Existing methods disentangle multi-view representations into consistent and view-specific representations by introducing strong inductive biases, which can limit their generalization ability. In this paper, we propose a novel multi-view representation disentangling method… ▽ More Multi-view (or -modality) representation learning aims to understand the relationships between different view representations. Existing methods disentangle multi-view representations into consistent and view-specific representations by introducing strong inductive biases, which can limit their generalization ability. In this paper, we propose a novel multi-view representation disentangling method that aims to go beyond inductive biases, ensuring both interpretability and generalizability of the resulting representations. Our method is based on the observation that discovering multi-view consistency in advance can determine the disentangling information boundary, leading to a decoupled learning objective. We also found that the consistency can be easily extracted by maximizing the transformation invariance and clustering consistency between views. These observations drive us to propose a two-stage framework. In the first stage, we obtain multi-view consistency by training a consistent encoder to produce semantically-consistent representations across views as well as their corresponding pseudo-labels. In the second stage, we disentangle specificity from comprehensive representations by minimizing the upper bound of mutual information between consistent and comprehensive representations. Finally, we reconstruct the original data by concatenating pseudo-labels and view-specific representations. Our experiments on four multi-view datasets demonstrate that our proposed method outperforms 12 comparison methods in terms of clustering and classification performance. The visualization results also show that the extracted consistency and specificity are compact and interpretable. Our code can be found at \url{https://github.com/Guanzhou-Ke/DMRIB}. △ Less

Submitted 4 August, 2023; v1 submitted 3 August, 2023; originally announced August 2023.

Comments: 9 pages, 5 figures, 4 tables

Journal ref: In Proceedings of the 31st ACM International Conference on Multimedia (MM '23), 2023

arXiv:2308.01524 [pdf]

doi 10.55092/am20240006

Unsupervised Learning of Part Similarity for Goal-Guided Accelerated Experiment Design in Metal Additive Manufacturing

Authors: Rui Liu, Sen Liu, Xiaoli Zhang

Abstract: Metal additive manufacturing is gaining broad interest and increased use in the industrial and academic fields. However, the quantification and commercialization of standard parts usually require extensive experiments and expensive post-characterization, which impedes the rapid development and adaptation of metal AM technologies. In this work, a similarity-based acceleration (S-acceleration) metho… ▽ More Metal additive manufacturing is gaining broad interest and increased use in the industrial and academic fields. However, the quantification and commercialization of standard parts usually require extensive experiments and expensive post-characterization, which impedes the rapid development and adaptation of metal AM technologies. In this work, a similarity-based acceleration (S-acceleration) method for design of experiments is developed to reduce the time and costs associated with unveiling process-property (porosity defects) relationships during manufacturing. With S-acceleration, part semantic features from machine-setting parameters and physics-effects informed characteristics are explored for measuring mutual part similarities. A user-defined simplification rate of experiments is proposed to purposely remove redundant parts before conducting experiments printing without sacrificing information gain as original full factorial experiment design. This S-acceleration design of experiments is demonstrated on a Concept Laser M2 machine for the experimental plan of modeling relationships between process parameters and part porosity defects. The printed part has 2 mm diameter by 4 mm tall pin geometry considering variations in build location and orientation, laser settings and powder feedstock are held constant. In total, 242 parts are measured to create a ground truth data set of porosity levels by using X-ray tomography microscopy. The S-acceleration method is assessed for performance considering 40%, 50%, and 60% of user-defined experiment simplification rates. The repeated experiments are removed without ignoring the minority experiments outlier, assuring a similar process-property relation in the original experiment plan. The experiment number is significantly reduced based on part similarity with minimal compromise of model accuracy and obtained knowledge. △ Less

Submitted 29 May, 2024; v1 submitted 3 August, 2023; originally announced August 2023.

Comments: 22 pages, 10 figures, journal paper published

MSC Class: J.2

Journal ref: Advanced Manufacturing 2024

arXiv:2307.16507 [pdf, other]

doi 10.1088/1612-202X/accce3

Uncertainty relations for metric adjusted skew information and Cauchy-Schwarz inequality

Authors: Xiaoli Hu, Naihuan **g

Abstract: Skew information is a pivotal concept in quantum information, quantum measurement, and quantum metrology. Further studies have lead to the uncertainty relations grounded in metric-adjusted skew information. In this work, we present an in-depth investigation using the methodologies of sampling coordinates of observables and convex functions to refine the uncertainty relations in both the product fo… ▽ More Skew information is a pivotal concept in quantum information, quantum measurement, and quantum metrology. Further studies have lead to the uncertainty relations grounded in metric-adjusted skew information. In this work, we present an in-depth investigation using the methodologies of sampling coordinates of observables and convex functions to refine the uncertainty relations in both the product form of two observables and summation form of multiple observables. △ Less

Submitted 31 July, 2023; originally announced July 2023.

Journal ref: Laser Phys. Lett. 20 (2023) 085202 (10pp)

arXiv:2307.15374 [pdf]

Leveraging Optical Communication Fiber and AI for Distributed Water Pipe Leak Detection

Authors: Huan Wu, Huan-Feng Duan, Wallace W. L. Lai, Kun Zhu, Xin Cheng, Hao Yin, Bin Zhou, Chun-Cheung Lai, Chao Lu, Xiaoli Ding

Abstract: Detecting leaks in water networks is a costly challenge. This article introduces a practical solution: the integration of optical network with water networks for efficient leak detection. Our approach uses a fiber-optic cable to measure vibrations, enabling accurate leak identification and localization by an intelligent algorithm. We also propose a method to access leak severity for prioritized re… ▽ More Detecting leaks in water networks is a costly challenge. This article introduces a practical solution: the integration of optical network with water networks for efficient leak detection. Our approach uses a fiber-optic cable to measure vibrations, enabling accurate leak identification and localization by an intelligent algorithm. We also propose a method to access leak severity for prioritized repairs. Our solution detects even small leaks with flow rates as low as 0.027 L/s. It offers a cost-effective way to improve leak detection, enhance water management, and increase operational efficiency. △ Less

Submitted 28 July, 2023; originally announced July 2023.

Comments: Accepted

Journal ref: IEEE Communications Magazine, 2023

arXiv:2307.13067 [pdf]

doi 10.1093/nsr/nwae149

Large negative magnetoresistance and pseudogap phase in superconducting A15-type La$_4$H$_{23}$

Authors: Jianning Guo, Dmitrii Semenok, Grigoriy Shutov, Di Zhou, Su Chen, Yulong Wang, Toni Helm, Sven Luther, Xiaoli Huang, Tian Cui

Abstract: High pressure plays a crucial role in the field of superconductivity. Compressed hydride superconductors are leaders in the race for a material that can conduct electricity without resistance at high or even room temperature. In the present work, we have discovered new lanthanum superhydride, cubic A15-type La$_4$H$_{23}$, with lower stabilization pressure compared to the reported $\textit{fcc}$ L… ▽ More High pressure plays a crucial role in the field of superconductivity. Compressed hydride superconductors are leaders in the race for a material that can conduct electricity without resistance at high or even room temperature. In the present work, we have discovered new lanthanum superhydride, cubic A15-type La$_4$H$_{23}$, with lower stabilization pressure compared to the reported $\textit{fcc}$ LaH$_{10}$. Superconducting La$_4$H$_{23}$ was obtained by laser heating of LaH$_3$ with ammonia borane at about 120 GPa. Transport measurements reveal the maximum critical temperature $\textit{T}$$_{C}$(onset) = 105 K and the critical field $\textit{H}$$_{C2}$(0) = 32 T at 118 GPa, as evidenced by the sharp drop of electrical resistance and the displacement of superconducting transitions in applied magnetic fields. Moreover, we provide evidence for unconventional transport associated with a pseudogap phase in La$_4$H$_{23}$ using pulsed magnetic fields up to 68 T. A large negative magnetoresistance in the non-superconducting state below 40 K, quasi $\textit{T}$-linear electrical resistance, and a sign-change of its temperature dependence mark the emergence of pseudogap in this hydride. Discovered lanthanum hydride is a new member of the A15 family of superconductors with $\textit{T}$$_C$ exceeding the boiling point of liquid nitrogen. △ Less

Submitted 6 February, 2024; v1 submitted 24 July, 2023; originally announced July 2023.

Comments: A mistake in Figure 3(a) was fixed

Journal ref: National Science Review, nwae149, 2024

arXiv:2307.11742 [pdf]

Evidence for Pseudogap Phase in Cerium Superhydrides: CeH$_{10}$ and CeH$_9$

Authors: Dmitrii Semenok, Jianning Guo, Di Zhou, Wuhao Chen, Toni Helm, Alexander Kvashnin, Andrei Sadakov, Oleg Sobolevsky, Vladimir Pudalov, Chuanying Xi, Xiaoli Huang, Ivan Troyan

Abstract: Polyhydride superconductors have been shown to possess metallic properties with a Bardeen-Cooper-Schrieffer-type superconducting ground state. Here, we provide evidence for unconventional transport associated with a pseudogap phase in cubic cerium superhydride CeH$_{10}$ ($\textit{T}$$_C$ = 116 K) at pressure of 115-125 GPa. A large negative magnetoresistance in the non-superconducting state below… ▽ More Polyhydride superconductors have been shown to possess metallic properties with a Bardeen-Cooper-Schrieffer-type superconducting ground state. Here, we provide evidence for unconventional transport associated with a pseudogap phase in cubic cerium superhydride CeH$_{10}$ ($\textit{T}$$_C$ = 116 K) at pressure of 115-125 GPa. A large negative magnetoresistance in the non-superconducting state below 90 K, quasi $\textit{T}$-linear electrical resistance, and a sign-change of its temperature dependence mark the emergence of this phase. We studied the magnetic phase diagrams and the upper critical fields $\textit{B}$$_{C2}$(T) of CeH$_{10}$, CeH$_9$, and CeD$_9$ in pulsed fields up to 70 T. $\textit{B}$$_{C2}$(T) of CeH$_9$ and CeD$_9$ exhibits pronounced saturation at low temperatures in accordance with the Werthamer-Helfand-Hohenberg model, whereas CeH$_{10}$ stands out in particular, as it does not obey this model. Our observations, therefore, reveal the unconventional nature of non-superconducting state of cerium superhydride CeH$_{10}$. △ Less

Submitted 15 September, 2023; v1 submitted 21 July, 2023; originally announced July 2023.

Comments: The discussion part has been improved

arXiv:2307.08268 [pdf, other]

Liver Tumor Screening and Diagnosis in CT with Pixel-Lesion-Patient Network

Authors: Ke Yan, Xiaoli Yin, Yingda Xia, Fakai Wang, Shu Wang, Yuan Gao, Jiawen Yao, Chunli Li, Xiaoyu Bai, **gren Zhou, Ling Zhang, Le Lu, Yu Shi

Abstract: Liver tumor segmentation and classification are important tasks in computer aided diagnosis. We aim to address three problems: liver tumor screening and preliminary diagnosis in non-contrast computed tomography (CT), and differential diagnosis in dynamic contrast-enhanced CT. A novel framework named Pixel-Lesion-pAtient Network (PLAN) is proposed. It uses a mask transformer to jointly segment and… ▽ More Liver tumor segmentation and classification are important tasks in computer aided diagnosis. We aim to address three problems: liver tumor screening and preliminary diagnosis in non-contrast computed tomography (CT), and differential diagnosis in dynamic contrast-enhanced CT. A novel framework named Pixel-Lesion-pAtient Network (PLAN) is proposed. It uses a mask transformer to jointly segment and classify each lesion with improved anchor queries and a foreground-enhanced sampling loss. It also has an image-wise classifier to effectively aggregate global information and predict patient-level diagnosis. A large-scale multi-phase dataset is collected containing 939 tumor patients and 810 normal subjects. 4010 tumor instances of eight types are extensively annotated. On the non-contrast tumor screening task, PLAN achieves 95% and 96% in patient-level sensitivity and specificity. On contrast-enhanced CT, our lesion-level detection precision, recall, and classification accuracy are 92%, 89%, and 86%, outperforming widely used CNN and transformers for lesion segmentation. We also conduct a reader study on a holdout set of 250 cases. PLAN is on par with a senior human radiologist, showing the clinical significance of our results. △ Less

Submitted 21 October, 2023; v1 submitted 17 July, 2023; originally announced July 2023.

Comments: MICCAI 2023, code: https://github.com/alibaba-damo-academy/pixel-lesion-patient-network

arXiv:2307.07542 [pdf, other]

Source-Free Domain Adaptation with Temporal Imputation for Time Series Data

Authors: Mohamed Ragab, Emadeldeen Eldele, Min Wu, Chuan-Sheng Foo, Xiaoli Li, Zhenghua Chen

Abstract: Source-free domain adaptation (SFDA) aims to adapt a pretrained model from a labeled source domain to an unlabeled target domain without access to the source domain data, preserving source domain privacy. Despite its prevalence in visual applications, SFDA is largely unexplored in time series applications. The existing SFDA methods that are mainly designed for visual applications may fail to handl… ▽ More Source-free domain adaptation (SFDA) aims to adapt a pretrained model from a labeled source domain to an unlabeled target domain without access to the source domain data, preserving source domain privacy. Despite its prevalence in visual applications, SFDA is largely unexplored in time series applications. The existing SFDA methods that are mainly designed for visual applications may fail to handle the temporal dynamics in time series, leading to impaired adaptation performance. To address this challenge, this paper presents a simple yet effective approach for source-free domain adaptation on time series data, namely MAsk and imPUte (MAPU). First, to capture temporal information of the source domain, our method performs random masking on the time series signals while leveraging a novel temporal imputer to recover the original signal from a masked version in the embedding space. Second, in the adaptation step, the imputer network is leveraged to guide the target model to produce target features that are temporally consistent with the source features. To this end, our MAPU can explicitly account for temporal dependency during the adaptation while avoiding the imputation in the noisy input space. Our method is the first to handle temporal consistency in SFDA for time series data and can be seamlessly equipped with other existing SFDA methods. Extensive experiments conducted on three real-world time series datasets demonstrate that our MAPU achieves significant performance gain over existing methods. Our code is available at \url{https://github.com/mohamedr002/MAPU_SFDA_TS}. △ Less

Submitted 14 July, 2023; originally announced July 2023.

Comments: Accepted in KDD'23

arXiv:2307.04226 [pdf, other]

Seismic Data Interpolation based on Denoising Diffusion Implicit Models with Resampling

Authors: Xiaoli Wei, Chunxia Zhang, Hongtao Wang, Chengli Tan, Deng Xiong, Baisong Jiang, Jiangshe Zhang, Sang-Woon Kim

Abstract: The incompleteness of the seismic data caused by missing traces along the spatial extension is a common issue in seismic acquisition due to the existence of obstacles and economic constraints, which severely impairs the imaging quality of subsurface geological structures. Recently, deep learningbased seismic interpolation methods have attained promising progress, while achieving stable training of… ▽ More The incompleteness of the seismic data caused by missing traces along the spatial extension is a common issue in seismic acquisition due to the existence of obstacles and economic constraints, which severely impairs the imaging quality of subsurface geological structures. Recently, deep learningbased seismic interpolation methods have attained promising progress, while achieving stable training of generative adversarial networks is not easy, and performance degradation is usually notable if the missing patterns in the testing and training do not match. In this paper, we propose a novel seismic denoising diffusion implicit model with resampling. The model training is established on the denoising diffusion probabilistic model, where U-Net is equipped with the multi-head self-attention to match the noise in each step. The cosine noise schedule, serving as the global noise configuration, promotes the high utilization of known trace information by accelerating the passage of the excessive noise stages. The model inference utilizes the denoising diffusion implicit model, conditioning on the known traces, to enable high-quality interpolation with fewer diffusion steps. To enhance the coherency between the known traces and the missing traces within each reverse step, the inference process integrates a resampling strategy to achieve an information recap on the former interpolated traces. Extensive experiments conducted on synthetic and field seismic data validate the superiority of our model and its robustness to various missing patterns. In addition, uncertainty quantification and ablation studies are also investigated. △ Less

Submitted 13 July, 2023; v1 submitted 9 July, 2023; originally announced July 2023.

Comments: 14 pages, 13 figures

arXiv:2307.03347 [pdf, other]

Distilling Universal and Joint Knowledge for Cross-Domain Model Compression on Time Series Data

Authors: Qing Xu, Min Wu, Xiaoli Li, Kezhi Mao, Zhenghua Chen

Abstract: For many real-world time series tasks, the computational complexity of prevalent deep leaning models often hinders the deployment on resource-limited environments (e.g., smartphones). Moreover, due to the inevitable domain shift between model training (source) and deploying (target) stages, compressing those deep models under cross-domain scenarios becomes more challenging. Although some of existi… ▽ More For many real-world time series tasks, the computational complexity of prevalent deep leaning models often hinders the deployment on resource-limited environments (e.g., smartphones). Moreover, due to the inevitable domain shift between model training (source) and deploying (target) stages, compressing those deep models under cross-domain scenarios becomes more challenging. Although some of existing works have already explored cross-domain knowledge distillation for model compression, they are either biased to source data or heavily tangled between source and target data. To this end, we design a novel end-to-end framework called Universal and joint knowledge distillation (UNI-KD) for cross-domain model compression. In particular, we propose to transfer both the universal feature-level knowledge across source and target domains and the joint logit-level knowledge shared by both domains from the teacher to the student model via an adversarial learning scheme. More specifically, a feature-domain discriminator is employed to align teacher's and student's representations for universal knowledge transfer. A data-domain discriminator is utilized to prioritize the domain-shared samples for joint knowledge transfer. Extensive experimental results on four time series datasets demonstrate the superiority of our proposed method over state-of-the-art (SOTA) benchmarks. △ Less

Submitted 6 July, 2023; originally announced July 2023.

Comments: Accepted by IJCAI 2023

arXiv:2307.02727 [pdf, ps, other]

On efficient linear and fully decoupled finite difference method for wormhole propagation with heat transmission process on staggered grids

Authors: Xiaoli Li, Ziyan Li, Hongxing Rui

Abstract: In this paper, we construct an efficient linear and fully decoupled finite difference scheme for wormhole propagation with heat transmission process on staggered grids, which only requires solving a sequence of linear elliptic equations at each time step. We first derive the positivity preserving properties for the discrete porosity and its difference quotient in time, and then obtain optimal erro… ▽ More In this paper, we construct an efficient linear and fully decoupled finite difference scheme for wormhole propagation with heat transmission process on staggered grids, which only requires solving a sequence of linear elliptic equations at each time step. We first derive the positivity preserving properties for the discrete porosity and its difference quotient in time, and then obtain optimal error estimates for the velocity, pressure, concentration, porosity and temperature in different norms rigorously and carefully by establishing several auxiliary lemmas for the highly coupled nonlinear system. Numerical experiments in two- and three-dimensional cases are provided to verify our theoretical results and illustrate the capabilities of the constructed method. △ Less

Submitted 5 July, 2023; originally announced July 2023.

arXiv:2307.01396 [pdf, other]

doi 10.1109/CCET59170.2023.10335130

Precheck Sequence Based False Base Station Detection During Handover: A Physical Layer Security Scheme

Authors: Xiangyu Li, Kaiwen Zheng, Sidong Guo, Xiaoli Ma

Abstract: False Base Station (FBS) attack has been a severe security problem for the cellular network since 2G era. During handover, the user equipment (UE) periodically receives state information from surrounding base stations (BSs) and uploads it to the source BS. The source BS compares the uploaded signal power and shifts UE to another BS that can provide the strongest signal. An FBS can transmit signal… ▽ More False Base Station (FBS) attack has been a severe security problem for the cellular network since 2G era. During handover, the user equipment (UE) periodically receives state information from surrounding base stations (BSs) and uploads it to the source BS. The source BS compares the uploaded signal power and shifts UE to another BS that can provide the strongest signal. An FBS can transmit signal with the proper power and attract UE to connect to it. In this paper, based on the 3GPP standard, a Precheck Sequence-based Detection (PSD) Scheme is proposed to secure the transition of legal base station (LBS) for UE. This scheme first analyzes the structure of received signals in blocks and symbols. Several additional symbols are added to the current signal sequence for verification. By designing a long table of symbol sequence, every UE which needs handover will be allocated a specific sequence from this table. The simulation results show that the performance of this PSD Scheme is better than that of any existing ones, even when a specific transmit power is designed for FBS. △ Less

Submitted 3 November, 2023; v1 submitted 3 July, 2023; originally announced July 2023.

arXiv:2307.00233 [pdf, other]

Hierarchical Federated Learning Incentivization for Gas Usage Estimation

Authors: Has Sun, Xiaoli Tang, Chengyi Yang, Zhenpeng Yu, Xiuli Wang, Qijie Ding, Zengxiang Li, Han Yu

Abstract: Accurately estimating gas usage is essential for the efficient functioning of gas distribution networks and saving operational costs. Traditional methods rely on centralized data processing, which poses privacy risks. Federated learning (FL) offers a solution to this problem by enabling local data processing on each participant, such as gas companies and heating stations. However, local training a… ▽ More Accurately estimating gas usage is essential for the efficient functioning of gas distribution networks and saving operational costs. Traditional methods rely on centralized data processing, which poses privacy risks. Federated learning (FL) offers a solution to this problem by enabling local data processing on each participant, such as gas companies and heating stations. However, local training and communication overhead may discourage gas companies and heating stations from actively participating in the FL training process. To address this challenge, we propose a Hierarchical FL Incentive Mechanism for Gas Usage Estimation (HI-GAS), which has been testbedded in the ENN Group, one of the leading players in the natural gas and green energy industry. It is designed to support horizontal FL among gas companies, and vertical FL among each gas company and heating station within a hierarchical FL ecosystem, rewarding participants based on their contributions to FL. In addition, a hierarchical FL model aggregation approach is also proposed to improve the gas usage estimation performance by aggregating models at different levels of the hierarchy. The incentive scheme employs a multi-dimensional contribution-aware reward distribution function that combines the evaluation of data quality and model contribution to incentivize both gas companies and heating stations within their jurisdiction while maintaining fairness. Results of extensive experiments validate the effectiveness of the proposed mechanism. △ Less

Submitted 1 July, 2023; originally announced July 2023.

arXiv:2306.16208 [pdf, other]

Continuous-time q-learning for mean-field control problems

Authors: Xiaoli Wei, Xiang Yu

Abstract: This paper studies the q-learning, recently coined as the continuous time counterpart of Q-learning by Jia and Zhou (2023), for continuous time Mckean-Vlasov control problems in the setting of entropy-regularized reinforcement learning. In contrast to the single agent's control problem in Jia and Zhou (2023), the mean-field interaction of agents renders the definition of the q-function more subtle… ▽ More This paper studies the q-learning, recently coined as the continuous time counterpart of Q-learning by Jia and Zhou (2023), for continuous time Mckean-Vlasov control problems in the setting of entropy-regularized reinforcement learning. In contrast to the single agent's control problem in Jia and Zhou (2023), the mean-field interaction of agents renders the definition of the q-function more subtle, for which we reveal that two distinct q-functions naturally arise: (i) the integrated q-function (denoted by $q$) as the first-order approximation of the integrated Q-function introduced in Gu, Guo, Wei and Xu (2023), which can be learnt by a weak martingale condition involving test policies; and (ii) the essential q-function (denoted by $q_e$) that is employed in the policy improvement iterations. We show that two q-functions are related via an integral representation under all test policies. Based on the weak martingale condition and our proposed searching method of test policies, some model-free learning algorithms are devised. In two examples, one in LQ control framework and one beyond LQ control framework, we can obtain the exact parameterization of the optimal value function and q-functions and illustrate our algorithms with simulation experiments. △ Less

Submitted 7 March, 2024; v1 submitted 28 June, 2023; originally announced June 2023.

Comments: Keywords: Continuous-time reinforcement learning, continuous-time q-function, Mckean-Vlasov control, weak martingale characterization, test policies

arXiv:2306.15796 [pdf, other]

ConKI: Contrastive Knowledge Injection for Multimodal Sentiment Analysis

Authors: Yakun Yu, Mingjun Zhao, Shi-ang Qi, Feiran Sun, Baoxun Wang, Weidong Guo, Xiaoli Wang, Lei Yang, Di Niu

Abstract: Multimodal Sentiment Analysis leverages multimodal signals to detect the sentiment of a speaker. Previous approaches concentrate on performing multimodal fusion and representation learning based on general knowledge obtained from pretrained models, which neglects the effect of domain-specific knowledge. In this paper, we propose Contrastive Knowledge Injection (ConKI) for multimodal sentiment anal… ▽ More Multimodal Sentiment Analysis leverages multimodal signals to detect the sentiment of a speaker. Previous approaches concentrate on performing multimodal fusion and representation learning based on general knowledge obtained from pretrained models, which neglects the effect of domain-specific knowledge. In this paper, we propose Contrastive Knowledge Injection (ConKI) for multimodal sentiment analysis, where specific-knowledge representations for each modality can be learned together with general knowledge representations via knowledge injection based on an adapter architecture. In addition, ConKI uses a hierarchical contrastive learning procedure performed between knowledge types within every single modality, across modalities within each sample, and across samples to facilitate the effective learning of the proposed representations, hence improving multimodal sentiment predictions. The experiments on three popular multimodal sentiment analysis benchmarks show that ConKI outperforms all prior methods on a variety of performance metrics. △ Less

Submitted 27 June, 2023; originally announced June 2023.

Comments: Accepted by ACL Findings 2023

arXiv:2306.12290 [pdf, other]

doi 10.1088/1674-4527/acf18d

Redshift Dependence of the Low-energy Spectral Index of Gamma-Ray Bursts Revisited

Authors: Xiao-Li Zhang, Yong-Feng Huang, Ze-Cheng Zou

Abstract: A negative correlation was found to exist between the low-energy spectral index and the redshift of gamma-ray bursts (GRBs) by Amati(2002). It was later confirmed by Geng(2013) and Gruber(2014), but the correlation was also found to be quite dispersive when the sample size was significantly expanded. In this study, we have established two even larger samples of gamma-ray bursts to further examine… ▽ More A negative correlation was found to exist between the low-energy spectral index and the redshift of gamma-ray bursts (GRBs) by Amati(2002). It was later confirmed by Geng(2013) and Gruber(2014), but the correlation was also found to be quite dispersive when the sample size was significantly expanded. In this study, we have established two even larger samples of gamma-ray bursts to further examine the correlation. One of our sample is consisted of 316 GRBs detected by the Swift satellite, and the other one is consisted of 80 GRBs detected by the Fermi satellite. It is found that there is no correlation between the two parameters for the Swift sample, but there does exist a weak negative correlation for the Fermi sample. The correlation becomes even more significant when the spectral index at the peak flux is considered. It is argued that the absence of the correlation in the Swift sample may be due to the fact that Swift has a very narrow energy response so that it could not measure the low-energy spectral index accurately enough. Further studies based on even larger GRB samples are solicited. △ Less

Submitted 2 August, 2023; v1 submitted 21 June, 2023; originally announced June 2023.

Journal ref: Research in Astronomy and Astrophysics (RAA), 23:125003, 2023

arXiv:2306.11990 [pdf, ps, other]

doi 10.1016/j.neucom.2024.127272

Physics-constrained Attack against Convolution-based Human Motion Prediction

Authors: Chengxu Duan, Zhicheng Zhang, Xiaoli Liu, Yonghao Dang, Jianqin Yin

Abstract: Human motion prediction has achieved a brilliant performance with the help of convolution-based neural networks. However, currently, there is no work evaluating the potential risk in human motion prediction when facing adversarial attacks. The adversarial attack will encounter problems against human motion prediction in naturalness and data scale. To solve the problems above, we propose a new adve… ▽ More Human motion prediction has achieved a brilliant performance with the help of convolution-based neural networks. However, currently, there is no work evaluating the potential risk in human motion prediction when facing adversarial attacks. The adversarial attack will encounter problems against human motion prediction in naturalness and data scale. To solve the problems above, we propose a new adversarial attack method that generates the worst-case perturbation by maximizing the human motion predictor's prediction error with physical constraints. Specifically, we introduce a novel adaptable scheme that facilitates the attack to suit the scale of the target pose and two physical constraints to enhance the naturalness of the adversarial example. The evaluating experiments on three datasets show that the prediction errors of all target models are enlarged significantly, which means current convolution-based human motion prediction models are vulnerable to the proposed attack. Based on the experimental results, we provide insights on how to enhance the adversarial robustness of the human motion predictor and how to improve the adversarial attack against human motion prediction. △ Less

Submitted 14 January, 2024; v1 submitted 20 June, 2023; originally announced June 2023.

arXiv:2306.08695 [pdf, other]

doi 10.1038/s42004-023-01090-2

A generative artificial intelligence framework based on a molecular diffusion model for the design of metal-organic frameworks for carbon capture

Authors: Hyun Park, Xiaoli Yan, Ruijie Zhu, E. A. Huerta, Santanu Chaudhuri, Donny Cooper, Ian Foster, Emad Tajkhorshid

Abstract: Metal-organic frameworks (MOFs) exhibit great promise for CO2 capture. However, finding the best performing materials poses computational and experimental grand challenges in view of the vast chemical space of potential building blocks. Here, we introduce GHP-MOFassemble, a generative artificial intelligence (AI), high performance framework for the rational and accelerated design of MOFs with high… ▽ More Metal-organic frameworks (MOFs) exhibit great promise for CO2 capture. However, finding the best performing materials poses computational and experimental grand challenges in view of the vast chemical space of potential building blocks. Here, we introduce GHP-MOFassemble, a generative artificial intelligence (AI), high performance framework for the rational and accelerated design of MOFs with high CO2 adsorption capacity and synthesizable linkers. GHP-MOFassemble generates novel linkers, assembled with one of three pre-selected metal nodes (Cu paddlewheel, Zn paddlewheel, Zn tetramer) into MOFs in a primitive cubic topology. GHP-MOFassemble screens and validates AI-generated MOFs for uniqueness, synthesizability, structural validity, uses molecular dynamics simulations to study their stability and chemical consistency, and crystal graph neural networks and Grand Canonical Monte Carlo simulations to quantify their CO2 adsorption capacities. We present the top six AI-generated MOFs with CO2 capacities greater than 2 $m mol/g$, i.e., higher than 96.9% of structures in the hypothetical MOF dataset. △ Less

Submitted 12 March, 2024; v1 submitted 14 June, 2023; originally announced June 2023.

Comments: 25 pages, 17 figures, 6 tables, accepted to Nature Communications Chemistry. This work was awarded the HPCwire 2023 Editors' Choice Awards for Best Use of High Performance Data Analytics \& Artificial Intelligence see https://www.hpcwire.com/2023-readers-editors-choice-data-analytics-ai/

ACM Class: I.2

Journal ref: Commun Chem 7, 21 (2024)

arXiv:2306.05695 [pdf, other]

Power Beacon Energy Consumption Minimization in Wireless Powered Backscatter Communication Networks

Authors: Haohang Yang, Yinghui Ye, Kai Liang, Xiaoli Chu

Abstract: Internet-of-Things (IoT) networks are expected to support the wireless connection of massive energy limited IoT nodes. The emerging wireless powered backscatter communications (WPBC) enable IoT nodes to harvest energy from the incident radio frequency signals transmitted by a power beacon (PB) to support their circuit operation, but the energy consumption of the PB (a potentially high cost borne b… ▽ More Internet-of-Things (IoT) networks are expected to support the wireless connection of massive energy limited IoT nodes. The emerging wireless powered backscatter communications (WPBC) enable IoT nodes to harvest energy from the incident radio frequency signals transmitted by a power beacon (PB) to support their circuit operation, but the energy consumption of the PB (a potentially high cost borne by the network operator) has not been sufficiently studied for WPBC. In this paper, we aim to minimize the energy consumption of the PB while satisfying the throughput requirement per IoT node by jointly optimizing the time division multiple access (TDMA) time slot duration and backscatter reflection coefficient of each IoT node and the PB transmit power per time slot. As the formulated joint optimization problem is non-convex, we transform it into a convex problem by using auxiliary variables, then employ the Lagrange dual method to obtain the optimal solutions. To reduce the implementation complexity required for adjusting the PB's transmit power every time slot, we keep the PB transmit power constant in each time block and solve the corresponding PB energy consumption minimization problem by using auxiliary variables, the block coordinated decent method and the successive convex approximation technique. Based on the above solutions, two iterative algorithms are proposed for the dynamic PB transmit power scheme and the static PB transmit power scheme. The simulation results show that the dynamic PB transmit power scheme and the static PB transmit power scheme both achieve a lower PB energy consumption than the benchmark schemes, and the former achieves the lowest PB energy consumption. △ Less

Submitted 9 June, 2023; originally announced June 2023.

arXiv:2306.03122 [pdf, other]

doi 10.1038/s41586-024-07026-7

Imaging the Meissner effect and flux trap** in a hydride superconductor at megabar pressures using a nanoscale quantum sensor

Authors: Prabudhya Bhattacharyya, Wuhao Chen, Xiaoli Huang, Shubhayu Chatterjee, Benchen Huang, Bryce Kobrin, Yuanqi Lyu, Thomas J. Smart, Maxwell Block, Esther Wang, Zhipan Wang, Weijie Wu, Satcher Hsieh, He Ma, Srinivas Mandyam, Bijuan Chen, Emily Davis, Zachary M. Geballe, Chong Zu, Viktor Struzhkin, Raymond Jeanloz, Joel E. Moore, Tian Cui, Giulia Galli, Bertrand I. Halperin , et al. (2 additional authors not shown)

Abstract: By directly altering microscopic interactions, pressure provides a powerful tuning knob for the exploration of condensed phases and geophysical phenomena. The megabar regime represents an exciting frontier, where recent discoveries include novel high-temperature superconductors, as well as structural and valence phase transitions. However, at such high pressures, many conventional measurement tech… ▽ More By directly altering microscopic interactions, pressure provides a powerful tuning knob for the exploration of condensed phases and geophysical phenomena. The megabar regime represents an exciting frontier, where recent discoveries include novel high-temperature superconductors, as well as structural and valence phase transitions. However, at such high pressures, many conventional measurement techniques fail. Here, we demonstrate the ability to perform local magnetometry inside of a diamond anvil cell with sub-micron spatial resolution at megabar pressures. Our approach utilizes a shallow layer of Nitrogen-Vacancy (NV) color centers implanted directly within the anvil; crucially, we choose a crystal cut compatible with the intrinsic symmetries of the NV center to enable functionality at megabar pressures. We apply our technique to characterize a recently discovered hydride superconductor, CeH$_9$. By performing simultaneous magnetometry and electrical transport measurements, we observe the dual signatures of superconductivity: local diamagnetism characteristic of the Meissner effect and a sharp drop of the resistance to near zero. By locally map** the Meissner effect and flux trap**, we directly image the geometry of superconducting regions, revealing significant inhomogeneities at the micron scale. Our work brings quantum sensing to the megabar frontier and enables the closed loop optimization of superhydride materials synthesis. △ Less

Submitted 5 June, 2023; originally announced June 2023.

Journal ref: Nature 627, 73-79 (2024)

arXiv:2306.01757 [pdf, ps, other]

State estimation for one-dimensional agro-hydrological processes with model mismatch

Authors: Zhuangyu Liu, **feng Liu, Shunyi Zhao, Xiaoli Luan, Fei Liu

Abstract: The importance of accurate soil moisture data for the development of modern closed-loop irrigation systems cannot be overstated. Due to the diversity of soil, it is difficult to obtain an accurate model for agro-hydrological system. In this study, soil moisture estimation in 1D agro-hydrological systems with model mismatch is the focus. To address the problem of model mismatch, a nonlinear state-s… ▽ More The importance of accurate soil moisture data for the development of modern closed-loop irrigation systems cannot be overstated. Due to the diversity of soil, it is difficult to obtain an accurate model for agro-hydrological system. In this study, soil moisture estimation in 1D agro-hydrological systems with model mismatch is the focus. To address the problem of model mismatch, a nonlinear state-space model derived from the Richards equation is utilized, along with additive unknown inputs. The determination of the number of sensors required is achieved through sensitivity analysis and the orthogonalization projection method. To estimate states and unknown inputs in real-time, a recursive expectation maximization (EM) algorithm derived from the conventional EM algorithm is employed. During the E-step, the extended Kalman filter (EKF) is used to compute states and covariance in the recursive Q-function, while in the M-step, unknown inputs are updated by locally maximizing the recursive Q-function. The estimation performance is evaluated using comprehensive simulations. Through this method, accurate soil moisture estimation can be obtained, even in the presence of model mismatch. △ Less

Submitted 24 May, 2023; originally announced June 2023.

arXiv:2306.00137 [pdf, other]

A Sequence-to-Sequence&Set Model for Text-to-Table Generation

Authors: Tong Li, Zhihao Wang, Liangying Shao, Xuling Zheng, Xiaoli Wang, **song Su

Abstract: Recently, the text-to-table generation task has attracted increasing attention due to its wide applications. In this aspect, the dominant model formalizes this task as a sequence-to-sequence generation task and serializes each table into a token sequence during training by concatenating all rows in a top-down order. However, it suffers from two serious defects: 1) the predefined order introduces a… ▽ More Recently, the text-to-table generation task has attracted increasing attention due to its wide applications. In this aspect, the dominant model formalizes this task as a sequence-to-sequence generation task and serializes each table into a token sequence during training by concatenating all rows in a top-down order. However, it suffers from two serious defects: 1) the predefined order introduces a wrong bias during training, which highly penalizes shifts in the order between rows; 2) the error propagation problem becomes serious when the model outputs a long token sequence. In this paper, we first conduct a preliminary study to demonstrate the generation of most rows is order-insensitive. Furthermore, we propose a novel sequence-to-sequence&set text-to-table generation model. Specifically, in addition to a text encoder encoding the input text, our model is equipped with a table header generator to first output a table header, i.e., the first row of the table, in the manner of sequence generation. Then we use a table body generator with learnable row embeddings and column embeddings to generate a set of table body rows in parallel. Particularly, to deal with the issue that there is no correspondence between each generated table body row and target during training, we propose a target assignment strategy based on the bipartite matching between the first cells of generated table body rows and targets. Experiment results show that our model significantly surpasses the baselines, achieving state-of-the-art performance on commonly-used datasets. △ Less

Submitted 31 May, 2023; originally announced June 2023.

arXiv:2305.18969 [pdf, other]

doi 10.18653/v1/2023.acl-long.77

MS-DETR: Natural Language Video Localization with Sampling Moment-Moment Interaction

Authors: **g Wang, Aixin Sun, Hao Zhang, Xiaoli Li

Abstract: Given a query, the task of Natural Language Video Localization (NLVL) is to localize a temporal moment in an untrimmed video that semantically matches the query. In this paper, we adopt a proposal-based solution that generates proposals (i.e., candidate moments) and then select the best matching proposal. On top of modeling the cross-modal interaction between candidate moments and the query, our p… ▽ More Given a query, the task of Natural Language Video Localization (NLVL) is to localize a temporal moment in an untrimmed video that semantically matches the query. In this paper, we adopt a proposal-based solution that generates proposals (i.e., candidate moments) and then select the best matching proposal. On top of modeling the cross-modal interaction between candidate moments and the query, our proposed Moment Sampling DETR (MS-DETR) enables efficient moment-moment relation modeling. The core idea is to sample a subset of moments guided by the learnable templates with an adopted DETR (DEtection TRansformer) framework. To achieve this, we design a multi-scale visual-linguistic encoder, and an anchor-guided moment decoder paired with a set of learnable templates. Experimental results on three public datasets demonstrate the superior performance of MS-DETR. △ Less

Submitted 30 May, 2023; originally announced May 2023.

Comments: Accepted by ACL 2023

Journal ref: ACL 2023 long paper

arXiv:2305.15761 [pdf, other]

PRIMP: PRobabilistically-Informed Motion Primitives for Efficient Affordance Learning from Demonstration

Authors: Sipu Ruan, Weixiao Liu, Xiaoli Wang, Xin Meng, Gregory S. Chirikjian

Abstract: This paper proposes a learning-from-demonstration method using probability densities on the workspaces of robot manipulators. The method, named "PRobabilistically-Informed Motion Primitives (PRIMP)", learns the probability distribution of the end effector trajectories in the 6D workspace that includes both positions and orientations. It is able to adapt to new situations such as novel via poses wi… ▽ More This paper proposes a learning-from-demonstration method using probability densities on the workspaces of robot manipulators. The method, named "PRobabilistically-Informed Motion Primitives (PRIMP)", learns the probability distribution of the end effector trajectories in the 6D workspace that includes both positions and orientations. It is able to adapt to new situations such as novel via poses with uncertainty and a change of viewing frame. The method itself is robot-agnostic, in which the learned distribution can be transferred to another robot with the adaptation to its workspace density. The learned trajectory distribution is then used to guide an optimization-based motion planning algorithm to further help the robot avoid novel obstacles that are unseen during the demonstration process. The proposed methods are evaluated by several sets of benchmark experiments. PRIMP runs more than 5 times faster while generalizing trajectories more than twice as close to both the demonstrations and novel desired poses. It is then combined with our robot imagination method that learns object affordances, illustrating the applicability of PRIMP to learn tool use through physical experiments. △ Less

Submitted 25 May, 2023; originally announced May 2023.

Comments: 17 pages, 19 figures

arXiv:2305.15424 [pdf, other]

PulseNet: Deep Learning ECG-signal classification using random augmentation policy and continous wavelet transform for canines

Authors: Andre Dourson, Roberto Santilli, Federica Marchesotti, Jennifer Schneiderman, Oliver Roman Stiel, Fernando Junior, Michael Fitzke, Norbert Sithirangathan, Emil Walleser, Xiaoli Qiao, Mark Parkinson

Abstract: Evaluating canine electrocardiograms (ECG) require skilled veterinarians, but current availability of veterinary cardiologists for ECG interpretation and diagnostic support is limited. Develo** tools for automated assessment of ECG sequences can improve veterinary care by providing clinicians real-time results and decision support tools. We implement a deep convolutional neural network (CNN) app… ▽ More Evaluating canine electrocardiograms (ECG) require skilled veterinarians, but current availability of veterinary cardiologists for ECG interpretation and diagnostic support is limited. Develo** tools for automated assessment of ECG sequences can improve veterinary care by providing clinicians real-time results and decision support tools. We implement a deep convolutional neural network (CNN) approach for classifying canine electrocardiogram sequences as either normal or abnormal. ECG records are converted into 8 second Lead II sequences and classified as either normal (no evidence of cardiac abnormalities) or abnormal (presence of one or more cardiac abnormalities). For training ECG sequences are randomly augmented using RandomAugmentECG, a new augmentation library implemented specifically for this project. Each chunk is then is converted using a continuous wavelet transform into a 2D scalogram. The 2D scalogram are then classified as either normal or abnormal by a binary CNN classifier. Experimental results are validated against three boarded veterinary cardiologists achieving an AUC-ROC score of 0.9506 on test dataset matching human level performance. Additionally, we describe model deployment to Microsoft Azure using an MLOps approach. To our knowledge, this work is one of the first attempts to implement a deep learning model to automatically classify ECG sequences for canines.Implementing automated ECG classification will enhance veterinary care through improved diagnostic performance and increased clinic efficiency. △ Less

Submitted 19 June, 2023; v1 submitted 17 May, 2023; originally announced May 2023.

arXiv:2305.13799 [pdf, other]

UPNet: Uncertainty-based Picking Deep Learning Network for Robust First Break Picking

Authors: Hongtao Wang, Jiangshe Zhang, Xiaoli Wei, Li Long, Chunxia Zhang

Abstract: In seismic exploration, first break (FB) picking is a crucial aspect in the determination of subsurface velocity models, significantly influencing the placement of wells. Many deep neural networks (DNNs)-based automatic picking methods have been proposed to accelerate this processing. Significantly, the segmentation-based DNN methods provide a segmentation map and then estimate FB from the map usi… ▽ More In seismic exploration, first break (FB) picking is a crucial aspect in the determination of subsurface velocity models, significantly influencing the placement of wells. Many deep neural networks (DNNs)-based automatic picking methods have been proposed to accelerate this processing. Significantly, the segmentation-based DNN methods provide a segmentation map and then estimate FB from the map using a picking threshold. However, the uncertainty of the results picked by DNNs still needs to be analyzed. Thus, the automatic picking methods applied in field datasets can not ensure robustness, especially in the case of a low signal-to-noise ratio (SNR). In this paper, we introduce uncertainty quantification into the FB picking task and propose a novel uncertainty-based picking deep learning network called UPNet. UPNet not only estimates the uncertainty of network output but also can filter the pickings with low confidence. Many experiments evaluate that UPNet exhibits higher accuracy and robustness than the deterministic DNN-based model, achieving State-of-the-Art (SOTA) performance in field surveys. In addition, we verify that the measurement uncertainty is meaningful, which can provide a reference for human decision-making. △ Less

Submitted 7 April, 2024; v1 submitted 23 May, 2023; originally announced May 2023.

arXiv:2305.08316 [pdf, other]

SemiGNN-PPI: Self-Ensembling Multi-Graph Neural Network for Efficient and Generalizable Protein-Protein Interaction Prediction

Authors: Ziyuan Zhao, Peisheng Qian, Xulei Yang, Zeng Zeng, Cuntai Guan, Wai Leong Tam, Xiaoli Li

Abstract: Protein-protein interactions (PPIs) are crucial in various biological processes and their study has significant implications for drug development and disease diagnosis. Existing deep learning methods suffer from significant performance degradation under complex real-world scenarios due to various factors, e.g., label scarcity and domain shift. In this paper, we propose a self-ensembling multigraph… ▽ More Protein-protein interactions (PPIs) are crucial in various biological processes and their study has significant implications for drug development and disease diagnosis. Existing deep learning methods suffer from significant performance degradation under complex real-world scenarios due to various factors, e.g., label scarcity and domain shift. In this paper, we propose a self-ensembling multigraph neural network (SemiGNN-PPI) that can effectively predict PPIs while being both efficient and generalizable. In SemiGNN-PPI, we not only model the protein correlations but explore the label dependencies by constructing and processing multiple graphs from the perspectives of both features and labels in the graph learning process. We further marry GNN with Mean Teacher to effectively leverage unlabeled graph-structured PPI data for self-ensemble graph learning. We also design multiple graph consistency constraints to align the student and teacher graphs in the feature embedding space, enabling the student model to better learn from the teacher model by incorporating more relationships. Extensive experiments on PPI datasets of different scales with different evaluation settings demonstrate that SemiGNN-PPI outperforms state-of-the-art PPI prediction methods, particularly in challenging scenarios such as training with limited annotations and testing on unseen data. △ Less

Submitted 14 May, 2023; originally announced May 2023.

Comments: Accepted by IJCAI 2023

arXiv:2305.07854 [pdf, other]

doi 10.1109/TASE.2023.3274648

A Federated Learning-based Industrial Health Prognostics for Heterogeneous Edge Devices using Matched Feature Extraction

Authors: Anushiya Arunan, Yan Qin, Xiaoli Li, Chau Yuen

Abstract: Data-driven industrial health prognostics require rich training data to develop accurate and reliable predictive models. However, stringent data privacy laws and the abundance of edge industrial data necessitate decentralized data utilization. Thus, the industrial health prognostics field is well suited to significantly benefit from federated learning (FL), a decentralized and privacy-preserving l… ▽ More Data-driven industrial health prognostics require rich training data to develop accurate and reliable predictive models. However, stringent data privacy laws and the abundance of edge industrial data necessitate decentralized data utilization. Thus, the industrial health prognostics field is well suited to significantly benefit from federated learning (FL), a decentralized and privacy-preserving learning technique. However, FL-based health prognostics tasks have hardly been investigated due to the complexities of meaningfully aggregating model parameters trained from heterogeneous data to form a high performing federated model. Specifically, data heterogeneity among edge devices, stemming from dissimilar degradation mechanisms and unequal dataset sizes, poses a critical statistical challenge for develo** accurate federated models. We propose a pioneering FL-based health prognostic model with a feature similarity-matched parameter aggregation algorithm to discriminatingly learn from heterogeneous edge data. The algorithm searches across the heterogeneous locally trained models and matches neurons with probabilistically similar feature extraction functions first, before selectively averaging them to form the federated model parameters. As the algorithm only averages similar neurons, as opposed to conventional naive averaging of coordinate-wise neurons, the distinct feature extractors of local models are carried over with less dilution to the resultant federated model. Using both cyclic degradation data of Li-ion batteries and non-cyclic data of turbofan engines, we demonstrate that the proposed method yields accuracy improvements as high as 44.5\% and 39.3\% for state-of-health estimation and remaining useful life estimation, respectively. △ Less

Submitted 18 May, 2023; v1 submitted 13 May, 2023; originally announced May 2023.

Comments: 17 pages, 11 figures, and 6 tables

Journal ref: Aeecpted by IEEE TASE 2023

arXiv:2305.07367 [pdf, ps, other]

S-REINFORCE: A Neuro-Symbolic Policy Gradient Approach for Interpretable Reinforcement Learning

Authors: Rajdeep Dutta, Qincheng Wang, Ankur Singh, Dhruv Kumarjiguda, Li Xiaoli, Senthilnath Jayavelu

Abstract: This paper presents a novel RL algorithm, S-REINFORCE, which is designed to generate interpretable policies for dynamic decision-making tasks. The proposed algorithm leverages two types of function approximators, namely Neural Network (NN) and Symbolic Regressor (SR), to produce numerical and symbolic policies, respectively. The NN component learns to generate a numerical probability distribution… ▽ More This paper presents a novel RL algorithm, S-REINFORCE, which is designed to generate interpretable policies for dynamic decision-making tasks. The proposed algorithm leverages two types of function approximators, namely Neural Network (NN) and Symbolic Regressor (SR), to produce numerical and symbolic policies, respectively. The NN component learns to generate a numerical probability distribution over the possible actions using a policy gradient, while the SR component captures the functional form that relates the associated states with the action probabilities. The SR-generated policy expressions are then utilized through importance sampling to improve the rewards received during the learning process. We have tested the proposed S-REINFORCE algorithm on various dynamic decision-making problems with low and high dimensional action spaces, and the results demonstrate its effectiveness and impact in achieving interpretable solutions. By leveraging the strengths of both NN and SR, S-REINFORCE produces policies that are not only well-performing but also easy to interpret, making it an ideal choice for real-world applications where transparency and causality are crucial. △ Less

Submitted 12 May, 2023; originally announced May 2023.

Comments: 10 pages, 7 figures

arXiv:2305.06784 [pdf, other]

Utility-Maximizing Bidding Strategy for Data Consumers in Auction-based Federated Learning

Authors: Xiaoli Tang, Han Yu

Abstract: Auction-based Federated Learning (AFL) has attracted extensive research interest due to its ability to motivate data owners to join FL through economic means. Existing works assume that only one data consumer and multiple data owners exist in an AFL marketplace (i.e., a monopoly market). Therefore, data owners bid to join the data consumer for FL. However, this assumption is not realistic in pract… ▽ More Auction-based Federated Learning (AFL) has attracted extensive research interest due to its ability to motivate data owners to join FL through economic means. Existing works assume that only one data consumer and multiple data owners exist in an AFL marketplace (i.e., a monopoly market). Therefore, data owners bid to join the data consumer for FL. However, this assumption is not realistic in practical AFL marketplaces in which multiple data consumers can compete to attract data owners to join their respective FL tasks. In this paper, we bridge this gap by proposing a first-of-its-kind utility-maximizing bidding strategy for data consumers in federated learning (Fed-Bidder). It enables multiple FL data consumers to compete for data owners via AFL effectively and efficiently by providing with utility estimation capabilities which can accommodate diverse forms of winning functions, each reflecting different market dynamics. Extensive experiments based on six commonly adopted benchmark datasets show that Fed-Bidder is significantly more advantageous compared to four state-of-the-art approaches. △ Less

Submitted 14 May, 2023; v1 submitted 11 May, 2023; originally announced May 2023.

arXiv:2305.04181 [pdf, other]

Shall We Trust All Relational Tuples by Open Information Extraction? A Study on Speculation Detection

Authors: Kuicai Dong, Aixin Sun, Jung-Jae Kim, Xiaoli Li

Abstract: Open Information Extraction (OIE) aims to extract factual relational tuples from open-domain sentences. Downstream tasks use the extracted OIE tuples as facts, without examining the certainty of these facts. However, uncertainty/speculation is a common linguistic phenomenon. Existing studies on speculation detection are defined at sentence level, but even if a sentence is determined to be speculat… ▽ More Open Information Extraction (OIE) aims to extract factual relational tuples from open-domain sentences. Downstream tasks use the extracted OIE tuples as facts, without examining the certainty of these facts. However, uncertainty/speculation is a common linguistic phenomenon. Existing studies on speculation detection are defined at sentence level, but even if a sentence is determined to be speculative, not all tuples extracted from it may be speculative. In this paper, we propose to study speculations in OIE and aim to determine whether an extracted tuple is speculative. We formally define the research problem of tuple-level speculation detection and conduct a detailed data analysis on the LSOIE dataset which contains labels for speculative tuples. Lastly, we propose a baseline model OIE-Spec for this new research task. △ Less

Submitted 6 May, 2023; originally announced May 2023.

arXiv:2305.03299 [pdf, other]

Open Information Extraction via Chunks

Authors: Kuicai Dong, Aixin Sun, Jung-Jae Kim, Xiaoli Li

Abstract: Open Information Extraction (OIE) aims to extract relational tuples from open-domain sentences. Existing OIE systems split a sentence into tokens and recognize token spans as tuple relations and arguments. We instead propose Sentence as Chunk sequence (SaC) and recognize chunk spans as tuple relations and arguments. We argue that SaC has better quantitative and qualitative properties for OIE than… ▽ More Open Information Extraction (OIE) aims to extract relational tuples from open-domain sentences. Existing OIE systems split a sentence into tokens and recognize token spans as tuple relations and arguments. We instead propose Sentence as Chunk sequence (SaC) and recognize chunk spans as tuple relations and arguments. We argue that SaC has better quantitative and qualitative properties for OIE than sentence as token sequence, and evaluate four choices of chunks (i.e., CoNLL chunks, simple phrases, NP chunks, and spans from SpanOIE) against gold OIE tuples. Accordingly, we propose a simple BERT-based model for sentence chunking, and propose Chunk-OIE for tuple extraction on top of SaC. Chunk-OIE achieves state-of-the-art results on multiple OIE datasets, showing that SaC benefits OIE task. △ Less

Submitted 5 May, 2023; originally announced May 2023.

arXiv:2305.02762 [pdf, ps, other]

Degree stability of graphs forbidding odd cycles

Authors: Xiaoli Yuan, Yuejian Peng

Abstract: Erdős and Simonovits asked the following question: For an integer $r\geq 2$ and a family of non-bipartite graphs $\mathcal{H}$, what is the tight bound of $α$ such that any $\mathcal{H}$-free $n$-vertex graph with minimum degree at least $αn$ has chromatic number at most $r$? We answer this question for $r=2$ and any family of odd cycles. Let $l\le k$ and $n\ge 1000k^{8}$ be positive integers. Let… ▽ More Erdős and Simonovits asked the following question: For an integer $r\geq 2$ and a family of non-bipartite graphs $\mathcal{H}$, what is the tight bound of $α$ such that any $\mathcal{H}$-free $n$-vertex graph with minimum degree at least $αn$ has chromatic number at most $r$? We answer this question for $r=2$ and any family of odd cycles. Let $l\le k$ and $n\ge 1000k^{8}$ be positive integers. Let ${\mathcal C}$ be a family of odd cycles, $C_{2l+1}$ be the shortest odd cycle not in ${\mathcal C}$, and $C_{2k+1}$ be the longest odd cycle in ${\mathcal C}$. Let $BC_{2l+1}(n)$ denote the graph obtained by taking $2l+1$ vertex-disjoint copies of $K_{\frac{n}{2(2l+1)},\frac{n}{2(2l+1)}}$ and selecting a vertex in each of them such that these vertices form a cycle of length $2l+1$. Let $C_{2k+3}({n \over 2k+3})$ denote the balanced blow up of $C_{2k+3}$ with $n$ vertices. Note that both $BC_{2l+1}(n)$ and $C_{2k+3}({n \over 2k+3})$ are $n$-vertex ${\mathcal C}$-free non-bipartite graphs. We show that if $G$ is an $n$-vertex ${\mathcal C}$-free graph with $δ(G)>\max\{ \frac{n}{2(2l+1)}, \frac{2}{2k+3}n\}$, then $G$ is bipartite. The bound is tight evident by $BC_{2l+1}(n)$ and $C_{2k+3}({n \over 2k+3})$. Moreover, the only $n$-vertex ${\mathcal C}$-free non-bipartite graph with minimum degree $\max\{ \frac{n}{2(2l+1)}, \frac{2}{2k+3}n\}=\frac{n}{2(2l+1)}$ is $BC_{2l+1}(n)$, and the the only $n$-vertex ${\mathcal C}$-free non-bipartite graph with minimum degree $\max\{ \frac{n}{2(2l+1)}, \frac{2}{2k+3}n\}=\frac{2}{2k+3}n$ is $C_{2k+3}({n \over 2k+3})$. Our result also unifies stability results of Andrásfai, Erdős and Sós, Häggkvist and Yuan and Peng for large $n$. △ Less

Submitted 4 May, 2023; originally announced May 2023.

Comments: 17 pages

arXiv:2304.11939 [pdf]

doi 10.1103/PhysRevLett.130.206001

Phase shift and magnetic anisotropy induced field splitting of impurity states in (Li1-xFex)OHFeSe superconductor

Authors: Tianzhen Zhang, Yining Hu, Wei Su, Chen Chen, Xu Wang, Dong Li, Zouyouwei Lu, Wentao Yang, Qingle Zhang, Xiaoli Dong, Rui Wang, Xiaoqun Wang, Donglai Feng, Tong Zhang

Abstract: Revealing the energy and spatial characteristics of impurity induced states in superconductors is essential for understanding their mechanism and fabricating new quantum state by manipulating impurities. Here by using high-resolution scanning tunneling microscopy/spectroscopy, we investigated the spatial distribution and magnetic field response of the impurity states in (Li1-xFex)OHFeSe. We detect… ▽ More Revealing the energy and spatial characteristics of impurity induced states in superconductors is essential for understanding their mechanism and fabricating new quantum state by manipulating impurities. Here by using high-resolution scanning tunneling microscopy/spectroscopy, we investigated the spatial distribution and magnetic field response of the impurity states in (Li1-xFex)OHFeSe. We detected two pairs of strong in-gap states on the "dumbbell" shaped defects. They display clear damped oscillations with different phase shifts and a direct phase-energy correlation. These features have long been predicted for classical Yu-Shiba-Rusinov (YSR) state, which are demonstrated here with unprecedented resolution for the first time. Moreover, upon applying magnetic field, all the in-gap state peaks remarkably split into two rather than shift, and the splitting strength is field orientation dependent. Via detailed numerical model calculations, we found such anisotropic splitting behavior can be naturally induced by a high-spin impurity coupled to anisotropic environment, highlighting how magnetic anisotropy affects the behavior of YSR states. △ Less

Submitted 24 April, 2023; originally announced April 2023.

Comments: Main text with supplementary (accepted by Phys. Rev. Lett.)

arXiv:2304.11288 [pdf, other]

A novel energy-optimal scalar auxiliary variable (EOP-SAV) approach for gradient flows

Authors: Zhengguang Liu, Yanrong Zhang, Xiaoli Li

Abstract: In recent years, the scalar auxiliary variable (SAV) approach has become very popular and hot in the design of linear, high-order and unconditional energy stable schemes of gradient flow models. However, the nature of SAV-based numerical schemes preserving modified energy dissipation limits its wider application. A relaxation technique to correct the modified energy for the baseline SAV method (RS… ▽ More In recent years, the scalar auxiliary variable (SAV) approach has become very popular and hot in the design of linear, high-order and unconditional energy stable schemes of gradient flow models. However, the nature of SAV-based numerical schemes preserving modified energy dissipation limits its wider application. A relaxation technique to correct the modified energy for the baseline SAV method (RSAV) was proposed by Zhao et al. and Shen et al.. The RSAV approach is unconditionally energy stable with respect to a modified energy that is closer to the original free energy, and provides a much improved accuracy when compared with the SAV approach. In this paper, inspired by the RSAV approach, we propose a novel technique to correct the modified energy of the SAV approach, which can be proved to be an optimal energy approximation. We construct new high-order implicit-explicit schemes based on the proposed energy-optimal SAV (EOP-SAV) approach. The constructed EOP-SAV schemes not only provide an improved accuracy but also simplify calculation, and can be viewed as the optimal relaxation. We also prove that the numerical schemes based on the EOP-SAV approach are unconditionally energy stable. Compared with the RSAV approach, the proposed EOP-SAV approach does not need introduce any relaxed factors and can share the similar procedure for error estimates. Several interesting numerical examples have been presented to demonstrate the accuracy and effectiveness of the proposed methods. △ Less

Submitted 21 April, 2023; originally announced April 2023.

Comments: Scalar auxiliary variable, Gradient flow, Relaxation, Optimal, Error analysis

arXiv:2304.10316 [pdf, other]

Search-Map-Search: A Frame Selection Paradigm for Action Recognition

Authors: Mingjun Zhao, Yakun Yu, Xiaoli Wang, Lei Yang, Di Niu

Abstract: Despite the success of deep learning in video understanding tasks, processing every frame in a video is computationally expensive and often unnecessary in real-time applications. Frame selection aims to extract the most informative and representative frames to help a model better understand video content. Existing frame selection methods either individually sample frames based on per-frame importa… ▽ More Despite the success of deep learning in video understanding tasks, processing every frame in a video is computationally expensive and often unnecessary in real-time applications. Frame selection aims to extract the most informative and representative frames to help a model better understand video content. Existing frame selection methods either individually sample frames based on per-frame importance prediction, without considering interaction among frames, or adopt reinforcement learning agents to find representative frames in succession, which are costly to train and may lead to potential stability issues. To overcome the limitations of existing methods, we propose a Search-Map-Search learning paradigm which combines the advantages of heuristic search and supervised learning to select the best combination of frames from a video as one entity. By combining search with learning, the proposed method can better capture frame interactions while incurring a low inference overhead. Specifically, we first propose a hierarchical search method conducted on each training video to search for the optimal combination of frames with the lowest error on the downstream task. A feature map** function is then learned to map the frames of a video to the representation of its target optimal frame combination. During inference, another search is performed on an unseen video to select a combination of frames whose feature representation is close to the projected feature representation. Extensive experiments based on several action recognition benchmarks demonstrate that our frame selection method effectively improves performance of action recognition models, and significantly outperforms a number of competitive baselines. △ Less

Submitted 20 April, 2023; originally announced April 2023.

Comments: CVPR 2023

arXiv:2304.10310 [pdf, other]

doi 10.1007/978-3-031-19803-8_16

LA3: Efficient Label-Aware AutoAugment

Authors: Mingjun Zhao, Shan Lu, Zixuan Wang, Xiaoli Wang, Di Niu

Abstract: Automated augmentation is an emerging and effective technique to search for data augmentation policies to improve generalizability of deep neural network training. Most existing work focuses on constructing a unified policy applicable to all data samples in a given dataset, without considering sample or class variations. In this paper, we propose a novel two-stage data augmentation algorithm, name… ▽ More Automated augmentation is an emerging and effective technique to search for data augmentation policies to improve generalizability of deep neural network training. Most existing work focuses on constructing a unified policy applicable to all data samples in a given dataset, without considering sample or class variations. In this paper, we propose a novel two-stage data augmentation algorithm, named Label-Aware AutoAugment (LA3), which takes advantage of the label information, and learns augmentation policies separately for samples of different labels. LA3 consists of two learning stages, where in the first stage, individual augmentation methods are evaluated and ranked for each label via Bayesian Optimization aided by a neural predictor, which allows us to identify effective augmentation techniques for each label under a low search cost. And in the second stage, a composite augmentation policy is constructed out of a selection of effective as well as complementary augmentations, which produces significant performance boost and can be easily deployed in typical model training. Extensive experiments demonstrate that LA3 achieves excellent performance matching or surpassing existing methods on CIFAR-10 and CIFAR-100, and achieves a new state-of-the-art ImageNet accuracy of 79.97% on ResNet-50 among auto-augmentation methods, while maintaining a low computational cost. △ Less

Submitted 20 April, 2023; originally announced April 2023.

Comments: ECCV 2022

arXiv:2304.10188 [pdf, ps, other]

doi 10.3847/1538-4357/acc85f

Hard TeV Gamma-Ray Afterglows of Nearby GRB 190829A as a Tentative Signature of Ultra-High-Energy Cosmic Rays Accelerated in Gamma-Ray Burst Jets

Authors: Jian-Kun Huang, Xiao-Li Huang, Ji-Gui Cheng, Jia Ren, Lu-Lu Zhang, En-Wei Liang

Abstract: The observed hard TeV gamma-ray spectrum of the nearby gamma-ray burst (GRB) 190829A may challenge the conventional leptonic GRB afterglow model. It has been proposed that an ultra-high-energy (UHE; $\varepsilon^{'}_{\rm p}\sim 10^{20}$ eV) proton population can be pre-accelerated by internal shocks in GRB jets. We study possible signatures of the UHE protons embedded in the TeV afterglows when th… ▽ More The observed hard TeV gamma-ray spectrum of the nearby gamma-ray burst (GRB) 190829A may challenge the conventional leptonic GRB afterglow model. It has been proposed that an ultra-high-energy (UHE; $\varepsilon^{'}_{\rm p}\sim 10^{20}$ eV) proton population can be pre-accelerated by internal shocks in GRB jets. We study possible signatures of the UHE protons embedded in the TeV afterglows when they escape the afterglow fireball. We show that the leptonic model can represent the observed multiwavelength lightcurves and spectral energy distributions of GRB 190829A by considering the uncertainties of the model parameters. Attributing the TeV gamma-ray afterglows to the emission of both the electron self-Compton scattering process and the UHE proton synchrotron radiations in the afterglow fireball, we obtain tentative upper limits of $\log_{10} \varepsilon_{\rm p}^{\prime}/{\rm eV}\sim 20.46$ and $\log_{10}E_{\rm p, total}/{\rm erg}\leq 50.75$, where $E_{\rm p, total}$ is the total energy of the proton population. The synchrotron radiations of the UHE protons should dominate the early TeV gamma-ray afterglows, implying that early observations are critical for revealing the UHE proton population. △ Less

Submitted 20 April, 2023; originally announced April 2023.

Comments: 9 pages, 4 figures, Accepted for Publication in ApJ

arXiv:2304.05854 [pdf]

doi 10.1038/s41467-023-37792-3

Charge order driven by multiple-Q spin fluctuations in heavily electron-doped iron selenide superconductors

Authors: Ziyuan Chen, Dong Li, Zouyouwei Lu, Yue Liu, Jiakang Zhang, Yuanji Li, Ruotong Yin, Mingzhe Li, Tong Zhang, Xiaoli Dong, Ya-Jun Yan, Dong-Lai Feng

Abstract: Intertwined spin and charge orders have been widely studied in high-temperature superconductors, since their fluctuations may facilitate electron pairing; however, they are rarely identified in heavily electron-doped iron selenides. Here, using scanning tunneling microscopy, we show that when the superconductivity of (Li0.84Fe0.16OH)Fe1-xSe is suppressed by introducing Fe-site defects, a short-ran… ▽ More Intertwined spin and charge orders have been widely studied in high-temperature superconductors, since their fluctuations may facilitate electron pairing; however, they are rarely identified in heavily electron-doped iron selenides. Here, using scanning tunneling microscopy, we show that when the superconductivity of (Li0.84Fe0.16OH)Fe1-xSe is suppressed by introducing Fe-site defects, a short-ranged checkerboard charge order emerges, propagating along the Fe-Fe directions with an approximately 2aFe period. It persists throughout the whole phase space tuned by Fe-site defect density, from a defect-pinned local pattern in optimally doped samples to an extended order in samples with lower Tc or non-superconducting. Intriguingly, our simulations indicate that the charge order is likely driven by multiple-Q spin density waves originating from the spin fluctuations observed by inelastic neutron scattering. Our study proves the presence of a competing order in heavily electron-doped iron selenides, and demonstrates the potential of charge order as a tool to detect spin fluctuations. △ Less

Submitted 12 April, 2023; originally announced April 2023.

Comments: 16 pages, 5 figures

Journal ref: Nat. Commun. 14, 2023 (2023)

arXiv:2304.04869 [pdf, other]

doi 10.1088/1538-3873/acd1b5

The James Webb Space Telescope Mission

Authors: Jonathan P. Gardner, John C. Mather, Randy Abbott, James S. Abell, Mark Abernathy, Faith E. Abney, John G. Abraham, Roberto Abraham, Yasin M. Abul-Huda, Scott Acton, Cynthia K. Adams, Evan Adams, David S. Adler, Maarten Adriaensen, Jonathan Albert Aguilar, Mansoor Ahmed, Nasif S. Ahmed, Tanjira Ahmed, Rüdeger Albat, Loïc Albert, Stacey Alberts, David Aldridge, Mary Marsha Allen, Shaune S. Allen, Martin Altenburg , et al. (983 additional authors not shown)

Abstract: Twenty-six years ago a small committee report, building on earlier studies, expounded a compelling and poetic vision for the future of astronomy, calling for an infrared-optimized space telescope with an aperture of at least $4m$. With the support of their governments in the US, Europe, and Canada, 20,000 people realized that vision as the $6.5m$ James Webb Space Telescope. A generation of astrono… ▽ More Twenty-six years ago a small committee report, building on earlier studies, expounded a compelling and poetic vision for the future of astronomy, calling for an infrared-optimized space telescope with an aperture of at least $4m$. With the support of their governments in the US, Europe, and Canada, 20,000 people realized that vision as the $6.5m$ James Webb Space Telescope. A generation of astronomers will celebrate their accomplishments for the life of the mission, potentially as long as 20 years, and beyond. This report and the scientific discoveries that follow are extended thank-you notes to the 20,000 team members. The telescope is working perfectly, with much better image quality than expected. In this and accompanying papers, we give a brief history, describe the observatory, outline its objectives and current observing program, and discuss the inventions and people who made it possible. We cite detailed reports on the design and the measured performance on orbit. △ Less

Submitted 10 April, 2023; originally announced April 2023.

Comments: Accepted by PASP for the special issue on The James Webb Space Telescope Overview, 29 pages, 4 figures

arXiv:2304.00558 [pdf]

doi 10.1007/s11433-023-2171-8

Percolation-induced resistivity drop in cold-pressed LuH2

Authors: Ningning Wang, Jun Hou, Ziyi Liu, Pengfei Shan, Congcong Chai, Shifeng **, Xiao Wang, Youwen Long, Yue Liu, Hua Zhang, Xiaoli Dong, **guang Cheng

Abstract: The stoichiometric bulk LuH2 is a paramagnetic metal with high electrical conductivity comparable to simple metals. Here we show that the resistivity of cold-pressed (CP) LuH2 samples varies sensitively upon modifying the grain size or surface conditions via the grinding process, i.e., the CP pellets made of commercially purchased LuH2 powder remain metallic but exhibit thousands of times higher r… ▽ More The stoichiometric bulk LuH2 is a paramagnetic metal with high electrical conductivity comparable to simple metals. Here we show that the resistivity of cold-pressed (CP) LuH2 samples varies sensitively upon modifying the grain size or surface conditions via the grinding process, i.e., the CP pellets made of commercially purchased LuH2 powder remain metallic but exhibit thousands of times higher resistivity, while additional grinding of LuH2 powders in air further enhances the resistivity and even results in weakly localized behaviors. For these CP samples, interestingly, we can occasionally observe abrupt resistivity drops at high temperatures, which also show dependences on magnetic fields and electrical current. Measurements of variable-temperature XRD, magnetic susceptibility, and specific heat exclude the possibilities of structural, magnetic, and superconducting transitions for the observed resistivity drops. Instead, we tentatively attribute these above observations to the presence of insulating layers on the grain surface due to the modification of hydrogen stoichiometry or the pollution by oxygen/nitrogen. Percolation of the metallic grains through the insulating surfaces can explain the sudden drop in resistivity. The present results thus call for caution in asserting the resistivity drops as superconductivity and invalidate the background subtraction in analyzing the resistivity data. △ Less

Submitted 2 April, 2023; originally announced April 2023.

Comments: 17 pages, 10 figures

Journal ref: Sci. China Phys. Mech. Astron. 66, 297412 (2023)

arXiv:2304.00212 [pdf, other]

Devil is in the Queries: Advancing Mask Transformers for Real-world Medical Image Segmentation and Out-of-Distribution Localization

Authors: Mingze Yuan, Yingda Xia, Hexin Dong, Zifan Chen, Jiawen Yao, Mingyan Qiu, Ke Yan, Xiaoli Yin, Yu Shi, Xin Chen, Zaiyi Liu, Bin Dong, **gren Zhou, Le Lu, Ling Zhang, Li Zhang

Abstract: Real-world medical image segmentation has tremendous long-tailed complexity of objects, among which tail conditions correlate with relatively rare diseases and are clinically significant. A trustworthy medical AI algorithm should demonstrate its effectiveness on tail conditions to avoid clinically dangerous damage in these out-of-distribution (OOD) cases. In this paper, we adopt the concept of obj… ▽ More Real-world medical image segmentation has tremendous long-tailed complexity of objects, among which tail conditions correlate with relatively rare diseases and are clinically significant. A trustworthy medical AI algorithm should demonstrate its effectiveness on tail conditions to avoid clinically dangerous damage in these out-of-distribution (OOD) cases. In this paper, we adopt the concept of object queries in Mask Transformers to formulate semantic segmentation as a soft cluster assignment. The queries fit the feature-level cluster centers of inliers during training. Therefore, when performing inference on a medical image in real-world scenarios, the similarity between pixels and the queries detects and localizes OOD regions. We term this OOD localization as MaxQuery. Furthermore, the foregrounds of real-world medical images, whether OOD objects or inliers, are lesions. The difference between them is less than that between the foreground and background, possibly misleading the object queries to focus redundantly on the background. Thus, we propose a query-distribution (QD) loss to enforce clear boundaries between segmentation targets and other regions at the query level, improving the inlier segmentation and OOD indication. Our proposed framework is tested on two real-world segmentation tasks, i.e., segmentation of pancreatic and liver tumors, outperforming previous state-of-the-art algorithms by an average of 7.39% on AUROC, 14.69% on AUPR, and 13.79% on FPR95 for OOD localization. On the other hand, our framework improves the performance of inlier segmentation by an average of 5.27% DSC when compared with the leading baseline nnUNet. △ Less

Submitted 31 March, 2023; originally announced April 2023.

Comments: CVPR 2023 Highlight

arXiv:2303.15496 [pdf, ps, other]

Holonomic Bessel modules and generating functions

Authors: Yik Man Chiang, Avery Ching, Xiaoli Lin

Abstract: We have solved a number of holonomic PDEs derived from the Bessel modules which are related to the generating functions of classical Bessel functions and the difference Bessel functions recently discovered by Bohner and Cuchta. This $D$-module approach both unifies and extends generating functions of the classical and the difference Bessel functions. It shows that the algebraic structures of the B… ▽ More We have solved a number of holonomic PDEs derived from the Bessel modules which are related to the generating functions of classical Bessel functions and the difference Bessel functions recently discovered by Bohner and Cuchta. This $D$-module approach both unifies and extends generating functions of the classical and the difference Bessel functions. It shows that the algebraic structures of the Bessel modules and related modules determine the possible formats of Bessel's generating functions studied in this article. As a consequence of these $D$-modules structures, a number of new recursion formulae, integral representations and new difference Bessel polynomials have been discovered. The key ingredients of our argument involve new transmutation formulae related to the Bessel modules and the construction of $D$-linear maps between different appropriately constructed submodules. This work can be viewed as $D$-module approach to Truesdell's $F$-equation theory specialised to Bessel functions. The framework presented in this article can be applied to other special functions. △ Less

Submitted 27 March, 2023; originally announced March 2023.

Comments: 97 pages including one blank page

MSC Class: Primary 32S40; 33E99; 30D10; Secondary 47B37; 47B47; 12H05; 12H10; 13N10

arXiv:2303.10452 [pdf, other]

Confidence Attention and Generalization Enhanced Distillation for Continuous Video Domain Adaptation

Authors: Xiyu Wang, Yuecong Xu, Jianfei Yang, Bihan Wen, Alex C. Kot

Abstract: Continuous Video Domain Adaptation (CVDA) is a scenario where a source model is required to adapt to a series of individually available changing target domains continuously without source data or target supervision. It has wide applications, such as robotic vision and autonomous driving. The main underlying challenge of CVDA is to learn helpful information only from the unsupervised target data wh… ▽ More Continuous Video Domain Adaptation (CVDA) is a scenario where a source model is required to adapt to a series of individually available changing target domains continuously without source data or target supervision. It has wide applications, such as robotic vision and autonomous driving. The main underlying challenge of CVDA is to learn helpful information only from the unsupervised target data while avoiding forgetting previously learned knowledge catastrophically, which is out of the capability of previous Video-based Unsupervised Domain Adaptation methods. Therefore, we propose a Confidence-Attentive network with geneRalization enhanced self-knowledge disTillation (CART) to address the challenge in CVDA. Firstly, to learn from unsupervised domains, we propose to learn from pseudo labels. However, in continuous adaptation, prediction errors can accumulate rapidly in pseudo labels, and CART effectively tackles this problem with two key modules. Specifically, The first module generates refined pseudo labels using model predictions and deploys a novel attentive learning strategy. The second module compares the outputs of augmented data from the current model to the outputs of weakly augmented data from the source model, forming a novel consistency regularization on the model to alleviate the accumulation of prediction errors. Extensive experiments suggest that the CVDA performance of CART outperforms existing methods by a considerable margin. △ Less

Submitted 29 August, 2023; v1 submitted 18 March, 2023; originally announced March 2023.

Comments: 16 pages, 9 tables, 10 figures

arXiv:2303.10451 [pdf, other]

Augmenting and Aligning Snippets for Few-Shot Video Domain Adaptation

Authors: Yuecong Xu, Jianfei Yang, Yunjiao Zhou, Zhenghua Chen, Min Wu, Xiaoli Li

Abstract: For video models to be transferred and applied seamlessly across video tasks in varied environments, Video Unsupervised Domain Adaptation (VUDA) has been introduced to improve the robustness and transferability of video models. However, current VUDA methods rely on a vast amount of high-quality unlabeled target data, which may not be available in real-world cases. We thus consider a more realistic… ▽ More For video models to be transferred and applied seamlessly across video tasks in varied environments, Video Unsupervised Domain Adaptation (VUDA) has been introduced to improve the robustness and transferability of video models. However, current VUDA methods rely on a vast amount of high-quality unlabeled target data, which may not be available in real-world cases. We thus consider a more realistic \textit{Few-Shot Video-based Domain Adaptation} (FSVDA) scenario where we adapt video models with only a few target video samples. While a few methods have touched upon Few-Shot Domain Adaptation (FSDA) in images and in FSVDA, they rely primarily on spatial augmentation for target domain expansion with alignment performed statistically at the instance level. However, videos contain more knowledge in terms of rich temporal and semantic information, which should be fully considered while augmenting target domains and performing alignment in FSVDA. We propose a novel SSA2lign to address FSVDA at the snippet level, where the target domain is expanded through a simple snippet-level augmentation followed by the attentive alignment of snippets both semantically and statistically, where semantic alignment of snippets is conducted through multiple perspectives. Empirical results demonstrate state-of-the-art performance of SSA2lign across multiple cross-domain action recognition benchmarks. △ Less

Submitted 18 March, 2023; originally announced March 2023.

Comments: 15 pages, 9 tables, 5 figures

arXiv:2303.10062 [pdf, other]

Confidence-aware 3D Gaze Estimation and Evaluation Metric

Authors: Qiaojie Zheng, Jiucai Zhang, Amy Zhang, Xiaoli Zhang

Abstract: Deep learning appearance-based 3D gaze estimation is gaining popularity due to its minimal hardware requirements and being free of constraint. Unreliable and overconfident inferences, however, still limit the adoption of this gaze estimation method. To address the unreliable and overconfident issues, we introduce a confidence-aware model that predicts uncertainties together with gaze angle estimat… ▽ More Deep learning appearance-based 3D gaze estimation is gaining popularity due to its minimal hardware requirements and being free of constraint. Unreliable and overconfident inferences, however, still limit the adoption of this gaze estimation method. To address the unreliable and overconfident issues, we introduce a confidence-aware model that predicts uncertainties together with gaze angle estimations. We also introduce a novel effectiveness evaluation method based on the causality between eye feature degradation and the rise in inference uncertainty to assess the uncertainty estimation. Our confidence-aware model demonstrates reliable uncertainty estimations while providing angular estimation accuracies on par with the state-of-the-art. Compared with the existing statistical uncertainty-angular-error evaluation metric, the proposed effectiveness evaluation approach can more effectively judge inferred uncertainties' performance at each prediction. △ Less

Submitted 17 March, 2023; originally announced March 2023.

Comments: 9 pages 12 figures

Showing 101–150 of 671 results for author: Xiali