-
Quasibound and quasinormal modes of a thick brane in Rastall gravity
Authors:
Qin Tan,
Yi Zhong,
Wen-Di Guo
Abstract:
In this work, we study the gravitational quasinormal modes of the thick brane in Rastall gravity. Using the asymptotic iteration and direct integration methods, we solve the quasinormal frequencies of the Rastall thick brane. We also obtained the waveforms of these quasinormal modes through numerical evolution. The results indicate that although the Rastall thick brane lacks a bound zero mode, whe…
▽ More
In this work, we study the gravitational quasinormal modes of the thick brane in Rastall gravity. Using the asymptotic iteration and direct integration methods, we solve the quasinormal frequencies of the Rastall thick brane. We also obtained the waveforms of these quasinormal modes through numerical evolution. The results indicate that although the Rastall thick brane lacks a bound zero mode, when the Rastall parameter $λ\gtrsim0$, a long-lived quasinormal mode appears. This long-lived quasinormal mode may restore the four-dimensional effective Newtonian potential on the brane on a large scale. This may provide a new perspective for the localization of gravity on thick branes, that a thick brane does not necessarily require the gravity to be localized, perhaps quasi-localized is sufficient.
△ Less
Submitted 17 April, 2024;
originally announced April 2024.
-
Revisiting Noise Resilience Strategies in Gesture Recognition: Short-Term Enhancement in Surface Electromyographic Signal Analysis
Authors:
Weiyu Guo,
Ziyue Qiao,
Ying Sun,
Hui Xiong
Abstract:
Gesture recognition based on surface electromyography (sEMG) has been gaining importance in many 3D Interactive Scenes. However, sEMG is easily influenced by various forms of noise in real-world environments, leading to challenges in providing long-term stable interactions through sEMG. Existing methods often struggle to enhance model noise resilience through various predefined data augmentation t…
▽ More
Gesture recognition based on surface electromyography (sEMG) has been gaining importance in many 3D Interactive Scenes. However, sEMG is easily influenced by various forms of noise in real-world environments, leading to challenges in providing long-term stable interactions through sEMG. Existing methods often struggle to enhance model noise resilience through various predefined data augmentation techniques. In this work, we revisit the problem from a short term enhancement perspective to improve precision and robustness against various common noisy scenarios with learnable denoise using sEMG intrinsic pattern information and sliding-window attention. We propose a Short Term Enhancement Module(STEM) which can be easily integrated with various models. STEM offers several benefits: 1) Learnable denoise, enabling noise reduction without manual data augmentation; 2) Scalability, adaptable to various models; and 3) Cost-effectiveness, achieving short-term enhancement through minimal weight-sharing in an efficient attention mechanism. In particular, we incorporate STEM into a transformer, creating the Short Term Enhanced Transformer (STET). Compared with best-competing approaches, the impact of noise on STET is reduced by more than 20%. We also report promising results on both classification and regression datasets and demonstrate that STEM generalizes across different gesture recognition tasks.
△ Less
Submitted 17 April, 2024;
originally announced April 2024.
-
Design of Artificial Interference Signals for Covert Communication Aided by Multiple Friendly Nodes
Authors:
Xuyang Zhao. Wei Guo,
Yongchao Wang
Abstract:
In this paper, we consider a scenario of covert communication aided by multiple friendly interference nodes. The objective is to conceal the legitimate communication link under the surveillance of a warden. The main content is as follows: first, we propose a novel strategy for generating artificial noise signals in the considered covert scenario. Then, we leverage the statistical information of ch…
▽ More
In this paper, we consider a scenario of covert communication aided by multiple friendly interference nodes. The objective is to conceal the legitimate communication link under the surveillance of a warden. The main content is as follows: first, we propose a novel strategy for generating artificial noise signals in the considered covert scenario. Then, we leverage the statistical information of channel coefficients to optimize the basis matrix of the artificial noise signals space in the absence of accurate channel fading information between the friendly interference nodes and the legitimate receiver. The optimization problem aims to design artificial noise signals within the space to facilitate covert communication while minimizing the impact on the performance of legitimate communication. Second, a customized Rimannian Stochastic Variance Reduced Gradient (R-SVRG) algorithm is proposed to solve the non-convex problem. In the algorithm, we employ the Riemannian optimization framework to analyze the geometric structure of the basis matrix constraints and transform the original non-convex optimization problem into an unconstrained problem on the complex Stiefel manifold for solution. Third, we theoretically prove the convergence of the proposed algorithm to a stationary point. In the end, we evaluate the performance of the proposed strategy for generating artificial noise signals through numerical simulations. The results demonstrate that our approach significantly outperforms the Gaussian artificial noise strategy without optimization.
△ Less
Submitted 9 May, 2024; v1 submitted 13 April, 2024;
originally announced April 2024.
-
Meply: A Large-scale Dataset and Baseline Evaluations for Metastatic Perirectal Lymph Node Detection and Segmentation
Authors:
Weidong Guo,
Hantao Zhang,
Shouhong Wan,
Bingbing Zou,
Wanqin Wang,
Chenyang Qiu,
Jun Li,
Peiquan **
Abstract:
Accurate segmentation of metastatic lymph nodes in rectal cancer is crucial for the staging and treatment of rectal cancer. However, existing segmentation approaches face challenges due to the absence of pixel-level annotated datasets tailored for lymph nodes around the rectum. Additionally, metastatic lymph nodes are characterized by their relatively small size, irregular shapes, and lower contra…
▽ More
Accurate segmentation of metastatic lymph nodes in rectal cancer is crucial for the staging and treatment of rectal cancer. However, existing segmentation approaches face challenges due to the absence of pixel-level annotated datasets tailored for lymph nodes around the rectum. Additionally, metastatic lymph nodes are characterized by their relatively small size, irregular shapes, and lower contrast compared to the background, further complicating the segmentation task. To address these challenges, we present the first large-scale perirectal metastatic lymph node CT image dataset called Meply, which encompasses pixel-level annotations of 269 patients diagnosed with rectal cancer. Furthermore, we introduce a novel lymph-node segmentation model named CoSAM. The CoSAM utilizes sequence-based detection to guide the segmentation of metastatic lymph nodes in rectal cancer, contributing to improved localization performance for the segmentation model. It comprises three key components: sequence-based detection module, segmentation module, and collaborative convergence unit. To evaluate the effectiveness of CoSAM, we systematically compare its performance with several popular segmentation methods using the Meply dataset. Our code and dataset will be publicly available at: https://github.com/kanydao/CoSAM.
△ Less
Submitted 13 April, 2024;
originally announced April 2024.
-
Generalization Gap in Data Augmentation: Insights from Illumination
Authors:
Jianqiang Xiao,
Weiwen Guo,
Junfeng Liu,
Mengze Li
Abstract:
In the field of computer vision, data augmentation is widely used to enrich the feature complexity of training datasets with deep learning techniques. However, regarding the generalization capabilities of models, the difference in artificial features generated by data augmentation and natural visual features has not been fully revealed. This study focuses on the visual representation variable 'ill…
▽ More
In the field of computer vision, data augmentation is widely used to enrich the feature complexity of training datasets with deep learning techniques. However, regarding the generalization capabilities of models, the difference in artificial features generated by data augmentation and natural visual features has not been fully revealed. This study focuses on the visual representation variable 'illumination', by simulating its distribution degradation and examining how data augmentation techniques enhance model performance on a classification task. Our goal is to investigate the differences in generalization between models trained with augmented data and those trained under real-world illumination conditions. Results indicate that after undergoing various data augmentation methods, model performance has been significantly improved. Yet, a noticeable generalization gap still exists after utilizing various data augmentation methods, emphasizing the critical role of feature diversity in the training set for enhancing model generalization.
△ Less
Submitted 11 April, 2024;
originally announced April 2024.
-
Field-induced Peierls phase in $S=1$ Heisenberg spins coupled to quantum phonons
Authors:
Shifeng Cui,
Wenan Guo,
G. G. Batrouni,
Pinaki Sengupta
Abstract:
Spin-Peierls transition occurs in a one-dimensional $S=1$ Heisenberg antiferromagnetic model with single-ion anisotropy, coupled to finite frequency bond phonons, in a magnetic field. Our results indicate that for the pure Heisenberg model, any Peierls transition is suppressed by quantum fluctuations of the phonon field. However, a novel magnetic field-induced Spin-Peierls phase is realized in the…
▽ More
Spin-Peierls transition occurs in a one-dimensional $S=1$ Heisenberg antiferromagnetic model with single-ion anisotropy, coupled to finite frequency bond phonons, in a magnetic field. Our results indicate that for the pure Heisenberg model, any Peierls transition is suppressed by quantum fluctuations of the phonon field. However, a novel magnetic field-induced Spin-Peierls phase is realized in the presence of strong single-ion anisotropy. Contrary to the standard Peierls state, the periodicity of bond strength modulation in this field-induced Spin-Peierls state is variable and depends on the strength of the applied field. The nature of the ground state in this new phase and the associated field-driven transitions to and out of this phase are explored using extensive numerical simulations. In particular, we explore the spin and bond correlations and the evolution of bond order modulation with varying magnetic field.
△ Less
Submitted 10 April, 2024;
originally announced April 2024.
-
Constraining Axion-Like Particles Dark Matter in Coma Berenices with FAST
Authors:
Wen-Qing Guo,
Zi-Qing Xia,
Xiaoyuan Huang
Abstract:
Axions and axion-like particles (ALPs) appear in many extensions of the Standard Model and are being investigated as promising dark matter (DM) candidates. One viable methodology for their detection involves the investigation of the line-like radio emissions from the dwarf spheroidal galaxy, potentially originating from the radiative decay of ALPs or the conversion of ALPs in the magnetic field. I…
▽ More
Axions and axion-like particles (ALPs) appear in many extensions of the Standard Model and are being investigated as promising dark matter (DM) candidates. One viable methodology for their detection involves the investigation of the line-like radio emissions from the dwarf spheroidal galaxy, potentially originating from the radiative decay of ALPs or the conversion of ALPs in the magnetic field. In this work, we constrain the properties of ALPs using the 2-hour radio observation of Coma Berenices through the Five-hundred-meter Aperture Spherical radio Telescope (FAST). The $\rm 95\%$ upper limits of the ALP-photon coupling are calculated for the ALP decay and conversion scenarios, respectively. Note that the sensitive ALP masses for FAST range from $\sim μ\rm eV$ to tens of $μ\rm eV$, where ALP can explain the DM abundance naturally. However, our limits are weaker than those of the CAST helioscope, which can provide an independent and complementary check on the ALP non-detection for ground experiments. Furthermore, we evaluate the expected sensitivity on the ALP of FAST with its full designed bandwidth (70 $\rm MHz$ - 3 $\rm GHz$) for 100 hours of observation time. Our results indicate that, even with the exceptional sensitivity of the FAST, it is challenging to surpass the existing experimental constraints on ALP DM using radio observation of dSphs, unless the possible enhancements of ALP signals by compact stars in dSphs are considered.
△ Less
Submitted 17 April, 2024; v1 submitted 7 April, 2024;
originally announced April 2024.
-
Event Camera Demosaicing via Swin Transformer and Pixel-focus Loss
Authors:
Yunfan Lu,
Yijie Xu,
Wenzong Ma,
Weiyu Guo,
Hui Xiong
Abstract:
Recent research has highlighted improvements in high-quality imaging guided by event cameras, with most of these efforts concentrating on the RGB domain. However, these advancements frequently neglect the unique challenges introduced by the inherent flaws in the sensor design of event cameras in the RAW domain. Specifically, this sensor design results in the partial loss of pixel values, posing ne…
▽ More
Recent research has highlighted improvements in high-quality imaging guided by event cameras, with most of these efforts concentrating on the RGB domain. However, these advancements frequently neglect the unique challenges introduced by the inherent flaws in the sensor design of event cameras in the RAW domain. Specifically, this sensor design results in the partial loss of pixel values, posing new challenges for RAW domain processes like demosaicing. The challenge intensifies as most research in the RAW domain is based on the premise that each pixel contains a value, making the straightforward adaptation of these methods to event camera demosaicing problematic. To end this, we present a Swin-Transformer-based backbone and a pixel-focus loss function for demosaicing with missing pixel values in RAW domain processing. Our core motivation is to refine a general and widely applicable foundational model from the RGB domain for RAW domain processing, thereby broadening the model's applicability within the entire imaging process. Our method harnesses multi-scale processing and space-to-depth techniques to ensure efficiency and reduce computing complexity. We also proposed the Pixel-focus Loss function for network fine-tuning to improve network convergence based on our discovery of a long-tailed distribution in training loss. Our method has undergone validation on the MIPI Demosaic Challenge dataset, with subsequent analytical experimentation confirming its efficacy. All code and trained models are released here: https://github.com/yunfanLu/ev-demosaic
△ Less
Submitted 3 April, 2024;
originally announced April 2024.
-
Model-Driven Deep Learning for Distributed Detection with Binary Quantization
Authors:
Wei Guo,
Meng He,
Chuan Huang,
Hengtao He,
Shenghui Song,
Jun Zhang,
Khaled B. Letaief
Abstract:
Within the realm of rapidly advancing wireless sensor networks (WSNs), distributed detection assumes a significant role in various practical applications. However, critical challenge lies in maintaining robust detection performance while operating within the constraints of limited bandwidth and energy resources. This paper introduces a novel approach that combines model-driven deep learning (DL) w…
▽ More
Within the realm of rapidly advancing wireless sensor networks (WSNs), distributed detection assumes a significant role in various practical applications. However, critical challenge lies in maintaining robust detection performance while operating within the constraints of limited bandwidth and energy resources. This paper introduces a novel approach that combines model-driven deep learning (DL) with binary quantization to strike a balance between communication overhead and detection performance in WSNs. We begin by establishing the lower bound of detection error probability for distributed detection using the maximum a posteriori (MAP) criterion. Furthermore, we prove the global optimality of employing identical local quantizers across sensors, thereby maximizing the corresponding Chernoff information. Subsequently, the paper derives the minimum MAP detection error probability (MAPDEP) by inplementing identical binary probabilistic quantizers across the sensors. Moreover, the paper establishes the equivalence between utilizing all quantized data and their average as input to the detector at the fusion center (FC). In particular, we derive the Kullback-Leibler (KL) divergence, which measures the difference between the true posterior probability and output of the proposed detector. Leveraging the MAPDEP and KL divergence as loss functions, the paper proposes model-driven DL method to separately train the probability controller module in the quantizer and the detector module at the FC. Numerical results validate the convergence and effectiveness of the proposed method, which achieves near-optimal performance with reduced complexity for Gaussian hypothesis testing.
△ Less
Submitted 30 March, 2024;
originally announced April 2024.
-
A contribution to the theory of $σ$-properties of a finite group
Authors:
A-Ming Liu,
Wenbin Guo,
Vasily G. Safonov,
Alexander N. Skiba
Abstract:
We characterize some classes of finite soluble groups. In particular, we prove that: a finite group $G$ is supersoluble if and only if $G$ has a normal subgroup $D$ such that $G/D$ is supersoluble and $D$ avoids every chief factor of $G$ between $V^{G}$ and $V_{G}$ for every maximal subgroup $V$ of the generalized Fitting subgroup $F^{*}(G)$ of $G$; a finite soluble group $G$ is a $PST$-group (tha…
▽ More
We characterize some classes of finite soluble groups. In particular, we prove that: a finite group $G$ is supersoluble if and only if $G$ has a normal subgroup $D$ such that $G/D$ is supersoluble and $D$ avoids every chief factor of $G$ between $V^{G}$ and $V_{G}$ for every maximal subgroup $V$ of the generalized Fitting subgroup $F^{*}(G)$ of $G$; a finite soluble group $G$ is a $PST$-group (that is, Sylow permutability is a transitive relation on $G$) if and only if $G$ has a normal subgroup $D$ such that $G/D$ is nilpotent and $D$ avoids every chief factor of $G$ between $V^{G}$ and $V_{G}$ for every subnormal subgroup $A$ of $G$.
△ Less
Submitted 18 February, 2024;
originally announced April 2024.
-
Large-field CO (J=1-0) observations toward SNR G150.3+4.5
Authors:
Jian-Cheng Feng,
Xuepeng Chen,
Yang Su,
Li Sun,
Shiyu Zhang,
Xin Zhou,
Weihua Guo
Abstract:
Aims. We aim to investigate the molecular environment of the supernova remnant (SNR) G150.3+4.5, and explore its association with ambient molecular clouds (MCs). Methods. We present large-field CO (J=1-0) molecular line observations toward SNR G150.3+4.5, using the 13.7 m millimeter telescope of the Purple Mountain Observatory. The observations have an angular resolution of $\sim 55 ''$. We analyz…
▽ More
Aims. We aim to investigate the molecular environment of the supernova remnant (SNR) G150.3+4.5, and explore its association with ambient molecular clouds (MCs). Methods. We present large-field CO (J=1-0) molecular line observations toward SNR G150.3+4.5, using the 13.7 m millimeter telescope of the Purple Mountain Observatory. The observations have an angular resolution of $\sim 55 ''$. We analyzed the spatial distribution of MCs in relation to the SNR shell detected in previous Urumqi $λ$ 6 cm radio observations and examined the CO spectra for kinematics information. Results. We find that MCs at the velocity range of [-14, -2] km s$^{-1}$ are spatially distributed along the radio shell of the SNR. Line broadening and asymmetries are observed in the CO spectra of the clouds. Moreover, we find that the molecular clouds around the shell have systematic velocity gradients in the position-velocity (PV) diagrams. Both morphology alignment and gas kinematics suggest that the SNR is associated with the ambient MCs at $\sim$ 740 pc. Based on the CO gas distance, the dimension and the age of the SNR is estimated to be 40 pc $\times$ 33 pc and 3.8 $ \times 10^4$ years, respectively. The very high energy emission of 1LHAASO J0428+5531 toward the SNR may originate from the interaction between the SNR and the surrounding MCs.
△ Less
Submitted 3 April, 2024; v1 submitted 28 March, 2024;
originally announced March 2024.
-
DODA: Diffusion for Object-detection Domain Adaptation in Agriculture
Authors:
Shuai Xiang,
Pieter M. Blok,
James Burridge,
Haozhou Wang,
Wei Guo
Abstract:
The diverse and high-quality content generated by recent generative models demonstrates the great potential of using synthetic data to train downstream models. However, in vision, especially in objection detection, related areas are not fully explored, the synthetic images are merely used to balance the long tails of existing datasets, and the accuracy of the generated labels is low, the full pote…
▽ More
The diverse and high-quality content generated by recent generative models demonstrates the great potential of using synthetic data to train downstream models. However, in vision, especially in objection detection, related areas are not fully explored, the synthetic images are merely used to balance the long tails of existing datasets, and the accuracy of the generated labels is low, the full potential of generative models has not been exploited. In this paper, we propose DODA, a data synthesizer that can generate high-quality object detection data for new domains in agriculture. Specifically, we improve the controllability of layout-to-image through encoding layout as an image, thereby improving the quality of labels, and use a visual encoder to provide visual clues for the diffusion model to decouple visual features from the diffusion model, and empowering the model the ability to generate data in new domains. On the Global Wheat Head Detection (GWHD) Dataset, which is the largest dataset in agriculture and contains diverse domains, using the data synthesized by DODA improves the performance of the object detector by 12.74-17.76 AP$_{50}$ in the domain that was significantly shifted from the training data.
△ Less
Submitted 27 March, 2024;
originally announced March 2024.
-
END4Rec: Efficient Noise-Decoupling for Multi-Behavior Sequential Recommendation
Authors:
Yongqiang Han,
Hao Wang,
Kefan Wang,
Likang Wu,
Zhi Li,
Wei Guo,
Yong Liu,
Defu Lian,
Enhong Chen
Abstract:
In recommendation systems, users frequently engage in multiple types of behaviors, such as clicking, adding to a cart, and purchasing. However, with diversified behavior data, user behavior sequences will become very long in the short term, which brings challenges to the efficiency of the sequence recommendation model. Meanwhile, some behavior data will also bring inevitable noise to the modeling…
▽ More
In recommendation systems, users frequently engage in multiple types of behaviors, such as clicking, adding to a cart, and purchasing. However, with diversified behavior data, user behavior sequences will become very long in the short term, which brings challenges to the efficiency of the sequence recommendation model. Meanwhile, some behavior data will also bring inevitable noise to the modeling of user interests. To address the aforementioned issues, firstly, we develop the Efficient Behavior Sequence Miner (EBM) that efficiently captures intricate patterns in user behavior while maintaining low time complexity and parameter count. Secondly, we design hard and soft denoising modules for different noise types and fully explore the relationship between behaviors and noise. Finally, we introduce a contrastive loss function along with a guided training strategy to compare the valid information in the data with the noisy signal, and seamlessly integrate the two denoising processes to achieve a high degree of decoupling of the noisy signal. Sufficient experiments on real-world datasets demonstrate the effectiveness and efficiency of our approach in dealing with multi-behavior sequential recommendation.
△ Less
Submitted 26 March, 2024;
originally announced March 2024.
-
Coupler-Assisted Leakage Reduction for Scalable Quantum Error Correction with Superconducting Qubits
Authors:
Xiaohan Yang,
Ji Chu,
Zechen Guo,
Wenhui Huang,
Yongqi Liang,
Jiawei Liu,
Jiawei Qiu,
Xuandong Sun,
Ziyu Tao,
Jiawei Zhang,
Jiajian Zhang,
Libo Zhang,
Yuxuan Zhou,
Weijie Guo,
Ling Hu,
Ji Jiang,
Yang Liu,
Xiayu Linpeng,
Tingyong Chen,
Yuanzhen Chen,
**g**g Niu,
Song Liu,
Youpeng Zhong,
Dapeng Yu
Abstract:
Superconducting qubits are a promising platform for building fault-tolerant quantum computers, with recent achievement showing the suppression of logical error with increasing code size. However, leakage into non-computational states, a common issue in practical quantum systems including superconducting circuits, introduces correlated errors that undermine QEC scalability. Here, we propose and dem…
▽ More
Superconducting qubits are a promising platform for building fault-tolerant quantum computers, with recent achievement showing the suppression of logical error with increasing code size. However, leakage into non-computational states, a common issue in practical quantum systems including superconducting circuits, introduces correlated errors that undermine QEC scalability. Here, we propose and demonstrate a leakage reduction scheme utilizing tunable couplers, a widely adopted ingredient in large-scale superconducting quantum processors. Leveraging the strong frequency tunability of the couplers and stray interaction between the couplers and readout resonators, we eliminate state leakage on the couplers, thus suppressing space-correlated errors caused by population propagation among the couplers. Assisted by the couplers, we further reduce leakage to higher qubit levels with high efficiency (98.1%) and low error rate on the computational subspace (0.58%), suppressing time-correlated errors during QEC cycles. The performance of our scheme demonstrates its potential as an indispensable building block for scalable QEC with superconducting qubits.
△ Less
Submitted 24 March, 2024;
originally announced March 2024.
-
Qibo: A Large Language Model for Traditional Chinese Medicine
Authors:
Heyi Zhang,
Xin Wang,
Zhaopeng Meng,
Zhe Chen,
Pengwei Zhuang,
Yongzhe Jia,
Dawei Xu,
Wenbin Guo
Abstract:
Large Language Models (LLMs) has made significant progress in a number of professional fields, including medicine, law, and finance. However, in traditional Chinese medicine (TCM), there are challenges such as the essential differences between theory and modern medicine, the lack of specialized corpus resources, and the fact that relying only on supervised fine-tuning may lead to overconfident pre…
▽ More
Large Language Models (LLMs) has made significant progress in a number of professional fields, including medicine, law, and finance. However, in traditional Chinese medicine (TCM), there are challenges such as the essential differences between theory and modern medicine, the lack of specialized corpus resources, and the fact that relying only on supervised fine-tuning may lead to overconfident predictions. To address these challenges, we propose a two-stage training approach that combines continuous pre-training and supervised fine-tuning. A notable contribution of our study is the processing of a 2GB corpus dedicated to TCM, constructing pre-training and instruction fine-tuning datasets for TCM, respectively. In addition, we have developed Qibo-Benchmark, a tool that evaluates the performance of LLM in the TCM on multiple dimensions, including subjective, objective, and three TCM NLP tasks. The medical LLM trained with our pipeline, named $\textbf{Qibo}$, exhibits significant performance boosts. Compared to the baselines, the average subjective win rate is 63%, the average objective accuracy improved by 23% to 58%, and the Rouge-L scores for the three TCM NLP tasks are 0.72, 0.61, and 0.55. Finally, we propose a pipline to apply Qibo to TCM consultation and demonstrate the model performance through the case study.
△ Less
Submitted 22 June, 2024; v1 submitted 24 March, 2024;
originally announced March 2024.
-
A general-purpose neural network potential for Ti-Al-Nb alloys towards large-scale molecular dynamics with ab initio accuracy
Authors:
Zhiqiang Zhao,
Wanlin Guo,
Zhuhua Zhang
Abstract:
High Nb-containing TiAl alloys exhibit exceptional high-temperature strength and room-temperature ductility, making them widely used in hot-section components of automotive and aerospace engines. However, the lack of accurate interatomic interaction potentials for large-scale modeling severely hampers a comprehensive understanding of the failure mechanism of Ti-Al-Nb alloys and the development of…
▽ More
High Nb-containing TiAl alloys exhibit exceptional high-temperature strength and room-temperature ductility, making them widely used in hot-section components of automotive and aerospace engines. However, the lack of accurate interatomic interaction potentials for large-scale modeling severely hampers a comprehensive understanding of the failure mechanism of Ti-Al-Nb alloys and the development of strategies to enhance the mechanical properties. Here, we develop a general-purpose machine-learned potential (MLP) for the Ti-Al-Nb ternary system by combining the neural evolution potentials framework with an active learning scheme. The developed MLP, trained on extensive first-principles datasets, demonstrates remarkable accuracy in predicting various lattice and defect properties, as well as high-temperature characteristics such as thermal expansion and melting point for TiAl systems. Notably, this potential can effectively describe the key effect of Nb do** on stacking fault energies and formation energies. Of practical importance is that our MLP enables large-scale molecular dynamics simulations involving tens of millions of atoms with ab initio accuracy, achieving an outstanding balance between computational speed and accuracy. These results pave the way for studying micro-mechanical behaviors in TiAl lamellar structures and develo** high-performance TiAl alloys towards applications at elevated temperatures.
△ Less
Submitted 14 March, 2024;
originally announced March 2024.
-
Noise-aware neural network for stochastic dynamics simulation
Authors:
Pei-Fang Wu,
Wei-Chen Guo,
Liang He
Abstract:
In the presence of system-environment coupling, classical complex systems undergo stochastic dynamics, where rich phenomena can emerge at large spatio-temporal scales. To investigate these phenomena, numerical approaches for simulating stochastic dynamics are indispensable and can be computationally expensive. In light of the recent fast development in machine learning techniques, here, we establi…
▽ More
In the presence of system-environment coupling, classical complex systems undergo stochastic dynamics, where rich phenomena can emerge at large spatio-temporal scales. To investigate these phenomena, numerical approaches for simulating stochastic dynamics are indispensable and can be computationally expensive. In light of the recent fast development in machine learning techniques, here, we establish a generic machine learning approach to simulate the stochastic dynamics, dubbed the noise-aware neural network (NANN). One key feature of this approach is its ability to generate the long-time stochastic dynamics of complex large-scale systems by just training NANN with the one-step dynamics of smaller-scale systems, thus reducing the computational cost. Furthermore, this NANN based approach is quite generic. Case-by-case special design of the architecture of NANN is not necessary when it is employed to investigate different stochastic complex systems. Using the noisy Kuramoto model and the Vicsek model as concrete examples, we demonstrate its capability in simulating stochastic dynamics. We believe that this novel machine learning approach can be a useful tool in investigating the large spatio-temporal scaling behavior of complex systems subjected to the influences of the environmental noise.
△ Less
Submitted 14 March, 2024;
originally announced March 2024.
-
ADEdgeDrop: Adversarial Edge Drop** for Robust Graph Neural Networks
Authors:
Zhaoliang Chen,
Zhihao Wu,
Ylli Sadikaj,
Claudia Plant,
Hong-Ning Dai,
Shi** Wang,
Wenzhong Guo
Abstract:
Although Graph Neural Networks (GNNs) have exhibited the powerful ability to gather graph-structured information from neighborhood nodes via various message-passing mechanisms, the performance of GNNs is limited by poor generalization and fragile robustness caused by noisy and redundant graph data. As a prominent solution, Graph Augmentation Learning (GAL) has recently received increasing attentio…
▽ More
Although Graph Neural Networks (GNNs) have exhibited the powerful ability to gather graph-structured information from neighborhood nodes via various message-passing mechanisms, the performance of GNNs is limited by poor generalization and fragile robustness caused by noisy and redundant graph data. As a prominent solution, Graph Augmentation Learning (GAL) has recently received increasing attention. Among prior GAL approaches, edge-drop** methods that randomly remove edges from a graph during training are effective techniques to improve the robustness of GNNs. However, randomly drop** edges often results in bypassing critical edges, consequently weakening the effectiveness of message passing. In this paper, we propose a novel adversarial edge-drop** method (ADEdgeDrop) that leverages an adversarial edge predictor guiding the removal of edges, which can be flexibly incorporated into diverse GNN backbones. Employing an adversarial training framework, the edge predictor utilizes the line graph transformed from the original graph to estimate the edges to be dropped, which improves the interpretability of the edge-drop** method. The proposed ADEdgeDrop is optimized alternately by stochastic gradient descent and projected gradient descent. Comprehensive experiments on six graph benchmark datasets demonstrate that the proposed ADEdgeDrop outperforms state-of-the-art baselines across various GNN backbones, demonstrating improved generalization and robustness.
△ Less
Submitted 14 March, 2024;
originally announced March 2024.
-
HRLAIF: Improvements in Helpfulness and Harmlessness in Open-domain Reinforcement Learning From AI Feedback
Authors:
Ang Li,
Qiugen Xiao,
Peng Cao,
Jian Tang,
Yi Yuan,
Zijie Zhao,
Xiaoyuan Chen,
Liang Zhang,
Xiangyang Li,
Kaitong Yang,
Weidong Guo,
Yukang Gan,
Xu Yu,
Daniell Wang,
Ying Shan
Abstract:
Reinforcement Learning from AI Feedback (RLAIF) has the advantages of shorter annotation cycles and lower costs over Reinforcement Learning from Human Feedback (RLHF), making it highly efficient during the rapid strategy iteration periods of large language model (LLM) training. Using ChatGPT as a labeler to provide feedback on open-domain prompts in RLAIF training, we observe an increase in human…
▽ More
Reinforcement Learning from AI Feedback (RLAIF) has the advantages of shorter annotation cycles and lower costs over Reinforcement Learning from Human Feedback (RLHF), making it highly efficient during the rapid strategy iteration periods of large language model (LLM) training. Using ChatGPT as a labeler to provide feedback on open-domain prompts in RLAIF training, we observe an increase in human evaluators' preference win ratio for model responses, but a decrease in evaluators' satisfaction rate. Analysis suggests that the decrease in satisfaction rate is mainly due to some responses becoming less helpful, particularly in terms of correctness and truthfulness, highlighting practical limitations of basic RLAIF. In this paper, we propose Hybrid Reinforcement Learning from AI Feedback (HRLAIF). This method enhances the accuracy of AI annotations for responses, making the model's helpfulness more robust in training process. Additionally, it employs AI for Red Teaming, further improving the model's harmlessness. Human evaluation results show that HRLAIF inherits the ability of RLAIF to enhance human preference for outcomes at a low cost while also improving the satisfaction rate of responses. Compared to the policy model before Reinforcement Learning (RL), it achieves an increase of 2.08\% in satisfaction rate, effectively addressing the issue of a decrease of 4.58\% in satisfaction rate after basic RLAIF.
△ Less
Submitted 14 March, 2024; v1 submitted 13 March, 2024;
originally announced March 2024.
-
SIFiD: Reassess Summary Factual Inconsistency Detection with LLM
Authors:
Jiuding Yang,
Hui Liu,
Weidong Guo,
Zhuwei Rao,
Yu Xu,
Di Niu
Abstract:
Ensuring factual consistency between the summary and the original document is paramount in summarization tasks. Consequently, considerable effort has been dedicated to detecting inconsistencies. With the advent of Large Language Models (LLMs), recent studies have begun to leverage their advanced language understanding capabilities for inconsistency detection. However, early attempts have shown tha…
▽ More
Ensuring factual consistency between the summary and the original document is paramount in summarization tasks. Consequently, considerable effort has been dedicated to detecting inconsistencies. With the advent of Large Language Models (LLMs), recent studies have begun to leverage their advanced language understanding capabilities for inconsistency detection. However, early attempts have shown that LLMs underperform traditional models due to their limited ability to follow instructions and the absence of an effective detection methodology. In this study, we reassess summary inconsistency detection with LLMs, comparing the performances of GPT-3.5 and GPT-4. To advance research in LLM-based inconsistency detection, we propose SIFiD (Summary Inconsistency Detection with Filtered Document) that identify key sentences within documents by either employing natural language inference or measuring semantic similarity between summaries and documents.
△ Less
Submitted 12 March, 2024;
originally announced March 2024.
-
Thermoelectric transport of the coexistence topological semimetal in the quantum limit
Authors:
L. W. Guo,
C. M. Wang
Abstract:
We explore the thermoelectric transport properties of a coexistence topological semimetal, characterized by the presence of both a pair of Weyl points and a nodal ring in the quantum limit. This system gives rise to complex Landau bands when subjected to a magnetic field aligned with the direction connecting two Weyl points. In the longitudinal configuration, where the magnetic field is parallel t…
▽ More
We explore the thermoelectric transport properties of a coexistence topological semimetal, characterized by the presence of both a pair of Weyl points and a nodal ring in the quantum limit. This system gives rise to complex Landau bands when subjected to a magnetic field aligned with the direction connecting two Weyl points. In the longitudinal configuration, where the magnetic field is parallel to the electric field or the temperature gradient, the thermoelectric conductivity indicates a plateau independent of the magnetic field and the Fermi energy at $δ$-form short-range scattering. This platform structure should also exist in pure two-node Weyl semimetals. However, the thermoelectric conductivity and the Seebeck coefficient are significantly influenced by the parameters of long-ranged Gaussian or screened Coulomb scattering potentials for both fixed carrier density and Fermi energy scenarios. In the transverse configuration, both Gaussian and screened Coulomb scatterings yield substantial positive magnetoresistance and thermoelectric conductance. Since the Hall conductivity is larger than the longitudinal one, the Seebeck coefficient, exhibiting a quadratic increase with the magnetic field, is close to the dissipationless limit irrespective of scatterings, while the Nernst response is notably dependent on the scattering mechanism. Additionally, the model parameter, distinct from the two-node Weyl model, influences the thermoelectric transport properties. The magnetic field response of the thermoelectric coefficients to different scattering potentials can be used as a basis for distinguishing scattering mechanisms in materials.
△ Less
Submitted 11 March, 2024;
originally announced March 2024.
-
Self-supervised Photographic Image Layout Representation Learning
Authors:
Zhaoran Zhao,
Peng Lu,
Xujun Peng,
Wenhao Guo
Abstract:
In the domain of image layout representation learning, the critical process of translating image layouts into succinct vector forms is increasingly significant across diverse applications, such as image retrieval, manipulation, and generation. Most approaches in this area heavily rely on costly labeled datasets and notably lack in adapting their modeling and learning methods to the specific nuance…
▽ More
In the domain of image layout representation learning, the critical process of translating image layouts into succinct vector forms is increasingly significant across diverse applications, such as image retrieval, manipulation, and generation. Most approaches in this area heavily rely on costly labeled datasets and notably lack in adapting their modeling and learning methods to the specific nuances of photographic image layouts. This shortfall makes the learning process for photographic image layouts suboptimal. In our research, we directly address these challenges. We innovate by defining basic layout primitives that encapsulate various levels of layout information and by map** these, along with their interconnections, onto a heterogeneous graph structure. This graph is meticulously engineered to capture the intricate layout information within the pixel domain explicitly. Advancing further, we introduce novel pretext tasks coupled with customized loss functions, strategically designed for effective self-supervised learning of these layout graphs. Building on this foundation, we develop an autoencoder-based network architecture skilled in compressing these heterogeneous layout graphs into precise, dimensionally-reduced layout representations. Additionally, we introduce the LODB dataset, which features a broader range of layout categories and richer semantics, serving as a comprehensive benchmark for evaluating the effectiveness of layout representation learning methods. Our extensive experimentation on this dataset demonstrates the superior performance of our approach in the realm of photographic image layout representation learning.
△ Less
Submitted 6 March, 2024;
originally announced March 2024.
-
Ultralight vector dark matter search using data from the KAGRA O3GK run
Authors:
The LIGO Scientific Collaboration,
the Virgo Collaboration,
the KAGRA Collaboration,
A. G. Abac,
R. Abbott,
H. Abe,
I. Abouelfettouh,
F. Acernese,
K. Ackley,
C. Adamcewicz,
S. Adhicary,
N. Adhikari,
R. X. Adhikari,
V. K. Adkins,
V. B. Adya,
C. Affeldt,
D. Agarwal,
M. Agathos,
O. D. Aguiar,
I. Aguilar,
L. Aiello,
A. Ain,
P. Ajith,
T. Akutsu,
S. Albanesi
, et al. (1778 additional authors not shown)
Abstract:
Among the various candidates for dark matter (DM), ultralight vector DM can be probed by laser interferometric gravitational wave detectors through the measurement of oscillating length changes in the arm cavities. In this context, KAGRA has a unique feature due to differing compositions of its mirrors, enhancing the signal of vector DM in the length change in the auxiliary channels. Here we prese…
▽ More
Among the various candidates for dark matter (DM), ultralight vector DM can be probed by laser interferometric gravitational wave detectors through the measurement of oscillating length changes in the arm cavities. In this context, KAGRA has a unique feature due to differing compositions of its mirrors, enhancing the signal of vector DM in the length change in the auxiliary channels. Here we present the result of a search for $U(1)_{B-L}$ gauge boson DM using the KAGRA data from auxiliary length channels during the first joint observation run together with GEO600. By applying our search pipeline, which takes into account the stochastic nature of ultralight DM, upper bounds on the coupling strength between the $U(1)_{B-L}$ gauge boson and ordinary matter are obtained for a range of DM masses. While our constraints are less stringent than those derived from previous experiments, this study demonstrates the applicability of our method to the lower-mass vector DM search, which is made difficult in this measurement by the short observation time compared to the auto-correlation time scale of DM.
△ Less
Submitted 5 March, 2024;
originally announced March 2024.
-
SCHEMA: State CHangEs MAtter for Procedure Planning in Instructional Videos
Authors:
Yulei Niu,
Wenliang Guo,
Long Chen,
Xudong Lin,
Shih-Fu Chang
Abstract:
We study the problem of procedure planning in instructional videos, which aims to make a goal-oriented sequence of action steps given partial visual state observations. The motivation of this problem is to learn a structured and plannable state and action space. Recent works succeeded in sequence modeling of steps with only sequence-level annotations accessible during training, which overlooked th…
▽ More
We study the problem of procedure planning in instructional videos, which aims to make a goal-oriented sequence of action steps given partial visual state observations. The motivation of this problem is to learn a structured and plannable state and action space. Recent works succeeded in sequence modeling of steps with only sequence-level annotations accessible during training, which overlooked the roles of states in the procedures. In this work, we point out that State CHangEs MAtter (SCHEMA) for procedure planning in instructional videos. We aim to establish a more structured state space by investigating the causal relations between steps and states in procedures. Specifically, we explicitly represent each step as state changes and track the state changes in procedures. For step representation, we leveraged the commonsense knowledge in large language models (LLMs) to describe the state changes of steps via our designed chain-of-thought prompting. For state change tracking, we align visual state observations with language state descriptions via cross-modal contrastive learning, and explicitly model the intermediate states of the procedure using LLM-generated state descriptions. Experiments on CrossTask, COIN, and NIV benchmark datasets demonstrate that our proposed SCHEMA model achieves state-of-the-art performance and obtains explainable visualizations.
△ Less
Submitted 3 March, 2024;
originally announced March 2024.
-
CR-LT-KGQA: A Knowledge Graph Question Answering Dataset Requiring Commonsense Reasoning and Long-Tail Knowledge
Authors:
Willis Guo,
Armin Toroghi,
Scott Sanner
Abstract:
Knowledge graph question answering (KGQA) is a well-established field that seeks to provide factual answers to natural language (NL) questions by leveraging knowledge graphs (KGs). However, existing KGQA datasets suffer from two significant limitations: (1) no existing KGQA dataset requires commonsense reasoning to arrive at an answer and (2) existing KGQA datasets focus on popular entities for wh…
▽ More
Knowledge graph question answering (KGQA) is a well-established field that seeks to provide factual answers to natural language (NL) questions by leveraging knowledge graphs (KGs). However, existing KGQA datasets suffer from two significant limitations: (1) no existing KGQA dataset requires commonsense reasoning to arrive at an answer and (2) existing KGQA datasets focus on popular entities for which large language models (LLMs) can directly answer without hallucinating and without leveraging the KG. In this work, we seek a novel KGQA dataset that supports commonsense reasoning and focuses on long-tail entities (e.g., non-mainstream and recent entities) where LLMs frequently hallucinate, and thus create the need for novel methodologies that leverage the KG for factual and attributable commonsense inference. We create a novel Commonsense Reasoning (CR) and Long-Tail (LT) KGQA dataset with two subtasks -- question answering and claim verification -- that address both limitations (1) and (2). We construct CR-LT-KGQA by building extensions to existing reasoning datasets StrategyQA and CREAK over Wikidata. While existing KGQA methods are not applicable due to their lack of commonsense inference support, baseline evaluation of LLMs on CR-LT KGQA demonstrate a high rate of hallucination. Thus, CR-LT KGQA poses significant challenges for hallucination-prone LLMs, hence paving the way for future commonsense KGQA research to provide accurate and factual answers for long-tail entities in the era of LLMs.
△ Less
Submitted 2 March, 2024;
originally announced March 2024.
-
Right for Right Reasons: Large Language Models for Verifiable Commonsense Knowledge Graph Question Answering
Authors:
Armin Toroghi,
Willis Guo,
Mohammad Mahdi Abdollah Pour,
Scott Sanner
Abstract:
Knowledge Graph Question Answering (KGQA) methods seek to answer Natural Language questions using the relational information stored in Knowledge Graphs (KGs). With the recent advancements of Large Language Models (LLMs) and their remarkable reasoning abilities, there is a growing trend to leverage them for KGQA. However, existing methodologies have only focused on answering factual questions, e.g.…
▽ More
Knowledge Graph Question Answering (KGQA) methods seek to answer Natural Language questions using the relational information stored in Knowledge Graphs (KGs). With the recent advancements of Large Language Models (LLMs) and their remarkable reasoning abilities, there is a growing trend to leverage them for KGQA. However, existing methodologies have only focused on answering factual questions, e.g., "In which city was Silvio Berlusconi's first wife born?", leaving questions involving commonsense reasoning that real-world users may pose more often, e.g., "Do I need separate visas to see the Venus of Willendorf and attend the Olympics this summer?" unaddressed. In this work, we first observe that existing LLM-based methods for KGQA struggle with hallucination on such questions, especially on queries targeting long-tail entities (e.g., non-mainstream and recent entities), thus hindering their applicability in real-world applications especially since their reasoning processes are not easily verifiable. In response, we propose Right for Right Reasons (R3), a commonsense KGQA methodology that allows for a verifiable reasoning procedure by axiomatically surfacing intrinsic commonsense knowledge of LLMs and grounding every factual reasoning step on KG triples. Through experimental evaluations across three different tasks--question answering, claim verification, and preference matching--our findings showcase R3 as a superior approach, outperforming existing methodologies and notably reducing instances of hallucination and reasoning errors.
△ Less
Submitted 2 March, 2024;
originally announced March 2024.
-
A Comprehensive Survey of Federated Transfer Learning: Challenges, Methods and Applications
Authors:
Wei Guo,
Fuzhen Zhuang,
Xiao Zhang,
Yiqi Tong,
** Dong
Abstract:
Federated learning (FL) is a novel distributed machine learning paradigm that enables participants to collaboratively train a centralized model with privacy preservation by eliminating the requirement of data sharing. In practice, FL often involves multiple participants and requires the third party to aggregate global information to guide the update of the target participant. Therefore, many FL me…
▽ More
Federated learning (FL) is a novel distributed machine learning paradigm that enables participants to collaboratively train a centralized model with privacy preservation by eliminating the requirement of data sharing. In practice, FL often involves multiple participants and requires the third party to aggregate global information to guide the update of the target participant. Therefore, many FL methods do not work well due to the training and test data of each participant may not be sampled from the same feature space and the same underlying distribution. Meanwhile, the differences in their local devices (system heterogeneity), the continuous influx of online data (incremental data), and labeled data scarcity may further influence the performance of these methods. To solve this problem, federated transfer learning (FTL), which integrates transfer learning (TL) into FL, has attracted the attention of numerous researchers. However, since FL enables a continuous share of knowledge among participants with each communication round while not allowing local data to be accessed by other participants, FTL faces many unique challenges that are not present in TL. In this survey, we focus on categorizing and reviewing the current progress on federated transfer learning, and outlining corresponding solutions and applications. Furthermore, the common setting of FTL scenarios, available datasets, and significant related research are summarized in this survey.
△ Less
Submitted 2 March, 2024;
originally announced March 2024.
-
SAR-AE-SFP: SAR Imagery Adversarial Example in Real Physics domain with Target Scattering Feature Parameters
Authors:
Jiahao Cui,
Jiale Duan,
Binyan Luo,
Hang Cao,
Wang Guo,
Haifeng Li
Abstract:
Deep neural network-based Synthetic Aperture Radar (SAR) target recognition models are susceptible to adversarial examples. Current adversarial example generation methods for SAR imagery primarily operate in the 2D digital domain, known as image adversarial examples. Recent work, while considering SAR imaging scatter mechanisms, fails to account for the actual imaging process, rendering attacks in…
▽ More
Deep neural network-based Synthetic Aperture Radar (SAR) target recognition models are susceptible to adversarial examples. Current adversarial example generation methods for SAR imagery primarily operate in the 2D digital domain, known as image adversarial examples. Recent work, while considering SAR imaging scatter mechanisms, fails to account for the actual imaging process, rendering attacks in the three-dimensional physical domain infeasible, termed pseudo physics adversarial examples. To address these challenges, this paper proposes SAR-AE-SFP-Attack, a method to generate real physics adversarial examples by altering the scattering feature parameters of target objects. Specifically, we iteratively optimize the coherent energy accumulation of the target echo by perturbing the reflection coefficient and scattering coefficient in the scattering feature parameters of the three-dimensional target object, and obtain the adversarial example after echo signal processing and imaging processing in the RaySAR simulator. Experimental results show that compared to digital adversarial attack methods, SAR-AE-SFP Attack significantly improves attack efficiency on CNN-based models (over 30\%) and Transformer-based models (over 13\%), demonstrating significant transferability of attack effects across different models and perspectives.
△ Less
Submitted 2 March, 2024;
originally announced March 2024.
-
Multi-objective Optimal Roadside Units Deployment in Urban Vehicular Networks
Authors:
Weian Guo,
Zecheng Kang,
Dongyang Li,
Lun Zhang,
Li Li
Abstract:
The significance of transportation efficiency, safety, and related services is increasing in urban vehicular networks. Within such networks, roadside units (RSUs) serve as intermediates in facilitating communication. Therefore, the deployment of RSUs is of utmost importance in ensuring the quality of communication services. However, the optimization objectives, such as time delay and deployment co…
▽ More
The significance of transportation efficiency, safety, and related services is increasing in urban vehicular networks. Within such networks, roadside units (RSUs) serve as intermediates in facilitating communication. Therefore, the deployment of RSUs is of utmost importance in ensuring the quality of communication services. However, the optimization objectives, such as time delay and deployment cost, are commonly developed from diverse perspectives. As a result, it is possible that conflicts may arise among the objectives. Furthermore, in urban environments, the presence of various obstacles, such as buildings, gardens, lakes, and other infrastructure, poses challenges for the deployment of RSUs. Hence, the deployment encounters significant difficulties due to the existence of multiple objectives, constraints imposed by obstacles, and the necessity to explore a large-scale optimization space. To address this issue, two versions of multi-objective optimization algorithms are proposed in this paper. By utilizing a multi-population strategy and an adaptive exploration technique, the methods efficiently explore a large-scale decision-variable space. In order to mitigate the issue of an overcrowded deployment of RSUs, a calibrating mechanism is adopted to adjust RSU density during the optimization procedures. The proposed methods also take care of data offloading between vehicles and RSUs by setting up an iterative best response sequence game (IBRSG). By comparing the proposed algorithms with several state-of-the-art algorithms, the results demonstrate that our strategies perform better in both high-density and low-density urban scenarios. The results also indicate that the proposed solutions substantially improve the efficiency of vehicular networks.
△ Less
Submitted 14 January, 2024;
originally announced February 2024.
-
Defect Detection in Tire X-Ray Images: Conventional Methods Meet Deep Structures
Authors:
Andrei Cozma,
Landon Harris,
Hairong Qi,
** Ji,
Wenpeng Guo,
Song Yuan
Abstract:
This paper introduces a robust approach for automated defect detection in tire X-ray images by harnessing traditional feature extraction methods such as Local Binary Pattern (LBP) and Gray Level Co-Occurrence Matrix (GLCM) features, as well as Fourier and Wavelet-based features, complemented by advanced machine learning techniques. Recognizing the challenges inherent in the complex patterns and te…
▽ More
This paper introduces a robust approach for automated defect detection in tire X-ray images by harnessing traditional feature extraction methods such as Local Binary Pattern (LBP) and Gray Level Co-Occurrence Matrix (GLCM) features, as well as Fourier and Wavelet-based features, complemented by advanced machine learning techniques. Recognizing the challenges inherent in the complex patterns and textures of tire X-ray images, the study emphasizes the significance of feature engineering to enhance the performance of defect detection systems. By meticulously integrating combinations of these features with a Random Forest (RF) classifier and comparing them against advanced models like YOLOv8, the research not only benchmarks the performance of traditional features in defect detection but also explores the synergy between classical and modern approaches. The experimental results demonstrate that these traditional features, when fine-tuned and combined with machine learning models, can significantly improve the accuracy and reliability of tire defect detection, aiming to set a new standard in automated quality assurance in tire manufacturing.
△ Less
Submitted 28 February, 2024;
originally announced February 2024.
-
Minimize Control Inputs for Strong Structural Controllability Using Reinforcement Learning with Graph Neural Network
Authors:
Mengbang Zou,
Weisi Guo,
Bailu **
Abstract:
Strong structural controllability (SSC) guarantees networked system with linear-invariant dynamics controllable for all numerical realizations of parameters. Current research has established algebraic and graph-theoretic conditions of SSC for zero/nonzero or zero/nonzero/arbitrary structure. One relevant practical problem is how to fully control the system with the minimal number of input signals…
▽ More
Strong structural controllability (SSC) guarantees networked system with linear-invariant dynamics controllable for all numerical realizations of parameters. Current research has established algebraic and graph-theoretic conditions of SSC for zero/nonzero or zero/nonzero/arbitrary structure. One relevant practical problem is how to fully control the system with the minimal number of input signals and identify which nodes must be imposed signals. Previous work shows that this optimization problem is NP-hard and it is difficult to find the solution. To solve this problem, we formulate the graph coloring process as a Markov decision process (MDP) according to the graph-theoretical condition of SSC for both zero/nonzero and zero/nonzero/arbitrary structure. We use Actor-critic method with Directed graph neural network which represents the color information of graph to optimize MDP. Our method is validated in a social influence network with real data and different complex network models. We find that the number of input nodes is determined by the average degree of the network and the input nodes tend to select nodes with low in-degree and avoid high-degree nodes.
△ Less
Submitted 26 February, 2024;
originally announced February 2024.
-
Quasinormal modes of a charged black hole with scalar hair
Authors:
Wen-Di Guo,
Qin Tan
Abstract:
From a five-dimensional Einstein-Maxwell theory, Bah et al. constructed a singularity free topology star/black hole [Phys. Rev. Lett. 126, 151101 (2021)]. After the Klein-Kluza reduction, i.e., integrating the extra space dimension, it can obtain an effective four-dimensional static spherical charged black hole with scalar hair. In this paper, we study the quasinormal modes (QNMs) of the scalar fi…
▽ More
From a five-dimensional Einstein-Maxwell theory, Bah et al. constructed a singularity free topology star/black hole [Phys. Rev. Lett. 126, 151101 (2021)]. After the Klein-Kluza reduction, i.e., integrating the extra space dimension, it can obtain an effective four-dimensional static spherical charged black hole with scalar hair. In this paper, we study the quasinormal modes (QNMs) of the scalar field, electromagnetic field, and gravitational field on the background of this effective four-dimensional charged black hole. The radial parts of the perturbed fields all satisfy a Schrödinger-like equation. Using the asymptotic iteration method, we obtain the QNM frequencies semianalytically. For low overtone QNMs, the results obtained from the asymptotic iteration method and the Wentzel-Kramers-Brillouin approximation method agree well. In the null coordinates, the evolution of a Gaussian package is also studied. The QNM frequencies obtained by fitting the evolution data also agree well with the results obtained by the asymptotic iteration method.
△ Less
Submitted 21 February, 2024;
originally announced February 2024.
-
Variable Projection Algorithms: Theoretical Insights and A Novel Approach for Problems with Large Residual
Authors:
Guangyong Chen,
Peng Xue,
Min Gan,
**g Chen,
Wenzhong Guo,
C. L. Philip. Chen
Abstract:
This paper delves into an in-depth exploration of the Variable Projection (VP) algorithm, a powerful tool for solving separable nonlinear optimization problems across multiple domains, including system identification, image processing, and machine learning. We first establish a theoretical framework to examine the effect of the approximate treatment of the coupling relationship among parameters on…
▽ More
This paper delves into an in-depth exploration of the Variable Projection (VP) algorithm, a powerful tool for solving separable nonlinear optimization problems across multiple domains, including system identification, image processing, and machine learning. We first establish a theoretical framework to examine the effect of the approximate treatment of the coupling relationship among parameters on the local convergence of the VP algorithm and theoretically prove that the Kaufman's VP algorithm can achieve a similar convergence rate as the Golub \& Pereyra's form. These studies fill the gap in the existing convergence theory analysis, and provide a solid foundation for understanding the mechanism of VP algorithm and broadening its application horizons. Furthermore, drawing inspiration from these theoretical revelations, we design a refined VP algorithm for handling separable nonlinear optimization problems characterized by large residual, called VPLR, which boosts the convergence performance by addressing the interdependence of parameters within the separable model and by continually correcting the approximated Hessian matrix to counteract the influence of large residual during the iterative process. The effectiveness of this refined algorithm is corroborated through numerical experimentation.
△ Less
Submitted 21 February, 2024;
originally announced February 2024.
-
FGAD: Self-boosted Knowledge Distillation for An Effective Federated Graph Anomaly Detection Framework
Authors:
**yu Cai,
Yunhe Zhang,
Zhoumin Lu,
Wenzhong Guo,
See-kiong Ng
Abstract:
Graph anomaly detection (GAD) aims to identify anomalous graphs that significantly deviate from other ones, which has raised growing attention due to the broad existence and complexity of graph-structured data in many real-world scenarios. However, existing GAD methods usually execute with centralized training, which may lead to privacy leakage risk in some sensitive cases, thereby impeding collab…
▽ More
Graph anomaly detection (GAD) aims to identify anomalous graphs that significantly deviate from other ones, which has raised growing attention due to the broad existence and complexity of graph-structured data in many real-world scenarios. However, existing GAD methods usually execute with centralized training, which may lead to privacy leakage risk in some sensitive cases, thereby impeding collaboration among organizations seeking to collectively develop robust GAD models. Although federated learning offers a promising solution, the prevalent non-IID problems and high communication costs present significant challenges, particularly pronounced in collaborations with graph data distributed among different participants. To tackle these challenges, we propose an effective federated graph anomaly detection framework (FGAD). We first introduce an anomaly generator to perturb the normal graphs to be anomalous, and train a powerful anomaly detector by distinguishing generated anomalous graphs from normal ones. Then, we leverage a student model to distill knowledge from the trained anomaly detector (teacher model), which aims to maintain the personality of local models and alleviate the adverse impact of non-IID problems. Moreover, we design an effective collaborative learning mechanism that facilitates the personalization preservation of local models and significantly reduces communication costs among clients. Empirical results of the GAD tasks on non-IID graphs compared with state-of-the-art baselines demonstrate the superiority and efficiency of the proposed FGAD method.
△ Less
Submitted 20 February, 2024;
originally announced February 2024.
-
HyCubE: Efficient Knowledge Hypergraph 3D Circular Convolutional Embedding
Authors:
Zhao Li,
Xin Wang,
Jun Zhao,
Wenbin Guo,
Jianxin Li
Abstract:
Knowledge hypergraph embedding models are usually computationally expensive due to the inherent complex semantic information. However, existing works mainly focus on improving the effectiveness of knowledge hypergraph embedding, making the model architecture more complex and redundant. It is desirable and challenging for knowledge hypergraph embedding to reach a trade-off between model effectivene…
▽ More
Knowledge hypergraph embedding models are usually computationally expensive due to the inherent complex semantic information. However, existing works mainly focus on improving the effectiveness of knowledge hypergraph embedding, making the model architecture more complex and redundant. It is desirable and challenging for knowledge hypergraph embedding to reach a trade-off between model effectiveness and efficiency. In this paper, we propose an end-to-end efficient n-ary knowledge hypergraph embedding model, HyCubE, which designs a novel 3D circular convolutional neural network and the alternate mask stack strategy to enhance the interaction and extraction of feature information comprehensively. Furthermore, our proposed model achieves a better trade-off between effectiveness and efficiency by adaptively adjusting the 3D circular convolutional layer structure to handle different arity knowledge hypergraphs with fewer parameters. In addition, we use 1-N multilinear scoring based on the entity mask mechanism to further accelerate the model training efficiency. Finally, extensive experimental results on all datasets demonstrate that our proposed model consistently outperforms state-of-the-art baselines, with an average improvement of 7.30%-9.53% and a maximum improvement of 33.82% across all metrics. Meanwhile, HyCubE is 4.12x faster, GPU memory usage is 52.19% lower, and the number of parameters is reduced by 85.21% compared with the average metric of the latest state-of-the-art baselines.
△ Less
Submitted 3 June, 2024; v1 submitted 14 February, 2024;
originally announced February 2024.
-
Explainable Adversarial Learning Framework on Physical Layer Secret Keys Combating Malicious Reconfigurable Intelligent Surface
Authors:
Zhuangkun Wei,
Wenxiu Hu,
Weisi Guo
Abstract:
The development of reconfigurable intelligent surfaces (RIS) is a double-edged sword to physical layer security (PLS). Whilst a legitimate RIS can yield beneficial impacts including increased channel randomness to enhance physical layer secret key generation (PL-SKG), malicious RIS can poison legitimate channels and crack most of existing PL-SKGs. In this work, we propose an adversarial learning f…
▽ More
The development of reconfigurable intelligent surfaces (RIS) is a double-edged sword to physical layer security (PLS). Whilst a legitimate RIS can yield beneficial impacts including increased channel randomness to enhance physical layer secret key generation (PL-SKG), malicious RIS can poison legitimate channels and crack most of existing PL-SKGs. In this work, we propose an adversarial learning framework between legitimate parties (namely Alice and Bob) to address this Man-in-the-middle malicious RIS (MITM-RIS) eavesdrop**. First, the theoretical mutual information gap between legitimate pairs and MITM-RIS is deduced. Then, Alice and Bob leverage generative adversarial networks (GANs) to learn to achieve a common feature surface that does not have mutual information overlap with MITM-RIS. Next, we aid signal processing interpretation of black-box neural networks by using a symbolic explainable AI (xAI) representation. These symbolic terms of dominant neurons aid feature engineering-based validation and future design of PLS common feature space. Simulation results show that our proposed GAN-based and symbolic-based PL-SKGs can achieve high key agreement rates between legitimate users, and is even resistant to MITM-RIS Eve with the knowledge of legitimate feature generation (NNs or formulas). This therefore paves the way to secure wireless communications with untrusted reflective devices in future 6G.
△ Less
Submitted 6 February, 2024;
originally announced February 2024.
-
Generalizable Entity Grounding via Assistance of Large Language Model
Authors:
Lu Qi,
Yi-Wen Chen,
Lehan Yang,
Tiancheng Shen,
Xiangtai Li,
Weidong Guo,
Yu Xu,
Ming-Hsuan Yang
Abstract:
In this work, we propose a novel approach to densely ground visual entities from a long caption. We leverage a large multimodal model (LMM) to extract semantic nouns, a class-agnostic segmentation model to generate entity-level segmentation, and the proposed multi-modal feature fusion module to associate each semantic noun with its corresponding segmentation mask. Additionally, we introduce a stra…
▽ More
In this work, we propose a novel approach to densely ground visual entities from a long caption. We leverage a large multimodal model (LMM) to extract semantic nouns, a class-agnostic segmentation model to generate entity-level segmentation, and the proposed multi-modal feature fusion module to associate each semantic noun with its corresponding segmentation mask. Additionally, we introduce a strategy of encoding entity segmentation masks into a colormap, enabling the preservation of fine-grained predictions from features of high-resolution masks. This approach allows us to extract visual features from low-resolution images using the CLIP vision encoder in the LMM, which is more computationally efficient than existing approaches that use an additional encoder for high-resolution images. Our comprehensive experiments demonstrate the superiority of our method, outperforming state-of-the-art techniques on three tasks, including panoptic narrative grounding, referring expression segmentation, and panoptic segmentation.
△ Less
Submitted 4 February, 2024;
originally announced February 2024.
-
NeuV-SLAM: Fast Neural Multiresolution Voxel Optimization for RGBD Dense SLAM
Authors:
Wenzhi Guo,
Bing Wang,
Lijun Chen
Abstract:
We introduce NeuV-SLAM, a novel dense simultaneous localization and map** pipeline based on neural multiresolution voxels, characterized by ultra-fast convergence and incremental expansion capabilities. This pipeline utilizes RGBD images as input to construct multiresolution neural voxels, achieving rapid convergence while maintaining robust incremental scene reconstruction and camera tracking.…
▽ More
We introduce NeuV-SLAM, a novel dense simultaneous localization and map** pipeline based on neural multiresolution voxels, characterized by ultra-fast convergence and incremental expansion capabilities. This pipeline utilizes RGBD images as input to construct multiresolution neural voxels, achieving rapid convergence while maintaining robust incremental scene reconstruction and camera tracking. Central to our methodology is to propose a novel implicit representation, termed VDF that combines the implementation of neural signed distance field (SDF) voxels with an SDF activation strategy. This approach entails the direct optimization of color features and SDF values anchored within the voxels, substantially enhancing the rate of scene convergence. To ensure the acquisition of clear edge delineation, SDF activation is designed, which maintains exemplary scene representation fidelity even under constraints of voxel resolution. Furthermore, in pursuit of advancing rapid incremental expansion with low computational overhead, we developed hashMV, a novel hash-based multiresolution voxel management structure. This architecture is complemented by a strategically designed voxel generation technique that synergizes with a two-dimensional scene prior. Our empirical evaluations, conducted on the Replica and ScanNet Datasets, substantiate NeuV-SLAM's exceptional efficacy in terms of convergence speed, tracking accuracy, scene reconstruction, and rendering quality.
△ Less
Submitted 2 February, 2024;
originally announced February 2024.
-
Relation between timelike and spacelike entanglement entropy
Authors:
Wu-zhong Guo,
Song He,
Yu-Xuan Zhang
Abstract:
In this study, we establish a connection between timelike and spacelike entanglement entropy. Specifically, for a diverse range of states, the timelike entanglement entropy is uniquely determined by a linear combination of the spacelike entanglement entropy and its first-order temporal derivative. This framework reveals that the imaginary component of the timelike entanglement entropy primarily or…
▽ More
In this study, we establish a connection between timelike and spacelike entanglement entropy. Specifically, for a diverse range of states, the timelike entanglement entropy is uniquely determined by a linear combination of the spacelike entanglement entropy and its first-order temporal derivative. This framework reveals that the imaginary component of the timelike entanglement entropy primarily originates from the non-commutativity between the twist operator and its first-order temporal derivative. Furthermore, we analyze the constraints of this relation and highlight the possible extension to accommodate more complex state configurations.
△ Less
Submitted 31 January, 2024;
originally announced February 2024.
-
Superconductivity in freestanding infinite-layer nickelate membranes
Authors:
Shengjun Yan,
Wei Mao,
Wenjie Sun,
Yueying Li,
Haoying Sun,
Jiangfeng Yang,
Bo Hao,
Wei Guo,
Leyan Nian,
Zhengbin Gu,
Peng Wang,
Yuefeng Nie
Abstract:
The observation of superconductivity in infinite-layer nickelates has attracted significant attention due to its potential as a new platform for exploring high $ \mathrm{\textit{T}}_{c} $ superconductivity. However, thus far, superconductivity has only been observed in epitaxial thin films, which limits the manipulation capabilities and modulation methods compared to two-dimensional exfoliated mat…
▽ More
The observation of superconductivity in infinite-layer nickelates has attracted significant attention due to its potential as a new platform for exploring high $ \mathrm{\textit{T}}_{c} $ superconductivity. However, thus far, superconductivity has only been observed in epitaxial thin films, which limits the manipulation capabilities and modulation methods compared to two-dimensional exfoliated materials. Given the exceptionally giant strain tunability and stacking capability of freestanding membranes, separating superconducting nickelates from the as-grown substrate is a novel way to engineer the superconductivity and uncover the underlying physics. Herein, we report the synthesis of the superconducting freestanding $ \mathrm{La}_{0.8}\mathrm{Sr}_{0.2}\mathrm{Ni}\mathrm{O}_{2} $ membranes ($ \mathrm{\textit{T}}_{c}\mathrm{=}\mathrm{10.9}\;\mathrm{K} $), emphasizing the crucial roles of the interface engineering in the precursor phase film growth and the quick transfer process in achieving superconductivity. Our work offers a new versatile platform for investigating the superconductivity in nickelates, such as the pairing symmetry via constructing Josephson tunneling junctions and higher $ \mathrm{\textit{T}}_{c} $ values via high-pressure experiments.
△ Less
Submitted 29 January, 2024;
originally announced January 2024.
-
A ThermalKinetic Inductance Detectors Pixel Design for Cosmic Microwave Background Observations at 90/150 GHz bands
Authors:
Ye Chai,
Shibo Shu,
Yong** Li,
Jiamin Sun,
Zhouhui Liu,
Yu Xu,
Daikang Yan,
Zhengwei Li,
Yang Liu,
Yiwen Wang,
Weijie Guo,
Juexian Cao,
Congzhan Liu
Abstract:
The highly sensitive millimeter-wave telescope is an important tool for accurate measurement of Cosmic Microwave Background (CMB) radiation, and its core component is a detector array located in a cryogenic focal plane. The feasibility of utilizing thermal kinetic inductance detectors (TKIDs) for CMB observations has been demonstrated. We propose a pixel design of TKIDs for observing CMB through a…
▽ More
The highly sensitive millimeter-wave telescope is an important tool for accurate measurement of Cosmic Microwave Background (CMB) radiation, and its core component is a detector array located in a cryogenic focal plane. The feasibility of utilizing thermal kinetic inductance detectors (TKIDs) for CMB observations has been demonstrated. We propose a pixel design of TKIDs for observing CMB through atmospheric windows for observations in the 90/150 GHz bands. Assuming lossless dielectric, the coupling efficiency of a single pixel is around 90%. This pixel design will be utilized for future large-scale TKIDs array designs for CMB observations.
△ Less
Submitted 26 January, 2024;
originally announced January 2024.
-
Learning to Manipulate Artistic Images
Authors:
Wei Guo,
Yuqi Zhang,
De Ma,
Qian Zheng
Abstract:
Recent advancement in computer vision has significantly lowered the barriers to artistic creation. Exemplar-based image translation methods have attracted much attention due to flexibility and controllability. However, these methods hold assumptions regarding semantics or require semantic information as the input, while accurate semantics is not easy to obtain in artistic images. Besides, these me…
▽ More
Recent advancement in computer vision has significantly lowered the barriers to artistic creation. Exemplar-based image translation methods have attracted much attention due to flexibility and controllability. However, these methods hold assumptions regarding semantics or require semantic information as the input, while accurate semantics is not easy to obtain in artistic images. Besides, these methods suffer from cross-domain artifacts due to training data prior and generate imprecise structure due to feature compression in the spatial domain. In this paper, we propose an arbitrary Style Image Manipulation Network (SIM-Net), which leverages semantic-free information as guidance and a region transportation strategy in a self-supervised manner for image generation. Our method balances computational efficiency and high resolution to a certain extent. Moreover, our method facilitates zero-shot style image manipulation. Both qualitative and quantitative experiments demonstrate the superiority of our method over state-of-the-art methods.Code is available at https://github.com/SnailForce/SIM-Net.
△ Less
Submitted 25 January, 2024;
originally announced January 2024.
-
Unveiling a Novel Metal-to-Metal Transition in LuH2: Critically Challenging Superconductivity Claims in Lutetium Hydrides
Authors:
Dong Wang,
Ningning Wang,
Caoshun Zhang,
Chunsheng Xia,
Weicheng Guo,
Xia Yin,
Kejun Bu,
Takeshi Nakagawa,
Jianbo Zhang,
Federico Gorelli,
Philip Dalladay-Simpson,
Thomas Meier,
Xujie Lü,
Liling Sun,
**guang Cheng,
Qiaoshi Zeng,
Yang Ding,
Ho-kwang Mao
Abstract:
Following the recent report by Dasenbrock-Gammon et al. (2023) of near-ambient superconductivity in nitrogen-doped lutetium trihydride (LuH3-δNε), significant debate has emerged surrounding the composition and interpretation of the observed sharp resistance drop. Here, we meticulously revisit these claims through comprehensive characterization and investigations. We definitively identify the repor…
▽ More
Following the recent report by Dasenbrock-Gammon et al. (2023) of near-ambient superconductivity in nitrogen-doped lutetium trihydride (LuH3-δNε), significant debate has emerged surrounding the composition and interpretation of the observed sharp resistance drop. Here, we meticulously revisit these claims through comprehensive characterization and investigations. We definitively identify the reported material as lutetium dihydride (LuH2), resolving the ambiguity surrounding its composition. Under similar conditions (270-295 K and 1-2 GPa), we replicate the reported sharp decrease in electrical resistance with a 30% success rate, aligning with Dasenbrock-Gammon et al.'s observations. However, our extensive investigations reveal this phenomenon to be a novel, pressure-induced metal-to-metal transition intrinsic to LuH2, distinct from superconductivity. Intriguingly, nitrogen do** exerts minimal impact on this transition. Our work not only elucidates the fundamental properties of LuH2 and LuH3 but also critically challenges the notion of superconductivity in these lutetium hydride systems. These findings pave the way for future research on lutetium hydride systems while emphasizing the crucial importance of rigorous verification in claims of ambient temperature superconductivity.
△ Less
Submitted 28 January, 2024; v1 submitted 25 January, 2024;
originally announced January 2024.
-
Diagnosing $SO(5)$ Symmetry and First-Order Transition in the $J-Q_3$ Model via Entanglement Entropy
Authors:
Zehui Deng,
Lu Liu,
Wenan Guo,
Hai-qing Lin
Abstract:
We study the scaling behavior of the Rényi entanglement entropy with smooth boundaries at the phase transition point of the two-dimensional $J-Q_3$ model. Using the recently developed scaling formula [Deng {\it et al.}, Phys. Rev. B {\textbf{108}, 125144 (2023)}], we find a subleading logarithmic term with a coefficient showing that the number of Goldstone modes is four, indicating the existence o…
▽ More
We study the scaling behavior of the Rényi entanglement entropy with smooth boundaries at the phase transition point of the two-dimensional $J-Q_3$ model. Using the recently developed scaling formula [Deng {\it et al.}, Phys. Rev. B {\textbf{108}, 125144 (2023)}], we find a subleading logarithmic term with a coefficient showing that the number of Goldstone modes is four, indicating the existence of the spontaneous symmetry breaking from an emergent $SO(5)$ to $O(4)$ in the thermodynamic limit, but restored in a finite size. This result shows that the believed deconfined quantum critical point of the $J-Q_{3}$ model is a weak first-order transition point. Our work provides a new way to distinguish a state with spontaneously broken continuous symmetry from a critical state. The method is particularly useful in identifying weak first-order phase transitions, which are hard to determine using conventional methods.
△ Less
Submitted 23 January, 2024;
originally announced January 2024.
-
Experience-Learning Inspired Two-Step Reward Method for Efficient Legged Locomotion Learning Towards Natural and Robust Gaits
Authors:
Yinghui Li,
**ze Wu,
Xin Liu,
Weizhong Guo,
Yufei Xue
Abstract:
Multi-legged robots offer enhanced stability in complex terrains, yet autonomously learning natural and robust motions in such environments remains challenging. Drawing inspiration from animals' progressive learning patterns, from simple to complex tasks, we introduce a universal two-stage learning framework with two-step reward setting based on self-acquired experience, which efficiently enables…
▽ More
Multi-legged robots offer enhanced stability in complex terrains, yet autonomously learning natural and robust motions in such environments remains challenging. Drawing inspiration from animals' progressive learning patterns, from simple to complex tasks, we introduce a universal two-stage learning framework with two-step reward setting based on self-acquired experience, which efficiently enables legged robots to incrementally learn natural and robust movements. In the first stage, robots learn through gait-related rewards to track velocity on flat terrain, acquiring natural, robust movements and generating effective motion experience data. In the second stage, mirroring animal learning from existing experiences, robots learn to navigate challenging terrains with natural and robust movements using adversarial imitation learning. To demonstrate our method's efficacy, we trained both quadruped robots and a hexapod robot, and the policy were successfully transferred to a physical quadruped robot GO1, which exhibited natural gait patterns and remarkable robustness in various terrains.
△ Less
Submitted 22 January, 2024;
originally announced January 2024.
-
Robust Semi-Supervised Learning for Self-learning Open-World Classes
Authors:
Wenjuan Xi,
Xin Song,
Weili Guo,
Yang Yang
Abstract:
Existing semi-supervised learning (SSL) methods assume that labeled and unlabeled data share the same class space. However, in real-world applications, unlabeled data always contain classes not present in the labeled set, which may cause classification performance degradation of known classes. Therefore, open-world SSL approaches are researched to handle the presence of multiple unknown classes in…
▽ More
Existing semi-supervised learning (SSL) methods assume that labeled and unlabeled data share the same class space. However, in real-world applications, unlabeled data always contain classes not present in the labeled set, which may cause classification performance degradation of known classes. Therefore, open-world SSL approaches are researched to handle the presence of multiple unknown classes in the unlabeled data, which aims to accurately classify known classes while fine-grained distinguishing different unknown classes. To address this challenge, in this paper, we propose an open-world SSL method for Self-learning Open-world Classes (SSOC), which can explicitly self-learn multiple unknown classes. Specifically, SSOC first defines class center tokens for both known and unknown classes and autonomously learns token representations according to all samples with the cross-attention mechanism. To effectively discover novel classes, SSOC further designs a pairwise similarity loss in addition to the entropy loss, which can wisely exploit the information available in unlabeled data from instances' predictions and relationships. Extensive experiments demonstrate that SSOC outperforms the state-of-the-art baselines on multiple popular classification benchmarks. Specifically, on the ImageNet-100 dataset with a novel ratio of 90%, SSOC achieves a remarkable 22% improvement.
△ Less
Submitted 15 January, 2024;
originally announced January 2024.
-
Phase diagram of a square lattice model of XY Spins with direction-dependent interactions
Authors:
Fan Zhang,
Wenan Guo,
Ribhu K. Kaul
Abstract:
We study a generalization of the well-known classical two-dimensional square lattice compass model of XY spins (sometimes referred to as the 90$^\circ$ compass model), which interpolates between the XY model and the compass model. Our model possesses the combined $C_4$ lattice and spin rotation symmetry of the compass model but is free of its fine-tuned subsystem symmetries. Using both field theor…
▽ More
We study a generalization of the well-known classical two-dimensional square lattice compass model of XY spins (sometimes referred to as the 90$^\circ$ compass model), which interpolates between the XY model and the compass model. Our model possesses the combined $C_4$ lattice and spin rotation symmetry of the compass model but is free of its fine-tuned subsystem symmetries. Using both field theoretic arguments and Monte Carlo simulations, we find that our model possesses a line of critical points with continuously varying exponents of the Ashkin-Teller type terminating at the four-state Potts point. Further, our Monte Carlo study uncovers that beyond the four-state Potts point, the line of phase transition is connected to the lattice-nematic Ising phase transition in the square lattice compass model through a region of first-order transitions.
△ Less
Submitted 17 January, 2024; v1 submitted 15 January, 2024;
originally announced January 2024.
-
JMA: a General Algorithm to Craft Nearly Optimal Targeted Adversarial Example
Authors:
Benedetta Tondi,
Wei Guo,
Mauro Barni
Abstract:
Most of the approaches proposed so far to craft targeted adversarial examples against Deep Learning classifiers are highly suboptimal and typically rely on increasing the likelihood of the target class, thus implicitly focusing on one-hot encoding settings. In this paper, we propose a more general, theoretically sound, targeted attack that resorts to the minimization of a Jacobian-induced MAhalano…
▽ More
Most of the approaches proposed so far to craft targeted adversarial examples against Deep Learning classifiers are highly suboptimal and typically rely on increasing the likelihood of the target class, thus implicitly focusing on one-hot encoding settings. In this paper, we propose a more general, theoretically sound, targeted attack that resorts to the minimization of a Jacobian-induced MAhalanobis distance (JMA) term, taking into account the effort (in the input space) required to move the latent space representation of the input sample in a given direction. The minimization is solved by exploiting the Wolfe duality theorem, reducing the problem to the solution of a Non-Negative Least Square (NNLS) problem. The proposed algorithm provides an optimal solution to a linearized version of the adversarial example problem originally introduced by Szegedy et al. \cite{szegedy2013intriguing}. The experiments we carried out confirm the generality of the proposed attack which is proven to be effective under a wide variety of output encoding schemes. Noticeably, the JMA attack is also effective in a multi-label classification scenario, being capable to induce a targeted modification of up to half the labels in a complex multilabel classification scenario with 20 labels, a capability that is out of reach of all the attacks proposed so far. As a further advantage, the JMA attack usually requires very few iterations, thus resulting more efficient than existing methods.
△ Less
Submitted 2 January, 2024;
originally announced January 2024.
-
Non-prompt $\mathrm{J}/ψ$ production in proton-proton collisions with ALICE
Authors:
Wenda Guo
Abstract:
$\mathrm{J}/ψ…
▽ More
$\mathrm{J}/ψ$ production in high-energy hadronic collisions is sensitive to both perturbative and non-perturbative aspects of quantum chromodynamics (QCD) calculations. The production of a heavy-quark pair is well-described by perturbative QCD, whereas the formation of the bound state involves non-perturbative processes, treated in different ways by various available theoretical models. ALICE can measure inclusive $\mathrm{J}/ψ$ at both forward and midrapidity down to low ${p_{\rm T}}$ and the prompt and non-prompt $\mathrm{J}/ψ$ separation can be performed at midrapidity. The study of the production of non-prompt $\mathrm{J}/ψ$ originating from the decay of beauty hadrons, besides allowing to isolate the prompt $\mathrm{J}/ψ$ cross section from the inclusive $\mathrm{J}/ψ$ cross section, can be used to estimate open beauty-hadron production. Heavy-flavour particle production in pp collisions as a function of charged-particle multiplicity can provide insight into the processes occuring in the collision at the partonic level, as well as the interplay between the hard and soft mechanisms in particle production.
△ Less
Submitted 13 March, 2024; v1 submitted 29 December, 2023;
originally announced January 2024.
-
Quasinormal modes and greybody factor of a Lorentz-violating black hole
Authors:
Wen-Di Guo,
Qin Tan,
Yu-Xiao Liu
Abstract:
Recently, a static spherically symmetric black hole solution was found in gravity nonminimally coupled a background Kalb-Ramond field. The Lorentz symmetry is spontaneously broken when the Kalb-Ramond field has a nonvanishing vacuum expectation value. In this work, we focus on the quasinormal modes and greybody factor of this black hole. The master equations for the perturbed scalar field, electro…
▽ More
Recently, a static spherically symmetric black hole solution was found in gravity nonminimally coupled a background Kalb-Ramond field. The Lorentz symmetry is spontaneously broken when the Kalb-Ramond field has a nonvanishing vacuum expectation value. In this work, we focus on the quasinormal modes and greybody factor of this black hole. The master equations for the perturbed scalar field, electromagnetic field, and gravitational field can be written into a uniform form. We use three methods to solve the quasinormal frequencies in the frequency domain. The results agree well with each other. The time evolution of a Gaussian wave packet is studied. The quasinormal frequencies fitted from the time evolution data agree well with that of frequency domain. The greybody factor is calculated by Wentzel-Kramers-Brillouin (WKB) method. The effect of the Lorentz-violating parameter on the quasinormal modes and greybody factor are also studied.
△ Less
Submitted 27 December, 2023;
originally announced December 2023.