-
Text-Guided Vector Graphics Customization
Authors:
Peiying Zhang,
Nanxuan Zhao,
**g Liao
Abstract:
Vector graphics are widely used in digital art and valued by designers for their scalability and layer-wise topological properties. However, the creation and editing of vector graphics necessitate creativity and design expertise, leading to a time-consuming process. In this paper, we propose a novel pipeline that generates high-quality customized vector graphics based on textual prompts while pres…
▽ More
Vector graphics are widely used in digital art and valued by designers for their scalability and layer-wise topological properties. However, the creation and editing of vector graphics necessitate creativity and design expertise, leading to a time-consuming process. In this paper, we propose a novel pipeline that generates high-quality customized vector graphics based on textual prompts while preserving the properties and layer-wise information of a given exemplar SVG. Our method harnesses the capabilities of large pre-trained text-to-image models. By fine-tuning the cross-attention layers of the model, we generate customized raster images guided by textual prompts. To initialize the SVG, we introduce a semantic-based path alignment method that preserves and transforms crucial paths from the exemplar SVG. Additionally, we optimize path parameters using both image-level and vector-level losses, ensuring smooth shape deformation while aligning with the customized raster image. We extensively evaluate our method using multiple metrics from vector-level, image-level, and text-level perspectives. The evaluation results demonstrate the effectiveness of our pipeline in generating diverse customizations of vector graphics with exceptional quality. The project page is https://intchous.github.io/SVGCustomization.
△ Less
Submitted 21 September, 2023;
originally announced September 2023.
-
A class-weighted supervised contrastive learning long-tailed bearing fault diagnosis approach using quadratic neural network
Authors:
Wei-En Yu,
**wei Sun,
Shi** Zhang,
Xiaoge Zhang,
**g-Xiao Liao
Abstract:
Deep learning has achieved remarkable success in bearing fault diagnosis. However, its performance oftentimes deteriorates when dealing with highly imbalanced or long-tailed data, while such cases are prevalent in industrial settings because fault is a rare event that occurs with an extremely low probability. Conventional data augmentation methods face fundamental limitations due to the scarcity o…
▽ More
Deep learning has achieved remarkable success in bearing fault diagnosis. However, its performance oftentimes deteriorates when dealing with highly imbalanced or long-tailed data, while such cases are prevalent in industrial settings because fault is a rare event that occurs with an extremely low probability. Conventional data augmentation methods face fundamental limitations due to the scarcity of samples pertaining to the minority class. In this paper, we propose a supervised contrastive learning approach with a class-aware loss function to enhance the feature extraction capability of neural networks for fault diagnosis. The developed class-weighted contrastive learning quadratic network (CCQNet) consists of a quadratic convolutional residual network backbone, a contrastive learning branch utilizing a class-weighted contrastive loss, and a classifier branch employing logit-adjusted cross-entropy loss. By utilizing class-weighted contrastive loss and logit-adjusted cross-entropy loss, our approach encourages equidistant representation of class features, thereby inducing equal attention on all the classes. We further analyze the superior feature extraction ability of quadratic network by establishing the connection between quadratic neurons and autocorrelation in signal processing. Experimental results on public and proprietary datasets are used to validate the effectiveness of CCQNet, and computational results reveal that CCQNet outperforms SOTA methods in handling extremely imbalanced data substantially.
△ Less
Submitted 20 September, 2023;
originally announced September 2023.
-
Preserving Tumor Volumes for Unsupervised Medical Image Registration
Authors:
Qihua Dong,
Hao Du,
Ying Song,
Yan Xu,
**g Liao
Abstract:
Medical image registration is a critical task that estimates the spatial correspondence between pairs of images. However, current traditional and deep-learning-based methods rely on similarity measures to generate a deforming field, which often results in disproportionate volume changes in dissimilar regions, especially in tumor regions. These changes can significantly alter the tumor size and und…
▽ More
Medical image registration is a critical task that estimates the spatial correspondence between pairs of images. However, current traditional and deep-learning-based methods rely on similarity measures to generate a deforming field, which often results in disproportionate volume changes in dissimilar regions, especially in tumor regions. These changes can significantly alter the tumor size and underlying anatomy, which limits the practical use of image registration in clinical diagnosis. To address this issue, we have formulated image registration with tumors as a constraint problem that preserves tumor volumes while maximizing image similarity in other normal regions. Our proposed strategy involves a two-stage process. In the first stage, we use similarity-based registration to identify potential tumor regions by their volume change, generating a soft tumor mask accordingly. In the second stage, we propose a volume-preserving registration with a novel adaptive volume-preserving loss that penalizes the change in size adaptively based on the masks calculated from the previous stage. Our approach balances image similarity and volume preservation in different regions, i.e., normal and tumor regions, by using soft tumor masks to adjust the imposition of volume-preserving loss on each one. This ensures that the tumor volume is preserved during the registration process. We have evaluated our strategy on various datasets and network architectures, demonstrating that our method successfully preserves the tumor volume while achieving comparable registration results with state-of-the-art methods. Our codes is available at: \url{https://dddraxxx.github.io/Volume-Preserving-Registration/}.
△ Less
Submitted 9 May, 2024; v1 submitted 18 September, 2023;
originally announced September 2023.
-
Two-mode correlated multiphoton bundle emission
Authors:
Yi Wang,
Fen Zou,
Jie-Qiao Liao
Abstract:
The preparation of correlated multiphoton sources is an important research topic in quantum optics and quantum information science. Here, two-mode correlated multiphoton bundle emission in a nondegenerate multiphoton Jaynes-Cummings model, which is comprised of a two-level system coupled with two cavity modes is studied. The two-level system is driven by a near-resonant strong laser such that the…
▽ More
The preparation of correlated multiphoton sources is an important research topic in quantum optics and quantum information science. Here, two-mode correlated multiphoton bundle emission in a nondegenerate multiphoton Jaynes-Cummings model, which is comprised of a two-level system coupled with two cavity modes is studied. The two-level system is driven by a near-resonant strong laser such that the Mollow regime dominates the physical processes in this system. Under certain resonance conditions, a perfect super-Rabi oscillation between the zero-photon state $|0\rangle_{a}|0\rangle_{b}$ and the ($n+m$)-photon state $|n\rangle_{a}|m\rangle_{b}$ of the two cavity modes can take place. Induced by the photon decay, the two-mode correlated multiphoton bundle emission occurs in this system. More importantly, the results show that there is an antibunching effect between the strongly-correlated photon bundles, so that the system behaves as an antibunched ($n+m$)-photon source. The work opens up a route towards achieving two-mode correlated multiphoton source device, which has potential applications in modern quantum technology.
△ Less
Submitted 26 October, 2023; v1 submitted 15 September, 2023;
originally announced September 2023.
-
Entangling two giant atoms via a topological waveguide
Authors:
Wen-Bin Luo,
Xian-Li Yin,
Jie-Qiao Liao
Abstract:
The entanglement generation of two two-level giant atoms coupled to a photonic waveguide, which is formed by a Su-Schrieffer-Heeger (SSH) type coupled-cavity array is studied. Here, each atom is coupled to the waveguide through two coupling points. The two-atom separate-coupling case is studied, and 16 coupling configurations are considered for the coupling-point distributions between the two atom…
▽ More
The entanglement generation of two two-level giant atoms coupled to a photonic waveguide, which is formed by a Su-Schrieffer-Heeger (SSH) type coupled-cavity array is studied. Here, each atom is coupled to the waveguide through two coupling points. The two-atom separate-coupling case is studied, and 16 coupling configurations are considered for the coupling-point distributions between the two atoms and the waveguide. Quantum master equations are derived to govern the evolution of the two atoms and characterize atomic entanglement by calculating the concurrence of the two-atom states. It is found that the two giant-atom entanglement depends on the coupling configurations and the coupling-point distance of the giant atoms. In particular, the entanglement dynamics of the two giant atoms in 14 coupling configurations depend on the dimerization parameter of the SSH waveguide. According to the self-energies of the two giant atoms, it is found that ten of these 16 coupling configurations can be divided into five pairs. It is also showed that the delayed sudden birth of entanglement between the two giant atoms is largely enhanced in these five pairs of coupling configurations. This work will promote the study of quantum effects and coherent manipulation in giant-atom topological-waveguide-QED systems.
△ Less
Submitted 8 May, 2024; v1 submitted 15 September, 2023;
originally announced September 2023.
-
Real-time Monitoring for the Next Core-Collapse Supernova in JUNO
Authors:
Angel Abusleme,
Thomas Adam,
Shakeel Ahmad,
Rizwan Ahmed,
Sebastiano Aiello,
Muhammad Akram,
Abid Aleem,
Fengpeng An,
Qi An,
Giuseppe Andronico,
Nikolay Anfimov,
Vito Antonelli,
Tatiana Antoshkina,
Burin Asavapibhop,
João Pedro Athayde Marcondes de André,
Didier Auguste,
Weidong Bai,
Nikita Balashov,
Wander Baldini,
Andrea Barresi,
Davide Basilico,
Eric Baussan,
Marco Bellato,
Marco Beretta,
Antonio Bergnoli
, et al. (606 additional authors not shown)
Abstract:
The core-collapse supernova (CCSN) is considered one of the most energetic astrophysical events in the universe. The early and prompt detection of neutrinos before (pre-SN) and during the supernova (SN) burst presents a unique opportunity for multi-messenger observations of CCSN events. In this study, we describe the monitoring concept and present the sensitivity of the system to pre-SN and SN neu…
▽ More
The core-collapse supernova (CCSN) is considered one of the most energetic astrophysical events in the universe. The early and prompt detection of neutrinos before (pre-SN) and during the supernova (SN) burst presents a unique opportunity for multi-messenger observations of CCSN events. In this study, we describe the monitoring concept and present the sensitivity of the system to pre-SN and SN neutrinos at the Jiangmen Underground Neutrino Observatory (JUNO), a 20 kton liquid scintillator detector currently under construction in South China. The real-time monitoring system is designed to ensure both prompt alert speed and comprehensive coverage of progenitor stars. It incorporates prompt monitors on the electronic board as well as online monitors at the data acquisition stage. Assuming a false alert rate of 1 per year, this monitoring system exhibits sensitivity to pre-SN neutrinos up to a distance of approximately 1.6 (0.9) kiloparsecs and SN neutrinos up to about 370 (360) kiloparsecs for a progenitor mass of 30 solar masses, considering both normal and inverted mass ordering scenarios. The pointing ability of the CCSN is evaluated by analyzing the accumulated event anisotropy of inverse beta decay interactions from pre-SN or SN neutrinos. This, along with the early alert, can play a crucial role in facilitating follow-up multi-messenger observations of the next galactic or nearby extragalactic CCSN.
△ Less
Submitted 4 December, 2023; v1 submitted 13 September, 2023;
originally announced September 2023.
-
Recursive Error Reduction for Regular Branching Programs
Authors:
Eshan Chattopadhyay,
Jyun-Jie Liao
Abstract:
In a recent work, Chen, Hoza, Lyu, Tal and Wu (FOCS 2023) showed an improved error reduction framework for the derandomization of regular read-once branching programs (ROBPs). Their result is based on a clever modification to the inverse Laplacian perspective of space-bounded derandomization, which was originally introduced by Ahmadinejad, Kelner, Murtagh, Peebles, Sidford and Vadhan (FOCS 2020).…
▽ More
In a recent work, Chen, Hoza, Lyu, Tal and Wu (FOCS 2023) showed an improved error reduction framework for the derandomization of regular read-once branching programs (ROBPs). Their result is based on a clever modification to the inverse Laplacian perspective of space-bounded derandomization, which was originally introduced by Ahmadinejad, Kelner, Murtagh, Peebles, Sidford and Vadhan (FOCS 2020).
In this work, we give an alternative error reduction framework for regular ROBPs. Our new framework is based on a binary recursive formula from the work of Chattopadhyay and Liao (CCC 2020), that they used to construct weighted pseudorandom generators (WPRGs) for general ROBPs.
Based on our new error reduction framework, we give alternative proofs to the following results for regular ROBPs of length $n$ and width $w$, both of which were proved in the work of Chen et al. using their error reduction:
$\bullet$ There is a WPRG with error $\varepsilon$ that has seed length $\tilde{O}(\log(n)(\sqrt{\log(1/\varepsilon)}+\log(w))+\log(1/\varepsilon)).$
$\bullet$ There is a (non-black-box) deterministic algorithm which estimates the expectation of any such program within error $\pm\varepsilon$ with space complexity $\tilde{O}(\log(nw)\cdot\log\log(1/\varepsilon)).$ (This was first proved in the work of Ahmadinejad et al., but the proof by Chen et al. is simpler.)
Because of the binary recursive nature of our new framework, both of our proofs are based on a straightforward induction that is arguably simpler than the Laplacian-based proof in the work of Chen et al.
△ Less
Submitted 6 December, 2023; v1 submitted 8 September, 2023;
originally announced September 2023.
-
How to Evaluate the Generalization of Detection? A Benchmark for Comprehensive Open-Vocabulary Detection
Authors:
Yiyang Yao,
Peng Liu,
Tiancheng Zhao,
Qianqian Zhang,
Jiajia Liao,
Chunxin Fang,
Kyusong Lee,
Qing Wang
Abstract:
Object detection (OD) in computer vision has made significant progress in recent years, transitioning from closed-set labels to open-vocabulary detection (OVD) based on large-scale vision-language pre-training (VLP). However, current evaluation methods and datasets are limited to testing generalization over object types and referral expressions, which do not provide a systematic, fine-grained, and…
▽ More
Object detection (OD) in computer vision has made significant progress in recent years, transitioning from closed-set labels to open-vocabulary detection (OVD) based on large-scale vision-language pre-training (VLP). However, current evaluation methods and datasets are limited to testing generalization over object types and referral expressions, which do not provide a systematic, fine-grained, and accurate benchmark of OVD models' abilities. In this paper, we propose a new benchmark named OVDEval, which includes 9 sub-tasks and introduces evaluations on commonsense knowledge, attribute understanding, position understanding, object relation comprehension, and more. The dataset is meticulously created to provide hard negatives that challenge models' true understanding of visual and linguistic input. Additionally, we identify a problem with the popular Average Precision (AP) metric when benchmarking models on these fine-grained label datasets and propose a new metric called Non-Maximum Suppression Average Precision (NMS-AP) to address this issue. Extensive experimental results show that existing top OVD models all fail on the new tasks except for simple object types, demonstrating the value of the proposed dataset in pinpointing the weakness of current OVD models and guiding future research. Furthermore, the proposed NMS-AP metric is verified by experiments to provide a much more truthful evaluation of OVD models, whereas traditional AP metrics yield deceptive results. Data is available at \url{https://github.com/om-ai-lab/OVDEval}
△ Less
Submitted 18 December, 2023; v1 submitted 25 August, 2023;
originally announced August 2023.
-
A Successive Two-stage Method for Sparse Generalized Eigenvalue Problems
Authors:
Qia Li,
Jianmin Liao,
Lixin Shen,
Na Zhang
Abstract:
The Sparse Generalized Eigenvalue Problem (sGEP), a pervasive challenge in statistical learning methods including sparse principal component analysis, sparse Fisher's discriminant analysis, and sparse canonical correlation analysis, presents significant computational complexity due to its NP-hardness. The primary aim of sGEP is to derive a sparse vector approximation of the largest generalized eig…
▽ More
The Sparse Generalized Eigenvalue Problem (sGEP), a pervasive challenge in statistical learning methods including sparse principal component analysis, sparse Fisher's discriminant analysis, and sparse canonical correlation analysis, presents significant computational complexity due to its NP-hardness. The primary aim of sGEP is to derive a sparse vector approximation of the largest generalized eigenvector, effectively posing this as a sparse optimization problem. Conventional algorithms for sGEP, however, often succumb to local optima and exhibit significant dependency on initial points. This predicament necessitates a more refined approach to avoid local optima and achieve an improved solution in terms of sGEP's objective value, which we address in this paper through a novel successive two-stage method. The first stage of this method incorporates an algorithm for sGEP capable of yielding a stationary point from any initial point. The subsequent stage refines this stationary point by adjusting its support, resulting in a point with an enhanced objective value relative to the original stationary point. This support adjustment is achieved through a novel procedure we have named support alteration. The final point derived from the second stage then serves as the initial point for the algorithm in the first stage, creating a cyclical process that continues until a predetermined stop** criterion is satisfied. We also provide a comprehensive convergence analysis of this process. Through extensive experimentation under various settings, our method has demonstrated significant improvements in the objective value of sGEP compared to existing methodologies, underscoring its potential as a valuable tool in statistical learning and optimization.
△ Less
Submitted 23 August, 2023;
originally announced August 2023.
-
Room temperature magnetic phase transition in an electrically-tuned van der Waals ferromagnet
Authors:
Cheng Tan,
Ji-Hai Liao,
Guolin Zheng,
Meri Algarni,
Jia-Yi Lin,
Xiang Ma,
Edwin L. H. Mayes,
Matthew R. Field,
Sultan Albarakati,
Majid Panahandeh-Fard,
Saleh Alzahrani,
Guopeng Wang,
Yuanjun Yang,
Dimitrie Culcer,
James Partridge,
Mingliang Tian,
Bin Xiang,
Yu-Jun Zhao,
Lan Wang
Abstract:
Finding tunable van der Waals (vdW) ferromagnets that operate at above room temperature is an important research focus in physics and materials science. Most vdW magnets are only intrinsically magnetic far below room temperature and magnetism with square-shaped hysteresis at room-temperature has yet to be observed. Here, we report magnetism in a quasi-2D magnet Cr1.2Te2 observed at room temperatur…
▽ More
Finding tunable van der Waals (vdW) ferromagnets that operate at above room temperature is an important research focus in physics and materials science. Most vdW magnets are only intrinsically magnetic far below room temperature and magnetism with square-shaped hysteresis at room-temperature has yet to be observed. Here, we report magnetism in a quasi-2D magnet Cr1.2Te2 observed at room temperature (290 K). This magnetism was tuned via a protonic gate with an electron do** concentration up to 3.8 * 10^21 cm^-3. We observed non-monotonic evolutions in both coercivity and anomalous Hall resistivity. Under increased electron do**, the coercivities and anomalous Hall effects (AHEs) vanished, indicating a do**-induced magnetic phase transition. This occurred up to room temperature. DFT calculations showed the formation of an antiferromagnetic (AFM) phase caused by the intercalation of protons which induced significant electron do** in the Cr1.2Te2. The tunability of the magnetic properties and phase in room temperature magnetic vdW Cr1.2Te2 is a significant step towards practical spintronic devices.
△ Less
Submitted 19 March, 2024; v1 submitted 20 August, 2023;
originally announced August 2023.
-
Weakly-Supervised Action Localization by Hierarchically-structured Latent Attention Modeling
Authors:
Guiqin Wang,
Peng Zhao,
Cong Zhao,
Shusen Yang,
Jie Cheng,
Luziwei Leng,
Jianxing Liao,
Qinghai Guo
Abstract:
Weakly-supervised action localization aims to recognize and localize action instancese in untrimmed videos with only video-level labels. Most existing models rely on multiple instance learning(MIL), where the predictions of unlabeled instances are supervised by classifying labeled bags. The MIL-based methods are relatively well studied with cogent performance achieved on classification but not on…
▽ More
Weakly-supervised action localization aims to recognize and localize action instancese in untrimmed videos with only video-level labels. Most existing models rely on multiple instance learning(MIL), where the predictions of unlabeled instances are supervised by classifying labeled bags. The MIL-based methods are relatively well studied with cogent performance achieved on classification but not on localization. Generally, they locate temporal regions by the video-level classification but overlook the temporal variations of feature semantics. To address this problem, we propose a novel attention-based hierarchically-structured latent model to learn the temporal variations of feature semantics. Specifically, our model entails two components, the first is an unsupervised change-points detection module that detects change-points by learning the latent representations of video features in a temporal hierarchy based on their rates of change, and the second is an attention-based classification model that selects the change-points of the foreground as the boundaries. To evaluate the effectiveness of our model, we conduct extensive experiments on two benchmark datasets, THUMOS-14 and ActivityNet-v1.3. The experiments show that our method outperforms current state-of-the-art methods, and even achieves comparable performance with fully-supervised methods.
△ Less
Submitted 25 September, 2023; v1 submitted 19 August, 2023;
originally announced August 2023.
-
Generation of two-giant-atom entanglement in waveguide-QED systems
Authors:
Xian-Li Yin,
Jie-Qiao Liao
Abstract:
We study the generation of quantum entanglement between two giant atoms coupled to a one-dimensional waveguide. Since each giant atom interacts with the waveguide at two separate coupling points, there exist three different coupling configurations in the two-atom waveguide system: separated, braided, and nested couplings. Within the Wigner-Weisskopf framework for single coupling points, the quantu…
▽ More
We study the generation of quantum entanglement between two giant atoms coupled to a one-dimensional waveguide. Since each giant atom interacts with the waveguide at two separate coupling points, there exist three different coupling configurations in the two-atom waveguide system: separated, braided, and nested couplings. Within the Wigner-Weisskopf framework for single coupling points, the quantum master equations governing the evolution of the two giant atoms are obtained. For each coupling configuration, the entanglement dynamics of the two giant atoms is studied, including the cases of two different atomic initial states: single- and double-excitation states. It is shown that the generated entanglement depends on the coupling configuration, phase shift, and atomic initial state. For the single-excitation initial state, there exists steady-state entanglement for these three couplings due to the appearance of the dark state. For the double-excitation initial state, an entanglement sudden birth is observed via adjusting the phase shift. In particular, the maximal entanglement for the nested coupling is about one order of magnitude larger than those of separate and braided couplings. In addition, the influence of the atomic frequency detuning on the entanglement generation is studied. This work can be utilized for the generation and control of atomic entanglement in quantum networks based on giant-atom waveguide-QED systems, which have wide potential applications in quantum information processing.
△ Less
Submitted 30 August, 2023; v1 submitted 15 August, 2023;
originally announced August 2023.
-
SAILOR: Structural Augmentation Based Tail Node Representation Learning
Authors:
Jie Liao,
**tang Li,
Liang Chen,
Bingzhe Wu,
Yatao Bian,
Zibin Zheng
Abstract:
Graph Neural Networks (GNNs) have achieved state-of-the-art performance in representation learning for graphs recently. However, the effectiveness of GNNs, which capitalize on the key operation of message propagation, highly depends on the quality of the topology structure. Most of the graphs in real-world scenarios follow a long-tailed distribution on their node degrees, that is, a vast majority…
▽ More
Graph Neural Networks (GNNs) have achieved state-of-the-art performance in representation learning for graphs recently. However, the effectiveness of GNNs, which capitalize on the key operation of message propagation, highly depends on the quality of the topology structure. Most of the graphs in real-world scenarios follow a long-tailed distribution on their node degrees, that is, a vast majority of the nodes in the graph are tail nodes with only a few connected edges. GNNs produce inferior node representations for tail nodes since they lack structural information. In the pursuit of promoting the expressiveness of GNNs for tail nodes, we explore how the deficiency of structural information deteriorates the performance of tail nodes and propose a general Structural Augmentation based taIL nOde Representation learning framework, dubbed as SAILOR, which can jointly learn to augment the graph structure and extract more informative representations for tail nodes. Extensive experiments on public benchmark datasets demonstrate that SAILOR can significantly improve the tail node representations and outperform the state-of-the-art baselines.
△ Less
Submitted 14 August, 2023; v1 submitted 13 August, 2023;
originally announced August 2023.
-
Your Negative May not Be True Negative: Boosting Image-Text Matching with False Negative Elimination
Authors:
Haoxuan Li,
Yi Bin,
Junrong Liao,
Yang Yang,
Heng Tao Shen
Abstract:
Most existing image-text matching methods adopt triplet loss as the optimization objective, and choosing a proper negative sample for the triplet of <anchor, positive, negative> is important for effectively training the model, e.g., hard negatives make the model learn efficiently and effectively. However, we observe that existing methods mainly employ the most similar samples as hard negatives, wh…
▽ More
Most existing image-text matching methods adopt triplet loss as the optimization objective, and choosing a proper negative sample for the triplet of <anchor, positive, negative> is important for effectively training the model, e.g., hard negatives make the model learn efficiently and effectively. However, we observe that existing methods mainly employ the most similar samples as hard negatives, which may not be true negatives. In other words, the samples with high similarity but not paired with the anchor may reserve positive semantic associations, and we call them false negatives. Repelling these false negatives in triplet loss would mislead the semantic representation learning and result in inferior retrieval performance. In this paper, we propose a novel False Negative Elimination (FNE) strategy to select negatives via sampling, which could alleviate the problem introduced by false negatives. Specifically, we first construct the distributions of positive and negative samples separately via their similarities with the anchor, based on the features extracted from image and text encoders. Then we calculate the false negative probability of a given sample based on its similarity with the anchor and the above distributions via the Bayes' rule, which is employed as the sampling weight during negative sampling process. Since there may not exist any false negative in a small batch size, we design a memory module with momentum to retain a large negative buffer and implement our negative sampling strategy spanning over the buffer. In addition, to make the model focus on hard negatives, we reassign the sampling weights for the simple negatives with a cut-down strategy. The extensive experiments are conducted on Flickr30K and MS-COCO, and the results demonstrate the superiority of our proposed false negative elimination strategy. The code is available at https://github.com/LuminosityX/FNE.
△ Less
Submitted 8 August, 2023;
originally announced August 2023.
-
Hilton-Milner theorem for $k$-multisets
Authors:
Jiaqi Liao,
Zequn Lv,
Mengyu Cao,
Mei Lu
Abstract:
Let $ k, n \in \mathbb{N}^+ $ and $ m \in \mathbb{N}^+ \cup \{\infty \} $. A $ k $-multiset in $ [n]_m $ is a $ k $-set whose elements are integers from $ \{1, 2, \ldots, n\} $, and each element is allowed to have at most $ m $ repetitions. A family of $ k $-multisets in $ [n]_m $ is said to be intersecting if every pair of $ k $-multisets from the family have non-empty intersection. In this paper…
▽ More
Let $ k, n \in \mathbb{N}^+ $ and $ m \in \mathbb{N}^+ \cup \{\infty \} $. A $ k $-multiset in $ [n]_m $ is a $ k $-set whose elements are integers from $ \{1, 2, \ldots, n\} $, and each element is allowed to have at most $ m $ repetitions. A family of $ k $-multisets in $ [n]_m $ is said to be intersecting if every pair of $ k $-multisets from the family have non-empty intersection. In this paper, we give the size and structure of the largest non-trivial intersecting family of $ k $-multisets in $ [n]_m $ for $ n \geq k + \lceil k/m \rceil $. In the special case when $m=\infty$, our result gives rise to an unbounded multiset version for Hilton-Milner Theorem given by Meagher and Purdy. Furthermore, our main theorem unites the statements of the Hilton-Milner Theorem for finite sets and unbounded multisets.
△ Less
Submitted 6 July, 2024; v1 submitted 7 August, 2023;
originally announced August 2023.
-
Rethinking Class Activation Maps for Segmentation: Revealing Semantic Information in Shallow Layers by Reducing Noise
Authors:
Hang-Cheng Dong,
Yuhao Jiang,
Yingyan Huang,
**gxiao Liao,
Bingguo Liu,
Dong Ye,
Guodong Liu
Abstract:
Class activation maps are widely used for explaining deep neural networks. Due to its ability to highlight regions of interest, it has evolved in recent years as a key step in weakly supervised learning. A major limitation to the performance of the class activation maps is the small spatial resolution of the feature maps in the last layer of the convolutional neural network. Therefore, we expect t…
▽ More
Class activation maps are widely used for explaining deep neural networks. Due to its ability to highlight regions of interest, it has evolved in recent years as a key step in weakly supervised learning. A major limitation to the performance of the class activation maps is the small spatial resolution of the feature maps in the last layer of the convolutional neural network. Therefore, we expect to generate high-resolution feature maps that result in high-quality semantic information. In this paper, we rethink the properties of semantic information in shallow feature maps. We find that the shallow feature maps still have fine-grained non-discriminative features while mixing considerable non-target noise. Furthermore, we propose a simple gradient-based denoising method to filter the noise by truncating the positive gradient. Our proposed scheme can be easily deployed in other CAM-related methods, facilitating these methods to obtain higher-quality class activation maps. We evaluate the proposed approach through a weakly-supervised semantic segmentation task, and a large number of experiments demonstrate the effectiveness of our approach.
△ Less
Submitted 3 August, 2023;
originally announced August 2023.
-
BearingPGA-Net: A Lightweight and Deployable Bearing Fault Diagnosis Network via Decoupled Knowledge Distillation and FPGA Acceleration
Authors:
**g-Xiao Liao,
Sheng-Lai Wei,
Chen-Long Xie,
Tieyong Zeng,
**wei Sun,
Shi** Zhang,
Xiaoge Zhang,
Feng-Lei Fan
Abstract:
Deep learning has achieved remarkable success in the field of bearing fault diagnosis. However, this success comes with larger models and more complex computations, which cannot be transferred into industrial fields requiring models to be of high speed, strong portability, and low power consumption. In this paper, we propose a lightweight and deployable model for bearing fault diagnosis, referred…
▽ More
Deep learning has achieved remarkable success in the field of bearing fault diagnosis. However, this success comes with larger models and more complex computations, which cannot be transferred into industrial fields requiring models to be of high speed, strong portability, and low power consumption. In this paper, we propose a lightweight and deployable model for bearing fault diagnosis, referred to as BearingPGA-Net, to address these challenges. Firstly, aided by a well-trained large model, we train BearingPGA-Net via decoupled knowledge distillation. Despite its small size, our model demonstrates excellent fault diagnosis performance compared to other lightweight state-of-the-art methods. Secondly, we design an FPGA acceleration scheme for BearingPGA-Net using Verilog. This scheme involves the customized quantization and designing programmable logic gates for each layer of BearingPGA-Net on the FPGA, with an emphasis on parallel computing and module reuse to enhance the computational speed. To the best of our knowledge, this is the first instance of deploying a CNN-based bearing fault diagnosis model on an FPGA. Experimental results reveal that our deployment scheme achieves over 200 times faster diagnosis speed compared to CPU, while achieving a lower-than-0.4\% performance drop in terms of F1, Recall, and Precision score on our independently-collected bearing dataset. Our code is available at \url{https://github.com/asdvfghg/BearingPGA-Net}.
△ Less
Submitted 30 July, 2023;
originally announced July 2023.
-
How Does Diffusion Influence Pretrained Language Models on Out-of-Distribution Data?
Authors:
Huazheng Wang,
Daixuan Cheng,
Haifeng Sun,
**gyu Wang,
Qi Qi,
Jianxin Liao,
**g Wang,
Cong Liu
Abstract:
Transformer-based pretrained language models (PLMs) have achieved great success in modern NLP. An important advantage of PLMs is good out-of-distribution (OOD) robustness. Recently, diffusion models have attracted a lot of work to apply diffusion to PLMs. It remains under-explored how diffusion influences PLMs on OOD data. The core of diffusion models is a forward diffusion process which gradually…
▽ More
Transformer-based pretrained language models (PLMs) have achieved great success in modern NLP. An important advantage of PLMs is good out-of-distribution (OOD) robustness. Recently, diffusion models have attracted a lot of work to apply diffusion to PLMs. It remains under-explored how diffusion influences PLMs on OOD data. The core of diffusion models is a forward diffusion process which gradually applies Gaussian noise to inputs, and a reverse denoising process which removes noise. The noised input reconstruction is a fundamental ability of diffusion models. We directly analyze OOD robustness by measuring the reconstruction loss, including testing the abilities to reconstruct OOD data, and to detect OOD samples. Experiments are conducted by analyzing different training parameters and data statistical features on eight datasets. It shows that finetuning PLMs with diffusion degrades the reconstruction ability on OOD data. The comparison also shows that diffusion models can effectively detect OOD samples, achieving state-of-the-art performance in most of the datasets with an absolute accuracy improvement up to 18%. These results indicate that diffusion reduces OOD robustness of PLMs.
△ Less
Submitted 26 July, 2023;
originally announced July 2023.
-
Automotive Object Detection via Learning Sparse Events by Spiking Neurons
Authors:
Hu Zhang,
Yanchen Li,
Luziwei Leng,
Kaiwei Che,
Qian Liu,
Qinghai Guo,
Jianxing Liao,
Ran Cheng
Abstract:
Event-based sensors, distinguished by their high temporal resolution of 1 $\mathrmμ\text{s}$ and a dynamic range of 120 $\text{dB}$, stand out as ideal tools for deployment in fast-paced settings like vehicles and drones. Traditional object detection techniques that utilize Artificial Neural Networks (ANNs) face challenges due to the sparse and asynchronous nature of the events these sensors captu…
▽ More
Event-based sensors, distinguished by their high temporal resolution of 1 $\mathrmμ\text{s}$ and a dynamic range of 120 $\text{dB}$, stand out as ideal tools for deployment in fast-paced settings like vehicles and drones. Traditional object detection techniques that utilize Artificial Neural Networks (ANNs) face challenges due to the sparse and asynchronous nature of the events these sensors capture. In contrast, Spiking Neural Networks (SNNs) offer a promising alternative, providing a temporal representation that is inherently aligned with event-based data. This paper explores the unique membrane potential dynamics of SNNs and their ability to modulate sparse events. We introduce an innovative spike-triggered adaptive threshold mechanism designed for stable training. Building on these insights, we present a specialized spiking feature pyramid network (SpikeFPN) optimized for automotive event-based object detection. Comprehensive evaluations demonstrate that SpikeFPN surpasses both traditional SNNs and advanced ANNs enhanced with attention mechanisms. Evidently, SpikeFPN achieves a mean Average Precision (mAP) of 0.477 on the GEN1 Automotive Detection (GAD) benchmark dataset, marking significant increases over the selected SNN baselines. Moreover, the efficient design of SpikeFPN ensures robust performance while optimizing computational resources, attributed to its innate sparse computation capabilities. Source codes are publicly accessible at https://github.com/EMI-Group/spikefpn.
△ Less
Submitted 10 June, 2024; v1 submitted 24 July, 2023;
originally announced July 2023.
-
Convex Optimal Power Flow Based on Power Injection-based Equations and Its Application in Bipolar DC Distribution Network
Authors:
Yiyao Zhou,
Qianggang Wang,
Yuan Chi,
Jianquan Liao,
Tao Huang,
Niancheng Zhou,
Xiaolong Xu,
Xuefei Zhang
Abstract:
Optimal power flow (OPF) is a fundamental tool for analyzing the characteristics of bipolar DC distribution network (DCDN). However, existing OPF models face challenges in reflecting the power distribution and exchange of bipolar DCDN directly since its decision variables are voltage and current. This paper addresses this issue by establishing a convex OPF model that can be used for the planning a…
▽ More
Optimal power flow (OPF) is a fundamental tool for analyzing the characteristics of bipolar DC distribution network (DCDN). However, existing OPF models face challenges in reflecting the power distribution and exchange of bipolar DCDN directly since its decision variables are voltage and current. This paper addresses this issue by establishing a convex OPF model that can be used for the planning and operation of bipolar DCDN. First, the power flow characteristics of bipolar DCDN are revealed through power injection-based equations, upon which the original OPF model is established. Next, the original OPF model undergoes a transformation into a convex OPF model based on second-order cone programming (SOCP) through variable substitution, secondorder cone relaxation, McCormick relaxation, and first-order Taylor expansion, respectively. Finally, the sequence bound tightening algorithm (STBA) is employed to tighten the boundaries of McCormick envelopes in each iteration to ensure the exactness of the convex OPF model. The effectiveness of this novel OPF model for bipolar DCDN is verified through two case studies, i.e., capacity configuration of distributed generation (DG) and operation optimization of bipolar DCDN.
△ Less
Submitted 5 July, 2023;
originally announced July 2023.
-
Effective Transfer of Pretrained Large Visual Model for Fabric Defect Segmentation via Specifc Knowledge Injection
Authors:
Zhewei Chen,
Wai Keung Wong,
Zuofeng Zhong,
**piao Liao,
Ying Qu
Abstract:
Fabric defect segmentation is integral to textile quality control. Despite this, the scarcity of high-quality annotated data and the diversity of fabric defects present significant challenges to the application of deep learning in this field. These factors limit the generalization and segmentation performance of existing models, impeding their ability to handle the complexity of diverse fabric typ…
▽ More
Fabric defect segmentation is integral to textile quality control. Despite this, the scarcity of high-quality annotated data and the diversity of fabric defects present significant challenges to the application of deep learning in this field. These factors limit the generalization and segmentation performance of existing models, impeding their ability to handle the complexity of diverse fabric types and defects. To overcome these obstacles, this study introduces an innovative method to infuse specialized knowledge of fabric defects into the Segment Anything Model (SAM), a large-scale visual model. By introducing and training a unique set of fabric defect-related parameters, this approach seamlessly integrates domain-specific knowledge into SAM without the need for extensive modifications to the pre-existing model parameters. The revamped SAM model leverages generalized image understanding learned from large-scale natural image datasets while incorporating fabric defect-specific knowledge, ensuring its proficiency in fabric defect segmentation tasks. The experimental results reveal a significant improvement in the model's segmentation performance, attributable to this novel amalgamation of generic and fabric-specific knowledge. When benchmarking against popular existing segmentation models across three datasets, our proposed model demonstrates a substantial leap in performance. Its impressive results in cross-dataset comparisons and few-shot learning experiments further demonstrate its potential for practical applications in textile quality control.
△ Less
Submitted 28 June, 2023;
originally announced June 2023.
-
Continuous Layout Editing of Single Images with Diffusion Models
Authors:
Zhiyuan Zhang,
Zhitong Huang,
**g Liao
Abstract:
Recent advancements in large-scale text-to-image diffusion models have enabled many applications in image editing. However, none of these methods have been able to edit the layout of single existing images. To address this gap, we propose the first framework for layout editing of a single image while preserving its visual properties, thus allowing for continuous editing on a single image. Our appr…
▽ More
Recent advancements in large-scale text-to-image diffusion models have enabled many applications in image editing. However, none of these methods have been able to edit the layout of single existing images. To address this gap, we propose the first framework for layout editing of a single image while preserving its visual properties, thus allowing for continuous editing on a single image. Our approach is achieved through two key modules. First, to preserve the characteristics of multiple objects within an image, we disentangle the concepts of different objects and embed them into separate textual tokens using a novel method called masked textual inversion. Next, we propose a training-free optimization method to perform layout control for a pre-trained diffusion model, which allows us to regenerate images with learned concepts and align them with user-specified layouts. As the first framework to edit the layout of existing images, we demonstrate that our method is effective and outperforms other baselines that were modified to support this task. Our code will be freely available for public use upon acceptance.
△ Less
Submitted 22 June, 2023;
originally announced June 2023.
-
Efficient Deep Spiking Multi-Layer Perceptrons with Multiplication-Free Inference
Authors:
Boyan Li,
Luziwei Leng,
Shuaijie Shen,
Kaixuan Zhang,
Jianguo Zhang,
Jianxing Liao,
Ran Cheng
Abstract:
Advancements in adapting deep convolution architectures for Spiking Neural Networks (SNNs) have significantly enhanced image classification performance and reduced computational burdens. However, the inability of Multiplication-Free Inference (MFI) to align with attention and transformer mechanisms, which are critical to superior performance on high-resolution vision tasks, imposing limitations on…
▽ More
Advancements in adapting deep convolution architectures for Spiking Neural Networks (SNNs) have significantly enhanced image classification performance and reduced computational burdens. However, the inability of Multiplication-Free Inference (MFI) to align with attention and transformer mechanisms, which are critical to superior performance on high-resolution vision tasks, imposing limitations on these gains. To address this, our research explores a new pathway, drawing inspiration from the progress made in Multi-Layer Perceptrons (MLPs). We propose an innovative spiking MLP architecture that uses batch normalization to retain MFI compatibility and introducing a spiking patch encoding layer to enhance local feature extraction capabilities. As a result, we establish an efficient multi-stage spiking MLP network that blends effectively global receptive fields with local feature extraction for comprehensive spike-based computation. Without relying on pre-training or sophisticated SNN training techniques, our network secures a top-1 accuracy of 66.39% on the ImageNet-1K dataset, surpassing the directly trained spiking ResNet-34 by 2.67%. Furthermore, we curtail computational costs, model parameters, and simulation steps. An expanded version of our network compares with the performance of the spiking VGG-16 network with a 71.64% top-1 accuracy, all while operating with a model capacity 2.1 times smaller. Our findings highlight the potential of our deep SNN architecture in effectively integrating global and local learning abilities. Interestingly, the trained receptive field in our network mirrors the activity patterns of cortical cells. Source codes are publicly accessible at https://github.com/EMI-Group/mixer-snn.
△ Less
Submitted 26 April, 2024; v1 submitted 21 June, 2023;
originally announced June 2023.
-
Adaptive DNN Surgery for Selfish Inference Acceleration with On-demand Edge Resource
Authors:
Xiang Yang,
Dezhi Chen,
Qi Qi,
**gyu Wang,
Haifeng Sun,
Jianxin Liao,
Song Guo
Abstract:
Deep Neural Networks (DNNs) have significantly improved the accuracy of intelligent applications on mobile devices. DNN surgery, which partitions DNN processing between mobile devices and multi-access edge computing (MEC) servers, can enable real-time inference despite the computational limitations of mobile devices. However, DNN surgery faces a critical challenge: determining the optimal computin…
▽ More
Deep Neural Networks (DNNs) have significantly improved the accuracy of intelligent applications on mobile devices. DNN surgery, which partitions DNN processing between mobile devices and multi-access edge computing (MEC) servers, can enable real-time inference despite the computational limitations of mobile devices. However, DNN surgery faces a critical challenge: determining the optimal computing resource demand from the server and the corresponding partition strategy, while considering both inference latency and MEC server usage costs. This problem is compounded by two factors: (1) the finite computing capacity of the MEC server, which is shared among multiple devices, leading to inter-dependent demands, and (2) the shift in modern DNN architecture from chains to directed acyclic graphs (DAGs), which complicates potential solutions.
In this paper, we introduce a novel Decentralized DNN Surgery (DDS) framework. We formulate the partition strategy as a min-cut and propose a resource allocation game to adaptively schedule the demands of mobile devices in an MEC environment. We prove the existence of a Nash Equilibrium (NE), and develop an iterative algorithm to efficiently reach the NE for each device. Our extensive experiments demonstrate that DDS can effectively handle varying MEC scenarios, achieving up to 1.25$\times$ acceleration compared to the state-of-the-art algorithm.
△ Less
Submitted 21 June, 2023;
originally announced June 2023.
-
Chiral and nonreciprocal single-photon scattering in a chiral-giant-molecule waveguide-QED system
Authors:
Juan Zhou,
Xian-Li Yin,
Jie-Qiao Liao
Abstract:
We study chiral and nonreciprocal single-photon scattering in a chiral-giant-molecule waveguide-QED system. Here, the giant molecule consists of two coupled giant atoms, which interact with two linear waveguides, forming a four-port quantum device. We obtain the exact analytical expressions of the four scattering amplitudes using a real-space method. Under the Markovian limit, we find that the sin…
▽ More
We study chiral and nonreciprocal single-photon scattering in a chiral-giant-molecule waveguide-QED system. Here, the giant molecule consists of two coupled giant atoms, which interact with two linear waveguides, forming a four-port quantum device. We obtain the exact analytical expressions of the four scattering amplitudes using a real-space method. Under the Markovian limit, we find that the single-photon scattering behavior is determined by the coupling strength between the giant atoms and the waveguides, the coupling strength between the two giant atoms, and the nondipole effect caused by the phase accumulation of photons travelling between the coupling points. It is also found that chiral and nonreciprocal single-photon scattering can be realized by introducing the chiral coupling to break the symmetry in the coupling configuration between the giant molecule and the waveguides. In addition, an ideal chiral emitter-waveguide coupling enables a directional single-photon routing. In the non-Markovian regime, the scattering spectra are characterized by more abundant structures with multiple peaks and dips. In particular, we demonstrate that the non-Markovian retarded effect can induce the nonreciprocal single-photon scattering. Our results have potential applications in the design of optical quantum devices involving giant atoms, which can provide an efficient platform for studying chiral quantum optics.
△ Less
Submitted 19 June, 2023;
originally announced June 2023.
-
The First GECAM Observation Results on Terrestrial Gamma-ray Flashes and Terrestrial Electron Beams
Authors:
Y. Zhao,
J. C. Liu,
S. L. Xiong,
W. C. Xue,
Q. B. Yi,
G. P. Lu,
W. Xu,
F. C. Lyu,
J. C. Sun,
W. X. Peng,
C. Zheng,
Y. Q. Zhang,
C. Cai,
S. Xiao,
S. L. Xie,
C. W. Wang,
W. J. Tan,
Z. H. An,
G. Chen,
Y. Q. Du,
Y. Huang,
M. Gao,
K. Gong,
D. Y. Guo,
J. J. He
, et al. (37 additional authors not shown)
Abstract:
Gravitational-wave high-energy Electromagnetic Counterpart All-sky Monitor (GECAM) is a space-borne instrument dedicated to monitoring high-energy transients, including Terrestrial Gamma-ray Flashes (TGFs) and Terrestrial Electron Beams (TEBs). We implemented a TGF/TEB search algorithm for GECAM, with which 147 bright TGFs, 2 typical TEBs and 2 special TEB-like events are identified during an effe…
▽ More
Gravitational-wave high-energy Electromagnetic Counterpart All-sky Monitor (GECAM) is a space-borne instrument dedicated to monitoring high-energy transients, including Terrestrial Gamma-ray Flashes (TGFs) and Terrestrial Electron Beams (TEBs). We implemented a TGF/TEB search algorithm for GECAM, with which 147 bright TGFs, 2 typical TEBs and 2 special TEB-like events are identified during an effective observation time of $\sim$9 months. We show that, with gamma-ray and charged particle detectors, GECAM can effectively identify and distinguish TGFs and TEBs, and measure their temporal and spectral properties in detail. A very high TGF-lightning association rate of $\sim$80\% is obtained between GECAM and GLD360 in east Asia region.
△ Less
Submitted 17 June, 2023;
originally announced June 2023.
-
Insight-HXMT Measurements of the Diffuse X-ray Background
Authors:
Rui Huang,
Wei Cui,
**-Yuan Liao,
Shuo Zhang,
Si-Fan Wang,
**g **,
Xue Feng Lu,
Cheng-Cheng Guo,
Yuan You,
Gang Li,
Juan Zhang
Abstract:
We present an X-ray spectrum of the diffuse X-ray background (DXRB) between 1.5 and 120 keV, as measured with the Low-Energy Detector (LE) and the High-Energy Detector (HE) aboard the Insight-HXMT satellite, based on 'blank-sky' observations. LE covers a nominal energy range of 1-15 keV and HE 20-250 keV, but calibration issues and data quality narrowed the energy range for this work. The LE backg…
▽ More
We present an X-ray spectrum of the diffuse X-ray background (DXRB) between 1.5 and 120 keV, as measured with the Low-Energy Detector (LE) and the High-Energy Detector (HE) aboard the Insight-HXMT satellite, based on 'blank-sky' observations. LE covers a nominal energy range of 1-15 keV and HE 20-250 keV, but calibration issues and data quality narrowed the energy range for this work. The LE background was directly measured with `blind' detector modules, while the HE background was derived from Earth-occultation data. With the LE data alone, the measured DXRB spectrum can be well described by a power law; fitting the LE and HE data jointly, however, a spectral cut-off must be introduced in the model to account for the measurements above 30 keV. Modelling the combined spectrum with a cut-off power law, the best-fit photon index is 1.40, normalisation $9.57$~$\rm ph~cm^{-2}~s^{-1}~keV^{-1}~sr^{-1} $ (at 1 keV), and cut-off energy 55 keV, after correcting for the effects of the Earth albedo and atmospheric emission (which are significant in the HE band). Based on the best-fit cut-off power law, we derived the spectral energy distribution (SED) of the DXRB. The shape of the SED is in general agreement with the published measurements, but the overall normalization is lower by varying amounts, except for the HEAO-1 result, with which our result is in good agreement.
△ Less
Submitted 16 June, 2023;
originally announced June 2023.
-
Probing general $U(1)'$ models with non-universal lepton charges at FASER/FASER2, COHERENT and long-baseline oscillation experiments
Authors:
Tobias Felkl,
Tong Li,
Jiajun Liao,
Michael A. Schmidt
Abstract:
The general anomaly-free $U(1)'$ models allow non-universal lepton charges. We explore the sensitivities of FASER/FASER2, COHERENT and DUNE/T2HK precision experiments to the new gauge boson $Z'$ and the new CP-even scalar $φ$. With non-universal lepton charges, distinctive reaches at FASER/FASER2 emerge in the regime of low $m_{Z'}$ and small gauge coupling $g_{BL}$ for different $U(1)'$ charge se…
▽ More
The general anomaly-free $U(1)'$ models allow non-universal lepton charges. We explore the sensitivities of FASER/FASER2, COHERENT and DUNE/T2HK precision experiments to the new gauge boson $Z'$ and the new CP-even scalar $φ$. With non-universal lepton charges, distinctive reaches at FASER/FASER2 emerge in the regime of low $m_{Z'}$ and small gauge coupling $g_{BL}$ for different $U(1)'$ charge setups. The COHERENT experiment and the future long-baseline experiments DUNE/T2HK also provide complementary probes to the available parameter space. For $m_φ< 2m_{Z'}$, the search for the scalar $φ$ at FASER/FASER2 is sensitive to the mixing angle between the scalar singlet and the SM Higgs. In the case of $m_φ> 2m_{Z'}$, the kinematically allowed decay $φ\to Z' Z'$ changes the lifetime and decay rates of the scalar $φ$. The sensitivity reach highly depends on the $Z'$ mass and the gauge coupling $g_{BL}$.
△ Less
Submitted 14 September, 2023; v1 submitted 15 June, 2023;
originally announced June 2023.
-
JUNO sensitivity to the annihilation of MeV dark matter in the galactic halo
Authors:
JUNO Collaboration,
Angel Abusleme,
Thomas Adam,
Shakeel Ahmad,
Rizwan Ahmed,
Sebastiano Aiello,
Muhammad Akram,
Abid Aleem,
Tsagkarakis Alexandros,
Fengpeng An,
Qi An,
Giuseppe Andronico,
Nikolay Anfimov,
Vito Antonelli,
Tatiana Antoshkina,
Burin Asavapibhop,
João Pedro Athayde Marcondes de André,
Didier Auguste,
Weidong Bai,
Nikita Balashov,
Wander Baldini,
Andrea Barresi,
Davide Basilico,
Eric Baussan,
Marco Bellato
, et al. (581 additional authors not shown)
Abstract:
We discuss JUNO sensitivity to the annihilation of MeV dark matter in the galactic halo via detecting inverse beta decay reactions of electron anti-neutrinos resulting from the annihilation. We study possible backgrounds to the signature, including the reactor neutrinos, diffuse supernova neutrino background, charged- and neutral-current interactions of atmospheric neutrinos, backgrounds from muon…
▽ More
We discuss JUNO sensitivity to the annihilation of MeV dark matter in the galactic halo via detecting inverse beta decay reactions of electron anti-neutrinos resulting from the annihilation. We study possible backgrounds to the signature, including the reactor neutrinos, diffuse supernova neutrino background, charged- and neutral-current interactions of atmospheric neutrinos, backgrounds from muon-induced fast neutrons and cosmogenic isotopes. A fiducial volume cut, as well as the pulse shape discrimination and the muon veto are applied to suppress the above backgrounds. It is shown that JUNO sensitivity to the thermally averaged dark matter annihilation rate in 10 years of exposure would be significantly better than the present-day best limit set by Super-Kamiokande and would be comparable to that expected by Hyper-Kamiokande.
△ Less
Submitted 13 September, 2023; v1 submitted 15 June, 2023;
originally announced June 2023.
-
Strong Interaction Physics at the Luminosity Frontier with 22 GeV Electrons at Jefferson Lab
Authors:
A. Accardi,
P. Achenbach,
D. Adhikari,
A. Afanasev,
C. S. Akondi,
N. Akopov,
M. Albaladejo,
H. Albataineh,
M. Albrecht,
B. Almeida-Zamora,
M. Amaryan,
D. Androić,
W. Armstrong,
D. S. Armstrong,
M. Arratia,
J. Arrington,
A. Asaturyan,
A. Austregesilo,
H. Avagyan,
T. Averett,
C. Ayerbe Gayoso,
A. Bacchetta,
A. B. Balantekin,
N. Baltzell,
L. Barion
, et al. (419 additional authors not shown)
Abstract:
This document presents the initial scientific case for upgrading the Continuous Electron Beam Accelerator Facility (CEBAF) at Jefferson Lab (JLab) to 22 GeV. It is the result of a community effort, incorporating insights from a series of workshops conducted between March 2022 and April 2023. With a track record of over 25 years in delivering the world's most intense and precise multi-GeV electron…
▽ More
This document presents the initial scientific case for upgrading the Continuous Electron Beam Accelerator Facility (CEBAF) at Jefferson Lab (JLab) to 22 GeV. It is the result of a community effort, incorporating insights from a series of workshops conducted between March 2022 and April 2023. With a track record of over 25 years in delivering the world's most intense and precise multi-GeV electron beams, CEBAF's potential for a higher energy upgrade presents a unique opportunity for an innovative nuclear physics program, which seamlessly integrates a rich historical background with a promising future. The proposed physics program encompass a diverse range of investigations centered around the nonperturbative dynamics inherent in hadron structure and the exploration of strongly interacting systems. It builds upon the exceptional capabilities of CEBAF in high-luminosity operations, the availability of existing or planned Hall equipment, and recent advancements in accelerator technology. The proposed program cover various scientific topics, including Hadron Spectroscopy, Partonic Structure and Spin, Hadronization and Transverse Momentum, Spatial Structure, Mechanical Properties, Form Factors and Emergent Hadron Mass, Hadron-Quark Transition, and Nuclear Dynamics at Extreme Conditions, as well as QCD Confinement and Fundamental Symmetries. Each topic highlights the key measurements achievable at a 22 GeV CEBAF accelerator. Furthermore, this document outlines the significant physics outcomes and unique aspects of these programs that distinguish them from other existing or planned facilities. In summary, this document provides an exciting rationale for the energy upgrade of CEBAF to 22 GeV, outlining the transformative scientific potential that lies within reach, and the remarkable opportunities it offers for advancing our understanding of hadron physics and related fundamental phenomena.
△ Less
Submitted 24 August, 2023; v1 submitted 13 June, 2023;
originally announced June 2023.
-
Telecom-band integrated multimode photonic quantum memory
Authors:
Xueying Zhang,
Bin Zhang,
Shihai Wei,
Hao Li,
**yu Liao,
Cheng Li,
Guangwei Deng,
You Wang,
Haizhi Song,
Lixing You,
Bo **g,
Feng Chen,
Guang-Can Guo,
Qiang Zhou
Abstract:
Telecom-band integrated quantum memory is an elementary building block for develo** quantum networks compatible with fiber communication infrastructures. Towards such a network with large capacity, an integrated multimode photonic quantum memory at telecom band has yet been demonstrated. Here we report a fiber-integrated multimode quantum storage of single photon at telecom band on a laser-writt…
▽ More
Telecom-band integrated quantum memory is an elementary building block for develo** quantum networks compatible with fiber communication infrastructures. Towards such a network with large capacity, an integrated multimode photonic quantum memory at telecom band has yet been demonstrated. Here we report a fiber-integrated multimode quantum storage of single photon at telecom band on a laser-written chip. The storage device is a fiber-pigtailed Er3+:LiNbO3 waveguide and allows a storage of up to 330 temporal modes of heralded single photon with 4-GHz-wide bandwidth at 1532 nm and a 167-fold increasing of coincidence detection rate with respect to single mode. Our memory system with all-fiber addressing is performed using telecom-band fiber-integrated and on-chip devices. The results represent an important step for the future quantum networks using integrated photonics devices.
△ Less
Submitted 13 June, 2023;
originally announced June 2023.
-
DualHGNN: A Dual Hypergraph Neural Network for Semi-Supervised Node Classification based on Multi-View Learning and Density Awareness
Authors:
Jianpeng Liao,
Jun Yan,
Qian Tao
Abstract:
Graph-based semi-supervised node classification has been shown to become a state-of-the-art approach in many applications with high research value and significance. Most existing methods are only based on the original intrinsic or artificially established graph structure which may not accurately reflect the "true" correlation among data and are not optimal for semi-supervised node classification i…
▽ More
Graph-based semi-supervised node classification has been shown to become a state-of-the-art approach in many applications with high research value and significance. Most existing methods are only based on the original intrinsic or artificially established graph structure which may not accurately reflect the "true" correlation among data and are not optimal for semi-supervised node classification in the downstream graph neural networks. Besides, while existing graph-based methods mostly utilize the explicit graph structure, some implicit information, for example, the density information, can also provide latent information that can be further exploited. To address these limitations, this paper proposes the Dual Hypergraph Neural Network (DualHGNN), a new dual connection model integrating both hypergraph structure learning and hypergraph representation learning simultaneously in a unified architecture. The DualHGNN first leverages a multi-view hypergraph learning network to explore the optimal hypergraph structure from multiple views, constrained by a consistency loss proposed to improve its generalization. Then, DualHGNN employs a density-aware hypergraph attention network to explore the high-order semantic correlation among data points based on the density-aware attention mechanism. Extensive experiments are conducted in various benchmark datasets, and the results demonstrate the effectiveness of the proposed approach.
△ Less
Submitted 7 June, 2023;
originally announced June 2023.
-
Simultaneous and panchromatic observations of the Fast Radio Burst FRB 20180916B
Authors:
M. Trudu,
M. Pilia,
L. Nicastro,
C. Guidorzi,
M. Orlandini,
L. Zampieri,
V. R. Marthi,
F. Ambrosino,
A. Possenti,
M. Burgay,
C. Casentini,
I. Mereminskiy,
V. Savchenko,
E. Palazzi,
F. Panessa,
A. Ridolfi,
F. Verrecchia,
M. Anedda,
G. Bernardi,
M. Bachetti,
R. Burenin,
A. Burtovoi,
P. Casella,
M. Fiori,
F. Frontera
, et al. (25 additional authors not shown)
Abstract:
Aims. Fast Radio Bursts are bright radio transients whose origin has not yet explained. The search for a multi-wavelength counterpart of those events can put a tight constrain on the emission mechanism and the progenitor source. Methods. We conducted a multi-wavelength observational campaign on FRB 20180916B between October 2020 and August 2021 during eight activity cycles of the source. Observati…
▽ More
Aims. Fast Radio Bursts are bright radio transients whose origin has not yet explained. The search for a multi-wavelength counterpart of those events can put a tight constrain on the emission mechanism and the progenitor source. Methods. We conducted a multi-wavelength observational campaign on FRB 20180916B between October 2020 and August 2021 during eight activity cycles of the source. Observations were led in the radio band by the SRT both at 336 MHz and 1547 MHz and the uGMRT at 400 MHz. Simultaneous observations have been conducted by the optical telescopes Asiago (Galileo and Copernico), CMO SAI MSU, CAHA 2.2m, RTT-150 and TNG, and X/Gamma-ray detectors on board the AGILE, Insight-HXMT, INTEGRAL and Swift satellites. Results. We present the detection of 14 new bursts detected with the SRT at 336 MHz and seven new bursts with the uGMRT from this source. We provide the deepest prompt upper limits in the optical band fro FRB 20180916B to date. In fact, the TNG/SiFAP2 observation simultaneous to a burst detection by uGMRT gives an upper limit E_optical / E_radio < 1.3 x 10^2. Another burst detected by the SRT at 336 MHz was also co-observed by Insight-HMXT. The non-detection in the X-rays yields an upper limit (1-30 keV band) of E_X-ray / E_radio in the range of (0.9-1.3) x 10^7, depending on which model is considered for the X-ray emission.
△ Less
Submitted 29 May, 2023;
originally announced May 2023.
-
COVID-19 spreading patterns in family clusters reveal gender roles in China
Authors:
**gyi Liao,
Xiao Fan Liu,
Xiao-Ke Xu,
Tao Zhou
Abstract:
Unfolding different gender roles is preceding the efforts to reduce gender inequality. This paper analyzes COVID-19 family clusters outside Hubei Province in mainland China during the 2020 outbreak, revealing significant differences in spreading patterns across gender and family roles. Results show that men are more likely to be the imported cases of a family cluster, and women are more likely to…
▽ More
Unfolding different gender roles is preceding the efforts to reduce gender inequality. This paper analyzes COVID-19 family clusters outside Hubei Province in mainland China during the 2020 outbreak, revealing significant differences in spreading patterns across gender and family roles. Results show that men are more likely to be the imported cases of a family cluster, and women are more likely to be infected within the family. This finding provides new supportive evidence of the men as breadwinner and women as homemaker (MBWH) gender roles in China. Further analyses reveal that the MBWH pattern is stronger in eastern than in western China, stronger for younger than for elder people. This paper offers not only valuable references for formulating gender-differentiated epidemic prevention policies but also an exemplification for studying group differences in similar scenarios.
△ Less
Submitted 28 May, 2023;
originally announced May 2023.
-
A spectral-timing study of the inner flow geometry in MAXI J1535--571 with $Insight$-HXMT and NICER
Authors:
Wei Yu,
Qing-Cui Bu,
He-Xin Liu,
Yue Huang,
Liang Zhang,
Zi-Xu Yang,
**-Lu Qu,
Shu Zhang,
Li-Ming Song,
Shuang-Nan Zhang,
Shu-Mei Jia,
Xiang Ma,
Lian Tao,
Ming-Yu Ge,
Qing-Zhong Liu,
**g-Zhi Yan,
Xue-Lei Cao,
Zhi Chang,
Li Chen,
Yong Chen,
Yu-Peng Chen,
Guo-Qiang Ding,
Ju Guan,
**g **,
Ling-Da Kong
, et al. (26 additional authors not shown)
Abstract:
We have performed a spectral-timing analysis on the black hole X-ray binary MAXI J1535--571 during its 2017 outburst, with the aim of exploring the evolution of the inner accretion flow geometry. X-ray reverberation lags are observed in the hard-intermediate state (HIMS) and soft-intermediate state (SIMS) of the outburst. During the HIMS, the characteristic frequency of the reverberation lags…
▽ More
We have performed a spectral-timing analysis on the black hole X-ray binary MAXI J1535--571 during its 2017 outburst, with the aim of exploring the evolution of the inner accretion flow geometry. X-ray reverberation lags are observed in the hard-intermediate state (HIMS) and soft-intermediate state (SIMS) of the outburst. During the HIMS, the characteristic frequency of the reverberation lags $ν_0$ (the frequency at which the soft lag turns to zero in the lag-frequency spectra) increases when the spectrum softens. This reflects a reduction of the spatial distance between the corona and accretion disc, when assuming the measured time lags are associated with the light travel time. We also find a strong correlation between $ν_0$ and type-C Quasi Periodic Oscillation (QPO) centroid frequency $ν_{QPO}$, which can be well explained by the Lense-Thirring (L-T) precession model under a truncated disk geometry. Despite the degeneracy in the spectral modellings, our results suggest that the accretion disc is largely truncated in the low hard state (LHS), and moves inward as the spectrum softens. Combine the spectral modelling results with the $ν_0$ - $ν_{QPO}$ evolution, we are inclined to believe that this source probably have a truncated disk geometry in the hard state.
△ Less
Submitted 3 July, 2023; v1 submitted 26 May, 2023;
originally announced May 2023.
-
TabGSL: Graph Structure Learning for Tabular Data Prediction
Authors:
Jay Chiehen Liao,
Cheng-Te Li
Abstract:
This work presents a novel approach to tabular data prediction leveraging graph structure learning and graph neural networks. Despite the prevalence of tabular data in real-world applications, traditional deep learning methods often overlook the potentially valuable associations between data instances. Such associations can offer beneficial insights for classification tasks, as instances may exhib…
▽ More
This work presents a novel approach to tabular data prediction leveraging graph structure learning and graph neural networks. Despite the prevalence of tabular data in real-world applications, traditional deep learning methods often overlook the potentially valuable associations between data instances. Such associations can offer beneficial insights for classification tasks, as instances may exhibit similar patterns of correlations among features and target labels. This information can be exploited by graph neural networks, necessitating robust graph structures. However, existing studies primarily focus on improving graph structure from noisy data, largely neglecting the possibility of deriving graph structures from tabular data. We present a novel solution, Tabular Graph Structure Learning (TabGSL), to enhance tabular data prediction by simultaneously learning instance correlation and feature interaction within a unified framework. This is achieved through a proposed graph contrastive learning module, along with transformer-based feature extractor and graph neural network. Comprehensive experiments conducted on 30 benchmark tabular datasets demonstrate that TabGSL markedly outperforms both tree-based models and recent deep learning-based tabular models. Visualizations of the learned instance embeddings further substantiate the effectiveness of TabGSL.
△ Less
Submitted 25 May, 2023;
originally announced May 2023.
-
The case for an EIC Theory Alliance: Theoretical Challenges of the EIC
Authors:
Raktim Abir,
Igor Akushevich,
Tolga Altinoluk,
Daniele Paolo Anderle,
Fatma P. Aslan,
Alessandro Bacchetta,
Baha Balantekin,
Joao Barata,
Marco Battaglieri,
Carlos A. Bertulani,
Guillaume Beuf,
Chiara Bissolotti,
Daniël Boer,
M. Boglione,
Radja Boughezal,
Eric Braaten,
Nora Brambilla,
Vladimir Braun,
Duane Byer,
Francesco Giovanni Celiberto,
Yang-Ting Chien,
Ian C. Cloët,
Martha Constantinou,
Wim Cosyn,
Aurore Courtoy
, et al. (146 additional authors not shown)
Abstract:
We outline the physics opportunities provided by the Electron Ion Collider (EIC). These include the study of the parton structure of the nucleon and nuclei, the onset of gluon saturation, the production of jets and heavy flavor, hadron spectroscopy and tests of fundamental symmetries. We review the present status and future challenges in EIC theory that have to be addressed in order to realize thi…
▽ More
We outline the physics opportunities provided by the Electron Ion Collider (EIC). These include the study of the parton structure of the nucleon and nuclei, the onset of gluon saturation, the production of jets and heavy flavor, hadron spectroscopy and tests of fundamental symmetries. We review the present status and future challenges in EIC theory that have to be addressed in order to realize this ambitious and impactful physics program, including how to engage a diverse and inclusive workforce. In order to address these many-fold challenges, we propose a coordinated effort involving theory groups with differing expertise is needed. We discuss the scientific goals and scope of such an EIC Theory Alliance.
△ Less
Submitted 23 May, 2023;
originally announced May 2023.
-
Text2NeRF: Text-Driven 3D Scene Generation with Neural Radiance Fields
Authors:
**gbo Zhang,
Xiaoyu Li,
Ziyu Wan,
Can Wang,
**g Liao
Abstract:
Text-driven 3D scene generation is widely applicable to video gaming, film industry, and metaverse applications that have a large demand for 3D scenes. However, existing text-to-3D generation methods are limited to producing 3D objects with simple geometries and dreamlike styles that lack realism. In this work, we present Text2NeRF, which is able to generate a wide range of 3D scenes with complica…
▽ More
Text-driven 3D scene generation is widely applicable to video gaming, film industry, and metaverse applications that have a large demand for 3D scenes. However, existing text-to-3D generation methods are limited to producing 3D objects with simple geometries and dreamlike styles that lack realism. In this work, we present Text2NeRF, which is able to generate a wide range of 3D scenes with complicated geometric structures and high-fidelity textures purely from a text prompt. To this end, we adopt NeRF as the 3D representation and leverage a pre-trained text-to-image diffusion model to constrain the 3D reconstruction of the NeRF to reflect the scene description. Specifically, we employ the diffusion model to infer the text-related image as the content prior and use a monocular depth estimation method to offer the geometric prior. Both content and geometric priors are utilized to update the NeRF model. To guarantee textured and geometric consistency between different views, we introduce a progressive scene inpainting and updating strategy for novel view synthesis of the scene. Our method requires no additional training data but only a natural language description of the scene as the input. Extensive experiments demonstrate that our Text2NeRF outperforms existing methods in producing photo-realistic, multi-view consistent, and diverse 3D scenes from a variety of natural language prompts. Our code is available at https://github.com/eckertzhang/Text2NeRF.
△ Less
Submitted 31 January, 2024; v1 submitted 19 May, 2023;
originally announced May 2023.
-
Inter-frame Accelerate Attack against Video Interpolation Models
Authors:
Junpei Liao,
Zhikai Chen,
Liang Yi,
Wenyuan Yang,
Baoyuan Wu,
Xiaochun Cao
Abstract:
Deep learning based video frame interpolation (VIF) method, aiming to synthesis the intermediate frames to enhance video quality, have been highly developed in the past few years. This paper investigates the adversarial robustness of VIF models. We apply adversarial attacks to VIF models and find that the VIF models are very vulnerable to adversarial examples. To improve attack efficiency, we sugg…
▽ More
Deep learning based video frame interpolation (VIF) method, aiming to synthesis the intermediate frames to enhance video quality, have been highly developed in the past few years. This paper investigates the adversarial robustness of VIF models. We apply adversarial attacks to VIF models and find that the VIF models are very vulnerable to adversarial examples. To improve attack efficiency, we suggest to make full use of the property of video frame interpolation task. The intuition is that the gap between adjacent frames would be small, leading to the corresponding adversarial perturbations being similar as well. Then we propose a novel attack method named Inter-frame Accelerate Attack (IAA) that initializes the perturbation as the perturbation for the previous adjacent frame and reduces the number of attack iterations. It is shown that our method can improve attack efficiency greatly while achieving comparable attack performance with traditional methods. Besides, we also extend our method to video recognition models which are higher level vision tasks and achieves great attack efficiency.
△ Less
Submitted 10 May, 2023;
originally announced May 2023.
-
Dynamics in near-threshold $J/ψ$ photoproduction
Authors:
JPAC Collaboration,
D. Winney,
C. Fernandez-Ramirez,
A. Pilloni,
A. N. Hiller Blin,
M. Albaladejo,
L. Bibrzycki,
N. Hammoud,
J. Liao,
V. Mathieu,
G. Montana,
R. J. Perry,
V. Shastry,
W. A. Smith,
A. P. Szczepaniak
Abstract:
The study of $J/ψ$ photoproduction at low energies has consequences for the understanding of multiple aspects of nonperturbative QCD, ranging from mechanical properties of the proton, to the binding inside nuclei, and the existence of hidden-charm pentaquarks. Factorization of the photon-$c \bar c$ and nucleon dynamics or Vector Meson Dominance are often invoked to justify these studies. Alternati…
▽ More
The study of $J/ψ$ photoproduction at low energies has consequences for the understanding of multiple aspects of nonperturbative QCD, ranging from mechanical properties of the proton, to the binding inside nuclei, and the existence of hidden-charm pentaquarks. Factorization of the photon-$c \bar c$ and nucleon dynamics or Vector Meson Dominance are often invoked to justify these studies. Alternatively, open charm intermediate states have been proposed as the dominant mechanism underlying $J/ψ$ photoproduction. As the latter violates this factorization, it is important to estimate the relevance of such contributions. We analyse the latest differential and integrated photoproduction cross sections from the GlueX and $J/ψ$-007 experiments. We show that the data can be adequately described by a small number of partial waves, which we parameterize with generic models enforcing low-energy unitarity. The results suggest a nonnegligible contribution from open charm intermediate states. Furthermore, most of the models present an elastic scattering length incompatible with previous extractions based on Vector Meson Dominance, and thus call into question its applicability to heavy mesons. Our results indicate a wide array of physics possibilities that are compatible with present data and need to be disentangled.
△ Less
Submitted 13 September, 2023; v1 submitted 2 May, 2023;
originally announced May 2023.
-
IconShop: Text-Guided Vector Icon Synthesis with Autoregressive Transformers
Authors:
Ronghuan Wu,
Wanchao Su,
Kede Ma,
**g Liao
Abstract:
Scalable Vector Graphics (SVG) is a popular vector image format that offers good support for interactivity and animation. Despite its appealing characteristics, creating custom SVG content can be challenging for users due to the steep learning curve required to understand SVG grammars or get familiar with professional editing software. Recent advancements in text-to-image generation have inspired…
▽ More
Scalable Vector Graphics (SVG) is a popular vector image format that offers good support for interactivity and animation. Despite its appealing characteristics, creating custom SVG content can be challenging for users due to the steep learning curve required to understand SVG grammars or get familiar with professional editing software. Recent advancements in text-to-image generation have inspired researchers to explore vector graphics synthesis using either image-based methods (i.e., text -> raster image -> vector graphics) combining text-to-image generation models with image vectorization, or language-based methods (i.e., text -> vector graphics script) through pretrained large language models. However, these methods still suffer from limitations in terms of generation quality, diversity, and flexibility. In this paper, we introduce IconShop, a text-guided vector icon synthesis method using autoregressive transformers. The key to success of our approach is to sequentialize and tokenize SVG paths (and textual descriptions as guidance) into a uniquely decodable token sequence. With that, we are able to fully exploit the sequence learning power of autoregressive transformers, while enabling both unconditional and text-conditioned icon synthesis. Through standard training to predict the next token on a large-scale vector icon dataset accompanied by textural descriptions, the proposed IconShop consistently exhibits better icon synthesis capability than existing image-based and language-based methods both quantitatively and qualitatively. Meanwhile, we observe a dramatic improvement in generation diversity, which is validated by the objective Uniqueness and Novelty measures. More importantly, we demonstrate the flexibility of IconShop with multiple novel icon synthesis tasks, including icon editing, icon interpolation, icon semantic combination, and icon design auto-suggestion.
△ Less
Submitted 6 June, 2023; v1 submitted 27 April, 2023;
originally announced April 2023.
-
Accurate and Efficient Event-based Semantic Segmentation Using Adaptive Spiking Encoder-Decoder Network
Authors:
Rui Zhang,
Luziwei Leng,
Kaiwei Che,
Hu Zhang,
Jie Cheng,
Qinghai Guo,
Jiangxing Liao,
Ran Cheng
Abstract:
Leveraging the low-power, event-driven computation and the inherent temporal dynamics, spiking neural networks (SNNs) are potentially ideal solutions for processing dynamic and asynchronous signals from event-based sensors. However, due to the challenges in training and the restrictions in architectural design, there are limited examples of competitive SNNs in the realm of event-based dense predic…
▽ More
Leveraging the low-power, event-driven computation and the inherent temporal dynamics, spiking neural networks (SNNs) are potentially ideal solutions for processing dynamic and asynchronous signals from event-based sensors. However, due to the challenges in training and the restrictions in architectural design, there are limited examples of competitive SNNs in the realm of event-based dense prediction when compared to artificial neural networks (ANNs). In this paper, we present an efficient spiking encoder-decoder network designed for large-scale event-based semantic segmentation tasks. This is achieved by optimizing the encoder using a hierarchical search method. To enhance learning from dynamic event streams, we harness the inherent adaptive threshold of spiking neurons to modulate network activation. Moreover, we introduce a dual-path Spiking Spatially-Adaptive Modulation (SSAM) block, specifically designed to enhance the representation of sparse events, thereby considerably improving network performance. Our proposed network achieves a 72.57% mean intersection over union (MIoU) on the DDD17 dataset and a 57.22% MIoU on the recently introduced, larger DSEC-Semantic dataset. This performance surpasses the current state-of-the-art ANNs by 4%, whilst consuming significantly less computational resources. To the best of our knowledge, this is the first study demonstrating SNNs outperforming ANNs in demanding event-based semantic segmentation tasks, thereby establishing the vast potential of SNNs in the field of event-based vision. Our source code will be made publicly accessible.
△ Less
Submitted 9 July, 2023; v1 submitted 24 April, 2023;
originally announced April 2023.
-
Dynamical $N$-photon bundle emission
Authors:
Fen Zou,
Yong Li,
Jie-Qiao Liao
Abstract:
Engineering multiphoton resources is of importance in quantum metrology, quantum lithography, and biological sensing. Here we propose a concept of dynamical emission of $N$ strongly-correlated photons. This is realized in a circuit quantum electrodynamical system driven by two Gaussian-pulse sequences. The underlying physical mechanism relies on the stimulated Raman adiabatic passage that allows e…
▽ More
Engineering multiphoton resources is of importance in quantum metrology, quantum lithography, and biological sensing. Here we propose a concept of dynamical emission of $N$ strongly-correlated photons. This is realized in a circuit quantum electrodynamical system driven by two Gaussian-pulse sequences. The underlying physical mechanism relies on the stimulated Raman adiabatic passage that allows efficient and selective preparation of target multiphoton states. Assisted by the photon decay, a highly pure $N$-photon bundle emission takes place in this system. In particular, the dynamical $N$-photon bundle emission can be tuned by controlling the time interval between consecutive pulses so that the device behaves as an $N$-photon gun, which can be triggered on demand. Our work opens up a route to achieve multiphoton source devices, which have wide potential applications in quantum information processing and quantum metrology.
△ Less
Submitted 7 May, 2023; v1 submitted 21 April, 2023;
originally announced April 2023.
-
Learning Neural Duplex Radiance Fields for Real-Time View Synthesis
Authors:
Ziyu Wan,
Christian Richardt,
Aljaž Božič,
Chao Li,
Vijay Rengarajan,
Seonghyeon Nam,
Xiaoyu Xiang,
Tuotuo Li,
Bo Zhu,
Rakesh Ranjan,
**g Liao
Abstract:
Neural radiance fields (NeRFs) enable novel view synthesis with unprecedented visual quality. However, to render photorealistic images, NeRFs require hundreds of deep multilayer perceptron (MLP) evaluations - for each pixel. This is prohibitively expensive and makes real-time rendering infeasible, even on powerful modern GPUs. In this paper, we propose a novel approach to distill and bake NeRFs in…
▽ More
Neural radiance fields (NeRFs) enable novel view synthesis with unprecedented visual quality. However, to render photorealistic images, NeRFs require hundreds of deep multilayer perceptron (MLP) evaluations - for each pixel. This is prohibitively expensive and makes real-time rendering infeasible, even on powerful modern GPUs. In this paper, we propose a novel approach to distill and bake NeRFs into highly efficient mesh-based neural representations that are fully compatible with the massively parallel graphics rendering pipeline. We represent scenes as neural radiance features encoded on a two-layer duplex mesh, which effectively overcomes the inherent inaccuracies in 3D surface reconstruction by learning the aggregated radiance information from a reliable interval of ray-surface intersections. To exploit local geometric relationships of nearby pixels, we leverage screen-space convolutions instead of the MLPs used in NeRFs to achieve high-quality appearance. Finally, the performance of the whole framework is further boosted by a novel multi-view distillation optimization strategy. We demonstrate the effectiveness and superiority of our approach via extensive experiments on a range of standard datasets.
△ Less
Submitted 20 April, 2023;
originally announced April 2023.
-
The kernels of powers of linear operator via Weyr characteristic
Authors:
Jie Jian,
Jun Liao,
Heguo Liu
Abstract:
The adjoint of a matrix in the Lie algebra associated with a matrix algebra is a fundamental operator, which can be generalized to a more general operator $\varphi_{AB}: X\rightarrow AX-XB$ by two matrices $A$ and $B$. The kernel of the operator is very well-known and it can be found in Gantmacher's book. The formulas for the dimensions of the kernels of arbitrary powers of the operator…
▽ More
The adjoint of a matrix in the Lie algebra associated with a matrix algebra is a fundamental operator, which can be generalized to a more general operator $\varphi_{AB}: X\rightarrow AX-XB$ by two matrices $A$ and $B$. The kernel of the operator is very well-known and it can be found in Gantmacher's book. The formulas for the dimensions of the kernels of arbitrary powers of the operator $\varphi_{AB}$ were given in terms of the Segre characteristics of these two matrices by the second and third authors in this paper and their collaborators. This paper provides an alternative approach to this problem via the Weyr characteristic in a more essential method. We obtain formulas for the dimensions of the kernels of arbitrary powers of the operator in terms of the Weyr characteristics. Furthermore, the basis for kernel of each power of the operator is described explicitly. As a consequence, for arbitrary square matrices $A$ and $B$ over an algebraically closed field, the dimension of the kernel of each power of the operator $\varphi_{A-λI,B}$ for eigenvalues $λ$ of $\varphi_{AB}$ can be viewed as a similarity invariant of the operator $\varphi_{AB}$, so we characterise the operator within similarity, which should be of interest to a number of people (including physicists).
△ Less
Submitted 17 February, 2024; v1 submitted 19 April, 2023;
originally announced April 2023.
-
Deep-Learning-based Vasculature Extraction for Single-Scan Optical Coherence Tomography Angiography
Authors:
**peng Liao,
Tianyu Zhang,
Yilong Zhang,
Chunhui Li,
Zhihong Huang
Abstract:
Optical coherence tomography angiography (OCTA) is a non-invasive imaging modality that extends the functionality of OCT by extracting moving red blood cell signals from surrounding static biological tissues. OCTA has emerged as a valuable tool for analyzing skin microvasculature, enabling more accurate diagnosis and treatment monitoring. Most existing OCTA extraction algorithms, such as speckle v…
▽ More
Optical coherence tomography angiography (OCTA) is a non-invasive imaging modality that extends the functionality of OCT by extracting moving red blood cell signals from surrounding static biological tissues. OCTA has emerged as a valuable tool for analyzing skin microvasculature, enabling more accurate diagnosis and treatment monitoring. Most existing OCTA extraction algorithms, such as speckle variance (SV)- and eigen-decomposition (ED)-OCTA, implement a larger number of repeated (NR) OCT scans at the same position to produce high-quality angiography images. However, a higher NR requires a longer data acquisition time, leading to more unpredictable motion artifacts. In this study, we propose a vasculature extraction pipeline that uses only one-repeated OCT scan to generate OCTA images. The pipeline is based on the proposed Vasculature Extraction Transformer (VET), which leverages convolutional projection to better learn the spatial relationships between image patches. In comparison to OCTA images obtained via the SV-OCTA (PSNR: 17.809) and ED-OCTA (PSNR: 18.049) using four-repeated OCT scans, OCTA images extracted by VET exhibit moderate quality (PSNR: 17.515) and higher image contrast while reducing the required data acquisition time from ~8 s to ~2 s. Based on visual observations, the proposed VET outperforms SV and ED algorithms when using neck and face OCTA data in areas that are challenging to scan. This study represents that the VET has the capacity to extract vascularture images from a fast one-repeated OCT scan, facilitating accurate diagnosis for patients.
△ Less
Submitted 3 May, 2023; v1 submitted 17 April, 2023;
originally announced April 2023.
-
Controllable generation of mechanical quadrature squeezing via dark-mode engineering in cavity optomechanics
Authors:
Jian Huang,
Deng-Gao Lai,
Jie-Qiao Liao
Abstract:
Quantum squeezing is an important resource in modern quantum technologies, such as quantum precision measurement and continuous-variable quantum information processing. The generation of squeezed states of mechanical modes is a significant task in cavity optomechanics. Motivated by recent interest in multimode optomechanics, it becomes an interesting topic to create quadrature squeezing in multipl…
▽ More
Quantum squeezing is an important resource in modern quantum technologies, such as quantum precision measurement and continuous-variable quantum information processing. The generation of squeezed states of mechanical modes is a significant task in cavity optomechanics. Motivated by recent interest in multimode optomechanics, it becomes an interesting topic to create quadrature squeezing in multiple mechanical resonators. However, in the multiple-degenerate-mechanical-mode optomechanical systems, the dark-mode effect strongly suppresses the quantum effects in mechanical modes. Here we study the generation of mechanical squeezing in a two-mechanical-mode optomechanical system by breaking the dark-mode effect with the synthetic-gauge-field method. We find that when the mechanical modes work at a finite temperature, the mechanical squeezing is weak or even disappeared due to the dark-mode effect, while the strong mechanical squeezing can be generated once the dark-mode effect is broken. In particular, the thermal-phonon-occupation tolerance of the mechanical squeezing is approximately three orders of magnitude larger than that without breaking the dark-mode effect. We also generalize this method to break the dark modes and to create the mechanical squeezing in a multiple-mechanical-mode optomechanical system. Our results describe a general physical mechanism and pave the way towards the generation of noise-resistant quantum resources.
△ Less
Submitted 27 July, 2023; v1 submitted 3 April, 2023;
originally announced April 2023.
-
AvatarCraft: Transforming Text into Neural Human Avatars with Parameterized Shape and Pose Control
Authors:
Ruixiang Jiang,
Can Wang,
**gbo Zhang,
Menglei Chai,
Mingming He,
Dongdong Chen,
**g Liao
Abstract:
Neural implicit fields are powerful for representing 3D scenes and generating high-quality novel views, but it remains challenging to use such implicit representations for creating a 3D human avatar with a specific identity and artistic style that can be easily animated. Our proposed method, AvatarCraft, addresses this challenge by using diffusion models to guide the learning of geometry and textu…
▽ More
Neural implicit fields are powerful for representing 3D scenes and generating high-quality novel views, but it remains challenging to use such implicit representations for creating a 3D human avatar with a specific identity and artistic style that can be easily animated. Our proposed method, AvatarCraft, addresses this challenge by using diffusion models to guide the learning of geometry and texture for a neural avatar based on a single text prompt. We carefully design the optimization framework of neural implicit fields, including a coarse-to-fine multi-bounding box training strategy, shape regularization, and diffusion-based constraints, to produce high-quality geometry and texture. Additionally, we make the human avatar animatable by deforming the neural implicit field with an explicit war** field that maps the target human mesh to a template human mesh, both represented using parametric human models. This simplifies animation and resha** of the generated avatar by controlling pose and shape parameters. Extensive experiments on various text descriptions show that AvatarCraft is effective and robust in creating human avatars and rendering novel views, poses, and shapes. Our project page is: https://avatar-craft.github.io/.
△ Less
Submitted 21 August, 2023; v1 submitted 30 March, 2023;
originally announced March 2023.
-
Hot QCD White Paper
Authors:
M. Arslandok,
S. A. Bass,
A. A. Baty,
I. Bautista,
C. Beattie,
F. Becattini,
R. Bellwied,
Y. Berdnikov,
A. Berdnikov,
J. Bielcik,
J. T. Blair,
F. Bock,
B. Boimska,
H. Bossi,
H. Caines,
Y. Chen,
Y. -T. Chien,
M. Chiu,
M. E. Connors,
M. Csanád,
C. L. da Silva,
A. P. Dash,
G. David,
K. Dehmelt,
V. Dexheimer
, et al. (149 additional authors not shown)
Abstract:
Hot QCD physics studies the nuclear strong force under extreme temperature and densities. Experimentally these conditions are achieved via high-energy collisions of heavy ions at the Relativistic Heavy Ion Collider (RHIC) and the Large Hadron Collider (LHC). In the past decade, a unique and substantial suite of data was collected at RHIC and the LHC, probing hydrodynamics at the nucleon scale, the…
▽ More
Hot QCD physics studies the nuclear strong force under extreme temperature and densities. Experimentally these conditions are achieved via high-energy collisions of heavy ions at the Relativistic Heavy Ion Collider (RHIC) and the Large Hadron Collider (LHC). In the past decade, a unique and substantial suite of data was collected at RHIC and the LHC, probing hydrodynamics at the nucleon scale, the temperature dependence of the transport properties of quark-gluon plasma, the phase diagram of nuclear matter, the interaction of quarks and gluons at different scales and much more. This document, as part of the 2023 nuclear science long range planning process, was written to review the progress in hot QCD since the 2015 Long Range Plan for Nuclear Science, as well as highlight the realization of previous recommendations, and present opportunities for the next decade, building on the accomplishments and investments made in theoretical developments and the construction of new detectors. Furthermore, this document provides additional context to support the recommendations voted on at the Joint Hot and Cold QCD Town Hall Meeting, which are reported in a separate document.
△ Less
Submitted 30 March, 2023;
originally announced March 2023.
-
Giant-atom entanglement in waveguide-QED systems including non-Markovian effect
Authors:
Xian-Li Yin,
Jie-Qiao Liao
Abstract:
We study the generation of quantum entanglement between two giant atoms coupled to a common one-dimensional waveguide. Here each giant atom interacts with the waveguide at two separate coupling points. Within the Wigner-Weisskopf framework for single coupling points, we obtain the time-delayed quantum master equations governing the evolution of the two giant atoms for three different coupling conf…
▽ More
We study the generation of quantum entanglement between two giant atoms coupled to a common one-dimensional waveguide. Here each giant atom interacts with the waveguide at two separate coupling points. Within the Wigner-Weisskopf framework for single coupling points, we obtain the time-delayed quantum master equations governing the evolution of the two giant atoms for three different coupling configurations: separated, braided, and nested couplings. For each coupling configuration, we consider both the Markovian and non-Markovian entanglement dynamics of the giant atoms, which are initially in two different separable states: single- and double-excitation states. Our results show that the generated entanglement depends on the phase shift, time delay, atomic initial state, and the coupling configuration. For the single-excitation initial state, there exists the steady-state entanglement for each coupling in both the Markovian and non-Markovian regimes due to the appearance of the dark state. For the double-excitation initial state, we observe entanglement sudden birth via adjusting the phase shift in both regimes. In particular, the maximally achievable entanglement for the nested coupling is about one order of magnitude larger than those of separate and braided couplings. We also find that the maximal entanglement for these three coupling configurations can be enhanced in the case of small time delays. This work can be utilized for the generation and control of entanglement in quantum networks based on giant-atom waveguide-QED systems, which have wide potential applications in quantum information processing.
△ Less
Submitted 8 June, 2023; v1 submitted 26 March, 2023;
originally announced March 2023.