Search | arXiv e-print repository

Text-Guided Vector Graphics Customization

Authors: Peiying Zhang, Nanxuan Zhao, **g Liao

Abstract: Vector graphics are widely used in digital art and valued by designers for their scalability and layer-wise topological properties. However, the creation and editing of vector graphics necessitate creativity and design expertise, leading to a time-consuming process. In this paper, we propose a novel pipeline that generates high-quality customized vector graphics based on textual prompts while pres… ▽ More Vector graphics are widely used in digital art and valued by designers for their scalability and layer-wise topological properties. However, the creation and editing of vector graphics necessitate creativity and design expertise, leading to a time-consuming process. In this paper, we propose a novel pipeline that generates high-quality customized vector graphics based on textual prompts while preserving the properties and layer-wise information of a given exemplar SVG. Our method harnesses the capabilities of large pre-trained text-to-image models. By fine-tuning the cross-attention layers of the model, we generate customized raster images guided by textual prompts. To initialize the SVG, we introduce a semantic-based path alignment method that preserves and transforms crucial paths from the exemplar SVG. Additionally, we optimize path parameters using both image-level and vector-level losses, ensuring smooth shape deformation while aligning with the customized raster image. We extensively evaluate our method using multiple metrics from vector-level, image-level, and text-level perspectives. The evaluation results demonstrate the effectiveness of our pipeline in generating diverse customizations of vector graphics with exceptional quality. The project page is https://intchous.github.io/SVGCustomization. △ Less

Submitted 21 September, 2023; originally announced September 2023.

Comments: Accepted by SIGGRAPH Asia 2023. Project page: https://intchous.github.io/SVGCustomization

arXiv:2309.11717 [pdf, other]

A class-weighted supervised contrastive learning long-tailed bearing fault diagnosis approach using quadratic neural network

Authors: Wei-En Yu, **wei Sun, Shi** Zhang, Xiaoge Zhang, **g-Xiao Liao

Abstract: Deep learning has achieved remarkable success in bearing fault diagnosis. However, its performance oftentimes deteriorates when dealing with highly imbalanced or long-tailed data, while such cases are prevalent in industrial settings because fault is a rare event that occurs with an extremely low probability. Conventional data augmentation methods face fundamental limitations due to the scarcity o… ▽ More Deep learning has achieved remarkable success in bearing fault diagnosis. However, its performance oftentimes deteriorates when dealing with highly imbalanced or long-tailed data, while such cases are prevalent in industrial settings because fault is a rare event that occurs with an extremely low probability. Conventional data augmentation methods face fundamental limitations due to the scarcity of samples pertaining to the minority class. In this paper, we propose a supervised contrastive learning approach with a class-aware loss function to enhance the feature extraction capability of neural networks for fault diagnosis. The developed class-weighted contrastive learning quadratic network (CCQNet) consists of a quadratic convolutional residual network backbone, a contrastive learning branch utilizing a class-weighted contrastive loss, and a classifier branch employing logit-adjusted cross-entropy loss. By utilizing class-weighted contrastive loss and logit-adjusted cross-entropy loss, our approach encourages equidistant representation of class features, thereby inducing equal attention on all the classes. We further analyze the superior feature extraction ability of quadratic network by establishing the connection between quadratic neurons and autocorrelation in signal processing. Experimental results on public and proprietary datasets are used to validate the effectiveness of CCQNet, and computational results reveal that CCQNet outperforms SOTA methods in handling extremely imbalanced data substantially. △ Less

Submitted 20 September, 2023; originally announced September 2023.

arXiv:2309.10153 [pdf, other]

Preserving Tumor Volumes for Unsupervised Medical Image Registration

Authors: Qihua Dong, Hao Du, Ying Song, Yan Xu, **g Liao

Abstract: Medical image registration is a critical task that estimates the spatial correspondence between pairs of images. However, current traditional and deep-learning-based methods rely on similarity measures to generate a deforming field, which often results in disproportionate volume changes in dissimilar regions, especially in tumor regions. These changes can significantly alter the tumor size and und… ▽ More Medical image registration is a critical task that estimates the spatial correspondence between pairs of images. However, current traditional and deep-learning-based methods rely on similarity measures to generate a deforming field, which often results in disproportionate volume changes in dissimilar regions, especially in tumor regions. These changes can significantly alter the tumor size and underlying anatomy, which limits the practical use of image registration in clinical diagnosis. To address this issue, we have formulated image registration with tumors as a constraint problem that preserves tumor volumes while maximizing image similarity in other normal regions. Our proposed strategy involves a two-stage process. In the first stage, we use similarity-based registration to identify potential tumor regions by their volume change, generating a soft tumor mask accordingly. In the second stage, we propose a volume-preserving registration with a novel adaptive volume-preserving loss that penalizes the change in size adaptively based on the masks calculated from the previous stage. Our approach balances image similarity and volume preservation in different regions, i.e., normal and tumor regions, by using soft tumor masks to adjust the imposition of volume-preserving loss on each one. This ensures that the tumor volume is preserved during the registration process. We have evaluated our strategy on various datasets and network architectures, demonstrating that our method successfully preserves the tumor volume while achieving comparable registration results with state-of-the-art methods. Our codes is available at: \url{https://dddraxxx.github.io/Volume-Preserving-Registration/}. △ Less

Submitted 9 May, 2024; v1 submitted 18 September, 2023; originally announced September 2023.

Comments: ICCV 2023 Poster

arXiv:2309.08858 [pdf, other]

doi 10.1002/qute.202300244

Two-mode correlated multiphoton bundle emission

Authors: Yi Wang, Fen Zou, Jie-Qiao Liao

Abstract: The preparation of correlated multiphoton sources is an important research topic in quantum optics and quantum information science. Here, two-mode correlated multiphoton bundle emission in a nondegenerate multiphoton Jaynes-Cummings model, which is comprised of a two-level system coupled with two cavity modes is studied. The two-level system is driven by a near-resonant strong laser such that the… ▽ More The preparation of correlated multiphoton sources is an important research topic in quantum optics and quantum information science. Here, two-mode correlated multiphoton bundle emission in a nondegenerate multiphoton Jaynes-Cummings model, which is comprised of a two-level system coupled with two cavity modes is studied. The two-level system is driven by a near-resonant strong laser such that the Mollow regime dominates the physical processes in this system. Under certain resonance conditions, a perfect super-Rabi oscillation between the zero-photon state $|0\rangle_{a}|0\rangle_{b}$ and the ($n+m$)-photon state $|n\rangle_{a}|m\rangle_{b}$ of the two cavity modes can take place. Induced by the photon decay, the two-mode correlated multiphoton bundle emission occurs in this system. More importantly, the results show that there is an antibunching effect between the strongly-correlated photon bundles, so that the system behaves as an antibunched ($n+m$)-photon source. The work opens up a route towards achieving two-mode correlated multiphoton source device, which has potential applications in modern quantum technology. △ Less

Submitted 26 October, 2023; v1 submitted 15 September, 2023; originally announced September 2023.

Comments: 16 pages, 6 figures

Journal ref: Adv. Quantum Technol. 2023, 2300244

arXiv:2309.08856 [pdf, other]

doi 10.1002/qute.202400030

Entangling two giant atoms via a topological waveguide

Authors: Wen-Bin Luo, Xian-Li Yin, Jie-Qiao Liao

Abstract: The entanglement generation of two two-level giant atoms coupled to a photonic waveguide, which is formed by a Su-Schrieffer-Heeger (SSH) type coupled-cavity array is studied. Here, each atom is coupled to the waveguide through two coupling points. The two-atom separate-coupling case is studied, and 16 coupling configurations are considered for the coupling-point distributions between the two atom… ▽ More The entanglement generation of two two-level giant atoms coupled to a photonic waveguide, which is formed by a Su-Schrieffer-Heeger (SSH) type coupled-cavity array is studied. Here, each atom is coupled to the waveguide through two coupling points. The two-atom separate-coupling case is studied, and 16 coupling configurations are considered for the coupling-point distributions between the two atoms and the waveguide. Quantum master equations are derived to govern the evolution of the two atoms and characterize atomic entanglement by calculating the concurrence of the two-atom states. It is found that the two giant-atom entanglement depends on the coupling configurations and the coupling-point distance of the giant atoms. In particular, the entanglement dynamics of the two giant atoms in 14 coupling configurations depend on the dimerization parameter of the SSH waveguide. According to the self-energies of the two giant atoms, it is found that ten of these 16 coupling configurations can be divided into five pairs. It is also showed that the delayed sudden birth of entanglement between the two giant atoms is largely enhanced in these five pairs of coupling configurations. This work will promote the study of quantum effects and coherent manipulation in giant-atom topological-waveguide-QED systems. △ Less

Submitted 8 May, 2024; v1 submitted 15 September, 2023; originally announced September 2023.

Comments: 19 pages, 5 figures

Journal ref: Adv. Quantum Technol. 2024, 2400030

arXiv:2309.07109 [pdf, ps, other]

Real-time Monitoring for the Next Core-Collapse Supernova in JUNO

Authors: Angel Abusleme, Thomas Adam, Shakeel Ahmad, Rizwan Ahmed, Sebastiano Aiello, Muhammad Akram, Abid Aleem, Fengpeng An, Qi An, Giuseppe Andronico, Nikolay Anfimov, Vito Antonelli, Tatiana Antoshkina, Burin Asavapibhop, João Pedro Athayde Marcondes de André, Didier Auguste, Weidong Bai, Nikita Balashov, Wander Baldini, Andrea Barresi, Davide Basilico, Eric Baussan, Marco Bellato, Marco Beretta, Antonio Bergnoli , et al. (606 additional authors not shown)

Abstract: The core-collapse supernova (CCSN) is considered one of the most energetic astrophysical events in the universe. The early and prompt detection of neutrinos before (pre-SN) and during the supernova (SN) burst presents a unique opportunity for multi-messenger observations of CCSN events. In this study, we describe the monitoring concept and present the sensitivity of the system to pre-SN and SN neu… ▽ More The core-collapse supernova (CCSN) is considered one of the most energetic astrophysical events in the universe. The early and prompt detection of neutrinos before (pre-SN) and during the supernova (SN) burst presents a unique opportunity for multi-messenger observations of CCSN events. In this study, we describe the monitoring concept and present the sensitivity of the system to pre-SN and SN neutrinos at the Jiangmen Underground Neutrino Observatory (JUNO), a 20 kton liquid scintillator detector currently under construction in South China. The real-time monitoring system is designed to ensure both prompt alert speed and comprehensive coverage of progenitor stars. It incorporates prompt monitors on the electronic board as well as online monitors at the data acquisition stage. Assuming a false alert rate of 1 per year, this monitoring system exhibits sensitivity to pre-SN neutrinos up to a distance of approximately 1.6 (0.9) kiloparsecs and SN neutrinos up to about 370 (360) kiloparsecs for a progenitor mass of 30 solar masses, considering both normal and inverted mass ordering scenarios. The pointing ability of the CCSN is evaluated by analyzing the accumulated event anisotropy of inverse beta decay interactions from pre-SN or SN neutrinos. This, along with the early alert, can play a crucial role in facilitating follow-up multi-messenger observations of the next galactic or nearby extragalactic CCSN. △ Less

Submitted 4 December, 2023; v1 submitted 13 September, 2023; originally announced September 2023.

Comments: 24 pages, 9 figures, accepted for the publication at JCAP

arXiv:2309.04551 [pdf, ps, other]

doi 10.4230/LIPIcs.ITCS.2024.27

Recursive Error Reduction for Regular Branching Programs

Authors: Eshan Chattopadhyay, Jyun-Jie Liao

Abstract: In a recent work, Chen, Hoza, Lyu, Tal and Wu (FOCS 2023) showed an improved error reduction framework for the derandomization of regular read-once branching programs (ROBPs). Their result is based on a clever modification to the inverse Laplacian perspective of space-bounded derandomization, which was originally introduced by Ahmadinejad, Kelner, Murtagh, Peebles, Sidford and Vadhan (FOCS 2020).… ▽ More In a recent work, Chen, Hoza, Lyu, Tal and Wu (FOCS 2023) showed an improved error reduction framework for the derandomization of regular read-once branching programs (ROBPs). Their result is based on a clever modification to the inverse Laplacian perspective of space-bounded derandomization, which was originally introduced by Ahmadinejad, Kelner, Murtagh, Peebles, Sidford and Vadhan (FOCS 2020). In this work, we give an alternative error reduction framework for regular ROBPs. Our new framework is based on a binary recursive formula from the work of Chattopadhyay and Liao (CCC 2020), that they used to construct weighted pseudorandom generators (WPRGs) for general ROBPs. Based on our new error reduction framework, we give alternative proofs to the following results for regular ROBPs of length $n$ and width $w$, both of which were proved in the work of Chen et al. using their error reduction: $\bullet$ There is a WPRG with error $\varepsilon$ that has seed length $\tilde{O}(\log(n)(\sqrt{\log(1/\varepsilon)}+\log(w))+\log(1/\varepsilon)).$ $\bullet$ There is a (non-black-box) deterministic algorithm which estimates the expectation of any such program within error $\pm\varepsilon$ with space complexity $\tilde{O}(\log(nw)\cdot\log\log(1/\varepsilon)).$ (This was first proved in the work of Ahmadinejad et al., but the proof by Chen et al. is simpler.) Because of the binary recursive nature of our new framework, both of our proofs are based on a straightforward induction that is arguably simpler than the Laplacian-based proof in the work of Chen et al. △ Less

Submitted 6 December, 2023; v1 submitted 8 September, 2023; originally announced September 2023.

arXiv:2308.13177 [pdf, other]

How to Evaluate the Generalization of Detection? A Benchmark for Comprehensive Open-Vocabulary Detection

Authors: Yiyang Yao, Peng Liu, Tiancheng Zhao, Qianqian Zhang, Jiajia Liao, Chunxin Fang, Kyusong Lee, Qing Wang

Abstract: Object detection (OD) in computer vision has made significant progress in recent years, transitioning from closed-set labels to open-vocabulary detection (OVD) based on large-scale vision-language pre-training (VLP). However, current evaluation methods and datasets are limited to testing generalization over object types and referral expressions, which do not provide a systematic, fine-grained, and… ▽ More Object detection (OD) in computer vision has made significant progress in recent years, transitioning from closed-set labels to open-vocabulary detection (OVD) based on large-scale vision-language pre-training (VLP). However, current evaluation methods and datasets are limited to testing generalization over object types and referral expressions, which do not provide a systematic, fine-grained, and accurate benchmark of OVD models' abilities. In this paper, we propose a new benchmark named OVDEval, which includes 9 sub-tasks and introduces evaluations on commonsense knowledge, attribute understanding, position understanding, object relation comprehension, and more. The dataset is meticulously created to provide hard negatives that challenge models' true understanding of visual and linguistic input. Additionally, we identify a problem with the popular Average Precision (AP) metric when benchmarking models on these fine-grained label datasets and propose a new metric called Non-Maximum Suppression Average Precision (NMS-AP) to address this issue. Extensive experimental results show that existing top OVD models all fail on the new tasks except for simple object types, demonstrating the value of the proposed dataset in pinpointing the weakness of current OVD models and guiding future research. Furthermore, the proposed NMS-AP metric is verified by experiments to provide a much more truthful evaluation of OVD models, whereas traditional AP metrics yield deceptive results. Data is available at \url{https://github.com/om-ai-lab/OVDEval} △ Less

Submitted 18 December, 2023; v1 submitted 25 August, 2023; originally announced August 2023.

Comments: Long paper accepted at AAAI 2024

arXiv:2308.11904 [pdf, other]

A Successive Two-stage Method for Sparse Generalized Eigenvalue Problems

Authors: Qia Li, Jianmin Liao, Lixin Shen, Na Zhang

Abstract: The Sparse Generalized Eigenvalue Problem (sGEP), a pervasive challenge in statistical learning methods including sparse principal component analysis, sparse Fisher's discriminant analysis, and sparse canonical correlation analysis, presents significant computational complexity due to its NP-hardness. The primary aim of sGEP is to derive a sparse vector approximation of the largest generalized eig… ▽ More The Sparse Generalized Eigenvalue Problem (sGEP), a pervasive challenge in statistical learning methods including sparse principal component analysis, sparse Fisher's discriminant analysis, and sparse canonical correlation analysis, presents significant computational complexity due to its NP-hardness. The primary aim of sGEP is to derive a sparse vector approximation of the largest generalized eigenvector, effectively posing this as a sparse optimization problem. Conventional algorithms for sGEP, however, often succumb to local optima and exhibit significant dependency on initial points. This predicament necessitates a more refined approach to avoid local optima and achieve an improved solution in terms of sGEP's objective value, which we address in this paper through a novel successive two-stage method. The first stage of this method incorporates an algorithm for sGEP capable of yielding a stationary point from any initial point. The subsequent stage refines this stationary point by adjusting its support, resulting in a point with an enhanced objective value relative to the original stationary point. This support adjustment is achieved through a novel procedure we have named support alteration. The final point derived from the second stage then serves as the initial point for the algorithm in the first stage, creating a cyclical process that continues until a predetermined stop** criterion is satisfied. We also provide a comprehensive convergence analysis of this process. Through extensive experimentation under various settings, our method has demonstrated significant improvements in the objective value of sGEP compared to existing methodologies, underscoring its potential as a valuable tool in statistical learning and optimization. △ Less

Submitted 23 August, 2023; originally announced August 2023.

MSC Class: 90C26; 90C32; 90C59; 90C90

arXiv:2308.10324 [pdf]

doi 10.1103/PhysRevLett.131.166703

Room temperature magnetic phase transition in an electrically-tuned van der Waals ferromagnet

Authors: Cheng Tan, Ji-Hai Liao, Guolin Zheng, Meri Algarni, Jia-Yi Lin, Xiang Ma, Edwin L. H. Mayes, Matthew R. Field, Sultan Albarakati, Majid Panahandeh-Fard, Saleh Alzahrani, Guopeng Wang, Yuanjun Yang, Dimitrie Culcer, James Partridge, Mingliang Tian, Bin Xiang, Yu-Jun Zhao, Lan Wang

Abstract: Finding tunable van der Waals (vdW) ferromagnets that operate at above room temperature is an important research focus in physics and materials science. Most vdW magnets are only intrinsically magnetic far below room temperature and magnetism with square-shaped hysteresis at room-temperature has yet to be observed. Here, we report magnetism in a quasi-2D magnet Cr1.2Te2 observed at room temperatur… ▽ More Finding tunable van der Waals (vdW) ferromagnets that operate at above room temperature is an important research focus in physics and materials science. Most vdW magnets are only intrinsically magnetic far below room temperature and magnetism with square-shaped hysteresis at room-temperature has yet to be observed. Here, we report magnetism in a quasi-2D magnet Cr1.2Te2 observed at room temperature (290 K). This magnetism was tuned via a protonic gate with an electron do** concentration up to 3.8 * 10^21 cm^-3. We observed non-monotonic evolutions in both coercivity and anomalous Hall resistivity. Under increased electron do**, the coercivities and anomalous Hall effects (AHEs) vanished, indicating a do**-induced magnetic phase transition. This occurred up to room temperature. DFT calculations showed the formation of an antiferromagnetic (AFM) phase caused by the intercalation of protons which induced significant electron do** in the Cr1.2Te2. The tunability of the magnetic properties and phase in room temperature magnetic vdW Cr1.2Te2 is a significant step towards practical spintronic devices. △ Less

Submitted 19 March, 2024; v1 submitted 20 August, 2023; originally announced August 2023.

Comments: 18 pages, 4 figures

Journal ref: Phys. Rev. Lett. 131, 166703 (2023)

arXiv:2308.09946 [pdf, other]

Weakly-Supervised Action Localization by Hierarchically-structured Latent Attention Modeling

Authors: Guiqin Wang, Peng Zhao, Cong Zhao, Shusen Yang, Jie Cheng, Luziwei Leng, Jianxing Liao, Qinghai Guo

Abstract: Weakly-supervised action localization aims to recognize and localize action instancese in untrimmed videos with only video-level labels. Most existing models rely on multiple instance learning(MIL), where the predictions of unlabeled instances are supervised by classifying labeled bags. The MIL-based methods are relatively well studied with cogent performance achieved on classification but not on… ▽ More Weakly-supervised action localization aims to recognize and localize action instancese in untrimmed videos with only video-level labels. Most existing models rely on multiple instance learning(MIL), where the predictions of unlabeled instances are supervised by classifying labeled bags. The MIL-based methods are relatively well studied with cogent performance achieved on classification but not on localization. Generally, they locate temporal regions by the video-level classification but overlook the temporal variations of feature semantics. To address this problem, we propose a novel attention-based hierarchically-structured latent model to learn the temporal variations of feature semantics. Specifically, our model entails two components, the first is an unsupervised change-points detection module that detects change-points by learning the latent representations of video features in a temporal hierarchy based on their rates of change, and the second is an attention-based classification model that selects the change-points of the foreground as the boundaries. To evaluate the effectiveness of our model, we conduct extensive experiments on two benchmark datasets, THUMOS-14 and ActivityNet-v1.3. The experiments show that our method outperforms current state-of-the-art methods, and even achieves comparable performance with fully-supervised methods. △ Less

Submitted 25 September, 2023; v1 submitted 19 August, 2023; originally announced August 2023.

Comments: Accepted to ICCV 2023. arXiv admin note: text overlap with arXiv:2203.15187, arXiv:2003.12424, arXiv:2104.02967 by other authors

arXiv:2308.08108 [pdf, ps, other]

doi 10.1103/PhysRevA.108.023728

Generation of two-giant-atom entanglement in waveguide-QED systems

Authors: Xian-Li Yin, Jie-Qiao Liao

Abstract: We study the generation of quantum entanglement between two giant atoms coupled to a one-dimensional waveguide. Since each giant atom interacts with the waveguide at two separate coupling points, there exist three different coupling configurations in the two-atom waveguide system: separated, braided, and nested couplings. Within the Wigner-Weisskopf framework for single coupling points, the quantu… ▽ More We study the generation of quantum entanglement between two giant atoms coupled to a one-dimensional waveguide. Since each giant atom interacts with the waveguide at two separate coupling points, there exist three different coupling configurations in the two-atom waveguide system: separated, braided, and nested couplings. Within the Wigner-Weisskopf framework for single coupling points, the quantum master equations governing the evolution of the two giant atoms are obtained. For each coupling configuration, the entanglement dynamics of the two giant atoms is studied, including the cases of two different atomic initial states: single- and double-excitation states. It is shown that the generated entanglement depends on the coupling configuration, phase shift, and atomic initial state. For the single-excitation initial state, there exists steady-state entanglement for these three couplings due to the appearance of the dark state. For the double-excitation initial state, an entanglement sudden birth is observed via adjusting the phase shift. In particular, the maximal entanglement for the nested coupling is about one order of magnitude larger than those of separate and braided couplings. In addition, the influence of the atomic frequency detuning on the entanglement generation is studied. This work can be utilized for the generation and control of atomic entanglement in quantum networks based on giant-atom waveguide-QED systems, which have wide potential applications in quantum information processing. △ Less

Submitted 30 August, 2023; v1 submitted 15 August, 2023; originally announced August 2023.

Comments: 13 pages, 8 figures. arXiv admin note: substantial text overlap with arXiv:2303.14746

Journal ref: Phys. Rev. A 108, 023728 (2023)

arXiv:2308.06801 [pdf, other]

SAILOR: Structural Augmentation Based Tail Node Representation Learning

Authors: Jie Liao, **tang Li, Liang Chen, Bingzhe Wu, Yatao Bian, Zibin Zheng

Abstract: Graph Neural Networks (GNNs) have achieved state-of-the-art performance in representation learning for graphs recently. However, the effectiveness of GNNs, which capitalize on the key operation of message propagation, highly depends on the quality of the topology structure. Most of the graphs in real-world scenarios follow a long-tailed distribution on their node degrees, that is, a vast majority… ▽ More Graph Neural Networks (GNNs) have achieved state-of-the-art performance in representation learning for graphs recently. However, the effectiveness of GNNs, which capitalize on the key operation of message propagation, highly depends on the quality of the topology structure. Most of the graphs in real-world scenarios follow a long-tailed distribution on their node degrees, that is, a vast majority of the nodes in the graph are tail nodes with only a few connected edges. GNNs produce inferior node representations for tail nodes since they lack structural information. In the pursuit of promoting the expressiveness of GNNs for tail nodes, we explore how the deficiency of structural information deteriorates the performance of tail nodes and propose a general Structural Augmentation based taIL nOde Representation learning framework, dubbed as SAILOR, which can jointly learn to augment the graph structure and extract more informative representations for tail nodes. Extensive experiments on public benchmark datasets demonstrate that SAILOR can significantly improve the tail node representations and outperform the state-of-the-art baselines. △ Less

Submitted 14 August, 2023; v1 submitted 13 August, 2023; originally announced August 2023.

Comments: Accepted by CIKM 2023; Code is available at https://github.com/Jie-Re/SAILOR

arXiv:2308.04380 [pdf, other]

doi 10.1145/3581783.3612101

Your Negative May not Be True Negative: Boosting Image-Text Matching with False Negative Elimination

Authors: Haoxuan Li, Yi Bin, Junrong Liao, Yang Yang, Heng Tao Shen

Abstract: Most existing image-text matching methods adopt triplet loss as the optimization objective, and choosing a proper negative sample for the triplet of <anchor, positive, negative> is important for effectively training the model, e.g., hard negatives make the model learn efficiently and effectively. However, we observe that existing methods mainly employ the most similar samples as hard negatives, wh… ▽ More Most existing image-text matching methods adopt triplet loss as the optimization objective, and choosing a proper negative sample for the triplet of <anchor, positive, negative> is important for effectively training the model, e.g., hard negatives make the model learn efficiently and effectively. However, we observe that existing methods mainly employ the most similar samples as hard negatives, which may not be true negatives. In other words, the samples with high similarity but not paired with the anchor may reserve positive semantic associations, and we call them false negatives. Repelling these false negatives in triplet loss would mislead the semantic representation learning and result in inferior retrieval performance. In this paper, we propose a novel False Negative Elimination (FNE) strategy to select negatives via sampling, which could alleviate the problem introduced by false negatives. Specifically, we first construct the distributions of positive and negative samples separately via their similarities with the anchor, based on the features extracted from image and text encoders. Then we calculate the false negative probability of a given sample based on its similarity with the anchor and the above distributions via the Bayes' rule, which is employed as the sampling weight during negative sampling process. Since there may not exist any false negative in a small batch size, we design a memory module with momentum to retain a large negative buffer and implement our negative sampling strategy spanning over the buffer. In addition, to make the model focus on hard negatives, we reassign the sampling weights for the simple negatives with a cut-down strategy. The extensive experiments are conducted on Flickr30K and MS-COCO, and the results demonstrate the superiority of our proposed false negative elimination strategy. The code is available at https://github.com/LuminosityX/FNE. △ Less

Submitted 8 August, 2023; originally announced August 2023.

Comments: Accepted at ACM MM 2023

arXiv:2308.03585 [pdf, ps, other]

Hilton-Milner theorem for $k$-multisets

Authors: Jiaqi Liao, Zequn Lv, Mengyu Cao, Mei Lu

Abstract: Let $ k, n \in \mathbb{N}^+ $ and $ m \in \mathbb{N}^+ \cup \{\infty \} $. A $ k $-multiset in $ [n]_m $ is a $ k $-set whose elements are integers from $ \{1, 2, \ldots, n\} $, and each element is allowed to have at most $ m $ repetitions. A family of $ k $-multisets in $ [n]_m $ is said to be intersecting if every pair of $ k $-multisets from the family have non-empty intersection. In this paper… ▽ More Let $ k, n \in \mathbb{N}^+ $ and $ m \in \mathbb{N}^+ \cup \{\infty \} $. A $ k $-multiset in $ [n]_m $ is a $ k $-set whose elements are integers from $ \{1, 2, \ldots, n\} $, and each element is allowed to have at most $ m $ repetitions. A family of $ k $-multisets in $ [n]_m $ is said to be intersecting if every pair of $ k $-multisets from the family have non-empty intersection. In this paper, we give the size and structure of the largest non-trivial intersecting family of $ k $-multisets in $ [n]_m $ for $ n \geq k + \lceil k/m \rceil $. In the special case when $m=\infty$, our result gives rise to an unbounded multiset version for Hilton-Milner Theorem given by Meagher and Purdy. Furthermore, our main theorem unites the statements of the Hilton-Milner Theorem for finite sets and unbounded multisets. △ Less

Submitted 6 July, 2024; v1 submitted 7 August, 2023; originally announced August 2023.

Comments: 14 pages

MSC Class: 05D05; 05C35; 05A15

arXiv:2308.02118 [pdf, other]

Rethinking Class Activation Maps for Segmentation: Revealing Semantic Information in Shallow Layers by Reducing Noise

Authors: Hang-Cheng Dong, Yuhao Jiang, Yingyan Huang, **gxiao Liao, Bingguo Liu, Dong Ye, Guodong Liu

Abstract: Class activation maps are widely used for explaining deep neural networks. Due to its ability to highlight regions of interest, it has evolved in recent years as a key step in weakly supervised learning. A major limitation to the performance of the class activation maps is the small spatial resolution of the feature maps in the last layer of the convolutional neural network. Therefore, we expect t… ▽ More Class activation maps are widely used for explaining deep neural networks. Due to its ability to highlight regions of interest, it has evolved in recent years as a key step in weakly supervised learning. A major limitation to the performance of the class activation maps is the small spatial resolution of the feature maps in the last layer of the convolutional neural network. Therefore, we expect to generate high-resolution feature maps that result in high-quality semantic information. In this paper, we rethink the properties of semantic information in shallow feature maps. We find that the shallow feature maps still have fine-grained non-discriminative features while mixing considerable non-target noise. Furthermore, we propose a simple gradient-based denoising method to filter the noise by truncating the positive gradient. Our proposed scheme can be easily deployed in other CAM-related methods, facilitating these methods to obtain higher-quality class activation maps. We evaluate the proposed approach through a weakly-supervised semantic segmentation task, and a large number of experiments demonstrate the effectiveness of our approach. △ Less

Submitted 3 August, 2023; originally announced August 2023.

arXiv:2307.16363 [pdf, other]

BearingPGA-Net: A Lightweight and Deployable Bearing Fault Diagnosis Network via Decoupled Knowledge Distillation and FPGA Acceleration

Authors: **g-Xiao Liao, Sheng-Lai Wei, Chen-Long Xie, Tieyong Zeng, **wei Sun, Shi** Zhang, Xiaoge Zhang, Feng-Lei Fan

Abstract: Deep learning has achieved remarkable success in the field of bearing fault diagnosis. However, this success comes with larger models and more complex computations, which cannot be transferred into industrial fields requiring models to be of high speed, strong portability, and low power consumption. In this paper, we propose a lightweight and deployable model for bearing fault diagnosis, referred… ▽ More Deep learning has achieved remarkable success in the field of bearing fault diagnosis. However, this success comes with larger models and more complex computations, which cannot be transferred into industrial fields requiring models to be of high speed, strong portability, and low power consumption. In this paper, we propose a lightweight and deployable model for bearing fault diagnosis, referred to as BearingPGA-Net, to address these challenges. Firstly, aided by a well-trained large model, we train BearingPGA-Net via decoupled knowledge distillation. Despite its small size, our model demonstrates excellent fault diagnosis performance compared to other lightweight state-of-the-art methods. Secondly, we design an FPGA acceleration scheme for BearingPGA-Net using Verilog. This scheme involves the customized quantization and designing programmable logic gates for each layer of BearingPGA-Net on the FPGA, with an emphasis on parallel computing and module reuse to enhance the computational speed. To the best of our knowledge, this is the first instance of deploying a CNN-based bearing fault diagnosis model on an FPGA. Experimental results reveal that our deployment scheme achieves over 200 times faster diagnosis speed compared to CPU, while achieving a lower-than-0.4\% performance drop in terms of F1, Recall, and Precision score on our independently-collected bearing dataset. Our code is available at \url{https://github.com/asdvfghg/BearingPGA-Net}. △ Less

Submitted 30 July, 2023; originally announced July 2023.

arXiv:2307.13949 [pdf, other]

How Does Diffusion Influence Pretrained Language Models on Out-of-Distribution Data?

Authors: Huazheng Wang, Daixuan Cheng, Haifeng Sun, **gyu Wang, Qi Qi, Jianxin Liao, **g Wang, Cong Liu

Abstract: Transformer-based pretrained language models (PLMs) have achieved great success in modern NLP. An important advantage of PLMs is good out-of-distribution (OOD) robustness. Recently, diffusion models have attracted a lot of work to apply diffusion to PLMs. It remains under-explored how diffusion influences PLMs on OOD data. The core of diffusion models is a forward diffusion process which gradually… ▽ More Transformer-based pretrained language models (PLMs) have achieved great success in modern NLP. An important advantage of PLMs is good out-of-distribution (OOD) robustness. Recently, diffusion models have attracted a lot of work to apply diffusion to PLMs. It remains under-explored how diffusion influences PLMs on OOD data. The core of diffusion models is a forward diffusion process which gradually applies Gaussian noise to inputs, and a reverse denoising process which removes noise. The noised input reconstruction is a fundamental ability of diffusion models. We directly analyze OOD robustness by measuring the reconstruction loss, including testing the abilities to reconstruct OOD data, and to detect OOD samples. Experiments are conducted by analyzing different training parameters and data statistical features on eight datasets. It shows that finetuning PLMs with diffusion degrades the reconstruction ability on OOD data. The comparison also shows that diffusion models can effectively detect OOD samples, achieving state-of-the-art performance in most of the datasets with an absolute accuracy improvement up to 18%. These results indicate that diffusion reduces OOD robustness of PLMs. △ Less

Submitted 26 July, 2023; originally announced July 2023.

Comments: Accepted by ECAI 2023

arXiv:2307.12900 [pdf, other]

doi 10.1109/TCDS.2024.3410371

Automotive Object Detection via Learning Sparse Events by Spiking Neurons

Authors: Hu Zhang, Yanchen Li, Luziwei Leng, Kaiwei Che, Qian Liu, Qinghai Guo, Jianxing Liao, Ran Cheng

Abstract: Event-based sensors, distinguished by their high temporal resolution of 1 $\mathrmμ\text{s}$ and a dynamic range of 120 $\text{dB}$, stand out as ideal tools for deployment in fast-paced settings like vehicles and drones. Traditional object detection techniques that utilize Artificial Neural Networks (ANNs) face challenges due to the sparse and asynchronous nature of the events these sensors captu… ▽ More Event-based sensors, distinguished by their high temporal resolution of 1 $\mathrmμ\text{s}$ and a dynamic range of 120 $\text{dB}$, stand out as ideal tools for deployment in fast-paced settings like vehicles and drones. Traditional object detection techniques that utilize Artificial Neural Networks (ANNs) face challenges due to the sparse and asynchronous nature of the events these sensors capture. In contrast, Spiking Neural Networks (SNNs) offer a promising alternative, providing a temporal representation that is inherently aligned with event-based data. This paper explores the unique membrane potential dynamics of SNNs and their ability to modulate sparse events. We introduce an innovative spike-triggered adaptive threshold mechanism designed for stable training. Building on these insights, we present a specialized spiking feature pyramid network (SpikeFPN) optimized for automotive event-based object detection. Comprehensive evaluations demonstrate that SpikeFPN surpasses both traditional SNNs and advanced ANNs enhanced with attention mechanisms. Evidently, SpikeFPN achieves a mean Average Precision (mAP) of 0.477 on the GEN1 Automotive Detection (GAD) benchmark dataset, marking significant increases over the selected SNN baselines. Moreover, the efficient design of SpikeFPN ensures robust performance while optimizing computational resources, attributed to its innate sparse computation capabilities. Source codes are publicly accessible at https://github.com/EMI-Group/spikefpn. △ Less

Submitted 10 June, 2024; v1 submitted 24 July, 2023; originally announced July 2023.

Comments: IEEE Transactions on Cognitive and Developmental Systems

arXiv:2307.02036 [pdf]

Convex Optimal Power Flow Based on Power Injection-based Equations and Its Application in Bipolar DC Distribution Network

Authors: Yiyao Zhou, Qianggang Wang, Yuan Chi, Jianquan Liao, Tao Huang, Niancheng Zhou, Xiaolong Xu, Xuefei Zhang

Abstract: Optimal power flow (OPF) is a fundamental tool for analyzing the characteristics of bipolar DC distribution network (DCDN). However, existing OPF models face challenges in reflecting the power distribution and exchange of bipolar DCDN directly since its decision variables are voltage and current. This paper addresses this issue by establishing a convex OPF model that can be used for the planning a… ▽ More Optimal power flow (OPF) is a fundamental tool for analyzing the characteristics of bipolar DC distribution network (DCDN). However, existing OPF models face challenges in reflecting the power distribution and exchange of bipolar DCDN directly since its decision variables are voltage and current. This paper addresses this issue by establishing a convex OPF model that can be used for the planning and operation of bipolar DCDN. First, the power flow characteristics of bipolar DCDN are revealed through power injection-based equations, upon which the original OPF model is established. Next, the original OPF model undergoes a transformation into a convex OPF model based on second-order cone programming (SOCP) through variable substitution, secondorder cone relaxation, McCormick relaxation, and first-order Taylor expansion, respectively. Finally, the sequence bound tightening algorithm (STBA) is employed to tighten the boundaries of McCormick envelopes in each iteration to ensure the exactness of the convex OPF model. The effectiveness of this novel OPF model for bipolar DCDN is verified through two case studies, i.e., capacity configuration of distributed generation (DG) and operation optimization of bipolar DCDN. △ Less

Submitted 5 July, 2023; originally announced July 2023.

Comments: 10 pages, 13 figures, under review in IEEE transactions on power systems

arXiv:2306.16186 [pdf, other]

Effective Transfer of Pretrained Large Visual Model for Fabric Defect Segmentation via Specifc Knowledge Injection

Authors: Zhewei Chen, Wai Keung Wong, Zuofeng Zhong, **piao Liao, Ying Qu

Abstract: Fabric defect segmentation is integral to textile quality control. Despite this, the scarcity of high-quality annotated data and the diversity of fabric defects present significant challenges to the application of deep learning in this field. These factors limit the generalization and segmentation performance of existing models, impeding their ability to handle the complexity of diverse fabric typ… ▽ More Fabric defect segmentation is integral to textile quality control. Despite this, the scarcity of high-quality annotated data and the diversity of fabric defects present significant challenges to the application of deep learning in this field. These factors limit the generalization and segmentation performance of existing models, impeding their ability to handle the complexity of diverse fabric types and defects. To overcome these obstacles, this study introduces an innovative method to infuse specialized knowledge of fabric defects into the Segment Anything Model (SAM), a large-scale visual model. By introducing and training a unique set of fabric defect-related parameters, this approach seamlessly integrates domain-specific knowledge into SAM without the need for extensive modifications to the pre-existing model parameters. The revamped SAM model leverages generalized image understanding learned from large-scale natural image datasets while incorporating fabric defect-specific knowledge, ensuring its proficiency in fabric defect segmentation tasks. The experimental results reveal a significant improvement in the model's segmentation performance, attributable to this novel amalgamation of generic and fabric-specific knowledge. When benchmarking against popular existing segmentation models across three datasets, our proposed model demonstrates a substantial leap in performance. Its impressive results in cross-dataset comparisons and few-shot learning experiments further demonstrate its potential for practical applications in textile quality control. △ Less

Submitted 28 June, 2023; originally announced June 2023.

Comments: 13 pages,4 figures, 3 tables

ACM Class: I.2.10; I.4.9; I.5.4

arXiv:2306.13078 [pdf, other]

Continuous Layout Editing of Single Images with Diffusion Models

Authors: Zhiyuan Zhang, Zhitong Huang, **g Liao

Abstract: Recent advancements in large-scale text-to-image diffusion models have enabled many applications in image editing. However, none of these methods have been able to edit the layout of single existing images. To address this gap, we propose the first framework for layout editing of a single image while preserving its visual properties, thus allowing for continuous editing on a single image. Our appr… ▽ More Recent advancements in large-scale text-to-image diffusion models have enabled many applications in image editing. However, none of these methods have been able to edit the layout of single existing images. To address this gap, we propose the first framework for layout editing of a single image while preserving its visual properties, thus allowing for continuous editing on a single image. Our approach is achieved through two key modules. First, to preserve the characteristics of multiple objects within an image, we disentangle the concepts of different objects and embed them into separate textual tokens using a novel method called masked textual inversion. Next, we propose a training-free optimization method to perform layout control for a pre-trained diffusion model, which allows us to regenerate images with learned concepts and align them with user-specified layouts. As the first framework to edit the layout of existing images, we demonstrate that our method is effective and outperforms other baselines that were modified to support this task. Our code will be freely available for public use upon acceptance. △ Less

Submitted 22 June, 2023; originally announced June 2023.

arXiv:2306.12465 [pdf, other]

doi 10.1109/TNNLS.2024.3394837

Efficient Deep Spiking Multi-Layer Perceptrons with Multiplication-Free Inference

Authors: Boyan Li, Luziwei Leng, Shuaijie Shen, Kaixuan Zhang, Jianguo Zhang, Jianxing Liao, Ran Cheng

Abstract: Advancements in adapting deep convolution architectures for Spiking Neural Networks (SNNs) have significantly enhanced image classification performance and reduced computational burdens. However, the inability of Multiplication-Free Inference (MFI) to align with attention and transformer mechanisms, which are critical to superior performance on high-resolution vision tasks, imposing limitations on… ▽ More Advancements in adapting deep convolution architectures for Spiking Neural Networks (SNNs) have significantly enhanced image classification performance and reduced computational burdens. However, the inability of Multiplication-Free Inference (MFI) to align with attention and transformer mechanisms, which are critical to superior performance on high-resolution vision tasks, imposing limitations on these gains. To address this, our research explores a new pathway, drawing inspiration from the progress made in Multi-Layer Perceptrons (MLPs). We propose an innovative spiking MLP architecture that uses batch normalization to retain MFI compatibility and introducing a spiking patch encoding layer to enhance local feature extraction capabilities. As a result, we establish an efficient multi-stage spiking MLP network that blends effectively global receptive fields with local feature extraction for comprehensive spike-based computation. Without relying on pre-training or sophisticated SNN training techniques, our network secures a top-1 accuracy of 66.39% on the ImageNet-1K dataset, surpassing the directly trained spiking ResNet-34 by 2.67%. Furthermore, we curtail computational costs, model parameters, and simulation steps. An expanded version of our network compares with the performance of the spiking VGG-16 network with a 71.64% top-1 accuracy, all while operating with a model capacity 2.1 times smaller. Our findings highlight the potential of our deep SNN architecture in effectively integrating global and local learning abilities. Interestingly, the trained receptive field in our network mirrors the activity patterns of cortical cells. Source codes are publicly accessible at https://github.com/EMI-Group/mixer-snn. △ Less

Submitted 26 April, 2024; v1 submitted 21 June, 2023; originally announced June 2023.

Comments: IEEE TNNLS

arXiv:2306.12185 [pdf, other]

Adaptive DNN Surgery for Selfish Inference Acceleration with On-demand Edge Resource

Authors: Xiang Yang, Dezhi Chen, Qi Qi, **gyu Wang, Haifeng Sun, Jianxin Liao, Song Guo

Abstract: Deep Neural Networks (DNNs) have significantly improved the accuracy of intelligent applications on mobile devices. DNN surgery, which partitions DNN processing between mobile devices and multi-access edge computing (MEC) servers, can enable real-time inference despite the computational limitations of mobile devices. However, DNN surgery faces a critical challenge: determining the optimal computin… ▽ More Deep Neural Networks (DNNs) have significantly improved the accuracy of intelligent applications on mobile devices. DNN surgery, which partitions DNN processing between mobile devices and multi-access edge computing (MEC) servers, can enable real-time inference despite the computational limitations of mobile devices. However, DNN surgery faces a critical challenge: determining the optimal computing resource demand from the server and the corresponding partition strategy, while considering both inference latency and MEC server usage costs. This problem is compounded by two factors: (1) the finite computing capacity of the MEC server, which is shared among multiple devices, leading to inter-dependent demands, and (2) the shift in modern DNN architecture from chains to directed acyclic graphs (DAGs), which complicates potential solutions. In this paper, we introduce a novel Decentralized DNN Surgery (DDS) framework. We formulate the partition strategy as a min-cut and propose a resource allocation game to adaptively schedule the demands of mobile devices in an MEC environment. We prove the existence of a Nash Equilibrium (NE), and develop an iterative algorithm to efficiently reach the NE for each device. Our extensive experiments demonstrate that DDS can effectively handle varying MEC scenarios, achieving up to 1.25$\times$ acceleration compared to the state-of-the-art algorithm. △ Less

Submitted 21 June, 2023; originally announced June 2023.

Comments: Under Review

arXiv:2306.10957 [pdf, other]

doi 10.1103/PhysRevA.107.063703

Chiral and nonreciprocal single-photon scattering in a chiral-giant-molecule waveguide-QED system

Authors: Juan Zhou, Xian-Li Yin, Jie-Qiao Liao

Abstract: We study chiral and nonreciprocal single-photon scattering in a chiral-giant-molecule waveguide-QED system. Here, the giant molecule consists of two coupled giant atoms, which interact with two linear waveguides, forming a four-port quantum device. We obtain the exact analytical expressions of the four scattering amplitudes using a real-space method. Under the Markovian limit, we find that the sin… ▽ More We study chiral and nonreciprocal single-photon scattering in a chiral-giant-molecule waveguide-QED system. Here, the giant molecule consists of two coupled giant atoms, which interact with two linear waveguides, forming a four-port quantum device. We obtain the exact analytical expressions of the four scattering amplitudes using a real-space method. Under the Markovian limit, we find that the single-photon scattering behavior is determined by the coupling strength between the giant atoms and the waveguides, the coupling strength between the two giant atoms, and the nondipole effect caused by the phase accumulation of photons travelling between the coupling points. It is also found that chiral and nonreciprocal single-photon scattering can be realized by introducing the chiral coupling to break the symmetry in the coupling configuration between the giant molecule and the waveguides. In addition, an ideal chiral emitter-waveguide coupling enables a directional single-photon routing. In the non-Markovian regime, the scattering spectra are characterized by more abundant structures with multiple peaks and dips. In particular, we demonstrate that the non-Markovian retarded effect can induce the nonreciprocal single-photon scattering. Our results have potential applications in the design of optical quantum devices involving giant atoms, which can provide an efficient platform for studying chiral quantum optics. △ Less

Submitted 19 June, 2023; originally announced June 2023.

Comments: 13 pages,10 figures

Journal ref: Phys. Rev. A 107, 063703 (2023)

arXiv:2306.10255 [pdf, other]

doi 10.1029/2022GL102325

The First GECAM Observation Results on Terrestrial Gamma-ray Flashes and Terrestrial Electron Beams

Authors: Y. Zhao, J. C. Liu, S. L. Xiong, W. C. Xue, Q. B. Yi, G. P. Lu, W. Xu, F. C. Lyu, J. C. Sun, W. X. Peng, C. Zheng, Y. Q. Zhang, C. Cai, S. Xiao, S. L. Xie, C. W. Wang, W. J. Tan, Z. H. An, G. Chen, Y. Q. Du, Y. Huang, M. Gao, K. Gong, D. Y. Guo, J. J. He , et al. (37 additional authors not shown)

Abstract: Gravitational-wave high-energy Electromagnetic Counterpart All-sky Monitor (GECAM) is a space-borne instrument dedicated to monitoring high-energy transients, including Terrestrial Gamma-ray Flashes (TGFs) and Terrestrial Electron Beams (TEBs). We implemented a TGF/TEB search algorithm for GECAM, with which 147 bright TGFs, 2 typical TEBs and 2 special TEB-like events are identified during an effe… ▽ More Gravitational-wave high-energy Electromagnetic Counterpart All-sky Monitor (GECAM) is a space-borne instrument dedicated to monitoring high-energy transients, including Terrestrial Gamma-ray Flashes (TGFs) and Terrestrial Electron Beams (TEBs). We implemented a TGF/TEB search algorithm for GECAM, with which 147 bright TGFs, 2 typical TEBs and 2 special TEB-like events are identified during an effective observation time of $\sim$9 months. We show that, with gamma-ray and charged particle detectors, GECAM can effectively identify and distinguish TGFs and TEBs, and measure their temporal and spectral properties in detail. A very high TGF-lightning association rate of $\sim$80\% is obtained between GECAM and GLD360 in east Asia region. △ Less

Submitted 17 June, 2023; originally announced June 2023.

Comments: The paper was accepted by Geophysical Research Letters on June 16th, 2023

arXiv:2306.10114 [pdf, other]

doi 10.1093/mnras/stac1129

Insight-HXMT Measurements of the Diffuse X-ray Background

Authors: Rui Huang, Wei Cui, **-Yuan Liao, Shuo Zhang, Si-Fan Wang, **g **, Xue Feng Lu, Cheng-Cheng Guo, Yuan You, Gang Li, Juan Zhang

Abstract: We present an X-ray spectrum of the diffuse X-ray background (DXRB) between 1.5 and 120 keV, as measured with the Low-Energy Detector (LE) and the High-Energy Detector (HE) aboard the Insight-HXMT satellite, based on 'blank-sky' observations. LE covers a nominal energy range of 1-15 keV and HE 20-250 keV, but calibration issues and data quality narrowed the energy range for this work. The LE backg… ▽ More We present an X-ray spectrum of the diffuse X-ray background (DXRB) between 1.5 and 120 keV, as measured with the Low-Energy Detector (LE) and the High-Energy Detector (HE) aboard the Insight-HXMT satellite, based on 'blank-sky' observations. LE covers a nominal energy range of 1-15 keV and HE 20-250 keV, but calibration issues and data quality narrowed the energy range for this work. The LE background was directly measured with `blind' detector modules, while the HE background was derived from Earth-occultation data. With the LE data alone, the measured DXRB spectrum can be well described by a power law; fitting the LE and HE data jointly, however, a spectral cut-off must be introduced in the model to account for the measurements above 30 keV. Modelling the combined spectrum with a cut-off power law, the best-fit photon index is 1.40, normalisation $9.57$~$\rm ph~cm^{-2}~s^{-1}~keV^{-1}~sr^{-1} $ (at 1 keV), and cut-off energy 55 keV, after correcting for the effects of the Earth albedo and atmospheric emission (which are significant in the HE band). Based on the best-fit cut-off power law, we derived the spectral energy distribution (SED) of the DXRB. The shape of the SED is in general agreement with the published measurements, but the overall normalization is lower by varying amounts, except for the HEAO-1 result, with which our result is in good agreement. △ Less

Submitted 16 June, 2023; originally announced June 2023.

Comments: 9 pages, 14 figures, published in MNRAS

Journal ref: MNRAS, 513, 4074 (2022)

arXiv:2306.09569 [pdf, other]

Probing general $U(1)'$ models with non-universal lepton charges at FASER/FASER2, COHERENT and long-baseline oscillation experiments

Authors: Tobias Felkl, Tong Li, Jiajun Liao, Michael A. Schmidt

Abstract: The general anomaly-free $U(1)'$ models allow non-universal lepton charges. We explore the sensitivities of FASER/FASER2, COHERENT and DUNE/T2HK precision experiments to the new gauge boson $Z'$ and the new CP-even scalar $φ$. With non-universal lepton charges, distinctive reaches at FASER/FASER2 emerge in the regime of low $m_{Z'}$ and small gauge coupling $g_{BL}$ for different $U(1)'$ charge se… ▽ More The general anomaly-free $U(1)'$ models allow non-universal lepton charges. We explore the sensitivities of FASER/FASER2, COHERENT and DUNE/T2HK precision experiments to the new gauge boson $Z'$ and the new CP-even scalar $φ$. With non-universal lepton charges, distinctive reaches at FASER/FASER2 emerge in the regime of low $m_{Z'}$ and small gauge coupling $g_{BL}$ for different $U(1)'$ charge setups. The COHERENT experiment and the future long-baseline experiments DUNE/T2HK also provide complementary probes to the available parameter space. For $m_φ< 2m_{Z'}$, the search for the scalar $φ$ at FASER/FASER2 is sensitive to the mixing angle between the scalar singlet and the SM Higgs. In the case of $m_φ> 2m_{Z'}$, the kinematically allowed decay $φ\to Z' Z'$ changes the lifetime and decay rates of the scalar $φ$. The sensitivity reach highly depends on the $Z'$ mass and the gauge coupling $g_{BL}$. △ Less

Submitted 14 September, 2023; v1 submitted 15 June, 2023; originally announced June 2023.

Comments: 30 pages, 6 figures, 2 tables. Accepted for publication in JHEP

Report number: CPPC-2023-03

arXiv:2306.09567 [pdf, other]

doi 10.1088/1475-7516/2023/09/001

JUNO sensitivity to the annihilation of MeV dark matter in the galactic halo

Authors: JUNO Collaboration, Angel Abusleme, Thomas Adam, Shakeel Ahmad, Rizwan Ahmed, Sebastiano Aiello, Muhammad Akram, Abid Aleem, Tsagkarakis Alexandros, Fengpeng An, Qi An, Giuseppe Andronico, Nikolay Anfimov, Vito Antonelli, Tatiana Antoshkina, Burin Asavapibhop, João Pedro Athayde Marcondes de André, Didier Auguste, Weidong Bai, Nikita Balashov, Wander Baldini, Andrea Barresi, Davide Basilico, Eric Baussan, Marco Bellato , et al. (581 additional authors not shown)

Abstract: We discuss JUNO sensitivity to the annihilation of MeV dark matter in the galactic halo via detecting inverse beta decay reactions of electron anti-neutrinos resulting from the annihilation. We study possible backgrounds to the signature, including the reactor neutrinos, diffuse supernova neutrino background, charged- and neutral-current interactions of atmospheric neutrinos, backgrounds from muon… ▽ More We discuss JUNO sensitivity to the annihilation of MeV dark matter in the galactic halo via detecting inverse beta decay reactions of electron anti-neutrinos resulting from the annihilation. We study possible backgrounds to the signature, including the reactor neutrinos, diffuse supernova neutrino background, charged- and neutral-current interactions of atmospheric neutrinos, backgrounds from muon-induced fast neutrons and cosmogenic isotopes. A fiducial volume cut, as well as the pulse shape discrimination and the muon veto are applied to suppress the above backgrounds. It is shown that JUNO sensitivity to the thermally averaged dark matter annihilation rate in 10 years of exposure would be significantly better than the present-day best limit set by Super-Kamiokande and would be comparable to that expected by Hyper-Kamiokande. △ Less

Submitted 13 September, 2023; v1 submitted 15 June, 2023; originally announced June 2023.

Comments: 25 pages, 9 figures, matches the publised version

Journal ref: JCAP 09 (2023) 001

arXiv:2306.09360 [pdf, other]

Strong Interaction Physics at the Luminosity Frontier with 22 GeV Electrons at Jefferson Lab

Authors: A. Accardi, P. Achenbach, D. Adhikari, A. Afanasev, C. S. Akondi, N. Akopov, M. Albaladejo, H. Albataineh, M. Albrecht, B. Almeida-Zamora, M. Amaryan, D. Androić, W. Armstrong, D. S. Armstrong, M. Arratia, J. Arrington, A. Asaturyan, A. Austregesilo, H. Avagyan, T. Averett, C. Ayerbe Gayoso, A. Bacchetta, A. B. Balantekin, N. Baltzell, L. Barion , et al. (419 additional authors not shown)

Abstract: This document presents the initial scientific case for upgrading the Continuous Electron Beam Accelerator Facility (CEBAF) at Jefferson Lab (JLab) to 22 GeV. It is the result of a community effort, incorporating insights from a series of workshops conducted between March 2022 and April 2023. With a track record of over 25 years in delivering the world's most intense and precise multi-GeV electron… ▽ More This document presents the initial scientific case for upgrading the Continuous Electron Beam Accelerator Facility (CEBAF) at Jefferson Lab (JLab) to 22 GeV. It is the result of a community effort, incorporating insights from a series of workshops conducted between March 2022 and April 2023. With a track record of over 25 years in delivering the world's most intense and precise multi-GeV electron beams, CEBAF's potential for a higher energy upgrade presents a unique opportunity for an innovative nuclear physics program, which seamlessly integrates a rich historical background with a promising future. The proposed physics program encompass a diverse range of investigations centered around the nonperturbative dynamics inherent in hadron structure and the exploration of strongly interacting systems. It builds upon the exceptional capabilities of CEBAF in high-luminosity operations, the availability of existing or planned Hall equipment, and recent advancements in accelerator technology. The proposed program cover various scientific topics, including Hadron Spectroscopy, Partonic Structure and Spin, Hadronization and Transverse Momentum, Spatial Structure, Mechanical Properties, Form Factors and Emergent Hadron Mass, Hadron-Quark Transition, and Nuclear Dynamics at Extreme Conditions, as well as QCD Confinement and Fundamental Symmetries. Each topic highlights the key measurements achievable at a 22 GeV CEBAF accelerator. Furthermore, this document outlines the significant physics outcomes and unique aspects of these programs that distinguish them from other existing or planned facilities. In summary, this document provides an exciting rationale for the energy upgrade of CEBAF to 22 GeV, outlining the transformative scientific potential that lies within reach, and the remarkable opportunities it offers for advancing our understanding of hadron physics and related fundamental phenomena. △ Less

Submitted 24 August, 2023; v1 submitted 13 June, 2023; originally announced June 2023.

Comments: Updates to the list of authors; Preprint number changed from theory to experiment; Updates to sections 4 and 6, including additional figures

Report number: JLAB-PHY-23-3840

arXiv:2306.08229 [pdf, other]

Telecom-band integrated multimode photonic quantum memory

Authors: Xueying Zhang, Bin Zhang, Shihai Wei, Hao Li, **yu Liao, Cheng Li, Guangwei Deng, You Wang, Haizhi Song, Lixing You, Bo **g, Feng Chen, Guang-Can Guo, Qiang Zhou

Abstract: Telecom-band integrated quantum memory is an elementary building block for develo** quantum networks compatible with fiber communication infrastructures. Towards such a network with large capacity, an integrated multimode photonic quantum memory at telecom band has yet been demonstrated. Here we report a fiber-integrated multimode quantum storage of single photon at telecom band on a laser-writt… ▽ More Telecom-band integrated quantum memory is an elementary building block for develo** quantum networks compatible with fiber communication infrastructures. Towards such a network with large capacity, an integrated multimode photonic quantum memory at telecom band has yet been demonstrated. Here we report a fiber-integrated multimode quantum storage of single photon at telecom band on a laser-written chip. The storage device is a fiber-pigtailed Er3+:LiNbO3 waveguide and allows a storage of up to 330 temporal modes of heralded single photon with 4-GHz-wide bandwidth at 1532 nm and a 167-fold increasing of coincidence detection rate with respect to single mode. Our memory system with all-fiber addressing is performed using telecom-band fiber-integrated and on-chip devices. The results represent an important step for the future quantum networks using integrated photonics devices. △ Less

Submitted 13 June, 2023; originally announced June 2023.

arXiv:2306.04214 [pdf, other]

DualHGNN: A Dual Hypergraph Neural Network for Semi-Supervised Node Classification based on Multi-View Learning and Density Awareness

Authors: Jianpeng Liao, Jun Yan, Qian Tao

Abstract: Graph-based semi-supervised node classification has been shown to become a state-of-the-art approach in many applications with high research value and significance. Most existing methods are only based on the original intrinsic or artificially established graph structure which may not accurately reflect the "true" correlation among data and are not optimal for semi-supervised node classification i… ▽ More Graph-based semi-supervised node classification has been shown to become a state-of-the-art approach in many applications with high research value and significance. Most existing methods are only based on the original intrinsic or artificially established graph structure which may not accurately reflect the "true" correlation among data and are not optimal for semi-supervised node classification in the downstream graph neural networks. Besides, while existing graph-based methods mostly utilize the explicit graph structure, some implicit information, for example, the density information, can also provide latent information that can be further exploited. To address these limitations, this paper proposes the Dual Hypergraph Neural Network (DualHGNN), a new dual connection model integrating both hypergraph structure learning and hypergraph representation learning simultaneously in a unified architecture. The DualHGNN first leverages a multi-view hypergraph learning network to explore the optimal hypergraph structure from multiple views, constrained by a consistency loss proposed to improve its generalization. Then, DualHGNN employs a density-aware hypergraph attention network to explore the high-order semantic correlation among data points based on the density-aware attention mechanism. Extensive experiments are conducted in various benchmark datasets, and the results demonstrate the effectiveness of the proposed approach. △ Less

Submitted 7 June, 2023; originally announced June 2023.

Comments: This work has been accepted by 2023 International Joint Conference on Neural Networks (IJCNN 2023). arXiv admin note: text overlap with arXiv:2201.11511

arXiv:2305.18628 [pdf, other]

doi 10.1051/0004-6361/202245303

Simultaneous and panchromatic observations of the Fast Radio Burst FRB 20180916B

Authors: M. Trudu, M. Pilia, L. Nicastro, C. Guidorzi, M. Orlandini, L. Zampieri, V. R. Marthi, F. Ambrosino, A. Possenti, M. Burgay, C. Casentini, I. Mereminskiy, V. Savchenko, E. Palazzi, F. Panessa, A. Ridolfi, F. Verrecchia, M. Anedda, G. Bernardi, M. Bachetti, R. Burenin, A. Burtovoi, P. Casella, M. Fiori, F. Frontera , et al. (25 additional authors not shown)

Abstract: Aims. Fast Radio Bursts are bright radio transients whose origin has not yet explained. The search for a multi-wavelength counterpart of those events can put a tight constrain on the emission mechanism and the progenitor source. Methods. We conducted a multi-wavelength observational campaign on FRB 20180916B between October 2020 and August 2021 during eight activity cycles of the source. Observati… ▽ More Aims. Fast Radio Bursts are bright radio transients whose origin has not yet explained. The search for a multi-wavelength counterpart of those events can put a tight constrain on the emission mechanism and the progenitor source. Methods. We conducted a multi-wavelength observational campaign on FRB 20180916B between October 2020 and August 2021 during eight activity cycles of the source. Observations were led in the radio band by the SRT both at 336 MHz and 1547 MHz and the uGMRT at 400 MHz. Simultaneous observations have been conducted by the optical telescopes Asiago (Galileo and Copernico), CMO SAI MSU, CAHA 2.2m, RTT-150 and TNG, and X/Gamma-ray detectors on board the AGILE, Insight-HXMT, INTEGRAL and Swift satellites. Results. We present the detection of 14 new bursts detected with the SRT at 336 MHz and seven new bursts with the uGMRT from this source. We provide the deepest prompt upper limits in the optical band fro FRB 20180916B to date. In fact, the TNG/SiFAP2 observation simultaneous to a burst detection by uGMRT gives an upper limit E_optical / E_radio < 1.3 x 10^2. Another burst detected by the SRT at 336 MHz was also co-observed by Insight-HMXT. The non-detection in the X-rays yields an upper limit (1-30 keV band) of E_X-ray / E_radio in the range of (0.9-1.3) x 10^7, depending on which model is considered for the X-ray emission. △ Less

Submitted 29 May, 2023; originally announced May 2023.

Comments: A&A accepted

Journal ref: A&A 676, A17 (2023)

arXiv:2305.17833 [pdf]

COVID-19 spreading patterns in family clusters reveal gender roles in China

Authors: **gyi Liao, Xiao Fan Liu, Xiao-Ke Xu, Tao Zhou

Abstract: Unfolding different gender roles is preceding the efforts to reduce gender inequality. This paper analyzes COVID-19 family clusters outside Hubei Province in mainland China during the 2020 outbreak, revealing significant differences in spreading patterns across gender and family roles. Results show that men are more likely to be the imported cases of a family cluster, and women are more likely to… ▽ More Unfolding different gender roles is preceding the efforts to reduce gender inequality. This paper analyzes COVID-19 family clusters outside Hubei Province in mainland China during the 2020 outbreak, revealing significant differences in spreading patterns across gender and family roles. Results show that men are more likely to be the imported cases of a family cluster, and women are more likely to be infected within the family. This finding provides new supportive evidence of the men as breadwinner and women as homemaker (MBWH) gender roles in China. Further analyses reveal that the MBWH pattern is stronger in eastern than in western China, stronger for younger than for elder people. This paper offers not only valuable references for formulating gender-differentiated epidemic prevention policies but also an exemplification for studying group differences in similar scenarios. △ Less

Submitted 28 May, 2023; originally announced May 2023.

Comments: 13 pages, 5 figures, 2 tables

arXiv:2305.16716 [pdf, other]

A spectral-timing study of the inner flow geometry in MAXI J1535--571 with $Insight$-HXMT and NICER

Authors: Wei Yu, Qing-Cui Bu, He-Xin Liu, Yue Huang, Liang Zhang, Zi-Xu Yang, **-Lu Qu, Shu Zhang, Li-Ming Song, Shuang-Nan Zhang, Shu-Mei Jia, Xiang Ma, Lian Tao, Ming-Yu Ge, Qing-Zhong Liu, **g-Zhi Yan, Xue-Lei Cao, Zhi Chang, Li Chen, Yong Chen, Yu-Peng Chen, Guo-Qiang Ding, Ju Guan, **g **, Ling-Da Kong , et al. (26 additional authors not shown)

Abstract: We have performed a spectral-timing analysis on the black hole X-ray binary MAXI J1535--571 during its 2017 outburst, with the aim of exploring the evolution of the inner accretion flow geometry. X-ray reverberation lags are observed in the hard-intermediate state (HIMS) and soft-intermediate state (SIMS) of the outburst. During the HIMS, the characteristic frequency of the reverberation lags… ▽ More We have performed a spectral-timing analysis on the black hole X-ray binary MAXI J1535--571 during its 2017 outburst, with the aim of exploring the evolution of the inner accretion flow geometry. X-ray reverberation lags are observed in the hard-intermediate state (HIMS) and soft-intermediate state (SIMS) of the outburst. During the HIMS, the characteristic frequency of the reverberation lags $ν_0$ (the frequency at which the soft lag turns to zero in the lag-frequency spectra) increases when the spectrum softens. This reflects a reduction of the spatial distance between the corona and accretion disc, when assuming the measured time lags are associated with the light travel time. We also find a strong correlation between $ν_0$ and type-C Quasi Periodic Oscillation (QPO) centroid frequency $ν_{QPO}$, which can be well explained by the Lense-Thirring (L-T) precession model under a truncated disk geometry. Despite the degeneracy in the spectral modellings, our results suggest that the accretion disc is largely truncated in the low hard state (LHS), and moves inward as the spectrum softens. Combine the spectral modelling results with the $ν_0$ - $ν_{QPO}$ evolution, we are inclined to believe that this source probably have a truncated disk geometry in the hard state. △ Less

Submitted 3 July, 2023; v1 submitted 26 May, 2023; originally announced May 2023.

arXiv:2305.15843 [pdf, other]

TabGSL: Graph Structure Learning for Tabular Data Prediction

Authors: Jay Chiehen Liao, Cheng-Te Li

Abstract: This work presents a novel approach to tabular data prediction leveraging graph structure learning and graph neural networks. Despite the prevalence of tabular data in real-world applications, traditional deep learning methods often overlook the potentially valuable associations between data instances. Such associations can offer beneficial insights for classification tasks, as instances may exhib… ▽ More This work presents a novel approach to tabular data prediction leveraging graph structure learning and graph neural networks. Despite the prevalence of tabular data in real-world applications, traditional deep learning methods often overlook the potentially valuable associations between data instances. Such associations can offer beneficial insights for classification tasks, as instances may exhibit similar patterns of correlations among features and target labels. This information can be exploited by graph neural networks, necessitating robust graph structures. However, existing studies primarily focus on improving graph structure from noisy data, largely neglecting the possibility of deriving graph structures from tabular data. We present a novel solution, Tabular Graph Structure Learning (TabGSL), to enhance tabular data prediction by simultaneously learning instance correlation and feature interaction within a unified framework. This is achieved through a proposed graph contrastive learning module, along with transformer-based feature extractor and graph neural network. Comprehensive experiments conducted on 30 benchmark tabular datasets demonstrate that TabGSL markedly outperforms both tree-based models and recent deep learning-based tabular models. Visualizations of the learned instance embeddings further substantiate the effectiveness of TabGSL. △ Less

Submitted 25 May, 2023; originally announced May 2023.

arXiv:2305.14572 [pdf, ps, other]

The case for an EIC Theory Alliance: Theoretical Challenges of the EIC

Authors: Raktim Abir, Igor Akushevich, Tolga Altinoluk, Daniele Paolo Anderle, Fatma P. Aslan, Alessandro Bacchetta, Baha Balantekin, Joao Barata, Marco Battaglieri, Carlos A. Bertulani, Guillaume Beuf, Chiara Bissolotti, Daniël Boer, M. Boglione, Radja Boughezal, Eric Braaten, Nora Brambilla, Vladimir Braun, Duane Byer, Francesco Giovanni Celiberto, Yang-Ting Chien, Ian C. Cloët, Martha Constantinou, Wim Cosyn, Aurore Courtoy , et al. (146 additional authors not shown)

Abstract: We outline the physics opportunities provided by the Electron Ion Collider (EIC). These include the study of the parton structure of the nucleon and nuclei, the onset of gluon saturation, the production of jets and heavy flavor, hadron spectroscopy and tests of fundamental symmetries. We review the present status and future challenges in EIC theory that have to be addressed in order to realize thi… ▽ More We outline the physics opportunities provided by the Electron Ion Collider (EIC). These include the study of the parton structure of the nucleon and nuclei, the onset of gluon saturation, the production of jets and heavy flavor, hadron spectroscopy and tests of fundamental symmetries. We review the present status and future challenges in EIC theory that have to be addressed in order to realize this ambitious and impactful physics program, including how to engage a diverse and inclusive workforce. In order to address these many-fold challenges, we propose a coordinated effort involving theory groups with differing expertise is needed. We discuss the scientific goals and scope of such an EIC Theory Alliance. △ Less

Submitted 23 May, 2023; originally announced May 2023.

Comments: 44 pages, ReVTeX, White Paper on EIC Theory Alliance

arXiv:2305.11588 [pdf, other]

Text2NeRF: Text-Driven 3D Scene Generation with Neural Radiance Fields

Authors: **gbo Zhang, Xiaoyu Li, Ziyu Wan, Can Wang, **g Liao

Abstract: Text-driven 3D scene generation is widely applicable to video gaming, film industry, and metaverse applications that have a large demand for 3D scenes. However, existing text-to-3D generation methods are limited to producing 3D objects with simple geometries and dreamlike styles that lack realism. In this work, we present Text2NeRF, which is able to generate a wide range of 3D scenes with complica… ▽ More Text-driven 3D scene generation is widely applicable to video gaming, film industry, and metaverse applications that have a large demand for 3D scenes. However, existing text-to-3D generation methods are limited to producing 3D objects with simple geometries and dreamlike styles that lack realism. In this work, we present Text2NeRF, which is able to generate a wide range of 3D scenes with complicated geometric structures and high-fidelity textures purely from a text prompt. To this end, we adopt NeRF as the 3D representation and leverage a pre-trained text-to-image diffusion model to constrain the 3D reconstruction of the NeRF to reflect the scene description. Specifically, we employ the diffusion model to infer the text-related image as the content prior and use a monocular depth estimation method to offer the geometric prior. Both content and geometric priors are utilized to update the NeRF model. To guarantee textured and geometric consistency between different views, we introduce a progressive scene inpainting and updating strategy for novel view synthesis of the scene. Our method requires no additional training data but only a natural language description of the scene as the input. Extensive experiments demonstrate that our Text2NeRF outperforms existing methods in producing photo-realistic, multi-view consistent, and diverse 3D scenes from a variety of natural language prompts. Our code is available at https://github.com/eckertzhang/Text2NeRF. △ Less

Submitted 31 January, 2024; v1 submitted 19 May, 2023; originally announced May 2023.

Comments: Accepted by TVCG; Homepage: https://eckertzhang.github.io/Text2NeRF.github.io/ Code:https://github.com/eckertzhang/Text2NeRF

arXiv:2305.06540 [pdf, other]

Inter-frame Accelerate Attack against Video Interpolation Models

Authors: Junpei Liao, Zhikai Chen, Liang Yi, Wenyuan Yang, Baoyuan Wu, Xiaochun Cao

Abstract: Deep learning based video frame interpolation (VIF) method, aiming to synthesis the intermediate frames to enhance video quality, have been highly developed in the past few years. This paper investigates the adversarial robustness of VIF models. We apply adversarial attacks to VIF models and find that the VIF models are very vulnerable to adversarial examples. To improve attack efficiency, we sugg… ▽ More Deep learning based video frame interpolation (VIF) method, aiming to synthesis the intermediate frames to enhance video quality, have been highly developed in the past few years. This paper investigates the adversarial robustness of VIF models. We apply adversarial attacks to VIF models and find that the VIF models are very vulnerable to adversarial examples. To improve attack efficiency, we suggest to make full use of the property of video frame interpolation task. The intuition is that the gap between adjacent frames would be small, leading to the corresponding adversarial perturbations being similar as well. Then we propose a novel attack method named Inter-frame Accelerate Attack (IAA) that initializes the perturbation as the perturbation for the previous adjacent frame and reduces the number of attack iterations. It is shown that our method can improve attack efficiency greatly while achieving comparable attack performance with traditional methods. Besides, we also extend our method to video recognition models which are higher level vision tasks and achieves great attack efficiency. △ Less

Submitted 10 May, 2023; originally announced May 2023.

arXiv:2305.01449 [pdf, other]

Dynamics in near-threshold $J/ψ$ photoproduction

Authors: JPAC Collaboration, D. Winney, C. Fernandez-Ramirez, A. Pilloni, A. N. Hiller Blin, M. Albaladejo, L. Bibrzycki, N. Hammoud, J. Liao, V. Mathieu, G. Montana, R. J. Perry, V. Shastry, W. A. Smith, A. P. Szczepaniak

Abstract: The study of $J/ψ$ photoproduction at low energies has consequences for the understanding of multiple aspects of nonperturbative QCD, ranging from mechanical properties of the proton, to the binding inside nuclei, and the existence of hidden-charm pentaquarks. Factorization of the photon-$c \bar c$ and nucleon dynamics or Vector Meson Dominance are often invoked to justify these studies. Alternati… ▽ More The study of $J/ψ$ photoproduction at low energies has consequences for the understanding of multiple aspects of nonperturbative QCD, ranging from mechanical properties of the proton, to the binding inside nuclei, and the existence of hidden-charm pentaquarks. Factorization of the photon-$c \bar c$ and nucleon dynamics or Vector Meson Dominance are often invoked to justify these studies. Alternatively, open charm intermediate states have been proposed as the dominant mechanism underlying $J/ψ$ photoproduction. As the latter violates this factorization, it is important to estimate the relevance of such contributions. We analyse the latest differential and integrated photoproduction cross sections from the GlueX and $J/ψ$-007 experiments. We show that the data can be adequately described by a small number of partial waves, which we parameterize with generic models enforcing low-energy unitarity. The results suggest a nonnegligible contribution from open charm intermediate states. Furthermore, most of the models present an elastic scattering length incompatible with previous extractions based on Vector Meson Dominance, and thus call into question its applicability to heavy mesons. Our results indicate a wide array of physics possibilities that are compatible with present data and need to be disentangled. △ Less

Submitted 13 September, 2023; v1 submitted 2 May, 2023; originally announced May 2023.

Comments: 15 pages, 7 figures, 2 tables. Version to appear on Phys. Rev. D

Report number: JLAB-THY-23-3802

Journal ref: Phys. Rev. D 108 (2023) 5, 054018

arXiv:2304.14400 [pdf, other]

IconShop: Text-Guided Vector Icon Synthesis with Autoregressive Transformers

Authors: Ronghuan Wu, Wanchao Su, Kede Ma, **g Liao

Abstract: Scalable Vector Graphics (SVG) is a popular vector image format that offers good support for interactivity and animation. Despite its appealing characteristics, creating custom SVG content can be challenging for users due to the steep learning curve required to understand SVG grammars or get familiar with professional editing software. Recent advancements in text-to-image generation have inspired… ▽ More Scalable Vector Graphics (SVG) is a popular vector image format that offers good support for interactivity and animation. Despite its appealing characteristics, creating custom SVG content can be challenging for users due to the steep learning curve required to understand SVG grammars or get familiar with professional editing software. Recent advancements in text-to-image generation have inspired researchers to explore vector graphics synthesis using either image-based methods (i.e., text -> raster image -> vector graphics) combining text-to-image generation models with image vectorization, or language-based methods (i.e., text -> vector graphics script) through pretrained large language models. However, these methods still suffer from limitations in terms of generation quality, diversity, and flexibility. In this paper, we introduce IconShop, a text-guided vector icon synthesis method using autoregressive transformers. The key to success of our approach is to sequentialize and tokenize SVG paths (and textual descriptions as guidance) into a uniquely decodable token sequence. With that, we are able to fully exploit the sequence learning power of autoregressive transformers, while enabling both unconditional and text-conditioned icon synthesis. Through standard training to predict the next token on a large-scale vector icon dataset accompanied by textural descriptions, the proposed IconShop consistently exhibits better icon synthesis capability than existing image-based and language-based methods both quantitatively and qualitatively. Meanwhile, we observe a dramatic improvement in generation diversity, which is validated by the objective Uniqueness and Novelty measures. More importantly, we demonstrate the flexibility of IconShop with multiple novel icon synthesis tasks, including icon editing, icon interpolation, icon semantic combination, and icon design auto-suggestion. △ Less

Submitted 6 June, 2023; v1 submitted 27 April, 2023; originally announced April 2023.

Comments: Project Page: https://icon-shop.github.io/

arXiv:2304.11857 [pdf, other]

Accurate and Efficient Event-based Semantic Segmentation Using Adaptive Spiking Encoder-Decoder Network

Authors: Rui Zhang, Luziwei Leng, Kaiwei Che, Hu Zhang, Jie Cheng, Qinghai Guo, Jiangxing Liao, Ran Cheng

Abstract: Leveraging the low-power, event-driven computation and the inherent temporal dynamics, spiking neural networks (SNNs) are potentially ideal solutions for processing dynamic and asynchronous signals from event-based sensors. However, due to the challenges in training and the restrictions in architectural design, there are limited examples of competitive SNNs in the realm of event-based dense predic… ▽ More Leveraging the low-power, event-driven computation and the inherent temporal dynamics, spiking neural networks (SNNs) are potentially ideal solutions for processing dynamic and asynchronous signals from event-based sensors. However, due to the challenges in training and the restrictions in architectural design, there are limited examples of competitive SNNs in the realm of event-based dense prediction when compared to artificial neural networks (ANNs). In this paper, we present an efficient spiking encoder-decoder network designed for large-scale event-based semantic segmentation tasks. This is achieved by optimizing the encoder using a hierarchical search method. To enhance learning from dynamic event streams, we harness the inherent adaptive threshold of spiking neurons to modulate network activation. Moreover, we introduce a dual-path Spiking Spatially-Adaptive Modulation (SSAM) block, specifically designed to enhance the representation of sparse events, thereby considerably improving network performance. Our proposed network achieves a 72.57% mean intersection over union (MIoU) on the DDD17 dataset and a 57.22% MIoU on the recently introduced, larger DSEC-Semantic dataset. This performance surpasses the current state-of-the-art ANNs by 4%, whilst consuming significantly less computational resources. To the best of our knowledge, this is the first study demonstrating SNNs outperforming ANNs in demanding event-based semantic segmentation tasks, thereby establishing the vast potential of SNNs in the field of event-based vision. Our source code will be made publicly accessible. △ Less

Submitted 9 July, 2023; v1 submitted 24 April, 2023; originally announced April 2023.

arXiv:2304.11298 [pdf, other]

doi 10.1088/1367-2630/accec2

Dynamical $N$-photon bundle emission

Authors: Fen Zou, Yong Li, Jie-Qiao Liao

Abstract: Engineering multiphoton resources is of importance in quantum metrology, quantum lithography, and biological sensing. Here we propose a concept of dynamical emission of $N$ strongly-correlated photons. This is realized in a circuit quantum electrodynamical system driven by two Gaussian-pulse sequences. The underlying physical mechanism relies on the stimulated Raman adiabatic passage that allows e… ▽ More Engineering multiphoton resources is of importance in quantum metrology, quantum lithography, and biological sensing. Here we propose a concept of dynamical emission of $N$ strongly-correlated photons. This is realized in a circuit quantum electrodynamical system driven by two Gaussian-pulse sequences. The underlying physical mechanism relies on the stimulated Raman adiabatic passage that allows efficient and selective preparation of target multiphoton states. Assisted by the photon decay, a highly pure $N$-photon bundle emission takes place in this system. In particular, the dynamical $N$-photon bundle emission can be tuned by controlling the time interval between consecutive pulses so that the device behaves as an $N$-photon gun, which can be triggered on demand. Our work opens up a route to achieve multiphoton source devices, which have wide potential applications in quantum information processing and quantum metrology. △ Less

Submitted 7 May, 2023; v1 submitted 21 April, 2023; originally announced April 2023.

Comments: 17 pages, 5 figures

Journal ref: New J. Phys. 25(4), 043027 (2023)

arXiv:2304.10537 [pdf, other]

Learning Neural Duplex Radiance Fields for Real-Time View Synthesis

Authors: Ziyu Wan, Christian Richardt, Aljaž Božič, Chao Li, Vijay Rengarajan, Seonghyeon Nam, Xiaoyu Xiang, Tuotuo Li, Bo Zhu, Rakesh Ranjan, **g Liao

Abstract: Neural radiance fields (NeRFs) enable novel view synthesis with unprecedented visual quality. However, to render photorealistic images, NeRFs require hundreds of deep multilayer perceptron (MLP) evaluations - for each pixel. This is prohibitively expensive and makes real-time rendering infeasible, even on powerful modern GPUs. In this paper, we propose a novel approach to distill and bake NeRFs in… ▽ More Neural radiance fields (NeRFs) enable novel view synthesis with unprecedented visual quality. However, to render photorealistic images, NeRFs require hundreds of deep multilayer perceptron (MLP) evaluations - for each pixel. This is prohibitively expensive and makes real-time rendering infeasible, even on powerful modern GPUs. In this paper, we propose a novel approach to distill and bake NeRFs into highly efficient mesh-based neural representations that are fully compatible with the massively parallel graphics rendering pipeline. We represent scenes as neural radiance features encoded on a two-layer duplex mesh, which effectively overcomes the inherent inaccuracies in 3D surface reconstruction by learning the aggregated radiance information from a reliable interval of ray-surface intersections. To exploit local geometric relationships of nearby pixels, we leverage screen-space convolutions instead of the MLPs used in NeRFs to achieve high-quality appearance. Finally, the performance of the whole framework is further boosted by a novel multi-view distillation optimization strategy. We demonstrate the effectiveness and superiority of our approach via extensive experiments on a range of standard datasets. △ Less

Submitted 20 April, 2023; originally announced April 2023.

Comments: CVPR 2023. Project page: http://raywzy.com/NDRF

arXiv:2304.10040 [pdf, ps, other]

The kernels of powers of linear operator via Weyr characteristic

Authors: Jie Jian, Jun Liao, Heguo Liu

Abstract: The adjoint of a matrix in the Lie algebra associated with a matrix algebra is a fundamental operator, which can be generalized to a more general operator $\varphi_{AB}: X\rightarrow AX-XB$ by two matrices $A$ and $B$. The kernel of the operator is very well-known and it can be found in Gantmacher's book. The formulas for the dimensions of the kernels of arbitrary powers of the operator… ▽ More The adjoint of a matrix in the Lie algebra associated with a matrix algebra is a fundamental operator, which can be generalized to a more general operator $\varphi_{AB}: X\rightarrow AX-XB$ by two matrices $A$ and $B$. The kernel of the operator is very well-known and it can be found in Gantmacher's book. The formulas for the dimensions of the kernels of arbitrary powers of the operator $\varphi_{AB}$ were given in terms of the Segre characteristics of these two matrices by the second and third authors in this paper and their collaborators. This paper provides an alternative approach to this problem via the Weyr characteristic in a more essential method. We obtain formulas for the dimensions of the kernels of arbitrary powers of the operator in terms of the Weyr characteristics. Furthermore, the basis for kernel of each power of the operator is described explicitly. As a consequence, for arbitrary square matrices $A$ and $B$ over an algebraically closed field, the dimension of the kernel of each power of the operator $\varphi_{A-λI,B}$ for eigenvalues $λ$ of $\varphi_{AB}$ can be viewed as a similarity invariant of the operator $\varphi_{AB}$, so we characterise the operator within similarity, which should be of interest to a number of people (including physicists). △ Less

Submitted 17 February, 2024; v1 submitted 19 April, 2023; originally announced April 2023.

MSC Class: 15A24; 15A27

arXiv:2304.08282 [pdf]

Deep-Learning-based Vasculature Extraction for Single-Scan Optical Coherence Tomography Angiography

Authors: **peng Liao, Tianyu Zhang, Yilong Zhang, Chunhui Li, Zhihong Huang

Abstract: Optical coherence tomography angiography (OCTA) is a non-invasive imaging modality that extends the functionality of OCT by extracting moving red blood cell signals from surrounding static biological tissues. OCTA has emerged as a valuable tool for analyzing skin microvasculature, enabling more accurate diagnosis and treatment monitoring. Most existing OCTA extraction algorithms, such as speckle v… ▽ More Optical coherence tomography angiography (OCTA) is a non-invasive imaging modality that extends the functionality of OCT by extracting moving red blood cell signals from surrounding static biological tissues. OCTA has emerged as a valuable tool for analyzing skin microvasculature, enabling more accurate diagnosis and treatment monitoring. Most existing OCTA extraction algorithms, such as speckle variance (SV)- and eigen-decomposition (ED)-OCTA, implement a larger number of repeated (NR) OCT scans at the same position to produce high-quality angiography images. However, a higher NR requires a longer data acquisition time, leading to more unpredictable motion artifacts. In this study, we propose a vasculature extraction pipeline that uses only one-repeated OCT scan to generate OCTA images. The pipeline is based on the proposed Vasculature Extraction Transformer (VET), which leverages convolutional projection to better learn the spatial relationships between image patches. In comparison to OCTA images obtained via the SV-OCTA (PSNR: 17.809) and ED-OCTA (PSNR: 18.049) using four-repeated OCT scans, OCTA images extracted by VET exhibit moderate quality (PSNR: 17.515) and higher image contrast while reducing the required data acquisition time from ~8 s to ~2 s. Based on visual observations, the proposed VET outperforms SV and ED algorithms when using neck and face OCTA data in areas that are challenging to scan. This study represents that the VET has the capacity to extract vascularture images from a fast one-repeated OCT scan, facilitating accurate diagnosis for patients. △ Less

Submitted 3 May, 2023; v1 submitted 17 April, 2023; originally announced April 2023.

arXiv:2304.00963 [pdf, other]

doi 10.1103/PhysRevA.108.013516

Controllable generation of mechanical quadrature squeezing via dark-mode engineering in cavity optomechanics

Authors: Jian Huang, Deng-Gao Lai, Jie-Qiao Liao

Abstract: Quantum squeezing is an important resource in modern quantum technologies, such as quantum precision measurement and continuous-variable quantum information processing. The generation of squeezed states of mechanical modes is a significant task in cavity optomechanics. Motivated by recent interest in multimode optomechanics, it becomes an interesting topic to create quadrature squeezing in multipl… ▽ More Quantum squeezing is an important resource in modern quantum technologies, such as quantum precision measurement and continuous-variable quantum information processing. The generation of squeezed states of mechanical modes is a significant task in cavity optomechanics. Motivated by recent interest in multimode optomechanics, it becomes an interesting topic to create quadrature squeezing in multiple mechanical resonators. However, in the multiple-degenerate-mechanical-mode optomechanical systems, the dark-mode effect strongly suppresses the quantum effects in mechanical modes. Here we study the generation of mechanical squeezing in a two-mechanical-mode optomechanical system by breaking the dark-mode effect with the synthetic-gauge-field method. We find that when the mechanical modes work at a finite temperature, the mechanical squeezing is weak or even disappeared due to the dark-mode effect, while the strong mechanical squeezing can be generated once the dark-mode effect is broken. In particular, the thermal-phonon-occupation tolerance of the mechanical squeezing is approximately three orders of magnitude larger than that without breaking the dark-mode effect. We also generalize this method to break the dark modes and to create the mechanical squeezing in a multiple-mechanical-mode optomechanical system. Our results describe a general physical mechanism and pave the way towards the generation of noise-resistant quantum resources. △ Less

Submitted 27 July, 2023; v1 submitted 3 April, 2023; originally announced April 2023.

Comments: 11 pages, 6 figures

Journal ref: Phys. Rev. A 108, 013516 (2023)

arXiv:2303.17606 [pdf, other]

AvatarCraft: Transforming Text into Neural Human Avatars with Parameterized Shape and Pose Control

Authors: Ruixiang Jiang, Can Wang, **gbo Zhang, Menglei Chai, Mingming He, Dongdong Chen, **g Liao

Abstract: Neural implicit fields are powerful for representing 3D scenes and generating high-quality novel views, but it remains challenging to use such implicit representations for creating a 3D human avatar with a specific identity and artistic style that can be easily animated. Our proposed method, AvatarCraft, addresses this challenge by using diffusion models to guide the learning of geometry and textu… ▽ More Neural implicit fields are powerful for representing 3D scenes and generating high-quality novel views, but it remains challenging to use such implicit representations for creating a 3D human avatar with a specific identity and artistic style that can be easily animated. Our proposed method, AvatarCraft, addresses this challenge by using diffusion models to guide the learning of geometry and texture for a neural avatar based on a single text prompt. We carefully design the optimization framework of neural implicit fields, including a coarse-to-fine multi-bounding box training strategy, shape regularization, and diffusion-based constraints, to produce high-quality geometry and texture. Additionally, we make the human avatar animatable by deforming the neural implicit field with an explicit war** field that maps the target human mesh to a template human mesh, both represented using parametric human models. This simplifies animation and resha** of the generated avatar by controlling pose and shape parameters. Extensive experiments on various text descriptions show that AvatarCraft is effective and robust in creating human avatars and rendering novel views, poses, and shapes. Our project page is: https://avatar-craft.github.io/. △ Less

Submitted 21 August, 2023; v1 submitted 30 March, 2023; originally announced March 2023.

Comments: ICCV 2023 Camera Ready

arXiv:2303.17254 [pdf, other]

Hot QCD White Paper

Authors: M. Arslandok, S. A. Bass, A. A. Baty, I. Bautista, C. Beattie, F. Becattini, R. Bellwied, Y. Berdnikov, A. Berdnikov, J. Bielcik, J. T. Blair, F. Bock, B. Boimska, H. Bossi, H. Caines, Y. Chen, Y. -T. Chien, M. Chiu, M. E. Connors, M. Csanád, C. L. da Silva, A. P. Dash, G. David, K. Dehmelt, V. Dexheimer , et al. (149 additional authors not shown)

Abstract: Hot QCD physics studies the nuclear strong force under extreme temperature and densities. Experimentally these conditions are achieved via high-energy collisions of heavy ions at the Relativistic Heavy Ion Collider (RHIC) and the Large Hadron Collider (LHC). In the past decade, a unique and substantial suite of data was collected at RHIC and the LHC, probing hydrodynamics at the nucleon scale, the… ▽ More Hot QCD physics studies the nuclear strong force under extreme temperature and densities. Experimentally these conditions are achieved via high-energy collisions of heavy ions at the Relativistic Heavy Ion Collider (RHIC) and the Large Hadron Collider (LHC). In the past decade, a unique and substantial suite of data was collected at RHIC and the LHC, probing hydrodynamics at the nucleon scale, the temperature dependence of the transport properties of quark-gluon plasma, the phase diagram of nuclear matter, the interaction of quarks and gluons at different scales and much more. This document, as part of the 2023 nuclear science long range planning process, was written to review the progress in hot QCD since the 2015 Long Range Plan for Nuclear Science, as well as highlight the realization of previous recommendations, and present opportunities for the next decade, building on the accomplishments and investments made in theoretical developments and the construction of new detectors. Furthermore, this document provides additional context to support the recommendations voted on at the Joint Hot and Cold QCD Town Hall Meeting, which are reported in a separate document. △ Less

Submitted 30 March, 2023; originally announced March 2023.

Comments: 190 pages, 69 figures

arXiv:2303.14746

Giant-atom entanglement in waveguide-QED systems including non-Markovian effect

Authors: Xian-Li Yin, Jie-Qiao Liao

Abstract: We study the generation of quantum entanglement between two giant atoms coupled to a common one-dimensional waveguide. Here each giant atom interacts with the waveguide at two separate coupling points. Within the Wigner-Weisskopf framework for single coupling points, we obtain the time-delayed quantum master equations governing the evolution of the two giant atoms for three different coupling conf… ▽ More We study the generation of quantum entanglement between two giant atoms coupled to a common one-dimensional waveguide. Here each giant atom interacts with the waveguide at two separate coupling points. Within the Wigner-Weisskopf framework for single coupling points, we obtain the time-delayed quantum master equations governing the evolution of the two giant atoms for three different coupling configurations: separated, braided, and nested couplings. For each coupling configuration, we consider both the Markovian and non-Markovian entanglement dynamics of the giant atoms, which are initially in two different separable states: single- and double-excitation states. Our results show that the generated entanglement depends on the phase shift, time delay, atomic initial state, and the coupling configuration. For the single-excitation initial state, there exists the steady-state entanglement for each coupling in both the Markovian and non-Markovian regimes due to the appearance of the dark state. For the double-excitation initial state, we observe entanglement sudden birth via adjusting the phase shift in both regimes. In particular, the maximally achievable entanglement for the nested coupling is about one order of magnitude larger than those of separate and braided couplings. We also find that the maximal entanglement for these three coupling configurations can be enhanced in the case of small time delays. This work can be utilized for the generation and control of entanglement in quantum networks based on giant-atom waveguide-QED systems, which have wide potential applications in quantum information processing. △ Less

Submitted 8 June, 2023; v1 submitted 26 March, 2023; originally announced March 2023.

Comments: We withdraw this submission because the obtained time-delayed quantum master equations do not have complete positivity during some periods of the dynamic evolution

Showing 101–150 of 725 results for author: Lia, J