Search | arXiv e-print repository

Curvatures of metric Jordan algebras

Authors: Hui Zhang, Zaili Yan, Zhiqi Chen

Abstract: In this paper, we investigate metric Jordan algebras, and follow the lines of the paper (J. Milnor: Curvatures of left invariant metrics on Lie groups. Adv. Math. (1976)). Firstly, we define the Jordan-Levi-Civita connection, then we show that every metric Jordan algebra admits a unique Jordan-Levi-Civita connection. Secondly, using the Jordan-Levi-Civita connection, we introduce three natural cur… ▽ More In this paper, we investigate metric Jordan algebras, and follow the lines of the paper (J. Milnor: Curvatures of left invariant metrics on Lie groups. Adv. Math. (1976)). Firstly, we define the Jordan-Levi-Civita connection, then we show that every metric Jordan algebra admits a unique Jordan-Levi-Civita connection. Secondly, using the Jordan-Levi-Civita connection, we introduce three natural curvature tensors on metric Jordan algebras, and obtain the corresponding curvature formulas. Thirdly, based on these curvature formulas, we prove that every formally real Jordan algebra admits both a metric of non-positive Jordan curvature, and a Jordan-Einstein metric of negative Jordan scalar curvature. Besides, for nilpotent Jordan algebras, we prove that they admit no Jordan-Einstein metrics. △ Less

Submitted 19 May, 2024; v1 submitted 5 September, 2023; originally announced September 2023.

Comments: Comments are welcome

MSC Class: 17C37; 17C50; 53C25; 53C99; 22E60

arXiv:2309.01123 [pdf, other]

On the determinant of the $Q$-walk matrix of rooted product with a path

Authors: Zhidan Yan, Lihuan Mao, Wei Wang

Abstract: Let $G$ be an $n$-vertex graph and $Q(G)$ be its signless Laplacian matrix. The $Q$-walk matrix of $G$, denoted by $W_Q(G)$, is $[e,Q(G)e,\ldots,Q^{n-1}(G)e]$, where $e$ is the all-one vector. Let $G\circ P_m$ be the graph obtained from $G$ and $n$ copies of the path $P_m$ by identifying the $i$-th vertex of $G$ with an endvertex of the $i$-th copy of $P_m$ for each $i$. We prove that,… ▽ More Let $G$ be an $n$-vertex graph and $Q(G)$ be its signless Laplacian matrix. The $Q$-walk matrix of $G$, denoted by $W_Q(G)$, is $[e,Q(G)e,\ldots,Q^{n-1}(G)e]$, where $e$ is the all-one vector. Let $G\circ P_m$ be the graph obtained from $G$ and $n$ copies of the path $P_m$ by identifying the $i$-th vertex of $G$ with an endvertex of the $i$-th copy of $P_m$ for each $i$. We prove that, $$\det W_Q(G\circ P_m)=\pm (\det Q(G))^{m-1}(\det W_Q(G))^m$$ holds for any $m\ge 2$. This gives a signless Laplacian counterpart of the following recently established identity [17]: $$\det W_A(G\circ P_m)=\pm (\det A(G))^{\lfloor\frac{m}{2}\rfloor}(\det W_A(G))^m,$$ where $A(G)$ is the adjacency matrix of $G$ and $W_A(G)=[e,A(G)e,\ldots,A^{n-1}(G)e]$. We also propose a conjecture to unify the above two equalities. △ Less

Submitted 3 September, 2023; originally announced September 2023.

Comments: 16 pages, 1 figure

MSC Class: 05C50

arXiv:2309.00655 [pdf, other]

RigNet++: Semantic Assisted Repetitive Image Guided Network for Depth Completion

Authors: Zhiqiang Yan, Xiang Li, Le Hui, Zhenyu Zhang, Jun Li, Jian Yang

Abstract: Depth completion aims to recover dense depth maps from sparse ones, where color images are often used to facilitate this task. Recent depth methods primarily focus on image guided learning frameworks. However, blurry guidance in the image and unclear structure in the depth still impede their performance. To tackle these challenges, we explore a repetitive design in our image guided network to grad… ▽ More Depth completion aims to recover dense depth maps from sparse ones, where color images are often used to facilitate this task. Recent depth methods primarily focus on image guided learning frameworks. However, blurry guidance in the image and unclear structure in the depth still impede their performance. To tackle these challenges, we explore a repetitive design in our image guided network to gradually and sufficiently recover depth values. Specifically, the repetition is embodied in both the image guidance branch and depth generation branch. In the former branch, we design a dense repetitive hourglass network (DRHN) to extract discriminative image features of complex environments, which can provide powerful contextual instruction for depth prediction. In the latter branch, we present a repetitive guidance (RG) module based on dynamic convolution, in which an efficient convolution factorization is proposed to reduce the complexity while modeling high-frequency structures progressively. Furthermore, in the semantic guidance branch, we utilize the well-known large vision model, i.e., segment anything (SAM), to supply RG with semantic prior. In addition, we propose a region-aware spatial propagation network (RASPN) for further depth refinement based on the semantic prior constraint. Finally, we collect a new dataset termed TOFDC for the depth completion task, which is acquired by the time-of-flight (TOF) sensor and the color camera on smartphones. Extensive experiments demonstrate that our method achieves state-of-the-art performance on KITTI, NYUv2, Matterport3D, 3D60, VKITTI, and our TOFDC. △ Less

Submitted 28 February, 2024; v1 submitted 1 September, 2023; originally announced September 2023.

Comments: 20 pages

arXiv:2309.00221 [pdf, other]

A multinode quantum network over a metropolitan area

Authors: Jian-Long Liu, Xi-Yu Luo, Yong Yu, Chao-Yang Wang, Bin Wang, Yi Hu, Jun Li, Ming-Yang Zheng, Bo Yao, Zi Yan, Da Teng, **-Wei Jiang, Xiao-Bing Liu, Xiu-** Xie, Jun Zhang, Qing-He Mao, Xiao Jiang, Qiang Zhang, Xiao-Hui Bao, Jian-Wei Pan

Abstract: Towards realizing the future quantum internet, a pivotal milestone entails the transition from two-node proof-of-principle experiments conducted in laboratories to comprehensive, multi-node setups on large scales. Here, we report on the debut implementation of a multi-node entanglement-based quantum network over a metropolitan area. We equipped three quantum nodes with atomic quantum memories and… ▽ More Towards realizing the future quantum internet, a pivotal milestone entails the transition from two-node proof-of-principle experiments conducted in laboratories to comprehensive, multi-node setups on large scales. Here, we report on the debut implementation of a multi-node entanglement-based quantum network over a metropolitan area. We equipped three quantum nodes with atomic quantum memories and their telecom interfaces, and combined them into a scalable phase-stabilized architecture through a server node. We demonstrated heralded entanglement generation between two quantum nodes situated 12.5 km apart, and the storage of entanglement exceeding the round-trip communication time. We also showed the concurrent entanglement generation on three links. Our work provides a metropolitan-scale testbed for the evaluation and exploration of multi-node quantum network protocols and starts a new stage of quantum internet research. △ Less

Submitted 31 August, 2023; originally announced September 2023.

Comments: 21 pages in total, 4 figures and 1 table in the main text, 5 figures and 8 tables in the supplementary material

arXiv:2309.00200 [pdf, other]

doi 10.1126/science.abo4504

Observations of a black hole X-ray binary indicate formation of a magnetically arrested disk

Authors: Bei You, Xinwu Cao, Zhen Yan, Jean-Marie Hameury, Bozena Czerny, Yue Wu, Tianyu Xia, Marek Sikora, Shuang-Nan Zhang, Pu Du, Piotr T. Zycki

Abstract: Accretion of material onto a black hole drags any magnetic fields present inwards, increasing their strength. Theory predicts that sufficiently strong magnetic fields can halt the accretion flow, producing a magnetically arrested disk (MAD). We analyze archival multi-wavelength observations of an outburst from the black hole X-ray binary MAXI J1820+070 in 2018. The radio and optical fluxes are del… ▽ More Accretion of material onto a black hole drags any magnetic fields present inwards, increasing their strength. Theory predicts that sufficiently strong magnetic fields can halt the accretion flow, producing a magnetically arrested disk (MAD). We analyze archival multi-wavelength observations of an outburst from the black hole X-ray binary MAXI J1820+070 in 2018. The radio and optical fluxes are delayed by about 8 and 17 days respectively, compared to the X-ray flux. We interpret this as evidence for the formation of a MAD. In this scenario, the magnetic field is amplified by an expanding corona, forming a MAD around the time of the radio peak. The optical delay is then due to thermal viscous instability in the outer disk. △ Less

Submitted 11 October, 2023; v1 submitted 31 August, 2023; originally announced September 2023.

Comments: The author's version of the article which will appear in Science on 31 August 2023, 49 pages including the extended data. The online publication version can be found at https://doi.org/10.1126/science.abo4504

Journal ref: Science 381, 961-964 (2023)

arXiv:2308.16246 [pdf, other]

Active Neural Map**

Authors: Zike Yan, Haoxiang Yang, Hongbin Zha

Abstract: We address the problem of active map** with a continually-learned neural scene representation, namely Active Neural Map**. The key lies in actively finding the target space to be explored with efficient agent movement, thus minimizing the map uncertainty on-the-fly within a previously unseen environment. In this paper, we examine the weight space of the continually-learned neural field, and sh… ▽ More We address the problem of active map** with a continually-learned neural scene representation, namely Active Neural Map**. The key lies in actively finding the target space to be explored with efficient agent movement, thus minimizing the map uncertainty on-the-fly within a previously unseen environment. In this paper, we examine the weight space of the continually-learned neural field, and show empirically that the neural variability, the prediction robustness against random weight perturbation, can be directly utilized to measure the instant uncertainty of the neural map. Together with the continuous geometric information inherited in the neural map, the agent can be guided to find a traversable path to gradually gain knowledge of the environment. We present for the first time an active map** system with a coordinate-based implicit neural representation for online scene reconstruction. Experiments in the visually-realistic Gibson and Matterport3D environment demonstrate the efficacy of the proposed method. △ Less

Submitted 30 August, 2023; originally announced August 2023.

Comments: ICCV 2023, project page: https://zikeyan.github.io/active-INR/index.html

arXiv:2308.12951 [pdf, other]

Directly imaging spin polarons in a kinetically frustrated Hubbard system

Authors: Max L. Prichard, Benjamin M. Spar, Ivan Morera, Eugene Demler, Zoe Z. Yan, Waseem S. Bakr

Abstract: The emergence of quasiparticles in quantum many-body systems underlies the rich phenomenology in many strongly interacting materials. In the context of doped Mott insulators, magnetic polarons are quasiparticles that usually arise from an interplay between the kinetic energy of doped charge carriers and superexchange spin interactions. However, in kinetically frustrated lattices, itinerant spin po… ▽ More The emergence of quasiparticles in quantum many-body systems underlies the rich phenomenology in many strongly interacting materials. In the context of doped Mott insulators, magnetic polarons are quasiparticles that usually arise from an interplay between the kinetic energy of doped charge carriers and superexchange spin interactions. However, in kinetically frustrated lattices, itinerant spin polarons - bound states of a dopant and a spin-flip - have been theoretically predicted even in the absence of superexchange coupling. Despite their important role in the theory of kinetic magnetism, a microscopic observation of these polarons is lacking. Here we directly image itinerant spin polarons in a triangular lattice Hubbard system realised with ultracold atoms, revealing enhanced antiferromagnetic correlations in the local environment of a hole dopant. In contrast, around a charge dopant, we find ferromagnetic correlations, a manifestation of the elusive Nagaoka effect. We study the evolution of these correlations with interactions and do**, and use higher-order correlation functions to further elucidate the relative contributions of superexchange and kinetic mechanisms. The robustness of itinerant spin polarons at high temperature paves the way for exploring potential mechanisms for hole pairing and superconductivity in frustrated systems. Furthermore, our work provides microscopic insights into related phenomena in triangular lattice moiré materials. △ Less

Submitted 24 August, 2023; originally announced August 2023.

Comments: 7 pages (4 figures) + 6 pages methods (7 figures)

arXiv:2308.12698 [pdf, other]

Potato: A Data-Oriented Programming 3D Simulator for Large-Scale Heterogeneous Swarm Robotics

Authors: **jie Li, Liang Han, Haoyang Yu, Zhaotian Wang, Pengzhi Yang, Ziwei Yan, Zhang Ren

Abstract: Large-scale simulation with realistic nonlinear dynamic models is crucial for algorithms development for swarm robotics. However, existing platforms are mainly developed based on Object-Oriented Programming (OOP) and either use simple kinematic models to pursue a large number of simulating nodes or implement realistic dynamic models with limited simulating nodes. In this paper, we develop a simula… ▽ More Large-scale simulation with realistic nonlinear dynamic models is crucial for algorithms development for swarm robotics. However, existing platforms are mainly developed based on Object-Oriented Programming (OOP) and either use simple kinematic models to pursue a large number of simulating nodes or implement realistic dynamic models with limited simulating nodes. In this paper, we develop a simulator based on Data-Oriented Programming (DOP) that utilizes GPU parallel computing to achieve large-scale swarm robotic simulations. Specifically, we use a multi-process approach to simulate heterogeneous agents and leverage PyTorch with GPU to simulate homogeneous agents with a large number. We test our approach using a nonlinear quadrotor model and demonstrate that this DOP approach can maintain almost the same computational speed when quadrotors are less than 5,000. We also provide two examples to present the functionality of the platform. △ Less

Submitted 24 August, 2023; originally announced August 2023.

Comments: 4 pages, 5 figures, accepted by ICRA 2023 Workshop on "The Role of Robotics Simulators for Unmanned Aerial Vehicles"

arXiv:2308.11957 [pdf, other]

CED: Consistent ensemble distillation for audio tagging

Authors: Heinrich Dinkel, Yongqing Wang, Zhiyong Yan, Junbo Zhang, Yujun Wang

Abstract: Augmentation and knowledge distillation (KD) are well-established techniques employed in audio classification tasks, aimed at enhancing performance and reducing model sizes on the widely recognized Audioset (AS) benchmark. Although both techniques are effective individually, their combined use, called consistent teaching, hasn't been explored before. This paper proposes CED, a simple training fram… ▽ More Augmentation and knowledge distillation (KD) are well-established techniques employed in audio classification tasks, aimed at enhancing performance and reducing model sizes on the widely recognized Audioset (AS) benchmark. Although both techniques are effective individually, their combined use, called consistent teaching, hasn't been explored before. This paper proposes CED, a simple training framework that distils student models from large teacher ensembles with consistent teaching. To achieve this, CED efficiently stores logits as well as the augmentation methods on disk, making it scalable to large-scale datasets. Central to CED's efficacy is its label-free nature, meaning that only the stored logits are used for the optimization of a student model only requiring 0.3\% additional disk space for AS. The study trains various transformer-based models, including a 10M parameter model achieving a 49.0 mean average precision (mAP) on AS. Pretrained models and code are available at https://github.com/RicherMans/CED. △ Less

Submitted 7 September, 2023; v1 submitted 23 August, 2023; originally announced August 2023.

arXiv:2308.10181 [pdf]

Stochastic Optimization of Coupled Power Distribution-Urban Transportation Network Operations with Autonomous Mobility on Demand Systems

Authors: Han Wang, Xiaoyuan Xu, Yue Chen, Zheng Yan, Mohammad Shahidehpour, Jiaqi Li, Shaolun Xu

Abstract: Autonomous mobility on demand systems (AMoDS) will significantly affect the operation of coupled power distribution-urban transportation networks (PTNs) by the optimal dispatch of electric vehicles (EVs). This paper proposes an uncertainty method to analyze the operational states of PTNs with AMoDS. First, a PTN operation framework is designed considering the controllable EVs dispatched by AMoDS a… ▽ More Autonomous mobility on demand systems (AMoDS) will significantly affect the operation of coupled power distribution-urban transportation networks (PTNs) by the optimal dispatch of electric vehicles (EVs). This paper proposes an uncertainty method to analyze the operational states of PTNs with AMoDS. First, a PTN operation framework is designed considering the controllable EVs dispatched by AMoDS as well as the uncontrollable driving behaviors of other vehicle users. Then, a bi-level power-traffic flow (PTF) model is proposed to characterize the interaction of power distribution networks (PDNs) and urban transportation networks (UTNs). In the upper level, a social optimum model is established to minimize the operating cost of PDNs and UTNs embedded with controllable EVs. In the lower level, a stochastic user equilibrium (SUE) model is established to minimize the operating cost of uncontrollable EVs and gasoline vehicles (GVs) in UTNs. Finally, a probabilistic PTF analysis method is developed to evaluate PTN operations under environmental and human uncertainties. A regional sensitivity analysis method is proposed to identify the critical uncertainties and quantify the impacts of their distribution ranges on PTN operations. The effectiveness of the proposed method is verified by the PTN consisting of a 21-bus PDN and a 20-node UTN. △ Less

Submitted 20 August, 2023; originally announced August 2023.

Comments: 10 pages, 13 figures

arXiv:2308.10001 [pdf, other]

AltNeRF: Learning Robust Neural Radiance Field via Alternating Depth-Pose Optimization

Authors: Kun Wang, Zhiqiang Yan, Huang Tian, Zhenyu Zhang, Xiang Li, Jun Li, Jian Yang

Abstract: Neural Radiance Fields (NeRF) have shown promise in generating realistic novel views from sparse scene images. However, existing NeRF approaches often encounter challenges due to the lack of explicit 3D supervision and imprecise camera poses, resulting in suboptimal outcomes. To tackle these issues, we propose AltNeRF -- a novel framework designed to create resilient NeRF representations using sel… ▽ More Neural Radiance Fields (NeRF) have shown promise in generating realistic novel views from sparse scene images. However, existing NeRF approaches often encounter challenges due to the lack of explicit 3D supervision and imprecise camera poses, resulting in suboptimal outcomes. To tackle these issues, we propose AltNeRF -- a novel framework designed to create resilient NeRF representations using self-supervised monocular depth estimation (SMDE) from monocular videos, without relying on known camera poses. SMDE in AltNeRF masterfully learns depth and pose priors to regulate NeRF training. The depth prior enriches NeRF's capacity for precise scene geometry depiction, while the pose prior provides a robust starting point for subsequent pose refinement. Moreover, we introduce an alternating algorithm that harmoniously melds NeRF outputs into SMDE through a consistence-driven mechanism, thus enhancing the integrity of depth priors. This alternation empowers AltNeRF to progressively refine NeRF representations, yielding the synthesis of realistic novel views. Extensive experiments showcase the compelling capabilities of AltNeRF in generating high-fidelity and robust novel views that closely resemble reality. △ Less

Submitted 23 February, 2024; v1 submitted 19 August, 2023; originally announced August 2023.

Comments: Accepted by AAAI-24

arXiv:2308.08222 [pdf, other]

HyperSNN: A new efficient and robust deep learning model for resource constrained control applications

Authors: Zhanglu Yan, Shida Wang, Kaiwen Tang, Weng-Fai Wong

Abstract: In light of the increasing adoption of edge computing in areas such as intelligent furniture, robotics, and smart homes, this paper introduces HyperSNN, an innovative method for control tasks that uses spiking neural networks (SNNs) in combination with hyperdimensional computing. HyperSNN substitutes expensive 32-bit floating point multiplications with 8-bit integer additions, resulting in reduced… ▽ More In light of the increasing adoption of edge computing in areas such as intelligent furniture, robotics, and smart homes, this paper introduces HyperSNN, an innovative method for control tasks that uses spiking neural networks (SNNs) in combination with hyperdimensional computing. HyperSNN substitutes expensive 32-bit floating point multiplications with 8-bit integer additions, resulting in reduced energy consumption while enhancing robustness and potentially improving accuracy. Our model was tested on AI Gym benchmarks, including Cartpole, Acrobot, MountainCar, and Lunar Lander. HyperSNN achieves control accuracies that are on par with conventional machine learning methods but with only 1.36% to 9.96% of the energy expenditure. Furthermore, our experiments showed increased robustness when using HyperSNN. We believe that HyperSNN is especially suitable for interactive, mobile, and wearable devices, promoting energy-efficient and robust system design. Furthermore, it paves the way for the practical implementation of complex algorithms like model predictive control (MPC) in real-world industrial scenarios. △ Less

Submitted 17 August, 2023; v1 submitted 16 August, 2023; originally announced August 2023.

arXiv:2308.07903 [pdf, other]

Relightable and Animatable Neural Avatar from Sparse-View Video

Authors: Zhen Xu, Sida Peng, Chen Geng, Linzhan Mou, Zihan Yan, Jiaming Sun, Hujun Bao, Xiaowei Zhou

Abstract: This paper tackles the challenge of creating relightable and animatable neural avatars from sparse-view (or even monocular) videos of dynamic humans under unknown illumination. Compared to studio environments, this setting is more practical and accessible but poses an extremely challenging ill-posed problem. Previous neural human reconstruction methods are able to reconstruct animatable avatars fr… ▽ More This paper tackles the challenge of creating relightable and animatable neural avatars from sparse-view (or even monocular) videos of dynamic humans under unknown illumination. Compared to studio environments, this setting is more practical and accessible but poses an extremely challenging ill-posed problem. Previous neural human reconstruction methods are able to reconstruct animatable avatars from sparse views using deformed Signed Distance Fields (SDF) but cannot recover material parameters for relighting. While differentiable inverse rendering-based methods have succeeded in material recovery of static objects, it is not straightforward to extend them to dynamic humans as it is computationally intensive to compute pixel-surface intersection and light visibility on deformed SDFs for inverse rendering. To solve this challenge, we propose a Hierarchical Distance Query (HDQ) algorithm to approximate the world space distances under arbitrary human poses. Specifically, we estimate coarse distances based on a parametric human model and compute fine distances by exploiting the local deformation invariance of SDF. Based on the HDQ algorithm, we leverage sphere tracing to efficiently estimate the surface intersection and light visibility. This allows us to develop the first system to recover animatable and relightable neural avatars from sparse view (or monocular) inputs. Experiments demonstrate that our approach is able to produce superior results compared to state-of-the-art methods. Our code will be released for reproducibility. △ Less

Submitted 17 August, 2023; v1 submitted 15 August, 2023; originally announced August 2023.

Comments: Project page: https://zju3dv.github.io/relightable_avatar

arXiv:2308.06639 [pdf, other]

doi 10.1145/3586183.3606804

3D Printing Magnetophoretic Displays

Authors: Zeyu Yan, Hsuanling Lee, Liang He, Huaishu Peng

Abstract: We present a pipeline for printing interactive and always-on magnetophoretic displays using affordable Fused Deposition Modeling (FDM) 3D printers. Using our pipeline, an end-user can convert the surface of a 3D shape into a matrix of voxels. The generated model can be sent to an FDM 3D printer equipped with an additional syringe-based injector. During the printing process, an oil and iron powder-… ▽ More We present a pipeline for printing interactive and always-on magnetophoretic displays using affordable Fused Deposition Modeling (FDM) 3D printers. Using our pipeline, an end-user can convert the surface of a 3D shape into a matrix of voxels. The generated model can be sent to an FDM 3D printer equipped with an additional syringe-based injector. During the printing process, an oil and iron powder-based liquid mixture is injected into each voxel cell, allowing the appearance of the once-printed object to be editable with external magnetic sources. To achieve this, we made modifications to the 3D printer hardware and the firmware. We also developed a 3D editor to prepare printable models. We demonstrate our pipeline with a variety of examples, including a printed Stanford bunny with customizable appearances, a small espresso mug that can be used as a post-it note surface, a board game figurine with a computationally updated display, and a collection of flexible wearable accessories with editable visuals. △ Less

Submitted 12 August, 2023; originally announced August 2023.

Journal ref: UIST 2023

arXiv:2308.06496 [pdf, ps, other]

Performance Analysis for Resource Constrained Decentralized Federated Learning Over Wireless Networks

Authors: Zhigang Yan, Dong Li

Abstract: Federated learning (FL) can lead to significant communication overhead and reliance on a central server. To address these challenges, decentralized federated learning (DFL) has been proposed as a more resilient framework. DFL involves parameter exchange between devices through a wireless network. This study analyzes the performance of resource-constrained DFL using different communication schemes… ▽ More Federated learning (FL) can lead to significant communication overhead and reliance on a central server. To address these challenges, decentralized federated learning (DFL) has been proposed as a more resilient framework. DFL involves parameter exchange between devices through a wireless network. This study analyzes the performance of resource-constrained DFL using different communication schemes (digital and analog) over wireless networks to optimize communication efficiency. Specifically, we provide convergence bounds for both digital and analog transmission approaches, enabling analysis of the model performance trained on DFL. Furthermore, for digital transmission, we investigate and analyze resource allocation between computation and communication and convergence rates, obtaining its communication complexity and the minimum probability of correction communication required for convergence guarantee. For analog transmission, we discuss the impact of channel fading and noise on the model performance and the maximum errors accumulation with convergence guarantee over fading channels. Finally, we conduct numerical simulations to evaluate the performance and convergence rate of convolutional neural networks (CNNs) and Vision Transformer (ViT) trained in the DFL framework on fashion-MNIST and CIFAR-10 datasets. Our simulation results validate our analysis and discussion, revealing how to improve performance by optimizing system parameters under different communication conditions. △ Less

Submitted 12 August, 2023; originally announced August 2023.

arXiv:2308.01857 [pdf, other]

iEDA: An Open-Source Intelligent Physical Implementation Toolkit and Library

Authors: Xingquan Li, Simin Tao, Zengrong Huang, Shijian Chen, Zhisheng Zeng, Liwei Ni, Zhipeng Huang, Chunan Zhuang, Hongxi Wu, Weiguo Li1, Xueyan Zhao, He Liu, Shuaiying Long, Wei He, Bojun Liu, Sifeng Gan, Zihao Yu, Tong Liu, Yuchi Miao, Zhiyuan Yan, Hao Wang, Jie Zhao, Yifan Li, Ruizhi Liu, Xiaoze Lin , et al. (31 additional authors not shown)

Abstract: Open-source EDA shows promising potential in unleashing EDA innovation and lowering the cost of chip design. This paper presents an open-source EDA project, iEDA, aiming for building a basic infrastructure for EDA technology evolution and closing the industrial-academic gap in the EDA area. iEDA now covers the whole flow of physical design (including Floorplan, Placement, CTS, Routing, Timing Opti… ▽ More Open-source EDA shows promising potential in unleashing EDA innovation and lowering the cost of chip design. This paper presents an open-source EDA project, iEDA, aiming for building a basic infrastructure for EDA technology evolution and closing the industrial-academic gap in the EDA area. iEDA now covers the whole flow of physical design (including Floorplan, Placement, CTS, Routing, Timing Optimization etc.), and part of the analysis tools (Static Timing Analysis and Power Analysis). To demonstrate the effectiveness of iEDA, we implement and tape out three chips of different scales (from 700k to 1.5M gates) on different process nodes (110nm and 28nm) with iEDA. iEDA is publicly available from the project home page http://ieda.oscc.cc. △ Less

Submitted 3 August, 2023; originally announced August 2023.

arXiv:2308.00772 [pdf, other]

doi 10.3847/1538-4357/ad14fb

One Fits All: A Unified Synchrotron Model Explains GRBs with FRED-Shaped Pulses

Authors: Zhen-Yu Yan, Jun Yang, Xiao-Hong Zhao, Yan-Zhi Meng, Bin-Bin Zhang

Abstract: The analysis of gamma-ray burst (GRB) spectra often relies on empirical models lacking a distinct physical explanation. Previous attempts to couple physical models with observed data focus on individual burst studies, fitting models to segmented spectra with independent physical parameters. However, these approaches typically neglect to explain the time evolution of observed spectra. In this study… ▽ More The analysis of gamma-ray burst (GRB) spectra often relies on empirical models lacking a distinct physical explanation. Previous attempts to couple physical models with observed data focus on individual burst studies, fitting models to segmented spectra with independent physical parameters. However, these approaches typically neglect to explain the time evolution of observed spectra. In this study, we propose a novel approach by incorporating the synchrotron radiation model to provide a self-consistent explanation for a selection of single-pulse GRBs. Our study comprehensively tests the synchrotron model under a unified physical condition, such as a single injection event of electrons. By tracing the evolution of cooling electrons in a decaying magnetic field, our model predicts time-dependent observed spectra that align well with the data. Using a single set of physical parameters, our model successfully fits all time-resolved spectra within each burst. Our model suggests that the rising phase of the GRB light curve results from the increasing number of radiating electrons, while the declining phase is attributed to the curvature effect, electron cooling, and the decaying magnetic field. Our model provides a straightforward interpretation of the peak energy's evolution, linked to the decline of the magnetic field and electron cooling due to the expansion of the GRB emission region. Our findings strongly support the notion that spectral and temporal evolution in GRB pulses originates from the expansion of the GRB emission region, with an initial radius of approximately $10^{15}$ cm, and synchrotron radiation as the underlying emission mechanism. △ Less

Submitted 15 January, 2024; v1 submitted 1 August, 2023; originally announced August 2023.

Comments: ApJ in press, author version, 28 pages, 19 figures, 4 tables

Journal ref: 2024, ApJ, 962, 85

arXiv:2307.16355 [pdf, other]

Temporal and Spectral Properties of the Persistent Radio Source Associated with FRB 20190520B with the VLA

Authors: Xian Zhang, Wenfei Yu, Casey Law, Di Li, Shami Chatterjee, Paul Demorest, Zhen Yan, Chenhui Niu, Kshitij Aggarwal, Reshma Anna-Thomas, Sarah Burke-Spolaor, Liam Connor, Chao-wei Tsai, Weiwei Zhu, Gan Luo

Abstract: Among more than 800 known fast radio bursts (FRBs), only two, namely FRB 20121102A and FRB 20190520B, are confirmed to be associated with a persistent radio sources (PRS). Here we report evidence of apparent temporal variability in the PRS associated with the bursting FRB 20190520B based on the Karl G. Jansky Very Large Array (VLA) observations taken in 2020 and 2021. Based on the analysis of epoc… ▽ More Among more than 800 known fast radio bursts (FRBs), only two, namely FRB 20121102A and FRB 20190520B, are confirmed to be associated with a persistent radio sources (PRS). Here we report evidence of apparent temporal variability in the PRS associated with the bursting FRB 20190520B based on the Karl G. Jansky Very Large Array (VLA) observations taken in 2020 and 2021. Based on the analysis of epoch-to-epoch variability of the PRS at L, S, C, and X band in 1-12 GHz, we detected not only overall marginal variability but also a likely radio flux decrease ($\sim$ 3.2 $σ$) between the observations taken in 2020 and 2021 at 3 GHz. Assuming no spectral variation in the PRS during these observations, we found the evidence for an overall broadband radio flux decrease by about 20 percent between the 2020 and the 2021 observations, suggesting that the PRS probably evolves on the yearly time scale. If we attribute the marginal variability at 3 GHz as intrinsic or due to scintillation, the size of potential variable component of the PRS is constrained to be sub-parsec. On the other hand, the size of the PRS can be also constrained to be larger than about 0.22 parsec from the averaged radio spectrum and the integrated radio luminosity in the 1-12 GHz band based on equipartition and self-absorption arguments. We discuss potential origins of the PRS and suggest that an accreting compact object origin might be able to explain the PRS's temporal and spectral properties. Confirmation of variability or flux decline of the PRS would be critical to our understanding of the PRS and its relation to the bursting source. △ Less

Submitted 23 October, 2023; v1 submitted 30 July, 2023; originally announced July 2023.

Comments: 12 pages, 3 figures, accepted for publication in ApJ

arXiv:2307.15853 [pdf, other]

Improving Realistic Worst-Case Performance of NVCiM DNN Accelerators through Training with Right-Censored Gaussian Noise

Authors: Zheyu Yan, Yifan Qin, Wujie Wen, Xiaobo Sharon Hu, Yiyu Shi

Abstract: Compute-in-Memory (CiM), built upon non-volatile memory (NVM) devices, is promising for accelerating deep neural networks (DNNs) owing to its in-situ data processing capability and superior energy efficiency. Unfortunately, the well-trained model parameters, after being mapped to NVM devices, can often exhibit large deviations from their intended values due to device variations, resulting in notab… ▽ More Compute-in-Memory (CiM), built upon non-volatile memory (NVM) devices, is promising for accelerating deep neural networks (DNNs) owing to its in-situ data processing capability and superior energy efficiency. Unfortunately, the well-trained model parameters, after being mapped to NVM devices, can often exhibit large deviations from their intended values due to device variations, resulting in notable performance degradation in these CiM-based DNN accelerators. There exists a long list of solutions to address this issue. However, they mainly focus on improving the mean performance of CiM DNN accelerators. How to guarantee the worst-case performance under the impact of device variations, which is crucial for many safety-critical applications such as self-driving cars, has been far less explored. In this work, we propose to use the k-th percentile performance (KPP) to capture the realistic worst-case performance of DNN models executing on CiM accelerators. Through a formal analysis of the properties of KPP and the noise injection-based DNN training, we demonstrate that injecting a novel right-censored Gaussian noise, as opposed to the conventional Gaussian noise, significantly improves the KPP of DNNs. We further propose an automated method to determine the optimal hyperparameters for injecting this right-censored Gaussian noise during the training process. Our method achieves up to a 26% improvement in KPP compared to the state-of-the-art methods employed to enhance DNN robustness under the impact of device variations. △ Less

Submitted 28 July, 2023; originally announced July 2023.

arXiv:2307.15058 [pdf, other]

MARS: An Instance-aware, Modular and Realistic Simulator for Autonomous Driving

Authors: Zirui Wu, Tianyu Liu, Liyi Luo, Zhide Zhong, Jianteng Chen, Hongmin Xiao, Chao Hou, Haozhe Lou, Yuantao Chen, Runyi Yang, Yuxin Huang, Xiaoyu Ye, Zike Yan, Yongliang Shi, Yiyi Liao, Hao Zhao

Abstract: Nowadays, autonomous cars can drive smoothly in ordinary cases, and it is widely recognized that realistic sensor simulation will play a critical role in solving remaining corner cases by simulating them. To this end, we propose an autonomous driving simulator based upon neural radiance fields (NeRFs). Compared with existing works, ours has three notable features: (1) Instance-aware. Our simulator… ▽ More Nowadays, autonomous cars can drive smoothly in ordinary cases, and it is widely recognized that realistic sensor simulation will play a critical role in solving remaining corner cases by simulating them. To this end, we propose an autonomous driving simulator based upon neural radiance fields (NeRFs). Compared with existing works, ours has three notable features: (1) Instance-aware. Our simulator models the foreground instances and background environments separately with independent networks so that the static (e.g., size and appearance) and dynamic (e.g., trajectory) properties of instances can be controlled separately. (2) Modular. Our simulator allows flexible switching between different modern NeRF-related backbones, sampling strategies, input modalities, etc. We expect this modular design to boost academic progress and industrial deployment of NeRF-based autonomous driving simulation. (3) Realistic. Our simulator set new state-of-the-art photo-realism results given the best module selection. Our simulator will be open-sourced while most of our counterparts are not. Project page: https://open-air-sun.github.io/mars/. △ Less

Submitted 27 July, 2023; originally announced July 2023.

Comments: CICAI 2023, project page with code: https://open-air-sun.github.io/mars/

arXiv:2307.13891 [pdf, other]

Jet-hadron correlations with respect to the event plane in $\sqrt{s_{\mathrm{NN}}}$ = 200 GeV Au+Au collisions in STAR

Authors: STAR Collaboration, M. I. Abdulhamid, B. E. Aboona, J. Adam, L. Adamczyk, J. R. Adams, I. Aggarwal, M. M. Aggarwal, Z. Ahammed, E. C. Aschenauer, S. Aslam, J. Atchison, V. Bairathi, J. G. Ball Cap, K. Barish, R. Bellwied, P. Bhagat, A. Bhasin, S. Bhatta, S. R. Bhosale, J. Bielcik, J. Bielcikova, J. D. Brandenburg, X. Z. Cai, H. Caines , et al. (340 additional authors not shown)

Abstract: Angular distributions of charged particles relative to jet axes are studied in $\sqrt{s_{\mathrm{NN}}}$ = 200 GeV Au+Au collisions as a function of the jet orientation with respect to the event plane. This differential study tests the expected path-length dependence of energy loss experienced by a hard-scattered parton as it traverses the hot and dense medium formed in heavy-ion collisions. A seco… ▽ More Angular distributions of charged particles relative to jet axes are studied in $\sqrt{s_{\mathrm{NN}}}$ = 200 GeV Au+Au collisions as a function of the jet orientation with respect to the event plane. This differential study tests the expected path-length dependence of energy loss experienced by a hard-scattered parton as it traverses the hot and dense medium formed in heavy-ion collisions. A second-order event plane is used in the analysis as an experimental estimate of the reaction plane formed by the collision impact parameter and the beam direction. Charged-particle jets with $15 < p_{\rm T, jet} <$ 20 and $20 < p_{\rm T, jet} <$ 40 GeV/$c$ were reconstructed with the anti-$k_{\rm T}$ algorithm with radius parameter setting of (R=0.4) in the 20-50\% centrality bin to maximize the initial-state eccentricity of the interaction region. The reaction plane fit method is implemented to remove the flow-modulated background with better precision than prior methods. Yields and widths of jet-associated charged-hadron distributions are extracted in three angular bins between the jet axis and the event plane. The event-plane (EP) dependence is further quantified by ratios of the associated yields in different EP bins. No dependence on orientation of the jet axis with respect to the event plane is seen within the uncertainties in the kinematic regime studied. This finding is consistent with a similar experimental observation by ALICE in $\sqrt{s_{\mathrm{NN}}}$ = 2.76 TeV Pb+Pb collision data. △ Less

Submitted 20 March, 2024; v1 submitted 25 July, 2023; originally announced July 2023.

arXiv:2307.13643 [pdf, other]

Backdoor Attacks against Voice Recognition Systems: A Survey

Authors: Baochen Yan, Jiahe Lan, Zheng Yan

Abstract: Voice Recognition Systems (VRSs) employ deep learning for speech recognition and speaker recognition. They have been widely deployed in various real-world applications, from intelligent voice assistance to telephony surveillance and biometric authentication. However, prior research has revealed the vulnerability of VRSs to backdoor attacks, which pose a significant threat to the security and priva… ▽ More Voice Recognition Systems (VRSs) employ deep learning for speech recognition and speaker recognition. They have been widely deployed in various real-world applications, from intelligent voice assistance to telephony surveillance and biometric authentication. However, prior research has revealed the vulnerability of VRSs to backdoor attacks, which pose a significant threat to the security and privacy of VRSs. Unfortunately, existing literature lacks a thorough review on this topic. This paper fills this research gap by conducting a comprehensive survey on backdoor attacks against VRSs. We first present an overview of VRSs and backdoor attacks, elucidating their basic knowledge. Then we propose a set of evaluation criteria to assess the performance of backdoor attack methods. Next, we present a comprehensive taxonomy of backdoor attacks against VRSs from different perspectives and analyze the characteristic of different categories. After that, we comprehensively review existing attack methods and analyze their pros and cons based on the proposed criteria. Furthermore, we review classic backdoor defense methods and generic audio defense techniques. Then we discuss the feasibility of deploying them on VRSs. Finally, we figure out several open issues and further suggest future research directions to motivate the research of VRSs security. △ Less

Submitted 22 July, 2023; originally announced July 2023.

Comments: 33 pages, 7 figures

arXiv:2307.13321 [pdf, other]

doi 10.1103/PhysRevLett.131.253603

Super-radiant and Sub-radiant Cavity Scattering by Atom Arrays

Authors: Zhenjie Yan, Jacquelyn Ho, Yue-Hui Lu, Stuart J. Masson, Ana Asenjo-Garcia, Dan M. Stamper-Kurn

Abstract: We realize collective enhancement and suppression of light scattered by an array of tweezer-trapped $^{87}$Rb atoms positioned within a strongly coupled Fabry-Pérot optical cavity. We illuminate the array with light directed transverse to the cavity axis, in the low saturation regime, and detect photons scattered into the cavity. For an array with integer-optical-wavelength spacing each atom scatt… ▽ More We realize collective enhancement and suppression of light scattered by an array of tweezer-trapped $^{87}$Rb atoms positioned within a strongly coupled Fabry-Pérot optical cavity. We illuminate the array with light directed transverse to the cavity axis, in the low saturation regime, and detect photons scattered into the cavity. For an array with integer-optical-wavelength spacing each atom scatters light into the cavity with nearly identical scattering amplitude, leading to an observed $N^2$ scaling of cavity photon number as the atom number increases stepwise from $N=1$ to $N=8$. By contrast, for an array with half-integer-wavelength spacing, destructive interference of scattering amplitudes yields a non-monotonic, sub-radiant cavity intensity versus $N$. By analyzing the polarization of light emitted from the cavity, we find that Rayleigh scattering can be collectively enhanced or suppressed with respect to Raman scattering. We observe also that atom-induced shifts and broadenings of the cavity resonance are precisely tuned by varying the atom number and positions. Altogether, tweezer arrays provide exquisite control of atomic cavity QED spanning from the single- to the many-body regime. △ Less

Submitted 8 November, 2023; v1 submitted 25 July, 2023; originally announced July 2023.

Journal ref: Phys. Rev. Lett. 131, 253603 (2023)

arXiv:2307.13158 [pdf, other]

Multi-UAV Speed Control with Collision Avoidance and Handover-aware Cell Association: DRL with Action Branching

Authors: Zijiang Yan, Wael Jaafar, Bassant Selim, Hina Tabassum

Abstract: This paper presents a deep reinforcement learning solution for optimizing multi-UAV cell-association decisions and their moving velocity on a 3D aerial highway. The objective is to enhance transportation and communication performance, including collision avoidance, connectivity, and handovers. The problem is formulated as a Markov decision process (MDP) with UAVs' states defined by velocities and… ▽ More This paper presents a deep reinforcement learning solution for optimizing multi-UAV cell-association decisions and their moving velocity on a 3D aerial highway. The objective is to enhance transportation and communication performance, including collision avoidance, connectivity, and handovers. The problem is formulated as a Markov decision process (MDP) with UAVs' states defined by velocities and communication data rates. We propose a neural architecture with a shared decision module and multiple network branches, each dedicated to a specific action dimension in a 2D transportation-communication space. This design efficiently handles the multi-dimensional action space, allowing independence for individual action dimensions. We introduce two models, Branching Dueling Q-Network (BDQ) and Branching Dueling Double Deep Q-Network (Dueling DDQN), to demonstrate the approach. Simulation results show a significant improvement of 18.32% compared to existing benchmarks. △ Less

Submitted 21 January, 2024; v1 submitted 24 July, 2023; originally announced July 2023.

Comments: IEEE Globecom 2023 Accepted

arXiv:2307.11462 [pdf, other]

Improve Long-term Memory Learning Through Rescaling the Error Temporally

Authors: Shida Wang, Zhanglu Yan

Abstract: This paper studies the error metric selection for long-term memory learning in sequence modelling. We examine the bias towards short-term memory in commonly used errors, including mean absolute/squared error. Our findings show that all temporally positive-weighted errors are biased towards short-term memory in learning linear functionals. To reduce this bias and improve long-term memory learning,… ▽ More This paper studies the error metric selection for long-term memory learning in sequence modelling. We examine the bias towards short-term memory in commonly used errors, including mean absolute/squared error. Our findings show that all temporally positive-weighted errors are biased towards short-term memory in learning linear functionals. To reduce this bias and improve long-term memory learning, we propose the use of a temporally rescaled error. In addition to reducing the bias towards short-term memory, this approach can also alleviate the vanishing gradient issue. We conduct numerical experiments on different long-memory tasks and sequence models to validate our claims. Numerical results confirm the importance of appropriate temporally rescaled error for effective long-term memory learning. To the best of our knowledge, this is the first work that quantitatively analyzes different errors' memory bias towards short-term memory in sequence modelling. △ Less

Submitted 21 July, 2023; originally announced July 2023.

Comments: 12 pages, 7 figures

arXiv:2307.09288 [pdf, other]

Llama 2: Open Foundation and Fine-Tuned Chat Models

Authors: Hugo Touvron, Louis Martin, Kevin Stone, Peter Albert, Amjad Almahairi, Yasmine Babaei, Nikolay Bashlykov, Soumya Batra, Prajjwal Bhargava, Shruti Bhosale, Dan Bikel, Lukas Blecher, Cristian Canton Ferrer, Moya Chen, Guillem Cucurull, David Esiobu, Jude Fernandes, Jeremy Fu, Wenyin Fu, Brian Fuller, Cynthia Gao, Vedanuj Goswami, Naman Goyal, Anthony Hartshorn, Saghar Hosseini , et al. (43 additional authors not shown)

Abstract: In this work, we develop and release Llama 2, a collection of pretrained and fine-tuned large language models (LLMs) ranging in scale from 7 billion to 70 billion parameters. Our fine-tuned LLMs, called Llama 2-Chat, are optimized for dialogue use cases. Our models outperform open-source chat models on most benchmarks we tested, and based on our human evaluations for helpfulness and safety, may be… ▽ More In this work, we develop and release Llama 2, a collection of pretrained and fine-tuned large language models (LLMs) ranging in scale from 7 billion to 70 billion parameters. Our fine-tuned LLMs, called Llama 2-Chat, are optimized for dialogue use cases. Our models outperform open-source chat models on most benchmarks we tested, and based on our human evaluations for helpfulness and safety, may be a suitable substitute for closed-source models. We provide a detailed description of our approach to fine-tuning and safety improvements of Llama 2-Chat in order to enable the community to build on our work and contribute to the responsible development of LLMs. △ Less

Submitted 19 July, 2023; v1 submitted 18 July, 2023; originally announced July 2023.

arXiv:2307.07954 [pdf, other]

A new variability pattern in GRS 1915+105 with NICER and Insight-HXMT observations

Authors: Zhihong Shi, Qingwen Wu, Zhen Yan, Bing Lyu, Hao Liu

Abstract: We explore the timing and spectral properties of GRS 1915+105 based on X-ray observations of NICER and Insight-HXMT during the long outburst from 2017 to 2021. We find a new class of variability in the rising stage of the outburst that differs from the formerly reported patterns of light curves. This new variability pattern, which we name class $ψ$, is characterized by several periodic mini pulses… ▽ More We explore the timing and spectral properties of GRS 1915+105 based on X-ray observations of NICER and Insight-HXMT during the long outburst from 2017 to 2021. We find a new class of variability in the rising stage of the outburst that differs from the formerly reported patterns of light curves. This new variability pattern, which we name class $ψ$, is characterized by several periodic mini pulses superposed on another longer periodic pulse. The periods are around $\sim$130 seconds and $\sim$10 seconds for the main pulses and mini pulses respectively based on the analysis of power spectrum density (PSD) and step-wise filter correlation (SFC), where the SFC method has an advantage in finding the superimposed periodic components. The mini pulses become weak or disappear when the luminosity increases and the light curves change into the classical class $κ$. The class $ψ$ shows a softer spectrum with lower count rates compared to the class $κ$ during the main pulse. The new class $ψ$ shows peculiar timing and spectral properties compared to those of classic class $κ$, which can help us to explore the class transition mechanism in GRS 1915+105. △ Less

Submitted 16 July, 2023; originally announced July 2023.

Comments: 10 pages, 6 figures, accepted by MNRAS

arXiv:2307.07709 [pdf, other]

doi 10.1051/0004-6361/202346309

Intermittent QPO properties of MAXI J1820+070 revealed by Insight-HXMT

Authors: P. Zhang, R. Soria, S. Zhang, L. Ji, L. D. Kong, Y. P. Chen, S. N. Zhang, Z. Chang, M. Y. Ge, J. Li, G. C. Liu, Q. Z. Liu, X. Ma, J. Q. Peng, J. L. Qu, Q. C. Shui, L. Tao, H. J. Tian, P. J. Wang, J. Z. Yan, X. Y. Zeng

Abstract: We investigate the dynamical properties of low frequency quasi-periodic oscillations (QPOs) observed from the black hole X-ray binary MAXI J1820+070 during the early part of its 2018 outburst, when the system was in a bright hard state. To this aim, we use a series of observations from the Hard X-ray Modulation Telescope Insight-HXMT, and apply a wavelet decomposition (weighted wavelet Z-transform… ▽ More We investigate the dynamical properties of low frequency quasi-periodic oscillations (QPOs) observed from the black hole X-ray binary MAXI J1820+070 during the early part of its 2018 outburst, when the system was in a bright hard state. To this aim, we use a series of observations from the Hard X-ray Modulation Telescope Insight-HXMT, and apply a wavelet decomposition (weighted wavelet Z-transforms) to the X-ray light-curve. We find that the QPO phenomenon is intermittent within each individual observation, with some sub-intervals where the oscillation is strongly detected (high root-mean-square amplitude) and others where it is weak or absent. The average life time of individual QPO segments is ~ 5 oscillation cycles, with a 3 sigma tail up to ~ 20 cycles. There is no substantial difference between the energy spectra during intervals with strong and weak/absent QPOs. We discuss two possible reasons for the intermittent QPO strength, within the precessing jet model previously proposed for MAXI J1820+070. In the rigid precession model, intermittent QPOs are predicted to occur with a coherence Q ~ a few when the disk alignment time-scale is only a few times the precession time-scale. Alternatively, we suggest that changes in oscillation amplitude can be caused by changes in the jet speed. We discuss a possible reason for the intermittent QPO strength, within the precessing jet model previously proposed for MAXI J1820+070: we suggest that changes in oscillation amplitude are caused by changes in the jet speed. We argue that a misaligned, precessing jet scenario is also consistent with other recent observational findings that suggest an oscillation of the Compton reflection component in phase with the QPOs. △ Less

Submitted 15 July, 2023; originally announced July 2023.

Comments: 8 pages, 4 figures

Journal ref: A&A 677, A178 (2023)

arXiv:2307.05689 [pdf, other]

Magnetar emergence in a peculiar gamma-ray burst from a compact star merger

Authors: H. Sun, C. -W. Wang, J. Yang, B. -B. Zhang, S. -L. Xiong, Y. -H. I. Yin, Y. Liu, Y. Li, W. -C. Xue, Z. Yan, C. Zhang, W. -J. Tan, H. -W. Pan, J. -C. Liu, H. -Q. Cheng, Y. -Q. Zhang, J. -W. Hu, C. Zheng, Z. -H. An, C. Cai, L. Hu, C. **, D. -Y. Li, X. -Q. Li, H. -Y. Liu , et al. (19 additional authors not shown)

Abstract: The central engine that powers gamma-ray bursts (GRBs), the most powerful explosions in the universe, is still not identified. Besides hyper-accreting black holes, rapidly spinning and highly magnetized neutron stars, known as millisecond magnetars, have been suggested to power both long and short GRBs. The presence of a magnetar engine following compact star mergers is of particular interest as i… ▽ More The central engine that powers gamma-ray bursts (GRBs), the most powerful explosions in the universe, is still not identified. Besides hyper-accreting black holes, rapidly spinning and highly magnetized neutron stars, known as millisecond magnetars, have been suggested to power both long and short GRBs. The presence of a magnetar engine following compact star mergers is of particular interest as it would provide essential constraints on the poorly understood equation of state for neutron stars. Indirect indications of a magnetar engine in these merger sources have been observed in the form of plateau features present in the X-ray afterglow light curves of some short GRBs. Additionally, some X-ray transients lacking gamma-ray bursts (GRB-less) have been identified as potential magnetar candidates originating from compact star mergers. Nevertheless, smoking gun evidence is still lacking for a magnetar engine in short GRBs, and the associated theoretical challenges have been addressed. Here we present a comprehensive analysis of the broad-band prompt emission data of a peculiar, very bright GRB 230307A. Despite its apparently long duration, the prompt emission and host galaxy properties point toward a compact star merger origin, being consistent with its association with a kilonova. More intriguingly, an extended X-ray emission component emerges as the $γ$-ray emission dies out, signifying the emergence of a magnetar central engine. We also identify an achromatic temporal break in the high-energy band during the prompt emission phase, which was never observed in previous bursts and reveals a narrow jet with half opening angle of approximately $3.4^\circ$. △ Less

Submitted 11 July, 2023; originally announced July 2023.

Comments: 44 pages, 10 figures, 5 tables

arXiv:2307.03597 [pdf, other]

doi 10.1093/mnras/stad2063

An apparent positive relation between spin and orbital angular momentum in X-ray binaries

Authors: Zhen Yan, Wenda Zhang, Wenfei Yu

Abstract: The origin of current angular momentum (AM) of the black hole (BH) in X-ray binary (XRB) is still unclear, which is related with the birth and/or the growth of the BH. Here we collect the spin parameters $a_{*}$ measured in BH XRBs and find an apparent bimodal distribution centered at $\sim$ 0.17 and 0.83. We find a positive relation between the spin parameter and the orbital period/orbital separa… ▽ More The origin of current angular momentum (AM) of the black hole (BH) in X-ray binary (XRB) is still unclear, which is related with the birth and/or the growth of the BH. Here we collect the spin parameters $a_{*}$ measured in BH XRBs and find an apparent bimodal distribution centered at $\sim$ 0.17 and 0.83. We find a positive relation between the spin parameter and the orbital period/orbital separation through combining distinct XRB categories, including neutron star (NS) low-mass X-ray binaries (LMXBs), Roche-lobe overflow (RLOF) BH XRBs and wind-fed BH XRBs. It seems that the AM of the compact star and the binary orbit correlates by combining the different XRB systems. These positive relations imply that accretion process is a common mechanism for spinning up the compact star in these diverse XRB systems. We infer that the low and high spin BH XRBs may experience different evolution and accretion history, which corresponds to the bimodal distribution of the BH spin parameters. The low spin BHs ($a_{*}<0.3$) are similar to the NS LMXBs, the compact star of which is spun-up by the low-level accretion, and the high spin BHs ($a_{*}>0.5$) had experienced a short hypercritical accretion ($\gg \dot{M}_\mathrm{Edd}$) period, during which, the BH spin dramatically increased. △ Less

Submitted 7 July, 2023; originally announced July 2023.

Comments: 13 pages, 7 figures, accepted for publication in MNRAS

arXiv:2307.03449 [pdf, other]

Universal Semi-supervised Model Adaptation via Collaborative Consistency Training

Authors: Zizheng Yan, Yushuang Wu, Yipeng Qin, Xiaoguang Han, Shuguang Cui, Guanbin Li

Abstract: In this paper, we introduce a realistic and challenging domain adaptation problem called Universal Semi-supervised Model Adaptation (USMA), which i) requires only a pre-trained source model, ii) allows the source and target domain to have different label sets, i.e., they share a common label set and hold their own private label set, and iii) requires only a few labeled samples in each class of the… ▽ More In this paper, we introduce a realistic and challenging domain adaptation problem called Universal Semi-supervised Model Adaptation (USMA), which i) requires only a pre-trained source model, ii) allows the source and target domain to have different label sets, i.e., they share a common label set and hold their own private label set, and iii) requires only a few labeled samples in each class of the target domain. To address USMA, we propose a collaborative consistency training framework that regularizes the prediction consistency between two models, i.e., a pre-trained source model and its variant pre-trained with target data only, and combines their complementary strengths to learn a more powerful model. The rationale of our framework stems from the observation that the source model performs better on common categories than the target-only model, while on target-private categories, the target-only model performs better. We also propose a two-perspective, i.e., sample-wise and class-wise, consistency regularization to improve the training. Experimental results demonstrate the effectiveness of our method on several benchmark datasets. △ Less

Submitted 3 November, 2023; v1 submitted 7 July, 2023; originally announced July 2023.

arXiv:2307.01851 [pdf, other]

Boundary Flat Bands with Topological Spin Textures Protected by Sub-chiral Symmetry

Authors: Yijie Mo, Xiao-Jiao Wang, Rui Yu, Zhongbo Yan

Abstract: Chiral symmetry plays an indispensable role in topological classifications as well as in the understanding of the origin of bulk or boundary flat bands. The conventional definition of chiral symmetry refers to the existence of a constant unitary matrix anticommuting with the Hamiltonian. As a constant unitary matrix has constant eigenvectors, boundary flat bands enforced by chiral symmetry, which… ▽ More Chiral symmetry plays an indispensable role in topological classifications as well as in the understanding of the origin of bulk or boundary flat bands. The conventional definition of chiral symmetry refers to the existence of a constant unitary matrix anticommuting with the Hamiltonian. As a constant unitary matrix has constant eigenvectors, boundary flat bands enforced by chiral symmetry, which share the same eigenvectors with the chiral symmetry operator, are known to carry fixed (pseudo)spin polarizations and be featureless in quantum geometry. In this work, we generalize the chiral symmetry and introduce a concept termed sub-chiral symmetry. Unlike the conventional chiral symmetry operator defined as constant, the sub-chiral symmetry operator depends on partial components of the momentum vector, so as its eigenvectors. We show that topological gapped or gapless systems without the chiral symmetry but with the sub-chiral symmetry can support boundary flat bands, which exhibit topological spin textures and quantized Berry phases. We expect that such intriguing boundary flat bands could give rise to a variety of exotic physics in the presence of interactions or disorders. △ Less

Submitted 4 July, 2023; originally announced July 2023.

Comments: 7+4 pages, 2 figures

arXiv:2307.01426 [pdf, other]

DeepfakeBench: A Comprehensive Benchmark of Deepfake Detection

Authors: Zhiyuan Yan, Yong Zhang, Xinhang Yuan, Siwei Lyu, Baoyuan Wu

Abstract: A critical yet frequently overlooked challenge in the field of deepfake detection is the lack of a standardized, unified, comprehensive benchmark. This issue leads to unfair performance comparisons and potentially misleading results. Specifically, there is a lack of uniformity in data processing pipelines, resulting in inconsistent data inputs for detection models. Additionally, there are noticeab… ▽ More A critical yet frequently overlooked challenge in the field of deepfake detection is the lack of a standardized, unified, comprehensive benchmark. This issue leads to unfair performance comparisons and potentially misleading results. Specifically, there is a lack of uniformity in data processing pipelines, resulting in inconsistent data inputs for detection models. Additionally, there are noticeable differences in experimental settings, and evaluation strategies and metrics lack standardization. To fill this gap, we present the first comprehensive benchmark for deepfake detection, called DeepfakeBench, which offers three key contributions: 1) a unified data management system to ensure consistent input across all detectors, 2) an integrated framework for state-of-the-art methods implementation, and 3) standardized evaluation metrics and protocols to promote transparency and reproducibility. Featuring an extensible, modular-based codebase, DeepfakeBench contains 15 state-of-the-art detection methods, 9 deepfake datasets, a series of deepfake detection evaluation protocols and analysis tools, as well as comprehensive evaluations. Moreover, we provide new insights based on extensive analysis of these evaluations from various perspectives (e.g., data augmentations, backbones). We hope that our efforts could facilitate future research and foster innovation in this increasingly critical domain. All codes, evaluations, and analyses of our benchmark are publicly available at https://github.com/SCLBD/DeepfakeBench. △ Less

Submitted 28 October, 2023; v1 submitted 3 July, 2023; originally announced July 2023.

arXiv:2307.01187 [pdf, other]

SAMAug: Point Prompt Augmentation for Segment Anything Model

Authors: Haixing Dai, Chong Ma, Zhiling Yan, Zhengliang Liu, Enze Shi, Yiwei Li, Peng Shu, Xiaozheng Wei, Lin Zhao, Zihao Wu, Fang Zeng, Dajiang Zhu, Wei Liu, Quanzheng Li, Lichao Sun, Shu Zhang Tianming Liu, Xiang Li

Abstract: This paper introduces SAMAug, a novel visual point augmentation method for the Segment Anything Model (SAM) that enhances interactive image segmentation performance. SAMAug generates augmented point prompts to provide more information about the user's intention to SAM. Starting with an initial point prompt, SAM produces an initial mask, which is then fed into our proposed SAMAug to generate augmen… ▽ More This paper introduces SAMAug, a novel visual point augmentation method for the Segment Anything Model (SAM) that enhances interactive image segmentation performance. SAMAug generates augmented point prompts to provide more information about the user's intention to SAM. Starting with an initial point prompt, SAM produces an initial mask, which is then fed into our proposed SAMAug to generate augmented point prompts. By incorporating these extra points, SAM can generate augmented segmentation masks based on both the augmented point prompts and the initial prompt, resulting in improved segmentation performance. We conducted evaluations using four different point augmentation strategies: random sampling, sampling based on maximum difference entropy, maximum distance, and saliency. Experiment results on the COCO, Fundus, COVID QUEx, and ISIC2018 datasets show that SAMAug can boost SAM's segmentation results, especially using the maximum distance and saliency. SAMAug demonstrates the potential of visual prompt augmentation for computer vision. Codes of SAMAug are available at github.com/yhydhx/SAMAug △ Less

Submitted 19 March, 2024; v1 submitted 3 July, 2023; originally announced July 2023.

arXiv:2307.00599 [pdf, other]

RH-Map: Online Map Construction Framework of Dynamic Objects Removal Based on Region-wise Hash Map Structure

Authors: Zihong Yan, Xiaoyi Wu, Zhuozhu Jian, Bin Lan Xueqian Wang, Bin Liang

Abstract: Mobile robots navigating in outdoor environments frequently encounter the issue of undesired traces left by dynamic objects and manifested as obstacles on map, impeding robots from achieving accurate localization and effective navigation. To tackle the problem, a novel map construction framework based on 3D region-wise hash map structure (RH-Map) is proposed, consisting of front-end scan fresher a… ▽ More Mobile robots navigating in outdoor environments frequently encounter the issue of undesired traces left by dynamic objects and manifested as obstacles on map, impeding robots from achieving accurate localization and effective navigation. To tackle the problem, a novel map construction framework based on 3D region-wise hash map structure (RH-Map) is proposed, consisting of front-end scan fresher and back-end removal modules, which realizes real-time map construction and online dynamic object removal (DOR). First, a two-layer 3D region-wise hash map structure of map management is proposed for effective online DOR. Then, in scan fresher, region-wise ground plane estimation (R-GPE) is adopted for estimating and preserving ground information and Scan-to-Map Removal (S2M-R) is proposed to discriminate and remove dynamic regions. Moreover, the lightweight back-end removal module maintaining keyframes is proposed for further DOR. As experimentally verified on SemanticKITTI, our proposed framework yields promising performance on online DOR of map construction compared with the state-of-the-art methods. And we also validate the proposed framework in real-world environments. △ Less

Submitted 24 July, 2023; v1 submitted 2 July, 2023; originally announced July 2023.

arXiv:2306.16241 [pdf, other]

Focus on the Sound around You: Monaural Target Speaker Extraction via Distance and Speaker Information

Authors: Jiuxin Lin, Peng Wang, Heinrich Dinkel, Jun Chen, Zhiyong Wu, Zhiyong Yan, Yongqing Wang, Junbo Zhang, Yujun Wang

Abstract: Previously, Target Speaker Extraction (TSE) has yielded outstanding performance in certain application scenarios for speech enhancement and source separation. However, obtaining auxiliary speaker-related information is still challenging in noisy environments with significant reverberation. inspired by the recently proposed distance-based sound separation, we propose the near sound (NS) extractor,… ▽ More Previously, Target Speaker Extraction (TSE) has yielded outstanding performance in certain application scenarios for speech enhancement and source separation. However, obtaining auxiliary speaker-related information is still challenging in noisy environments with significant reverberation. inspired by the recently proposed distance-based sound separation, we propose the near sound (NS) extractor, which leverages distance information for TSE to reliably extract speaker information without requiring previous speaker enrolment, called speaker embedding self-enrollment (SESE). Full- & sub-band modeling is introduced to enhance our NS-Extractor's adaptability towards environments with significant reverberation. Experimental results on several cross-datasets demonstrate the effectiveness of our improvements and the excellent performance of our proposed NS-Extractor in different application scenarios. △ Less

Submitted 7 October, 2023; v1 submitted 28 June, 2023; originally announced June 2023.

Comments: Proc. INTERSPEECH 2023, 2488-2492, doi: 10.21437/Interspeech.2023-218

arXiv:2306.16197 [pdf, other]

Multi-IMU with Online Self-Consistency for Freehand 3D Ultrasound Reconstruction

Authors: Mingyuan Luo, Xin Yang, Zhongnuo Yan, Junyu Li, Yuanji Zhang, Jiongquan Chen, Xindi Hu, Jikuan Qian, Jun Cheng, Dong Ni

Abstract: Ultrasound (US) imaging is a popular tool in clinical diagnosis, offering safety, repeatability, and real-time capabilities. Freehand 3D US is a technique that provides a deeper understanding of scanned regions without increasing complexity. However, estimating elevation displacement and accumulation error remains challenging, making it difficult to infer the relative position using images alone.… ▽ More Ultrasound (US) imaging is a popular tool in clinical diagnosis, offering safety, repeatability, and real-time capabilities. Freehand 3D US is a technique that provides a deeper understanding of scanned regions without increasing complexity. However, estimating elevation displacement and accumulation error remains challenging, making it difficult to infer the relative position using images alone. The addition of external lightweight sensors has been proposed to enhance reconstruction performance without adding complexity, which has been shown to be beneficial. We propose a novel online self-consistency network (OSCNet) using multiple inertial measurement units (IMUs) to improve reconstruction performance. OSCNet utilizes a modal-level self-supervised strategy to fuse multiple IMU information and reduce differences between reconstruction results obtained from each IMU data. Additionally, a sequence-level self-consistency strategy is proposed to improve the hierarchical consistency of prediction results among the scanning sequence and its sub-sequences. Experiments on large-scale arm and carotid datasets with multiple scanning tactics demonstrate that our OSCNet outperforms previous methods, achieving state-of-the-art reconstruction performance. △ Less

Submitted 18 July, 2023; v1 submitted 28 June, 2023; originally announced June 2023.

Comments: Accepted by MICCAI-2023

arXiv:2306.15430 [pdf, other]

KnowPrefix-Tuning: A Two-Stage Prefix-Tuning Framework for Knowledge-Grounded Dialogue Generation

Authors: Jiaqi Bai, Zhao Yan, Jian Yang, Xinnian Liang, Hongcheng Guo, Zhoujun Li

Abstract: Existing knowledge-grounded conversation systems generate responses typically in a retrieve-then-generate manner. They require a large knowledge base and a strong knowledge retrieval component, which is time- and resource-consuming. In this paper, we address the challenge by leveraging the inherent knowledge encoded in the pre-trained language models (PLMs). We propose Knowledgeable Prefix Tuning… ▽ More Existing knowledge-grounded conversation systems generate responses typically in a retrieve-then-generate manner. They require a large knowledge base and a strong knowledge retrieval component, which is time- and resource-consuming. In this paper, we address the challenge by leveraging the inherent knowledge encoded in the pre-trained language models (PLMs). We propose Knowledgeable Prefix Tuning (KnowPrefix-Tuning), a two-stage tuning framework, bypassing the retrieval process in a knowledge-grounded conversation system by injecting prior knowledge into the lightweight knowledge prefix. The knowledge prefix is a sequence of continuous knowledge-specific vectors that can be learned during training. In addition, we propose a novel interactive re-parameterization mechanism that allows the prefix to interact fully with the PLM during the optimization of response generation. Experimental results demonstrate that KnowPrefix-Tuning outperforms fine-tuning and other lightweight tuning approaches, and performs comparably with strong retrieval-based baselines while being $3\times$ faster during inference. △ Less

Submitted 27 June, 2023; originally announced June 2023.

Comments: Accepted by ECML-PKDD 2023 (Research Track)

arXiv:2306.14538 [pdf, other]

Learnable Differencing Center for Nighttime Depth Perception

Authors: Zhiqiang Yan, Yupeng Zheng, Chongyi Li, Jun Li, Jian Yang

Abstract: Depth completion is the task of recovering dense depth maps from sparse ones, usually with the help of color images. Existing image-guided methods perform well on daytime depth perception self-driving benchmarks, but struggle in nighttime scenarios with poor visibility and complex illumination. To address these challenges, we propose a simple yet effective framework called LDCNet. Our key idea is… ▽ More Depth completion is the task of recovering dense depth maps from sparse ones, usually with the help of color images. Existing image-guided methods perform well on daytime depth perception self-driving benchmarks, but struggle in nighttime scenarios with poor visibility and complex illumination. To address these challenges, we propose a simple yet effective framework called LDCNet. Our key idea is to use Recurrent Inter-Convolution Differencing (RICD) and Illumination-Affinitive Intra-Convolution Differencing (IAICD) to enhance the nighttime color images and reduce the negative effects of the varying illumination, respectively. RICD explicitly estimates global illumination by differencing two convolutions with different kernels, treating the small-kernel-convolution feature as the center of the large-kernel-convolution feature in a new perspective. IAICD softly alleviates local relative light intensity by differencing a single convolution, where the center is dynamically aggregated based on neighboring pixels and the estimated illumination map in RICD. On both nighttime depth completion and depth estimation tasks, extensive experiments demonstrate the effectiveness of our LDCNet, reaching the state of the art. △ Less

Submitted 4 September, 2023; v1 submitted 26 June, 2023; originally announced June 2023.

Comments: 8 pages

arXiv:2306.14170 [pdf, other]

AV-SepFormer: Cross-Attention SepFormer for Audio-Visual Target Speaker Extraction

Authors: Jiuxin Lin, Xinyu Cai, Heinrich Dinkel, Jun Chen, Zhiyong Yan, Yongqing Wang, Junbo Zhang, Zhiyong Wu, Yujun Wang, Helen Meng

Abstract: Visual information can serve as an effective cue for target speaker extraction (TSE) and is vital to improving extraction performance. In this paper, we propose AV-SepFormer, a SepFormer-based attention dual-scale model that utilizes cross- and self-attention to fuse and model features from audio and visual. AV-SepFormer splits the audio feature into a number of chunks, equivalent to the length of… ▽ More Visual information can serve as an effective cue for target speaker extraction (TSE) and is vital to improving extraction performance. In this paper, we propose AV-SepFormer, a SepFormer-based attention dual-scale model that utilizes cross- and self-attention to fuse and model features from audio and visual. AV-SepFormer splits the audio feature into a number of chunks, equivalent to the length of the visual feature. Then self- and cross-attention are employed to model and fuse the multi-modal features. Furthermore, we use a novel 2D positional encoding, that introduces the positional information between and within chunks and provides significant gains over the traditional positional encoding. Our model has two key advantages: the time granularity of audio chunked feature is synchronized to the visual feature, which alleviates the harm caused by the inconsistency of audio and video sampling rate; by combining self- and cross-attention, feature fusion and speech extraction processes are unified within an attention paradigm. The experimental results show that AV-SepFormer significantly outperforms other existing methods. △ Less

Submitted 25 June, 2023; originally announced June 2023.

Comments: Accepted by ICASSP2023

arXiv:2306.13339 [pdf, other]

doi 10.1109/TDSC.2024.3353548

TrustGuard: GNN-based Robust and Explainable Trust Evaluation with Dynamicity Support

Authors: Jie Wang, Zheng Yan, Jiahe Lan, Elisa Bertino, Witold Pedrycz

Abstract: Trust evaluation assesses trust relationships between entities and facilitates decision-making. Machine Learning (ML) shows great potential for trust evaluation owing to its learning capabilities. In recent years, Graph Neural Networks (GNNs), as a new ML paradigm, have demonstrated superiority in dealing with graph data. This has motivated researchers to explore their use in trust evaluation, as… ▽ More Trust evaluation assesses trust relationships between entities and facilitates decision-making. Machine Learning (ML) shows great potential for trust evaluation owing to its learning capabilities. In recent years, Graph Neural Networks (GNNs), as a new ML paradigm, have demonstrated superiority in dealing with graph data. This has motivated researchers to explore their use in trust evaluation, as trust relationships among entities can be modeled as a graph. However, current trust evaluation methods that employ GNNs fail to fully satisfy the dynamic nature of trust, overlook the adverse effects of trust-related attacks, and cannot provide convincing explanations on evaluation results. To address these problems, we propose TrustGuard, a GNN-based accurate trust evaluation model that supports trust dynamicity, is robust against typical attacks, and provides explanations through visualization. Specifically, TrustGuard is designed with a layered architecture that contains a snapshot input layer, a spatial aggregation layer, a temporal aggregation layer, and a prediction layer. Among them, the spatial aggregation layer adopts a defense mechanism to robustly aggregate local trust, and the temporal aggregation layer applies an attention mechanism for effective learning of temporal patterns. Extensive experiments on two real-world datasets show that TrustGuard outperforms state-of-the-art GNN-based trust evaluation models with respect to trust prediction across single-timeslot and multi-timeslot, even in the presence of attacks. In addition, TrustGuard can explain its evaluation results by visualizing both spatial and temporal views. △ Less

Submitted 4 February, 2024; v1 submitted 23 June, 2023; originally announced June 2023.

Comments: Accepted by IEEE TDSC. Code: https://github.com/Jieerbobo/TrustGuard

arXiv:2306.13016 [pdf]

Axion Insulator State in Hundred-Nanometer-Thick Magnetic Topological Insulator Sandwich Heterostructures

Authors: Deyi Zhuo, Zi-Jie Yan, Zi-Ting Sun, Ling-Jie Zhou, Yi-Fan Zhao, Ruoxi Zhang, Ruobing Mei, Hemian Yi, Ke Wang, Moses H. W. Chan, Chao-Xing Liu, K. T. Law, Cui-Zu Chang

Abstract: An axion insulator is a three-dimensional (3D) topological insulator (TI), in which the bulk maintains the time-reversal symmetry or inversion symmetry but the surface states are gapped by surface magnetization. The axion insulator state has been observed in molecular beam epitaxy (MBE)-grown magnetically doped TI sandwiches and exfoliated intrinsic magnetic TI MnBi2Te4 flakes with an even number… ▽ More An axion insulator is a three-dimensional (3D) topological insulator (TI), in which the bulk maintains the time-reversal symmetry or inversion symmetry but the surface states are gapped by surface magnetization. The axion insulator state has been observed in molecular beam epitaxy (MBE)-grown magnetically doped TI sandwiches and exfoliated intrinsic magnetic TI MnBi2Te4 flakes with an even number layer. All these samples have a thickness of ~10 nm, near the 2D-to-3D boundary. The coupling between the top and bottom surface states in thin samples may hinder the observation of quantized topological magnetoelectric response. Here, we employ MBE to synthesize magnetic TI sandwich heterostructures and find that the axion insulator state persists in a 3D sample with a thickness of ~106 nm. Our transport results show that the axion insulator state starts to emerge when the thickness of the middle undoped TI layer is greater than ~3 nm. The 3D hundred-nanometer-thick axion insulator provides a promising platform for the exploration of the topological magnetoelectric effect and other emergent magnetic topological states, such as the high-order TI phase. △ Less

Submitted 3 October, 2023; v1 submitted 22 June, 2023; originally announced June 2023.

Comments: 25 pages, 5 figures. Comments are very much welcome

arXiv:2306.09760 [pdf]

A filtered embedded weighted compact nonlinear scheme for hyperbolic conservation law

Authors: Xuan Liu, Yaobing Min, **sheng Cai, Yankai Ma, Zhenguo Yan

Abstract: In situations where a wide range of flow scales are involved, the nonlinear scheme used should be capable of both shock capturing and low-dissipation.Most of the existing WCNS schemes are too dissipative because the weights deviate from ideal weights in the smooth regions caused by small-scale fluctuations. Moreover, due to the defect of the weighting strategy, the two smooth stencils located on t… ▽ More In situations where a wide range of flow scales are involved, the nonlinear scheme used should be capable of both shock capturing and low-dissipation.Most of the existing WCNS schemes are too dissipative because the weights deviate from ideal weights in the smooth regions caused by small-scale fluctuations. Moreover, due to the defect of the weighting strategy, the two smooth stencils located on the same side of a discontinuity cannot achieve fourth-order when the discontinuity only crosses S0 or S2. In this paper, we proposed the filtered embedded WCNS scheme which is applicable for complex flow simulations involving both shock and small-scale features. In order to overcome the above deficiency of existing WCNS scheme, a new map** function is proposed to filter the weights deviation out which can map the weights to ideal weights in smooth region. Meanwhile, the embedded process also implemented by this function which is utilized to improve the resolution of shock capturing in certain discontinuity distributions. The approximate-dispersion-relation analysis indicates that the scheme with the map** function we proposed has lower dispersion error and numerical dissipation as compared to the WCNS-JS and WCNS-Z schemes. The improved performance is demonstrated by the simulation of linear advection problem and nonlinear hyperbolic conservation laws. △ Less

Submitted 16 June, 2023; originally announced June 2023.

arXiv:2306.08043 [pdf, other]

Circuit QED detection of induced two-fold anisotropic pairing in a hybrid superconductor-ferromagnet bilayer

Authors: C. G. L. Bøttcher, N. R. Poniatowski, A. Grankin, M. E. Wesson, Z. Yan, U. Vool, V. M. Galitski, A. Yacoby

Abstract: Hybrid systems represent one of the frontiers in the study of unconventional superconductivity and are a promising platform to realize topological superconducting states. Owing to their mesoscopic dimensions, these materials are challenging to probe using many conventional measurement techniques, and require new experimental probes to successfully characterize. In this work, we develop a probe tha… ▽ More Hybrid systems represent one of the frontiers in the study of unconventional superconductivity and are a promising platform to realize topological superconducting states. Owing to their mesoscopic dimensions, these materials are challenging to probe using many conventional measurement techniques, and require new experimental probes to successfully characterize. In this work, we develop a probe that enables us to measure the superfluid density of micron-size superconductors using microwave techniques drawn from circuit quantum electrodynamics (cQED). We apply this technique to a paradigmatic hybrid system, the superconductor/ferromagnet bilayer, and find that the proximity-induced superfluid density is two-fold anisotropic within the plane of the sample and exhibits power law temperature-scaling which is indicative of a nodal superconducting state. These experimental results are consistent with the theoretically predicted signatures of induced triplet pairing with a nodal $p$-wave order parameter. Moreover, we unexpectedly observe drastic modifications to the microwave response at frequencies near the ferromagnetic resonance, suggesting a coupling between the spin dynamics and induced superconducting order in the ferromagnetic layer. Our results offer new insights into the unconventional superconducting states induced in superconductor/ferromagnet heterostructures and simultaneously establish a new avenue for the study of fragile unconventional superconductivity in low-dimensional materials such as van der Waals heterostructures. △ Less

Submitted 13 June, 2023; originally announced June 2023.

Comments: 7 main text pages + 4 supplemental text pages, 4 main text figures + 4 supplemental figures

arXiv:2306.08009 [pdf, other]

DHBE: Data-free Holistic Backdoor Erasing in Deep Neural Networks via Restricted Adversarial Distillation

Authors: Zhicong Yan, Shenghong Li, Ruijie Zhao, Yuan Tian, Yuanyuan Zhao

Abstract: Backdoor attacks have emerged as an urgent threat to Deep Neural Networks (DNNs), where victim DNNs are furtively implanted with malicious neurons that could be triggered by the adversary. To defend against backdoor attacks, many works establish a staged pipeline to remove backdoors from victim DNNs: inspecting, locating, and erasing. However, in a scenario where a few clean data can be accessible… ▽ More Backdoor attacks have emerged as an urgent threat to Deep Neural Networks (DNNs), where victim DNNs are furtively implanted with malicious neurons that could be triggered by the adversary. To defend against backdoor attacks, many works establish a staged pipeline to remove backdoors from victim DNNs: inspecting, locating, and erasing. However, in a scenario where a few clean data can be accessible, such pipeline is fragile and cannot erase backdoors completely without sacrificing model accuracy. To address this issue, in this paper, we propose a novel data-free holistic backdoor erasing (DHBE) framework. Instead of the staged pipeline, the DHBE treats the backdoor erasing task as a unified adversarial procedure, which seeks equilibrium between two different competing processes: distillation and backdoor regularization. In distillation, the backdoored DNN is distilled into a proxy model, transferring its knowledge about clean data, yet backdoors are simultaneously transferred. In backdoor regularization, the proxy model is holistically regularized to prevent from infecting any possible backdoor transferred from distillation. These two processes jointly proceed with data-free adversarial optimization until a clean, high-accuracy proxy model is obtained. With the novel adversarial design, our framework demonstrates its superiority in three aspects: 1) minimal detriment to model accuracy, 2) high tolerance for hyperparameters, and 3) no demand for clean data. Extensive experiments on various backdoor attacks and datasets are performed to verify the effectiveness of the proposed framework. Code is available at \url{https://github.com/yanzhicong/DHBE} △ Less

Submitted 13 June, 2023; originally announced June 2023.

Comments: It has been accepted by asiaccs

arXiv:2306.06923 [pdf, other]

On the Viability of using LLMs for SW/HW Co-Design: An Example in Designing CiM DNN Accelerators

Authors: Zheyu Yan, Yifan Qin, Xiaobo Sharon Hu, Yiyu Shi

Abstract: Deep Neural Networks (DNNs) have demonstrated impressive performance across a wide range of tasks. However, deploying DNNs on edge devices poses significant challenges due to stringent power and computational budgets. An effective solution to this issue is software-hardware (SW-HW) co-design, which allows for the tailored creation of DNN models and hardware architectures that optimally utilize ava… ▽ More Deep Neural Networks (DNNs) have demonstrated impressive performance across a wide range of tasks. However, deploying DNNs on edge devices poses significant challenges due to stringent power and computational budgets. An effective solution to this issue is software-hardware (SW-HW) co-design, which allows for the tailored creation of DNN models and hardware architectures that optimally utilize available resources. However, SW-HW co-design traditionally suffers from slow optimization speeds because their optimizers do not make use of heuristic knowledge, also known as the ``cold start'' problem. In this study, we present a novel approach that leverages Large Language Models (LLMs) to address this issue. By utilizing the abundant knowledge of pre-trained LLMs in the co-design optimization process, we effectively bypass the cold start problem, substantially accelerating the design process. The proposed method achieves a significant speedup of 25x. This advancement paves the way for the rapid and efficient deployment of DNNs on edge devices. △ Less

Submitted 12 June, 2023; originally announced June 2023.

arXiv:2306.06425 [pdf, other]

An Adaptive Hybrid Channel Reservation Medium Access Control Protocol for Differentiated QoS

Authors: Ze Liu, Bo Li, Mao Yang, Zhongjiang Yan, Xichan Liu

Abstract: In a densely deployed distributed wireless network, there may be various types of traffic with differentiated Quality of Service (QoS) requirements. However, when the network is heavily loaded, the collision increases significantly, making it difficult to guarantee the QoS of traffic. Designing an efficient Medium Access Control (MAC) protocol to guarantee the QoS of different types of traffic is… ▽ More In a densely deployed distributed wireless network, there may be various types of traffic with differentiated Quality of Service (QoS) requirements. However, when the network is heavily loaded, the collision increases significantly, making it difficult to guarantee the QoS of traffic. Designing an efficient Medium Access Control (MAC) protocol to guarantee the QoS of different types of traffic is an essential research direction. Channel reservation mechanism is a promising approach to improving QoS. However, few studies have focused on the channel reservation mechanism for differentiated traffic. It is difficult to take into account both the QoS of real-time traffic and the collision issue for ordinary traffic. To address this issue, this paper proposes the Differentiated Service Guarantee Adaptive Reservation Mechanism (DSGARM) protocol. A hybrid reservation mechanism is proposed by combining the absolute reservation mechanism and the relative reservation mechanism. The absolute reservation mechanism is adopted for real-time traffic. Meanwhile, the relative reservation mechanism is adopted for ordinary traffic. An adaptive algorithm is proposed to calculate the reservation parameters that meet the delay requirements based on the network conditions. The proposed work can be widely applied in the densely deployed distributed wireless network with differentiated QoS requirements. In addition, this paper establishes a mathematical model for the proposed mechanism and theoretically analyzes the performance. Simulations verify that the mathematical model provides a good approximation of the protocol performance and demonstrates the advantages of the proposed protocol. △ Less

Submitted 10 June, 2023; originally announced June 2023.

Comments: 14 pages, 11 figures

arXiv:2306.06300

NERFBK: A High-Quality Benchmark for NERF-Based 3D Reconstruction

Authors: Ali Karami, Simone Rigon, Gabriele Mazzacca, Ziyang Yan, Fabio Remondino

Abstract: This paper introduces a new real and synthetic dataset called NeRFBK specifically designed for testing and comparing NeRF-based 3D reconstruction algorithms. High-quality 3D reconstruction has significant potential in various fields, and advancements in image-based algorithms make it essential to evaluate new advanced techniques. However, gathering diverse data with precise ground truth is challen… ▽ More This paper introduces a new real and synthetic dataset called NeRFBK specifically designed for testing and comparing NeRF-based 3D reconstruction algorithms. High-quality 3D reconstruction has significant potential in various fields, and advancements in image-based algorithms make it essential to evaluate new advanced techniques. However, gathering diverse data with precise ground truth is challenging and may not encompass all relevant applications. The NeRFBK dataset addresses this issue by providing multi-scale, indoor and outdoor datasets with high-resolution images and videos and camera parameters for testing and comparing NeRF-based algorithms. This paper presents the design and creation of the NeRFBK benchmark, various examples and application scenarios, and highlights its potential for advancing the field of 3D reconstruction. △ Less

Submitted 15 June, 2023; v1 submitted 9 June, 2023; originally announced June 2023.

Comments: paper result has problem

arXiv:2306.05145 [pdf, other]

Variable Radiance Field for Real-Life Category-Specifc Reconstruction from Single Image

Authors: Kun Wang, Zhiqiang Yan, Zhenyu Zhang, Xiang Li, Jun Li, Jian Yang

Abstract: Reconstructing category-specific objects from a single image is a challenging task that requires inferring the geometry and appearance of an object from a limited viewpoint. Existing methods typically rely on local feature retrieval based on re-projection with known camera intrinsic, which are slow and prone to distortion at viewpoints distant from the input image. In this paper, we present Variab… ▽ More Reconstructing category-specific objects from a single image is a challenging task that requires inferring the geometry and appearance of an object from a limited viewpoint. Existing methods typically rely on local feature retrieval based on re-projection with known camera intrinsic, which are slow and prone to distortion at viewpoints distant from the input image. In this paper, we present Variable Radiance Field (VRF), a novel framework that can efficiently reconstruct category-specific objects from a single image without known camera parameters. Our key contributions are: (1) We parameterize the geometry and appearance of the object using a multi-scale global feature extractor, which avoids frequent point-wise feature retrieval and camera dependency. We also propose a contrastive learning-based pretraining strategy to improve the feature extractor. (2) We reduce the geometric complexity of the object by learning a category template, and use hypernetworks to generate a small neural radiance field for fast and instance-specific rendering. (3) We align each training instance to the template space using a learned similarity transformation, which enables semantic-consistent learning across different objects. We evaluate our method on the CO3D dataset and show that it outperforms existing methods in terms of quality and speed. We also demonstrate its applicability to shape interpolation and object placement tasks. △ Less

Submitted 8 June, 2023; originally announced June 2023.

arXiv:2306.04741 [pdf, other]

doi 10.1007/JHEP12(2023)022

Non-Lorentzian IIB Supergravity from a Polynomial Realization of SL(2,R)

Authors: Eric Bergshoeff, Kevin T. Grosvenor, Johannes Lahnsteiner, Ziqi Yan, Utku Zorba

Abstract: We derive the action and symmetries of the bosonic sector of non-Lorentzian IIB supergravity by taking the non-relativistic string limit. We find that the bosonic field content is extended by a Lagrange multiplier that implements a restriction on the Ramond-Ramond fluxes. We show that the SL(2,R) transformation rules of non-Lorentzian IIB supergravity form a novel, nonlinear polynomial realization… ▽ More We derive the action and symmetries of the bosonic sector of non-Lorentzian IIB supergravity by taking the non-relativistic string limit. We find that the bosonic field content is extended by a Lagrange multiplier that implements a restriction on the Ramond-Ramond fluxes. We show that the SL(2,R) transformation rules of non-Lorentzian IIB supergravity form a novel, nonlinear polynomial realization. Using classical invariant theory of polynomial equations and binary forms, we will develop a general formalism describing the polynomial realization of SL(2,R) and apply it to the special case of non-Lorentzian IIB supergravity. Using the same formalism, we classify all the relevant SL(2,R) invariants. Invoking other bosonic symmetries, such as the local boost and dilatation symmetry, we show how the bosonic part of the non-Lorentzian IIB supergravity action is formed uniquely from these SL(2,R) invariants. This work also points towards the concept of a non-Lorentzian bootstrap, where bosonic symmetries in non-Lorentzian supergravity are used to bootstrap the bosonic dynamics in Lorentzian supergravity, without considering the fermions. △ Less

Submitted 11 December, 2023; v1 submitted 7 June, 2023; originally announced June 2023.

Comments: 45 pages; v3: published version, typos corrected, references updated

Report number: NORDITA 2023-030

Journal ref: JHEP 12 (2023) 022

Showing 201–250 of 1,205 results for author: Yan, Z