Search | arXiv e-print repository

arXiv:2208.12074 [pdf, other]

doi 10.1063/5.0144147

Ionization Induced by the Ponderomotive Force in Intense and High-Frequency Laser Fields

Authors: Mingyu Zhu, Yuxiang Liu, Chunli Wei, Hongcheng Ni, Qi Wei

Abstract: Atomic stabilization is a universal phenomenon that occurs when atoms interact with intense and high-frequency laser fields. In this work, we systematically study the influence of the ponderomotive (PM) force, present around the laser focus, on atomic stabilization. We show that the PM force could induce tunneling and even over-barrier ionization to the otherwise stabilized atoms. Such effect may… ▽ More Atomic stabilization is a universal phenomenon that occurs when atoms interact with intense and high-frequency laser fields. In this work, we systematically study the influence of the ponderomotive (PM) force, present around the laser focus, on atomic stabilization. We show that the PM force could induce tunneling and even over-barrier ionization to the otherwise stabilized atoms. Such effect may overweight the typical multiphoton ionization under moderate laser intensities. Our work highlights the importance of an improved treatment of atomic stabilization that includes the influence of the PM force. △ Less

Submitted 5 May, 2023; v1 submitted 25 August, 2022; originally announced August 2022.

Journal ref: J. Chem. Phys. 158, 164306 (2023)

arXiv:2208.10912 [pdf, other]

Learning Instrumental Variable from Data Fusion for Treatment Effect Estimation

Authors: Anpeng Wu, Kun Kuang, Ruoxuan Xiong, Minqing Zhu, Yuxuan Liu, Bo Li, Furui Liu, Zhihua Wang, Fei Wu

Abstract: The advent of the big data era brought new opportunities and challenges to draw treatment effect in data fusion, that is, a mixed dataset collected from multiple sources (each source with an independent treatment assignment mechanism). Due to possibly omitted source labels and unmeasured confounders, traditional methods cannot estimate individual treatment assignment probability and infer treatmen… ▽ More The advent of the big data era brought new opportunities and challenges to draw treatment effect in data fusion, that is, a mixed dataset collected from multiple sources (each source with an independent treatment assignment mechanism). Due to possibly omitted source labels and unmeasured confounders, traditional methods cannot estimate individual treatment assignment probability and infer treatment effect effectively. Therefore, we propose to reconstruct the source label and model it as a Group Instrumental Variable (GIV) to implement IV-based Regression for treatment effect estimation. In this paper, we conceptualize this line of thought and develop a unified framework (Meta-EM) to (1) map the raw data into a representation space to construct Linear Mixed Models for the assigned treatment variable; (2) estimate the distribution differences and model the GIV for the different treatment assignment mechanisms; and (3) adopt an alternating training strategy to iteratively optimize the representations and the joint distribution to model GIV for IV regression. Empirical results demonstrate the advantages of our Meta-EM compared with state-of-the-art methods. △ Less

Submitted 7 December, 2022; v1 submitted 23 August, 2022; originally announced August 2022.

arXiv:2208.09735 [pdf, other]

How a Small Amount of Data Sharing Benefits Distributed Optimization and Learning

Authors: Mingxi Zhu, Yinyu Ye

Abstract: Distributed optimization algorithms have been widely used in machine learning. While those algorithms have the merits in parallel processing and protecting data security, they often suffer from slow convergence. This paper focuses on how a small amount of data sharing could benefit distributed optimization and learning. Specifically, we examine higher-order optimization algorithms including distri… ▽ More Distributed optimization algorithms have been widely used in machine learning. While those algorithms have the merits in parallel processing and protecting data security, they often suffer from slow convergence. This paper focuses on how a small amount of data sharing could benefit distributed optimization and learning. Specifically, we examine higher-order optimization algorithms including distributed multi-block alternating direction method of multipliers (ADMM) and preconditioned conjugate gradient method (PCG). The contribution of this paper is three-folded. First, in theory, we answer when and why distributed optimization algorithms are slow by identifying the worst data structure. Surprisingly, while PCG algorithm converges slowly under heterogeneous data structure, for distributed ADMM, data homogeneity leads to the worst performance. This result challenges the common belief that data heterogeneity hurts convergence, highlighting the need for a universal approach on altering data structure for different algorithms. Second, in practice, we propose a meta-algorithm of data sharing, with its tailored applications in multi-block ADMM and PCG methods. By only sharing a small amount of prefixed data (e.g. 1%), our algorithms provide good quality estimators in different machine learning tasks within much fewer iterations, while purely distributed optimization algorithms may take hundreds more times of iterations to converge. Finally, in philosophy, we argue that even minimal collaboration can have huge synergy, which is a concept that extends beyond the realm of optimization analysis. We hope that the discovery resulting from this paper would encourage even a small amount of data sharing among different regions to combat difficult global learning problems. △ Less

Submitted 2 January, 2024; v1 submitted 20 August, 2022; originally announced August 2022.

MSC Class: 90C06 (Primary); 90C25; 68U04 (Secondary)

arXiv:2208.08848 [pdf, other]

A Two-stream Convolutional Network for Musculoskeletal and Neurological Disorders Prediction

Authors: Manli Zhu, Qianhui Men, Edmond S. L. Ho, Howard Leung, Hubert P. H. Shum

Abstract: Musculoskeletal and neurological disorders are the most common causes of walking problems among older people, and they often lead to diminished quality of life. Analyzing walking motion data manually requires trained professionals and the evaluations may not always be objective. To facilitate early diagnosis, recent deep learning-based methods have shown promising results for automated analysis, w… ▽ More Musculoskeletal and neurological disorders are the most common causes of walking problems among older people, and they often lead to diminished quality of life. Analyzing walking motion data manually requires trained professionals and the evaluations may not always be objective. To facilitate early diagnosis, recent deep learning-based methods have shown promising results for automated analysis, which can discover patterns that have not been found in traditional machine learning methods. We observe that existing work mostly applies deep learning on individual joint features such as the time series of joint positions. Due to the challenge of discovering inter-joint features such as the distance between feet (i.e. the stride width) from generally smaller-scale medical datasets, these methods usually perform sub-optimally. As a result, we propose a solution that explicitly takes both individual joint features and inter-joint features as input, relieving the system from the need of discovering more complicated features from small data. Due to the distinctive nature of the two types of features, we introduce a two-stream framework, with one stream learning from the time series of joint position and the other from the time series of relative joint displacement. We further develop a mid-layer fusion module to combine the discovered patterns in these two streams for diagnosis, which results in a complementary representation of the data for better prediction performance. We validate our system with a benchmark dataset of 3D skeleton motion that involves 45 patients with musculoskeletal and neurological disorders, and achieve a prediction accuracy of 95.56%, outperforming state-of-the-art methods. △ Less

Submitted 18 August, 2022; originally announced August 2022.

Comments: Journal of Medical Systems

arXiv:2208.05037 [pdf]

Characterization of 250 MeV protons from Varian ProBeam pencil beam scanning system for FLASH radiation therapy

Authors: Serdar Charyyev, Chih-Wei Chang, Mingyao Zhu, Liyong Lin, Katja Langen, Anees Dhabaan

Abstract: Recently, shoot-through proton FLASH has been proposed where the highest energy is extracted from the cyclotron to maximize the dose rate (DR). Even though our proton pencil beam scanning system can deliver 250 MeV (the highest energy), it is not typical to use 250 MeV protons for routine clinical treatments and as such 250 MeV may not have been characterized in the commissioning. In this study, w… ▽ More Recently, shoot-through proton FLASH has been proposed where the highest energy is extracted from the cyclotron to maximize the dose rate (DR). Even though our proton pencil beam scanning system can deliver 250 MeV (the highest energy), it is not typical to use 250 MeV protons for routine clinical treatments and as such 250 MeV may not have been characterized in the commissioning. In this study, we aim to characterize 250 MeV protons from Varian ProBeam system for FLASH RT as well as assess the ability of clinical monitoring ionization chamber (MIC) for FLASH-readiness. We measured data needed for beam commissioning: integral depth dose (IDD) curve, spot sigma, and absolute dose calibration. To evaluate MIC, we measured output as a function of beam current. To characterize a 250 MeV FLASH beam, we measured: (1) central axis DR as a function of current and spot spacing and arrangement, (2) for a fixed spot spacing, the maximum field size that still achieves FLASH DR (i.e., > 40 Gy/s), (3) DR reproducibility. All FLASH DR measurements were performed using ion chamber for the absolute dose and irradiation times were obtained from log files. We verified dose measurements using EBT-XD films and irradiation times using a fast, pixelated spectral detector. R90 and R80 from IDD were 37.58 and 37.69 cm, and spot sigma at isocenter were σx=3.336 and σy=3.332 mm, respectively. The absolute dose output was measured as 0.377 GyE*mm2/MU for the commissioning conditions. Output was stable for beam currents up to 15 nA, and it gradually increased to 12-fold for 115 nA. DR depended on beam current, spot spacing and arrangement and could be reproduced within 4.2% variations. Even though FLASH was achieved and the largest field size that delivers FLASH DR was determined as 35x35 mm2, current MIC has DR dependence and users should measure DR each time for their FLASH applications. △ Less

Submitted 25 January, 2023; v1 submitted 9 August, 2022; originally announced August 2022.

Comments: 11 pages, 6 figures

arXiv:2208.04586 [pdf]

Single crystal synthesis and low-lying electronic structure of V$_3$S$_4$

Authors: Yu-Jie Hao, Ming-Yuan Zhu, Xiao-Ming Ma, Chengcheng Zhang, Hongtao Rong, Qi Jiang, Yichen Yang, Zhicheng Jiang, Xiang-Rui Liu, Yupeng Zhu, Meng Zeng, Ruie Lu, Tianhao Shao, Xin Liu, Hu Xu, Zhengtai Liu, Mao Ye, Dawei Shen, Chaoyu Chen, Chang Liu

Abstract: We report successful growth of millimeter-sized high quality single crystals of V$_3$S$_4$, a candidate topological semimetal belonging to a low-symmetry space group and consisting of only low atomic number elements. Using density functional theory calculations and angle-resolved photoemission spectroscopy, we show that the nonmagnetic phase of monoclinic V$_3$S$_4$ hosts type-II Dirac-like quasip… ▽ More We report successful growth of millimeter-sized high quality single crystals of V$_3$S$_4$, a candidate topological semimetal belonging to a low-symmetry space group and consisting of only low atomic number elements. Using density functional theory calculations and angle-resolved photoemission spectroscopy, we show that the nonmagnetic phase of monoclinic V$_3$S$_4$ hosts type-II Dirac-like quasiparticles which opens a sizable gap due to spin orbit coupling, as well as theoretical multiple nodal lines that are eliminated also by spin orbit coupling. These results suggest that relativistic effects give rise to profound modifications of the topological properties even in compounds with low-weight elements. △ Less

Submitted 25 March, 2023; v1 submitted 9 August, 2022; originally announced August 2022.

Comments: 19 Pages, 3 Figures. To be published in Journal of Alloys and Compounds

arXiv:2208.02955 [pdf, ps, other]

ZLPR: A Novel Loss for Multi-label Classification

Authors: Jianlin Su, Mingren Zhu, Ahmed Murtadha, Shengfeng Pan, Bo Wen, Yunfeng Liu

Abstract: In the era of deep learning, loss functions determine the range of tasks available to models and algorithms. To support the application of deep learning in multi-label classification (MLC) tasks, we propose the ZLPR (zero-bounded log-sum-exp \& pairwise rank-based) loss in this paper. Compared to other rank-based losses for MLC, ZLPR can handel problems that the number of target labels is uncertai… ▽ More In the era of deep learning, loss functions determine the range of tasks available to models and algorithms. To support the application of deep learning in multi-label classification (MLC) tasks, we propose the ZLPR (zero-bounded log-sum-exp \& pairwise rank-based) loss in this paper. Compared to other rank-based losses for MLC, ZLPR can handel problems that the number of target labels is uncertain, which, in this point of view, makes it equally capable with the other two strategies often used in MLC, namely the binary relevance (BR) and the label powerset (LP). Additionally, ZLPR takes the corelation between labels into consideration, which makes it more comprehensive than the BR methods. In terms of computational complexity, ZLPR can compete with the BR methods because its prediction is also label-independent, which makes it take less time and memory than the LP methods. Our experiments demonstrate the effectiveness of ZLPR on multiple benchmark datasets and multiple evaluation metrics. Moreover, we propose the soft version and the corresponding KL-divergency calculation method of ZLPR, which makes it possible to apply some regularization tricks such as label smoothing to enhance the generalization of models. △ Less

Submitted 4 August, 2022; originally announced August 2022.

arXiv:2208.01190 [pdf, ps, other]

Toward 6G TK$μ$ Extreme Connectivity: Architecture, Key Technologies and Experiments

Authors: Xiaohu You, Yongming Huang, Shengheng Liu, Dongming Wang, Junchao Ma, Chuan Zhang, Hang Zhan, Cheng Zhang, Jiao Zhang, ** Li, Min Zhu, Jianjie You, Dongjie Liu, Shiwen He, Guanghui He, Fengyi Yang, Yang Liu, Jianjun Wu, Jianmin Lu, Ge Li, Xiaowu Chen, Wenguang Chen, Wen Gao

Abstract: Sixth-generation (6G) networks are evolving towards new features and order-of-magnitude enhancement of systematic performance metrics compared to the current 5G. In particular, the 6G networks are expected to achieve extreme connectivity performance with Tbps-scale data rate, Kbps/Hz-scale spectral efficiency, and $μ$s-scale latency. To this end, an original three-layer 6G network architecture is… ▽ More Sixth-generation (6G) networks are evolving towards new features and order-of-magnitude enhancement of systematic performance metrics compared to the current 5G. In particular, the 6G networks are expected to achieve extreme connectivity performance with Tbps-scale data rate, Kbps/Hz-scale spectral efficiency, and $μ$s-scale latency. To this end, an original three-layer 6G network architecture is designed to realise uniform full-spectrum cell-free radio access and provide task-centric agile proximate support for diverse applications. The designed architecture is featured by super edge node (SEN) which integrates connectivity, computing, AI, data, etc. On this basis, a technological framework of pervasive multi-level (PML) AI is established in the centralised unit to enable task-centric near-real-time resource allocation and network automation. We then introduce a radio access network (RAN) architecture of full spectrum uniform cell-free networks, which is among the most attractive RAN candidates for 6G TK$μ$ extreme connectivity. A few most promising key technologies, i.e., cell-free massive MIMO, photonics-assisted Terahertz wireless access and spatiotemporal two-dimensional channel coding are further discussed. A testbed is implemented and extensive trials are conducted to evaluate innovative technologies and methodologies. The proposed 6G network architecture and technological framework demonstrate exciting potentials for full-service and full-scenario applications. △ Less

Submitted 11 October, 2022; v1 submitted 1 August, 2022; originally announced August 2022.

Comments: 8 pages, 4 figures, in peer review with IEEE Wireless Communications Magazine

arXiv:2207.14532 [pdf, other]

doi 10.1088/1475-7516/2023/01/015

Enhance Primordial Black Hole Abundance through the Non-linear Processes around Bounce Point

Authors: Jie-Wen Chen, Mian Zhu, Sheng-Feng Yan, Qing-Qing Wang, Yifu Cai

Abstract: The non-singular bouncing cosmology is an alternative paradigm to inflation, wherein the background energy density vanishes at the bounce point, in the context of Einstein gravity. Therefore, the non-linear effects in the evolution of density fluctuations ($δρ$) may be strong in the bounce phase, which potentially provides a mechanism to enhance the abundance of primordial black holes (PBHs). This… ▽ More The non-singular bouncing cosmology is an alternative paradigm to inflation, wherein the background energy density vanishes at the bounce point, in the context of Einstein gravity. Therefore, the non-linear effects in the evolution of density fluctuations ($δρ$) may be strong in the bounce phase, which potentially provides a mechanism to enhance the abundance of primordial black holes (PBHs). This article presents a comprehensive illustration for PBH enhancement due to the bounce phase. To calculate the non-linear evolution of $δρ$, the Raychaudhuri equation is numerically solved here. Since the non-linear processes may lead to a non-Gaussian probability distribution function for $δρ$ after the bounce point, the PBH abundance is calculated in a modified Press-Schechter formalism. In this case, the criterion of PBH formation is complicated, due to complicated non-linear evolutionary behavior of $δρ$ during the bounce phase. Our results indicate that the bounce phase indeed has potential to enhance the PBH abundance sufficiently. Furthermore, the PBH abundance is applied to constrain the parameters of bounce phase, providing a complementary to the surveys of cosmic microwave background and large scale structure. △ Less

Submitted 29 July, 2022; originally announced July 2022.

Comments: 17 pages, 6 figures

Journal ref: JCAP01 (2023) 015

arXiv:2207.12179 [pdf, other]

Time-constrained Dynamic Mechanisms for College Admissions

Authors: Li Chen, Juan S. Pereyra, Min Zhu

Abstract: Recent literature shows that dynamic matching mechanisms may outperform the standard mechanisms to deliver desirable results. We highlight an under-explored design dimension, the time constraints that students face under such a dynamic mechanism. First, we theoretically explore the effect of time constraints and show that the outcome can be worse than the outcome produced by the student-proposing… ▽ More Recent literature shows that dynamic matching mechanisms may outperform the standard mechanisms to deliver desirable results. We highlight an under-explored design dimension, the time constraints that students face under such a dynamic mechanism. First, we theoretically explore the effect of time constraints and show that the outcome can be worse than the outcome produced by the student-proposing deferred acceptance mechanism. Second, we present evidence from the Inner Mongolian university admissions that time constraints can prevent dynamic mechanisms from achieving stable outcomes, creating losers and winners among students. △ Less

Submitted 25 July, 2022; originally announced July 2022.

arXiv:2207.10289 [pdf, other]

doi 10.1016/j.cma.2022.115671

A comprehensive study of non-adaptive and residual-based adaptive sampling for physics-informed neural networks

Authors: Chenxi Wu, Min Zhu, Qinyang Tan, Yadhu Kartha, Lu Lu

Abstract: Physics-informed neural networks (PINNs) have shown to be an effective tool for solving forward and inverse problems of partial differential equations (PDEs). PINNs embed the PDEs into the loss of the neural network, and this PDE loss is evaluated at a set of scattered residual points. The distribution of these points are highly important to the performance of PINNs. However, in the existing studi… ▽ More Physics-informed neural networks (PINNs) have shown to be an effective tool for solving forward and inverse problems of partial differential equations (PDEs). PINNs embed the PDEs into the loss of the neural network, and this PDE loss is evaluated at a set of scattered residual points. The distribution of these points are highly important to the performance of PINNs. However, in the existing studies on PINNs, only a few simple residual point sampling methods have mainly been used. Here, we present a comprehensive study of two categories of sampling: non-adaptive uniform sampling and adaptive nonuniform sampling. We consider six uniform sampling, including (1) equispaced uniform grid, (2) uniformly random sampling, (3) Latin hypercube sampling, (4) Halton sequence, (5) Hammersley sequence, and (6) Sobol sequence. We also consider a resampling strategy for uniform sampling. To improve the sampling efficiency and the accuracy of PINNs, we propose two new residual-based adaptive sampling methods: residual-based adaptive distribution (RAD) and residual-based adaptive refinement with distribution (RAR-D), which dynamically improve the distribution of residual points based on the PDE residuals during training. Hence, we have considered a total of 10 different sampling methods, including six non-adaptive uniform sampling, uniform sampling with resampling, two proposed adaptive sampling, and an existing adaptive sampling. We extensively tested the performance of these sampling methods for four forward problems and two inverse problems in many setups. Our numerical results presented in this study are summarized from more than 6000 simulations of PINNs. We show that the proposed adaptive sampling methods of RAD and RAR-D significantly improve the accuracy of PINNs with fewer residual points. The results obtained in this study can also be used as a practical guideline in choosing sampling methods. △ Less

Submitted 20 July, 2022; originally announced July 2022.

arXiv:2207.08167 [pdf, ps, other]

Multiplicity and orbital stability of normalized solutions to non-autonomous Schrödinger equation with mixed nonlinearities

Authors: Xinfu Li, Li Xu, Meiling Zhu

Abstract: This paper studies the multiplicity of normalized solutions to the Schrödinger equation with mixed nonlinearities \begin{equation*} \begin{cases} -Δu=λu+h(εx)|u|^{q-2}u+η|u|^{p-2}u,\quad x\in \mathbb{R}^N, \\ \int_{\mathbb{R}^N}|u|^2dx=a^2, \end{cases} \end{equation*} where $a, ε, η>0$, $q$ is $L^2$-subcritical, $p$ is $L^2$-supercritical, $λ\in \mathbb{R}$ is an unknown parameter that appears as… ▽ More This paper studies the multiplicity of normalized solutions to the Schrödinger equation with mixed nonlinearities \begin{equation*} \begin{cases} -Δu=λu+h(εx)|u|^{q-2}u+η|u|^{p-2}u,\quad x\in \mathbb{R}^N, \\ \int_{\mathbb{R}^N}|u|^2dx=a^2, \end{cases} \end{equation*} where $a, ε, η>0$, $q$ is $L^2$-subcritical, $p$ is $L^2$-supercritical, $λ\in \mathbb{R}$ is an unknown parameter that appears as a Lagrange multiplier, $h$ is a positive and continuous function. It is proved that the numbers of normalized solutions are at least the numbers of global maximum points of $h$ when $ε$ is small enough. Moreover, the orbital stability of the solutions obtained is analyzed as well. In particular, our results cover the Sobolev critical case $p=2N/(N-2)$. △ Less

Submitted 17 July, 2022; originally announced July 2022.

arXiv:2207.07824 [pdf, other]

Distributed Safe Learning and Planning for Multi-robot Systems

Authors: Zhenyuan Yuan, Minghui Zhu

Abstract: This paper considers the problem of online multi-robot motion planning with general nonlinear dynamics subject to unknown external disturbances. We propose dSLAP, a distributed safe learning and planning framework that allows the robots to safely navigate through the environments by coupling online learning and motion planning. Gaussian process regression is used to online learn the disturbances w… ▽ More This paper considers the problem of online multi-robot motion planning with general nonlinear dynamics subject to unknown external disturbances. We propose dSLAP, a distributed safe learning and planning framework that allows the robots to safely navigate through the environments by coupling online learning and motion planning. Gaussian process regression is used to online learn the disturbances with uncertainty quantification. The safe motion planning algorithm ensures collision avoidance against the learning uncertainty and utilizes set-valued analysis to achieve fast adaptation in response to the newly learned models. A model predictive control problem is then formulated and solved to return a control policy that balances between actively exploring the unknown disturbances and reaching goal regions. Sufficient conditions are established to guarantee the safety of the robots in the absence of backup policy. Monte Carlo simulations are conducted for evaluation. △ Less

Submitted 19 November, 2023; v1 submitted 15 July, 2022; originally announced July 2022.

arXiv:2207.05733 [pdf, other]

A Skeleton-aware Graph Convolutional Network for Human-Object Interaction Detection

Authors: Manli Zhu, Edmond S. L. Ho, Hubert P. H. Shum

Abstract: Detecting human-object interactions is essential for comprehensive understanding of visual scenes. In particular, spatial connections between humans and objects are important cues for reasoning interactions. To this end, we propose a skeleton-aware graph convolutional network for human-object interaction detection, named SGCN4HOI. Our network exploits the spatial connections between human keypoint… ▽ More Detecting human-object interactions is essential for comprehensive understanding of visual scenes. In particular, spatial connections between humans and objects are important cues for reasoning interactions. To this end, we propose a skeleton-aware graph convolutional network for human-object interaction detection, named SGCN4HOI. Our network exploits the spatial connections between human keypoints and object keypoints to capture their fine-grained structural interactions via graph convolutions. It fuses such geometric features with visual features and spatial configuration features obtained from human-object pairs. Furthermore, to better preserve the object structural information and facilitate human-object interaction detection, we propose a novel skeleton-based object keypoints representation. The performance of SGCN4HOI is evaluated in the public benchmark V-COCO dataset. Experimental results show that the proposed approach outperforms the state-of-the-art pose-based models and achieves competitive performance against other models. △ Less

Submitted 11 July, 2022; originally announced July 2022.

Comments: Accepted by IEEE SMC 2022

arXiv:2207.05406 [pdf, other]

doi 10.1103/PhysRevD.107.024022

Microlensing effect of charged spherically symmetric wormhole

Authors: Lei-Hua Liu, Mian Zhu, Wentao Luo, Yi-Fu Cai, Yi Wang

Abstract: We systematically investigate the microlensing effect of charged spherically symmetric wormhole, where the light source is remote from the throat. Remarkably, there will be at most three images by considering the charge part. We study all situations including three images, two images, and one image, respectively. The numerical result shows that the range of total magnification is from $10^5$ to… ▽ More We systematically investigate the microlensing effect of charged spherically symmetric wormhole, where the light source is remote from the throat. Remarkably, there will be at most three images by considering the charge part. We study all situations including three images, two images, and one image, respectively. The numerical result shows that the range of total magnification is from $10^5$ to $10^{-2}$ depending on various metrics. In the case of three images, there will be two maximal values of magnification (a peak, and a gentle peak) when the contribution via mass is much less than that of charge. However, we cannot distinguish the case that forms three images or only one image as the total magnification is of order $10^5$. Finally, our theoretical investigation could shed new light on exploring the wormhole with the microlensing effect. △ Less

Submitted 19 January, 2023; v1 submitted 12 July, 2022; originally announced July 2022.

Comments: 10 pages, 9 figures

Journal ref: Phys. Rev. D 107, 024022(2023)

arXiv:2207.04584 [pdf, other]

HEGrid: A High Efficient Multi-Channel Radio Astronomical Data Gridding Framework in Heterogeneous Computing Environments

Authors: Hao Wang, Ce Yu, Jian Xiao, Shanjiang Tang, Min Long, Ming Zhu

Abstract: The challenge to fully exploit the potential of existing and upcoming scientific instruments like large single-dish radio telescopes is to process the collected massive data effectively and efficiently. As a "quasi 2D stencil computation" with the "Moore neighborhood pattern," gridding is the most computationally intensive step in data reduction pipeline for radio astronomy studies, enabling astro… ▽ More The challenge to fully exploit the potential of existing and upcoming scientific instruments like large single-dish radio telescopes is to process the collected massive data effectively and efficiently. As a "quasi 2D stencil computation" with the "Moore neighborhood pattern," gridding is the most computationally intensive step in data reduction pipeline for radio astronomy studies, enabling astronomers to create correct sky images for further analysis. However, the existing gridding frameworks can either only run on multi-core CPU architecture or do not support high-concurrency, multi-channel data gridding. Their performance is then limited, and there are emerging needs for innovative gridding frameworks to process data from large single-dish radio telescopes like the Five-hundred-meter Aperture Spherical Telescope (FAST). To address those challenges, we developed a High Efficient Gridding framework, HEGrid, by overcoming the above limitations. Specifically, we propose and construct the gridding pipeline in heterogeneous computing environments and achieve multi-pipeline concurrency for high performance multi-channel processing. Furthermore, we propose pipeline-based co-optimization to alleviate the potential negative performance impact of possible intra- and inter-pipeline low computation and I/O utilization, including component share-based redundancy elimination, thread-level data reuse and overlap** I/O and computation. Our experiments are based on both simulated datasets and actual FAST observational datasets. The results show that HEGrid outperforms other state-of-the-art gridding frameworks by up to 5.5x and has robust hardware portability, including AMD Radeon Instinct GPU and NVIDIA GPU. △ Less

Submitted 10 July, 2022; originally announced July 2022.

Comments: 12 pages, 17 figures

ACM Class: I.4.9; J.2

arXiv:2207.04448 [pdf, other]

Mix-Teaching: A Simple, Unified and Effective Semi-Supervised Learning Framework for Monocular 3D Object Detection

Authors: Lei Yang, Xinyu Zhang, Li Wang, Minghan Zhu, Chuang Zhang, Jun Li

Abstract: Monocular 3D object detection is an essential perception task for autonomous driving. However, the high reliance on large-scale labeled data make it costly and time-consuming during model optimization. To reduce such over-reliance on human annotations, we propose Mix-Teaching, an effective semi-supervised learning framework applicable to employ both labeled and unlabeled images in training stage.… ▽ More Monocular 3D object detection is an essential perception task for autonomous driving. However, the high reliance on large-scale labeled data make it costly and time-consuming during model optimization. To reduce such over-reliance on human annotations, we propose Mix-Teaching, an effective semi-supervised learning framework applicable to employ both labeled and unlabeled images in training stage. Mix-Teaching first generates pseudo-labels for unlabeled images by self-training. The student model is then trained on the mixed images possessing much more intensive and precise labeling by merging instance-level image patches into empty backgrounds or labeled images. This is the first to break the image-level limitation and put high-quality pseudo labels from multi frames into one image for semi-supervised training. Besides, as a result of the misalignment between confidence score and localization quality, it's hard to discriminate high-quality pseudo-labels from noisy predictions using only confidence-based criterion. To that end, we further introduce an uncertainty-based filter to help select reliable pseudo boxes for the above mixing operation. To the best of our knowledge, this is the first unified SSL framework for monocular 3D object detection. Mix-Teaching consistently improves MonoFlex and GUPNet by significant margins under various labeling ratios on KITTI dataset. For example, our method achieves around +6.34% [email protected] improvement against the GUPNet baseline on validation set when using only 10% labeled data. Besides, by leveraging full training set and the additional 48K raw images of KITTI, it can further improve the MonoFlex by +4.65% improvement on [email protected] for car detection, reaching 18.54% [email protected], which ranks the 1st place among all monocular based methods on KITTI test leaderboard. The code and pretrained models will be released at https://github.com/yanglei18/Mix-Teaching. △ Less

Submitted 10 July, 2022; originally announced July 2022.

Comments: 11 pages, 5 figures, 7 tables

arXiv:2207.00221 [pdf, other]

VL-CheckList: Evaluating Pre-trained Vision-Language Models with Objects, Attributes and Relations

Authors: Tiancheng Zhao, Tianqi Zhang, Mingwei Zhu, Haozhan Shen, Kyusong Lee, Xiaopeng Lu, Jianwei Yin

Abstract: Vision-Language Pretraining (VLP) models have recently successfully facilitated many cross-modal downstream tasks. Most existing works evaluated their systems by comparing the fine-tuned downstream task performance. However, only average downstream task accuracy provides little information about the pros and cons of each VLP method, let alone provides insights on how the community can improve the… ▽ More Vision-Language Pretraining (VLP) models have recently successfully facilitated many cross-modal downstream tasks. Most existing works evaluated their systems by comparing the fine-tuned downstream task performance. However, only average downstream task accuracy provides little information about the pros and cons of each VLP method, let alone provides insights on how the community can improve the systems in the future. Inspired by the CheckList for testing natural language processing, we exploit VL-CheckList, a novel framework to understand the capabilities of VLP models. The proposed method divides the image-texting ability of a VLP model into three categories: objects, attributes, and relations, and uses a novel taxonomy to further break down these three aspects. We conduct comprehensive studies to analyze seven recently popular VLP models via the proposed framework. Results confirm the effectiveness of the proposed method by revealing fine-grained differences among the compared models that were not visible from downstream task-only evaluation. Further results show promising research direction in building better VLP models. Our data and code are available at: https://github.com/om-ai-lab/VL-CheckList. △ Less

Submitted 22 June, 2023; v1 submitted 1 July, 2022; originally announced July 2022.

Comments: 9 pages, preprint

arXiv:2206.14540 [pdf, ps, other]

Hardy-Sobolev inequalities with distance to the boundary weight functions

Authors: Lei Wang, Meijun Zhu

Abstract: This is the first part of our research on certain sharp Hardy-Sobolev inequalities and the related elliptic equations. In this part we shall establish some sharp weighted Hardy-Sobolev inequalities whose weights are distance functions to the boundary. This is the first part of our research on certain sharp Hardy-Sobolev inequalities and the related elliptic equations. In this part we shall establish some sharp weighted Hardy-Sobolev inequalities whose weights are distance functions to the boundary. △ Less

Submitted 29 June, 2022; originally announced June 2022.

Comments: 24 pages

MSC Class: 35A23 (Primary); 35B09; 35J70 (Secondary)

arXiv:2206.12281 [pdf]

Real-time Dual-channel 2 * 2 MIMO Fiber-THz-Fiber Seamless Integration System at 385 GHz and 435 GHz

Authors: Jiao Zhang, Min Zhu, Bingchang Hua, Mingzheng Lei, Yuancheng Cai, Liang Tian, Yucong Zou, Like Ma, Yongming Huang, Jianjun Yu, Xiaohu You

Abstract: We demonstrate the first practical real-time dual-channel fiber-THz-fiber 2 * 2 MIMO seamless integration system with a record net data rate of 2 * 103.125 Gb/s at 385 GHz and 435 GHz over two spans of 20 km SSMF and 3 m wireless link. We demonstrate the first practical real-time dual-channel fiber-THz-fiber 2 * 2 MIMO seamless integration system with a record net data rate of 2 * 103.125 Gb/s at 385 GHz and 435 GHz over two spans of 20 km SSMF and 3 m wireless link. △ Less

Submitted 24 June, 2022; originally announced June 2022.

Comments: This paper has been accepted by ECOC 2022

arXiv:2206.08474 [pdf, other]

XLCoST: A Benchmark Dataset for Cross-lingual Code Intelligence

Authors: Ming Zhu, Aneesh Jain, Karthik Suresh, Roshan Ravindran, Sindhu Tipirneni, Chandan K. Reddy

Abstract: Recent advances in machine learning have significantly improved the understanding of source code data and achieved good performance on a number of downstream tasks. Open source repositories like GitHub enable this process with rich unlabeled code data. However, the lack of high quality labeled data has largely hindered the progress of several code related tasks, such as program translation, summar… ▽ More Recent advances in machine learning have significantly improved the understanding of source code data and achieved good performance on a number of downstream tasks. Open source repositories like GitHub enable this process with rich unlabeled code data. However, the lack of high quality labeled data has largely hindered the progress of several code related tasks, such as program translation, summarization, synthesis, and code search. This paper introduces XLCoST, Cross-Lingual Code SnippeT dataset, a new benchmark dataset for cross-lingual code intelligence. Our dataset contains fine-grained parallel data from 8 languages (7 commonly used programming languages and English), and supports 10 cross-lingual code tasks. To the best of our knowledge, it is the largest parallel dataset for source code both in terms of size and the number of languages. We also provide the performance of several state-of-the-art baseline models for each task. We believe this new dataset can be a valuable asset for the research community and facilitate the development and validation of new methods for cross-lingual code intelligence. △ Less

Submitted 16 June, 2022; originally announced June 2022.

Comments: 20 pages, 11 tables, 2 figures

arXiv:2206.08024 [pdf, other]

doi 10.1016/j.nuclphysb.2022.116004

ODE/IM correspondence and supersymmetric affine Toda field equations

Authors: Katsushi Ito, Mingshuo Zhu

Abstract: We study the linear differential system associated with the supersymmetric affine Toda field equations for affine Lie superalgebras, which has a purely odd simple root system. For an affine Lie algebra, the linear problem modified by conformal transformation leads to an ordinary differential equation (ODE) that provides the functional relations in the integrable models. This is known as the ODE/IM… ▽ More We study the linear differential system associated with the supersymmetric affine Toda field equations for affine Lie superalgebras, which has a purely odd simple root system. For an affine Lie algebra, the linear problem modified by conformal transformation leads to an ordinary differential equation (ODE) that provides the functional relations in the integrable models. This is known as the ODE/IM correspondence. For the affine Lie superalgebras, the linear equations modified by a superconformal transformation are shown to reduce to a couple of ODEs for each bosonic subalgebra. In particular, for $osp(2,2)^{(2)}$, the corresponding ODE becomes the second-order ODE with squared potential, which is related to the ${\cal N}=1$ supersymmetric minimal model via the ODE/IM correspondence. We also find ODEs for classical affine Lie superalgebras with purely odd simple root systems. △ Less

Submitted 4 November, 2022; v1 submitted 16 June, 2022; originally announced June 2022.

Comments: 28 pages; published version

Report number: TIT/HEP-690

arXiv:2206.05398 [pdf, other]

E2PN: Efficient SE(3)-Equivariant Point Network

Authors: Minghan Zhu, Maani Ghaffari, William A. Clark, Huei Peng

Abstract: This paper proposes a convolution structure for learning SE(3)-equivariant features from 3D point clouds. It can be viewed as an equivariant version of kernel point convolutions (KPConv), a widely used convolution form to process point cloud data. Compared with existing equivariant networks, our design is simple, lightweight, fast, and easy to be integrated with existing task-specific point cloud… ▽ More This paper proposes a convolution structure for learning SE(3)-equivariant features from 3D point clouds. It can be viewed as an equivariant version of kernel point convolutions (KPConv), a widely used convolution form to process point cloud data. Compared with existing equivariant networks, our design is simple, lightweight, fast, and easy to be integrated with existing task-specific point cloud learning pipelines. We achieve these desirable properties by combining group convolutions and quotient representations. Specifically, we discretize SO(3) to finite groups for their simplicity while using SO(2) as the stabilizer subgroup to form spherical quotient feature fields to save computations. We also propose a permutation layer to recover SO(3) features from spherical features to preserve the capacity to distinguish rotations. Experiments show that our method achieves comparable or superior performance in various tasks, including object classification, pose estimation, and keypoint-matching, while consuming much less memory and running faster than existing work. The proposed method can foster the development of equivariant models for real-world applications based on point clouds. △ Less

Submitted 13 June, 2023; v1 submitted 10 June, 2022; originally announced June 2022.

Comments: CVPR 2023, 16 pages. See https://github.com/minghanz/E2PN for code

arXiv:2206.05239 [pdf, other]

StructCoder: Structure-Aware Transformer for Code Generation

Authors: Sindhu Tipirneni, Ming Zhu, Chandan K. Reddy

Abstract: There has been a recent surge of interest in automating software engineering tasks using deep learning. This paper addresses the problem of code generation, where the goal is to generate target code given source code in a different language or a natural language description. Most state-of-the-art deep learning models for code generation use training strategies primarily designed for natural langua… ▽ More There has been a recent surge of interest in automating software engineering tasks using deep learning. This paper addresses the problem of code generation, where the goal is to generate target code given source code in a different language or a natural language description. Most state-of-the-art deep learning models for code generation use training strategies primarily designed for natural language. However, understanding and generating code requires a more rigorous comprehension of the code syntax and semantics. With this motivation, we develop an encoder-decoder Transformer model where both the encoder and decoder are explicitly trained to recognize the syntax and data flow in the source and target codes, respectively. We not only make the encoder structure-aware by leveraging the source code's syntax tree and data flow graph, but we also support the decoder in preserving the syntax and data flow of the target code by introducing two novel auxiliary tasks: AST (Abstract Syntax Tree) paths prediction and data flow prediction. To the best of our knowledge, this is the first work to introduce a structure-aware Transformer decoder that models both syntax and data flow to enhance the quality of generated code. The proposed StructCoder model achieves state-of-the-art performance on code translation and text-to-code generation tasks in the CodeXGLUE benchmark, and improves over baselines of similar size on the APPS code generation benchmark. Our code is publicly available at https://github.com/reddy-lab-code-research/StructCoder/. △ Less

Submitted 30 January, 2024; v1 submitted 10 June, 2022; originally announced June 2022.

arXiv:2206.01339 [pdf, other]

A peristaltic soft, wearable robot for compression and massage therapy

Authors: Mengjia Zhu, Adrian Ferstera, Stejara Dinulescu, Nikolas Kastor, Max Linnander, Elliot W. Hawkes, Yon Visell

Abstract: Soft robotics is attractive for wearable applications that require conformal interactions with the human body. Soft wearable robotic garments hold promise for supplying dynamic compression or massage therapies, such as are applied for disorders affecting lymphatic and blood circulation. In this paper, we present a wearable robot capable of supplying dynamic compression and massage therapy via peri… ▽ More Soft robotics is attractive for wearable applications that require conformal interactions with the human body. Soft wearable robotic garments hold promise for supplying dynamic compression or massage therapies, such as are applied for disorders affecting lymphatic and blood circulation. In this paper, we present a wearable robot capable of supplying dynamic compression and massage therapy via peristaltic motion of finger-sized soft, fluidic actuators. We show that this peristaltic wearable robot can supply dynamic compression pressures exceeding 22 kPa at frequencies of 14 Hz or more, meeting requirements for compression and massage therapy. A large variety of software-programmable compression wave patterns can be generated by varying frequency, amplitude, phase delay, and duration parameters. We first demonstrate the utility of this peristaltic wearable robot for compression therapy, showing fluid transport in a laboratory model of the upper limb. We theoretically and empirically identify driving regimes that optimize fluid transport. We second demonstrate the utility of this garment for dynamic massage therapy. These findings show the potential of such a wearable robot for the treatment of several health disorders associated with lymphatic and blood circulation, such as lymphedema and blood clots. △ Less

Submitted 2 June, 2022; originally announced June 2022.

Comments: 10 pages, 10 figures

arXiv:2205.13294 [pdf, other]

Analytical Interpretation of Latent Codes in InfoGAN with SAR Images

Authors: Zhenpeng Feng, Milos Dakovic, Hongbing Ji, Mingzhe Zhu, Ljubisa Stankovic

Abstract: Generative Adversarial Networks (GANs) can synthesize abundant photo-realistic synthetic aperture radar (SAR) images. Some recent GANs (e.g., InfoGAN), are even able to edit specific properties of the synthesized images by introducing latent codes. It is crucial for SAR image synthesis since the targets in real SAR images are with different properties due to the imaging mechanism. Despite the succ… ▽ More Generative Adversarial Networks (GANs) can synthesize abundant photo-realistic synthetic aperture radar (SAR) images. Some recent GANs (e.g., InfoGAN), are even able to edit specific properties of the synthesized images by introducing latent codes. It is crucial for SAR image synthesis since the targets in real SAR images are with different properties due to the imaging mechanism. Despite the success of InfoGAN in manipulating properties, there still lacks a clear explanation of how these latent codes affect synthesized properties, thus editing specific properties usually relies on empirical trials, unreliable and time-consuming. In this paper, we show that latent codes are disentangled to affect the properties of SAR images in a non-linear manner. By introducing some property estimators for latent codes, we are able to provide a completely analytical nonlinear model to decompose the entangled causality between latent codes and different properties. The qualitative and quantitative experimental results further reveal that the properties can be calculated by latent codes, inversely, the satisfying latent codes can be estimated given desired properties. In this case, properties can be manipulated by latent codes as we expect. △ Less

Submitted 26 May, 2022; originally announced May 2022.

Comments: 13 pages, 14 figures

arXiv:2205.12042 [pdf, other]

HCFRec: Hash Collaborative Filtering via Normalized Flow with Structural Consensus for Efficient Recommendation

Authors: Fan Wang, Weiming Liu, Chaochao Chen, Mengying Zhu, Xiaolin Zheng

Abstract: The ever-increasing data scale of user-item interactions makes it challenging for an effective and efficient recommender system. Recently, hash-based collaborative filtering (Hash-CF) approaches employ efficient Hamming distance of learned binary representations of users and items to accelerate recommendations. However, Hash-CF often faces two challenging problems, i.e., optimization on discrete r… ▽ More The ever-increasing data scale of user-item interactions makes it challenging for an effective and efficient recommender system. Recently, hash-based collaborative filtering (Hash-CF) approaches employ efficient Hamming distance of learned binary representations of users and items to accelerate recommendations. However, Hash-CF often faces two challenging problems, i.e., optimization on discrete representations and preserving semantic information in learned representations. To address the above two challenges, we propose HCFRec, a novel Hash-CF approach for effective and efficient recommendations. Specifically, HCFRec not only innovatively introduces normalized flow to learn the optimal hash code by efficiently fit a proposed approximate mixture multivariate normal distribution, a continuous but approximately discrete distribution, but also deploys a cluster consistency preserving mechanism to preserve the semantic structure in representations for more accurate recommendations. Extensive experiments conducted on six real-world datasets demonstrate the superiority of our HCFRec compared to the state-of-art methods in terms of effectiveness and efficiency. △ Less

Submitted 24 May, 2022; originally announced May 2022.

arXiv:2205.09305 [pdf, other]

FedILC: Weighted Geometric Mean and Invariant Gradient Covariance for Federated Learning on Non-IID Data

Authors: Mike He Zhu, Léna Néhale Ezzine, Dianbo Liu, Yoshua Bengio

Abstract: Federated learning is a distributed machine learning approach which enables a shared server model to learn by aggregating the locally-computed parameter updates with the training data from spatially-distributed client silos. Though successfully possessing advantages in both scale and privacy, federated learning is hurt by domain shift problems, where the learning models are unable to generalize to… ▽ More Federated learning is a distributed machine learning approach which enables a shared server model to learn by aggregating the locally-computed parameter updates with the training data from spatially-distributed client silos. Though successfully possessing advantages in both scale and privacy, federated learning is hurt by domain shift problems, where the learning models are unable to generalize to unseen domains whose data distribution is non-i.i.d. with respect to the training domains. In this study, we propose the Federated Invariant Learning Consistency (FedILC) approach, which leverages the gradient covariance and the geometric mean of Hessians to capture both inter-silo and intra-silo consistencies of environments and unravel the domain shift problems in federated networks. The benchmark and real-world dataset experiments bring evidence that our proposed algorithm outperforms conventional baselines and similar federated learning algorithms. This is relevant to various fields such as medical healthcare, computer vision, and the Internet of Things (IoT). The code is released at https://github.com/mikemikezhu/FedILC. △ Less

Submitted 18 May, 2022; originally announced May 2022.

arXiv:2205.06479 [pdf, ps, other]

Frame set for Gabor systems with Haar window

Authors: Xin-Rong Dai, Meng Zhu

Abstract: We show the full structure of the frame set for the Gabor system $\mathcal{G}(g;α,β):=\{e^{-2πi mβ\cdot}g(\cdot-nα):m,n\in\Bbb Z\}$ with the window being the Haar function $g=-χ_{[-1/2,0)}+χ_{[0,1/2)}$. The strategy of this paper is to introduce the piecewise linear transformation $\mathcal{M}$ on the unit circle, and to provide a complete characterization of structures for its (symmetric) maximal… ▽ More We show the full structure of the frame set for the Gabor system $\mathcal{G}(g;α,β):=\{e^{-2πi mβ\cdot}g(\cdot-nα):m,n\in\Bbb Z\}$ with the window being the Haar function $g=-χ_{[-1/2,0)}+χ_{[0,1/2)}$. The strategy of this paper is to introduce the piecewise linear transformation $\mathcal{M}$ on the unit circle, and to provide a complete characterization of structures for its (symmetric) maximal invariant sets. This transformation is related to the famous three gap theorem of Steinhaus which may be of independent interest. Furthermore, a classical criterion on Gabor frames is improved, which allows us to establish {a} necessary and sufficient condition for the Gabor system $\mathcal{G}(g;α,β)$ to be a frame, i.e., the symmetric invariant set of the transformation $\mathcal{M}$ is empty. Compared with the previous studies, the present paper provides a self-contained environment to study Gabor frames by a new perspective, which includes that the techniques developed here are new and all the proofs could be understood thoroughly by the readers without reference to the known results in the previous literature. △ Less

Submitted 13 May, 2022; originally announced May 2022.

MSC Class: Primary 42C15; 42C40; Secondary 28D05; 37A05; 94A20

arXiv:2205.03990 [pdf, other]

doi 10.1038/s42005-024-01521-z

Multi-resolution partial differential equations preserved learning framework for spatiotemporal dynamics

Authors: Xin-Yang Liu, Min Zhu, Lu Lu, Hao Sun, Jian-Xun Wang

Abstract: Traditional data-driven deep learning models often struggle with high training costs, error accumulation, and poor generalizability in complex physical processes. Physics-informed deep learning (PiDL) addresses these challenges by incorporating physical principles into the model. Most PiDL approaches regularize training by embedding governing equations into the loss function, yet this depends heav… ▽ More Traditional data-driven deep learning models often struggle with high training costs, error accumulation, and poor generalizability in complex physical processes. Physics-informed deep learning (PiDL) addresses these challenges by incorporating physical principles into the model. Most PiDL approaches regularize training by embedding governing equations into the loss function, yet this depends heavily on extensive hyperparameter tuning to weigh each loss term. To this end, we propose to leverage physics prior knowledge by ``baking'' the discretized governing equations into the neural network architecture via the connection between the partial differential equations (PDE) operators and network structures, resulting in a PDE-preserved neural network (PPNN). This method, embedding discretized PDEs through convolutional residual networks in a multi-resolution setting, largely improves the generalizability and long-term prediction accuracy, outperforming conventional black-box models. The effectiveness and merit of the proposed methods have been demonstrated across various spatiotemporal dynamical systems governed by spatiotemporal PDEs, including reaction-diffusion, Burgers', and Navier-Stokes equations. △ Less

Submitted 13 January, 2024; v1 submitted 8 May, 2022; originally announced May 2022.

Comments: 51 pages, 27 figures

Journal ref: Commun Phys 7, 31 (2024)

arXiv:2205.01805 [pdf, other]

doi 10.1109/MIPR.2019.00024

Splicing Detection and Localization In Satellite Imagery Using Conditional GANs

Authors: Emily R. Bartusiak, Sri Kalyan Yarlagadda, David Güera, Paolo Bestagini, Stefano Tubaro, Fengqing M. Zhu, Edward J. Delp

Abstract: The widespread availability of image editing tools and improvements in image processing techniques allow image manipulation to be very easy. Oftentimes, easy-to-use yet sophisticated image manipulation tools yields distortions/changes imperceptible to the human observer. Distribution of forged images can have drastic ramifications, especially when coupled with the speed and vastness of the Interne… ▽ More The widespread availability of image editing tools and improvements in image processing techniques allow image manipulation to be very easy. Oftentimes, easy-to-use yet sophisticated image manipulation tools yields distortions/changes imperceptible to the human observer. Distribution of forged images can have drastic ramifications, especially when coupled with the speed and vastness of the Internet. Therefore, verifying image integrity poses an immense and important challenge to the digital forensic community. Satellite images specifically can be modified in a number of ways, including the insertion of objects to hide existing scenes and structures. In this paper, we describe the use of a Conditional Generative Adversarial Network (cGAN) to identify the presence of such spliced forgeries within satellite images. Additionally, we identify their locations and shapes. Trained on pristine and falsified images, our method achieves high success on these detection and localization objectives. △ Less

Submitted 3 May, 2022; originally announced May 2022.

Comments: Accepted to the 2019 IEEE Conference on Multimedia Information Processing and Retrieval (MIPR)

Journal ref: IEEE Conference on Multimedia Information Processing and Retrieval, pp. 91-96, March 2019, San Jose, CA

arXiv:2205.00906 [pdf]

doi 10.1109/JLT.2022.3176401

Scattering-assisted and logic-controllable WGM laser in liquid crystal micropillar

Authors: ** Chuan Zhang, Hong Yang Zhu, Xiao Mei Zhu, Yan Li Zhang, Zhao Wang, Fei Liang Chen, Ke Li, Xiao Feng Li, Wei Li Zhang

Abstract: Whispering gallery mode (WGM) microcavities can efficiently store and manipulate light with strong light confinement and long photon lifetime, while coupling light into and from WGMs is intrinsically hindered by their unique feature of rotational symmetry. Here, a scattering-assisted liquid crystal (LC) micropillar WGM laser is proposed. WGM lasing at the surface of the micropillar is obviously en… ▽ More Whispering gallery mode (WGM) microcavities can efficiently store and manipulate light with strong light confinement and long photon lifetime, while coupling light into and from WGMs is intrinsically hindered by their unique feature of rotational symmetry. Here, a scattering-assisted liquid crystal (LC) micropillar WGM laser is proposed. WGM lasing at the surface of the micropillar is obviously enhanced by fluorescence scattering in the core of the micropillar. Besides, weak scattering of LC molecules also builds efficient coupling channels between the laser modes and the axial transmission modes of the micropillar-based waveguide, providing an all-in-one liquid WGM laser with functions of self-seeding and self-guiding. Furthermore, based on the hysteresis characteristics of the electrically anchored LC molecules under the interaction of thermal force, an erasable read-write liquid memory device is proposed, paving the way for the application of logic-controllable WGM lasers in optical storage and optical control. △ Less

Submitted 2 May, 2022; v1 submitted 2 May, 2022; originally announced May 2022.

arXiv:2204.08002 [pdf]

Multiplication of freestanding semiconductor membranes from a single wafer by advanced remote epitaxy

Authors: Hyunseok Kim, Yunpeng Liu, Kuangye Lu, Celesta S. Chang, Kuan Qiao, Ki Seok Kim, Bo-In Park, Junseok Jeong, Menglin Zhu, Jun Min Suh, Yongmin Baek, You ** Ji, Sungsu Kang, Sangho Lee, Ne Myo Han, Chansoo Kim, Chanyeol Choi, Xinyuan Zhang, Haozhe Wang, Ling** Kong, Jungwon Park, Kyusang Lee, Geun Young Yeom, Sungkyu Kim, **woo Hwang , et al. (4 additional authors not shown)

Abstract: Freestanding single-crystalline membranes are an important building block for functional electronics. Especially, compounds semiconductor membranes such as III-N and III-V offer great opportunities for optoelectronics, high-power electronics, and high-speed computing. Despite huge efforts to produce such membranes by detaching epitaxial layers from donor wafers, however, it is still challenging to… ▽ More Freestanding single-crystalline membranes are an important building block for functional electronics. Especially, compounds semiconductor membranes such as III-N and III-V offer great opportunities for optoelectronics, high-power electronics, and high-speed computing. Despite huge efforts to produce such membranes by detaching epitaxial layers from donor wafers, however, it is still challenging to harvest epitaxial layers using practical processes. Here, we demonstrate a method to grow and harvest multiple epitaxial membranes with extremely high throughput at the wafer scale. For this, 2D materials are directly formed on III-N and III-V substrates in epitaxy systems, which enables an advanced remote epitaxy scheme comprised of multiple alternating layers of 2D materials and epitaxial layers that can be formed by a single epitaxy run. Each epilayer in the multi-stack structure is then harvested by layer-by-layer peeling, producing multiple freestanding membranes with unprecedented throughput from a single wafer. Because 2D materials allow peeling at the interface without damaging the epilayer or the substrate, wafers can be reused for subsequent membrane production. Therefore, this work represents a meaningful step toward high-throughput and low-cost production of single-crystal membranes that can be heterointegrated. △ Less

Submitted 7 April, 2022; originally announced April 2022.

arXiv:2204.05840 [pdf, other]

doi 10.1088/1674-4527/ac6796

Extragalactic HI survey with FAST : First look of the pilot survey results

Authors: Jiangang Kang, Ming Zhu, Mei Ai, Haiyang Yu, Chun Sun

Abstract: As first data release of a pilot extragalactic HI survey with Five-hundred-meter Aperture Spherical radio Telescope (FAST),we extracted 544 extragalaxies from three-dimensional(3D) spectral data to perform interactive searching and computing, yielding global parameters for these detections, extending redshift ranges of HI 21cm line up to z = 0.04 ,which covers part of the sky region in right ascen… ▽ More As first data release of a pilot extragalactic HI survey with Five-hundred-meter Aperture Spherical radio Telescope (FAST),we extracted 544 extragalaxies from three-dimensional(3D) spectral data to perform interactive searching and computing, yielding global parameters for these detections, extending redshift ranges of HI 21cm line up to z = 0.04 ,which covers part of the sky region in right ascension(R.A. or $α$) and declination(Dec or $δ$) range $00^{\rm h} 47^{\rm m}< \rm R.A.(J2000)<23^{\rm h}22^{\rm m}$ and $+24^{\circ}<\rm Dec.(J2000) <+43^{\circ}$ . The S/N of 544 HI detections are greater than 5 flagged with code 1 to 4 based on baseline qualities or RFI contamination. Besides, we find 16 of which without any counterparts in the existing galaxy catalogs. The catalog can give a guidence for the future HI observation with FAST. △ Less

Submitted 24 April, 2022; v1 submitted 12 April, 2022; originally announced April 2022.

Comments: 23 pages,17 figures, 4 tables. Accepted for publication in the Research in Astronomy and Astrophysics

arXiv:2203.02286 [pdf, other]

Semi-parametric Makeup Transfer via Semantic-aware Correspondence

Authors: Mingrui Zhu, Yun Yi, Nannan Wang, Xiaoyu Wang, Xinbo Gao

Abstract: The large discrepancy between the source non-makeup image and the reference makeup image is one of the key challenges in makeup transfer. Conventional approaches for makeup transfer either learn disentangled representation or perform pixel-wise correspondence in a parametric way between two images. We argue that non-parametric techniques have a high potential for addressing the pose, expression, a… ▽ More The large discrepancy between the source non-makeup image and the reference makeup image is one of the key challenges in makeup transfer. Conventional approaches for makeup transfer either learn disentangled representation or perform pixel-wise correspondence in a parametric way between two images. We argue that non-parametric techniques have a high potential for addressing the pose, expression, and occlusion discrepancies. To this end, this paper proposes a \textbf{S}emi-\textbf{p}arametric \textbf{M}akeup \textbf{T}ransfer (SpMT) method, which combines the reciprocal strengths of non-parametric and parametric mechanisms. The non-parametric component is a novel \textbf{S}emantic-\textbf{a}ware \textbf{C}orrespondence (SaC) module that explicitly reconstructs content representation with makeup representation under the strong constraint of component semantics. The reconstructed representation is desired to preserve the spatial and identity information of the source image while "wearing" the makeup of the reference image. The output image is synthesized via a parametric decoder that draws on the reconstructed representation. Extensive experiments demonstrate the superiority of our method in terms of visual quality, robustness, and flexibility. Code and pre-trained model are available at \url{https://github.com/AnonymScholar/SpMT. △ Less

Submitted 4 March, 2022; originally announced March 2022.

Comments: 20 pages, 2 tables, 17 figures

arXiv:2203.02104 [pdf, other]

Interactive Image Synthesis with Panoptic Layout Generation

Authors: Bo Wang, Tao Wu, Minfeng Zhu, Peng Du

Abstract: Interactive image synthesis from user-guided input is a challenging task when users wish to control the scene structure of a generated image with ease.Although remarkable progress has been made on layout-based image synthesis approaches, in order to get realistic fake image in interactive scene, existing methods require high-precision inputs, which probably need adjustment several times and are un… ▽ More Interactive image synthesis from user-guided input is a challenging task when users wish to control the scene structure of a generated image with ease.Although remarkable progress has been made on layout-based image synthesis approaches, in order to get realistic fake image in interactive scene, existing methods require high-precision inputs, which probably need adjustment several times and are unfriendly to novice users. When placement of bounding boxes is subject to perturbation, layout-based models suffer from "missing regions" in the constructed semantic layouts and hence undesirable artifacts in the generated images. In this work, we propose Panoptic Layout Generative Adversarial Networks (PLGAN) to address this challenge. The PLGAN employs panoptic theory which distinguishes object categories between "stuff" with amorphous boundaries and "things" with well-defined shapes, such that stuff and instance layouts are constructed through separate branches and later fused into panoptic layouts. In particular, the stuff layouts can take amorphous shapes and fill up the missing regions left out by the instance layouts. We experimentally compare our PLGAN with state-of-the-art layout-based models on the COCO-Stuff, Visual Genome, and Landscape datasets. The advantages of PLGAN are not only visually demonstrated but quantitatively verified in terms of inception score, Fréchet inception distance, classification accuracy score, and coverage. △ Less

Submitted 28 March, 2022; v1 submitted 3 March, 2022; originally announced March 2022.

Comments: Accepted by CVPR 2022

arXiv:2203.00852 [pdf, other]

doi 10.1103/PhysRevA.106.022412

Preserving multi-level quantum coherence by dynamical decoupling

Authors: Xinxing Yuan, Yue Li, Mengxiang Zhang, Chang Liu, Mingdong Zhu, Xi Qin, Nikolay V. Vitanov, Yiheng Lin, Jiangfeng Du

Abstract: Quantum information processing with multi-level systems (qudits) provides additional features and applications than the two-level systems. However, qudits are more prone to dephasing and dynamical decoupling for qudits has never been experimentally demonstrated. Here, as a proof-of-principle demonstration, we experimentally apply dynamical decoupling to protect superpositions with three levels of… ▽ More Quantum information processing with multi-level systems (qudits) provides additional features and applications than the two-level systems. However, qudits are more prone to dephasing and dynamical decoupling for qudits has never been experimentally demonstrated. Here, as a proof-of-principle demonstration, we experimentally apply dynamical decoupling to protect superpositions with three levels of a trapped $^9\rm{Be}^+$ ion from ambient noisy magnetic field, prolonging coherence by up to approximately an order of magnitude. Our demonstration, straightforwardly scalable to more levels, may open up a path toward long coherence quantum memory, metrology and information processing with qudits. △ Less

Submitted 1 March, 2022; originally announced March 2022.

Comments: 6 pages, 7 figures

Journal ref: Phys. Rev. A 106, 022412 (2022)

arXiv:2202.08935 [pdf, other]

A Formal Safety Characterization of Advanced Driver Assist Systems in the Car-Following Regime with Scenario-Sampling

Authors: Bowen Weng, Minghao Zhu, Keith Redmill

Abstract: The capability to follow a lead-vehicle and avoid rear-end collisions is one of the most important functionalities for human drivers and various Advanced Driver Assist Systems (ADAS). Existing safety performance justification of the car-following systems either relies on simple concrete scenarios with biased surrogate metrics or requires a significantly long driving distance for risk observation a… ▽ More The capability to follow a lead-vehicle and avoid rear-end collisions is one of the most important functionalities for human drivers and various Advanced Driver Assist Systems (ADAS). Existing safety performance justification of the car-following systems either relies on simple concrete scenarios with biased surrogate metrics or requires a significantly long driving distance for risk observation and inference. In this paper, we propose a guaranteed unbiased and sampling efficient scenario-based safety evaluation framework inspired by the previous work on $εδ$-almost safe set quantification. The proposal characterizes the complete safety performance of the test subject in the car-following regime. The performance of the proposed method is also demonstrated in challenging cases including some widely adopted car-following decision-making modules and the commercially available Openpilot driving stack by CommaAI. △ Less

Submitted 23 May, 2022; v1 submitted 17 February, 2022; originally announced February 2022.

arXiv:2202.07099 [pdf, other]

Two Gaussian regularization methods for time-varying networks

Authors: Jie Jian, Peijun Sang, Mu Zhu

Abstract: We model time-varying network data as realizations from multivariate Gaussian distributions with precision matrices that change over time. To facilitate parameter estimation, we require not only that each precision matrix at any given time point be sparse, but also that precision matrices at neighboring time points be similar. We accomplish this with two different algorithms, by generalizing the e… ▽ More We model time-varying network data as realizations from multivariate Gaussian distributions with precision matrices that change over time. To facilitate parameter estimation, we require not only that each precision matrix at any given time point be sparse, but also that precision matrices at neighboring time points be similar. We accomplish this with two different algorithms, by generalizing the elastic net and the fused LASSO, respectively. Our main focuses are efficient computational algorithms and convenient degree-of-freedom formulae for choosing tuning parameters. We illustrate our methods with two simulation studies. By applying them to an fMRI data set, we also detect some interesting differences in brain connectivity between healthy individuals and ADHD patients. △ Less

Submitted 9 March, 2022; v1 submitted 14 February, 2022; originally announced February 2022.

arXiv:2202.04494 [pdf]

A Circle Grid-based Approach for Obstacle Avoidance Motion Planning of Unmanned Surface Vehicles

Authors: Man Zhu, Changshi Xiao, Shangding Gu, Zhe Du, Yuanqiao Wen

Abstract: Aiming at an obstacle avoidance problem with dynamic constraints for Unmanned Surface Vehicle (USV), a method based on Circle Grid Trajectory Cell (CGTC) is proposed. Firstly, the ship model and standardization rules are constructed to develop and constrain the trajectory, respectively. Secondly, by analyzing the properties of the circle grid, the circle grid tree is produced to guide the motion o… ▽ More Aiming at an obstacle avoidance problem with dynamic constraints for Unmanned Surface Vehicle (USV), a method based on Circle Grid Trajectory Cell (CGTC) is proposed. Firstly, the ship model and standardization rules are constructed to develop and constrain the trajectory, respectively. Secondly, by analyzing the properties of the circle grid, the circle grid tree is produced to guide the motion of the USV. Then, the kinematics and dynamics of the USV are considered through the on-line trajectory generator by designing a relational function that links the rudder angle, heading angle, and the central angle of the circle grid. Finally, obstacle avoidance is achieved by leveraging the on-line trajectory generator to choose a safe, smooth, and efficient path for the USV. The experimental results indicate that the proposed method can avoid both static and dynamic obstacles, have better performance in terms of distance cost and steering cost comparing with the related methods, and our method only takes 50% steering cost of the grid-based method; the collision avoidance path not only conforms to the USV dynamic characteristic but also provides a reference of steering command. △ Less

Submitted 9 February, 2022; originally announced February 2022.

arXiv:2202.04362 [pdf, ps, other]

Some integral inequalities on weighted Riemannian manifolds with boundary

Authors: Guangyue Huang, Mingfang Zhu

Abstract: In this paper, we continue to study some applications with respect to a Reilly type integral formula associated with the $φ$-Laplacian. Some inequalities of Brascamp-Lieb type and Colesanti type are provided. In this paper, we continue to study some applications with respect to a Reilly type integral formula associated with the $φ$-Laplacian. Some inequalities of Brascamp-Lieb type and Colesanti type are provided. △ Less

Submitted 23 February, 2022; v1 submitted 9 February, 2022; originally announced February 2022.

Comments: All comments are welcome

arXiv:2202.04225 [pdf, other]

doi 10.1021/acsaelm.2c00185

Kinetically-controlled epitaxial growth of Fe$_3$GeTe$_2$ van der Waals ferromagnetic films

Authors: Wenyi Zhou, Alexander J. Bishop, Menglin Zhu, Igor Lyalin, Robert C. Walko, Jay A. Gupta, **woo Hwang, Roland K. Kawakami

Abstract: We demonstrate that kinetics play an important role in the epitaxial growth of Fe$_3$GeTe$_2$ (FGT) van der Waals (vdW) ferromagnetic films by molecular beam epitaxy. By varying the deposition rate, we control the formation or suppression of an initial tellurium-deficient non-van der Waals phase (Fe$_3$Ge$_2$) prior to realizing epitaxial growth of the vdW FGT phase. Using cross-sectional scanning… ▽ More We demonstrate that kinetics play an important role in the epitaxial growth of Fe$_3$GeTe$_2$ (FGT) van der Waals (vdW) ferromagnetic films by molecular beam epitaxy. By varying the deposition rate, we control the formation or suppression of an initial tellurium-deficient non-van der Waals phase (Fe$_3$Ge$_2$) prior to realizing epitaxial growth of the vdW FGT phase. Using cross-sectional scanning transmission electron microscopy and scanning tunneling microscopy, we optimize the FGT films to have atomically smooth surfaces and abrupt interfaces with the Ge(111) substrate. The magnetic properties of our high quality material are confirmed through magneto-optic, magnetotransport, and spin-polarized STM studies. Importantly, this demonstrates how the interplay of energetics and kinetics can help tune the re-evaporation rate of chalcogen atoms and interdiffusion from the underlayer, which paves the way for future studies of van der Waals epitaxy. △ Less

Submitted 8 February, 2022; originally announced February 2022.

Comments: 18 pages, 4 figures

Journal ref: ACS Appl. Electron. Mater. 4, 3190 (2022)

arXiv:2202.04003 [pdf, ps, other]

Differentiable N-gram Objective on Abstractive Summarization

Authors: Yunqi Zhu, Xuebing Yang, Yuanyuan Wu, Ming** Zhu, Wensheng Zhang

Abstract: ROUGE is a standard automatic evaluation metric based on n-grams for sequence-to-sequence tasks, while cross-entropy loss is an essential objective of neural network language model that optimizes at a unigram level. We present differentiable n-gram objectives, attempting to alleviate the discrepancy between training criterion and evaluating criterion. The objective maximizes the probabilistic weig… ▽ More ROUGE is a standard automatic evaluation metric based on n-grams for sequence-to-sequence tasks, while cross-entropy loss is an essential objective of neural network language model that optimizes at a unigram level. We present differentiable n-gram objectives, attempting to alleviate the discrepancy between training criterion and evaluating criterion. The objective maximizes the probabilistic weight of matched sub-sequences, and the novelty of our work is the objective weights the matched sub-sequences equally and does not ceil the number of matched sub-sequences by the ground truth count of n-grams in reference sequence. We jointly optimize cross-entropy loss and the proposed objective, providing decent ROUGE score enhancement over abstractive summarization dataset CNN/DM and XSum, outperforming alternative n-gram objectives. △ Less

Submitted 25 December, 2022; v1 submitted 8 February, 2022; originally announced February 2022.

arXiv:2202.03735 [pdf, other]

Navigating to Objects in Unseen Environments by Distance Prediction

Authors: Minzhao Zhu, Binglei Zhao, Tao Kong

Abstract: Object Goal Navigation (ObjectNav) task is to navigate an agent to an object category in unseen environments without a pre-built map. In this paper, we solve this task by predicting the distance to the target using semantically-related objects as cues. Based on the estimated distance to the target object, our method directly choose optimal mid-term goals that are more likely to have a shorter path… ▽ More Object Goal Navigation (ObjectNav) task is to navigate an agent to an object category in unseen environments without a pre-built map. In this paper, we solve this task by predicting the distance to the target using semantically-related objects as cues. Based on the estimated distance to the target object, our method directly choose optimal mid-term goals that are more likely to have a shorter path to the target. Specifically, based on the learned knowledge, our model takes a bird's-eye view semantic map as input, and estimates the path length from the frontier map cells to the target object. With the estimated distance map, the agent could simultaneously explore the environment and navigate to the target objects based on a simple human-designed strategy. Empirical results in visually realistic simulation environments show that the proposed method outperforms a wide range of baselines on success rate and efficiency. Real-robot experiment also demonstrates that our method generalizes well to the real world. Video at https://www.youtube.com/watch?v=R79pWVGFKS4 △ Less

Submitted 13 July, 2022; v1 submitted 8 February, 2022; originally announced February 2022.

Comments: IROS 2022, Video at https://www.youtube.com/watch?v=R79pWVGFKS4

arXiv:2202.03716 [pdf, other]

Binary Neural Networks as a general-propose compute paradigm for on-device computer vision

Authors: Guhong Nie, Lirui Xiao, Menglong Zhu, Dongliang Chu, Yue Shen, Peng Li, Kang Yang, Li Du, Bo Chen

Abstract: For binary neural networks (BNNs) to become the mainstream on-device computer vision algorithm, they must achieve a superior speed-vs-accuracy tradeoff than 8-bit quantization and establish a similar degree of general applicability in vision tasks. To this end, we propose a BNN framework comprising 1) a minimalistic inference scheme for hardware-friendliness, 2) an over-parameterized training sche… ▽ More For binary neural networks (BNNs) to become the mainstream on-device computer vision algorithm, they must achieve a superior speed-vs-accuracy tradeoff than 8-bit quantization and establish a similar degree of general applicability in vision tasks. To this end, we propose a BNN framework comprising 1) a minimalistic inference scheme for hardware-friendliness, 2) an over-parameterized training scheme for high accuracy, and 3) a simple procedure to adapt to different vision tasks. The resultant framework overtakes 8-bit quantization in the speed-vs-accuracy tradeoff for classification, detection, segmentation, super-resolution and matching: our BNNs not only retain the accuracy levels of their 8-bit baselines but also showcase 1.3-2.4$\times$ faster FPS on mobile CPUs. Similar conclusions can be drawn for prototypical systolic-array-based AI accelerators, where our BNNs promise 2.8-7$\times$ fewer execution cycles than 8-bit and 2.1-2.7$\times$ fewer cycles than alternative BNN designs. These results suggest that the time for large-scale BNN adoption could be upon us. △ Less

Submitted 8 February, 2022; originally announced February 2022.

Comments: 13 pages, 3 figures

arXiv:2202.03406 [pdf, other]

Dependence model assessment and selection with DecoupleNets

Authors: Marius Hofert, Avinash Prasad, Mu Zhu

Abstract: Neural networks are suggested for learning a map from $d$-dimensional samples with any underlying dependence structure to multivariate uniformity in $d'$ dimensions. This map, termed DecoupleNet, is used for dependence model assessment and selection. If the data-generating dependence model was known, and if it was among the few analytically tractable ones, one such transformation for $d'=d$ is Ros… ▽ More Neural networks are suggested for learning a map from $d$-dimensional samples with any underlying dependence structure to multivariate uniformity in $d'$ dimensions. This map, termed DecoupleNet, is used for dependence model assessment and selection. If the data-generating dependence model was known, and if it was among the few analytically tractable ones, one such transformation for $d'=d$ is Rosenblatt's transform. DecoupleNets have multiple advantages. For example, they only require an available sample and are applicable to $d'<d$, in particular $d'=2$. This allows for simpler model assessment and selection, both numerically and, because $d'=2$, especially graphically. A graphical assessment method has the advantage of being able to identify why, or in which region of the domain, a candidate model does not provide an adequate fit, thus leading to model selection in particular regions of interest or improved model building strategies in such regions. Through simulation studies with data from various copulas, the feasibility and validity of this novel DecoupleNet approach is demonstrated. Applications to real world data illustrate its usefulness for model assessment and selection. △ Less

Submitted 5 October, 2022; v1 submitted 7 February, 2022; originally announced February 2022.

MSC Class: 62H99; 65C60; 60E05; 62M45; 00A72; 65C10; 62M10

arXiv:2202.03183 [pdf, other]

TransFollower: Long-Sequence Car-Following Trajectory Prediction through Transformer

Authors: Meixin Zhu, Simon S. Du, Xuesong Wang, Hao, Yang, Ziyuan Pu, Yinhai Wang

Abstract: Car-following refers to a control process in which the following vehicle (FV) tries to keep a safe distance between itself and the lead vehicle (LV) by adjusting its acceleration in response to the actions of the vehicle ahead. The corresponding car-following models, which describe how one vehicle follows another vehicle in the traffic flow, form the cornerstone for microscopic traffic simulation… ▽ More Car-following refers to a control process in which the following vehicle (FV) tries to keep a safe distance between itself and the lead vehicle (LV) by adjusting its acceleration in response to the actions of the vehicle ahead. The corresponding car-following models, which describe how one vehicle follows another vehicle in the traffic flow, form the cornerstone for microscopic traffic simulation and intelligent vehicle development. One major motivation of car-following models is to replicate human drivers' longitudinal driving trajectories. To model the long-term dependency of future actions on historical driving situations, we developed a long-sequence car-following trajectory prediction model based on the attention-based Transformer model. The model follows a general format of encoder-decoder architecture. The encoder takes historical speed and spacing data as inputs and forms a mixed representation of historical driving context using multi-head self-attention. The decoder takes the future LV speed profile as input and outputs the predicted future FV speed profile in a generative way (instead of an auto-regressive way, avoiding compounding errors). Through cross-attention between encoder and decoder, the decoder learns to build a connection between historical driving and future LV speed, based on which a prediction of future FV speed can be obtained. We train and test our model with 112,597 real-world car-following events extracted from the Shanghai Naturalistic Driving Study (SH-NDS). Results show that the model outperforms the traditional intelligent driver model (IDM), a fully connected neural network model, and a long short-term memory (LSTM) based model in terms of long-sequence trajectory prediction accuracy. We also visualized the self-attention and cross-attention heatmaps to explain how the model derives its predictions. △ Less

Submitted 4 February, 2022; originally announced February 2022.

arXiv:2201.11719 [pdf, other]

doi 10.1038/s41567-022-01825-3

Spin fluctuations associated with the collapse of the pseudogap in a cuprate superconductor

Authors: M. Zhu, D. J. Voneshen, S. Raymond, O. J. Lipscombe, C. C. Tam, S. M. Hayden

Abstract: Theories of the origin of superconductivity in cuprates are dependent on an understanding of their normal state which exhibits various competing orders. Transport and thermodynamic measurements on La$_{2-x}$Sr$_x$CuO$_4$ show signatures of a quantum critical point, including a peak in the electronic specific heat $C$ versus do** $p$, near the do** $p^{\star}$ where the pseudogap collapses. The… ▽ More Theories of the origin of superconductivity in cuprates are dependent on an understanding of their normal state which exhibits various competing orders. Transport and thermodynamic measurements on La$_{2-x}$Sr$_x$CuO$_4$ show signatures of a quantum critical point, including a peak in the electronic specific heat $C$ versus do** $p$, near the do** $p^{\star}$ where the pseudogap collapses. The fundamental nature of the fluctuations associated with this peak is unclear. Here we use inelastic neutron scattering to show that close to $T_c$ and near $p^{\star}$, there are very-low-energy collective spin excitations with characteristic energies $\hbar Γ\approx$~5 meV. Cooling and applying a 8.8~T magnetic field creates a mixed state with a stronger magnetic response below 10~meV. We conclude that the low-energy spin-fluctuations are due to the collapse of the pseudogap combined with an underlying tendency to magnetic order. We show that the large specific heat near $p^{\star}$ can be understood in terms of collective spin fluctuations. The spin fluctuations we measure exist across the superconducting phase diagram and may be related to the strange metal behaviour observed in overdoped cuprates. △ Less

Submitted 22 August, 2023; v1 submitted 27 January, 2022; originally announced January 2022.

Comments: Final author version

Journal ref: Nature Physics 19, 99 (2023)

arXiv:2201.11685 [pdf, other]

Generative Adversarial Exploration for Reinforcement Learning

Authors: Weijun Hong, Menghui Zhu, Minghuan Liu, Weinan Zhang, Ming Zhou, Yong Yu, Peng Sun

Abstract: Exploration is crucial for training the optimal reinforcement learning (RL) policy, where the key is to discriminate whether a state visiting is novel. Most previous work focuses on designing heuristic rules or distance metrics to check whether a state is novel without considering such a discrimination process that can be learned. In this paper, we propose a novel method called generative adversar… ▽ More Exploration is crucial for training the optimal reinforcement learning (RL) policy, where the key is to discriminate whether a state visiting is novel. Most previous work focuses on designing heuristic rules or distance metrics to check whether a state is novel without considering such a discrimination process that can be learned. In this paper, we propose a novel method called generative adversarial exploration (GAEX) to encourage exploration in RL via introducing an intrinsic reward output from a generative adversarial network, where the generator provides fake samples of states that help discriminator identify those less frequently visited states. Thus the agent is encouraged to visit those states which the discriminator is less confident to judge as visited. GAEX is easy to implement and of high training efficiency. In our experiments, we apply GAEX into DQN and the DQN-GAEX algorithm achieves convincing performance on challenging exploration problems, including the game Venture, Montezuma's Revenge and Super Mario Bros, without further fine-tuning on complicate learning algorithms. To our knowledge, this is the first work to employ GAN in RL exploration problems. △ Less

Submitted 27 January, 2022; originally announced January 2022.

arXiv:2201.10135 [pdf, other]

doi 10.1103/PhysRevLett.129.250501

Observation of spin-tensor induced topological phase transitions of triply degenerate points with a trapped ion

Authors: Mengxiang Zhang, Xinxing Yuan, Xi-Wang Luo, Chang Liu, Yue Li, Mingdong Zhu, Xi Qin, Yiheng Lin, Jiangfeng Du

Abstract: Triply degenerate points (TDPs), which correspond to new types of topological semimetals, can support novel quasiparticles possessing effective integer spins while preserving Fermi statistics. Here by map** the momentum space to the parameter space of a three-level system in a trapped ion, we experimentally explore the transitions between different types of TDPs driven by spin-tensor--momentum c… ▽ More Triply degenerate points (TDPs), which correspond to new types of topological semimetals, can support novel quasiparticles possessing effective integer spins while preserving Fermi statistics. Here by map** the momentum space to the parameter space of a three-level system in a trapped ion, we experimentally explore the transitions between different types of TDPs driven by spin-tensor--momentum couplings. We observe the phase transitions between TDPs with different topological charges by measuring the Berry flux on a loop surrounding the gap-closing lines, and the jump of the Berry flux gives the jump of the topological charge (up to a $2π$ factor) across the transitions. For the Berry flux measurement, we employ a new method by examining the geometric rotations of both spin vectors and tensors, which lead to a generalized solid angle equal to the Berry flux. The controllability of multi-level ion offers a versatile platform to study high-spin physics and our work paves the way to explore novel topological phenomena therein. △ Less

Submitted 25 January, 2022; originally announced January 2022.

Comments: 9 pages, 10 figures

Journal ref: Phys. Rev. Lett. 129, 250501 (2022)

Showing 301–350 of 865 results for author: Zhu, M