Search | arXiv e-print repository

FAST polarization map** of the SNR VRO 42.05.01

Authors: Li Xiao, Ming Zhu, Xiao-Hui Sun, Peng Jiang, Chun Sun

Abstract: We have obtained the polarization data cube of the VRO 42.05.01 supernova remnant at 1240 MHz using the Five-hundred-meter Aperture Spherical radio Telescope (FAST). Three-dimensional Faraday Synthesis is applied to the FAST data to derive the Faraday depth spectrum. The peak Faraday depth map shows a large area of enhanced foreground RM of ~60 rad m-2 extending along the remnant's "wing" section,… ▽ More We have obtained the polarization data cube of the VRO 42.05.01 supernova remnant at 1240 MHz using the Five-hundred-meter Aperture Spherical radio Telescope (FAST). Three-dimensional Faraday Synthesis is applied to the FAST data to derive the Faraday depth spectrum. The peak Faraday depth map shows a large area of enhanced foreground RM of ~60 rad m-2 extending along the remnant's "wing" section, which coincides with a large-scale HI shell at -20 km/s. The two depolarization patches within the "wing" region with RM of 97 rad m-2 and 55 rad m-2 coincide with two HI structures in the HI shell. Faraday screen model fitting on the Canadian Galactic Plane Survey (CGPS) 1420 MHz full-scale polarization data reveals a distance of 0.7-0.8d_{SNR} in front of the SNR with enhanced regular magnetic field there. The highly piled-up magnetic field indicates that the HI shell at -20 km/s could originate from an old evolved SNR. △ Less

Submitted 2 June, 2023; originally announced June 2023.

Comments: 9 pages, 8 figures, accepted by ApJ

arXiv:2305.17289 [pdf, other]

Fourier-DeepONet: Fourier-enhanced deep operator networks for full waveform inversion with improved accuracy, generalizability, and robustness

Authors: Min Zhu, Shihang Feng, Youzuo Lin, Lu Lu

Abstract: Full waveform inversion (FWI) infers the subsurface structure information from seismic waveform data by solving a non-convex optimization problem. Data-driven FWI has been increasingly studied with various neural network architectures to improve accuracy and computational efficiency. Nevertheless, the applicability of pre-trained neural networks is severely restricted by potential discrepancies be… ▽ More Full waveform inversion (FWI) infers the subsurface structure information from seismic waveform data by solving a non-convex optimization problem. Data-driven FWI has been increasingly studied with various neural network architectures to improve accuracy and computational efficiency. Nevertheless, the applicability of pre-trained neural networks is severely restricted by potential discrepancies between the source function used in the field survey and the one utilized during training. Here, we develop a Fourier-enhanced deep operator network (Fourier-DeepONet) for FWI with the generalization of seismic sources, including the frequencies and locations of sources. Specifically, we employ the Fourier neural operator as the decoder of DeepONet, and we utilize source parameters as one input of Fourier-DeepONet, facilitating the resolution of FWI with variable sources. To test Fourier-DeepONet, we develop three new and realistic FWI benchmark datasets (FWI-F, FWI-L, and FWI-FL) with varying source frequencies, locations, or both. Our experiments demonstrate that compared with existing data-driven FWI methods, Fourier-DeepONet obtains more accurate predictions of subsurface structures in a wide range of source parameters. Moreover, the proposed Fourier-DeepONet exhibits superior robustness when handling data with Gaussian noise or missing traces and sources with Gaussian noise, paving the way for more reliable and accurate subsurface imaging across diverse real conditions. △ Less

Submitted 24 July, 2023; v1 submitted 26 May, 2023; originally announced May 2023.

arXiv:2305.15847 [pdf, other]

Stability Improvement of Nuclear Magnetic Resonance Gyroscope with Self-Calibrating Parametric Magnetometer

Authors: Guo** Gao, **bo Hu, Feng Tang, Wenhui Liu, Xiangdong Zhang, Baoxu Wang, Dongge Deng, Mingzhi Zhu, Nan Zhao

Abstract: In this paper, we study the stability of nuclear magnetic resonance gyroscope (NMRG), which employs Xe nuclear spins to measure inertial rotation rate. The Xe spin polarization is sensed by an in-situ Rb-magnetometer. The Rb-magnetometer works in a parametric oscillation mode (henceforth referred to as the Rb parametric magnetometer, or Rb-PM), in which the Larmor frequency of the Rb spins is modu… ▽ More In this paper, we study the stability of nuclear magnetic resonance gyroscope (NMRG), which employs Xe nuclear spins to measure inertial rotation rate. The Xe spin polarization is sensed by an in-situ Rb-magnetometer. The Rb-magnetometer works in a parametric oscillation mode (henceforth referred to as the Rb parametric magnetometer, or Rb-PM), in which the Larmor frequency of the Rb spins is modulated and the transverse components of Xe nuclear spin polarization are measured. As the measurement output of the Rb-PM, the phase of the Xe nuclear spin precession is eventually converted to the Xe nuclear magnetic resonance (NMR) frequencies and the inertial rotation rate. Here we provide a comprehensive study of the NMR phase measured by the Rb-PM, and analyze the influence of various control parameters, including the DC magnetic field, the frequency and phase of the modulation field, and the Rb resonance linewidth, on the stability of the NMR phase. Based on these analysis, we propose and implement a self-calibrating method to compensate the NMR phase drift during the Rb-PM measurement. With the self-calibrating Rb-PM, we demonstrate a significant improvement of the bias stability of NMRG. △ Less

Submitted 25 May, 2023; originally announced May 2023.

Comments: 16 pages, 11 figures

arXiv:2305.14211 [pdf, other]

Towards Graph-hop Retrieval and Reasoning in Complex Question Answering over Textual Database

Authors: Minjun Zhu, Yixuan Weng, Shizhu He, Kang Liu, Jun Zhao

Abstract: In Textual question answering (TQA) systems, complex questions often require retrieving multiple textual fact chains with multiple reasoning steps. While existing benchmarks are limited to single-chain or single-hop retrieval scenarios. In this paper, we propose to conduct Graph-Hop -- a novel multi-chains and multi-hops retrieval and reasoning paradigm in complex question answering. We construct… ▽ More In Textual question answering (TQA) systems, complex questions often require retrieving multiple textual fact chains with multiple reasoning steps. While existing benchmarks are limited to single-chain or single-hop retrieval scenarios. In this paper, we propose to conduct Graph-Hop -- a novel multi-chains and multi-hops retrieval and reasoning paradigm in complex question answering. We construct a new benchmark called ReasonGraphQA, which provides explicit and fine-grained evidence graphs for complex questions to support interpretable reasoning, comprehensive and detailed reasoning. And ReasonGraphQA also shows an advantage in reasoning diversity and scale. Moreover, We propose a strong graph-hop baseline called Bidirectional Graph Retrieval (BGR) method for generating an explanation graph of textual evidence in knowledge reasoning and question answering. We have thoroughly evaluated existing evidence retrieval and reasoning models on the ReasonGraphQA. Experiments highlight Graph-Hop is a promising direction for answering complex questions, but it still has certain limitations. We have further studied mitigation strategies to meet these challenges and discuss future directions. △ Less

Submitted 23 May, 2023; originally announced May 2023.

arXiv:2305.13310 [pdf, other]

Matcher: Segment Anything with One Shot Using All-Purpose Feature Matching

Authors: Yang Liu, Muzhi Zhu, Hengtao Li, Hao Chen, Xinlong Wang, Chunhua Shen

Abstract: Powered by large-scale pre-training, vision foundation models exhibit significant potential in open-world image understanding. However, unlike large language models that excel at directly tackling various language tasks, vision foundation models require a task-specific model structure followed by fine-tuning on specific tasks. In this work, we present Matcher, a novel perception paradigm that util… ▽ More Powered by large-scale pre-training, vision foundation models exhibit significant potential in open-world image understanding. However, unlike large language models that excel at directly tackling various language tasks, vision foundation models require a task-specific model structure followed by fine-tuning on specific tasks. In this work, we present Matcher, a novel perception paradigm that utilizes off-the-shelf vision foundation models to address various perception tasks. Matcher can segment anything by using an in-context example without training. Additionally, we design three effective components within the Matcher framework to collaborate with these foundation models and unleash their full potential in diverse perception tasks. Matcher demonstrates impressive generalization performance across various segmentation tasks, all without training. For example, it achieves 52.7% mIoU on COCO-20$^i$ with one example, surpassing the state-of-the-art specialist model by 1.6%. In addition, Matcher achieves 33.0% mIoU on the proposed LVIS-92$^i$ for one-shot semantic segmentation, outperforming the state-of-the-art generalist model by 14.4%. Our visualization results further showcase the open-world generality and flexibility of Matcher when applied to images in the wild. Our code can be found at https://github.com/aim-uofa/Matcher. △ Less

Submitted 19 January, 2024; v1 submitted 22 May, 2023; originally announced May 2023.

Comments: Accepted to ICLR2024

arXiv:2305.10933 [pdf, other]

Primordial black holes from null energy condition violation during inflation

Authors: Yong Cai, Mian Zhu, Yun-Song Piao

Abstract: The violation of the null energy condition (NEC) is closely related to potential solutions for the cosmological singularity problem and may therefore play a crucial role in the very early universe. We explore a novel approach to generate primordial black holes (PBHs) via the violation of the NEC in a single-field inflationary scenario. In our scenario, the universe transitions from a first slow-ro… ▽ More The violation of the null energy condition (NEC) is closely related to potential solutions for the cosmological singularity problem and may therefore play a crucial role in the very early universe. We explore a novel approach to generate primordial black holes (PBHs) via the violation of the NEC in a single-field inflationary scenario. In our scenario, the universe transitions from a first slow-roll inflation stage with a Hubble parameter H = Hinf1 to a second slow-roll inflation stage with H = Hinf2 > Hinf1, passing through an intermediate stage of NEC violation. The resulting primordial scalar power spectrum is naturally enhanced by the NEC violation at a certain wavelength. As a result, PBHs with masses and abundances of observational interest can be produced in our scenario. We also examine the phenomenological signatures of scalar-induced gravitational waves (SIGWs). Our work highlights the significance of utilizing a combination of PBHs, SIGWs, and primordial gravitational waves as a powerful probe for exploring the NEC violation during inflation. △ Less

Submitted 18 May, 2023; originally announced May 2023.

Comments: 8 pages, 5 figures

arXiv:2305.10338 [pdf]

Inertial-based Navigation by Polynomial Optimization: Inertial-Magnetic Attitude Estimation

Authors: Maoran Zhu, Yuanxin Wu

Abstract: Inertial-based navigation refers to the navigation methods or systems that have inertial information or sensors as the core part and integrate a spectrum of other kinds of sensors for enhanced performance. Through a series of papers, the authors attempt to explore information blending of inertial-based navigation by a polynomial optimization method. The basic idea is to model rigid motions as fini… ▽ More Inertial-based navigation refers to the navigation methods or systems that have inertial information or sensors as the core part and integrate a spectrum of other kinds of sensors for enhanced performance. Through a series of papers, the authors attempt to explore information blending of inertial-based navigation by a polynomial optimization method. The basic idea is to model rigid motions as finite-order polynomials and then attacks the involved navigation problems by optimally solving their coefficients, taking into considerations the constraints posed by inertial sensors and others. In the current paper, a continuous-time attitude estimation approach is proposed, which transforms the attitude estimation into a constant parameter determination problem by the polynomial optimization. Specifically, the continuous attitude is first approximated by a Chebyshev polynomial, of which the unknown Chebyshev coefficients are determined by minimizing the weighted residuals of initial conditions, dynamics and measurements. We apply the derived estimator to the attitude estimation with the magnetic and inertial sensors. Simulation and field tests show that the estimator has much better stability and faster convergence than the traditional extended Kalman filter does, especially in the challenging large initial state error scenarios. △ Less

Submitted 17 May, 2023; originally announced May 2023.

Comments: 12 pages, 15 figures

arXiv:2305.06334 [pdf]

Optimizing Drug Design by Merging Generative AI With Active Learning Frameworks

Authors: Isaac Filella-Merce, Alexis Molina, Marek Orzechowski, Lucía Díaz, Yang Ming Zhu, Julia Vilalta Mor, Laura Malo, Ajay S Yekkirala, Soumya Ray, Victor Guallar

Abstract: Traditional drug discovery programs are being transformed by the advent of machine learning methods. Among these, Generative AI methods (GM) have gained attention due to their ability to design new molecules and enhance specific properties of existing ones. However, current GM methods have limitations, such as low affinity towards the target, unknown ADME/PK properties, or the lack of synthetic tr… ▽ More Traditional drug discovery programs are being transformed by the advent of machine learning methods. Among these, Generative AI methods (GM) have gained attention due to their ability to design new molecules and enhance specific properties of existing ones. However, current GM methods have limitations, such as low affinity towards the target, unknown ADME/PK properties, or the lack of synthetic tractability. To improve the applicability domain of GM methods, we have developed a workflow based on a variational autoencoder coupled with active learning steps. The designed GM workflow iteratively learns from molecular metrics, including drug likeliness, synthesizability, similarity, and docking scores. In addition, we also included a hierarchical set of criteria based on advanced molecular modeling simulations during a final selection step. We tested our GM workflow on two model systems, CDK2 and KRAS. In both cases, our model generated chemically viable molecules with a high predicted affinity toward the targets. Particularly, the proportion of high-affinity molecules inferred by our GM workflow was significantly greater than that in the training data. Notably, we also uncovered novel scaffolds significantly dissimilar to those known for each target. These results highlight the potential of our GM workflow to explore novel chemical space for specific targets, thereby opening up new possibilities for drug discovery endeavors. △ Less

Submitted 4 May, 2023; originally announced May 2023.

arXiv:2305.05410 [pdf, other]

Large Language Models Need Holistically Thought in Medical Conversational QA

Authors: Yixuan Weng, Bin Li, Fei Xia, Minjun Zhu, Bin Sun, Shizhu He, Kang Liu, Jun Zhao

Abstract: The medical conversational question answering (CQA) system aims at providing a series of professional medical services to improve the efficiency of medical care. Despite the success of large language models (LLMs) in complex reasoning tasks in various fields, such as mathematics, logic, and commonsense QA, they still need to improve with the increased complexity and specialization of the medical f… ▽ More The medical conversational question answering (CQA) system aims at providing a series of professional medical services to improve the efficiency of medical care. Despite the success of large language models (LLMs) in complex reasoning tasks in various fields, such as mathematics, logic, and commonsense QA, they still need to improve with the increased complexity and specialization of the medical field. This is because medical CQA tasks require not only strong medical reasoning, but also the ability to think broadly and deeply. In this paper, to address these challenges in medical CQA tasks that need to be considered and understood in many aspects, we propose the Holistically Thought (HoT) method, which is designed to guide the LLMs to perform the diffused and focused thinking for generating high-quality medical responses. The proposed HoT method has been evaluated through automated and manual assessments in three different medical CQA datasets containing the English and Chinese languages. The extensive experimental results show that our method can produce more correctness, professional, and considerate answers than several state-of-the-art (SOTA) methods, manifesting its effectiveness. Our code in https://github.com/WENGSYX/HoT. △ Less

Submitted 10 May, 2023; v1 submitted 9 May, 2023; originally announced May 2023.

arXiv:2305.05144 [pdf, other]

Adapt and Align to Improve Zero-Shot Sketch-Based Image Retrieval

Authors: Shiyin Dong, Mingrui Zhu, Nannan Wang, Xinbo Gao

Abstract: Zero-shot sketch-based image retrieval (ZS-SBIR) is challenging due to the cross-domain nature of sketches and photos, as well as the semantic gap between seen and unseen image distributions. Previous methods fine-tune pre-trained models with various side information and learning strategies to learn a compact feature space that is shared between the sketch and photo domains and bridges seen and un… ▽ More Zero-shot sketch-based image retrieval (ZS-SBIR) is challenging due to the cross-domain nature of sketches and photos, as well as the semantic gap between seen and unseen image distributions. Previous methods fine-tune pre-trained models with various side information and learning strategies to learn a compact feature space that is shared between the sketch and photo domains and bridges seen and unseen classes. However, these efforts are inadequate in adapting domains and transferring knowledge from seen to unseen classes. In this paper, we present an effective ``Adapt and Align'' approach to address the key challenges. Specifically, we insert simple and lightweight domain adapters to learn new abstract concepts of the sketch domain and improve cross-domain representation capabilities. Inspired by recent advances in image-text foundation models (e.g., CLIP) on zero-shot scenarios, we explicitly align the learned image embedding with a more semantic text embedding to achieve the desired knowledge transfer from seen to unseen classes. Extensive experiments on three benchmark datasets and two popular backbones demonstrate the superiority of our method in terms of retrieval accuracy and flexibility. △ Less

Submitted 9 August, 2023; v1 submitted 8 May, 2023; originally announced May 2023.

Comments: 10 pages, 7 figures, 6 tables

arXiv:2305.05128 [pdf]

A Kriging-Random Forest Hybrid Model for Real-time Ground Property Prediction during Earth Pressure Balance Shield Tunneling

Authors: Ziheng Geng, Chao Zhang, Yuhao Ren, Minxiang Zhu, Renpeng Chen, Hongzhan Cheng

Abstract: A kriging-random forest hybrid model is developed for real-time ground property prediction ahead of the earth pressure balanced shield by integrating Kriging extrapolation and random forest, which can guide shield operating parameter selection thereby mitigate construction risks. The proposed KRF algorithm synergizes two types of information: prior information and real-time information. The previo… ▽ More A kriging-random forest hybrid model is developed for real-time ground property prediction ahead of the earth pressure balanced shield by integrating Kriging extrapolation and random forest, which can guide shield operating parameter selection thereby mitigate construction risks. The proposed KRF algorithm synergizes two types of information: prior information and real-time information. The previously predicted ground properties with EPB operating parameters are extrapolated via the Kriging algorithm to provide prior information for the prediction of currently being excavated ground properties. The real-time information refers to the real-time operating parameters of the EPB shield, which are input into random forest to provide a real-time prediction of ground properties. The integration of these two predictions is achieved by assigning weights to each prediction according to their uncertainties, ensuring the prediction of KRF with minimum uncertainty. The performance of the KRF algorithm is assessed via a case study of the Changsha Metro Line 4 project. It reveals that the proposed KRF algorithm can predict ground properties with an accuracy of 93%, overperforming the existing algorithms of LightGBM, AdaBoost-CART, and DNN by 29%, 8%, and 12%, respectively. Another dataset from Shenzhen Metro Line 13 project is utilized to further evaluate the model generalization performance, revealing that the model can transfer its learned knowledge from one region to another with an accuracy of 89%. △ Less

Submitted 8 May, 2023; originally announced May 2023.

arXiv:2305.03283 [pdf, other]

doi 10.1007/JHEP08(2023)007

WKB analysis of the linear problem for modified affine Toda field equations

Authors: Katsushi Ito, Mingshuo Zhu

Abstract: We study the WKB analysis of the solutions to the linear problem for a modified affine Toda field equation, which is equivalent to the higher-order ordinary differential equation (ODE) studied in the ODE/IM correspondence. After gauge transformation, we diagonalize the flat connection of the linear problem to reduce the latter to a set of independent first-order linear differential equations. We e… ▽ More We study the WKB analysis of the solutions to the linear problem for a modified affine Toda field equation, which is equivalent to the higher-order ordinary differential equation (ODE) studied in the ODE/IM correspondence. After gauge transformation, we diagonalize the flat connection of the linear problem to reduce the latter to a set of independent first-order linear differential equations. We explicitly perform this procedure for classical affine Lie algebras with lower ranks. In particular, we study the WKB solutions of the $D_r^{(1)}$- and $D^{(2)}_{r+1}$-type linear problems, which correspond to the higher-order ODEs with the pseudo-differential operator. The diagonalized connection is obtained from the Riccati equations of the adjoint linear problem and related to the conserved currents of the integrable hierarchy constructed by Drinfeld and Sokolov up to total derivatives. △ Less

Submitted 21 July, 2023; v1 submitted 5 May, 2023; originally announced May 2023.

Comments: 42 pages

Report number: TIT/HEP-694

Journal ref: JHEP08(2023)007

arXiv:2304.14908 [pdf, other]

Compiler Auto-tuning through Multiple Phase Learning

Authors: Mingxuan Zhu, Dan Hao, Junjie Chen

Abstract: Widely used compilers like GCC and LLVM usually have hundreds of optimizations controlled by optimization flags, which are enabled or disabled during compilation to improve runtime performance (e.g., small execution time) of the compiler program. Due to the large number of optimization flags and their combination, it is difficult for compiler users to manually tune compiler optimization flags. In… ▽ More Widely used compilers like GCC and LLVM usually have hundreds of optimizations controlled by optimization flags, which are enabled or disabled during compilation to improve runtime performance (e.g., small execution time) of the compiler program. Due to the large number of optimization flags and their combination, it is difficult for compiler users to manually tune compiler optimization flags. In the literature, a number of auto-tuning techniques have been proposed, which tune optimization flags for a compiled program by comparing its actual runtime performance with different optimization flag combination. Due to the huge search space and heavy actual runtime cost, these techniques suffer from the widely-recognized efficiency problem. To reduce the heavy runtime cost, in this paper we propose a lightweight learning approach which uses a small number of actual runtime performance data to predict the runtime performance of a compiled program with various optimization flag combination. Furthermore, to reduce the search space, we design a novel particle swarm algorithm which tunes compiler optimization flags with the prediction model. To evaluate the performance of the proposed approach CompTuner, we conduct an extensive experimental study on two popular C compilers GCC and LLVM with two widely used benchmarks cBench and PolyBench. The experimental results show that CompTuner significantly outperforms the five compared techniques, including the state-of-art technique BOCA. △ Less

Submitted 27 April, 2023; originally announced April 2023.

arXiv:2304.13964 [pdf, other]

doi 10.1093/mnras/stad1281

FAST discovery of long tidal tails in NGC 4490/85

Authors: Yao Liu, Ming Zhu, Haiyang Yu, Mei Ai, Peng Jiang, Siqi Liu, Ruilei Zhou, Lixia Yuan

Abstract: We report the discovery of a 100 kpc HI tail in the merging galaxy pair NGC 4490/85 detected by the Five-Hundred-meter Aperture Spherical radio Telescope (FAST). The tidal tails extended in both the south and north directions, and they are much longer than that reported previously based on the VLA interferometric maps. The NGC 4490/85 is surrounded by a large gas envelope, and a starburst low meta… ▽ More We report the discovery of a 100 kpc HI tail in the merging galaxy pair NGC 4490/85 detected by the Five-Hundred-meter Aperture Spherical radio Telescope (FAST). The tidal tails extended in both the south and north directions, and they are much longer than that reported previously based on the VLA interferometric maps. The NGC 4490/85 is surrounded by a large gas envelope, and a starburst low metallicity dwarf galaxy MAPS 1231+42 is found to be connected with the gas envelope, indicating that galaxy interaction trigged the intense star formation in it. Based on the fact that the metallicity in MAPS 1231+42 is one order of magnitude lower than that in the two disks of NGC 4490 and NGC 4485, we speculate that the gas near this galaxy should be primordial and could be due to gas inflow from the circum-galactic medium (CGM). We also found a collimated gas component pointing at a nearby dwarf galaxy KK 149, suggesting that this galaxy might also be interacting with the NGC 4490 pair. We discuss the possible origin of the long tidal tails and the extended gas envelope in this merging system based on the new data from FAST. △ Less

Submitted 28 April, 2023; v1 submitted 27 April, 2023; originally announced April 2023.

Comments: 10 pages, 8 figures, 1 table.Accepted by MNRAS. The raw data used in the article will be published on the FAST website: https://fast.bao.ac.cn. The PID is N2021_4. Please contact the author ([email protected], [email protected]) for processed data

arXiv:2304.13174 [pdf, other]

Dynamic Datasets and Market Environments for Financial Reinforcement Learning

Authors: Xiao-Yang Liu, Ziyi Xia, Hongyang Yang, Jiechao Gao, Daochen Zha, Ming Zhu, Christina Dan Wang, Zhaoran Wang, Jian Guo

Abstract: The financial market is a particularly challenging playground for deep reinforcement learning due to its unique feature of dynamic datasets. Building high-quality market environments for training financial reinforcement learning (FinRL) agents is difficult due to major factors such as the low signal-to-noise ratio of financial data, survivorship bias of historical data, and model overfitting. In t… ▽ More The financial market is a particularly challenging playground for deep reinforcement learning due to its unique feature of dynamic datasets. Building high-quality market environments for training financial reinforcement learning (FinRL) agents is difficult due to major factors such as the low signal-to-noise ratio of financial data, survivorship bias of historical data, and model overfitting. In this paper, we present FinRL-Meta, a data-centric and openly accessible library that processes dynamic datasets from real-world markets into gym-style market environments and has been actively maintained by the AI4Finance community. First, following a DataOps paradigm, we provide hundreds of market environments through an automatic data curation pipeline. Second, we provide homegrown examples and reproduce popular research papers as step** stones for users to design new trading strategies. We also deploy the library on cloud platforms so that users can visualize their own results and assess the relative performance via community-wise competitions. Third, we provide dozens of Jupyter/Python demos organized into a curriculum and a documentation website to serve the rapidly growing community. The open-source codes for the data curation pipeline are available at https://github.com/AI4Finance-Foundation/FinRL-Meta △ Less

Submitted 25 April, 2023; originally announced April 2023.

Comments: 49 pages, 15 figures. arXiv admin note: substantial text overlap with arXiv:2211.03107

arXiv:2304.12485 [pdf, other]

doi 10.1103/PhysRevB.108.L201111

Interplay between atomic fluctuations and charge density waves in La$_{2-x}$Sr$_{x}$CuO$_{4}$

Authors: L. Shen, V. Esposito, N. G. Burdet, M. Zhu, A. N. Petsch, T. P. Croft, S. P. Collins, Z. Ren, F. Westermeier, M. Sprung, S. M. Hayden, J. J. Turner, E. Blackburn

Abstract: In the cuprate superconductors, the spatial coherence of the charge density wave (CDW) state grows rapidly below a characteristic temperature $T_\mathrm{CDW}$, the nature of which is debated. We have combined a set of x-ray scattering techniques to study La$_{1.88}$Sr$_{0.12}$CuO$_{4}$ ($T_\mathrm{CDW}$~$\approx$~80\,K) to shed light on this discussion. We observe the emergence of a crystal struct… ▽ More In the cuprate superconductors, the spatial coherence of the charge density wave (CDW) state grows rapidly below a characteristic temperature $T_\mathrm{CDW}$, the nature of which is debated. We have combined a set of x-ray scattering techniques to study La$_{1.88}$Sr$_{0.12}$CuO$_{4}$ ($T_\mathrm{CDW}$~$\approx$~80\,K) to shed light on this discussion. We observe the emergence of a crystal structure, which is consistent with the CDW modulation in symmetry, well above $T_\mathrm{CDW}$. This global structural change also induces strong fluctuations of local atomic disorder in the intermediate temperature region. At $T_\mathrm{CDW}$, the temperature dependence of this structure develops a kink, while the atomic disorder is minimized. We find that the atomic relaxation dynamics cross over from a cooperative to an incoherent response at $T_\mathrm{CDW}$. These results reveal a rich interplay between the CDWs and atomic fluctuations of distinct spatio-temporal scales. For example, the CDW coherence is enhanced on quasi-elastic timescales by incoherent atomic relaxation. △ Less

Submitted 24 April, 2023; originally announced April 2023.

arXiv:2304.11895 [pdf, other]

doi 10.1088/1674-4527/acd58e

Performance of FAST with an Ultra-Wide Bandwidth Receiver at 500-3300 MHz

Authors: Chuan-Peng Zhang, Peng Jiang, Ming Zhu, Jun Pan, Cheng Cheng, Hong-Fei Liu, Yan Zhu, Chun Sun, FAST Collaboration

Abstract: The Five-hundred-meter Aperture Spherical radio Telescope (FAST) has been running for several years. A new Ultra-Wide Bandwidth (UWB) receiver, simultaneously covering 500-3300 MHz, has been mounted in the FAST feed cabin and passed a series of observational tests. The whole UWB band is separated into four independent bands. Each band has 1048576 channels in total, resulted in a spectral resolutio… ▽ More The Five-hundred-meter Aperture Spherical radio Telescope (FAST) has been running for several years. A new Ultra-Wide Bandwidth (UWB) receiver, simultaneously covering 500-3300 MHz, has been mounted in the FAST feed cabin and passed a series of observational tests. The whole UWB band is separated into four independent bands. Each band has 1048576 channels in total, resulted in a spectral resolution of 1 kHz. At 500-3300 MHz, the antenna gain is around 14.3-7.7 K/Jy, the aperture efficiency is around 0.56-0.30, the system temperature is around 88-130 K, and the HPBW is around 7.6-1.6 arcmin. The measured standard deviation of pointing accuracy is better than ~7.9 arcsec, when zenith angle (ZA) is within 26.4deg. The sensitivity and stability of the UWB receiver are confirmed to satisfy expectation by spectral observations, e.g., HI and OH. The FAST UWB receiver already has a good performance for taking sensitive observations in various scientific goals. △ Less

Submitted 24 June, 2023; v1 submitted 24 April, 2023; originally announced April 2023.

Comments: 11 pages, 7 figures, 2 tables, published in Research in Astronomy and Astrophysics

Journal ref: Res. Astron. Astrophys. 23 075016 (2023)

arXiv:2304.11823 [pdf, other]

Enhancing Fine-Tuning Based Backdoor Defense with Sharpness-Aware Minimization

Authors: Mingli Zhu, Shaokui Wei, Li Shen, Yanbo Fan, Baoyuan Wu

Abstract: Backdoor defense, which aims to detect or mitigate the effect of malicious triggers introduced by attackers, is becoming increasingly critical for machine learning security and integrity. Fine-tuning based on benign data is a natural defense to erase the backdoor effect in a backdoored model. However, recent studies show that, given limited benign data, vanilla fine-tuning has poor defense perform… ▽ More Backdoor defense, which aims to detect or mitigate the effect of malicious triggers introduced by attackers, is becoming increasingly critical for machine learning security and integrity. Fine-tuning based on benign data is a natural defense to erase the backdoor effect in a backdoored model. However, recent studies show that, given limited benign data, vanilla fine-tuning has poor defense performance. In this work, we provide a deep study of fine-tuning the backdoored model from the neuron perspective and find that backdoorrelated neurons fail to escape the local minimum in the fine-tuning process. Inspired by observing that the backdoorrelated neurons often have larger norms, we propose FTSAM, a novel backdoor defense paradigm that aims to shrink the norms of backdoor-related neurons by incorporating sharpness-aware minimization with fine-tuning. We demonstrate the effectiveness of our method on several benchmark datasets and network architectures, where it achieves state-of-the-art defense performance. Overall, our work provides a promising avenue for improving the robustness of machine learning models against backdoor attacks. △ Less

Submitted 24 April, 2023; originally announced April 2023.

arXiv:2304.07378 [pdf]

doi 10.1109/TMTT.2023.3293054

A Reconfigurable Linear RF Analog Processor for Realizing Microwave Artificial Neural Network

Authors: Minning Zhu, Tzu-Wei Kuo, Chung-Tse Michael Wu

Abstract: Owing to the data explosion and rapid development of artificial intelligence (AI), particularly deep neural networks (DNNs), the ever-increasing demand for large-scale matrix-vector multiplication has become one of the major issues in machine learning (ML). Training and evaluating such neural networks rely on heavy computational resources, resulting in significant system latency and power consumpt… ▽ More Owing to the data explosion and rapid development of artificial intelligence (AI), particularly deep neural networks (DNNs), the ever-increasing demand for large-scale matrix-vector multiplication has become one of the major issues in machine learning (ML). Training and evaluating such neural networks rely on heavy computational resources, resulting in significant system latency and power consumption. To overcome these issues, analog computing using optical interferometric-based linear processors have recently appeared as promising candidates in accelerating matrix-vector multiplication and lowering power consumption. On the other hand, radio frequency (RF) electromagnetic waves can also exhibit similar advantages as the optical counterpart by performing analog computation at light speed with lower power. Furthermore, RF devices have extra benefits such as lower cost, mature fabrication, and analog-digital mixed design simplicity, which has great potential in realizing affordable, scalable, low latency, low power, near-sensor radio frequency neural network (RFNN) that may greatly enrich RF signal processing capability. In this work, we propose a 2X2 reconfigurable linear RF analog processor in theory and experiment, which can be applied as a matrix multiplier in an artificial neural network (ANN). The proposed device can be utilized to realize a 2X2 simple RFNN for data classification. An 8X8 linear analog processor formed by 28 RFNN devices are also applied in a 4-layer ANN for Modified National Institute of Standards and Technology (MNIST) dataset classification. △ Less

Submitted 24 July, 2023; v1 submitted 14 April, 2023; originally announced April 2023.

Comments: 11 pages, 16 figures

arXiv:2304.06627 [pdf, other]

CoSDA: Continual Source-Free Domain Adaptation

Authors: Haozhe Feng, Zhaorui Yang, Hesun Chen, Tianyu Pang, Chao Du, Minfeng Zhu, Wei Chen, Shuicheng Yan

Abstract: Without access to the source data, source-free domain adaptation (SFDA) transfers knowledge from a source-domain trained model to target domains. Recently, SFDA has gained popularity due to the need to protect the data privacy of the source domain, but it suffers from catastrophic forgetting on the source domain due to the lack of data. To systematically investigate the mechanism of catastrophic f… ▽ More Without access to the source data, source-free domain adaptation (SFDA) transfers knowledge from a source-domain trained model to target domains. Recently, SFDA has gained popularity due to the need to protect the data privacy of the source domain, but it suffers from catastrophic forgetting on the source domain due to the lack of data. To systematically investigate the mechanism of catastrophic forgetting, we first reimplement previous SFDA approaches within a unified framework and evaluate them on four benchmarks. We observe that there is a trade-off between adaptation gain and forgetting loss, which motivates us to design a consistency regularization to mitigate forgetting. In particular, we propose a continual source-free domain adaptation approach named CoSDA, which employs a dual-speed optimized teacher-student model pair and is equipped with consistency learning capability. Our experiments demonstrate that CoSDA outperforms state-of-the-art approaches in continuous adaptation. Notably, our CoSDA can also be integrated with other SFDA methods to alleviate forgetting. △ Less

Submitted 13 April, 2023; originally announced April 2023.

Comments: 15 pages, 6 figures

arXiv:2304.05749

Boosting long-term forecasting performance for continuous-time dynamic graph networks via data augmentation

Authors: Yuxing Tian, Mingjie Zhu, Jiachi Luo, Song Li

Abstract: This study focuses on long-term forecasting (LTF) on continuous-time dynamic graph networks (CTDGNs), which is important for real-world modeling. Existing CTDGNs are effective for modeling temporal graph data due to their ability to capture complex temporal dependencies but perform poorly on LTF due to the substantial requirement for historical data, which is not practical in most cases. To reliev… ▽ More This study focuses on long-term forecasting (LTF) on continuous-time dynamic graph networks (CTDGNs), which is important for real-world modeling. Existing CTDGNs are effective for modeling temporal graph data due to their ability to capture complex temporal dependencies but perform poorly on LTF due to the substantial requirement for historical data, which is not practical in most cases. To relieve this problem, a most intuitive way is data augmentation. In this study, we propose \textbf{\underline{U}ncertainty \underline{M}asked \underline{M}ix\underline{U}p (UmmU)}: a plug-and-play module that conducts uncertainty estimation to introduce uncertainty into the embedding of intermediate layer of CTDGNs, and perform masked mixup to further enhance the uncertainty of the embedding to make it generalize to more situations. UmmU can be easily inserted into arbitrary CTDGNs without increasing the number of parameters. We conduct comprehensive experiments on three real-world dynamic graph datasets, the results demonstrate that UmmU can effectively improve the long-term forecasting performance for CTDGNs. △ Less

Submitted 26 May, 2024; v1 submitted 12 April, 2023; originally announced April 2023.

Comments: Recent evidence has emerged that directly contradicts the main findings of our paper, particularly those in Section 4. We need to conduct further research to address these discrepancies

arXiv:2304.05011 [pdf, other]

Towards an Understanding and Explanation for Mixed-Initiative Artificial Scientific Text Detection

Authors: Luoxuan Weng, Minfeng Zhu, Kam Kwai Wong, Shi Liu, Jiashun Sun, Hang Zhu, Dongming Han, Wei Chen

Abstract: Large language models (LLMs) have gained popularity in various fields for their exceptional capability of generating human-like text. Their potential misuse has raised social concerns about plagiarism in academic contexts. However, effective artificial scientific text detection is a non-trivial task due to several challenges, including 1) the lack of a clear understanding of the differences betwee… ▽ More Large language models (LLMs) have gained popularity in various fields for their exceptional capability of generating human-like text. Their potential misuse has raised social concerns about plagiarism in academic contexts. However, effective artificial scientific text detection is a non-trivial task due to several challenges, including 1) the lack of a clear understanding of the differences between machine-generated and human-written scientific text, 2) the poor generalization performance of existing methods caused by out-of-distribution issues, and 3) the limited support for human-machine collaboration with sufficient interpretability during the detection process. In this paper, we first identify the critical distinctions between machine-generated and human-written scientific text through a quantitative experiment. Then, we propose a mixed-initiative workflow that combines human experts' prior knowledge with machine intelligence, along with a visual analytics prototype to facilitate efficient and trustworthy scientific text detection. Finally, we demonstrate the effectiveness of our approach through two case studies and a controlled user study with proficient researchers. We also provide design implications for interactive artificial text detection tools in high-stakes decision-making scenarios. △ Less

Submitted 11 April, 2023; originally announced April 2023.

arXiv:2304.04741 [pdf, other]

Nanophotonic cavity cooling of a single atom

Authors: Chenwei Lv, Ming Zhu, Sambit Banerjee, Chen-Lung Hung

Abstract: We investigate external and internal dynamics of a two-level atom strongly coupled to a weakly pumped nanophotonic cavity. We calculate the dipole force, friction force, and stochastic force due to the cavity pump field, and show that a three-dimensional cooling region exists near the surface of a cavity. Using a two-color evanescent field trap as an example, we perform three-dimensional Monte-Car… ▽ More We investigate external and internal dynamics of a two-level atom strongly coupled to a weakly pumped nanophotonic cavity. We calculate the dipole force, friction force, and stochastic force due to the cavity pump field, and show that a three-dimensional cooling region exists near the surface of a cavity. Using a two-color evanescent field trap as an example, we perform three-dimensional Monte-Carlo simulations to demonstrate efficient loading of single atoms into a trap by momentum diffusion, and the stability of cavity cooling near the trap center. Our analyses show that cavity cooling can be a promising method for directly loading cold atoms from free-space into a surface micro-trap. We further discuss the impact of pump intensity on atom trap** and loading efficiency. △ Less

Submitted 10 April, 2023; originally announced April 2023.

Comments: 14 pages, 11 figures, 1 table

arXiv:2304.03917 [pdf]

MC-MLP:Multiple Coordinate Frames in all-MLP Architecture for Vision

Authors: Zhimin Zhu, Jianguo Zhao, Tong Mu, Yuliang Yang, Mengyu Zhu

Abstract: In deep learning, Multi-Layer Perceptrons (MLPs) have once again garnered attention from researchers. This paper introduces MC-MLP, a general MLP-like backbone for computer vision that is composed of a series of fully-connected (FC) layers. In MC-MLP, we propose that the same semantic information has varying levels of difficulty in learning, depending on the coordinate frame of features. To addres… ▽ More In deep learning, Multi-Layer Perceptrons (MLPs) have once again garnered attention from researchers. This paper introduces MC-MLP, a general MLP-like backbone for computer vision that is composed of a series of fully-connected (FC) layers. In MC-MLP, we propose that the same semantic information has varying levels of difficulty in learning, depending on the coordinate frame of features. To address this, we perform an orthogonal transform on the feature information, equivalent to changing the coordinate frame of features. Through this design, MC-MLP is equipped with multi-coordinate frame receptive fields and the ability to learn information across different coordinate frames. Experiments demonstrate that MC-MLP outperforms most MLPs in image classification tasks, achieving better performance at the same parameter level. The code will be available at: https://github.com/ZZM11/MC-MLP. △ Less

Submitted 8 April, 2023; originally announced April 2023.

arXiv:2304.03556 [pdf]

Construction of unbiased dental template and parametric dental model for precision digital dentistry

Authors: Lei Ma, **gyang Zhang, Ke Deng, Peng Xue, Zhiming Cui, Yu Fang, Minhui Tang, Yue Zhao, Min Zhu, Zhongxiang Ding, Dinggang Shen

Abstract: Dental template and parametric dental models are important tools for various applications in digital dentistry. However, constructing an unbiased dental template and accurate parametric dental models remains a challenging task due to the complex anatomical and morphological dental structures and also low volume ratio of the teeth. In this study, we develop an unbiased dental template by constructi… ▽ More Dental template and parametric dental models are important tools for various applications in digital dentistry. However, constructing an unbiased dental template and accurate parametric dental models remains a challenging task due to the complex anatomical and morphological dental structures and also low volume ratio of the teeth. In this study, we develop an unbiased dental template by constructing an accurate dental atlas from CBCT images with guidance of teeth segmentation. First, to address the challenges, we propose to enhance the CBCT images and their segmentation images, including image crop**, image masking and segmentation intensity reassigning. Then, we further use the segmentation images to perform co-registration with the CBCT images to generate an accurate dental atlas, from which an unbiased dental template can be generated. By leveraging the unbiased dental template, we construct parametric dental models by estimating point-to-point correspondences between the dental models and employing Principal Component Analysis to determine shape subspaces of the parametric dental models. A total of 159 CBCT images of real subjects are collected to perform the constructions. Experimental results demonstrate effectiveness of our proposed method in constructing unbiased dental template and parametric dental model. The developed dental template and parametric dental models are available at https://github.com/Marvin0724/Teeth_template. △ Less

Submitted 7 April, 2023; originally announced April 2023.

arXiv:2304.01665 [pdf, other]

Mastering Symbolic Operations: Augmenting Language Models with Compiled Neural Networks

Authors: Yixuan Weng, Minjun Zhu, Fei Xia, Bin Li, Shizhu He, Kang Liu, Jun Zhao

Abstract: Language models' (LMs) proficiency in handling deterministic symbolic reasoning and rule-based tasks remains limited due to their dependency implicit learning on textual data. To endow LMs with genuine rule comprehension abilities, we propose "Neural Comprehension" - a framework that synergistically integrates compiled neural networks (CoNNs) into the standard transformer architecture. CoNNs are n… ▽ More Language models' (LMs) proficiency in handling deterministic symbolic reasoning and rule-based tasks remains limited due to their dependency implicit learning on textual data. To endow LMs with genuine rule comprehension abilities, we propose "Neural Comprehension" - a framework that synergistically integrates compiled neural networks (CoNNs) into the standard transformer architecture. CoNNs are neural modules designed to explicitly encode rules through artificially generated attention weights. By incorporating CoNN modules, the Neural Comprehension framework enables LMs to accurately and robustly execute rule-intensive symbolic tasks. Extensive experiments demonstrate the superiority of our approach over existing techniques in terms of length generalization, efficiency, and interpretability for symbolic operations. Furthermore, it can be applied to LMs across different model scales, outperforming tool-calling methods in arithmetic reasoning tasks while maintaining superior inference efficiency. Our work highlights the potential of seamlessly unifying explicit rule learning via CoNNs and implicit pattern learning in LMs, paving the way for true symbolic comprehension capabilities. △ Less

Submitted 9 March, 2024; v1 submitted 4 April, 2023; originally announced April 2023.

Comments: Accepted in ICLR 2024

arXiv:2304.01453 [pdf, ps, other]

doi 10.1007/s00208-024-02824-w

Linear stability of compact shrinking Ricci solitons

Authors: Huai-Dong Cao, Meng Zhu

Abstract: In this paper, we continue investigating the second variation of Perelman's $ν$-entropy for compact shrinking Ricci solitons. In particular, we improve some of our previous work in "H.-D. Cao and M. Zhu, Math. Ann. 353 (2012), No. 3, 747-763", as well as the more recent work in "M. Mehrmohamadi and A. Razavi, arXiv:2104.08343", and obtain a necessary and sufficient condition for a compact shrinkin… ▽ More In this paper, we continue investigating the second variation of Perelman's $ν$-entropy for compact shrinking Ricci solitons. In particular, we improve some of our previous work in "H.-D. Cao and M. Zhu, Math. Ann. 353 (2012), No. 3, 747-763", as well as the more recent work in "M. Mehrmohamadi and A. Razavi, arXiv:2104.08343", and obtain a necessary and sufficient condition for a compact shrinking Ricci soliton to be linearly stable. Our work also extends similar results of Hamilton, Ilmanen and the first author in "arXiv:math.DG/0404165" (see also "H.-D. Cao and C. He, J. Reine Angew. Math. 2015 (2015), no. 709, 229-246.") for positive Einstein manifolds to the compact shrinking Ricci soliton case. △ Less

Submitted 2 February, 2024; v1 submitted 3 April, 2023; originally announced April 2023.

Comments: Final version; to appear in Math. Ann

Journal ref: Math. Ann. (2024)

arXiv:2304.01114 [pdf, other]

Associating Spatially-Consistent Grou** with Text-supervised Semantic Segmentation

Authors: Yabo Zhang, Zihao Wang, Jun Hao Liew, **gjia Huang, Manyu Zhu, Jiashi Feng, Wangmeng Zuo

Abstract: In this work, we investigate performing semantic segmentation solely through the training on image-sentence pairs. Due to the lack of dense annotations, existing text-supervised methods can only learn to group an image into semantic regions via pixel-insensitive feedback. As a result, their grouped results are coarse and often contain small spurious regions, limiting the upper-bound performance of… ▽ More In this work, we investigate performing semantic segmentation solely through the training on image-sentence pairs. Due to the lack of dense annotations, existing text-supervised methods can only learn to group an image into semantic regions via pixel-insensitive feedback. As a result, their grouped results are coarse and often contain small spurious regions, limiting the upper-bound performance of segmentation. On the other hand, we observe that grouped results from self-supervised models are more semantically consistent and break the bottleneck of existing methods. Motivated by this, we introduce associate self-supervised spatially-consistent grou** with text-supervised semantic segmentation. Considering the part-like grouped results, we further adapt a text-supervised model from image-level to region-level recognition with two core designs. First, we encourage fine-grained alignment with a one-way noun-to-region contrastive loss, which reduces the mismatched noun-region pairs. Second, we adopt a contextually aware masking strategy to enable simultaneous recognition of all grouped regions. Coupled with spatially-consistent grou** and region-adapted recognition, our method achieves 59.2% mIoU and 32.4% mIoU on Pascal VOC and Pascal Context benchmarks, significantly surpassing the state-of-the-art methods. △ Less

Submitted 3 April, 2023; originally announced April 2023.

arXiv:2303.15651 [pdf, other]

4D Panoptic Segmentation as Invariant and Equivariant Field Prediction

Authors: Minghan Zhu, Shizhong Han, Hong Cai, Shubhankar Borse, Maani Ghaffari, Fatih Porikli

Abstract: In this paper, we develop rotation-equivariant neural networks for 4D panoptic segmentation. 4D panoptic segmentation is a benchmark task for autonomous driving that requires recognizing semantic classes and object instances on the road based on LiDAR scans, as well as assigning temporally consistent IDs to instances across time. We observe that the driving scenario is symmetric to rotations on th… ▽ More In this paper, we develop rotation-equivariant neural networks for 4D panoptic segmentation. 4D panoptic segmentation is a benchmark task for autonomous driving that requires recognizing semantic classes and object instances on the road based on LiDAR scans, as well as assigning temporally consistent IDs to instances across time. We observe that the driving scenario is symmetric to rotations on the ground plane. Therefore, rotation-equivariance could provide better generalization and more robust feature learning. Specifically, we review the object instance clustering strategies and restate the centerness-based approach and the offset-based approach as the prediction of invariant scalar fields and equivariant vector fields. Other sub-tasks are also unified from this perspective, and different invariant and equivariant layers are designed to facilitate their predictions. Through evaluation on the standard 4D panoptic segmentation benchmark of SemanticKITTI, we show that our equivariant models achieve higher accuracy with lower computational costs compared to their non-equivariant counterparts. Moreover, our method sets the new state-of-the-art performance and achieves 1st place on the SemanticKITTI 4D Panoptic Segmentation leaderboard. △ Less

Submitted 12 September, 2023; v1 submitted 27 March, 2023; originally announced March 2023.

Comments: 13 pages. Accepted at ICCV 2023

arXiv:2303.14598 [pdf, other]

Dynamic Game of the Dual-Channel Supply Chain Under a Carbon Subsidy Policy

Authors: Yi Tian, Li Zhao, Meihong Zhu

Abstract: This study investigates the dynamic game behaviors of dual-channel supply chains involving an oligopoly manufacturer selling low-carbon products to online and offline retailers. The price game models under government subsidy are discussed under three scenarios: (1) simultaneous decision, (2) manufacturer dominates the market, and (3) retailer dominates the market. The equilibrium strategies are co… ▽ More This study investigates the dynamic game behaviors of dual-channel supply chains involving an oligopoly manufacturer selling low-carbon products to online and offline retailers. The price game models under government subsidy are discussed under three scenarios: (1) simultaneous decision, (2) manufacturer dominates the market, and (3) retailer dominates the market. The equilibrium strategies are compared under the government subsidy policy. Using numerical simulation,complex characteristics of the dual-channel supply chain under the carbon subsidy policy are investigated. The complexity of wholesale price and sales commission of each channel are analyzed by bifurcation, largest Lyapunov exponent and basin of attraction diagrams. Furthermore,parameter adjustment and delayed feedback control methods are proven to be effective approaches to chaos control. △ Less

Submitted 25 March, 2023; originally announced March 2023.

arXiv:2303.13087 [pdf, other]

Robust Generalization against Photon-Limited Corruptions via Worst-Case Sharpness Minimization

Authors: Zhuo Huang, Miaoxi Zhu, Xiaobo Xia, Li Shen, Jun Yu, Chen Gong, Bo Han, Bo Du, Tongliang Liu

Abstract: Robust generalization aims to tackle the most challenging data distributions which are rare in the training set and contain severe noises, i.e., photon-limited corruptions. Common solutions such as distributionally robust optimization (DRO) focus on the worst-case empirical risk to ensure low training error on the uncommon noisy distributions. However, due to the over-parameterized model being opt… ▽ More Robust generalization aims to tackle the most challenging data distributions which are rare in the training set and contain severe noises, i.e., photon-limited corruptions. Common solutions such as distributionally robust optimization (DRO) focus on the worst-case empirical risk to ensure low training error on the uncommon noisy distributions. However, due to the over-parameterized model being optimized on scarce worst-case data, DRO fails to produce a smooth loss landscape, thus struggling on generalizing well to the test set. Therefore, instead of focusing on the worst-case risk minimization, we propose SharpDRO by penalizing the sharpness of the worst-case distribution, which measures the loss changes around the neighbor of learning parameters. Through worst-case sharpness minimization, the proposed method successfully produces a flat loss curve on the corrupted distributions, thus achieving robust generalization. Moreover, by considering whether the distribution annotation is available, we apply SharpDRO to two problem settings and design a worst-case selection process for robust generalization. Theoretically, we show that SharpDRO has a great convergence guarantee. Experimentally, we simulate photon-limited corruptions using CIFAR10/100 and ImageNet30 datasets and show that SharpDRO exhibits a strong generalization ability against severe corruptions and exceeds well-known baseline methods with large performance gains. △ Less

Submitted 23 March, 2023; originally announced March 2023.

Comments: CVPR 2023

arXiv:2303.12292 [pdf, ps, other]

On explicit birational geometry for polarised varieties

Authors: Minzhe Zhu

Abstract: In this paper, we investigate the explicit birational geometry for projective $ε$-lc varieties polarised by nef and big Weil divisors. We show that if $X$ is a projective $ε$-lc variety, $H$ is a nef and big Weil divisor with $\dim\overline{\varphi_{H}(X)}\geq n-1$ and $L$ is an effective Weil divisor such that $|L-K_X|\neq \emptyset$ or $L-K_X$ is nef, then we can find an explicit lower bound of… ▽ More In this paper, we investigate the explicit birational geometry for projective $ε$-lc varieties polarised by nef and big Weil divisors. We show that if $X$ is a projective $ε$-lc variety, $H$ is a nef and big Weil divisor with $\dim\overline{\varphi_{H}(X)}\geq n-1$ and $L$ is an effective Weil divisor such that $|L-K_X|\neq \emptyset$ or $L-K_X$ is nef, then we can find an explicit lower bound of $\text{vol}(H)$ and prove that $|L+m^\prime H|$ is birational for $m^\prime\geq m$, where $m$ is an explicit number which depends only on $n$ and $ε$. This result can be applied to polarised Calabi-Yau varieties, Fano varieties and varieties of general type, generalizing the results in [CEW22] and [Zhu23]. △ Less

Submitted 21 March, 2023; originally announced March 2023.

Comments: 15 pages, comments are welcome!

MSC Class: 14J32; 14J40; 14J45

arXiv:2303.10036 [pdf, other]

Individual differences in knowledge network navigation

Authors: Manran Zhu, Taha Yasseri, János Kertész

Abstract: With the rapid accumulation of online information, efficient web navigation has grown vital yet challenging. To create an easily navigable cyberspace catering to diverse demographics, understanding how people navigate differently is paramount. While previous research has unveiled individual differences in spatial navigation, such differences in knowledge space navigation remain sparse. To bridge t… ▽ More With the rapid accumulation of online information, efficient web navigation has grown vital yet challenging. To create an easily navigable cyberspace catering to diverse demographics, understanding how people navigate differently is paramount. While previous research has unveiled individual differences in spatial navigation, such differences in knowledge space navigation remain sparse. To bridge this gap, we conducted an online experiment where participants played a navigation game on Wikipedia and completed personal information questionnaires. Our analysis shows that age negatively affects knowledge space navigation performance, while multilingualism enhances it. Under time pressure, participants' performance improves across trials and males outperform females, an effect not observed in games without time pressure. In our experiment, successful route-finding is usually not related to abilities of innovative exploration of routes. Our results underline the importance of age, multilingualism and time constraint in the knowledge space navigation. △ Less

Submitted 19 March, 2024; v1 submitted 17 March, 2023; originally announced March 2023.

Comments: 14 pages, 4 figures

arXiv:2303.07657 [pdf, other]

Code Will Tell: Visual Identification of Ponzi Schemes on Ethereum

Authors: Xiaolin Wen, Kim Siang Yeo, Yong Wang, Ling Cheng, Feida Zhu, Min Zhu

Abstract: Ethereum has become a popular blockchain with smart contracts for investors nowadays. Due to the decentralization and anonymity of Ethereum, Ponzi schemes have been easily deployed and caused significant losses to investors. However, there are still no explainable and effective methods to help investors easily identify Ponzi schemes and validate whether a smart contract is actually a Ponzi scheme.… ▽ More Ethereum has become a popular blockchain with smart contracts for investors nowadays. Due to the decentralization and anonymity of Ethereum, Ponzi schemes have been easily deployed and caused significant losses to investors. However, there are still no explainable and effective methods to help investors easily identify Ponzi schemes and validate whether a smart contract is actually a Ponzi scheme. To fill the research gap, we propose PonziLens, a novel visualization approach to help investors achieve early identification of Ponzi schemes by investigating the operation codes of smart contracts. Specifically, we conduct symbolic execution of opcode and extract the control flow for investing and rewarding with critical opcode instructions. Then, an intuitive directed-graph based visualization is proposed to display the investing and rewarding flows and the crucial execution paths, enabling easy identification of Ponzi schemes on Ethereum. Two usage scenarios involving both Ponzi and non-Ponzi schemes demonstrate the effectiveness of PonziLens. △ Less

Submitted 14 March, 2023; originally announced March 2023.

arXiv:2303.05139 [pdf, other]

doi 10.1007/978-3-031-27481-7_35

Specification-Guided Critical Scenario Identification for Automated Driving

Authors: Adam Molin, Edgar A. Aguilar, Dejan Ničković, Mengjia Zhu, Alberto Bemporad, Hasan Esen

Abstract: To test automated driving systems, we present a case study for finding critical scenarios in driving environments guided by formal specifications. To that aim, we devise a framework for critical scenario identification, which we base on open-source libraries that combine scenario specification, testing, formal methods, and optimization. To test automated driving systems, we present a case study for finding critical scenarios in driving environments guided by formal specifications. To that aim, we devise a framework for critical scenario identification, which we base on open-source libraries that combine scenario specification, testing, formal methods, and optimization. △ Less

Submitted 9 March, 2023; originally announced March 2023.

Journal ref: Formal Methods. FM 2023. Lecture Notes in Computer Science, vol 14000

arXiv:2303.04778 [pdf, other]

Fourier-MIONet: Fourier-enhanced multiple-input neural operators for multiphase modeling of geological carbon sequestration

Authors: Zhongyi Jiang, Min Zhu, Dongzhuo Li, Qiuzi Li, Yanhua O. Yuan, Lu Lu

Abstract: Geologic Carbon Storage (GCS) is an important technology that aims to reduce the amount of carbon dioxide in the atmosphere. Multiphase flow in porous media is essential to understand CO2 migration and pressure fields in the subsurface associated with GCS. However, numerical simulation for such problems in 4D is computationally challenging and expensive, due to the multiphysics and multiscale natu… ▽ More Geologic Carbon Storage (GCS) is an important technology that aims to reduce the amount of carbon dioxide in the atmosphere. Multiphase flow in porous media is essential to understand CO2 migration and pressure fields in the subsurface associated with GCS. However, numerical simulation for such problems in 4D is computationally challenging and expensive, due to the multiphysics and multiscale nature of the highly nonlinear governing partial differential equations (PDEs). It prevents us from considering multiple subsurface scenarios and conducting real-time optimization. Here, we develop a Fourier-enhanced multiple-input neural operator (Fourier-MIONet) to learn the solution operator of the problem of multiphase flow in porous media. Fourier-MIONet utilizes the recently developed framework of the multiple-input deep neural operators (MIONet) and incorporates the Fourier neural operator (FNO) in the network architecture. Once Fourier-MIONet is trained, it can predict the evolution of saturation and pressure of the multiphase flow under various reservoir conditions, such as permeability and porosity heterogeneity, anisotropy, injection configurations, and multiphase flow properties. Compared to the enhanced FNO (U-FNO), the proposed Fourier-MIONet has 90% fewer unknown parameters, and it can be trained in significantly less time (about 3.5 times faster) with much lower CPU memory (< 15%) and GPU memory (< 35%) requirements, to achieve similar prediction accuracy. In addition to the lower computational cost, Fourier-MIONet can be trained with only 6 snapshots of time to predict the PDE solutions for 30 years. The excellent generalizability of Fourier-MIONet is enabled by its adherence to the physical principle that the solution to a PDE is continuous over time. △ Less

Submitted 8 March, 2023; originally announced March 2023.

arXiv:2303.04549 [pdf]

doi 10.1038/s41586-024-07023-w

Observation of plaid-like spin splitting in a noncoplanar antiferromagnet

Authors: Yu-Peng Zhu, Xiaobing Chen, Xiang-Rui Liu, Yuntian Liu, Pengfei Liu, Heming Zha, Gexing Qu, Caiyun Hong, Jiayu Li, Zhicheng Jiang, Xiao-Ming Ma, Yu-Jie Hao, Ming-Yuan Zhu, Wen**g Liu, Meng Zeng, Sreehari Jayaram, Malik Lenger, Jianyang Ding, Shu Mo, Kiyohisa Tanaka, Masashi Arita, Zhengtai Liu, Mao Ye, Dawei Shen, Jörg Wrachtrup , et al. (5 additional authors not shown)

Abstract: Spatial, momentum and energy separation of electronic spins in condensed matter systems guides the development of novel devices where spin-polarized current is generated and manipulated. Recent attention on a set of previously overlooked symmetry operations in magnetic materials leads to the emergence of a new type of spin splitting, enabling giant and momentum-dependent spin polarization of energ… ▽ More Spatial, momentum and energy separation of electronic spins in condensed matter systems guides the development of novel devices where spin-polarized current is generated and manipulated. Recent attention on a set of previously overlooked symmetry operations in magnetic materials leads to the emergence of a new type of spin splitting, enabling giant and momentum-dependent spin polarization of energy bands on selected antiferromagnets. Despite the ever-growing theoretical predictions, the direct spectroscopic proof of such spin splitting is still lacking. Here, we provide solid spectroscopic and computational evidence for the existence of such materials. In the noncoplanar antiferromagnet MnTe$_2$, the in-plane components of spin are found to be antisymmetric about the high-symmetry planes of the Brillouin zone, comprising a plaid-like spin texture in the antiferromagnetic (AFM) ground state. Such an unconventional spin pattern, further found to diminish at the high-temperature paramagnetic state, stems from the intrinsic AFM order instead of spin-orbit coupling (SOC). Our finding demonstrates a new type of quadratic spin texture induced by time-reversal breaking, placing AFM spintronics on a firm basis and paving the way for studying exotic quantum phenomena in related materials. △ Less

Submitted 4 January, 2024; v1 submitted 8 March, 2023; originally announced March 2023.

Comments: Version 3, 49 pages, 4 main figures, 13 extended data figures and 2 extended data tables. Nature in press (2024)

Journal ref: Nature 626, 523-528 (2024)

arXiv:2303.03811 [pdf, other]

Environment Transformer and Policy Optimization for Model-Based Offline Reinforcement Learning

Authors: Pengqin Wang, Meixin Zhu, Shaojie Shen

Abstract: Interacting with the actual environment to acquire data is often costly and time-consuming in robotic tasks. Model-based offline reinforcement learning (RL) provides a feasible solution. On the one hand, it eliminates the requirements of interaction with the actual environment. On the other hand, it learns the transition dynamics and reward function from the offline datasets and generates simulate… ▽ More Interacting with the actual environment to acquire data is often costly and time-consuming in robotic tasks. Model-based offline reinforcement learning (RL) provides a feasible solution. On the one hand, it eliminates the requirements of interaction with the actual environment. On the other hand, it learns the transition dynamics and reward function from the offline datasets and generates simulated rollouts to accelerate training. Previous model-based offline RL methods adopt probabilistic ensemble neural networks (NN) to model aleatoric uncertainty and epistemic uncertainty. However, this results in an exponential increase in training time and computing resource requirements. Furthermore, these methods are easily disturbed by the accumulative errors of the environment dynamics models when simulating long-term rollouts. To solve the above problems, we propose an uncertainty-aware sequence modeling architecture called Environment Transformer. It models the probability distribution of the environment dynamics and reward function to capture aleatoric uncertainty and treats epistemic uncertainty as a learnable noise parameter. Benefiting from the accurate modeling of the transition dynamics and reward function, Environment Transformer can be combined with arbitrary planning, dynamics programming, or policy optimization algorithms for offline RL. In this case, we perform Conservative Q-Learning (CQL) to learn a conservative Q-function. Through simulation experiments, we demonstrate that our method achieves or exceeds state-of-the-art performance in widely studied offline RL benchmarks. Moreover, we show that Environment Transformer's simulated rollout quality, sample efficiency, and long-term rollout simulation capability are superior to those of previous model-based offline RL methods. △ Less

Submitted 16 October, 2023; v1 submitted 7 March, 2023; originally announced March 2023.

Comments: ICRA2024

arXiv:2302.14752 [pdf, other]

Multi-Robot-Guided Crowd Evacuation: Two-Scale Modeling and Control

Authors: Tongjia Zheng, Zhenyuan Yuan, Mollik Nayyar, Alan R. Wagner, Minghui Zhu, Hai Lin

Abstract: Emergency evacuation describes a complex situation involving time-critical decision-making by evacuees. Mobile robots are being actively explored as a potential solution to provide timely guidance. In this work, we study a robot-guided crowd evacuation problem where a small group of robots is used to guide a large human crowd to safe locations. The challenge lies in how to use micro-level human-ro… ▽ More Emergency evacuation describes a complex situation involving time-critical decision-making by evacuees. Mobile robots are being actively explored as a potential solution to provide timely guidance. In this work, we study a robot-guided crowd evacuation problem where a small group of robots is used to guide a large human crowd to safe locations. The challenge lies in how to use micro-level human-robot interactions to indirectly influence a population that significantly outnumbers the robots to achieve the collective evacuation objective. To address the challenge, we follow a two-scale modeling strategy and explore hydrodynamic models, which consist of a family of microscopic social force models that describe how human movements are locally affected by other humans, the environment, and robots, and associated macroscopic equations for the temporal and spatial evolution of the crowd density and flow velocity. We design controllers for the robots such that they not only automatically explore the environment (with unknown dynamic obstacles) to cover it as much as possible, but also dynamically adjust the directions of their local navigation force fields based on the real-time macrostates of the crowd to guide the crowd to a safe location. We prove the stability of the proposed evacuation algorithm and conduct extensive simulations to investigate the performance of the algorithm with different combinations of human numbers, robot numbers, and obstacle settings. △ Less

Submitted 11 January, 2024; v1 submitted 28 February, 2023; originally announced February 2023.

arXiv:2302.13704 [pdf, other]

Higher-order correction to weak-field lensing of an Ellis-Bronnikov wormhole

Authors: Tingqi Cai, Hyat Huang, Zun Wang, Mian Zhu

Abstract: The gravitational lensing effect at higher order under weak-field approximation is believed to be important to distinguish black holes and other compact objects such as wormholes. The deflection angle of a generic wormhole is difficult to solve analytically; thus approximation methods are implemented. In this paper, we investigate the weak-field deflection angle of a specific wormhole, the Ellis-B… ▽ More The gravitational lensing effect at higher order under weak-field approximation is believed to be important to distinguish black holes and other compact objects such as wormholes. The deflection angle of a generic wormhole is difficult to solve analytically; thus approximation methods are implemented. In this paper, we investigate the weak-field deflection angle of a specific wormhole, the Ellis-Bronnikov wormhole, up to the 1/b^4 order. We use different approximation formalisms, study their precision at 1/b^4 order by a comparison to a purely numerical result, and finally rank these formalisms by their accuracy. Moreover, we find that certain formalisms are sensitive to the choice of coordinate system; thus it is important to choose the coordinate system appropriately for the evaluating of lensing physics. △ Less

Submitted 1 August, 2023; v1 submitted 27 February, 2023; originally announced February 2023.

Comments: PRD published version

arXiv:2302.11774 [pdf, other]

Semantic-Fused Multi-Granularity Cross-City Traffic Prediction

Authors: Kehua Chen, Yuxuan Liang, **dong Han, Siyuan Feng, Meixin Zhu, Hai Yang

Abstract: Accurate traffic prediction is essential for effective urban management and the improvement of transportation efficiency. Recently, data-driven traffic prediction methods have been widely adopted, with better performance than traditional approaches. However, they often require large amounts of data for effective training, which becomes challenging given the prevalence of data scarcity in regions w… ▽ More Accurate traffic prediction is essential for effective urban management and the improvement of transportation efficiency. Recently, data-driven traffic prediction methods have been widely adopted, with better performance than traditional approaches. However, they often require large amounts of data for effective training, which becomes challenging given the prevalence of data scarcity in regions with inadequate sensing infrastructures. To address this issue, we propose a Semantic-Fused Multi-Granularity Transfer Learning (SFMGTL) model to achieve knowledge transfer across cities with fused semantics at different granularities. In detail, we design a semantic fusion module to fuse various semantics while conserving static spatial dependencies via reconstruction losses. Then, a fused graph is constructed based on node features through graph structure learning. Afterwards, we implement hierarchical node clustering to generate graphs with different granularity. To extract feasible meta-knowledge, we further introduce common and private memories and obtain domain-invariant features via adversarial training. It is worth noting that our work jointly addresses semantic fusion and multi-granularity issues in transfer learning. We conduct extensive experiments on six real-world datasets to verify the effectiveness of our SFMGTL model by comparing it with other state-of-the-art baselines. Afterwards, we also perform ablation and case studies, demonstrating that our model possesses substantially fewer parameters compared to baseline models. Moreover, we illustrate how knowledge transfer aids the model in accurately predicting demands, especially during peak hours. The codes can be found at https://github.com/zeonchen/SFMGTL. △ Less

Submitted 31 March, 2024; v1 submitted 22 February, 2023; originally announced February 2023.

arXiv:2302.11085 [pdf, other]

Learning to Generalize Provably in Learning to Optimize

Authors: Junjie Yang, Tianlong Chen, Mingkang Zhu, Fengxiang He, Dacheng Tao, Yingbin Liang, Zhangyang Wang

Abstract: Learning to optimize (L2O) has gained increasing popularity, which automates the design of optimizers by data-driven approaches. However, current L2O methods often suffer from poor generalization performance in at least two folds: (i) applying the L2O-learned optimizer to unseen optimizees, in terms of lowering their loss function values (optimizer generalization, or ``generalizable learning of op… ▽ More Learning to optimize (L2O) has gained increasing popularity, which automates the design of optimizers by data-driven approaches. However, current L2O methods often suffer from poor generalization performance in at least two folds: (i) applying the L2O-learned optimizer to unseen optimizees, in terms of lowering their loss function values (optimizer generalization, or ``generalizable learning of optimizers"); and (ii) the test performance of an optimizee (itself as a machine learning model), trained by the optimizer, in terms of the accuracy over unseen data (optimizee generalization, or ``learning to generalize"). While the optimizer generalization has been recently studied, the optimizee generalization (or learning to generalize) has not been rigorously studied in the L2O context, which is the aim of this paper. We first theoretically establish an implicit connection between the local entropy and the Hessian, and hence unify their roles in the handcrafted design of generalizable optimizers as equivalent metrics of the landscape flatness of loss functions. We then propose to incorporate these two metrics as flatness-aware regularizers into the L2O framework in order to meta-train optimizers to learn to generalize, and theoretically show that such generalization ability can be learned during the L2O meta-training process and then transformed to the optimizee loss function. Extensive experiments consistently validate the effectiveness of our proposals with substantially improved generalization on multiple sophisticated L2O models and diverse optimizees. Our code is available at: https://github.com/VITA-Group/Open-L2O/tree/main/Model_Free_L2O/L2O-Entropy. △ Less

Submitted 28 March, 2023; v1 submitted 21 February, 2023; originally announced February 2023.

Comments: This paper is accepted in AISTATS 2023

arXiv:2302.07609 [pdf, other]

DiffSeer: Difference-based Dynamic Weighted Graph Visualization

Authors: Xiaolin Wen, Yong Wang, Meixuan Wu, Fengjie Wang, Xuanwu Yue, Qiaomu Shen, Yuxin Ma, Min Zhu

Abstract: Existing dynamic weighted graph visualization approaches rely on users' mental comparison to perceive temporal evolution of dynamic weighted graphs, hindering users from effectively analyzing changes across multiple timeslices. We propose DiffSeer, a novel approach for dynamic weighted graph visualization by explicitly visualizing the differences of graph structures (e.g., edge weight differences)… ▽ More Existing dynamic weighted graph visualization approaches rely on users' mental comparison to perceive temporal evolution of dynamic weighted graphs, hindering users from effectively analyzing changes across multiple timeslices. We propose DiffSeer, a novel approach for dynamic weighted graph visualization by explicitly visualizing the differences of graph structures (e.g., edge weight differences) between adjacent timeslices. Specifically, we present a novel nested matrix design that overviews the graph structure differences over a time period as well as shows graph structure details in the timeslices of user interest. By collectively considering the overall temporal evolution and structure details in each timeslice, an optimization-based node reordering strategy is developed to group nodes with similar evolution patterns and highlight interesting graph structure details in each timeslice. We conducted two case studies on real-world graph datasets and in-depth interviews with 12 target users to evaluate DiffSeer. The results demonstrate its effectiveness in visualizing dynamic weighted graphs. △ Less

Submitted 15 February, 2023; originally announced February 2023.

arXiv:2302.05863 [pdf, other]

NFTDisk: Visual Detection of Wash Trading in NFT Markets

Authors: Xiaolin Wen, Yong Wang, Xuanwu Yue, Feida Zhu, Min Zhu

Abstract: With the growing popularity of Non-Fungible Tokens (NFT), a new type of digital assets, various fraudulent activities have appeared in NFT markets. Among them, wash trading has become one of the most common frauds in NFT markets, which attempts to mislead investors by creating fake trading volumes. Due to the sophisticated patterns of wash trading, only a subset of them can be detected by automati… ▽ More With the growing popularity of Non-Fungible Tokens (NFT), a new type of digital assets, various fraudulent activities have appeared in NFT markets. Among them, wash trading has become one of the most common frauds in NFT markets, which attempts to mislead investors by creating fake trading volumes. Due to the sophisticated patterns of wash trading, only a subset of them can be detected by automatic algorithms, and manual inspection is usually required. We propose NFTDisk, a novel visualization for investors to identify wash trading activities in NFT markets, where two linked visualization modules are presented: a radial visualization module with a disk metaphor to overview NFT transactions and a flow-based visualization module to reveal detailed NFT flows at multiple levels. We conduct two case studies and an in-depth user interview with 14 NFT investors to evaluate NFTDisk. The results demonstrate its effectiveness in exploring wash trading activities in NFT markets. △ Less

Submitted 12 February, 2023; originally announced February 2023.

arXiv:2302.05855 [pdf]

Investigation of Enhanced Inertial Navigation Algorithms by Functional Iteration

Authors: Hongyan Jiang, Maoran Zhu, Yuanxin Wu

Abstract: The defects of the traditional strapdown inertial navigation algorithms become well acknowledged and the corresponding enhanced algorithms have been quite recently proposed trying to mitigate both theoretical and algorithmic defects. In this paper, the analytical accuracy evaluation of both the traditional algorithms and the enhanced algorithms is investigated, against the true reference for the f… ▽ More The defects of the traditional strapdown inertial navigation algorithms become well acknowledged and the corresponding enhanced algorithms have been quite recently proposed trying to mitigate both theoretical and algorithmic defects. In this paper, the analytical accuracy evaluation of both the traditional algorithms and the enhanced algorithms is investigated, against the true reference for the first time enabled by the functional iteration approach having provable convergence. The analyses by the help of MATLAB Symbolic Toolbox show that the resultant error orders of all algorithms under investigation are consistent with those in the existing literatures, and the enhanced attitude algorithm notably reduces error orders of the traditional counterpart, while the impact of the enhanced velocity algorithm on error order reduction is insignificant. Simulation results agree with analyses that the superiority of the enhanced algorithm over the traditional one in the body-frame attitude computation scenario diminishes significantly in the entire inertial navigation computation scenario, while the functional iteration approach possesses significant accuracy superiority even under sustained lowly dynamic conditions. △ Less

Submitted 20 March, 2024; v1 submitted 11 February, 2023; originally announced February 2023.

Comments: 12 pages, 3 figs

arXiv:2302.04686 [pdf, other]

Global and Preference-based Optimization with Mixed Variables using Piecewise Affine Surrogates

Authors: Mengjia Zhu, Alberto Bemporad

Abstract: Optimization problems involving mixed variables, i.e., variables of numerical and categorical nature, can be challenging to solve, especially in the presence of complex constraints. Moreover, when the objective function is the result of a complicated simulation or experiment, it may be expensive to evaluate. This paper proposes a novel surrogate-based global optimization algorithm to solve linearl… ▽ More Optimization problems involving mixed variables, i.e., variables of numerical and categorical nature, can be challenging to solve, especially in the presence of complex constraints. Moreover, when the objective function is the result of a complicated simulation or experiment, it may be expensive to evaluate. This paper proposes a novel surrogate-based global optimization algorithm to solve linearly constrained mixed-variable problems up to medium-large size (around 100 variables after encoding and 20 constraints) based on constructing a piecewise affine surrogate of the objective function over feasible samples. We introduce two types of exploration functions to efficiently search the feasible domain via mixed-integer linear programming solvers. We also provide a preference-based version of the algorithm, which can be used when only pairwise comparisons between samples can be acquired while the underlying objective function to minimize remains unquantified. The two algorithms are tested on mixed-variable benchmark problems with and without constraints. The results show that, within a small number of acquisitions, the proposed algorithms can often achieve better or comparable results than other existing methods. △ Less

Submitted 7 June, 2023; v1 submitted 9 February, 2023; originally announced February 2023.

Comments: code available at https://github.com/mjzhu-p/PWAS

arXiv:2302.04407 [pdf, other]

Bayesian Non-parametric Hidden Markov Model for Agile Radar Pulse Sequences Streaming Analysis

Authors: Jiadi Bao, Yunjie Li, Mengtao Zhu, Shafei Wang

Abstract: Multi-function radars (MFRs) are sophisticated types of sensors with the capabilities of complex agile inter-pulse modulation implementation and dynamic work mode scheduling. The developments in MFRs pose great challenges to modern electronic reconnaissance systems or radar warning receivers for recognition and inference of MFR work modes. To address this issue, this paper proposes an online proce… ▽ More Multi-function radars (MFRs) are sophisticated types of sensors with the capabilities of complex agile inter-pulse modulation implementation and dynamic work mode scheduling. The developments in MFRs pose great challenges to modern electronic reconnaissance systems or radar warning receivers for recognition and inference of MFR work modes. To address this issue, this paper proposes an online processing framework for parameter estimation and change point detection of MFR work modes. At first, this paper designed a fully-conjugate Bayesian non-parametric hidden Markov model with a designed prior distribution (agile BNP-HMM) to represent the MFR pulse agility characteristics. The proposed model allows fully-variational Bayesian inference. Then, the proposed framework is constructed by two main parts. The first part is the agile BNP-HMM model for automatically inferring the number of HMM hidden states and emission distribution of the corresponding hidden states. An estimation error lower bound on performance is derived and the proposed algorithm is shown to be close to the bound. The second part utilizes the streaming Bayesian updating to facilitate computation, and designed an online work mode change detection framework based upon a weighted sequential probability ratio test. We demonstrate that the proposed framework is consistently highly effective and robust to baseline methods on diverse simulated data-sets. △ Less

Submitted 22 August, 2023; v1 submitted 8 February, 2023; originally announced February 2023.

Comments: 15 pages, 10 figures, submitted to IEEE transactions on signal processing

arXiv:2302.04387 [pdf, other]

Catch Planner: Catching High-Speed Targets in the Flight

Authors: Huan Yu, Pengqin Wang, ** Wang, Jialin Ji, Zhi Zheng, Jie Tu, Guodong Lu, Jun Meng, Meixin Zhu, Shaojie Shen, Fei Gao

Abstract: Catching high-speed targets in the flight is a complex and typical highly dynamic task. In this paper, we propose Catch Planner, a planning-with-decision scheme for catching. For sequential decision making, we propose a policy search method based on deep reinforcement learning. In order to make catching adaptive and flexible, we propose a trajectory optimization method to jointly optimize the high… ▽ More Catching high-speed targets in the flight is a complex and typical highly dynamic task. In this paper, we propose Catch Planner, a planning-with-decision scheme for catching. For sequential decision making, we propose a policy search method based on deep reinforcement learning. In order to make catching adaptive and flexible, we propose a trajectory optimization method to jointly optimize the highly coupled catching time and terminal state while considering the dynamic feasibility and safety. We also propose a flexible constraint transcription method to catch targets at any reasonable attitude and terminal position bias. The proposed Catch Planner provides a new paradigm for the combination of learning and planning and is integrated on the quadrotor designed by ourselves, which runs at 100hz on the onboard computer. Extensive experiments are carried out in real and simulated scenes to verify the robustness of the proposed method and its expansibility when facing a variety of high-speed flying targets. △ Less

Submitted 26 June, 2023; v1 submitted 8 February, 2023; originally announced February 2023.

Comments: 11 pages, 8 figures, accepted by IEEE/ASME Transactions on Mechatronics

arXiv:2302.04114 [pdf, ps, other]

Resistance Distances in Directed Graphs: Definitions, Properties, and Applications

Authors: Mingzhe Zhu, Liwang Zhu, Huan Li, Wei Li, Zhongzhi Zhang

Abstract: Resistance distance has been studied extensively in the past years, with the majority of previous studies devoted to undirected networks, in spite of the fact that various realistic networks are directed. Although several generalizations of resistance distance on directed graphs have been proposed, they either have no physical interpretation or are not a metric. In this paper, we first extend the… ▽ More Resistance distance has been studied extensively in the past years, with the majority of previous studies devoted to undirected networks, in spite of the fact that various realistic networks are directed. Although several generalizations of resistance distance on directed graphs have been proposed, they either have no physical interpretation or are not a metric. In this paper, we first extend the definition of resistance distance to strongly connected directed graphs based on random walks and show that the two-node resistance distance on directed graphs is a metric. Then, we introduce the Laplacian matrix for directed graphs that subsumes the Laplacian matrix of undirected graphs as a particular case and use its pseudoinverse to express the two-node resistance distance, and many other relevant quantities derived from resistance distances. Moreover, we define the resistance distance between a vertex and a vertex group on directed graphs and further define a problem of optimally selecting a group of fixed number of nodes, such that their resistance distance is minimized. Since this combinatorial optimization problem is NP-hard, we present a greedy algorithm with a proved approximation ratio, and conduct experiments on model and realistic networks to validate the performance of this approximation algorithm. △ Less

Submitted 8 February, 2023; originally announced February 2023.

Comments: Submitted to IEEE Transactions on Information Theory

arXiv:2302.03669 [pdf, other]

Deep Reinforcement Learning for Traffic Light Control in Intelligent Transportation Systems

Authors: Xiao-Yang Liu, Ming Zhu, Sem Borst, Anwar Walid

Abstract: Smart traffic lights in intelligent transportation systems (ITSs) are envisioned to greatly increase traffic efficiency and reduce congestion. Deep reinforcement learning (DRL) is a promising approach to adaptively control traffic lights based on the real-time traffic situation in a road network. However, conventional methods may suffer from poor scalability. In this paper, we investigate deep rei… ▽ More Smart traffic lights in intelligent transportation systems (ITSs) are envisioned to greatly increase traffic efficiency and reduce congestion. Deep reinforcement learning (DRL) is a promising approach to adaptively control traffic lights based on the real-time traffic situation in a road network. However, conventional methods may suffer from poor scalability. In this paper, we investigate deep reinforcement learning to control traffic lights, and both theoretical analysis and numerical experiments show that the intelligent behavior ``greenwave" (i.e., a vehicle will see a progressive cascade of green lights, and not have to brake at any intersection) emerges naturally a grid road network, which is proved to be the optimal policy in an avenue with multiple cross streets. As a first step, we use two DRL algorithms for the traffic light control problems in two scenarios. In a single road intersection, we verify that the deep Q-network (DQN) algorithm delivers a thresholding policy; and in a grid road network, we adopt the deep deterministic policy gradient (DDPG) algorithm. Secondly, numerical experiments show that the DQN algorithm delivers the optimal control, and the DDPG algorithm with passive observations has the capability to produce on its own a high-level intelligent behavior in a grid road network, namely, the ``greenwave" policy emerges. We also verify the ``greenwave" patterns in a $5 \times 10$ grid road network. Thirdly, the ``greenwave" patterns demonstrate that DRL algorithms produce favorable solutions since the ``greenwave" policy shown in experiment results is proved to be optimal in a specified traffic model (an avenue with multiple cross streets). The delivered policies both in a single road intersection and a grid road network demonstrate the scalability of DRL algorithms. △ Less

Submitted 5 March, 2023; v1 submitted 3 February, 2023; originally announced February 2023.

Comments: 17 pages

Journal ref: IEEE Transactions on Network Science and Engineering, 2023

Showing 201–250 of 860 results for author: Zhu, M