Search | arXiv e-print repository

doi 10.1103/PhysRevResearch.5.L042043

Dynamical phase transitions of information flow in random quantum circuits

Authors: J. -Z. Zhuang, Y. -K. Wu, L. -M. Duan

Abstract: We study how the information flows in many-body dynamics governed by random quantum circuits and discover a rich set of dynamical phase transitions in this information flow. The phase transition points and their critical exponents are established across Clifford and Haar random circuits through finite-size scaling. The flow of both classical and quantum information, measured respectively by Holevo… ▽ More We study how the information flows in many-body dynamics governed by random quantum circuits and discover a rich set of dynamical phase transitions in this information flow. The phase transition points and their critical exponents are established across Clifford and Haar random circuits through finite-size scaling. The flow of both classical and quantum information, measured respectively by Holevo and coherent information, shows similar dynamical phase transition behaviors. We investigate how the phase transitions depend on the initial location of the information and the final probe region, and find ubiquitous behaviors in these transitions, revealing interesting properties about the information propagation and scrambling in this quantum many-body model. Our work underscores rich behaviors of the information flow in large systems with numerous phase transitions, thereby sheds new light on the understanding of quantum many-body dynamics. △ Less

Submitted 21 November, 2023; v1 submitted 3 April, 2023; originally announced April 2023.

Comments: 11 pages, 12 figures

arXiv:2303.11583 [pdf]

doi 10.1016/j.cpc.2023.108845

pgm: A Python package for free energy calculations within the phonon gas model

Authors: Hong** Wang, **gyi Zhuang, Zhen Zhang, Qi Zhang, Renata M. Wentzcovitch

Abstract: The quasi-harmonic approximation (QHA) is a powerful method that uses the volume dependence of non-interacting phonons to compute the free energy of materials at high pressures (P) and temperatures (T). However, anharmonicity, electronic excitations in metals, or both, introduce an intrinsic T-dependence on phonon frequencies, rendering the QHA inadequate. Here we present a Python code, pgm, to co… ▽ More The quasi-harmonic approximation (QHA) is a powerful method that uses the volume dependence of non-interacting phonons to compute the free energy of materials at high pressures (P) and temperatures (T). However, anharmonicity, electronic excitations in metals, or both, introduce an intrinsic T-dependence on phonon frequencies, rendering the QHA inadequate. Here we present a Python code, pgm, to compute the free energy and thermodynamic property within the phonon gas model (PGM) that uses T-dependent phonon quasiparticle frequencies. In this case, the vibrational contribution to the Helmholtz free energy is obtained by integrating the vibrational entropy, which can be readily calculated for a system of phonon quasiparticles. Other thermodynamic properties are then obtained from standard thermodynamic relations. We demonstrate the successful applications of pgm to two cases of geophysical significance: cubic CaSiO3-perovskite (cCaPv), a strongly anharmonic insulator and the third most abundant phase of the Earth's lower mantle, and NiAs-type (B8) FeO, a partially covalent-metallic system. This is the oxide endmember of a recently discovered iron-rich Fe$_n$O alloy phase likely to exit in the Earth's inner core. △ Less

Submitted 21 March, 2023; originally announced March 2023.

Comments: 26 pages, 9 figures, 5 tables

arXiv:2303.08774 [pdf, other]

GPT-4 Technical Report

Authors: OpenAI, Josh Achiam, Steven Adler, Sandhini Agarwal, Lama Ahmad, Ilge Akkaya, Florencia Leoni Aleman, Diogo Almeida, Janko Altenschmidt, Sam Altman, Shyamal Anadkat, Red Avila, Igor Babuschkin, Suchir Balaji, Valerie Balcom, Paul Baltescu, Haiming Bao, Mohammad Bavarian, Jeff Belgum, Irwan Bello, Jake Berdine, Gabriel Bernadett-Shapiro, Christopher Berner, Lenny Bogdonoff, Oleg Boiko , et al. (256 additional authors not shown)

Abstract: We report the development of GPT-4, a large-scale, multimodal model which can accept image and text inputs and produce text outputs. While less capable than humans in many real-world scenarios, GPT-4 exhibits human-level performance on various professional and academic benchmarks, including passing a simulated bar exam with a score around the top 10% of test takers. GPT-4 is a Transformer-based mo… ▽ More We report the development of GPT-4, a large-scale, multimodal model which can accept image and text inputs and produce text outputs. While less capable than humans in many real-world scenarios, GPT-4 exhibits human-level performance on various professional and academic benchmarks, including passing a simulated bar exam with a score around the top 10% of test takers. GPT-4 is a Transformer-based model pre-trained to predict the next token in a document. The post-training alignment process results in improved performance on measures of factuality and adherence to desired behavior. A core component of this project was develo** infrastructure and optimization methods that behave predictably across a wide range of scales. This allowed us to accurately predict some aspects of GPT-4's performance based on models trained with no more than 1/1,000th the compute of GPT-4. △ Less

Submitted 4 March, 2024; v1 submitted 15 March, 2023; originally announced March 2023.

Comments: 100 pages; updated authors list; fixed author names and added citation

arXiv:2303.05476 [pdf]

Full-waveform tomography reveals iron spin crossover in Earth lower mantle

Authors: Laura Cobden, **gyi Zhuang, Wenjie Lei, Renata Wentzcovitch, Jeannot Trampert, Jeroen Tromp

Abstract: Joint interpretation of bulk and shear wave speeds constrains the chemistry of the deep mantle. At all depths, the diversity of wave speeds cannot be explained by an isochemical mantle. Between 1000 and 2500 km depth, hypothetical mantle models containing an electronic spin crossover in (Mg,Fe)O provide a significantly better fit to the wave-speed distributions, as well as more realistic temperatu… ▽ More Joint interpretation of bulk and shear wave speeds constrains the chemistry of the deep mantle. At all depths, the diversity of wave speeds cannot be explained by an isochemical mantle. Between 1000 and 2500 km depth, hypothetical mantle models containing an electronic spin crossover in (Mg,Fe)O provide a significantly better fit to the wave-speed distributions, as well as more realistic temperatures and silica contents, than models without a spin crossover. Below 2500 km, wave speed distributions are explained by enrichment in silica towards the core-mantle-boundary. This silica enrichment may represent the fractionated remains of an ancient basal magma ocean. △ Less

Submitted 9 March, 2023; originally announced March 2023.

arXiv:2303.01047 [pdf, other]

Task-Specific Context Decoupling for Object Detection

Authors: Jiayuan Zhuang, Zheng Qin, Hao Yu, Xucan Chen

Abstract: Classification and localization are two main sub-tasks in object detection. Nonetheless, these two tasks have inconsistent preferences for feature context, i.e., localization expects more boundary-aware features to accurately regress the bounding box, while more semantic context is preferred for object classification. Exsiting methods usually leverage disentangled heads to learn different feature… ▽ More Classification and localization are two main sub-tasks in object detection. Nonetheless, these two tasks have inconsistent preferences for feature context, i.e., localization expects more boundary-aware features to accurately regress the bounding box, while more semantic context is preferred for object classification. Exsiting methods usually leverage disentangled heads to learn different feature context for each task. However, the heads are still applied on the same input features, which leads to an imperfect balance between classifcation and localization. In this work, we propose a novel Task-Specific COntext DEcoupling (TSCODE) head which further disentangles the feature encoding for two tasks. For classification, we generate spatially-coarse but semantically-strong feature encoding. For localization, we provide high-resolution feature map containing more edge information to better regress object boundaries. TSCODE is plug-and-play and can be easily incorperated into existing detection pipelines. Extensive experiments demonstrate that our method stably improves different detectors by over 1.0 AP with less computational cost. Our code and models will be publicly released. △ Less

Submitted 2 March, 2023; originally announced March 2023.

arXiv:2302.00827 [pdf, other]

Including stress relaxation in point-process model for seismic occurrence

Authors: Giuseppe Petrillo, Jiancang Zhuang, Eugenio Lippiello

Abstract: Physics-based and statistic-based models for describing seismic occurrence are two sides of the same coin. In this article we compare the temporal organization of events obtained in a spring-block model for the seismic fault with the one predicted by probabilistic models for seismic occurrence. Thanks to the optimization of the parameters, by means of a Maximum Likelihood Estimation, it is possibl… ▽ More Physics-based and statistic-based models for describing seismic occurrence are two sides of the same coin. In this article we compare the temporal organization of events obtained in a spring-block model for the seismic fault with the one predicted by probabilistic models for seismic occurrence. Thanks to the optimization of the parameters, by means of a Maximum Likelihood Estimation, it is possible to identify the statistical model which fits better the physical one. The results show that the best statistical model must take into account the non trivial interplay between temporal clustering, related to aftershock occurrence, and the stress discharge following the occurrence of high magnitude mainshocks. The two mechanisms contribute in different ways according to the minimum magnitude considered in the data fitting catalog. △ Less

Submitted 1 February, 2023; originally announced February 2023.

arXiv:2301.03832 [pdf, other]

Video Semantic Segmentation with Inter-Frame Feature Fusion and Inner-Frame Feature Refinement

Authors: Jiafan Zhuang, Zilei Wang, Junjie Li

Abstract: Video semantic segmentation aims to generate accurate semantic maps for each video frame. To this end, many works dedicate to integrate diverse information from consecutive frames to enhance the features for prediction, where a feature alignment procedure via estimated optical flow is usually required. However, the optical flow would inevitably suffer from inaccuracy, and then introduce noises in… ▽ More Video semantic segmentation aims to generate accurate semantic maps for each video frame. To this end, many works dedicate to integrate diverse information from consecutive frames to enhance the features for prediction, where a feature alignment procedure via estimated optical flow is usually required. However, the optical flow would inevitably suffer from inaccuracy, and then introduce noises in feature fusion and further result in unsatisfactory segmentation results. In this paper, to tackle the misalignment issue, we propose a spatial-temporal fusion (STF) module to model dense pairwise relationships among multi-frame features. Different from previous methods, STF uniformly and adaptively fuses features at different spatial and temporal positions, and avoids error-prone optical flow estimation. Besides, we further exploit feature refinement within a single frame and propose a novel memory-augmented refinement (MAR) module to tackle difficult predictions among semantic boundaries. Specifically, MAR can store the boundary features and prototypes extracted from the training samples, which together form the task-specific memory, and then use them to refine the features during inference. Essentially, MAR can move the hard features closer to the most likely category and thus make them more discriminative. We conduct extensive experiments on Cityscapes and CamVid, and the results show that our proposed methods significantly outperform previous methods and achieves the state-of-the-art performance. Code and pretrained models are available at https://github.com/jfzhuang/ST_Memory. △ Less

Submitted 10 January, 2023; originally announced January 2023.

arXiv:2301.02359 [pdf, other]

CHARM: Composing Heterogeneous Accelerators for Matrix Multiply on Versal ACAP Architecture

Authors: **ming Zhuang, Jason Lau, Hanchen Ye, Zhuo** Yang, Yubo Du, Jack Lo, Kristof Denolf, Stephen Neuendorffer, Alex Jones, **gtong Hu, Deming Chen, Jason Cong, Peipei Zhou

Abstract: Dense matrix multiply (MM) serves as one of the most heavily used kernels in deep learning applications. To cope with the high computation demands of these applications, heterogeneous architectures featuring both FPGA and dedicated ASIC accelerators have emerged as promising platforms. For example, the AMD/Xilinx Versal ACAP architecture combines general-purpose CPU cores and programmable logic wi… ▽ More Dense matrix multiply (MM) serves as one of the most heavily used kernels in deep learning applications. To cope with the high computation demands of these applications, heterogeneous architectures featuring both FPGA and dedicated ASIC accelerators have emerged as promising platforms. For example, the AMD/Xilinx Versal ACAP architecture combines general-purpose CPU cores and programmable logic with AI Engine processors optimized for AI/ML. With 400 AIEs, it provides up to 6.4 TFLOPs performance for 32-bit floating-point data. However, machine learning models often contain both large and small MM operations. While large MM operations can be parallelized efficiently across many cores, small MM operations typically cannot. We observe that executing some small MM layers from the BERT natural language processing model on a large, monolithic MM accelerator in Versal ACAP achieved less than 5% of the theoretical peak performance. Therefore, one key question arises: How can we design accelerators to fully use the abundant computation resources under limited communication bandwidth for applications with multiple MM layers of diverse sizes? We identify the biggest system throughput bottleneck resulting from the mismatch of massive computation resources of one monolithic accelerator and the various MM layers of small sizes in the application. To resolve this problem, we propose the CHARM framework to compose multiple diverse MM accelerator architectures working concurrently towards different layers in one application. We deploy the CHARM framework for four different applications, including BERT, ViT, NCF, MLP, on the AMD Versal ACAP VCK190 evaluation board. Our experiments show that we achieve 1.46 TFLOPs, 1.61 TFLOPs, 1.74 TFLOPs, and 2.94 TFLOPs inference throughput for BERT, ViT, NCF and MLP, which obtain 5.40x, 32.51x, 1.00x and 1.00x throughput gains compared to one monolithic accelerator. △ Less

Submitted 5 January, 2023; originally announced January 2023.

arXiv:2211.01607 [pdf, other]

ImageCAS: A Large-Scale Dataset and Benchmark for Coronary Artery Segmentation based on Computed Tomography Angiography Images

Authors: An Zeng, Chunbiao Wu, Mei** Huang, Jian Zhuang, Shanshan Bi, Dan Pan, Najeeb Ullah, Kaleem Nawaz Khan, Tianchen Wang, Yiyu Shi, Xiaomeng Li, Guisen Lin, Xiaowei Xu

Abstract: Cardiovascular disease (CVD) accounts for about half of non-communicable diseases. Vessel stenosis in the coronary artery is considered to be the major risk of CVD. Computed tomography angiography (CTA) is one of the widely used noninvasive imaging modalities in coronary artery diagnosis due to its superior image resolution. Clinically, segmentation of coronary arteries is essential for the diagno… ▽ More Cardiovascular disease (CVD) accounts for about half of non-communicable diseases. Vessel stenosis in the coronary artery is considered to be the major risk of CVD. Computed tomography angiography (CTA) is one of the widely used noninvasive imaging modalities in coronary artery diagnosis due to its superior image resolution. Clinically, segmentation of coronary arteries is essential for the diagnosis and quantification of coronary artery disease. Recently, a variety of works have been proposed to address this problem. However, on one hand, most works rely on in-house datasets, and only a few works published their datasets to the public which only contain tens of images. On the other hand, their source code have not been published, and most follow-up works have not made comparison with existing works, which makes it difficult to judge the effectiveness of the methods and hinders the further exploration of this challenging yet critical problem in the community. In this paper, we propose a large-scale dataset for coronary artery segmentation on CTA images. In addition, we have implemented a benchmark in which we have tried our best to implement several typical existing methods. Furthermore, we propose a strong baseline method which combines multi-scale patch fusion and two-stage processing to extract the details of vessels. Comprehensive experiments show that the proposed method achieves better performance than existing works on the proposed large-scale dataset. The benchmark and the dataset are published at https://github.com/XiaoweiXu/ImageCAS-A-Large-Scale-Dataset-and-Benchmark-for-Coronary-Artery-Segmentation-based-on-CT. △ Less

Submitted 17 October, 2023; v1 submitted 3 November, 2022; originally announced November 2022.

Comments: 17 pages, 12 figures, 4 tables

Journal ref: Computerized Medical Imaging and Graphics, 2023

arXiv:2209.06656 [pdf, ps, other]

Syndrome decoding meets multiple instances

Authors: Haoxuan Wu, **cheng Zhuang

Abstract: The NP-hard problem of decoding random linear codes is crucial to both coding theory and cryptography. In particular, this problem underpins the security of many code based post-quantum cryptographic schemes. The state-of-art algorithms for solving this problem are the information syndrome decoding algorithm and its advanced variants. In this work, we consider syndrome decoding in the multiple ins… ▽ More The NP-hard problem of decoding random linear codes is crucial to both coding theory and cryptography. In particular, this problem underpins the security of many code based post-quantum cryptographic schemes. The state-of-art algorithms for solving this problem are the information syndrome decoding algorithm and its advanced variants. In this work, we consider syndrome decoding in the multiple instances setting. Two strategies are applied for different scenarios. The first strategy is to solve all instances with the aid of the precomputation technique. We adjust the current framework and distinguish the offline phase and online phase to reduce the amortized complexity. Further, we discuss the impact on the concrete security of some post-quantum schemes. The second strategy is to solve one out of many instances. Adapting the analysis for some earlier algorithm, we discuss the effectiveness of using advanced variants and confirm a related folklore conjecture. △ Less

Submitted 14 September, 2022; originally announced September 2022.

arXiv:2209.00726 [pdf, other]

Learning correspondences of cardiac motion from images using biomechanics-informed modeling

Authors: Xiaoran Zhang, Chenyu You, Shawn Ahn, Juntang Zhuang, Lawrence Staib, James Duncan

Abstract: Learning spatial-temporal correspondences in cardiac motion from images is important for understanding the underlying dynamics of cardiac anatomical structures. Many methods explicitly impose smoothness constraints such as the $\mathcal{L}_2$ norm on the displacement vector field (DVF), while usually ignoring biomechanical feasibility in the transformation. Other geometric constraints either regul… ▽ More Learning spatial-temporal correspondences in cardiac motion from images is important for understanding the underlying dynamics of cardiac anatomical structures. Many methods explicitly impose smoothness constraints such as the $\mathcal{L}_2$ norm on the displacement vector field (DVF), while usually ignoring biomechanical feasibility in the transformation. Other geometric constraints either regularize specific regions of interest such as imposing incompressibility on the myocardium or introduce additional steps such as training a separate network-based regularizer on physically simulated datasets. In this work, we propose an explicit biomechanics-informed prior as regularization on the predicted DVF in modeling a more generic biomechanically plausible transformation within all cardiac structures without introducing additional training complexity. We validate our methods on two publicly available datasets in the context of 2D MRI data and perform extensive experiments to illustrate the effectiveness and robustness of our proposed methods compared to other competing regularization schemes. Our proposed methods better preserve biomechanical properties by visual assessment and show advantages in segmentation performance using quantitative evaluation metrics. The code is publicly available at \url{https://github.com/Voldemort108X/bioinformed_reg}. △ Less

Submitted 1 September, 2022; originally announced September 2022.

Comments: Accepted by MICCAI-STACOM 2022 as an oral presentation

arXiv:2208.12207 [pdf]

The postperovskite transition in Fe- and Al-bearing bridgmanite: effects on seismic observables

Authors: Juan J. Valencia-Cardona, Renata M. Wentzcovitch, **gyi Zhuang, Gaurav Shukla, Kanchan Sarkar

Abstract: The primary phase of the Earth's lower mantle, (Al, Fe)-bearing bridgmanite, transitions to the postperovskite (PPv) phase at Earth's deep mantle conditions. Despite extensive experimental and ab initio investigations, there are still important aspects of this transformation that need clarification. Here, we address this transition in (Al3+, Fe3+)-, (Al3+)-, (Fe2+)-, and (Fe3+)-bearing bridgmanite… ▽ More The primary phase of the Earth's lower mantle, (Al, Fe)-bearing bridgmanite, transitions to the postperovskite (PPv) phase at Earth's deep mantle conditions. Despite extensive experimental and ab initio investigations, there are still important aspects of this transformation that need clarification. Here, we address this transition in (Al3+, Fe3+)-, (Al3+)-, (Fe2+)-, and (Fe3+)-bearing bridgmanite using ab initio calculations and validate our results against experiments on similar compositions. Consistent with experiments, our results show that the onset transition pressure and the width of the two-phase region depend distinctly on the chemical composition: a) Fe3+-, Al3+-, or (Al3+, Fe3+)-alloying increases the transition pressure, while Fe2+-alloying has the opposite effect; b) in the absence of coexisting phases, the pressure-depth range of the Pv-PPv transition seems quite broad to cause a sharp D" discontinuity (< 30 km); c) the average Clapeyron slope of the two-phase regions are consistent with previous measurements, calculations in MgSiO3, and inferences from seismic data. In addition, d) we observe a softening of the bulk modulus in the two-phase region. The consistency between our results and experiments gives us the confidence to proceed and examine this transition in aggregates with different compositions computationally, which will be fundamental for resolving the most likely chemical composition of the D" region by analyses of tomographic images. △ Less

Submitted 15 September, 2022; v1 submitted 25 August, 2022; originally announced August 2022.

Comments: 20 pages, 5 figures, 1 table

arXiv:2208.09779 [pdf, other]

doi 10.1145/3511808.3557437

Robust Node Classification on Graphs: Jointly from Bayesian Label Transition and Topology-based Label Propagation

Authors: Jun Zhuang, Mohammad Al Hasan

Abstract: Node classification using Graph Neural Networks (GNNs) has been widely applied in various real-world scenarios. However, in recent years, compelling evidence emerges that the performance of GNN-based node classification may deteriorate substantially by topological perturbation, such as random connections or adversarial attacks. Various solutions, such as topological denoising methods and mechanism… ▽ More Node classification using Graph Neural Networks (GNNs) has been widely applied in various real-world scenarios. However, in recent years, compelling evidence emerges that the performance of GNN-based node classification may deteriorate substantially by topological perturbation, such as random connections or adversarial attacks. Various solutions, such as topological denoising methods and mechanism design methods, have been proposed to develop robust GNN-based node classifiers but none of these works can fully address the problems related to topological perturbations. Recently, the Bayesian label transition model is proposed to tackle this issue but its slow convergence may lead to inferior performance. In this work, we propose a new label inference model, namely LInDT, which integrates both Bayesian label transition and topology-based label propagation for improving the robustness of GNNs against topological perturbations. LInDT is superior to existing label transition methods as it improves the label prediction of uncertain nodes by utilizing neighborhood-based label propagation leading to better convergence of label inference. Besides, LIndT adopts asymmetric Dirichlet distribution as a prior, which also helps it to improve label inference. Extensive experiments on five graph datasets demonstrate the superiority of LInDT for GNN-based node classification under three scenarios of topological perturbations. △ Less

Submitted 20 August, 2022; originally announced August 2022.

Comments: The paper is accepted for CIKM 2022

arXiv:2208.05616 [pdf, other]

OpenMedIA: Open-Source Medical Image Analysis Toolbox and Benchmark under Heterogeneous AI Computing Platforms

Authors: Jia-Xin Zhuang, Xiansong Huang, Yang Yang, Jiancong Chen, Yue Yu, Wei Gao, Ge Li, Jie Chen, Tong Zhang

Abstract: In this paper, we present OpenMedIA, an open-source toolbox library containing a rich set of deep learning methods for medical image analysis under heterogeneous Artificial Intelligence (AI) computing platforms. Various medical image analysis methods, including 2D/3D medical image classification, segmentation, localisation, and detection, have been included in the toolbox with PyTorch and/or MindS… ▽ More In this paper, we present OpenMedIA, an open-source toolbox library containing a rich set of deep learning methods for medical image analysis under heterogeneous Artificial Intelligence (AI) computing platforms. Various medical image analysis methods, including 2D/3D medical image classification, segmentation, localisation, and detection, have been included in the toolbox with PyTorch and/or MindSpore implementations under heterogeneous NVIDIA and Huawei Ascend computing systems. To our best knowledge, OpenMedIA is the first open-source algorithm library providing compared PyTorch and MindSpore implementations and results on several benchmark datasets. The source codes and models are available at https://git.openi.org.cn/OpenMedIA. △ Less

Submitted 7 September, 2022; v1 submitted 10 August, 2022; originally announced August 2022.

Comments: 12 pages, 1 figure

arXiv:2208.00243 [pdf, other]

doi 10.1088/1361-648X/ac8465

2D Gapless Topological Superfluids Generated by Pairing Phases

Authors: Jiapei Zhuang, Ching-Yu Huang, Po-Yao Chang, Daw-Wei Wang

Abstract: We systematically investigate the ground state phase diagram and the finite temperature phase transitions for a Rydberg-dressed Fermi gas loaded in a bilayer optical lattice. When an effective finite-ranged attraction is induced, our self-consistent mean-field calculation shows that the gapped topological ( $p$-wave) superfluids in each layer are coupled together by the $s$-wave pairing in an inte… ▽ More We systematically investigate the ground state phase diagram and the finite temperature phase transitions for a Rydberg-dressed Fermi gas loaded in a bilayer optical lattice. When an effective finite-ranged attraction is induced, our self-consistent mean-field calculation shows that the gapped topological ( $p$-wave) superfluids in each layer are coupled together by the $s$-wave pairing in an intermediate inter-layer distance with a spontaneously modulated phases between these two order parameters. The obtained ground state is a gapless topological superfluid with quantized topological charges characterizing the gapless points, leading to a zero energy flat band at the edges. Finally, we calculate the finite temperature phase diagrams of this two-dimensional gapless superfluid and observe two distinct critical temperatures, demonstrating the fruitful many-body effects on a paired topological superfluids. △ Less

Submitted 30 July, 2022; originally announced August 2022.

Comments: 11 pages, 5 figures

arXiv:2207.04519 [pdf, other]

A multi-cubic-kilometre neutrino telescope in the western Pacific Ocean

Authors: Z. P. Ye, F. Hu, W. Tian, Q. C. Chang, Y. L. Chang, Z. S. Cheng, J. Gao, T. Ge, G. H. Gong, J. Guo, X. X. Guo, X. G. He, J. T. Huang, K. Jiang, P. K. Jiang, Y. P. **g, H. L. Li, J. L. Li, L. Li, W. L. Li, Z. Li, N. Y. Liao, Q. Lin, F. Liu, J. L. Liu , et al. (33 additional authors not shown)

Abstract: Next-generation neutrino telescopes with significantly improved sensitivity are required to pinpoint the sources of the diffuse astrophysical neutrino flux detected by IceCube and uncover the century-old puzzle of cosmic ray origins. A detector near the equator will provide a unique viewpoint of the neutrino sky, complementing IceCube and other neutrino telescopes in the Northern Hemisphere. Here… ▽ More Next-generation neutrino telescopes with significantly improved sensitivity are required to pinpoint the sources of the diffuse astrophysical neutrino flux detected by IceCube and uncover the century-old puzzle of cosmic ray origins. A detector near the equator will provide a unique viewpoint of the neutrino sky, complementing IceCube and other neutrino telescopes in the Northern Hemisphere. Here we present results from an expedition to the north-eastern region of the South China Sea, in the western Pacific Ocean. A favorable neutrino telescope site was found on an abyssal plain at a depth of $\sim$ 3.5km. At depths below 3km, the sea current speed, water absorption and scattering lengths for Cherenkov light, were measured to be $v_{\mathrm{c}}<$10cm/s, $λ_{\mathrm{abs} }\simeq$ 27m and $λ_{\mathrm{sca} }\simeq$ 63m, respectively. Accounting for these measurements, we present the design and expected performance of a next-generation neutrino telescope, TRopIcal DEep-sea Neutrino Telescope (TRIDENT). With its advanced photon-detection technology and large dimensions, TRIDENT expects to observe the IceCube steady source candidate NGC 1068 with 5$σ$ significance within 1 year of operation. This level of sensitivity will open a new arena for diagnosing the origin of cosmic rays and probing fundamental physics over astronomical baselines. △ Less

Submitted 13 May, 2024; v1 submitted 10 July, 2022; originally announced July 2022.

Comments: 34 pages,12 figures. Correspondence should be addressed to D. L. Xu: [email protected]

arXiv:2206.12558 [pdf, other]

FastBVP-Net: a lightweight pulse extraction network for measuring heart rhythm via facial videos

Authors: Jialiang Zhuang, Yuheng Chen, Yun Zhang, Xiujuan Zheng

Abstract: Remote photoplethysmography (rPPG) is an attractive camera-based health monitoring method that can measure the heart rhythm from facial videos. Many well-established deep-learning models have been reported to measure heart rate (HR) and heart rate variability (HRV). However, most of these models usually require a 30-second facial video and enormous computational resources to obtain accurate and ro… ▽ More Remote photoplethysmography (rPPG) is an attractive camera-based health monitoring method that can measure the heart rhythm from facial videos. Many well-established deep-learning models have been reported to measure heart rate (HR) and heart rate variability (HRV). However, most of these models usually require a 30-second facial video and enormous computational resources to obtain accurate and robust results, which significantly limits their applications in real-world scenarios. Hence, we propose a lightweight pulse extraction network, FastBVP-Net, to quickly measure heart rhythm via facial videos. The proposed FastBVP-Net uses a multi-frequency mode signal fusion (MMSF) mechanism to characterize the different modes of the raw signals in a decompose module and reconstruct the blood volume pulse (BVP) signal under a complex noise environment in a compose module. Meanwhile, an oversampling training scheme is used to solve the over-fitting problem caused by the limitations of the datasets. Then, the HR and HRV can be estimated based on the extracted BVP signals. Comprehensive experiments are conducted on the benchmark datasets to validate the proposed FastBVP-Net. For intra-dataset and cross-dataset testing, the proposed approach achieves better performance for HR and HRV estimation from 30-second facial videos with fewer computational burdens than the current well-established methods. Moreover, the proposed approach also achieves competitive results from 15-second facial videos. Therefore, the proposed FastBVP-Net has the potential to be applied in many real-world scenarios with shorter videos. △ Less

Submitted 21 December, 2022; v1 submitted 25 June, 2022; originally announced June 2022.

Comments: 9 pages, 2figures

arXiv:2206.02295 [pdf, other]

HIFI-Net: A Novel Network for Enhancement to Underwater Images

Authors: Jiajia Zhou, Junbin Zhuang, Yan Zheng, Di Wu

Abstract: A novel network for enhancement to underwater images is proposed in this paper. It contains a Reinforcement Fusion Module for Haar wavelet images (RFM-Haar) based on Reinforcement Fusion Unit (RFU), which is used to fuse an original image and some important information within it. Fusion is achieved for better enhancement. As this network make "Haar Images into Fusion Images", it is called HIFI-Net… ▽ More A novel network for enhancement to underwater images is proposed in this paper. It contains a Reinforcement Fusion Module for Haar wavelet images (RFM-Haar) based on Reinforcement Fusion Unit (RFU), which is used to fuse an original image and some important information within it. Fusion is achieved for better enhancement. As this network make "Haar Images into Fusion Images", it is called HIFI-Net. The experimental results show the proposed HIFI-Net performs best among many state-of-the-art methods on three datasets at three normal metrics and a new metric. △ Less

Submitted 5 June, 2022; originally announced June 2022.

Comments: 7 pages, 4 figures

arXiv:2205.14169 [pdf, other]

doi 10.1103/PhysRevB.106.144308

Phase-transition-like behavior in information retrieval of a quantum scrambled random circuit system

Authors: J. -Z. Zhuang, Y. -K. Wu, L. -M. Duan

Abstract: Information in a chaotic quantum system will scramble across the system, preventing any local measurement from reconstructing it. The scrambling dynamics is key to understanding a wide range of quantum many-body systems. Here we use Holevo information to quantify the scrambling dynamics, which shows a phase-transition-like behavior. When applying long random Clifford circuits to a large system, no… ▽ More Information in a chaotic quantum system will scramble across the system, preventing any local measurement from reconstructing it. The scrambling dynamics is key to understanding a wide range of quantum many-body systems. Here we use Holevo information to quantify the scrambling dynamics, which shows a phase-transition-like behavior. When applying long random Clifford circuits to a large system, no information can be recovered from a subsystem of less than half the system size. When exceeding half the system size, the amount of stored information grows by two bits of classical information per qubit until saturation through another sharp unanalytical change. We also study critical behavior near the transition points. Finally, we use coherent information to quantify the scrambling of quantum information in the system, which shows similar phase-transition-like behavior. △ Less

Submitted 11 October, 2022; v1 submitted 27 May, 2022; originally announced May 2022.

Comments: 10 pages, 5 figures

arXiv:2203.08065 [pdf, other]

Surrogate Gap Minimization Improves Sharpness-Aware Training

Authors: Juntang Zhuang, Boqing Gong, Liangzhe Yuan, Yin Cui, Hartwig Adam, Nicha Dvornek, Sekhar Tatikonda, James Duncan, Ting Liu

Abstract: The recently proposed Sharpness-Aware Minimization (SAM) improves generalization by minimizing a \textit{perturbed loss} defined as the maximum loss within a neighborhood in the parameter space. However, we show that both sharp and flat minima can have a low perturbed loss, implying that SAM does not always prefer flat minima. Instead, we define a \textit{surrogate gap}, a measure equivalent to th… ▽ More The recently proposed Sharpness-Aware Minimization (SAM) improves generalization by minimizing a \textit{perturbed loss} defined as the maximum loss within a neighborhood in the parameter space. However, we show that both sharp and flat minima can have a low perturbed loss, implying that SAM does not always prefer flat minima. Instead, we define a \textit{surrogate gap}, a measure equivalent to the dominant eigenvalue of Hessian at a local minimum when the radius of the neighborhood (to derive the perturbed loss) is small. The surrogate gap is easy to compute and feasible for direct minimization during training. Based on the above observations, we propose Surrogate \textbf{G}ap Guided \textbf{S}harpness-\textbf{A}ware \textbf{M}inimization (GSAM), a novel improvement over SAM with negligible computation overhead. Conceptually, GSAM consists of two steps: 1) a gradient descent like SAM to minimize the perturbed loss, and 2) an \textit{ascent} step in the \textit{orthogonal} direction (after gradient decomposition) to minimize the surrogate gap and yet not affect the perturbed loss. GSAM seeks a region with both small loss (by step 1) and low sharpness (by step 2), giving rise to a model with high generalization capabilities. Theoretically, we show the convergence of GSAM and provably better generalization than SAM. Empirically, GSAM consistently improves generalization (e.g., +3.2\% over SAM and +5.4\% over AdamW on ImageNet top-1 accuracy for ViT-B/32). Code is released at \url{ https://sites.google.com/view/gsam-iclr22/home}. △ Less

Submitted 19 March, 2022; v1 submitted 15 March, 2022; originally announced March 2022.

Comments: Paper accepted by ICLR22, https://openreview.net/forum?id=edONMAnhLu-

arXiv:2203.03762 [pdf, other]

Defending Graph Convolutional Networks against Dynamic Graph Perturbations via Bayesian Self-supervision

Authors: Jun Zhuang, Mohammad Al Hasan

Abstract: In recent years, plentiful evidence illustrates that Graph Convolutional Networks (GCNs) achieve extraordinary accomplishments on the node classification task. However, GCNs may be vulnerable to adversarial attacks on label-scarce dynamic graphs. Many existing works aim to strengthen the robustness of GCNs; for instance, adversarial training is used to shield GCNs against malicious perturbations.… ▽ More In recent years, plentiful evidence illustrates that Graph Convolutional Networks (GCNs) achieve extraordinary accomplishments on the node classification task. However, GCNs may be vulnerable to adversarial attacks on label-scarce dynamic graphs. Many existing works aim to strengthen the robustness of GCNs; for instance, adversarial training is used to shield GCNs against malicious perturbations. However, these works fail on dynamic graphs for which label scarcity is a pressing issue. To overcome label scarcity, self-training attempts to iteratively assign pseudo-labels to highly confident unlabeled nodes but such attempts may suffer serious degradation under dynamic graph perturbations. In this paper, we generalize noisy supervision as a kind of self-supervised learning method and then propose a novel Bayesian self-supervision model, namely GraphSS, to address the issue. Extensive experiments demonstrate that GraphSS can not only affirmatively alert the perturbations on dynamic graphs but also effectively recover the prediction of a node classifier when the graph is under such perturbations. These two advantages prove to be generalized over three classic GCNs across five public graph datasets. △ Less

Submitted 7 March, 2022; originally announced March 2022.

Comments: The paper is accepted by AAAI 2022

arXiv:2203.03634 [pdf, other]

Remote blood pressure measurement via spatiotemporal map** of a short-time facial video

Authors: Jialiang Zhuang, Bin Li, Yun Zhang, Yuheng Chen, Xiujuan Zheng

Abstract: Blood pressure (BP) monitoring is vital in daily healthcare, especially for cardiovascular diseases. However, BP values are mainly acquired through the contact sensing method, which is inconvenient and unfriendly to continuous BP measurement. Hence, we propose an efficient end-to-end network to estimate the BP values from a facial video to achieve remote BP measurement in daily life. In this study… ▽ More Blood pressure (BP) monitoring is vital in daily healthcare, especially for cardiovascular diseases. However, BP values are mainly acquired through the contact sensing method, which is inconvenient and unfriendly to continuous BP measurement. Hence, we propose an efficient end-to-end network to estimate the BP values from a facial video to achieve remote BP measurement in daily life. In this study, we first derived a Spatial-temporal map of a short-time (~15s) facial video. According to the Spatial-temporal map, we then regressed the BP ranges by a designed blood pressure classifier and simultaneously calculated the specific value by a blood pressure calculator in each BP range. In addition, we also developed an innovative oversampling training strategy to handle the unbalanced data distribution problem. Finally, we trained the proposed network on a private dataset ASPD and tested it on the popular dataset MMSE-HR. As a result, the proposed network achieved a state-of-the-art MAE of 12.35 mmHg and 9.5 mmHg on systolic and diastolic BP measurements, which is better than the recent works. It concludes that the proposed method has excellent potential for camera-based BP monitoring in real-world scenarios. △ Less

Submitted 23 June, 2022; v1 submitted 7 March, 2022; originally announced March 2022.

Comments: 7 pages, 7 figures

arXiv:2203.03329 [pdf, other]

Open Set Domain Adaptation By Novel Class Discovery

Authors: **gyu Zhuang, Ziliang Chen, Pengxu Wei, Guanbin Li, Liang Lin

Abstract: In Open Set Domain Adaptation (OSDA), large amounts of target samples are drawn from the implicit categories that never appear in the source domain. Due to the lack of their specific belonging, existing methods indiscriminately regard them as a single class unknown. We challenge this broadly-adopted practice that may arouse unexpected detrimental effects because the decision boundaries between the… ▽ More In Open Set Domain Adaptation (OSDA), large amounts of target samples are drawn from the implicit categories that never appear in the source domain. Due to the lack of their specific belonging, existing methods indiscriminately regard them as a single class unknown. We challenge this broadly-adopted practice that may arouse unexpected detrimental effects because the decision boundaries between the implicit categories have been fully ignored. Instead, we propose Self-supervised Class-Discovering Adapter (SCDA) that attempts to achieve OSDA by gradually discovering those implicit classes, then incorporating them to restructure the classifier and update the domain-adaptive features iteratively. SCDA performs two alternate steps to achieve implicit class discovery and self-supervised OSDA, respectively. By jointly optimizing for two tasks, SCDA achieves the state-of-the-art in OSDA and shows a competitive performance to unearth the implicit target classes. △ Less

Submitted 7 March, 2022; originally announced March 2022.

arXiv:2203.02876 [pdf]

Displacement calibration of optical tweezers with absolute gravitational acceleration

Authors: Jianyu Yang, Nan Li, Xunmin Zhu, Ming Chen, Xingfan Chen, Cheng Liu, Jian Zhuang, Huizhu Hu

Abstract: In recent years, levitated particles of optical traps in vacuum have shown enormous potential in precision sensor development and searching for new physics. The accuracy of the calibration relating the detected signal to absolute displacement of the trapped particle is a critical factor for absolute measurement performance. In this paper, we suggest and experimentally demonstrate a novel calibrati… ▽ More In recent years, levitated particles of optical traps in vacuum have shown enormous potential in precision sensor development and searching for new physics. The accuracy of the calibration relating the detected signal to absolute displacement of the trapped particle is a critical factor for absolute measurement performance. In this paper, we suggest and experimentally demonstrate a novel calibration method for optical tweezers based on free-falling particles in vacuum, where the gravitational acceleration is introduced as an absolute reference. Our work provides a calibration protocol with great certainty and traceability, which is significant in improving the accuracy of precision sensing based on optically levitated particles. △ Less

Submitted 6 March, 2022; originally announced March 2022.

Comments: 10 pages, 5 figures

arXiv:2202.08199 [pdf, other]

Less is More: Surgical Phase Recognition from Timestamp Supervision

Authors: Xinpeng Ding, Xinjian Yan, Zixun Wang, Wei Zhao, Jian Zhuang, Xiaowei Xu, Xiaomeng Li

Abstract: Surgical phase recognition is a fundamental task in computer-assisted surgery systems. Most existing works are under the supervision of expensive and time-consuming full annotations, which require the surgeons to repeat watching videos to find the precise start and end time for a surgical phase. In this paper, we introduce timestamp supervision for surgical phase recognition to train the models wi… ▽ More Surgical phase recognition is a fundamental task in computer-assisted surgery systems. Most existing works are under the supervision of expensive and time-consuming full annotations, which require the surgeons to repeat watching videos to find the precise start and end time for a surgical phase. In this paper, we introduce timestamp supervision for surgical phase recognition to train the models with timestamp annotations, where the surgeons are asked to identify only a single timestamp within the temporal boundary of a phase. This annotation can significantly reduce the manual annotation cost compared to the full annotations. To make full use of such timestamp supervisions, we propose a novel method called uncertainty-aware temporal diffusion (UATD) to generate trustworthy pseudo labels for training. Our proposed UATD is motivated by the property of surgical videos, i.e., the phases are long events consisting of consecutive frames. To be specific, UATD diffuses the single labelled timestamp to its corresponding high confident ( i.e., low uncertainty) neighbour frames in an iterative way. Our study uncovers unique insights of surgical phase recognition with timestamp supervisions: 1) timestamp annotation can reduce 74% annotation time compared with the full annotation, and surgeons tend to annotate those timestamps near the middle of phases; 2) extensive experiments demonstrate that our method can achieve competitive results compared with full supervision methods, while reducing manual annotation cost; 3) less is more in surgical phase recognition, i.e., less but discriminative pseudo labels outperform full but containing ambiguous frames; 4) the proposed UATD can be used as a plug and play method to clean ambiguous labels near boundaries between phases, and improve the performance of the current surgical phase recognition methods. △ Less

Submitted 30 November, 2022; v1 submitted 16 February, 2022; originally announced February 2022.

arXiv:2201.09206 [pdf, other]

doi 10.1109/TCSVT.2021.3135013

A Transformer-Based Feature Segmentation and Region Alignment Method For UAV-View Geo-Localization

Authors: Ming Dai, Jianhong Hu, Jiedong Zhuang, Enhui Zheng

Abstract: Cross-view geo-localization is a task of matching the same geographic image from different views, e.g., unmanned aerial vehicle (UAV) and satellite. The most difficult challenges are the position shift and the uncertainty of distance and scale. Existing methods are mainly aimed at digging for more comprehensive fine-grained information. However, it underestimates the importance of extracting robus… ▽ More Cross-view geo-localization is a task of matching the same geographic image from different views, e.g., unmanned aerial vehicle (UAV) and satellite. The most difficult challenges are the position shift and the uncertainty of distance and scale. Existing methods are mainly aimed at digging for more comprehensive fine-grained information. However, it underestimates the importance of extracting robust feature representation and the impact of feature alignment. The CNN-based methods have achieved great success in cross-view geo-localization. However it still has some limitations, e.g., it can only extract part of the information in the neighborhood and some scale reduction operations will make some fine-grained information lost. In particular, we introduce a simple and efficient transformer-based structure called Feature Segmentation and Region Alignment (FSRA) to enhance the model's ability to understand contextual information as well as to understand the distribution of instances. Without using additional supervisory information, FSRA divides regions based on the heat distribution of the transformer's feature map, and then aligns multiple specific regions in different views one on one. Finally, FSRA integrates each region into a set of feature representations. The difference is that FSRA does not divide regions manually, but automatically based on the heat distribution of the feature map. So that specific instances can still be divided and aligned when there are significant shifts and scale changes in the image. In addition, a multiple sampling strategy is proposed to overcome the disparity in the number of satellite images and that of images from other sources. Experiments show that the proposed method has superior performance and achieves the state-of-the-art in both tasks of drone view target localization and drone navigation. Code will be released at https://github.com/Dmmm1997/FSRA △ Less

Submitted 23 January, 2022; originally announced January 2022.

Comments: 14 pages, 13 figures, IEEE Transactions on Circuits and Systems for Video Technology

arXiv:2201.09201 [pdf, other]

Vision-Based UAV Self-Positioning in Low-Altitude Urban Environments

Authors: Ming Dai, Enhui Zheng, Zhenhua Feng, Jiedong Zhuang, Wankou Yang

Abstract: Unmanned Aerial Vehicles (UAVs) rely on satellite systems for stable positioning. However, due to limited satellite coverage or communication disruptions, UAVs may lose signals from satellite-based positioning systems. In such situations, vision-based techniques can serve as an alternative, ensuring the self-positioning capability of UAVs. However, most of the existing datasets are developed for t… ▽ More Unmanned Aerial Vehicles (UAVs) rely on satellite systems for stable positioning. However, due to limited satellite coverage or communication disruptions, UAVs may lose signals from satellite-based positioning systems. In such situations, vision-based techniques can serve as an alternative, ensuring the self-positioning capability of UAVs. However, most of the existing datasets are developed for the geo-localization tasks of the objects identified by UAVs, rather than the self-positioning task of UAVs. Furthermore, the current UAV datasets use discrete sampling on synthetic data, such as Google Maps, thereby neglecting the crucial aspects of dense sampling and the uncertainties commonly experienced in real-world scenarios. To address these issues, this paper presents a new dataset, DenseUAV, which is the first publicly available dataset designed for the UAV self-positioning task. DenseUAV adopts dense sampling on UAV images obtained in low-altitude urban settings. In total, over 27K UAV-view and satellite-view images of 14 university campuses are collected and annotated, establishing a new benchmark. In terms of model development, we first verify the superiority of Transformers over CNNs in this task. Then, we incorporate metric learning into representation learning to enhance the discriminative capacity of the model and to lessen the modality discrepancy. Besides, to facilitate joint learning from both perspectives, we propose a mutually supervised learning approach. Last, we enhance the Recall@K metric and introduce a new measurement, SDM@K, to evaluate the performance of a trained model from both the retrieval and localization perspectives simultaneously. As a result, the proposed baseline method achieves a remarkable Recall@1 score of 83.05% and an SDM@1 score of 86.24% on DenseUAV. The dataset and code will be made publicly available on https://github.com/Dmmm1997/DenseUAV. △ Less

Submitted 10 August, 2023; v1 submitted 23 January, 2022; originally announced January 2022.

Comments: 13 pages,8 figures

arXiv:2112.14027 [pdf, other]

doi 10.1103/PhysRevB.106.024506

Two-dimensional Paired Topological Superfluids of Rydberg Fermi Gases

Authors: Ching-Yu Huang, Jiapei Zhuang, Po-Yao Chang, Daw-Wei Wang

Abstract: We systematically investigate the topological properties of spin polarized Rydberg-dressed fermionic atoms loaded in a bilayer optical lattice. Through tuning the Rydberg coupling strength and the inter-layer tunneling amplitude, we identify different types of topological superfluid states generated from the inter-layer pairing and relative gauge phase modulation of the couples 2D $p$-wave superfl… ▽ More We systematically investigate the topological properties of spin polarized Rydberg-dressed fermionic atoms loaded in a bilayer optical lattice. Through tuning the Rydberg coupling strength and the inter-layer tunneling amplitude, we identify different types of topological superfluid states generated from the inter-layer pairing and relative gauge phase modulation of the couples 2D $p$-wave superfluids. These phases includes gapped/gapless with/without time reversal symmetry. One of the most interesting states is a gapless paired topological superfluid with both the time-reversal symmetry and particle-hole symmetry. This state is equivalent to a topological Kondo lattice model with the spin-orbit coupling, an in-plane magnetic field, and an additional particle-hole symmetry. The flexibility of experimental manipulation in such Rydberg-dressed ferminoic systems therefore becomes a promising system for realizing interesting topological superfluids. △ Less

Submitted 28 December, 2021; originally announced December 2021.

arXiv:2112.06379 [pdf, other]

5th Place Solution for VSPW 2021 Challenge

Authors: Jiafan Zhuang, Yixin Zhang, Xinyu Hu, Junjie Li, Zilei Wang

Abstract: In this article, we introduce the solution we used in the VSPW 2021 Challenge. Our experiments are based on two baseline models, Swin Transformer and MaskFormer. To further boost performance, we adopt stochastic weight averaging technique and design hierarchical ensemble strategy. Without using any external semantic segmentation dataset, our solution ranked the 5th place in the private leaderboard… ▽ More In this article, we introduce the solution we used in the VSPW 2021 Challenge. Our experiments are based on two baseline models, Swin Transformer and MaskFormer. To further boost performance, we adopt stochastic weight averaging technique and design hierarchical ensemble strategy. Without using any external semantic segmentation dataset, our solution ranked the 5th place in the private leaderboard. Besides, we have some interesting attempts to tackle long-tail recognition and overfitting issues, which achieves improvement on val subset. Maybe due to distribution difference, these attempts don't work on test subset. We will also introduce these attempts and hope to inspire other researchers. △ Less

Submitted 12 December, 2021; originally announced December 2021.

Comments: Presented in ICCV'21 Workshop

arXiv:2111.11007 [pdf]

doi 10.1103/PhysRevApplied.17.064017

Facet Dependent Topological Phase Transition in Bi4Br4

Authors: **gyuan Zhong, Ming Yang, Fei Ye, Chen Liu, Jiaou Wang, Weichang Hao, **cheng Zhuang, Yi Du

Abstract: The realization of the coexistence of various topologically nontrivial surface states in one material is expected to lay a foundation for new electric applications with selective robust spin current. Here we apply the magnetoconductivity characteristic and angle-resolved photoemission spectroscopy (ARPES) to visualize the surface-selected electronic features evolution of quasi-one-dimensional mate… ▽ More The realization of the coexistence of various topologically nontrivial surface states in one material is expected to lay a foundation for new electric applications with selective robust spin current. Here we apply the magnetoconductivity characteristic and angle-resolved photoemission spectroscopy (ARPES) to visualize the surface-selected electronic features evolution of quasi-one-dimensional material Bi4Br4. The transport measurements indicate the quantum interference correction to conductivity possesses symbolic spin rotational characteristic correlated to the value of Berry phase with the effects of weak localization and weak antilocalization for (001) and (100) surfaces, respectively. The ARPES spectra provide the experimental evidence for quasi-one-dimensional massless Dirac surface state at the side (100) surface and anisotropic massive Dirac surface state at the top (001) surface, respectively, which is highly coincide with the angle-dependent scaling behavior of magnetoconductivity. Our results reveal the facet dependent topological phases in quasi-one-dimensional Bi4Br4, stimulating the further investigations of this dual topology classes and the applications of the feasible technologies of topological spintronics. △ Less

Submitted 22 November, 2021; originally announced November 2021.

Journal ref: Physical Review Applied 17, 064017 (2022)

arXiv:2111.00687 [pdf, other]

RMNet: Equivalently Removing Residual Connection from Networks

Authors: Fanxu Meng, Hao Cheng, Jiaxin Zhuang, Ke Li, Xing Sun

Abstract: Although residual connection enables training very deep neural networks, it is not friendly for online inference due to its multi-branch topology. This encourages many researchers to work on designing DNNs without residual connections at inference. For example, RepVGG re-parameterizes multi-branch topology to a VGG-like (single-branch) model when deploying, showing great performance when the netwo… ▽ More Although residual connection enables training very deep neural networks, it is not friendly for online inference due to its multi-branch topology. This encourages many researchers to work on designing DNNs without residual connections at inference. For example, RepVGG re-parameterizes multi-branch topology to a VGG-like (single-branch) model when deploying, showing great performance when the network is relatively shallow. However, RepVGG can not transform ResNet to VGG equivalently because re-parameterizing methods can only be applied to linear blocks and the non-linear layers (ReLU) have to be put outside of the residual connection which results in limited representation ability, especially for deeper networks. In this paper, we aim to remedy this problem and propose to remove the residual connection in a vanilla ResNet equivalently by a reserving and merging (RM) operation on ResBlock. Specifically, the RM operation allows input feature maps to pass through the block while reserving their information and merges all the information at the end of each block, which can remove residual connections without changing the original output. As a plug-in method, RM Operation basically has three advantages: 1) its implementation makes it naturally friendly for high ratio network pruning. 2) it helps break the depth limitation of RepVGG. 3) it leads to better accuracy-speed trade-off network (RMNet) compared to ResNet and RepVGG. We believe the ideology of RM Operation can inspire many insights on model design for the community in the future. Code is available at: https://github.com/fxmeng/RMNet. △ Less

Submitted 1 November, 2021; originally announced November 2021.

Comments: Equivalently removing residual connection from ResBlock with non-linear layer inside it, towards an efficient plain model

arXiv:2110.12637 [pdf, other]

doi 10.1088/2516-1075/ac522b

Thermodynamics of spin crossover in ferropericlase: an improved LDA+$U_{sc}$ calculation

Authors: Yang Sun, **gyi Zhuang, Renata M. Wentzcovitch

Abstract: We present LDA+$U_{sc}$ calculations of high-spin (HS) and low-spin (LS) states in ferropericlase (fp) with an iron concentration of 18.75$\%$. The Hubbard parameter $U$ is determined self-consistently with structures optimized at arbitrary pressures. We confirm a strong dependence of $U$ on the pressure and spin state. Static calculations confirm that the antiferromagnetic configuration is more s… ▽ More We present LDA+$U_{sc}$ calculations of high-spin (HS) and low-spin (LS) states in ferropericlase (fp) with an iron concentration of 18.75$\%$. The Hubbard parameter $U$ is determined self-consistently with structures optimized at arbitrary pressures. We confirm a strong dependence of $U$ on the pressure and spin state. Static calculations confirm that the antiferromagnetic configuration is more stable than the ferromagnetic one in the HS state, consistent with low-temperature measurements. Phonon calculations guarantee the dynamical stability of HS and LS states throughout the pressure range of the Earth mantle. Compression curves for HS and LS states agree well with experiments. Using a non-ideal mixing model for the HS to LS states solid solution, we obtain a crossover starting at $\sim$45 GPa at room temperature and considerably broader than previous results. The spin-crossover phase diagram is calculated, including vibrational, magnetic, electronic, and non-ideal HS-LS entropic contributions. Our results suggest the mixed-spin state predominates in fp in most of the lower mantle. △ Less

Submitted 9 April, 2022; v1 submitted 25 October, 2021; originally announced October 2021.

Comments: 8 pages, 8 figures

Journal ref: Electron. Struct. 4 (2022) 014008

arXiv:2110.12274 [pdf, other]

"One-Shot" Reduction of Additive Artifacts in Medical Images

Authors: Yu-Jen Chen, Yen-Jung Chang, Shao-Cheng Wen, Yiyu Shi, Xiaowei Xu, Tsung-Yi Ho, Mei** Huang, Haiyun Yuan, Jian Zhuang

Abstract: Medical images may contain various types of artifacts with different patterns and mixtures, which depend on many factors such as scan setting, machine condition, patients' characteristics, surrounding environment, etc. However, existing deep-learning-based artifact reduction methods are restricted by their training set with specific predetermined artifact types and patterns. As such, they have lim… ▽ More Medical images may contain various types of artifacts with different patterns and mixtures, which depend on many factors such as scan setting, machine condition, patients' characteristics, surrounding environment, etc. However, existing deep-learning-based artifact reduction methods are restricted by their training set with specific predetermined artifact types and patterns. As such, they have limited clinical adoption. In this paper, we introduce One-Shot medical image Artifact Reduction (OSAR), which exploits the power of deep learning but without using pre-trained general networks. Specifically, we train a light-weight image-specific artifact reduction network using data synthesized from the input image at test-time. Without requiring any prior large training data set, OSAR can work with almost any medical images that contain varying additive artifacts which are not in any existing data sets. In addition, Computed Tomography (CT) and Magnetic Resonance Imaging (MRI) are used as vehicles and show that the proposed method can reduce artifacts better than state-of-the-art both qualitatively and quantitatively using shorter test time. △ Less

Submitted 23 October, 2021; originally announced October 2021.

arXiv:2110.06658 [pdf]

Large-Gap Quantum Spin Hall State and Temperature-Induced Lifshitz Transition in Bi4Br4

Authors: Ming Yang, Yundan Liu, Wei Zhou, Chen Liu, Dan Mu, Yani Liu, Jiaou Wang, Weichang Hao, ** Li, Jianxin Zhong, Yi Du, **cheng Zhuang

Abstract: Searching for new quantum spin Hall insulators with large fully opened energy gap to overcome the thermal disturbance at room temperature has attracted tremendous attention due to the one-dimensional (1D) spin-momentum locked topological edge states serving as dissipationless channels for the practical applications in low consumption electronics and high performance spintronics. Here, we report th… ▽ More Searching for new quantum spin Hall insulators with large fully opened energy gap to overcome the thermal disturbance at room temperature has attracted tremendous attention due to the one-dimensional (1D) spin-momentum locked topological edge states serving as dissipationless channels for the practical applications in low consumption electronics and high performance spintronics. Here, we report the investigation of topological nature of monolayer Bi4Br4 by the techniques of scanning tunneling microscopy and angle-resolved photoemission spectroscopy (ARPES). The topological non-triviality of 1D edge state integrals within the large bulk energy gap (~ 0.2 eV) is revealed by the first-principle calculations. The ARPES measurements at different temperature show a temperature-induced Lifshitz transition, corresponding to the resistivity anomaly caused by the shift of chemical potential. The connection between the emergency of superconductivity and the Lifshitz transition is discussed. △ Less

Submitted 13 October, 2021; originally announced October 2021.

arXiv:2110.05454 [pdf, other]

Momentum Centering and Asynchronous Update for Adaptive Gradient Methods

Authors: Juntang Zhuang, Yifan Ding, Tommy Tang, Nicha Dvornek, Sekhar Tatikonda, James S. Duncan

Abstract: We propose ACProp (Asynchronous-centering-Prop), an adaptive optimizer which combines centering of second momentum and asynchronous update (e.g. for $t$-th update, denominator uses information up to step $t-1$, while numerator uses gradient at $t$-th step). ACProp has both strong theoretical properties and empirical performance. With the example by Reddi et al. (2018), we show that asynchronous op… ▽ More We propose ACProp (Asynchronous-centering-Prop), an adaptive optimizer which combines centering of second momentum and asynchronous update (e.g. for $t$-th update, denominator uses information up to step $t-1$, while numerator uses gradient at $t$-th step). ACProp has both strong theoretical properties and empirical performance. With the example by Reddi et al. (2018), we show that asynchronous optimizers (e.g. AdaShift, ACProp) have weaker convergence condition than synchronous optimizers (e.g. Adam, RMSProp, AdaBelief); within asynchronous optimizers, we show that centering of second momentum further weakens the convergence condition. We demonstrate that ACProp has a convergence rate of $O(\frac{1}{\sqrt{T}})$ for the stochastic non-convex case, which matches the oracle rate and outperforms the $O(\frac{logT}{\sqrt{T}})$ rate of RMSProp and Adam. We validate ACProp in extensive empirical studies: ACProp outperforms both SGD and other adaptive optimizers in image classification with CNN, and outperforms well-tuned adaptive optimizers in the training of various GAN models, reinforcement learning and transformers. To sum up, ACProp has good theoretical properties including weak convergence condition and optimal convergence rate, and strong empirical performance including good generalization like SGD and training stability like Adam. We provide the implementation at https://github.com/juntang-zhuang/ACProp-Optimizer. △ Less

Submitted 1 December, 2021; v1 submitted 11 October, 2021; originally announced October 2021.

arXiv:2109.11724 [pdf, other]

$\texttt{express}$: extensible, high-level workflows for swifter $\textit{ab initio}$ materials modeling

Authors: Qi Zhang, Chaoxuan Gu, **gyi Zhuang, Renata M. Wentzcovitch

Abstract: In this work, we introduce an open-source $\texttt{Julia}$ project, $\texttt{express}$, an extensible, high-throughput, high-level workflow framework that aims to automate $\textit{ab initio}$ calculations for the materials science community. $\texttt{Express}$ is shipped with well-tested workflow templates, including structure optimization, equation of state (EOS) fitting, phonon spectrum (lattic… ▽ More In this work, we introduce an open-source $\texttt{Julia}$ project, $\texttt{express}$, an extensible, high-throughput, high-level workflow framework that aims to automate $\textit{ab initio}$ calculations for the materials science community. $\texttt{Express}$ is shipped with well-tested workflow templates, including structure optimization, equation of state (EOS) fitting, phonon spectrum (lattice dynamics) calculation, and thermodynamic property calculation in the framework of the quasi-harmonic approximation (QHA). It is designed to be highly modularized so that its components can be reused across various occasions, and customized workflows can be built on top of that. Users can also track the status of workflows in real-time, and rerun failed jobs thanks to the data lineage feature $\texttt{express}$ provides. Two working examples, i.e., all workflows applied to lime and akimotoite, are also presented in the code and this paper. △ Less

Submitted 23 September, 2021; originally announced September 2021.

arXiv:2109.06909 [pdf, other]

Hardware-aware Real-time Myocardial Segmentation Quality Control in Contrast Echocardiography

Authors: Dewen Zeng, Yukun Ding, Haiyun Yuan, Mei** Huang, Xiaowei Xu, Jian Zhuang, **gtong Hu, Yiyu Shi

Abstract: Automatic myocardial segmentation of contrast echocardiography has shown great potential in the quantification of myocardial perfusion parameters. Segmentation quality control is an important step to ensure the accuracy of segmentation results for quality research as well as its clinical application. Usually, the segmentation quality control happens after the data acquisition. At the data acquisit… ▽ More Automatic myocardial segmentation of contrast echocardiography has shown great potential in the quantification of myocardial perfusion parameters. Segmentation quality control is an important step to ensure the accuracy of segmentation results for quality research as well as its clinical application. Usually, the segmentation quality control happens after the data acquisition. At the data acquisition time, the operator could not know the quality of the segmentation results. On-the-fly segmentation quality control could help the operator to adjust the ultrasound probe or retake data if the quality is unsatisfied, which can greatly reduce the effort of time-consuming manual correction. However, it is infeasible to deploy state-of-the-art DNN-based models because the segmentation module and quality control module must fit in the limited hardware resource on the ultrasound machine while satisfying strict latency constraints. In this paper, we propose a hardware-aware neural architecture search framework for automatic myocardial segmentation and quality control of contrast echocardiography. We explicitly incorporate the hardware latency as a regularization term into the loss function during training. The proposed method searches the best neural network architecture for the segmentation module and quality prediction module with strict latency. △ Less

Submitted 14 September, 2021; originally announced September 2021.

Comments: 4 pages, DAC'21 invited paper

arXiv:2109.00374 [pdf, other]

ImageTBAD: A 3D Computed Tomography Angiography Image Dataset for Automatic Segmentation of Type-B Aortic Dissection

Authors: Zeyang Yao, Jiawei Zhang, Hailong Qiu, Tianchen Wang, Yiyu Shi, Jian Zhuang, Yuhao Dong, Mei** Huang, Xiaowei Xu

Abstract: Type-B Aortic Dissection (TBAD) is one of the most serious cardiovascular events characterized by a growing yearly incidence,and the severity of disease prognosis. Currently, computed tomography angiography (CTA) has been widely adopted for the diagnosis and prognosis of TBAD. Accurate segmentation of true lumen (TL), false lumen (FL), and false lumen thrombus (FLT) in CTA are crucial for the prec… ▽ More Type-B Aortic Dissection (TBAD) is one of the most serious cardiovascular events characterized by a growing yearly incidence,and the severity of disease prognosis. Currently, computed tomography angiography (CTA) has been widely adopted for the diagnosis and prognosis of TBAD. Accurate segmentation of true lumen (TL), false lumen (FL), and false lumen thrombus (FLT) in CTA are crucial for the precise quantification of anatomical features. However, existing works only focus on only TL and FL without considering FLT. In this paper, we propose ImageTBAD, the first 3D computed tomography angiography (CTA) image dataset of TBAD with annotation of TL, FL, and FLT. The proposed dataset contains 100 TBAD CTA images, which is of decent size compared with existing medical imaging datasets. As FLT can appear almost anywhere along the aorta with irregular shapes, segmentation of FLT presents a wide class of segmentation problems where targets exist in a variety of positions with irregular shapes. We further propose a baseline method for automatic segmentation of TBAD. Results show that the baseline method can achieve comparable results with existing works on aorta and TL segmentation. However, the segmentation accuracy of FLT is only 52%, which leaves large room for improvement and also shows the challenge of our dataset. To facilitate further research on this challenging problem, our dataset and codes are released to the public. △ Less

Submitted 1 September, 2021; originally announced September 2021.

arXiv:2108.11054 [pdf, other]

Understanding of Kernels in CNN Models by Suppressing Irrelevant Visual Features in Images

Authors: Jia-Xin Zhuang, Wanying Tao, Jianfei Xing, Wei Shi, Ruixuan Wang, Wei-shi Zheng

Abstract: Deep learning models have shown their superior performance in various vision tasks. However, the lack of precisely interpreting kernels in convolutional neural networks (CNNs) is becoming one main obstacle to wide applications of deep learning models in real scenarios. Although existing interpretation methods may find certain visual patterns which are associated with the activation of a specific k… ▽ More Deep learning models have shown their superior performance in various vision tasks. However, the lack of precisely interpreting kernels in convolutional neural networks (CNNs) is becoming one main obstacle to wide applications of deep learning models in real scenarios. Although existing interpretation methods may find certain visual patterns which are associated with the activation of a specific kernel, those visual patterns may not be specific or comprehensive enough for interpretation of a specific activation of kernel of interest. In this paper, a simple yet effective optimization method is proposed to interpret the activation of any kernel of interest in CNN models. The basic idea is to simultaneously preserve the activation of the specific kernel and suppress the activation of all other kernels at the same layer. In this way, only visual information relevant to the activation of the specific kernel is remained in the input. Consistent visual information from multiple modified inputs would help users understand what kind of features are specifically associated with specific kernel. Comprehensive evaluation shows that the proposed method can help better interpret activation of specific kernels than widely used methods, even when two kernels have very similar activation regions from the same input image. △ Less

Submitted 25 August, 2021; originally announced August 2021.

arXiv:2107.03642 [pdf]

Image restoration quality assessment based on regional differential information entropy

Authors: Zhiyu Wang, Jiayan Zhuang, Ningyuan Xu, Sichao Ye, Jiangjian Xiao, Chengbin Peng

Abstract: With the development of image recovery models,especially those based on adversarial and perceptual losses,the detailed texture portions of images are being recovered more naturally.However,these restored images are similar but not identical in detail texture to their reference images.With traditional image quality assessment methods,results with better subjective perceived quality often score lowe… ▽ More With the development of image recovery models,especially those based on adversarial and perceptual losses,the detailed texture portions of images are being recovered more naturally.However,these restored images are similar but not identical in detail texture to their reference images.With traditional image quality assessment methods,results with better subjective perceived quality often score lower in objective scoring.Assessment methods suffer from subjective and objective inconsistencies.This paper proposes a regional differential information entropy (RDIE) method for image quality assessment to address this problem.This approach allows better assessment of similar but not identical textural details and achieves good agreement with perceived quality.Neural networks are used to reshape the process of calculating information entropy,improving the speed and efficiency of the operation. Experiments conducted with this study image quality assessment dataset and the PIPAL dataset show that the proposed RDIE method yields a high degree of agreement with people average opinion scores compared to other image quality assessment metrics,proving that RDIE can better quantify the perceived quality of images. △ Less

Submitted 26 November, 2022; v1 submitted 8 July, 2021; originally announced July 2021.

Comments: 14 pages, 8 figures, 5 tables

arXiv:2107.03640 [pdf, other]

A Dataset and Method for Hallux Valgus Angle Estimation Based on Deep Learing

Authors: Ningyuan Xu, Jiayan Zhuang, Yaojun Wu, Jiangjian Xiao

Abstract: Angular measurements is essential to make a resonable treatment for Hallux valgus (HV), a common forefoot deformity. However, it still depends on manual labeling and measurement, which is time-consuming and sometimes unreliable. Automating this process is a thing of concern. However, it lack of dataset and the keypoints based method which made a great success in pose estimation is not suitable for… ▽ More Angular measurements is essential to make a resonable treatment for Hallux valgus (HV), a common forefoot deformity. However, it still depends on manual labeling and measurement, which is time-consuming and sometimes unreliable. Automating this process is a thing of concern. However, it lack of dataset and the keypoints based method which made a great success in pose estimation is not suitable for this field.To solve the problems, we made a dataset and developed an algorithm based on deep learning and linear regression. It shows great fitting ability to the ground truth. △ Less

Submitted 8 July, 2021; originally announced July 2021.

Comments: 7pages, 12 figures

ACM Class: I.4.7; I.2.10; I.5.1

arXiv:2106.15597 [pdf, other]

Segmentation with Multiple Acceptable Annotations: A Case Study of Myocardial Segmentation in Contrast Echocardiography

Authors: Dewen Zeng, Mingqi Li, Yukun Ding, Xiaowei Xu, Qiu Xie, Ruixue Xu, Hongwen Fei, Mei** Huang, Jian Zhuang, Yiyu Shi

Abstract: Most existing deep learning-based frameworks for image segmentation assume that a unique ground truth is known and can be used for performance evaluation. This is true for many applications, but not all. Myocardial segmentation of Myocardial Contrast Echocardiography (MCE), a critical task in automatic myocardial perfusion analysis, is an example. Due to the low resolution and serious artifacts in… ▽ More Most existing deep learning-based frameworks for image segmentation assume that a unique ground truth is known and can be used for performance evaluation. This is true for many applications, but not all. Myocardial segmentation of Myocardial Contrast Echocardiography (MCE), a critical task in automatic myocardial perfusion analysis, is an example. Due to the low resolution and serious artifacts in MCE data, annotations from different cardiologists can vary significantly, and it is hard to tell which one is the best. In this case, how can we find a good way to evaluate segmentation performance and how do we train the neural network? In this paper, we address the first problem by proposing a new extended Dice to effectively evaluate the segmentation performance when multiple accepted ground truth is available. Then based on our proposed metric, we solve the second problem by further incorporating the new metric into a loss function that enables neural networks to flexibly learn general features of myocardium. Experiment results on our clinical MCE data set demonstrate that the neural network trained with the proposed loss function outperforms those existing ones that try to obtain a unique ground truth from multiple annotations, both quantitatively and qualitatively. Finally, our grading study shows that using extended Dice as an evaluation metric can better identify segmentation results that need manual correction compared with using Dice. △ Less

Submitted 29 June, 2021; originally announced June 2021.

Comments: 12 pages

arXiv:2106.14415 [pdf, ps, other]

Exact simulation of extrinsic stress-release processes

Authors: Young Lee, Patrick J. Laub, Thomas Taimre, Hongbiao Zhao, Jiancang Zhuang

Abstract: We present a new and straightforward algorithm that simulates exact sample paths for a generalized stress-release process. The computation of the exact law of the joint interarrival times is detailed and used to derive this algorithm. Furthermore, the martingale generator of the process is derived and induces theoretical moments which generalize some results of Borovkov & Vere-Jones (2000) and are… ▽ More We present a new and straightforward algorithm that simulates exact sample paths for a generalized stress-release process. The computation of the exact law of the joint interarrival times is detailed and used to derive this algorithm. Furthermore, the martingale generator of the process is derived and induces theoretical moments which generalize some results of Borovkov & Vere-Jones (2000) and are used to demonstrate the validity of our simulation algorithm. △ Less

Submitted 28 June, 2021; originally announced June 2021.

MSC Class: 60G20 (Primary) 60G55; 65C05 (Secondary)

arXiv:2106.14344 [pdf, other]

Non-Exhaustive Learning Using Gaussian Mixture Generative Adversarial Networks

Authors: Jun Zhuang, Mohammad Al Hasan

Abstract: Supervised learning, while deployed in real-life scenarios, often encounters instances of unknown classes. Conventional algorithms for training a supervised learning model do not provide an option to detect such instances, so they miss-classify such instances with 100% probability. Open Set Recognition (OSR) and Non-Exhaustive Learning (NEL) are potential solutions to overcome this problem. Most e… ▽ More Supervised learning, while deployed in real-life scenarios, often encounters instances of unknown classes. Conventional algorithms for training a supervised learning model do not provide an option to detect such instances, so they miss-classify such instances with 100% probability. Open Set Recognition (OSR) and Non-Exhaustive Learning (NEL) are potential solutions to overcome this problem. Most existing methods of OSR first classify members of existing classes and then identify instances of new classes. However, many of the existing methods of OSR only makes a binary decision, i.e., they only identify the existence of the unknown class. Hence, such methods cannot distinguish test instances belonging to incremental unseen classes. On the other hand, the majority of NEL methods often make a parametric assumption over the data distribution, which either fail to return good results, due to the reason that real-life complex datasets may not follow a well-known data distribution. In this paper, we propose a new online non-exhaustive learning model, namely, Non-Exhaustive Gaussian Mixture Generative Adversarial Networks (NE-GM-GAN) to address these issues. Our proposed model synthesizes Gaussian mixture based latent representation over a deep generative model, such as GAN, for incremental detection of instances of emerging classes in the test data. Extensive experimental results on several benchmark datasets show that NE-GM-GAN significantly outperforms the state-of-the-art methods in detecting instances of novel classes in streaming data. △ Less

Submitted 2 July, 2021; v1 submitted 27 June, 2021; originally announced June 2021.

Comments: Accepted by ECML-PKDD 2021

arXiv:2106.10683 [pdf, other]

Solution for Large-scale Long-tailed Recognition with Noisy Labels

Authors: Yuqiao Xian, Jia-Xin Zhuang, Fufu Yu

Abstract: This is a technical report for CVPR 2021 AliProducts Challenge. AliProducts Challenge is a competition proposed for studying the large-scale and fine-grained commodity image recognition problem encountered by worldleading ecommerce companies. The large-scale product recognition simultaneously meets the challenge of noisy annotations, imbalanced (long-tailed) data distribution and fine-grained clas… ▽ More This is a technical report for CVPR 2021 AliProducts Challenge. AliProducts Challenge is a competition proposed for studying the large-scale and fine-grained commodity image recognition problem encountered by worldleading ecommerce companies. The large-scale product recognition simultaneously meets the challenge of noisy annotations, imbalanced (long-tailed) data distribution and fine-grained classification. In our solution, we adopt stateof-the-art model architectures of both CNNs and Transformer, including ResNeSt, EfficientNetV2, and DeiT. We found that iterative data cleaning, classifier weight normalization, high-resolution finetuning, and test time augmentation are key components to improve the performance of training with the noisy and imbalanced dataset. Finally, we obtain 6.4365% mean class error rate in the leaderboard with our ensemble model. △ Less

Submitted 20 June, 2021; originally announced June 2021.

Comments: 3 pages

Journal ref: CVPR 2021 AliProducts Challenge: CVPR 2021 AliProducts Challenge:Large-scale Product Recognition, Technical Report

arXiv:2106.09157 [pdf, other]

Positional Contrastive Learning for Volumetric Medical Image Segmentation

Authors: Dewen Zeng, Yawen Wu, Xinrong Hu, Xiaowei Xu, Haiyun Yuan, Mei** Huang, Jian Zhuang, **gtong Hu, Yiyu Shi

Abstract: The success of deep learning heavily depends on the availability of large labeled training sets. However, it is hard to get large labeled datasets in medical image domain because of the strict privacy concern and costly labeling efforts. Contrastive learning, an unsupervised learning technique, has been proved powerful in learning image-level representations from unlabeled data. The learned encode… ▽ More The success of deep learning heavily depends on the availability of large labeled training sets. However, it is hard to get large labeled datasets in medical image domain because of the strict privacy concern and costly labeling efforts. Contrastive learning, an unsupervised learning technique, has been proved powerful in learning image-level representations from unlabeled data. The learned encoder can then be transferred or fine-tuned to improve the performance of downstream tasks with limited labels. A critical step in contrastive learning is the generation of contrastive data pairs, which is relatively simple for natural image classification but quite challenging for medical image segmentation due to the existence of the same tissue or organ across the dataset. As a result, when applied to medical image segmentation, most state-of-the-art contrastive learning frameworks inevitably introduce a lot of false-negative pairs and result in degraded segmentation quality. To address this issue, we propose a novel positional contrastive learning (PCL) framework to generate contrastive data pairs by leveraging the position information in volumetric medical images. Experimental results on CT and MRI datasets demonstrate that the proposed PCL method can substantially improve the segmentation performance compared to existing methods in both semi-supervised setting and transfer learning setting. △ Less

Submitted 28 September, 2021; v1 submitted 16 June, 2021; originally announced June 2021.

Comments: 8 pages, conference

arXiv:2106.08793 [pdf, other]

doi 10.1016/j.jheap.2021.06.001

The late flare in tidal disruption events due to the interaction of disk wind with dusty torus

Authors: Jialun Zhuang, Rong-Feng Shen

Abstract: A late (t $\sim$ 1,500 days) multi-wavelength (UV, optical, IR, and X-ray) flare was found in PS1-10adi, a tidal disruption event (TDE) candidate that took place in an active galactic nucleus (AGN). TDEs usually involve super-Eddington accretion, which drives fast mass outflow (disk wind). So here we explore a possible scenario that such a flare might be produced by the interaction of the disk win… ▽ More A late (t $\sim$ 1,500 days) multi-wavelength (UV, optical, IR, and X-ray) flare was found in PS1-10adi, a tidal disruption event (TDE) candidate that took place in an active galactic nucleus (AGN). TDEs usually involve super-Eddington accretion, which drives fast mass outflow (disk wind). So here we explore a possible scenario that such a flare might be produced by the interaction of the disk wind with a dusty torus for TDEs in AGN. Due to the high velocity of the disk wind, strong shocks will emerge and convert the bulk of the kinetic energy of the disk wind to radiation. We calculate the dynamics and then predict the associated radiation signatures, taking into account the widths of the wind and torus. We compare our model with the bolometric light curve of the late flare in PS1-10adi constructed from observations. We find from our modeling that the disk wind has a total kinetic energy of about $10^{51}$ erg and a velocity of 0.1 c (i.e., a mass of 0.3 $M_{\odot}$); the gas number density of the clouds in the torus is $3\times 10^{7}$ $\rm cm^{-3}$. Observation of such a late flare can be an evidence of the disk wind in TDEs and can be used as a tool to explore the nuclear environment of the host. △ Less

Submitted 18 June, 2021; v1 submitted 16 June, 2021; originally announced June 2021.

Comments: 12 pages, 7 figures. Accepted for publication in Journal of High Energy Astrophysics

Journal ref: 2021, JHEAp, 32, 11

arXiv:2106.06144 [pdf, other]

doi 10.1103/PhysRevLett.128.083202

Defect-free arbitrary-geometry assembly of mixed-species atom arrays

Authors: Cheng Sheng, Jiayi Hou, Xiaodong He, Kunpeng Wang, Ruijun Guo, Jun Zhuang, Bahtiyar Mamat, Peng Xu, Min Liu, ** Wang, Mingsheng Zhan

Abstract: Optically trapped mixed-species single atom arrays with arbitrary geometries are an attractive and promising platform for various applications, because tunable quantum systems with multiple components provide extra degrees of freedom for experimental control. Here, we report the first demonstration of two-dimensional $6\times4$ dual-species atom assembly with a filling fraction of 0.88 (0.89) for… ▽ More Optically trapped mixed-species single atom arrays with arbitrary geometries are an attractive and promising platform for various applications, because tunable quantum systems with multiple components provide extra degrees of freedom for experimental control. Here, we report the first demonstration of two-dimensional $6\times4$ dual-species atom assembly with a filling fraction of 0.88 (0.89) for $^{85}$Rb ($^{87}$Rb) atoms. This mixed-species atomic synthetic is achieved via rearranging initially randomly distributed atoms using a sorting algorithm (heuristic heteronuclear algorithm) which is proposed for bottom-up atom assembly with both user-defined geometries and two-species atom number ratios. Our fully tunable hybrid-atom system of scalable advantages is a good starting point for high-fidelity quantum logic, many-body quantum simulation and forming defect-free single molecule arrays. △ Less

Submitted 10 June, 2021; originally announced June 2021.

Journal ref: Phys. Rev. Lett. 128, 083202 (2022)

arXiv:2106.05504 [pdf]

doi 10.1021/acs.jpcc.1c03010

Significantly Enhanced Performance of Nanofluidic Osmotic Power Generation by Slip** Surfaces of Nanopores

Authors: Long Ma, Kabin Lin, Yinghua Qiu, Jiakun Zhuang, Xuan An, Zhishan Yuan, Chuanzhen Huang

Abstract: High-performance osmotic energy conversion (OEC) with perm-selective porous membrane requires both high ionic selectivity and permeability simultaneously. Here, hydrodynamic slip is considered on surfaces of nanopores to break the tradeoff between ionic selectivity and permeability, because it decreases the viscous friction at solid-liquid interfaces which can promote ionic diffusion during OEC. T… ▽ More High-performance osmotic energy conversion (OEC) with perm-selective porous membrane requires both high ionic selectivity and permeability simultaneously. Here, hydrodynamic slip is considered on surfaces of nanopores to break the tradeoff between ionic selectivity and permeability, because it decreases the viscous friction at solid-liquid interfaces which can promote ionic diffusion during OEC. Taking advantage of simulations, influences from individual slip** surfaces on the OEC performance have been investigated, i.e. the slip** inner surface (surfaceinner) and exterior surfaces on the low- and high-concentration sides (surfaceL and surfaceH). Results show that the slip** surfaceL is crucial for high-performance OEC. For nanopores with various lengths, the slip** surfaceL simultaneously increases both ionic permeability and selectivity of nanopores, which results in both significantly enhanced electric power and energy conversion efficiency. While for nanopores longer than 30 nm, the slip** surfaceinner plays a dominant role in the increase of electric power, which induces a considerable decrease in energy conversion efficiency due to enhanced transport of both cations and anions. Considering the difficulty in hydrodynamic slip modification to the surfaceinner of nanopores, the surface modification to the surfaceL may be a better choice to achieve high-performance OEC. Our results provide feasible guidance to the design of porous membranes for high-performance osmotic energy harvesting. △ Less

Submitted 10 June, 2021; originally announced June 2021.

Comments: 25 pages, 7 figures

Journal ref: The Journal of Physical Chemistry C, 2021

arXiv:2105.08267 [pdf, other]

EchoCP: An Echocardiography Dataset in Contrast Transthoracic Echocardiography for Patent Foramen Ovale Diagnosis

Authors: Tianchen Wang, Zhihe Li, Mei** Huang, Jian Zhuang, Shanshan Bi, Jiawei Zhang, Yiyu Shi, Hongwen Fei, Xiaowei Xu

Abstract: Patent foramen ovale (PFO) is a potential separation between the septum, primum and septum secundum located in the anterosuperior portion of the atrial septum. PFO is one of the main factors causing cryptogenic stroke which is the fifth leading cause of death in the United States. For PFO diagnosis, contrast transthoracic echocardiography (cTTE) is preferred as being a more robust method compared… ▽ More Patent foramen ovale (PFO) is a potential separation between the septum, primum and septum secundum located in the anterosuperior portion of the atrial septum. PFO is one of the main factors causing cryptogenic stroke which is the fifth leading cause of death in the United States. For PFO diagnosis, contrast transthoracic echocardiography (cTTE) is preferred as being a more robust method compared with others. However, the current PFO diagnosis through cTTE is extremely slow as it is proceeded manually by sonographers on echocardiography videos. Currently there is no publicly available dataset for this important topic in the community. In this paper, we present EchoCP, as the first echocardiography dataset in cTTE targeting PFO diagnosis. EchoCP consists of 30 patients with both rest and Valsalva maneuver videos which covers various PFO grades. We further establish an automated baseline method for PFO diagnosis based on the state-of-the-art cardiac chamber segmentation technique, which achieves 0.89 average mean Dice score, but only 0.60/0.67 mean accuracies for PFO diagnosis, leaving large room for improvement. We hope that the challenging EchoCP dataset can stimulate further research and lead to innovative and generic solutions that would have an impact in multiple domains. Our dataset is released. △ Less

Submitted 15 September, 2021; v1 submitted 18 May, 2021; originally announced May 2021.

Comments: MICCAI2021

Showing 51–100 of 463 results for author: Zhuang, J