Search | arXiv e-print repository

CT3D++: Improving 3D Object Detection with Keypoint-induced Channel-wise Transformer

Authors: Hualian Sheng, Sijia Cai, Na Zhao, Bing Deng, Qiao Liang, Min-Jian Zhao, Jie** Ye

Abstract: The field of 3D object detection from point clouds is rapidly advancing in computer vision, aiming to accurately and efficiently detect and localize objects in three-dimensional space. Current 3D detectors commonly fall short in terms of flexibility and scalability, with ample room for advancements in performance. In this paper, our objective is to address these limitations by introducing two fram… ▽ More The field of 3D object detection from point clouds is rapidly advancing in computer vision, aiming to accurately and efficiently detect and localize objects in three-dimensional space. Current 3D detectors commonly fall short in terms of flexibility and scalability, with ample room for advancements in performance. In this paper, our objective is to address these limitations by introducing two frameworks for 3D object detection with minimal hand-crafted design. Firstly, we propose CT3D, which sequentially performs raw-point-based embedding, a standard Transformer encoder, and a channel-wise decoder for point features within each proposal. Secondly, we present an enhanced network called CT3D++, which incorporates geometric and semantic fusion-based embedding to extract more valuable and comprehensive proposal-aware information. Additionally, CT3D ++ utilizes a point-to-key bidirectional encoder for more efficient feature encoding with reduced computational cost. By replacing the corresponding components of CT3D with these novel modules, CT3D++ achieves state-of-the-art performance on both the KITTI dataset and the large-scale Way\-mo Open Dataset. The source code for our frameworks will be made accessible at https://github.com/hlsheng1/CT3D-plusplus. △ Less

Submitted 12 June, 2024; originally announced June 2024.

Comments: 19 pages, 8 figures

arXiv:2406.04875 [pdf, other]

3DRealCar: An In-the-wild RGB-D Car Dataset with 360-degree Views

Authors: Xiaobiao Du, Haiyang Sun, Shuyun Wang, Zhuojie Wu, Hongwei Sheng, Jiaying Ying, Ming Lu, Tianqing Zhu, Kun Zhan, Xin Yu

Abstract: 3D cars are commonly used in self-driving systems, virtual/augmented reality, and games. However, existing 3D car datasets are either synthetic or low-quality, presenting a significant gap toward the high-quality real-world 3D car datasets and limiting their applications in practical scenarios. In this paper, we propose the first large-scale 3D real car dataset, termed 3DRealCar, offering three di… ▽ More 3D cars are commonly used in self-driving systems, virtual/augmented reality, and games. However, existing 3D car datasets are either synthetic or low-quality, presenting a significant gap toward the high-quality real-world 3D car datasets and limiting their applications in practical scenarios. In this paper, we propose the first large-scale 3D real car dataset, termed 3DRealCar, offering three distinctive features. (1) \textbf{High-Volume}: 2,500 cars are meticulously scanned by 3D scanners, obtaining car images and point clouds with real-world dimensions; (2) \textbf{High-Quality}: Each car is captured in an average of 200 dense, high-resolution 360-degree RGB-D views, enabling high-fidelity 3D reconstruction; (3) \textbf{High-Diversity}: The dataset contains various cars from over 100 brands, collected under three distinct lighting conditions, including reflective, standard, and dark. Additionally, we offer detailed car parsing maps for each instance to promote research in car parsing tasks. Moreover, we remove background point clouds and standardize the car orientation to a unified axis for the reconstruction only on cars without background and controllable rendering. We benchmark 3D reconstruction results with state-of-the-art methods across each lighting condition in 3DRealCar. Extensive experiments demonstrate that the standard lighting condition part of 3DRealCar can be used to produce a large number of high-quality 3D cars, improving various 2D and 3D tasks related to cars. Notably, our dataset brings insight into the fact that recent 3D reconstruction methods face challenges in reconstructing high-quality 3D cars under reflective and dark lighting conditions. \textcolor{red}{\href{https://xiaobiaodu.github.io/3drealcar/}{Our dataset is available here.}} △ Less

Submitted 7 June, 2024; originally announced June 2024.

Comments: Project Page: https://xiaobiaodu.github.io/3drealcar

arXiv:2405.10681 [pdf, other]

Know in AdVance: Linear-Complexity Forecasting of Ad Campaign Performance with Evolving User Interest

Authors: XiaoYu Wang, YongHui Guo, Hui Sheng, Peili Lv, Chi Zhou, Wei Huang, ShiQin Ta, Dongbo Huang, Xiu** Yang, Lan Xu, Hao Zhou, Yusheng Ji

Abstract: Real-time Bidding (RTB) advertisers wish to \textit{know in advance} the expected cost and yield of ad campaigns to avoid trial-and-error expenses. However, Campaign Performance Forecasting (CPF), a sequence modeling task involving tens of thousands of ad auctions, poses challenges of evolving user interest, auction representation, and long context, making coarse-grained and static-modeling method… ▽ More Real-time Bidding (RTB) advertisers wish to \textit{know in advance} the expected cost and yield of ad campaigns to avoid trial-and-error expenses. However, Campaign Performance Forecasting (CPF), a sequence modeling task involving tens of thousands of ad auctions, poses challenges of evolving user interest, auction representation, and long context, making coarse-grained and static-modeling methods sub-optimal. We propose \textit{AdVance}, a time-aware framework that integrates local auction-level and global campaign-level modeling. User preference and fatigue are disentangled using a time-positioned sequence of clicked items and a concise vector of all displayed items. Cross-attention, conditioned on the fatigue vector, captures the dynamics of user interest toward each candidate ad. Bidders compete with each other, presenting a complete graph similar to the self-attention mechanism. Hence, we employ a Transformer Encoder to compress each auction into embedding by solving auxiliary tasks. These sequential embeddings are then summarized by a conditional state space model (SSM) to comprehend long-range dependencies while maintaining global linear complexity. Considering the irregular time intervals between auctions, we make SSM's parameters dependent on the current auction embedding and the time interval. We further condition SSM's global predictions on the accumulation of local results. Extensive evaluations and ablation studies demonstrate its superiority over state-of-the-art methods. AdVance has been deployed on the Tencent Advertising platform, and A/B tests show a remarkable 4.5\% uplift in Average Revenue per User (ARPU). △ Less

Submitted 17 May, 2024; originally announced May 2024.

Comments: 12 pages, 4 figures, accepted at ACM SIGKDD 2024

arXiv:2405.09883 [pdf, other]

RoScenes: A Large-scale Multi-view 3D Dataset for Roadside Perception

Authors: Xiaosu Zhu, Hualian Sheng, Sijia Cai, Bing Deng, Shaopeng Yang, Qiao Liang, Ken Chen, Lianli Gao, **gkuan Song, Jie** Ye

Abstract: We introduce RoScenes, the largest multi-view roadside perception dataset, which aims to shed light on the development of vision-centric Bird's Eye View (BEV) approaches for more challenging traffic scenes. The highlights of RoScenes include significantly large perception area, full scene coverage and crowded traffic. More specifically, our dataset achieves surprising 21.13M 3D annotations within… ▽ More We introduce RoScenes, the largest multi-view roadside perception dataset, which aims to shed light on the development of vision-centric Bird's Eye View (BEV) approaches for more challenging traffic scenes. The highlights of RoScenes include significantly large perception area, full scene coverage and crowded traffic. More specifically, our dataset achieves surprising 21.13M 3D annotations within 64,000 $m^2$. To relieve the expensive costs of roadside 3D labeling, we present a novel BEV-to-3D joint annotation pipeline to efficiently collect such a large volume of data. After that, we organize a comprehensive study for current BEV methods on RoScenes in terms of effectiveness and efficiency. Tested methods suffer from the vast perception area and variation of sensor layout across scenes, resulting in performance levels falling below expectations. To this end, we propose RoBEV that incorporates feature-guided position embedding for effective 2D-3D feature assignment. With its help, our method outperforms state-of-the-art by a large margin without extra computational overhead on validation set. Our dataset and devkit will be made available at https://github.com/xiaosu-zhu/RoScenes. △ Less

Submitted 19 May, 2024; v1 submitted 16 May, 2024; originally announced May 2024.

Comments: Technical report. 32 pages, 21 figures, 13 tables. https://github.com/xiaosu-zhu/RoScenes

arXiv:2404.18372 [pdf, other]

Integrable semi-discretization for a modified Camassa-Holm equation with cubic nonlinearity

Authors: Bao-Feng Feng, Heng-Chun Hu, Han-Han Sheng, Wei Yin, Guo-Fu Yu

Abstract: In the present paper, an integrable semi-discretization of the modified Camassa-Holm (mCH) equation with cubic nonlinearity is presented. The key points of the construction are based on the discrete Kadomtsev-Petviashvili (KP) equation and appropriate definition of discrete reciprocal transformations. First, we demonstrate that these bilinear equations and their determinant solutions can be derive… ▽ More In the present paper, an integrable semi-discretization of the modified Camassa-Holm (mCH) equation with cubic nonlinearity is presented. The key points of the construction are based on the discrete Kadomtsev-Petviashvili (KP) equation and appropriate definition of discrete reciprocal transformations. First, we demonstrate that these bilinear equations and their determinant solutions can be derived from the discrete KP equation through Miwa transformation and some reductions. Then, by scrutinizing the reduction process, we obtain a set of semi-discrete bilinear equations and their general soliton solutions in the Gram-type determinant form. Finally, we obtain an integrable semi-discrete analog of the mCH equation by introducing dependent variables and discrete reciprocal transformation. It is also shown that the semi-discrete mCH equation converges to the continuous one in the continuum limit. △ Less

Submitted 28 April, 2024; originally announced April 2024.

arXiv:2403.19169 [pdf, ps, other]

Static Manifolds with Boundary and Rigidity of Scalar Curvature and Mean Curvature

Authors: Hongyi Sheng

Abstract: On a compact manifold with boundary, the map consisting of the scalar curvature in the interior and the mean curvature on the boundary is a local surjection at generic metrics. Moreover, this result may be localized to compact subdomains in an arbitrary Riemannian manifold with boundary. The non-generic case (also called non-generic domains) corresponds to static manifolds with boundary. We discus… ▽ More On a compact manifold with boundary, the map consisting of the scalar curvature in the interior and the mean curvature on the boundary is a local surjection at generic metrics. Moreover, this result may be localized to compact subdomains in an arbitrary Riemannian manifold with boundary. The non-generic case (also called non-generic domains) corresponds to static manifolds with boundary. We discuss their geometric properties, which also work as the necessary conditions of non-generic metrics. In space forms and the Schwarzschild manifold, we classify simple non-generic domains (with only one boundary component) and show their connection with rigidity theorems and the Schwarzschild photon sphere. △ Less

Submitted 28 March, 2024; originally announced March 2024.

arXiv:2402.08619 [pdf, ps, other]

Localized Deformation of the Scalar Curvature and the Mean Curvature

Authors: Hongyi Sheng

Abstract: On a compact manifold with boundary, the map consisting of the scalar curvature in the interior and the mean curvature on the boundary is a local surjection at generic metrics. We prove that this result may be localized to compact subdomains in an arbitrary Riemannian manifold with boundary. This result is a generalization of Corvino's result about localized scalar curvature deformations; however,… ▽ More On a compact manifold with boundary, the map consisting of the scalar curvature in the interior and the mean curvature on the boundary is a local surjection at generic metrics. We prove that this result may be localized to compact subdomains in an arbitrary Riemannian manifold with boundary. This result is a generalization of Corvino's result about localized scalar curvature deformations; however, the existence part needs to be handled delicately since the problem is non-variational. We also briefly discuss generic conditions that guarantee localized deformations, and some related geometric properties. △ Less

Submitted 8 March, 2024; v1 submitted 12 January, 2024; originally announced February 2024.

arXiv:2402.06499 [pdf, other]

BarlowTwins-CXR : Enhancing Chest X-Ray abnormality localization in heterogeneous data with cross-domain self-supervised learning

Authors: Haoyue Sheng, Linrui Ma, Jean-Francois Samson, Dianbo Liu

Abstract: Background: Chest X-ray imaging-based abnormality localization, essential in diagnosing various diseases, faces significant clinical challenges due to complex interpretations and the growing workload of radiologists. While recent advances in deep learning offer promising solutions, there is still a critical issue of domain inconsistency in cross-domain transfer learning, which hampers the efficien… ▽ More Background: Chest X-ray imaging-based abnormality localization, essential in diagnosing various diseases, faces significant clinical challenges due to complex interpretations and the growing workload of radiologists. While recent advances in deep learning offer promising solutions, there is still a critical issue of domain inconsistency in cross-domain transfer learning, which hampers the efficiency and accuracy of diagnostic processes. This study aims to address the domain inconsistency problem and improve autonomic abnormality localization performance of heterogeneous chest X-ray image analysis, by develo** a self-supervised learning strategy called "BarlwoTwins-CXR". Methods: We utilized two publicly available datasets: the NIH Chest X-ray Dataset and the VinDr-CXR. The BarlowTwins-CXR approach was conducted in a two-stage training process. Initially, self-supervised pre-training was performed using an adjusted Barlow Twins algorithm on the NIH dataset with a Resnet50 backbone pre-trained on ImageNet. This was followed by supervised fine-tuning on the VinDr-CXR dataset using Faster R-CNN with Feature Pyramid Network (FPN). Results: Our experiments showed a significant improvement in model performance with BarlowTwins-CXR. The approach achieved a 3% increase in mAP50 accuracy compared to traditional ImageNet pre-trained models. In addition, the Ablation CAM method revealed enhanced precision in localizing chest abnormalities. Conclusion: BarlowTwins-CXR significantly enhances the efficiency and accuracy of chest X-ray image-based abnormality localization, outperforming traditional transfer learning methods and effectively overcoming domain inconsistency in cross-domain scenarios. Our experiment results demonstrate the potential of using self-supervised learning to improve the generalizability of models in medical settings with limited amounts of heterogeneous data. △ Less

Submitted 9 February, 2024; originally announced February 2024.

Comments: 15 pages, 7 figures, 3 tables

ACM Class: I.2.1; J.3; I.4.9

arXiv:2401.01222 [pdf, other]

Excitonic Instability in Ta2Pd3Te5 monolayer

Authors: **gyu Yao, Haohao Sheng, Ruihan Zhang, Rongtian Pang, **-Jian Zhou, Quansheng Wu, Hongming Weng, Xi Dai, Zhong Fang, Zhijun Wang

Abstract: By systematic theoretical calculations, we have revealed an excitonic insulator (EI) in a van der Waals (vdW) layered compound Ta2Pd3Te5. The interlayer binding energy in the vdW layered compound is 19.6 meV/$\unicode{x212B}$$^2$. The computed phonon spectrum suggests that the monolayer is dynamically stable without lattice distortion. The monolayer can be obtained by exfoliation or molecular-beam… ▽ More By systematic theoretical calculations, we have revealed an excitonic insulator (EI) in a van der Waals (vdW) layered compound Ta2Pd3Te5. The interlayer binding energy in the vdW layered compound is 19.6 meV/$\unicode{x212B}$$^2$. The computed phonon spectrum suggests that the monolayer is dynamically stable without lattice distortion. The monolayer can be obtained by exfoliation or molecular-beam epitaxy. First-principles calculations show that the monolayer is a nearly zero-gap semiconductor with the modified Becke-Johnson functional. Due to the like symmetry of the band-edge states, the 2D polarization $α_{2D}$ would be finite as the band gap goes to zero, allowing for the EI state in the compound. Using the first-principles many-body perturbation theory, the $GW$-BSE calculation reveals that the exciton binding energy $E_b$ is larger than the single-particle band gap $E_g$, indicating the excitonic instability. Our findings suggest that the Ta2Pd3Te5 monolayer is an excitonic insulator without structural distortion. △ Less

Submitted 8 May, 2024; v1 submitted 2 January, 2024; originally announced January 2024.

Comments: 6 pages, 4 figures

arXiv:2312.15570 [pdf]

Relativistic artificial molecules with tunable coupling and orbitals

Authors: Xiao-Feng Zhou, Yu-Chen Zhuang, Mo-Han Zhang, Hao Sheng, Qing-Feng Sun, Lin He

Abstract: In a molecule formed by two atoms, energy difference between bonding and antibonding orbitals should depend on distance of the two atoms. However, exploring molecular orbitals of two natural atoms with tunable distance has remained an outstanding experimental challenge. Graphene quantum dots (GQDs) can be viewed as relativistic artificial atoms, therefore, offering a unique platform to study molec… ▽ More In a molecule formed by two atoms, energy difference between bonding and antibonding orbitals should depend on distance of the two atoms. However, exploring molecular orbitals of two natural atoms with tunable distance has remained an outstanding experimental challenge. Graphene quantum dots (GQDs) can be viewed as relativistic artificial atoms, therefore, offering a unique platform to study molecular physics. Here, through scanning tunneling microscope (STM), we create and directly visualize the formation process of relativistic artificial molecules based on two coupled GQDs with tunable distance. Our study indicates that energy difference between the bonding and antibonding orbitals of the lowest quasibound state increases linearly with inverse distance of the two GQDs due to the relativistic nature of the artificial molecule. For quasibound states with higher orbital momenta, the coupling between these states leads to half-energy spacing of the confined states because the length of the molecular-like orbit is about twice that of the atomic-like orbit. Evolution from ring-like whispering-gallery modes in the artificial atoms to figure-eight orbitals in the artificial molecules is directly imaged. The ability to resolve the coupling and orbitals of the relativistic artificial molecule at the nanoscale level yields insights into the behavior of quantum-relativistic matter. △ Less

Submitted 24 December, 2023; originally announced December 2023.

arXiv:2312.14455 [pdf, other]

doi 10.1103/PhysRevX.14.011046

Evidence for an Excitonic Insulator State in Ta$_2$Pd$_3$Te$_5$

Authors: Jierui Huang, Bei Jiang, **gyu Yao, Dayu Yan, Xincheng Lei, Jiacheng Gao, Zhaopeng Guo, Feng **, Yupeng Li, Zhenyu Yuan, Congcong Chai, Haohao Sheng, Mojun Pan, Famin Chen, Junde Liu, Shunye Gao, Gexing Qu, Bo Liu, Zhicheng Jiang, Zhengtai Liu, Xiaoyan Ma, Shiming Zhou, Yaobo Huang, Chenxia Yun, Qingming Zhang , et al. (8 additional authors not shown)

Abstract: The excitonic insulator (EI) is an exotic ground state of narrow-gap semiconductors and semimetals arising from spontaneous condensation of electron-hole pairs bound by attractive Coulomb interaction. Despite research on EIs dating back to half a century ago, their existence in real materials remains a subject of ongoing debate. In this study, through systematic experimental and theoretical invest… ▽ More The excitonic insulator (EI) is an exotic ground state of narrow-gap semiconductors and semimetals arising from spontaneous condensation of electron-hole pairs bound by attractive Coulomb interaction. Despite research on EIs dating back to half a century ago, their existence in real materials remains a subject of ongoing debate. In this study, through systematic experimental and theoretical investigations, we provide evidence for the existence of an EI ground state in a van der Waals compound Ta$_2$Pd$_3$Te$_5$. Density-functional-theory calculations suggest that it is a semimetal with a small band overlap, whereas various experiments exhibit an insulating ground state with a clear band gap. Upon incorporating electron-hole Coulomb interaction into our calculations, we obtain an EI phase where the electronic symmetry breaking opens a many-body gap. Angle-resolved photoemission spectroscopy measurements exhibit that the band gap is closed with a significant change in the dispersions as the number of thermally excited charge carriers becomes sufficiently large in both equilibrium and nonequilibrium states. Structural measurements reveal a slight breaking of crystal symmetry with exceptionally small lattice distortion in the insulating state, which cannot account for the significant gap opening. Therefore, we attribute the insulating ground state with a gap opening in Ta$_2$Pd$_3$Te$_5$ to exciton condensation, where the coupling to the symmetry-breaking electronic state induces a subtle change in the crystal structure. △ Less

Submitted 14 March, 2024; v1 submitted 22 December, 2023; originally announced December 2023.

Comments: 10 pages, 5 figures

Journal ref: Phys. Rev. X 14, 011046, 2024

arXiv:2312.13045 [pdf, ps, other]

Feasibility Conditions for Mobile LiFi

Authors: Shuai Ma, Haihong Sheng, Junchang Sun, Hang Li, Xiaodong Liu, Chen Qiu, Majid Safari, Naofal Al-Dhahir, Shiyin Li

Abstract: Light fidelity (LiFi) is a potential key technology for future 6G networks. However, its feasibility of supporting mobile communications has not been fundamentally discussed. In this paper, we investigate the time-varying channel characteristics of mobile LiFi based on measured mobile phone rotation and movement data. Specifically, we define LiFi channel coherence time to evaluate the correlation… ▽ More Light fidelity (LiFi) is a potential key technology for future 6G networks. However, its feasibility of supporting mobile communications has not been fundamentally discussed. In this paper, we investigate the time-varying channel characteristics of mobile LiFi based on measured mobile phone rotation and movement data. Specifically, we define LiFi channel coherence time to evaluate the correlation of the channel timing sequence. Then, we derive the expression of LiFi transmission rate based on the m-pulse-amplitude-modulation (M-PAM). The derived rate expression indicates that mobile LiFi communications is feasible by using at least two photodiodes (PDs) with different orientations. Further, we propose two channel estimation schemes, and propose a LiFi channel tracking scheme to improve the communication performance. Finally, our experimental results show that the channel coherence time is on the order of tens of milliseconds, which indicates a relatively stable channel. In addition, based on the measured data, better communication performance can be realized in the multiple-input multiple-output (MIMO) scenario with a rate of 36Mbit/s, compared to other scenarios. The results also show that the proposed channel estimation and tracking schemes are effective in designing mobile LiFi systems. △ Less

Submitted 20 December, 2023; originally announced December 2023.

arXiv:2312.08729 [pdf, other]

doi 10.1088/0256-307X/40/12/127101

VASP2KP: kp models and Lande g-factors from ab initio calculations

Authors: Sheng Zhang, Haohao Sheng, Zhi-Da Song, Chenhao Liang, Yi Jiang, Song Sun, Quansheng Wu, Hongming Weng, Zhong Fang, Xi Dai, Zhijun Wang

Abstract: The $k\cdot p$ method is significant in condensed matter physics for the compact and analytical Hamiltonian. In the presence of magnetic field, it is described by the effective Zeeman's coupling Hamiltonian with Landé $ g $-factors. Here, we develop an open-source package VASP2KP (including two parts: vasp2mat and mat2kp) to compute $k\cdot p$ parameters and Landé $g$-factors directly from the wav… ▽ More The $k\cdot p$ method is significant in condensed matter physics for the compact and analytical Hamiltonian. In the presence of magnetic field, it is described by the effective Zeeman's coupling Hamiltonian with Landé $ g $-factors. Here, we develop an open-source package VASP2KP (including two parts: vasp2mat and mat2kp) to compute $k\cdot p$ parameters and Landé $g$-factors directly from the wavefunctions provided by the density functional theory (DFT) as implemented in Vienna ab initio Simulation Package (VASP). First, we develop a VASP patch vasp2mat to compute matrix representations of the generalized momentum operator $ \mathbf{\hatπ}=\mathbf{\hat{p}}+\frac{1}{2mc^2}\left(\mathbf{\hat{s}}\times\nabla V(\mathbf{r})\right) $, spin operator $\mathbf{\hat{s}}$, time reversal operator $\hat{T}$ and crystalline symmetry operators $\hat{R}$ on the DFT wavefunctions. Second, we develop a python code mat2kp to obtain the unitary transformation $U$ that rotates the degenerate DFT basis towards the standard basis, and then automatically compute the $k\cdot p$ parameters and $g$-factors. The theory and the methodology behind VASP2KP are described in detail. The matrix elements of the operators are derived comprehensively and computed correctly within the projector augmented wave method. We apply this package to some materials, e.g., Bi$_2$Se$_3$, Na$_3$Bi, Te, InAs and 1H-TMD monolayers. The obtained effective model's dispersions are in good agreement with the DFT data around the specific wave vector, and the $g$-factors are consistent with experimental data. The VASP2KP package is available at https://github.com/zjwang11/VASP2KP. △ Less

Submitted 14 December, 2023; originally announced December 2023.

Journal ref: Chin. Phys. Lett. 40, 127101 (2023)

arXiv:2310.05425 [pdf, other]

Divide and Ensemble: Progressively Learning for the Unknown

Authors: Hu Zhang, Xin Shen, Heming Du, Huiqiang Chen, Chen Liu, Hongwei Sheng, Qingzheng Xu, MD Wahiduzzaman Khan, Qingtao Yu, Tianqing Zhu, Scott Chapman, Zi Huang, Xin Yu

Abstract: In the wheat nutrient deficiencies classification challenge, we present the DividE and EnseMble (DEEM) method for progressive test data predictions. We find that (1) test images are provided in the challenge; (2) samples are equipped with their collection dates; (3) the samples of different dates show notable discrepancies. Based on the findings, we partition the dataset into discrete groups by th… ▽ More In the wheat nutrient deficiencies classification challenge, we present the DividE and EnseMble (DEEM) method for progressive test data predictions. We find that (1) test images are provided in the challenge; (2) samples are equipped with their collection dates; (3) the samples of different dates show notable discrepancies. Based on the findings, we partition the dataset into discrete groups by the dates and train models on each divided group. We then adopt the pseudo-labeling approach to label the test data and incorporate those with high confidence into the training set. In pseudo-labeling, we leverage models ensemble with different architectures to enhance the reliability of predictions. The pseudo-labeling and ensembled model training are iteratively conducted until all test samples are labeled. Finally, the separated models for each group are unified to obtain the model for the whole dataset. Our method achieves an average of 93.6\% Top-1 test accuracy~(94.0\% on WW2020 and 93.2\% on WR2021) and wins the 1$st$ place in the Deep Nutrient Deficiency Challenge~\footnote{https://cvppa2023.github.io/challenges/}. △ Less

Submitted 9 October, 2023; originally announced October 2023.

arXiv:2309.02791 [pdf, other]

Seismic Foundation Model (SFM): a new generation deep learning model in geophysics

Authors: Hanlin Sheng, Xinming Wu, Xu Si, **tao Li, Sibo Zhang, Xudong Duan

Abstract: While computer science has seen remarkable advancements in foundation models, which remain underexplored in geoscience. Addressing this gap, we introduce a workflow to develop geophysical foundation models, including data preparation, model pre-training, and adaption to downstream tasks. From 192 globally collected 3-D seismic volumes, we create a carefully curated dataset of 2,286,422 2-D seismic… ▽ More While computer science has seen remarkable advancements in foundation models, which remain underexplored in geoscience. Addressing this gap, we introduce a workflow to develop geophysical foundation models, including data preparation, model pre-training, and adaption to downstream tasks. From 192 globally collected 3-D seismic volumes, we create a carefully curated dataset of 2,286,422 2-D seismic images. Fully using these unlabeled images, we employ the self-supervised learning to pre-train a Transformer-based Seismic Foundation Model (SFM) for producing all-purpose seismic features that work across various tasks and surveys. Through experiments on seismic facies classification, geobody identification, interpolation, denoising, and inversion, our pre-trained model demonstrates versatility, generalization, scalability, and superior performance over baseline models. Conclusively, we provide a foundation model and vast dataset to advance AI in geophysics, addressing challenges (poor generalization, lacking labels, and repetitive training for task-specified models) of applying AI in geophysics and paving the way for future innovations in geoscience. △ Less

Submitted 15 December, 2023; v1 submitted 6 September, 2023; originally announced September 2023.

Comments: 27 pages, 9 figures, and 4 tables

arXiv:2309.02320 [pdf, other]

SeisCLIP: A seismology foundation model pre-trained by multi-modal data for multi-purpose seismic feature extraction

Authors: Xu Si, Xinming Wu, Hanlin Sheng, Jun Zhu, Zefeng Li

Abstract: Training specific deep learning models for particular tasks is common across various domains within seismology. However, this approach encounters two limitations: inadequate labeled data for certain tasks and limited generalization across regions. To address these challenges, we develop SeisCLIP, a seismology foundation model trained through contrastive learning from multi-modal data. It consists… ▽ More Training specific deep learning models for particular tasks is common across various domains within seismology. However, this approach encounters two limitations: inadequate labeled data for certain tasks and limited generalization across regions. To address these challenges, we develop SeisCLIP, a seismology foundation model trained through contrastive learning from multi-modal data. It consists of a transformer encoder for extracting crucial features from time-frequency seismic spectrum and an MLP encoder for integrating the phase and source information of the same event. These encoders are jointly pre-trained on a vast dataset and the spectrum encoder is subsequently fine-tuned on smaller datasets for various downstream tasks. Notably, SeisCLIP's performance surpasses that of baseline methods in event classification, localization, and focal mechanism analysis tasks, employing distinct datasets from different regions. In conclusion, SeisCLIP holds significant potential as a foundational model in the field of seismology, paving the way for innovative directions in foundation-model-based seismology research. △ Less

Submitted 5 September, 2023; originally announced September 2023.

Comments: 27 pages, 9 figures, 4 tables

arXiv:2308.12055 [pdf, other]

Majorana corner modes in unconventional monolayers of 1T-PtSe2 family

Authors: Haohao Sheng, Yue Xie, Quansheng Wu, Hongming Weng, Xi Dai, B. Andrei Bernevig, Zhong Fang, Zhijun Wang

Abstract: In this work, we propose that Majorana zero modes can be realized at the corners of a topologically trivial insulator with unconventionality. We demonstrate that 1T-PtSe$_2$ is a symmetry indicator-free (SI-free) unconventional insulator, originating from orbital hybridization between Pt $d$ and Se $p_{x,y}$ states. The new kind of SI-free unconventionality has no symmetry eigenvalue indication. I… ▽ More In this work, we propose that Majorana zero modes can be realized at the corners of a topologically trivial insulator with unconventionality. We demonstrate that 1T-PtSe$_2$ is a symmetry indicator-free (SI-free) unconventional insulator, originating from orbital hybridization between Pt $d$ and Se $p_{x,y}$ states. The new kind of SI-free unconventionality has no symmetry eigenvalue indication. Instead, it is diagnosed directly by the Wannier charge centers by using the one-dimensional Wilson loop method. The obstructed edge states exhibit strong anisotropy and large Rashba splitting. By introducing superconducting proximity and external magnetic field, the Majorana corner modes can be obtained in 1T-PtSe$_2$ monolayer. In the end, we construct a two-Bernevig-Hughes-Zhang model with anisotropy to capture the Majorana physics. △ Less

Submitted 14 December, 2023; v1 submitted 23 August, 2023; originally announced August 2023.

arXiv:2307.10497 [pdf, other]

Integrable discretizations for a generalized sine-Gordon equation and the reductions to the sine-Gordon equation and the short pulse equation

Authors: Han-Han Sheng, Bao-Feng Feng, Guo-Fu Yu

Abstract: In this paper, we propose fully discrete analogues of a generalized sine-Gordon (gsG) equation $u_{t x}=\left(1+ν\partial_x^2\right) \sin u$. The bilinear equations of the discrete KP hierarchy and the proper definition of discrete hodograph transformations are the keys to the construction. Then we derive semi-discrete analogues of the gsG equation from the fully discrete gsG equation by taking th… ▽ More In this paper, we propose fully discrete analogues of a generalized sine-Gordon (gsG) equation $u_{t x}=\left(1+ν\partial_x^2\right) \sin u$. The bilinear equations of the discrete KP hierarchy and the proper definition of discrete hodograph transformations are the keys to the construction. Then we derive semi-discrete analogues of the gsG equation from the fully discrete gsG equation by taking the temporal parameter $b\rightarrow0$. Especially, one full-discrete gsG equation is reduced to a semi-discrete gsG equation in the case of $ν=-1$ (Feng {\it et al. Numer. Algorithms} 2023). Furthermore, $N$-soliton solutions to the semi- and fully discrete analogues of the gsG equation in the determinant form are constructed. Dynamics of one- and two-soliton solutions for the discrete gsG equations are discussed with plots. We also investigate the reductions to the sine-Gordon (sG) equation and the short pulse (SP) equation. By introducing an important parameter $c$, we demonstrate that the gsG equation reduces to the sG equation and the SP equation, and the discrete gsG equation reduces to the discrete sG equation and the discrete SP equation, respectively, in the appropriate scaling limit. The limiting forms of the $N$-soliton solutions to the gsG equation also correspond to those of the sG equation and the SP equation. △ Less

Submitted 19 July, 2023; originally announced July 2023.

arXiv:2307.06577 [pdf, other]

RVD: A Handheld Device-Based Fundus Video Dataset for Retinal Vessel Segmentation

Authors: MD Wahiduzzaman Khan, Hongwei Sheng, Hu Zhang, Heming Du, Sen Wang, Minas Theodore Coroneo, Farshid Hajati, Sahar Shariflou, Michael Kalloniatis, Jack Phu, Ashish Agar, Zi Huang, Mojtaba Golzan, Xin Yu

Abstract: Retinal vessel segmentation is generally grounded in image-based datasets collected with bench-top devices. The static images naturally lose the dynamic characteristics of retina fluctuation, resulting in diminished dataset richness, and the usage of bench-top devices further restricts dataset scalability due to its limited accessibility. Considering these limitations, we introduce the first video… ▽ More Retinal vessel segmentation is generally grounded in image-based datasets collected with bench-top devices. The static images naturally lose the dynamic characteristics of retina fluctuation, resulting in diminished dataset richness, and the usage of bench-top devices further restricts dataset scalability due to its limited accessibility. Considering these limitations, we introduce the first video-based retinal dataset by employing handheld devices for data acquisition. The dataset comprises 635 smartphone-based fundus videos collected from four different clinics, involving 415 patients from 50 to 75 years old. It delivers comprehensive and precise annotations of retinal structures in both spatial and temporal dimensions, aiming to advance the landscape of vasculature segmentation. Specifically, the dataset provides three levels of spatial annotations: binary vessel masks for overall retinal structure delineation, general vein-artery masks for distinguishing the vein and artery, and fine-grained vein-artery masks for further characterizing the granularities of each artery and vein. In addition, the dataset offers temporal annotations that capture the vessel pulsation characteristics, assisting in detecting ocular diseases that require fine-grained recognition of hemodynamic fluctuation. In application, our dataset exhibits a significant domain shift with respect to data captured by bench-top devices, thus posing great challenges to existing methods. In the experiments, we provide evaluation metrics and benchmark results on our dataset, reflecting both the potential and challenges it offers for vessel segmentation tasks. We hope this challenging dataset would significantly contribute to the development of eye disease diagnosis and early prevention. △ Less

Submitted 13 July, 2023; originally announced July 2023.

arXiv:2306.14397 [pdf, other]

Discriminating Human-authored from ChatGPT-Generated Code Via Discernable Feature Analysis

Authors: Li Ke, Hong Sheng, Fu Cai, Zhang Yunhe, Liu Ming

Abstract: The ubiquitous adoption of Large Language Generation Models (LLMs) in programming has underscored the importance of differentiating between human-written code and code generated by intelligent models. This paper specifically aims to distinguish code generated by ChatGPT from that authored by humans. Our investigation reveals disparities in programming style, technical level, and readability betwee… ▽ More The ubiquitous adoption of Large Language Generation Models (LLMs) in programming has underscored the importance of differentiating between human-written code and code generated by intelligent models. This paper specifically aims to distinguish code generated by ChatGPT from that authored by humans. Our investigation reveals disparities in programming style, technical level, and readability between these two sources. Consequently, we develop a discriminative feature set for differentiation and evaluate its efficacy through ablation experiments. Additionally, we devise a dataset cleansing technique, which employs temporal and spatial segmentation, to mitigate the dearth of datasets and to secure high-caliber, uncontaminated datasets. To further enrich data resources, we employ "code transformation," "feature transformation," and "feature customization" techniques, generating an extensive dataset comprising 10,000 lines of ChatGPT-generated code. The salient contributions of our research include: proposing a discriminative feature set yielding high accuracy in differentiating ChatGPT-generated code from human-authored code in binary classification tasks; devising methods for generating extensive ChatGPT-generated codes; and introducing a dataset cleansing strategy that extracts immaculate, high-grade code datasets from open-source repositories, thus achieving exceptional accuracy in code authorship attribution tasks. △ Less

Submitted 4 July, 2023; v1 submitted 25 June, 2023; originally announced June 2023.

Comments: 11 pages, 8 figures, 3 tables

arXiv:2306.14176 [pdf, other]

Sentence-level Event Detection without Triggers via Prompt Learning and Machine Reading Comprehension

Authors: Tongtao Ling, Lei Chen, Huangxu Sheng, Zicheng Cai, Hai-Lin Liu

Abstract: The traditional way of sentence-level event detection involves two important subtasks: trigger identification and trigger classifications, where the identified event trigger words are used to classify event types from sentences. However, trigger classification highly depends on abundant annotated trigger words and the accuracy of trigger identification. In a real scenario, annotating trigger words… ▽ More The traditional way of sentence-level event detection involves two important subtasks: trigger identification and trigger classifications, where the identified event trigger words are used to classify event types from sentences. However, trigger classification highly depends on abundant annotated trigger words and the accuracy of trigger identification. In a real scenario, annotating trigger words is time-consuming and laborious. For this reason, we propose a trigger-free event detection model, which transforms event detection into a two-tower model based on machine reading comprehension and prompt learning. Compared to existing trigger-based and trigger-free methods, experimental studies on two event detection benchmark datasets (ACE2005 and MAVEN) have shown that the proposed approach can achieve competitive performance. △ Less

Submitted 25 June, 2023; originally announced June 2023.

Comments: 14 pages, accepted by ADMA 2023

arXiv:2306.08478 [pdf, other]

Interfering Josephson diode effect and magnetochiral anisotropy in Ta2Pd3Te5 asymmetric edge interferometer

Authors: Yupeng Li, Dayu Yan, Yu Hong, Haohao Sheng, Anqi Wang, Ziwei Dou, Xingchen Guo, Xiaofan Shi, Zikang Su, Zhaozheng Lyu, Tian Qian, Guangtong Liu, Fanming Qu, Kun Jiang, Zhijun Wang, Youguo Shi, Zhu-An Xu, Jiang** Hu, Li Lu, Jie Shen

Abstract: Edge states in topological systems have attracted great interest due to their robustness and linear dispersions. Here a superconducting-proximitized edge interferometer is engineered on a topological insulator Ta2Pd3Te5 with asymmetric edges to realize the interfering Josephson diode effect (JDE), which hosts many advantages, such as the high efficiency as much as 73% at tiny applied magnetic fiel… ▽ More Edge states in topological systems have attracted great interest due to their robustness and linear dispersions. Here a superconducting-proximitized edge interferometer is engineered on a topological insulator Ta2Pd3Te5 with asymmetric edges to realize the interfering Josephson diode effect (JDE), which hosts many advantages, such as the high efficiency as much as 73% at tiny applied magnetic fields with an ultra-low switching power around picowatt, and a giant interfering magnetochiral anisotropy with a maximal coefficient gamma = 1.2 x 10^{9} T^{-1}A^{-1}. As an important element to induce such JDE, the second-order harmonic in the current-phase relation is also experimentally confirmed by half-integer Shapiro steps. This edge interferometer offers a novel and effective method to enhance the overall performance of JDE and magnetochiral anisotropy, and boosts great potential applications for future superconducting quantum devices. △ Less

Submitted 2 June, 2024; v1 submitted 14 June, 2023; originally announced June 2023.

Comments: 29 pages,21 figures

arXiv:2306.08347 [pdf, other]

doi 10.1038/s41524-024-01210-z

Superconductivity in unconventional metals

Authors: Zhilong Yang, Haohao Sheng, Zhaopeng Guo, Ruihan Zhang, Quansheng Wu, Hongming Weng, Zhong Fang, Zhijun Wang

Abstract: Based on first-principles calculations, we demonstrate that 1H/2H-phase transition metal dichalcogenides MX2 (M=Nb,Ta; X=S,Se,Te) are unconventional metals, which have an empty-site band of $A_1'@1e$ elementary band representation at the Fermi level. The computed phonon dispersions indicate the stability of the system at high temperatures, while the presence of the soft phonon mode suggests a phas… ▽ More Based on first-principles calculations, we demonstrate that 1H/2H-phase transition metal dichalcogenides MX2 (M=Nb,Ta; X=S,Se,Te) are unconventional metals, which have an empty-site band of $A_1'@1e$ elementary band representation at the Fermi level. The computed phonon dispersions indicate the stability of the system at high temperatures, while the presence of the soft phonon mode suggests a phase transition to the charge density wave state at low temperatures. Based on the Bardeen-Cooper-Schrieffer theory and computed electron-phonon coupling, our calculations show that the superconductivity (SC) in NbSe2 is mainly attributed to the soft phonon mode due to the half filling of the empty-site band. Accordingly, the SC has been predicted in unconventional metals TaNS monolayer and 2H-TaN2 bulk with computed $T_C=$ 10 K and 26 K respectively. These results demonstrate that the unconventional metals with partial filling of the empty-site band offer an attractive platform to search for superconductors. △ Less

Submitted 18 March, 2024; v1 submitted 14 June, 2023; originally announced June 2023.

Comments: 5 pages, 5 figures,42 references

Journal ref: npj Comput Mater 10, 25 (2024)

arXiv:2305.09453 [pdf, other]

doi 10.1007/s11433-023-2271-y

Unconventional phonon spectra and obstructed edge phonon modes

Authors: Ruihan Zhang, Haohao Sheng, Junze Deng, Zhong Fang, Zhilong Yang, Zhijun Wang

Abstract: Based on the elementary band representations (EBR), some topologically trivial materials are classified as unconventional ones (obstructed atomic limit), where the EBR decomposition of electronic states is not consistent with the atomic valence-electron band representations. In the work, we identify that the unconventional nature can also exist in phonon spectra, where the EBR decomposition of the… ▽ More Based on the elementary band representations (EBR), some topologically trivial materials are classified as unconventional ones (obstructed atomic limit), where the EBR decomposition of electronic states is not consistent with the atomic valence-electron band representations. In the work, we identify that the unconventional nature can also exist in phonon spectra, where the EBR decomposition of the phonon modes is not consistent with atomic vibration band representations (ABR). The unconventionality has two types: type I is on an empty site; type II is on an atom site with non-atomic vibration orbitals. Our detailed calculations show that black phosphorus (BP) and 1H-MoSe2 have unconventional both phonon spectra and electronic band structures. The BP has the type-I unconventional phonon spectrum, while 1H-MoSe2 has the type-II one. The obstructed phonon modes are obtained for two types of unconventional phonon spectra. △ Less

Submitted 18 March, 2024; v1 submitted 16 May, 2023; originally announced May 2023.

Comments: 5 pages, 5 figures

Journal ref: Sci. China-Phys. Mech. Astron. 67, 246811 (2024)

arXiv:2305.06043 [pdf, other]

Autonomous Stabilization of Retinal Videos for Streamlining Assessment of Spontaneous Venous Pulsations

Authors: Hongwei Sheng, Xin Yu, Feiyu Wang, MD Wahiduzzaman Khan, Hexuan Weng, Sahar Shariflou, S. Mojtaba Golzan

Abstract: Spontaneous retinal Venous Pulsations (SVP) are rhythmic changes in the caliber of the central retinal vein and are observed in the optic disc region (ODR) of the retina. Its absence is a critical indicator of various ocular or neurological abnormalities. Recent advances in imaging technology have enabled the development of portable smartphone-based devices for observing the retina and assessment… ▽ More Spontaneous retinal Venous Pulsations (SVP) are rhythmic changes in the caliber of the central retinal vein and are observed in the optic disc region (ODR) of the retina. Its absence is a critical indicator of various ocular or neurological abnormalities. Recent advances in imaging technology have enabled the development of portable smartphone-based devices for observing the retina and assessment of SVPs. However, the quality of smartphone-based retinal videos is often poor due to noise and image jitting, which in return, can severely obstruct the observation of SVPs. In this work, we developed a fully automated retinal video stabilization method that enables the examination of SVPs captured by various mobile devices. Specifically, we first propose an ODR Spatio-Temporal Localization (ODR-STL) module to localize visible ODR and remove noisy and jittering frames. Then, we introduce a Noise-Aware Template Matching (NATM) module to stabilize high-quality video segments at a fixed position in the field of view. After the processing, the SVPs can be easily observed in the stabilized videos, significantly facilitating user observations. Furthermore, our method is cost-effective and has been tested in both subjective and objective evaluations. Both of the evaluations support its effectiveness in facilitating the observation of SVPs. This can improve the timely diagnosis and treatment of associated diseases, making it a valuable tool for eye health professionals. △ Less

Submitted 10 May, 2023; originally announced May 2023.

Comments: EMBC, 4 pages, 6 figures

arXiv:2304.04584 [pdf, other]

Extension of ELM suppression window using n=4 RMPs in EAST

Authors: P. Xie, Y. Sun, Q. Ma, S. Gu, Y. Q. Liu, M. Jia, A. Loarte, X. Wu, Y. Chang, T. Jia, T. Zhang, Z. Zhou, Q. Zang, B. Lyu, S. Fu, H. Sheng, C. Ye, H. Yang, H. H. Wang, EAST Contributors

Abstract: The q95 window for Type-I Edge Localized Modes (ELMs) suppression using n=4 even parity Resonant Magnetic Perturbations (RMPs) has been significantly expanded to a range from 3.9 to 4.8, which is demonstrated to be reliable and repeatable in EAST over the last two years. This window is significantly wider than the previous one, which is around q95=3.7pm0.1, and is achieved using n=4 odd parity RMP… ▽ More The q95 window for Type-I Edge Localized Modes (ELMs) suppression using n=4 even parity Resonant Magnetic Perturbations (RMPs) has been significantly expanded to a range from 3.9 to 4.8, which is demonstrated to be reliable and repeatable in EAST over the last two years. This window is significantly wider than the previous one, which is around q95=3.7pm0.1, and is achieved using n=4 odd parity RMPs. Here, n represents the toroidal mode number of the applied RMPs and q95 is the safety factor at the 95% normalized poloidal magnetic flux. During ELM suppression, there is only a slight drop in the stored energy (<=10%). The comparison of pedestal density profiles suggests that ELM suppression is achieved when the pedestal gradient is kept lower than a threshold. This wide q95 window for ELM suppression is consistent with the prediction made by MARS-F modeling prior to the experiment, in which it is located at one of the resonant q95 windows for plasma response. The Chirikov parameter taking into account plasma response near the pedestal top, which measures the plasma edge stochasticity, significantly increases when q95 exceeds 4, mainly due to denser neighboring rational surfaces. Modeling of plasma response by the MARS-F code shows a strong coupling between resonant and non-resonant components across the pedestal region, which is characteristic of the kink-peeling like response observed during RMP-ELM suppression in previous studies on EAST. These promising results show the reliability of ELM suppression using the n=4 RMPs and expand the physical understanding on ELM suppression mechanism. △ Less

Submitted 10 April, 2023; originally announced April 2023.

Comments: 25 pages, 11 figures

arXiv:2303.14343 [pdf, other]

doi 10.1103/PhysRevB.108.104109

Ferroelectric metals in 1T/1T'-phase transition metal dichalcogenide MTe2 bilayers (M = Pt, Pd, and Ni)

Authors: Haohao Sheng, Zhong Fang, Zhijun Wang

Abstract: Ferroelectricity and metallicity cannot coexist due to the screening effect of conducting electrons, and a large number of stable monolayers with 1T/1T$^{\prime}$ phase lack spontaneous polarization due to inversion symmetry. In this work, we have constructed the $π$-bilayer structures for transition metal dichalcogenides ($M$Te$_2,M =$ Pt, Pd, and Ni) with van der Waals stacking, where two monola… ▽ More Ferroelectricity and metallicity cannot coexist due to the screening effect of conducting electrons, and a large number of stable monolayers with 1T/1T$^{\prime}$ phase lack spontaneous polarization due to inversion symmetry. In this work, we have constructed the $π$-bilayer structures for transition metal dichalcogenides ($M$Te$_2,M =$ Pt, Pd, and Ni) with van der Waals stacking, where two monolayers are related by $C_{2z}$ rotation, and have demonstrated that these $π$ bilayers are typical ferroelectric metals (FEMs). The $π$-bilayer structure widely exists in nature, such as 1T$^{\prime}$/T$_d$-TMD, $α$-Bi$_4$Br$_4$. The computed vertical polarization of PtTe$_2$ and MoTe$_2$ $π$ bilayers are 0.46 and 0.25 pC/m, respectively. We show that the switching of polarization can be realized through interlayer sliding, which only requires crossing a low energy barrier. The interlayer charge transfer is the source of both vertical polarization and metallicity, and these properties are closely related to the spatially extended Te-$p_z$ orbital. Finally, we reveal that electron do** can significantly adjust the vertical polarization of these FEMs in both magnitude and direction. Our findings introduce a class of FEMs, which have potential applications in functional nanodevices such as ferroelectric tunneling junction and nonvolatile ferroelectric memory. △ Less

Submitted 22 September, 2023; v1 submitted 24 March, 2023; originally announced March 2023.

Journal ref: Physical Review B 108, 104109 (2023)

arXiv:2303.05041 [pdf]

doi 10.1038/s41467-024-47103-z

Magnetic field filtering of the hinge supercurrent in unconventional metal NiTe$_2$-based Josephson junctions

Authors: Tian Le, Ruihan Zhang, Changcun Li, Ruiyang Jiang, Haohao Sheng, Linfeng Tu, Xuewei Cao, Zhaozheng Lyu, Jie Shen, Guangtong Liu, Fucai Liu, Zhijun Wang, Li Lu, Fanming Qu

Abstract: Topological materials with boundary (surface/edge/hinge) states have attracted tremendous research interest. Besides, unconventional (obstructed atomic) materials have recently drawn lots of attention owing to their obstructed boundary states. Experimentally, Josephson junctions (JJs) constructed on materials with boundary states produce the peculiar boundary supercurrent, which was utilized as a… ▽ More Topological materials with boundary (surface/edge/hinge) states have attracted tremendous research interest. Besides, unconventional (obstructed atomic) materials have recently drawn lots of attention owing to their obstructed boundary states. Experimentally, Josephson junctions (JJs) constructed on materials with boundary states produce the peculiar boundary supercurrent, which was utilized as a powerful diagnostic approach. Here, we report the observations of conspicuous hinge supercurrent in NiTe$_2$-based JJs. Particularly, applying an in-plane magnetic field along the Josephson current could rapidly suppress the bulk supercurrent and retain the nearly pure hinge supercurrent, namely the magnetic field filtering of supercurrent. Further systematic comparative analysis and theoretical calculations demonstrate the existence of unconventional nature and obstructed hinge states in NiTe$_2$. Our results revealed the unique hinge states in unconventional metal NiTe$_2$, and demonstrated in-plane magnetic field as an efficient method to filter out the futile bulk contributions and thereby to highlight the hinge states hidden in topological/unconventional materials. △ Less

Submitted 9 March, 2023; originally announced March 2023.

Comments: 20+6 pages,6+5 figures

Journal ref: Nature Communications 15, 2785 (2024)

arXiv:2301.01842 [pdf, other]

Detecting Neighborhood Gentrification at Scale via Street-level Visual Data

Authors: Tianyuan Huang, Timothy Dai, Zhecheng Wang, Hesu Yoon, Hao Sheng, Andrew Y. Ng, Ram Rajagopal, Jackelyn Hwang

Abstract: Neighborhood gentrification plays a significant role in sha** the social and economic well-being of both individuals and communities at large. While some efforts have been made to detect gentrification in cities, existing approaches rely mainly on estimated measures from survey data, require substantial work of human labeling, and are limited in characterizing the neighborhood as a whole. We pro… ▽ More Neighborhood gentrification plays a significant role in sha** the social and economic well-being of both individuals and communities at large. While some efforts have been made to detect gentrification in cities, existing approaches rely mainly on estimated measures from survey data, require substantial work of human labeling, and are limited in characterizing the neighborhood as a whole. We propose a novel approach to detecting neighborhood gentrification at a large-scale based on the physical appearance of neighborhoods by incorporating historical street-level visual data. We show the effectiveness of the proposed method by comparing results from our approach with gentrification measures from previous literature and case studies. Our approach has the potential to supplement existing indicators of gentrification and become a valid resource for urban researchers and policy makers. △ Less

Submitted 4 January, 2023; originally announced January 2023.

arXiv:2210.17163 [pdf, other]

HHLPy: Practical Verification of Hybrid Systems using Hoare Logic

Authors: Huanhuan Sheng, Alexander Bentkamp, Bohua Zhan

Abstract: We present a tool for verification of hybrid systems expressed in the sequential fragment of HCSP (Hybrid Communicating Sequential Processes). The tool permits annotating HCSP programs with pre- and postconditions, invariants, and proof rules for reasoning about ordinary differential equations. Verification conditions are generated from the annotations following the rules of hybrid Hoare logic. We… ▽ More We present a tool for verification of hybrid systems expressed in the sequential fragment of HCSP (Hybrid Communicating Sequential Processes). The tool permits annotating HCSP programs with pre- and postconditions, invariants, and proof rules for reasoning about ordinary differential equations. Verification conditions are generated from the annotations following the rules of hybrid Hoare logic. We designed labeling and highlighting mechanisms to distinguish and visualize different verification conditions. The tool is implemented in Python and has a web-based user interface. We evaluated the effectiveness of the tool on translations of Simulink/Stateflow models and on KeYmaera X benchmarks. △ Less

Submitted 21 February, 2023; v1 submitted 31 October, 2022; originally announced October 2022.

arXiv:2210.04504 [pdf, other]

Sampling of Correlated Bandlimited Continuous Signals by Joint Time-vertex Graph Fourier Transform

Authors: Zhongyi Ni, Feng Ji, Hang Sheng, Hui Feng, Bo Hu

Abstract: When sampling multiple signals, the correlation between the signals can be exploited to reduce the overall number of samples. In this paper, we study the sampling theory of multiple correlated signals, using correlation to sample them at the lowest sampling rate. Based on the correlation between signal sources, we model multiple continuous-time signals as continuous time-vertex graph signals. The… ▽ More When sampling multiple signals, the correlation between the signals can be exploited to reduce the overall number of samples. In this paper, we study the sampling theory of multiple correlated signals, using correlation to sample them at the lowest sampling rate. Based on the correlation between signal sources, we model multiple continuous-time signals as continuous time-vertex graph signals. The graph signals are projected onto orthogonal bases to remove spatial correlation and reduce dimensions by graph Fourier transform. When the bandwidths of the original signals and the reduced dimension signals are given, we prove the minimum sampling rate required for recovery of the original signals, and propose a feasible sampling scheme. △ Less

Submitted 10 October, 2022; originally announced October 2022.

arXiv:2208.14027 [pdf, ps, other]

Optimal Probabilistic Constellation Sha** for Covert Communications

Authors: Shuai Ma, Yunqi Zhang, Haihong Sheng, Hang Li, Jia Shi, Long Yang, Youlong Wu, Naofal Al-Dhahir, Shiyin Li

Abstract: In this paper, we investigate the optimal probabilistic constellation sha** design for covert communication systems from a practical view. Different from conventional covert communications with equiprobable constellations modulation, we propose nonequiprobable constellations modulation schemes to further enhance the covert rate. Specifically, we derive covert rate expressions for practical discr… ▽ More In this paper, we investigate the optimal probabilistic constellation sha** design for covert communication systems from a practical view. Different from conventional covert communications with equiprobable constellations modulation, we propose nonequiprobable constellations modulation schemes to further enhance the covert rate. Specifically, we derive covert rate expressions for practical discrete constellation inputs for the first time. Then, we study the covert rate maximization problem by jointly optimizing the constellation distribution and power allocation. In particular, an approximate gradient descent method is proposed for obtaining the optimal probabilistic constellation sha**. To strike a balance between the computational complexity and the transmission performance, we further develop a framework that maximizes a lower bound on the achievable rate where the optimal probabilistic constellation sha** problem can be solved efficiently using the Frank-Wolfe method. Extensive numerical results show that the optimized probabilistic constellation sha** strategies provide significant gains in the achievable covert rate over the state-of-the-art schemes. △ Less

Submitted 30 August, 2022; originally announced August 2022.

arXiv:2208.07212 [pdf, other]

doi 10.34133/research.0042

Large spin Hall conductivity and excellent hydrogen evolution reaction activity in unconventional PtTe1.75 monolayer

Authors: Dexi Shao, Junze Deng, Haohao Sheng, Ruihan Zhang, Hongming Weng, Zhong Fang, Xing-Qiu Chen, Yan Sun, Zhijun Wang

Abstract: Two-dimensional (2D) materials have gained lots of attention due to the potential applications. In this work, we propose that based on first-principles calculations, the (2$\times$2) patterned PtTe$_2$ monolayer with kagome lattice formed by the well-ordered Te vacancy (PtTe$_{1.75}$) hosts large spin Hall conductivity (SHC) and excellent hydrogen evolution reaction (HER) activity. The unconventio… ▽ More Two-dimensional (2D) materials have gained lots of attention due to the potential applications. In this work, we propose that based on first-principles calculations, the (2$\times$2) patterned PtTe$_2$ monolayer with kagome lattice formed by the well-ordered Te vacancy (PtTe$_{1.75}$) hosts large spin Hall conductivity (SHC) and excellent hydrogen evolution reaction (HER) activity. The unconventional nature relies on the $A1@1b$ band representation (BR) of the highest valence band without SOC. The large SHC comes from the Rashba spin-orbit coupling (SOC) in the noncentrosymmetric structure induced by the Te vacancy. Even though it has a metallic SOC band structure, the $\mathbb Z_2$ invariant is well defined due to the existence of the direct band gap and is computed to be nontrivial. The calculated SHC is as large as 1.25$\times 10^3 \frac{\hbar}{e} (Ω~cm)^{-1}$ at the Fermi level ($E_F$). By tuning the chemical potential from $E_F-0.3$ to $E_F+0.3$ eV, it varies rapidly and monotonically from $-1.2\times 10^3$ to 3.1$\times 10^3 \frac{\hbar}{e} (Ω~cm)^{-1}$. In addition, we also find the Te vacancy in the patterned monolayer can induce excellent HER activity. Our results not only offer a new idea to search 2D materials with large SHC, i.e., by introducing inversion-symmetry breaking vacancies in large SOC systems, but also provide a feasible system with tunable SHC (by applying gate voltage) and excellent HER activity. △ Less

Submitted 15 August, 2022; originally announced August 2022.

Journal ref: Research 6, 0042 (2023)

arXiv:2208.05878 [pdf, ps, other]

Covert Beamforming Design for Integrated Radar Sensing and Communication Systems

Authors: Shuai Ma, Haihong Sheng, Ruixin Yang, Hang Li, Youlong Wu, Chao Shen, Naofal Al-Dhahir, Shiyin Li

Abstract: We propose covert beamforming design frameworks for integrated radar sensing and communication (IRSC) systems, where the radar can covertly communicate with legitimate users under the cover of the probing waveforms without being detected by the eavesdropper. Specifically, by jointly designing the target detection beamformer and communication beamformer, we aim to maximize the radar detection mutua… ▽ More We propose covert beamforming design frameworks for integrated radar sensing and communication (IRSC) systems, where the radar can covertly communicate with legitimate users under the cover of the probing waveforms without being detected by the eavesdropper. Specifically, by jointly designing the target detection beamformer and communication beamformer, we aim to maximize the radar detection mutual information (MI) (or the communication rate) subject to the covert constraint, the communication rate constraint (or the radar detection MI constraint), and the total power constraint. For the perfect eavesdropper's channel state information (CSI) scenario, we transform the covert beamforming design problems into a series of convex subproblems, by exploiting semidefinite relaxation, which can be solved via the bisection search method. Considering the high complexity of iterative optimization, we further propose a single-iterative covert beamformer design scheme based on the zero-forcing criterion. For the imperfect eavesdropper's CSI scenario, we develop a relaxation and restriction method to tackle the robust covert beamforming design problems. Simulation results demonstrate the effectiveness of the proposed covert beamforming schemes for perfect and imperfect CSI scenarios. △ Less

Submitted 11 August, 2022; originally announced August 2022.

arXiv:2207.09332 [pdf, other]

Rethinking IoU-based Optimization for Single-stage 3D Object Detection

Authors: Hualian Sheng, Sijia Cai, Na Zhao, Bing Deng, Jianqiang Huang, Xian-Sheng Hua, Min-Jian Zhao, Gim Hee Lee

Abstract: Since Intersection-over-Union (IoU) based optimization maintains the consistency of the final IoU prediction metric and losses, it has been widely used in both regression and classification branches of single-stage 2D object detectors. Recently, several 3D object detection methods adopt IoU-based optimization and directly replace the 2D IoU with 3D IoU. However, such a direct computation in 3D is… ▽ More Since Intersection-over-Union (IoU) based optimization maintains the consistency of the final IoU prediction metric and losses, it has been widely used in both regression and classification branches of single-stage 2D object detectors. Recently, several 3D object detection methods adopt IoU-based optimization and directly replace the 2D IoU with 3D IoU. However, such a direct computation in 3D is very costly due to the complex implementation and inefficient backward operations. Moreover, 3D IoU-based optimization is sub-optimal as it is sensitive to rotation and thus can cause training instability and detection performance deterioration. In this paper, we propose a novel Rotation-Decoupled IoU (RDIoU) method that can mitigate the rotation-sensitivity issue, and produce more efficient optimization objectives compared with 3D IoU during the training stage. Specifically, our RDIoU simplifies the complex interactions of regression parameters by decoupling the rotation variable as an independent term, yet preserving the geometry of 3D IoU. By incorporating RDIoU into both the regression and classification branches, the network is encouraged to learn more precise bounding boxes and concurrently overcome the misalignment issue between classification and regression. Extensive experiments on the benchmark KITTI and Waymo Open Dataset validate that our RDIoU method can bring substantial improvement for the single-stage 3D object detection. △ Less

Submitted 20 July, 2022; v1 submitted 19 July, 2022; originally announced July 2022.

Comments: Accepted by ECCV2022. The code is available at https://github.com/hlsheng1/RDIoU

arXiv:2205.00593 [pdf, other]

PFNN-2: A Domain Decomposed Penalty-Free Neural Network Method for Solving Partial Differential Equations

Authors: Hailong Sheng, Chao Yang

Abstract: A new penalty-free neural network method, PFNN-2, is presented for solving partial differential equations, which is a subsequent improvement of our previously proposed PFNN method [1]. PFNN-2 inherits all advantages of PFNN in handling the smoothness constraints and essential boundary conditions of self-adjoint problems with complex geometries, and extends the application to a broader range of non… ▽ More A new penalty-free neural network method, PFNN-2, is presented for solving partial differential equations, which is a subsequent improvement of our previously proposed PFNN method [1]. PFNN-2 inherits all advantages of PFNN in handling the smoothness constraints and essential boundary conditions of self-adjoint problems with complex geometries, and extends the application to a broader range of non-self-adjoint time-dependent differential equations. In addition, PFNN-2 introduces an overlap** domain decomposition strategy to substantially improve the training efficiency without sacrificing accuracy. Experiments results on a series of partial differential equations are reported, which demonstrate that PFNN-2 can outperform state-of-the-art neural network methods in various aspects such as numerical accuracy, convergence speed, and parallel scalability. △ Less

Submitted 1 May, 2022; originally announced May 2022.

arXiv:2201.12693 [pdf, other]

Extracting Built Environment Features for Planning Research with Computer Vision: A Review and Discussion of State-of-the-Art Approaches

Authors: Meiqing Li, Hao Sheng

Abstract: This is an extended abstract for a presentation at The 17th International Conference on CUPUM - Computational Urban Planning and Urban Management in June 2021. This study presents an interdisciplinary synthesis of the state-of-the-art approaches in computer vision technologies to extract built environment features that could improve the robustness of empirical research in planning. We discussed th… ▽ More This is an extended abstract for a presentation at The 17th International Conference on CUPUM - Computational Urban Planning and Urban Management in June 2021. This study presents an interdisciplinary synthesis of the state-of-the-art approaches in computer vision technologies to extract built environment features that could improve the robustness of empirical research in planning. We discussed the findings from the review of studies in both planning and computer science. △ Less

Submitted 21 March, 2022; v1 submitted 29 January, 2022; originally announced January 2022.

Comments: CUPUM 2021 (The 17th International Conference on Computational Urban Planning and Urban Management)

arXiv:2111.10171 [pdf]

Structural Origin of Boson Peak in Glasses

Authors: Yuan Tian, Xiaozhe Shen, Qingyang Gao, Zhen Lu, Jie Yang, Qiang Zheng, Christopher Florencio Aleman, Duan Luo, Alexander Hume Reid, Bin Xu, Michael Falk, Howard Sheng, Jianming Cao, Xijie Wang, Mingwei Chen

Abstract: Boson peak, the excess low energy excitations in the terahertz regime, is one of the most unique features of disordered systems and has been linked to many anomalous properties of glass materials. The nature and structural origin of the boson peak remain elusive and have been debated for more than a half century mainly due to the lack of real-time and real-space experimental insights of the dynami… ▽ More Boson peak, the excess low energy excitations in the terahertz regime, is one of the most unique features of disordered systems and has been linked to many anomalous properties of glass materials. The nature and structural origin of the boson peak remain elusive and have been debated for more than a half century mainly due to the lack of real-time and real-space experimental insights of the dynamic phenomenon. In this work we employed femtosecond MeV ultrafast electron diffraction to characterize the atomic dynamics of metallic glasses in real time. The experiment reveals collective atomic oscillations, presented in elastic electron scattering and atomic pair distribution functions, within the boson peak frequency range of 1.0-1.8 THz in both reciprocal and real space. It was found that the oscillation frequency has reciprocal dependence on interatomic pair distances and the corresponding wave velocity experimentally affirms the transverse acoustic wave nature of the boson peak. The observed strong correlation between THz acoustic vibrations and coherent electron scattering provides compelling evidence that the boson peak originates from the collective transverse vibrational modes of structurally ordered atoms in the disordered system. △ Less

Submitted 19 November, 2021; originally announced November 2021.

arXiv:2110.15876 [pdf, ps, other]

An integrable semi-discretization of the modified Camassa-Holm equation with linear dispersion term

Authors: Han-Han Sheng, Guo-Fu Yu, Bao-Feng Feng

Abstract: In the present paper, we are concerned with integrable discretization of a modified Camassa-Holm equation with linear dispersion term. The key of the construction is the semi-discrete analogue for a set of bilinear equations of the modified Camassa-Holm equation. Firstly, we show that these bilinear equations and their determinant solutions either in Gram-type or Casorati-type can be reduced from… ▽ More In the present paper, we are concerned with integrable discretization of a modified Camassa-Holm equation with linear dispersion term. The key of the construction is the semi-discrete analogue for a set of bilinear equations of the modified Camassa-Holm equation. Firstly, we show that these bilinear equations and their determinant solutions either in Gram-type or Casorati-type can be reduced from the discrete KP equation through Miwa transformation. Then, by scrutinizing the reduction process, we obtain a set of semi-discrete bilinear equations and their general soliton solution in Gram-type or Casorati-type determinant form. Finally, by defining dependent variables and discrete hodograph transformations, we are able to derive an integrable semi-discrete analogue of the modified Camassa-Holm equation. It is also shown that the semi-discrete modified Camassa-Holm equation converges to the continuous one in the continuum limit. △ Less

Submitted 29 October, 2021; originally announced October 2021.

Comments: 26 pages, 2 figures

MSC Class: 39A36; 35Q51; 35C08

arXiv:2108.12028 [pdf]

doi 10.1016/j.jallcom.2020.157266

Effects of minor alloying on the mechanical properties of Al based metallic glasses

Authors: Vrishank Jambur, Chaiyapat Tangpatjaroen, Jianqi Xi, Jirameth Tarnsangpradit, Meng Gao, Howard Sheng, John Perepezko, Izabela Szlufarska

Abstract: Minor alloying is widely used to control mechanical properties of metallic glasses (MGs). The present understanding of how a small amount of alloying element changes strength is that the additions lead to more efficient packing of atoms and increased local topological order, which then increases the barrier for shear transformations and the resistance to plastic deformation. Here, we discover that… ▽ More Minor alloying is widely used to control mechanical properties of metallic glasses (MGs). The present understanding of how a small amount of alloying element changes strength is that the additions lead to more efficient packing of atoms and increased local topological order, which then increases the barrier for shear transformations and the resistance to plastic deformation. Here, we discover that minor alloying can improve the strength of MGs by increasing the chemical bond strength alone and show that this strengthening is distinct from changes in topological order. The results were obtained using Al-Sm based MGs minor alloyed with transition metals (TMs). The addition of TMs led to an increase in the hardness of the MGs which, however, could not be explained based on changes in the topological ordering in the structure. Instead we found that it was the strong bonding between TM and Al atoms which led to a higher resistance to shear transformation that resulted in higher strength and hardness, while the topology around the TM atoms had no influence on their mechanical response. This finding demonstrates that the effects of topology and chemistry on mechanical properties of MGs are independent of each other and that they should be understood as separate, sometimes competing mechanisms of strengthening. This understanding lays a foundation for design of MGs with improved mechanical properties. △ Less

Submitted 26 August, 2021; originally announced August 2021.

Journal ref: Journal of Alloys and Compounds, vol. 854, p. 157266, Feb. 2021

arXiv:2108.10723 [pdf, other]

Improving 3D Object Detection with Channel-wise Transformer

Authors: Hualian Sheng, Sijia Cai, Yuan Liu, Bing Deng, Jianqiang Huang, Xian-Sheng Hua, Min-Jian Zhao

Abstract: Though 3D object detection from point clouds has achieved rapid progress in recent years, the lack of flexible and high-performance proposal refinement remains a great hurdle for existing state-of-the-art two-stage detectors. Previous works on refining 3D proposals have relied on human-designed components such as keypoints sampling, set abstraction and multi-scale feature fusion to produce powerfu… ▽ More Though 3D object detection from point clouds has achieved rapid progress in recent years, the lack of flexible and high-performance proposal refinement remains a great hurdle for existing state-of-the-art two-stage detectors. Previous works on refining 3D proposals have relied on human-designed components such as keypoints sampling, set abstraction and multi-scale feature fusion to produce powerful 3D object representations. Such methods, however, have limited ability to capture rich contextual dependencies among points. In this paper, we leverage the high-quality region proposal network and a Channel-wise Transformer architecture to constitute our two-stage 3D object detection framework (CT3D) with minimal hand-crafted design. The proposed CT3D simultaneously performs proposal-aware embedding and channel-wise context aggregation for the point features within each proposal. Specifically, CT3D uses proposal's keypoints for spatial contextual modelling and learns attention propagation in the encoding module, map** the proposal to point embeddings. Next, a new channel-wise decoding module enriches the query-key interaction via channel-wise re-weighting to effectively merge multi-level contexts, which contributes to more accurate object predictions. Extensive experiments demonstrate that our CT3D method has superior performance and excellent scalability. Remarkably, CT3D achieves the AP of 81.77% in the moderate car category on the KITTI test 3D detection benchmark, outperforms state-of-the-art 3D detectors. △ Less

Submitted 14 September, 2021; v1 submitted 22 August, 2021; originally announced August 2021.

Comments: Accepted by ICCV2021

arXiv:2106.06515 [pdf, other]

Probability Paths and the Structure of Predictions over Time

Authors: Zhiyuan Jerry Lin, Hao Sheng, Sharad Goel

Abstract: In settings ranging from weather forecasts to political prognostications to financial projections, probability estimates of future binary outcomes often evolve over time. For example, the estimated likelihood of rain on a specific day changes by the hour as new information becomes available. Given a collection of such probability paths, we introduce a Bayesian framework -- which we call the Gaussi… ▽ More In settings ranging from weather forecasts to political prognostications to financial projections, probability estimates of future binary outcomes often evolve over time. For example, the estimated likelihood of rain on a specific day changes by the hour as new information becomes available. Given a collection of such probability paths, we introduce a Bayesian framework -- which we call the Gaussian latent information martingale, or GLIM -- for modeling the structure of dynamic predictions over time. Suppose, for example, that the likelihood of rain in a week is 50 %, and consider two hypothetical scenarios. In the first, one expects the forecast to be equally likely to become either 25 % or 75 % tomorrow; in the second, one expects the forecast to stay constant for the next several days. A time-sensitive decision-maker might select a course of action immediately in the latter scenario, but may postpone their decision in the former, knowing that new information is imminent. We model these trajectories by assuming predictions update according to a latent process of information flow, which is inferred from historical data. In contrast to general methods for time series analysis, this approach preserves important properties of probability paths such as the martingale structure and appropriate amount of volatility and better quantifies future uncertainties around probability paths. We show that GLIM outperforms three popular baseline methods, producing better estimated posterior probability path distributions measured by three different metrics. By elucidating the dynamic structure of predictions over time, we hope to help individuals make more informed choices. △ Less

Submitted 4 November, 2021; v1 submitted 11 June, 2021; originally announced June 2021.

Journal ref: 35th Conference on Neural Information Processing Systems (NeurIPS 2021)

arXiv:2105.04398 [pdf, ps, other]

When Rooks Miss: Probability through Chess

Authors: Steven J. Miller, Haoyu Sheng, Daniel Turek

Abstract: A famous (and hard) chess problem asks what is the maximum number of safe squares possible in placing $n$ queens on an $n\times n$ board. We examine related problems from placing $n$ rooks. We prove that as $n\to\infty$, the probability rapidly tends to 1 that the fraction of safe squares from a random placement converges to $1/e^2$. Our interest in the problem is showing how to view the involved… ▽ More A famous (and hard) chess problem asks what is the maximum number of safe squares possible in placing $n$ queens on an $n\times n$ board. We examine related problems from placing $n$ rooks. We prove that as $n\to\infty$, the probability rapidly tends to 1 that the fraction of safe squares from a random placement converges to $1/e^2$. Our interest in the problem is showing how to view the involved algebra to obtain the simple, closed form limiting fraction. In particular, we see the power of many of the key concepts in probability: binary indicator variables, linearity of expectation, variances and covariances, Chebyshev's inequality, and Stirling's formula. △ Less

Submitted 10 May, 2021; originally announced May 2021.

Comments: Version 1.0, 11 pages, 2 figures

MSC Class: 60-01 (primary); 05-01 (secondary)

Journal ref: College Math. J. 52 (2021), no. 2, 82--93

arXiv:2105.02489 [pdf, other]

Learning Neighborhood Representation from Multi-Modal Multi-Graph: Image, Text, Mobility Graph and Beyond

Authors: Tianyuan Huang, Zhecheng Wang, Hao Sheng, Andrew Y. Ng, Ram Rajagopal

Abstract: Recent urbanization has coincided with the enrichment of geotagged data, such as street view and point-of-interest (POI). Region embedding enhanced by the richer data modalities has enabled researchers and city administrators to understand the built environment, socioeconomics, and the dynamics of cities better. While some efforts have been made to simultaneously use multi-modal inputs, existing m… ▽ More Recent urbanization has coincided with the enrichment of geotagged data, such as street view and point-of-interest (POI). Region embedding enhanced by the richer data modalities has enabled researchers and city administrators to understand the built environment, socioeconomics, and the dynamics of cities better. While some efforts have been made to simultaneously use multi-modal inputs, existing methods can be improved by incorporating different measures of 'proximity' in the same embedding space - leveraging not only the data that characterizes the regions (e.g., street view, local businesses pattern) but also those that depict the relationship between regions (e.g., trips, road network). To this end, we propose a novel approach to integrate multi-modal geotagged inputs as either node or edge features of a multi-graph based on their relations with the neighborhood region (e.g., tiles, census block, ZIP code region, etc.). We then learn the neighborhood representation based on a contrastive-sampling scheme from the multi-graph. Specifically, we use street view images and POI features to characterize neighborhoods (nodes) and use human mobility to characterize the relationship between neighborhoods (directed edges). We show the effectiveness of the proposed methods with quantitative downstream tasks as well as qualitative analysis of the embedding space: The embedding we trained outperforms the ones using only unimodal data as regional inputs. △ Less

Submitted 6 May, 2021; originally announced May 2021.

arXiv:2105.01764 [pdf, other]

Surveilling Surveillance: Estimating the Prevalence of Surveillance Cameras with Street View Data

Authors: Hao Sheng, Keniel Yao, Sharad Goel

Abstract: The use of video surveillance in public spaces -- both by government agencies and by private citizens -- has attracted considerable attention in recent years, particularly in light of rapid advances in face-recognition technology. But it has been difficult to systematically measure the prevalence and placement of cameras, hampering efforts to assess the implications of surveillance on privacy and… ▽ More The use of video surveillance in public spaces -- both by government agencies and by private citizens -- has attracted considerable attention in recent years, particularly in light of rapid advances in face-recognition technology. But it has been difficult to systematically measure the prevalence and placement of cameras, hampering efforts to assess the implications of surveillance on privacy and public safety. Here, we combine computer vision, human verification, and statistical analysis to estimate the spatial distribution of surveillance cameras. Specifically, we build a camera detection model and apply it to 1.6 million street view images sampled from 10 large U.S. cities and 6 other major cities around the world, with positive model detections verified by human experts. After adjusting for the estimated recall of our model, and accounting for the spatial coverage of our sampled images, we are able to estimate the density of surveillance cameras visible from the road. Across the 16 cities we consider, the estimated number of surveillance cameras per linear kilometer ranges from 0.2 (in Los Angeles) to 0.9 (in Seoul). In a detailed analysis of the 10 U.S. cities, we find that cameras are concentrated in commercial, industrial, and mixed zones, and in neighborhoods with higher shares of non-white residents -- a pattern that persists even after adjusting for land use. These results help inform ongoing discussions on the use of surveillance technology, including its potential disparate impacts on communities of color. △ Less

Submitted 30 August, 2021; v1 submitted 4 May, 2021; originally announced May 2021.

Comments: We now credit Turtiainen et al. (2020) both for creating a state-of-the-art camera detection model and for suggesting that computer vision could, in theory, be applied to street view data to map surveillance cameras. Also, we discovered a coding error in our image sampling strategy that corrupted our analysis of camera density over time. We have now removed the results of that analysis

arXiv:2104.08511 [pdf, other]

doi 10.1063/5.0062540

Plasma response to resonant magnetic perturbations near rotation zero-crossing in low torque plasmas

Authors: Pengcheng Xie, Youwen Sun, Yueqiang Liu, Shuai Gu, Qun Ma, Cheng Ye, Xuemin Wu, Hui Sheng

Abstract: Plasma response to resonant magnetic perturbations (RMPs) near the pedestal top is crucial for accessing edge localized modes (ELMs) suppression in tokamaks. Since radial location of rotation zero-crossing plays a key role in determining the threshold for field penetration of RMP, plasma response may be different in low input torque plasmas. In this work, the linear MHD code MARS-F is applied to r… ▽ More Plasma response to resonant magnetic perturbations (RMPs) near the pedestal top is crucial for accessing edge localized modes (ELMs) suppression in tokamaks. Since radial location of rotation zero-crossing plays a key role in determining the threshold for field penetration of RMP, plasma response may be different in low input torque plasmas. In this work, the linear MHD code MARS-F is applied to reveal the dependence of plasma response to RMP on rotation zero-crossing by a scan of rotation profiles based on an EAST equilibrium. It is shown that the plasma response is enhanced when zero-crossing occurs near rational surfaces. The dependence of plasma response on the location of rotation zero-crossing is well fitted by a double Gaussian, indicating two effects in this enhancement. One is induced by rotation screening effect shown as a wide base (with a width around 10-20 krad/s), and the other is related to resistive singular layer effect characterized by a localized peak (with a width around 3-4 krad/s). The width of the peak scales with the resistive singular layer width. The plasma displacement suggests the response is tearing like when zero-crossing is within the singular layer, while it is kink like when zero-crossing is far from the layer. The enhancement of magnetic islands width at the peak is only around a factor of two, when the absolute value of local rotation is not larger than 10-20 krad/s. It is further confirmed in a modeling of plasma response in an EAST ELM suppression discharge. Though there is a zero-crossing in $E\times B$ rotation but not in electron perpendicular rotation, no significant difference in plasma response is obtained using these two rotation profiles. This suggests that the rotation near pedestal top should not be far away from zero but may not be necessary to have zero-crossing for accessing ELM suppression. △ Less

Submitted 2 July, 2021; v1 submitted 17 April, 2021; originally announced April 2021.

Comments: 9 pages, 9 figures, submitted to Physics of Plasmas

arXiv:2101.09645 [pdf, other]

Multi-Task Time Series Forecasting With Shared Attention

Authors: Zekai Chen, Jiaze E, Xiao Zhang, Hao Sheng, Xiuzheng Cheng

Abstract: Time series forecasting is a key component in many industrial and business decision processes and recurrent neural network (RNN) based models have achieved impressive progress on various time series forecasting tasks. However, most of the existing methods focus on single-task forecasting problems by learning separately based on limited supervised objectives, which often suffer from insufficient tr… ▽ More Time series forecasting is a key component in many industrial and business decision processes and recurrent neural network (RNN) based models have achieved impressive progress on various time series forecasting tasks. However, most of the existing methods focus on single-task forecasting problems by learning separately based on limited supervised objectives, which often suffer from insufficient training instances. As the Transformer architecture and other attention-based models have demonstrated its great capability of capturing long term dependency, we propose two self-attention based sharing schemes for multi-task time series forecasting which can train jointly across multiple tasks. We augment a sequence of paralleled Transformer encoders with an external public multi-head attention function, which is updated by all data of all tasks. Experiments on a number of real-world multi-task time series forecasting tasks show that our proposed architectures can not only outperform the state-of-the-art single-task forecasting baselines but also outperform the RNN-based multi-task forecasting method. △ Less

Submitted 23 January, 2021; originally announced January 2021.

Comments: Accepted by ICDMW 2020

arXiv:2012.11162 [pdf, ps, other]

On the Ky Fan $k$-norm of the $LI$-matrix of graphs

Authors: Zhen Lin, Lianying Miao, Guanglong Yu, Han Sheng

Abstract: Let $A(G)$ and $D(G)$ be the adjacency matrix and the degree diagonal matrix of a graph $G$, respectively. Then $L(G)=D(G)-A(G)$ is called Laplacian matrix of the graph $G$. Let $G$ be a graph with $n$ vertices and $m$ edges. Then the $LI$-matrix of $G$ are defined as $LI(G)=L(G)-\frac{2m}{n}I_n$, where $I_n$ is the identity matrix. In this paper, we are interested in extremal properties of the Ky… ▽ More Let $A(G)$ and $D(G)$ be the adjacency matrix and the degree diagonal matrix of a graph $G$, respectively. Then $L(G)=D(G)-A(G)$ is called Laplacian matrix of the graph $G$. Let $G$ be a graph with $n$ vertices and $m$ edges. Then the $LI$-matrix of $G$ are defined as $LI(G)=L(G)-\frac{2m}{n}I_n$, where $I_n$ is the identity matrix. In this paper, we are interested in extremal properties of the Ky Fan $k$-norm of the $LI$-matrix of graphs, which is closely related to the well known problems and results in spectral graph theory, such as the Laplacian spectral radius, the Laplacian spread, the sum of the $k$ largest Laplacian eigenvalues, the Laplacian energy, and other parameters. Some bounds on the Ky Fan $k$-norm of the $LI$-matrix of graphs are given, and the extremal graphs are partly characterized. In addition, upper and lower bounds on the Ky Fan $k$-norm of $LI$-matrix of trees, unicyclic graphs and bicyclic graphs are determined, and the corresponding extremal graphs are characterized. △ Less

Submitted 24 May, 2022; v1 submitted 21 December, 2020; originally announced December 2020.

Comments: 23 pages, 3 figures

MSC Class: 05C50 ACM Class: F.2.2

arXiv:2012.05917 [pdf, other]

doi 10.1103/PhysRevB.103.115145

Quantum Spin Hall Effect in Ta$_2$M$_3$Te$_5$ (M = Pd, Ni)

Authors: Zhaopeng Guo, Dayu Yan, Haohao Sheng, Simin Nie, Youguo Shi, Zhijun Wang

Abstract: Quantum spin Hall (QSH) effect with great promise for the potential application in spintronics and quantum computing has attracted extensive research interest from both theoretical and experimental researchers. Here, we predict monolayer Ta$_2$Pd$_3$Te$_5$ can be a QSH insulator based on first-principles calculations. The interlayer binding energy in the layered van der Waals compound Ta$_2$Pd… ▽ More Quantum spin Hall (QSH) effect with great promise for the potential application in spintronics and quantum computing has attracted extensive research interest from both theoretical and experimental researchers. Here, we predict monolayer Ta$_2$Pd$_3$Te$_5$ can be a QSH insulator based on first-principles calculations. The interlayer binding energy in the layered van der Waals compound Ta$_2$Pd$_3$Te$_5$ is 19.6 meV/A$^2$; thus, its monolayer/thin-film structures could be readily obtained by exfoliation. The band inversion near the Fermi level ($E_F$) is an intrinsic characteristic, which happens between Ta-$5d$ and Pd-$4d$ orbitals without spin-orbit coupling (SOC). The SOC effect opens a global gap and makes the system a QSH insulator. With the $d$-$d$ band-inverted feature, the nontrivial topology in monolayer Ta$_2$Pd$_3$Te$_5$ is characterized by the time-reversal topological invariant $\mathbb Z_2=1$, which is computed by the one-dimensional (1D) Wilson loop method as implemented in our first-principles calculations. The helical edge modes are also obtained using surface Green's function method. Our calculations show that the QSH state in Ta$_2M_3$Te$_5$ ($M=$ Pd, Ni) can be tuned by external strain. These monolayers and thin films provide feasible platforms for realizing QSH effect as well as related devices. △ Less

Submitted 10 December, 2020; originally announced December 2020.

Comments: 11 pages, 12 figures, 1 table

Journal ref: Phys. Rev. B 103, 115145 (2021)

arXiv:2012.01186 [pdf, other]

AGenT Zero: Zero-shot Automatic Multiple-Choice Question Generation for Skill Assessments

Authors: Eric Li, **gyi Su, Hao Sheng, Lawrence Wai

Abstract: Multiple-choice questions (MCQs) offer the most promising avenue for skill evaluation in the era of virtual education and job recruiting, where traditional performance-based alternatives such as projects and essays have become less viable, and grading resources are constrained. The automated generation of MCQs would allow assessment creation at scale. Recent advances in natural language processing… ▽ More Multiple-choice questions (MCQs) offer the most promising avenue for skill evaluation in the era of virtual education and job recruiting, where traditional performance-based alternatives such as projects and essays have become less viable, and grading resources are constrained. The automated generation of MCQs would allow assessment creation at scale. Recent advances in natural language processing have given rise to many complex question generation methods. However, the few methods that produce deployable results in specific domains require a large amount of domain-specific training data that can be very costly to acquire. Our work provides an initial foray into MCQ generation under high data-acquisition cost scenarios by strategically emphasizing paraphrasing the question context (compared to the task). In addition to maintaining semantic similarity between the question-answer pairs, our pipeline, which we call AGenT Zero, consists of only pre-trained models and requires no fine-tuning, minimizing data acquisition costs for question generation. AGenT Zero successfully outperforms other pre-trained methods in fluency and semantic similarity. Additionally, with some small changes, our assessment pipeline can be generalized to a broader question and answer space, including short answer or fill in the blank questions. △ Less

Submitted 18 December, 2020; v1 submitted 24 November, 2020; originally announced December 2020.

Comments: AAAI 2021 Workshop on AI Education/TIPCE 2021

Showing 1–50 of 343 results for author: Sheng, H