Search | arXiv e-print repository

doi 10.2140/agt.2023.23.3835

The group of quasi-isometries of the real line cannot act effectively on the line

Abstract: We prove that the group $\mathrm{QI}^{+}(\mathbb{R})$ of orientation-preserving quasi-isometries of the real line is a left-orderable, non-simple group, which cannot act effectively on the real line $\mathbb{R}.$ We prove that the group $\mathrm{QI}^{+}(\mathbb{R})$ of orientation-preserving quasi-isometries of the real line is a left-orderable, non-simple group, which cannot act effectively on the real line $\mathbb{R}.$ △ Less

Submitted 24 June, 2022; v1 submitted 10 February, 2022; originally announced February 2022.

Comments: Minor changes; final version, to appear in Algebraic and Geometric Topology

Journal ref: Algebr. Geom. Topol. 23 (2023) 3835-3847

arXiv:2202.01614 [pdf, other]

The RoyalFlush System of Speech Recognition for M2MeT Challenge

Authors: Shuaishuai Ye, Peiyao Wang, Shunfei Chen, Xinhui Hu, Xinkang Xu

Abstract: This paper describes our RoyalFlush system for the track of multi-speaker automatic speech recognition (ASR) in the M2MeT challenge. We adopted the serialized output training (SOT) based multi-speakers ASR system with large-scale simulation data. Firstly, we investigated a set of front-end methods, including multi-channel weighted predicted error (WPE), beamforming, speech separation, speech enhan… ▽ More This paper describes our RoyalFlush system for the track of multi-speaker automatic speech recognition (ASR) in the M2MeT challenge. We adopted the serialized output training (SOT) based multi-speakers ASR system with large-scale simulation data. Firstly, we investigated a set of front-end methods, including multi-channel weighted predicted error (WPE), beamforming, speech separation, speech enhancement and so on, to process training, validation and test sets. But we only selected WPE and beamforming as our frontend methods according to their experimental results. Secondly, we made great efforts in the data augmentation for multi-speaker ASR, mainly including adding noise and reverberation, overlapped speech simulation, multi-channel speech simulation, speed perturbation, front-end processing, and so on, which brought us a great performance improvement. Finally, in order to make full use of the performance complementary of different model architecture, we trained the standard conformer based joint CTC/Attention (Conformer) and U2++ ASR model with a bidirectional attention decoder, a modification of Conformer, to fuse their results. Comparing with the official baseline system, our system got a 12.22% absolute Character Error Rate (CER) reduction on the validation set and 12.11% on the test set. △ Less

Submitted 24 February, 2022; v1 submitted 3 February, 2022; originally announced February 2022.

arXiv:2112.08606 [pdf]

An Empirical Study on Transfer Learning for Privilege Review

Authors: Haozhen Zhao, Shi Ye, **gchao Yang

Abstract: Protecting privileged communications and data from inadvertent disclosure is a paramount task in the US legal practice. Traditionally counsels rely on keyword searching and manual review to identify privileged documents in cases. As data volumes increase, this approach becomes less and less defensible in costs. Machine learning methods have been used in identifying privilege documents. Given the g… ▽ More Protecting privileged communications and data from inadvertent disclosure is a paramount task in the US legal practice. Traditionally counsels rely on keyword searching and manual review to identify privileged documents in cases. As data volumes increase, this approach becomes less and less defensible in costs. Machine learning methods have been used in identifying privilege documents. Given the generalizable nature of privilege in legal cases, we hypothesize that transfer learning can capitalize knowledge learned from existing labeled data to identify privilege documents without requiring labeling new training data. In this paper, we study both traditional machine learning models and deep learning models based on BERT for privilege document classification tasks in legal document review, and we examine the effectiveness of transfer learning in privilege model on three real world datasets with privilege labels. Our results show that BERT model outperforms the industry standard logistic regression algorithm and transfer learning models can achieve decent performance on datasets in same or close domains. △ Less

Submitted 15 December, 2021; originally announced December 2021.

Comments: 2021 IEEE International Conference on Big Data (Big Data)

arXiv:2112.08359 [pdf, other]

3D Question Answering

Authors: Shuquan Ye, Dongdong Chen, Songfang Han, **g Liao

Abstract: Visual Question Answering (VQA) has witnessed tremendous progress in recent years. However, most efforts only focus on the 2D image question answering tasks. In this paper, we present the first attempt at extending VQA to the 3D domain, which can facilitate artificial intelligence's perception of 3D real-world scenarios. Different from image based VQA, 3D Question Answering (3DQA) takes the color… ▽ More Visual Question Answering (VQA) has witnessed tremendous progress in recent years. However, most efforts only focus on the 2D image question answering tasks. In this paper, we present the first attempt at extending VQA to the 3D domain, which can facilitate artificial intelligence's perception of 3D real-world scenarios. Different from image based VQA, 3D Question Answering (3DQA) takes the color point cloud as input and requires both appearance and 3D geometry comprehension ability to answer the 3D-related questions. To this end, we propose a novel transformer-based 3DQA framework "3DQA-TR", which consists of two encoders for exploiting the appearance and geometry information, respectively. The multi-modal information of appearance, geometry, and the linguistic question can finally attend to each other via a 3D-Linguistic Bert to predict the target answers. To verify the effectiveness of our proposed 3DQA framework, we further develop the first 3DQA dataset "ScanQA", which builds on the ScanNet dataset and contains $\sim$6K questions, $\sim$30K answers for $806$ scenes. Extensive experiments on this dataset demonstrate the obvious superiority of our proposed 3DQA framework over existing VQA frameworks, and the effectiveness of our major designs. Our code and dataset will be made publicly available to facilitate the research in this direction. △ Less

Submitted 28 November, 2022; v1 submitted 15 December, 2021; originally announced December 2021.

Comments: To Appear at IEEE Transactions on Visualization and Computer Graphics (TVCG) 2022

arXiv:2111.14866 [pdf, other]

doi 10.3847/1538-4357/ac5895

Gravitational Microlensing Rates in Milky Way Globular Clusters

Authors: Fulya Kıroğlu, Newlin C. Weatherford, Kyle Kremer, Claire S. Ye, Giacomo Fragione, Frederic A. Rasio

Abstract: Many recent observational and theoretical studies suggest that globular clusters (GCs) host compact object populations large enough to play dominant roles in their overall dynamical evolution. Yet direct detection, particularly of black holes and neutron stars, remains rare and limited to special cases, such as when these objects reside in close binaries with bright companions. Here we examine the… ▽ More Many recent observational and theoretical studies suggest that globular clusters (GCs) host compact object populations large enough to play dominant roles in their overall dynamical evolution. Yet direct detection, particularly of black holes and neutron stars, remains rare and limited to special cases, such as when these objects reside in close binaries with bright companions. Here we examine the potential of microlensing detections to further constrain these dark populations. Based on state-of-the-art GC models from the CMC Cluster Catalog, we estimate the microlensing event rates for black holes, neutron stars, white dwarfs, and, for comparison, also for M dwarfs in Milky Way GCs, as well as the effects of different initial conditions on these rates. Among compact objects, we find that white dwarfs dominate the microlensing rates, simply because they largely dominate by numbers. We show that microlensing detections are in general more likely in GCs with higher initial densities, especially in clusters that undergo core collapse. We also estimate microlensing rates in the specific cases of M22 and 47 Tuc using our best-fitting models for these GCs. Because their positions on the sky lie near the rich stellar backgrounds of the Galactic bulge and the Small Magellanic Cloud, respectively, these clusters are among the Galactic GCs best-suited for dedicated microlensing surveys. The upcoming 10-year survey with the Rubin Observatory may be ideal for detecting lensing events in GCs. △ Less

Submitted 27 February, 2022; v1 submitted 29 November, 2021; originally announced November 2021.

Comments: v2: 17 pages, 5 figures

arXiv:2111.13049 [pdf, other]

doi 10.3847/1538-4365/ac283a

A catalogue of 323 cataclysmic variables from LAMOST DR6

Authors: Yongkang Sun, Zhenghao Cheng, Shuo Ye, Ruobin Ding, Yijiang Peng, Jiawen Zhang, Zhenyan Huo, Wenyuan Cui, Xiaofeng Wang, Jianrong Shi, Jie Lin, Chengyuan Wu, Linlin Li, Shuai Feng, Yang Yu, Xiaoran Ma, Xin Li, Cheng Liu, Zi** Zhang, Zhenzhen Shao

Abstract: In this work, we present a catalog of cataclysmic variables (CVs) identified from the Sixth Data Release (DR6) of the Large Sky Area Multi-Object Fiber Spectroscopic Telescope (LAMOST). To single out the CV spectra, we introduce a novel machine-learning algorithm called UMAP to screen out a total of 169,509 H$α$-emission spectra, and obtain a classification accuracy of the algorithm of over 99.6… ▽ More In this work, we present a catalog of cataclysmic variables (CVs) identified from the Sixth Data Release (DR6) of the Large Sky Area Multi-Object Fiber Spectroscopic Telescope (LAMOST). To single out the CV spectra, we introduce a novel machine-learning algorithm called UMAP to screen out a total of 169,509 H$α$-emission spectra, and obtain a classification accuracy of the algorithm of over 99.6$\%$ from the cross-validation set. We then apply the template matching program PyHammer v2.0 to the LAMOST spectra to obtain the optimal spectral type with metallicity, which helps us identify the chromospherically active stars and potential binary stars from the 169,509 spectra. After visually inspecting all the spectra, we identify 323 CV candidates from the LAMOST database, among them 52 objects are new. We further discuss the new CV candidates in subtypes based on their spectral features, including five DN subtype during outbursts, five NL subtype and four magnetic CVs (three AM Her type and one IP type). We also find two CVs that have been previously identified by photometry, and confirm their previous classification by the LAMOST spectra. △ Less

Submitted 25 November, 2021; originally announced November 2021.

arXiv:2110.10195 [pdf, other]

Operator-induced structural variable selection for identifying materials genes

Authors: Shengbin Ye, Thomas P. Senftle, Meng Li

Abstract: In the emerging field of materials informatics, a fundamental task is to identify physicochemically meaningful descriptors, or materials genes, which are engineered from primary features and a set of elementary algebraic operators through compositions. Standard practice directly analyzes the high-dimensional candidate predictor space in a linear model; statistical analyses are then substantially h… ▽ More In the emerging field of materials informatics, a fundamental task is to identify physicochemically meaningful descriptors, or materials genes, which are engineered from primary features and a set of elementary algebraic operators through compositions. Standard practice directly analyzes the high-dimensional candidate predictor space in a linear model; statistical analyses are then substantially hampered by the daunting challenge posed by the astronomically large number of correlated predictors with limited sample size. We formulate this problem as variable selection with operator-induced structure (OIS) and propose a new method to achieve unconventional dimension reduction by utilizing the geometry embedded in OIS. Although the model remains linear, we iterate nonparametric variable selection for effective dimension reduction. This enables variable selection based on ab initio primary features, leading to a method that is orders of magnitude faster than existing methods, with improved accuracy. To select the nonparametric module, we discuss a desired performance criterion that is uniquely induced by variable selection with OIS; in particular, we propose to employ a Bayesian Additive Regression Trees (BART)-based variable selection method. Numerical studies show superiority of the proposed method, which continues to exhibit robust performance when the input dimension is out of reach of existing methods. Our analysis of single-atom catalysis identifies physical descriptors that explain the binding energy of metal-support pairs with high explanatory power, leading to interpretable insights to guide the prevention of a notorious problem called sintering and aid catalysis design. △ Less

Submitted 10 November, 2023; v1 submitted 19 October, 2021; originally announced October 2021.

arXiv:2110.05495 [pdf, other]

doi 10.3847/1538-4357/ac5b0b

Compact Object Modeling in the Globular Cluster 47 Tucanae

Authors: Claire S. Ye, Kyle Kremer, Carl L. Rodriguez, Nicholas Z. Rui, Newlin C. Weatherford, Sourav Chatterjee, Giacomo Fragione, Frederic A. Rasio

Abstract: The globular cluster 47~Tucanae (47~Tuc) is one of the most massive star clusters in the Milky Way and is exceptionally rich in exotic stellar populations. For several decades it has been a favorite target of observers, and yet it is computationally very challenging to model because of its large number of stars ($N\gtrsim 10^6$) and high density. Here we present detailed and self-consistent 47~Tuc… ▽ More The globular cluster 47~Tucanae (47~Tuc) is one of the most massive star clusters in the Milky Way and is exceptionally rich in exotic stellar populations. For several decades it has been a favorite target of observers, and yet it is computationally very challenging to model because of its large number of stars ($N\gtrsim 10^6$) and high density. Here we present detailed and self-consistent 47~Tuc models computed with the \texttt{Cluster Monte Carlo} code (\texttt{CMC}). The models include all relevant dynamical interactions coupled to stellar and binary evolution, and reproduce various observations, including the surface brightness and velocity dispersion profiles, pulsar accelerations, and numbers of compact objects. We show that the present properties of 47~Tuc are best reproduced by adopting an initial stellar mass function that is both bottom-heavy and top-light relative to standard assumptions \citep[as in, e.g.,][]{Kroupa2001}, and an initial Elson profile \citep{Elson1987} that is overfilling the cluster's tidal radius. We include new prescriptions in \texttt{CMC} for the formation of binaries through giant star collisions and tidal captures, and we show that these mechanisms play a crucial role in the formation of neutron star binaries and millisecond pulsars in 47~Tuc; our best-fit model contains $\sim 50$ millisecond pulsars, $70\%$ of which are formed through giant collisions and tidal captures. Our models also suggest that 47~Tuc presently contains up to $\sim 200$ stellar-mass black holes, $\sim 5$ binary black holes, $\sim 15$ low-mass X-ray binaries, and $\sim 300$ cataclysmic variables. △ Less

Submitted 6 June, 2022; v1 submitted 11 October, 2021; originally announced October 2021.

Comments: 23 pages, 15 figures, 4 tables, published at ApJ

arXiv:2110.03215 [pdf, other]

Towards Continual Knowledge Learning of Language Models

Authors: Joel Jang, Seonghyeon Ye, Sohee Yang, Joongbo Shin, Janghoon Han, Gyeonghun Kim, Stanley Jungkyu Choi, Minjoon Seo

Abstract: Large Language Models (LMs) are known to encode world knowledge in their parameters as they pretrain on a vast amount of web corpus, which is often utilized for performing knowledge-dependent downstream tasks such as question answering, fact-checking, and open dialogue. In real-world scenarios, the world knowledge stored in the LMs can quickly become outdated as the world changes, but it is non-tr… ▽ More Large Language Models (LMs) are known to encode world knowledge in their parameters as they pretrain on a vast amount of web corpus, which is often utilized for performing knowledge-dependent downstream tasks such as question answering, fact-checking, and open dialogue. In real-world scenarios, the world knowledge stored in the LMs can quickly become outdated as the world changes, but it is non-trivial to avoid catastrophic forgetting and reliably acquire new knowledge while preserving invariant knowledge. To push the community towards better maintenance of ever-changing LMs, we formulate a new continual learning (CL) problem called Continual Knowledge Learning (CKL). We construct a new benchmark and metric to quantify the retention of time-invariant world knowledge, the update of outdated knowledge, and the acquisition of new knowledge. We adopt applicable recent methods from literature to create several strong baselines. Through extensive experiments, we find that CKL exhibits unique challenges that are not addressed in previous CL setups, where parameter expansion is necessary to reliably retain and learn knowledge simultaneously. By highlighting the critical causes of knowledge forgetting, we show that CKL is a challenging and important problem that helps us better understand and train ever-changing LMs. The benchmark datasets, evaluation script, and baseline code to reproduce our results are available at https://github.com/joeljang/continual-knowledge-learning. △ Less

Submitted 24 May, 2022; v1 submitted 7 October, 2021; originally announced October 2021.

Comments: published at ICLR 2022

arXiv:2110.02401 [pdf, other]

2D score based estimation of heterogeneous treatment effects

Authors: Steven Siwei Ye, Yanzhen Chen, Oscar Hernan Madrid Padilla

Abstract: Statisticians show growing interest in estimating and analyzing heterogeneity in causal effects in observational studies. However, there usually exists a trade-off between accuracy and interpretability for develo** a desirable estimator for treatment effects, especially in the case when there are a large number of features in estimation. To make efforts to address the issue, we propose a score-b… ▽ More Statisticians show growing interest in estimating and analyzing heterogeneity in causal effects in observational studies. However, there usually exists a trade-off between accuracy and interpretability for develo** a desirable estimator for treatment effects, especially in the case when there are a large number of features in estimation. To make efforts to address the issue, we propose a score-based framework for estimating the Conditional Average Treatment Effect (CATE) function in this paper. The framework integrates two components: (i) leverage the joint use of propensity and prognostic scores in a matching algorithm to obtain a proxy of the heterogeneous treatment effects for each observation, (ii) utilize non-parametric regression trees to construct an estimator for the CATE function conditioning on the two scores. The method naturally stratifies treatment effects into subgroups over a 2d grid whose axis are the propensity and prognostic scores. We conduct benchmark experiments on multiple simulated data and demonstrate clear advantages of the proposed estimator over state of the art methods. We also evaluate empirical performance in real-life settings, using two observational data from a clinical trial and a complex social survey, and interpret policy implications following the numerical results. △ Less

Submitted 23 June, 2023; v1 submitted 5 October, 2021; originally announced October 2021.

arXiv:2109.05941 [pdf, other]

Efficient Contrastive Learning via Novel Data Augmentation and Curriculum Learning

Authors: Seonghyeon Ye, Jiseon Kim, Alice Oh

Abstract: We introduce EfficientCL, a memory-efficient continual pretraining method that applies contrastive learning with novel data augmentation and curriculum learning. For data augmentation, we stack two types of operation sequentially: cutoff and PCA jittering. While pretraining steps proceed, we apply curriculum learning by incrementing the augmentation degree for each difficulty step. After data augm… ▽ More We introduce EfficientCL, a memory-efficient continual pretraining method that applies contrastive learning with novel data augmentation and curriculum learning. For data augmentation, we stack two types of operation sequentially: cutoff and PCA jittering. While pretraining steps proceed, we apply curriculum learning by incrementing the augmentation degree for each difficulty step. After data augmentation is finished, contrastive learning is applied on projected embeddings of original and augmented examples. When finetuned on GLUE benchmark, our model outperforms baseline models, especially for sentence-level tasks. Additionally, this improvement is capable with only 70% of computational memory compared to the baseline model. △ Less

Submitted 18 October, 2021; v1 submitted 10 September, 2021; originally announced September 2021.

Comments: EMNLP 2021

arXiv:2108.10161 [pdf]

doi 10.3847/1538-4357/ac0af1

Statistical Study on Spatial Distribution and Polarization of Saturn Narrowband Emissions

Authors: Siyuan. Wu, Shengyi. Ye, Georg. Fischer, Jian. Wang, Minyi. Long, John. D. Menietti, Baptiste. Cecconi, William. S. Kurth

Abstract: The spatial distribution and polarization of Saturn narrowband (NB) emissions have been studied by using Cassini Radio and Plasma Wave Sciences data and goniopolarimetric data obtained through an inversion algorithm with a preset source located at the center of Saturn. From 2004 January 1 to 2017 September 12, NB emissions were selected automatically by a computer program and rechecked manually. T… ▽ More The spatial distribution and polarization of Saturn narrowband (NB) emissions have been studied by using Cassini Radio and Plasma Wave Sciences data and goniopolarimetric data obtained through an inversion algorithm with a preset source located at the center of Saturn. From 2004 January 1 to 2017 September 12, NB emissions were selected automatically by a computer program and rechecked manually. The spatial distribution shows a preference for high latitude and intensity peaks in the region within 6 Saturn radii for both 5 and 20 kHz NB emissions. 5 kHz NB emissions also show a local time preference roughly in the 18:00-22:00 sector. The Enceladus plasma torus makes it difficult for NB emissions to propagate to the low latitude regions outside the plasma torus. The extent of the low latitude regions where 5 and 20 kHz NB emissions were never observed is consistent with the corresponding plasma torus density contour in the meridional plane. 20 kHz NB emissions show a high circular polarization while 5 kHz NB emissions are less circularly polarized with |V|<0.6 for majority of the cases. And cases of 5kHz NB emissions with high circular polarization are more frequently observed at high latitude especially at the northern and southern edges of the Enceladus plasma torus. △ Less

Submitted 16 August, 2021; originally announced August 2021.

Comments: accepted for publication in The Astrophysical Journal

arXiv:2107.14230 [pdf, other]

Learning with Noisy Labels for Robust Point Cloud Segmentation

Authors: Shuquan Ye, Dongdong Chen, Songfang Han, **g Liao

Abstract: Point cloud segmentation is a fundamental task in 3D. Despite recent progress on point cloud segmentation with the power of deep networks, current deep learning methods based on the clean label assumptions may fail with noisy labels. Yet, object class labels are often mislabeled in real-world point cloud datasets. In this work, we take the lead in solving this issue by proposing a novel Point Nois… ▽ More Point cloud segmentation is a fundamental task in 3D. Despite recent progress on point cloud segmentation with the power of deep networks, current deep learning methods based on the clean label assumptions may fail with noisy labels. Yet, object class labels are often mislabeled in real-world point cloud datasets. In this work, we take the lead in solving this issue by proposing a novel Point Noise-Adaptive Learning (PNAL) framework. Compared to existing noise-robust methods on image tasks, our PNAL is noise-rate blind, to cope with the spatially variant noise rate problem specific to point clouds. Specifically, we propose a novel point-wise confidence selection to obtain reliable labels based on the historical predictions of each point. A novel cluster-wise label correction is proposed with a voting strategy to generate the best possible label taking the neighbor point correlations into consideration. We conduct extensive experiments to demonstrate the effectiveness of PNAL on both synthetic and real-world noisy datasets. In particular, even with $60\%$ symmetric noisy labels, our proposed method produces much better results than its baseline counterpart without PNAL and is comparable to the ideal upper bound trained on a completely clean dataset. Moreover, we fully re-labeled the validation set of a popular but noisy real-world scene dataset ScanNetV2 to make it clean, for rigorous experiment and future research. Our code and data are available at \url{https://shuquanye.com/PNAL_website/}. △ Less

Submitted 5 August, 2021; v1 submitted 29 July, 2021; originally announced July 2021.

Comments: Typos fixed. ICCV 2021 Oral, Relabeled ScanNetV2 and code are available at https://shuquanye.com/PNAL_website/

arXiv:2107.13717 [pdf, other]

Maximize the Foot Clearance for a Hop** Robotic Leg Considering Motor Saturation

Authors: Juntong Su, Bingchen **, Shusheng Ye, Lecheng Ruan, Caiming Sun, Ning Ding, Yili Fu, Jianwen Luo

Abstract: A hop** leg, no matter in legged animals or humans, usually behaves like a spring during the periodic hop**. Hop** like a spring is efficient and without the requirement of complicated control algorithms. Position and force control are two main methods to realize such a spring-like behaviour. The position control usually consumes the torque resources to ensure the position accuracy and compe… ▽ More A hop** leg, no matter in legged animals or humans, usually behaves like a spring during the periodic hop**. Hop** like a spring is efficient and without the requirement of complicated control algorithms. Position and force control are two main methods to realize such a spring-like behaviour. The position control usually consumes the torque resources to ensure the position accuracy and compensate the tracking errors. In comparison, the force control strategy is able to maintain a high elasticity. Currently, the position and force control both leads to the discount of motor saturation ratio as well as the bandwidth of the control system, and thus attenuates the performance of the actuator. To augment the performance, this letter proposes a motor saturation strategy based on the force control to maximize the output torque of the actuator and realize the continuous hop** motion with natural dynamics. The proposed strategy is able to maximize the saturation ratio of motor and thus maximize the foot clearance of the single leg. The dynamics of the two-mass model is utilized to increase the force bandwidth and the performance of the actuator. A single leg with two degrees of freedom is designed as the experiment platform. The actuator consists of a powerful electric motor, a harmonic gear and encoder. The effectiveness of this method is verified through simulations and experiments using a robotic leg actuated by powerful high reduction ratio actuators. △ Less

Submitted 29 July, 2021; v1 submitted 28 July, 2021; originally announced July 2021.

arXiv:2107.05539 [pdf]

doi 10.1088/1748-0221/16/09/P09012

Towards GaAs Thin-Film Tracking Detectors

Authors: Victor Rangel-Kuoppa, Sheng Ye, Yasir J Noori, William Holmkvist, Robert J Young, Daniel Muenstermann

Abstract: Silicon-based tracking detectors have been used in several important applications, such as in cancer therapy using particle beams, and for the discovery of new elementary particles at the Large Hadron Collider at CERN. III-V semiconductor materials are an attractive alternative to silicon for this application, as they have some superior physical properties. They could meet the demands for fast tim… ▽ More Silicon-based tracking detectors have been used in several important applications, such as in cancer therapy using particle beams, and for the discovery of new elementary particles at the Large Hadron Collider at CERN. III-V semiconductor materials are an attractive alternative to silicon for this application, as they have some superior physical properties. They could meet the demands for fast timing detectors allowing time-of-flight measurements with ps resolution while being radiation tolerant and cost-efficient. As a material with a larger density, higher atomic number Z and much higher electron mobility than silicon, GaAs exhibits faster signal collection and a larger signal per μm of sensor thickness. In this work, we report on the fabrication of n-in-n GaAs thin-film devices intended to serve next-generation high-energy particle tracking detectors. Molecular beam epitaxy (MBE) was used to grow high-quality GaAs films with do** levels sufficiently low to achieve full depletion for detectors with an active thickness of 10 μm. The signal collection speed of the detector structures was assessed using the transient current technique (TCT). To elucidate the structural properties of the detector, Kelvin probe force microscopy (KPFM) was used, which confirmed the formation of the junction in the detector and revealed residual do** in the intrinsic layer. Our results suggest that GaAs thin films are suitable candidates to achieve thin and radiation-tolerant tracking detectors. △ Less

Submitted 13 July, 2021; v1 submitted 12 July, 2021; originally announced July 2021.

Comments: 15 pages, 12 figures

arXiv:2107.03642 [pdf]

Image restoration quality assessment based on regional differential information entropy

Authors: Zhiyu Wang, Jiayan Zhuang, Ningyuan Xu, Sichao Ye, Jiangjian Xiao, Chengbin Peng

Abstract: With the development of image recovery models,especially those based on adversarial and perceptual losses,the detailed texture portions of images are being recovered more naturally.However,these restored images are similar but not identical in detail texture to their reference images.With traditional image quality assessment methods,results with better subjective perceived quality often score lowe… ▽ More With the development of image recovery models,especially those based on adversarial and perceptual losses,the detailed texture portions of images are being recovered more naturally.However,these restored images are similar but not identical in detail texture to their reference images.With traditional image quality assessment methods,results with better subjective perceived quality often score lower in objective scoring.Assessment methods suffer from subjective and objective inconsistencies.This paper proposes a regional differential information entropy (RDIE) method for image quality assessment to address this problem.This approach allows better assessment of similar but not identical textural details and achieves good agreement with perceived quality.Neural networks are used to reshape the process of calculating information entropy,improving the speed and efficiency of the operation. Experiments conducted with this study image quality assessment dataset and the PIPAL dataset show that the proposed RDIE method yields a high degree of agreement with people average opinion scores compared to other image quality assessment metrics,proving that RDIE can better quantify the perceived quality of images. △ Less

Submitted 26 November, 2022; v1 submitted 8 July, 2021; originally announced July 2021.

Comments: 14 pages, 8 figures, 5 tables

arXiv:2106.02643 [pdf, other]

doi 10.3847/1538-4365/ac2edf

Modeling Dense Star Clusters in the Milky Way and Beyond with the Cluster Monte Carlo Code

Authors: Carl L. Rodriguez, Newlin C. Weatherford, Scott C. Coughlin, Pau Amaro Seoane, Katelyn Breivik, Sourav Chatterjee, Giacomo Fragione, Fulya Kıroğlu, Kyle Kremer, Nicholas Z. Rui, Claire S. Ye, Michael Zevin, Frederic A. Rasio

Abstract: We describe the public release of the Cluster Monte Carlo Code (CMC) a parallel, star-by-star $N$-body code for modeling dense star clusters. CMC treats collisional stellar dynamics using Hénon's method, where the cumulative effect of many two-body encounters is statistically reproduced as a single effective encounter between nearest-neighbor particles on a relaxation timescale. The star-by-star a… ▽ More We describe the public release of the Cluster Monte Carlo Code (CMC) a parallel, star-by-star $N$-body code for modeling dense star clusters. CMC treats collisional stellar dynamics using Hénon's method, where the cumulative effect of many two-body encounters is statistically reproduced as a single effective encounter between nearest-neighbor particles on a relaxation timescale. The star-by-star approach allows for the inclusion of additional physics, including strong gravitational three- and four-body encounters, two-body tidal and gravitational-wave captures, mass loss in arbitrary galactic tidal fields, and stellar evolution for both single and binary stars. The public release of CMC is pinned directly to the COSMIC population synthesis code, allowing dynamical star cluster simulations and population synthesis studies to be performed using identical assumptions about the stellar physics and initial conditions. As a demonstration, we present two examples of star cluster modeling: first, we perform the largest ($N = 10^8$) star-by-star $N$-body simulation of a Plummer sphere evolving to core collapse, reproducing the expected self-similar density profile over more than 15 orders of magnitude; second, we generate realistic models for typical globular clusters, and we show that their dynamical evolution can produce significant numbers of black hole mergers with masses greater than those produced from isolated binary evolution (such as GW190521, a recently reported merger with component masses in the pulsational pair-instability mass gap). △ Less

Submitted 11 October, 2021; v1 submitted 4 June, 2021; originally announced June 2021.

Comments: Code is available at https://clustermontecarlo.github.io/ 25 pages, 8 Figures, Matches version accepted by ApJS

arXiv:2105.07926 [pdf, other]

Towards Robust Vision Transformer

Authors: Xiaofeng Mao, Gege Qi, Yuefeng Chen, Xiaodan Li, Ranjie Duan, Shaokai Ye, Yuan He, Hui Xue

Abstract: Recent advances on Vision Transformer (ViT) and its improved variants have shown that self-attention-based networks surpass traditional Convolutional Neural Networks (CNNs) in most vision tasks. However, existing ViTs focus on the standard accuracy and computation cost, lacking the investigation of the intrinsic influence on model robustness and generalization. In this work, we conduct systematic… ▽ More Recent advances on Vision Transformer (ViT) and its improved variants have shown that self-attention-based networks surpass traditional Convolutional Neural Networks (CNNs) in most vision tasks. However, existing ViTs focus on the standard accuracy and computation cost, lacking the investigation of the intrinsic influence on model robustness and generalization. In this work, we conduct systematic evaluation on components of ViTs in terms of their impact on robustness to adversarial examples, common corruptions and distribution shifts. We find some components can be harmful to robustness. By using and combining robust components as building blocks of ViTs, we propose Robust Vision Transformer (RVT), which is a new vision transformer and has superior performance with strong robustness. We further propose two new plug-and-play techniques called position-aware attention scaling and patch-wise augmentation to augment our RVT, which we abbreviate as RVT*. The experimental results on ImageNet and six robustness benchmarks show the advanced robustness and generalization ability of RVT compared with previous ViTs and state-of-the-art CNNs. Furthermore, RVT-S* also achieves Top-1 rank on multiple robustness leaderboards including ImageNet-C and ImageNet-Sketch. The code will be available at \url{https://github.com/alibaba/easyrobust}. △ Less

Submitted 23 May, 2022; v1 submitted 17 May, 2021; originally announced May 2021.

Comments: Accepted to CVPR 2022, https://github.com/alibaba/easyrobust

arXiv:2104.14559 [pdf, other]

Exemplar-Based 3D Portrait Stylization

Authors: Fangzhou Han, Shuquan Ye, Mingming He, Menglei Chai, **g Liao

Abstract: Exemplar-based portrait stylization is widely attractive and highly desired. Despite recent successes, it remains challenging, especially when considering both texture and geometric styles. In this paper, we present the first framework for one-shot 3D portrait style transfer, which can generate 3D face models with both the geometry exaggerated and the texture stylized while preserving the identity… ▽ More Exemplar-based portrait stylization is widely attractive and highly desired. Despite recent successes, it remains challenging, especially when considering both texture and geometric styles. In this paper, we present the first framework for one-shot 3D portrait style transfer, which can generate 3D face models with both the geometry exaggerated and the texture stylized while preserving the identity from the original content. It requires only one arbitrary style image instead of a large set of training examples for a particular style, provides geometry and texture outputs that are fully parameterized and disentangled, and enables further graphics applications with the 3D representations. The framework consists of two stages. In the first geometric style transfer stage, we use facial landmark translation to capture the coarse geometry style and guide the deformation of the dense 3D face geometry. In the second texture style transfer stage, we focus on performing style transfer on the canonical texture by adopting a differentiable renderer to optimize the texture in a multi-view framework. Experiments show that our method achieves robustly good results on different artistic styles and outperforms existing methods. We also demonstrate the advantages of our method via various 2D and 3D graphics applications. Project page is https://halfjoe.github.io/projs/3DPS/index.html. △ Less

Submitted 29 April, 2021; originally announced April 2021.

Comments: Project page: https://halfjoe.github.io/projs/3DPS/index.html

arXiv:2104.11751 [pdf, other]

doi 10.3847/1538-4357/ac06d4

White Dwarf Subsystems in Core-Collapsed Globular Clusters

Authors: Kyle Kremer, Nicholas Z. Rui, Newlin C. Weatherford, Sourav Chatterjee, Giacomo Fragione, Frederic A. Rasio, Carl L. Rodriguez, Claire S. Ye

Abstract: Numerical and observational evidence suggests that massive white dwarfs dominate the innermost regions of core-collapsed globular clusters by both number and total mass. Using NGC 6397 as a test case, we constrain the features of white dwarf populations in core-collapsed clusters, both at present day and throughout their lifetimes. The dynamics of these white dwarf subsystems have a number of astr… ▽ More Numerical and observational evidence suggests that massive white dwarfs dominate the innermost regions of core-collapsed globular clusters by both number and total mass. Using NGC 6397 as a test case, we constrain the features of white dwarf populations in core-collapsed clusters, both at present day and throughout their lifetimes. The dynamics of these white dwarf subsystems have a number of astrophysical implications. We demonstrate that the collapse of globular cluster cores is ultimately halted by the dynamical burning of white dwarf binaries. We predict core-collapsed clusters in the local universe yield a white dwarf merger rate of $\mathcal{O}(10\rm{)\,Gpc}^{-3}\,\rm{yr}^{-1}$, roughly $0.1-1\%$ of the observed Type Ia supernova rate. We show that prior to merger, inspiraling white dwarf binaries will be observable as gravitational wave sources at milli- and decihertz frequencies. Over $90\%$ of these mergers have a total mass greater than the Chandrasekhar limit. If the merger/collision remnants are not destroyed completely in an explosive transient, we argue the remnants may be observed in core-collapsed clusters as either young neutron stars/pulsars/magnetars (in the event of accretion-induced collapse) or as young massive white dwarfs offset from the standard white dwarf cooling sequence. Finally, we show collisions between white dwarfs and main sequence stars, which may be detectable as bright transients, occur at a rate of $\mathcal{O}(100\rm{)\,Gpc}^{-3}\,\rm{yr}^{-1}$ in the local universe. We find that these collisions lead to depletion of blue straggler stars and main sequence star binaries in the centers of core-collapsed clusters. △ Less

Submitted 21 May, 2021; v1 submitted 23 April, 2021; originally announced April 2021.

Comments: Submitted to ApJ, 26 pages, 12 figures, 3 tables. Comments welcome

arXiv:2104.05439 [pdf, other]

Tensor Network for Supervised Learning at Finite Temperature

Authors: Haoxiang Lin, Shuqian Ye, Xi Zhu

Abstract: The large variation of datasets is a huge barrier for image classification tasks. In this paper, we embraced this observation and introduce the finite temperature tensor network (FTTN), which imports the thermal perturbation into the matrix product states framework by placing all images in an environment with constant temperature, in analog to energy-based learning. Tensor network is chosen since… ▽ More The large variation of datasets is a huge barrier for image classification tasks. In this paper, we embraced this observation and introduce the finite temperature tensor network (FTTN), which imports the thermal perturbation into the matrix product states framework by placing all images in an environment with constant temperature, in analog to energy-based learning. Tensor network is chosen since it is the best platform to introduce thermal fluctuation. Different from traditional network structure which directly takes the summation of individual losses as its loss function, FTTN regards it as thermal average loss computed from the entanglement with the environment. The temperature-like parameter can be automatically optimized, which gives each database an individual temperature. FTTN obtains improvement in both test accuracy and convergence speed in several datasets. The non-zero temperature automatically separates similar features, avoiding the wrong classification in previous architecture. The thermal fluctuation may give a better improvement in other frameworks, and we may also implement the temperature of database to improve the training effect. △ Less

Submitted 9 April, 2021; originally announced April 2021.

Comments: Video and slide are available on https://tensorworkshop.github.io/2020/program.html

arXiv:2104.00848 [pdf, other]

SDAN: Squared Deformable Alignment Network for Learning Misaligned Optical Zoom

Authors: Kangfu Mei, Shenglong Ye, Rui Huang

Abstract: Deep Neural Network (DNN) based super-resolution algorithms have greatly improved the quality of the generated images. However, these algorithms often yield significant artifacts when dealing with real-world super-resolution problems due to the difficulty in learning misaligned optical zoom. In this paper, we introduce a Squared Deformable Alignment Network (SDAN) to address this issue. Our networ… ▽ More Deep Neural Network (DNN) based super-resolution algorithms have greatly improved the quality of the generated images. However, these algorithms often yield significant artifacts when dealing with real-world super-resolution problems due to the difficulty in learning misaligned optical zoom. In this paper, we introduce a Squared Deformable Alignment Network (SDAN) to address this issue. Our network learns squared per-point offsets for convolutional kernels, and then aligns features in corrected convolutional windows based on the offsets. So the misalignment will be minimized by the extracted aligned features. Different from the per-point offsets used in the vanilla Deformable Convolutional Network (DCN), our proposed squared offsets not only accelerate the offset learning but also improve the generation quality with fewer parameters. Besides, we further propose an efficient cross packing attention layer to boost the accuracy of the learned offsets. It leverages the packing and unpacking operations to enlarge the receptive field of the offset learning and to enhance the ability of extracting the spatial connection between the low-resolution images and the referenced images. Comprehensive experiments show the superiority of our method over other state-of-the-art methods in both computational efficiency and realistic details. △ Less

Submitted 25 November, 2021; v1 submitted 1 April, 2021; originally announced April 2021.

Comments: ICME21. Code is available at https://github.com/MKFMIKU/SDAN

arXiv:2103.14950 [pdf, other]

The AI Settlement Generation Challenge in Minecraft: First Year Report

Authors: Christoph Salge, Michael Cerny Green, Rodrigo Canaan, Filip Skwarski, Rafael Fritsch, Adrian Brightmoore, Shaofang Ye, Changxing Cao, Julian Togelius

Abstract: This article outlines what we learned from the first year of the AI Settlement Generation Competition in Minecraft, a competition about producing AI programs that can generate interesting settlements in Minecraft for an unseen map. This challenge seeks to focus research into adaptive and holistic procedural content generation. Generating Minecraft towns and villages given existing maps is a suitab… ▽ More This article outlines what we learned from the first year of the AI Settlement Generation Competition in Minecraft, a competition about producing AI programs that can generate interesting settlements in Minecraft for an unseen map. This challenge seeks to focus research into adaptive and holistic procedural content generation. Generating Minecraft towns and villages given existing maps is a suitable task for this, as it requires the generated content to be adaptive, functional, evocative and aesthetic at the same time. Here, we present the results from the first iteration of the competition. We discuss the evaluation methodology, present the different technical approaches by the competitors, and outline the open problems. △ Less

Submitted 27 March, 2021; originally announced March 2021.

Comments: 14 pages, 9 figures, published in KI-Künstliche Intelligenz

Journal ref: KI-Künstliche Intelligenz 2020

arXiv:2103.14528 [pdf, other]

Model-based Reconstruction with Learning: From Unsupervised to Supervised and Beyond

Authors: Zhishen Huang, Siqi Ye, Michael T. McCann, Saiprasad Ravishankar

Abstract: Many techniques have been proposed for image reconstruction in medical imaging that aim to recover high-quality images especially from limited or corrupted measurements. Model-based reconstruction methods have been particularly popular (e.g., in magnetic resonance imaging and tomographic modalities) and exploit models of the imaging system's physics together with statistical models of measurements… ▽ More Many techniques have been proposed for image reconstruction in medical imaging that aim to recover high-quality images especially from limited or corrupted measurements. Model-based reconstruction methods have been particularly popular (e.g., in magnetic resonance imaging and tomographic modalities) and exploit models of the imaging system's physics together with statistical models of measurements, noise and often relatively simple object priors or regularizers. For example, sparsity or low-rankness based regularizers have been widely used for image reconstruction from limited data such as in compressed sensing. Learning-based approaches for image reconstruction have garnered much attention in recent years and have shown promise across biomedical imaging applications. These methods include synthesis dictionary learning, sparsifying transform learning, and different forms of deep learning involving complex neural networks. We briefly discuss classical model-based reconstruction methods and then review reconstruction methods at the intersection of model-based and learning-based paradigms in detail. This review includes many recent methods based on unsupervised learning, and supervised learning, as well as a framework to combine multiple types of learned models together. △ Less

Submitted 26 March, 2021; originally announced March 2021.

arXiv:2103.06504 [pdf, other]

Adversarial Laser Beam: Effective Physical-World Attack to DNNs in a Blink

Authors: Ranjie Duan, Xiaofeng Mao, A. K. Qin, Yun Yang, Yuefeng Chen, Shaokai Ye, Yuan He

Abstract: Though it is well known that the performance of deep neural networks (DNNs) degrades under certain light conditions, there exists no study on the threats of light beams emitted from some physical source as adversarial attacker on DNNs in a real-world scenario. In this work, we show by simply using a laser beam that DNNs are easily fooled. To this end, we propose a novel attack method called Advers… ▽ More Though it is well known that the performance of deep neural networks (DNNs) degrades under certain light conditions, there exists no study on the threats of light beams emitted from some physical source as adversarial attacker on DNNs in a real-world scenario. In this work, we show by simply using a laser beam that DNNs are easily fooled. To this end, we propose a novel attack method called Adversarial Laser Beam ($AdvLB$), which enables manipulation of laser beam's physical parameters to perform adversarial attack. Experiments demonstrate the effectiveness of our proposed approach in both digital- and physical-settings. We further empirically analyze the evaluation results and reveal that the proposed laser beam attack may lead to some interesting prediction errors of the state-of-the-art DNNs. We envisage that the proposed $AdvLB$ method enriches the current family of adversarial attacks and builds the foundation for future robustness studies for light. △ Less

Submitted 11 March, 2021; originally announced March 2021.

Comments: Accepted to CVPR2021

arXiv:2103.06273 [pdf, other]

No Black Holes in NGC 6397

Authors: Nicholas Z. Rui, Newlin Weatherford, Kyle Kremer, Sourav Chatterjee, Giacomo Fragione, Frederic A. Rasio, Carl L. Rodriguez, Claire S. Ye

Abstract: Recently, \citet{vitral2021does} detected a central concentration of dark objects in the core-collapsed globular cluster NGC 6397, which could be interpreted as a subcluster of stellar-mass black holes. However, it is well established theoretically that any significant number of black holes in the cluster would provide strong dynamical heating and is fundamentally inconsistent with this cluster's… ▽ More Recently, \citet{vitral2021does} detected a central concentration of dark objects in the core-collapsed globular cluster NGC 6397, which could be interpreted as a subcluster of stellar-mass black holes. However, it is well established theoretically that any significant number of black holes in the cluster would provide strong dynamical heating and is fundamentally inconsistent with this cluster's core-collapsed profile. Claims of intermediate-mass black holes in core-collapsed clusters should similarly be treated with suspicion, for reasons that have been understood theoretically for many decades. Instead, the central dark population in NGC 6397 is exactly accounted for by a compact subsystem of white dwarfs, as we demonstrate here by inspection of a previously published model that provides a good fit to this cluster. These central subclusters of heavy white dwarfs are in fact a generic feature of core-collapsed clusters, while central black hole subclusters are present in all {\em non\/}-collapsed clusters. △ Less

Submitted 10 March, 2021; originally announced March 2021.

Comments: 3 pages, 1 figure, to submit to AAS journals

arXiv:2103.06094 [pdf]

doi 10.1038/s41567-022-01534-x

Particle-hole asymmetric superconducting coherence peaks in overdoped cuprates

Authors: Changwei Zou, Zhenqi Hao, Xiangyu Luo, Shusen Ye, Qiang Gao, Xintong Li, Miao Xu, Peng Cai, Chengtian Lin, Xingjiang Zhou, Dung-Hai Lee, Yayu Wang

Abstract: To elucidate the superconductor to metal transition at the end of superconducting dome, the overdoped regime has stepped onto the center stage of cuprate research recently. Here, we use scanning tunneling microscopy to investigate the atomic-scale electronic structure of overdoped trilayer Bi-2223 and bilayer Bi-2212 cuprates. At low energies the spectroscopic maps are well described by dispersive… ▽ More To elucidate the superconductor to metal transition at the end of superconducting dome, the overdoped regime has stepped onto the center stage of cuprate research recently. Here, we use scanning tunneling microscopy to investigate the atomic-scale electronic structure of overdoped trilayer Bi-2223 and bilayer Bi-2212 cuprates. At low energies the spectroscopic maps are well described by dispersive quasiparticle interference patterns. However, as the bias increases to the superconducting coherence peak energy, a virtually non-dispersive pattern with sqrt(2)*sqrt(2) periodicity emerges. Remarkably, the position of the coherence peaks exhibits evident particle-hole asymmetry which also modulates with the same period. We propose that this is an extreme quasiparticle interference phenomenon, caused by pairing-breaking scattering between flat anti-nodal Bogoliubov bands, which is ultimately responsible for the superconductor to metal transition. △ Less

Submitted 10 March, 2021; originally announced March 2021.

Comments: 15 pages, 4 figures

arXiv:2103.05033 [pdf, other]

doi 10.3847/1538-4357/abed49

Matching Globular Cluster Models to Observations

Authors: Nicholas Z. Rui, Kyle Kremer, Newlin C. Weatherford, Sourav Chatterjee, Frederic A. Rasio, Carl L. Rodriguez, Claire S. Ye

Abstract: As ancient, gravitationally bound stellar populations, globular clusters are abundant, vibrant laboratories characterized by high frequencies of dynamical interactions coupled to complex stellar evolution. Using surface brightness and velocity dispersion profiles from the literature, we fit $59$ Milky Way globular clusters to dynamical models from the \texttt{CMC Cluster Catalog}. Without doing an… ▽ More As ancient, gravitationally bound stellar populations, globular clusters are abundant, vibrant laboratories characterized by high frequencies of dynamical interactions coupled to complex stellar evolution. Using surface brightness and velocity dispersion profiles from the literature, we fit $59$ Milky Way globular clusters to dynamical models from the \texttt{CMC Cluster Catalog}. Without doing any interpolation, and without any directed effort to fit any particular cluster, $26$ globular clusters are well-matched by at least one of our models. We discuss in particular the core-collapsed clusters NGC 6293, NGC 6397, NGC 6681, and NGC 6624, and the non-core-collapsed clusters NGC 288, NGC 4372, and NGC 5897. As NGC 6624 lacks well-fitting snapshots on the main \texttt{CMC Cluster Catalog}, we run six additional models in order to refine the fit. We calculate metrics for mass segregation, explore the production of compact object sources such as millisecond pulsars, cataclysmic variables, low-mass X-ray binaries, and stellar-mass black holes, finding reasonable agreement with observations. Additionally, closely mimicking observational cuts, we extract the binary fraction from our models, finding good agreement except in the dense core regions of core-collapsed clusters. Accompanying this paper are a number of \textsf{python} methods for examining the publicly accessible \texttt{CMC Cluster Catalog}, as well as any other models generated using \texttt{CMC}. △ Less

Submitted 8 March, 2021; originally announced March 2021.

Comments: 22 pages, 11 figures, 5 tables; accepted to ApJ

arXiv:2103.02927 [pdf, other]

QAIR: Practical Query-efficient Black-Box Attacks for Image Retrieval

Authors: Xiaodan Li, **feng Li, Yuefeng Chen, Shaokai Ye, Yuan He, Shuhui Wang, Hang Su, Hui Xue

Abstract: We study the query-based attack against image retrieval to evaluate its robustness against adversarial examples under the black-box setting, where the adversary only has query access to the top-k ranked unlabeled images from the database. Compared with query attacks in image classification, which produce adversaries according to the returned labels or confidence score, the challenge becomes even m… ▽ More We study the query-based attack against image retrieval to evaluate its robustness against adversarial examples under the black-box setting, where the adversary only has query access to the top-k ranked unlabeled images from the database. Compared with query attacks in image classification, which produce adversaries according to the returned labels or confidence score, the challenge becomes even more prominent due to the difficulty in quantifying the attack effectiveness on the partial retrieved list. In this paper, we make the first attempt in Query-based Attack against Image Retrieval (QAIR), to completely subvert the top-k retrieval results. Specifically, a new relevance-based loss is designed to quantify the attack effects by measuring the set similarity on the top-k retrieval results before and after attacks and guide the gradient optimization. To further boost the attack efficiency, a recursive model stealing method is proposed to acquire transferable priors on the target model and generate the prior-guided gradients. Comprehensive experiments show that the proposed attack achieves a high attack success rate with few queries against the image retrieval systems under the black-box setting. The attack evaluations on the real-world visual search engine show that it successfully deceives a commercial system such as Bing Visual Search with 98% attack success rate by only 33 queries on average. △ Less

Submitted 23 March, 2021; v1 submitted 4 March, 2021; originally announced March 2021.

arXiv:2102.08799 [pdf]

Imaging the atomic-scale electronic states induced by a pair of hole dopants in Ca2CuO2Cl2 Mott insulator

Authors: Haiwei Li, Shusen Ye, Jianfa Zhao, Changqing **, Yayu Wang

Abstract: We use scanning tunneling microscopy to visualize the atomic-scale electronic states induced by a pair of hole dopants in Ca2CuO2Cl2 parent Mott insulator of cuprates. We find that when the two dopants approach each other, the transfer of spectral weight from high energy Hubbard band to low energy in-gap state creates a broad peak and nearly V-shaped gap around the Fermi level. The peak position s… ▽ More We use scanning tunneling microscopy to visualize the atomic-scale electronic states induced by a pair of hole dopants in Ca2CuO2Cl2 parent Mott insulator of cuprates. We find that when the two dopants approach each other, the transfer of spectral weight from high energy Hubbard band to low energy in-gap state creates a broad peak and nearly V-shaped gap around the Fermi level. The peak position shows a sudden drop at distance around 4 a0 and then remains almost constant. The in-gap states exhibit peculiar spatial distributions depending on the configuration of the two dopants relative to the underlying Cu lattice. These results shed important new lights on the evolution of low energy electronic states when a few holes are doped into parent cuprates. △ Less

Submitted 17 February, 2021; originally announced February 2021.

Comments: 12 pages, 4 figures

Journal ref: Science Bulletin 66, 1395-1400 (2021)

arXiv:2102.04317 [pdf, other]

Meta-PU: An Arbitrary-Scale Upsampling Network for Point Cloud

Authors: Shuquan Ye, Dongdong Chen, Songfang Han, Ziyu Wan, **g Liao

Abstract: Point cloud upsampling is vital for the quality of the mesh in three-dimensional reconstruction. Recent research on point cloud upsampling has achieved great success due to the development of deep learning. However, the existing methods regard point cloud upsampling of different scale factors as independent tasks. Thus, the methods need to train a specific model for each scale factor, which is bot… ▽ More Point cloud upsampling is vital for the quality of the mesh in three-dimensional reconstruction. Recent research on point cloud upsampling has achieved great success due to the development of deep learning. However, the existing methods regard point cloud upsampling of different scale factors as independent tasks. Thus, the methods need to train a specific model for each scale factor, which is both inefficient and impractical for storage and computation in real applications. To address this limitation, in this work, we propose a novel method called ``Meta-PU" to firstly support point cloud upsampling of arbitrary scale factors with a single model. In the Meta-PU method, besides the backbone network consisting of residual graph convolution (RGC) blocks, a meta-subnetwork is learned to adjust the weights of the RGC blocks dynamically, and a farthest sampling block is adopted to sample different numbers of points. Together, these two blocks enable our Meta-PU to continuously upsample the point cloud with arbitrary scale factors by using only a single model. In addition, the experiments reveal that training on multiple scales simultaneously is beneficial to each other. Thus, Meta-PU even outperforms the existing methods trained for a specific scale factor only. △ Less

Submitted 8 February, 2021; originally announced February 2021.

Comments: To appear at TVCG

arXiv:2102.02995 [pdf]

Application of Deep Learning in Recognizing Bates Numbers and Confidentiality Stam** from Images

Authors: Christian J. Mahoney, Katie Jensen, Fusheng Wei, Haozhen Zhao, Han Qin, Shi Ye

Abstract: In eDiscovery, it is critical to ensure that each page produced in legal proceedings conforms with the requirements of court or government agency production requests. Errors in productions could have severe consequences in a case, putting a party in an adverse position. The volume of pages produced continues to increase, and tremendous time and effort has been taken to ensure quality control of do… ▽ More In eDiscovery, it is critical to ensure that each page produced in legal proceedings conforms with the requirements of court or government agency production requests. Errors in productions could have severe consequences in a case, putting a party in an adverse position. The volume of pages produced continues to increase, and tremendous time and effort has been taken to ensure quality control of document productions. This has historically been a manual and laborious process. This paper demonstrates a novel automated production quality control application which leverages deep learning-based image recognition technology to extract Bates Number and Confidentiality Stam** from legal case production images and validate their correctness. Effectiveness of the method is verified with an experiment using a real-world production data. △ Less

Submitted 4 February, 2021; originally announced February 2021.

Comments: 2020 IEEE International Conference on Big Data (Big Data)

arXiv:2102.01897 [pdf, other]

Automatic Segmentation of Organs-at-Risk from Head-and-Neck CT using Separable Convolutional Neural Network with Hard-Region-Weighted Loss

Authors: Wenhui Lei, Haochen Mei, Zhengwentai Sun, Shan Ye, Ran Gu, Huan Wang, Rui Huang, Shichuan Zhang, Shaoting Zhang, Guotai Wang

Abstract: Nasopharyngeal Carcinoma (NPC) is a leading form of Head-and-Neck (HAN) cancer in the Arctic, China, Southeast Asia, and the Middle East/North Africa. Accurate segmentation of Organs-at-Risk (OAR) from Computed Tomography (CT) images with uncertainty information is critical for effective planning of radiation therapy for NPC treatment. Despite the stateof-the-art performance achieved by Convolutio… ▽ More Nasopharyngeal Carcinoma (NPC) is a leading form of Head-and-Neck (HAN) cancer in the Arctic, China, Southeast Asia, and the Middle East/North Africa. Accurate segmentation of Organs-at-Risk (OAR) from Computed Tomography (CT) images with uncertainty information is critical for effective planning of radiation therapy for NPC treatment. Despite the stateof-the-art performance achieved by Convolutional Neural Networks (CNNs) for automatic segmentation of OARs, existing methods do not provide uncertainty estimation of the segmentation results for treatment planning, and their accuracy is still limited by several factors, including the low contrast of soft tissues in CT, highly imbalanced sizes of OARs and large inter-slice spacing. To address these problems, we propose a novel framework for accurate OAR segmentation with reliable uncertainty estimation. First, we propose a Segmental Linear Function (SLF) to transform the intensity of CT images to make multiple organs more distinguishable than existing methods based on a simple window width/level that often gives a better visibility of one organ while hiding the others. Second, to deal with the large inter-slice spacing, we introduce a novel 2.5D network (named as 3D-SepNet) specially designed for dealing with clinic HAN CT scans with anisotropic spacing. Thirdly, existing hardness-aware loss function often deal with class-level hardness, but our proposed attention to hard voxels (ATH) uses a voxel-level hardness strategy, which is more suitable to dealing with some hard regions despite that its corresponding class may be easy. Our code is now available at https://github.com/HiLab-git/SepNet. △ Less

Submitted 3 February, 2021; originally announced February 2021.

Comments: Accepted by Neurocomputing

arXiv:2101.11254 [pdf, other]

Automatic Segmentation of Gross Target Volume of Nasopharynx Cancer using Ensemble of Multiscale Deep Neural Networks with Spatial Attention

Authors: Haochen Mei, Wenhui Lei, Ran Gu, Shan Ye, Zhengwentai Sun, Shichuan Zhang, Guotai Wang

Abstract: Radiotherapy is the main treatment modality for nasopharynx cancer. Delineation of Gross Target Volume (GTV) from medical images such as CT and MRI images is a prerequisite for radiotherapy. As manual delineation is time-consuming and laborious, automatic segmentation of GTV has a potential to improve this process. Currently, most of the deep learning-based automatic delineation methods of GTV are… ▽ More Radiotherapy is the main treatment modality for nasopharynx cancer. Delineation of Gross Target Volume (GTV) from medical images such as CT and MRI images is a prerequisite for radiotherapy. As manual delineation is time-consuming and laborious, automatic segmentation of GTV has a potential to improve this process. Currently, most of the deep learning-based automatic delineation methods of GTV are mainly performed on medical images like CT images. However, it is challenged by the low contrast between the pathology regions and surrounding soft tissues, small target region, and anisotropic resolution of clinical CT images. To deal with these problems, we propose a 2.5D Convolutional Neural Network (CNN) to handle the difference of inplane and through-plane resolution. Furthermore, we propose a spatial attention module to enable the network to focus on small target, and use channel attention to further improve the segmentation performance. Moreover, we use multi-scale sampling method for training so that the networks can learn features at different scales, which are combined with a multi-model ensemble method to improve the robustness of segmentation results. We also estimate the uncertainty of segmentation results based on our model ensemble, which is of great importance for indicating the reliability of automatic segmentation results for radiotherapy planning. △ Less

Submitted 27 January, 2021; originally announced January 2021.

arXiv:2101.08902 [pdf, ps, other]

Length functions on groups and rigidity

Authors: Shengkui Ye

Abstract: Let $G$ be a group. A function $l:G\rightarrow \lbrack 0,\infty )$ is called a length function if (1) $l(g^{n})=|n|l(g)$ for any $g\in G$ and $n\in \mathbb{Z};$ (2) $l(hgh^{-1})=l(g)$ for any $h,g\in G;$ and (3) $l(ab)\leq l(a)+l(b)$ for commuting elements $a,b.$ Such length functions exist in many branches of mathematics, mainly as stable word lengths, stable norms, smooth measure-theoret… ▽ More Let $G$ be a group. A function $l:G\rightarrow \lbrack 0,\infty )$ is called a length function if (1) $l(g^{n})=|n|l(g)$ for any $g\in G$ and $n\in \mathbb{Z};$ (2) $l(hgh^{-1})=l(g)$ for any $h,g\in G;$ and (3) $l(ab)\leq l(a)+l(b)$ for commuting elements $a,b.$ Such length functions exist in many branches of mathematics, mainly as stable word lengths, stable norms, smooth measure-theoretic entropy, translation lengths on $\mathrm{CAT}(0)$ spaces and Gromov $δ$% -hyperbolic spaces, stable norms of quasi-cocycles, rotation numbers of circle homeomorphisms, dynamical degrees of birational maps and so on. We study length functions on Lie groups, Gromov hyperbolic groups, arithmetic subgroups, matrix groups over rings and Cremona groups. As applications, we prove that every group homomorphism from an arithmetic subgroup of a simple algebraic $\mathbb{Q}$-group of $\mathbb{Q}$-rank at least $2,$ or a finite-index subgroup of the elementary group $E_{n}(R)$ $(n\geq 3)$ over an associative ring, or the Cremona group $\mathrm{Bir}(P_{\mathbb{C}}^{2})$ to any group $G$ having a purely positive length function must have its image finite. Here $G$ can be outer automorphism group $\mathrm{Out}(F_{n})$ of free groups, map** classes group $\mathrm{MCG}(Σ_{g})$, $\mathrm{CAT}% (0)$ groups or Gromov hyperbolic groups, or the group $\mathrm{Diff}(Σ,ω)$ of diffeomorphisms of a hyperbolic closed surface preserving an area form $ω.$ △ Less

Submitted 10 January, 2023; v1 submitted 21 January, 2021; originally announced January 2021.

Comments: some typos are corrected. Final version, to appear in the Beyond Hyperbolicity/ Artin Groups, CAT(0) geometry and related topics Conference Proceedings

arXiv:2101.07793 [pdf, other]

doi 10.3847/2515-5172/abdf54

The Observed Rate of Binary Black Hole Mergers can be Entirely Explained by Globular Clusters

Authors: Carl L. Rodriguez, Kyle Kremer, Sourav Chatterjee, Giacomo Fragione, Abraham Loeb, Frederic A. Rasio, Newlin C. Weatherford, Claire S. Ye

Abstract: Since the first signal in 2015, the gravitational-wave detections of merging binary black holes (BBHs) by the LIGO and Virgo collaborations (LVC) have completely transformed our understanding of the lives and deaths of compact object binaries, and have motivated an enormous amount of theoretical work on the astrophysical origin of these objects. We show that the phenomenological fit to the redshif… ▽ More Since the first signal in 2015, the gravitational-wave detections of merging binary black holes (BBHs) by the LIGO and Virgo collaborations (LVC) have completely transformed our understanding of the lives and deaths of compact object binaries, and have motivated an enormous amount of theoretical work on the astrophysical origin of these objects. We show that the phenomenological fit to the redshift-dependent merger rate of BBHs from Abbott et al. (2020) is consistent with a purely dynamical origin for these objects, and that the current merger rate of BBHs from the LVC could be explained entirely with globular clusters alone. While this does not prove that globular clusters are the dominant formation channel, we emphasize that many formation scenarios could contribute a significant fraction of the current LVC rate, and that any analysis that assumes a single (or dominant) mechanism for producing BBH mergers is implicitly using a specious astrophysical prior. △ Less

Submitted 27 January, 2021; v1 submitted 20 January, 2021; originally announced January 2021.

Comments: 4 pages, one figure, matches version in RNAAS

Journal ref: Res. Notes AAS 5 19 (2021)

arXiv:2101.06598 [pdf]

doi 10.1103/PhysRevX.11.011007

Evolution of Charge and Pair Density Modulations in Overdoped Bi2Sr2CuO6+delta

Authors: Xintong Li, Changwei Zou, Ying Ding, Hongtao Yan, Shusen Ye, Haiwei Li, Zhenqi Hao, Lin Zhao, Xingjiang Zhou, Yayu Wang

Abstract: One of the central issues concerning the mechanism of high temperature superconductivity in cuprates is the nature of the ubiquitous charge order and its implications to superconductivity. Here we use scanning tunneling microscopy to investigate the evolution of charge order from the optimally doped to strongly overdoped Bi2Sr2CuO6+δ cuprates. We find that with increasing hole concentration, the l… ▽ More One of the central issues concerning the mechanism of high temperature superconductivity in cuprates is the nature of the ubiquitous charge order and its implications to superconductivity. Here we use scanning tunneling microscopy to investigate the evolution of charge order from the optimally doped to strongly overdoped Bi2Sr2CuO6+δ cuprates. We find that with increasing hole concentration, the long-range checkerboard order gradually evolves into short-range glassy patterns consisting of diluted charge puddles. Each charge puddle has a unidirectional nematic internal structure, and exhibits clear pair density modulations as revealed by the spatial variations of superconducting coherence peak and gap depth. Both the charge puddles and the nematicity vanish completely in the strongly overdoped non-superconducting regime, when another type of short-range order with root2 * root2 periodicity emerges. These results shed important new lights on the intricate interplay between the intertwined orders and the superconducting phase of cuprates. △ Less

Submitted 17 January, 2021; originally announced January 2021.

Comments: 23 pages, 7 figures

Journal ref: Physical Review X 11, 011007 (2021)

arXiv:2101.02217 [pdf, other]

doi 10.3847/2041-8213/abd79c

Black Hole Mergers from Star Clusters with Top-Heavy Initial Mass Functions

Authors: Newlin C. Weatherford, Giacomo Fragione, Kyle Kremer, Sourav Chatterjee, Claire S. Ye, Carl L. Rodriguez, Frederic A. Rasio

Abstract: Recent observations of globular clusters (GCs) provide evidence that the stellar initial mass function (IMF) may not be universal, suggesting specifically that the IMF grows increasingly top-heavy with decreasing metallicity and increasing gas density. Non-canonical IMFs can greatly affect the evolution of GCs, mainly because the high end determines how many black holes (BHs) form. Here we compute… ▽ More Recent observations of globular clusters (GCs) provide evidence that the stellar initial mass function (IMF) may not be universal, suggesting specifically that the IMF grows increasingly top-heavy with decreasing metallicity and increasing gas density. Non-canonical IMFs can greatly affect the evolution of GCs, mainly because the high end determines how many black holes (BHs) form. Here we compute a new set of GC models, varying the IMF within observational uncertainties. We find that GCs with top-heavy IMFs lose most of their mass within a few Gyr through stellar winds and tidal strip**. Heating of the cluster through BH mass segregation greatly enhances this process. We show that, as they approach complete dissolution, GCs with top-heavy IMFs can evolve into 'dark clusters' consisting of mostly BHs by mass. In addition to producing more BHs, GCs with top-heavy IMFs also produce many more binary BH (BBH) mergers. Even though these clusters are short-lived, mergers of ejected BBHs continue at a rate comparable to, or greater than, what is found for long-lived GCs with canonical IMFs. Therefore these clusters, although they are no longer visible today, could still contribute significantly to the local BBH merger rate detectable by LIGO/Virgo, especially for sources with higher component masses well into the BH mass gap. We also report that one of our GC models with a top-heavy IMF produces dozens of intermediate-mass black holes (IMBHs) with masses $M>100\,{\rm M_\odot}$, including one with $M>500\,{\rm M_\odot}$. Ultimately, additional gravitational wave observations will provide strong constraints on the stellar IMF in old GCs and the formation of IMBHs at high redshift. △ Less

Submitted 6 January, 2021; originally announced January 2021.

Comments: 8 pages, 4 figures, 1 table; accepted for publication to ApJL

arXiv:2012.10739 [pdf, other]

Digital Reconstruction of Elmina Castle for Mobile Virtual Reality via Point-based Detail Transfer

Authors: Sifan Ye, Ting Wu, Michael Jarvis, Yuhao Zhu

Abstract: Reconstructing 3D models from large, dense point clouds is critical to enable Virtual Reality (VR) as a platform for entertainment, education, and heritage preservation. Existing 3D reconstruction systems inevitably make trade-offs between three conflicting goals: the efficiency of reconstruction (e.g., time and memory requirements), the visual quality of the constructed scene, and the rendering s… ▽ More Reconstructing 3D models from large, dense point clouds is critical to enable Virtual Reality (VR) as a platform for entertainment, education, and heritage preservation. Existing 3D reconstruction systems inevitably make trade-offs between three conflicting goals: the efficiency of reconstruction (e.g., time and memory requirements), the visual quality of the constructed scene, and the rendering speed on the VR device. This paper proposes a reconstruction system that simultaneously meets all three goals. The key idea is to avoid the resource-demanding process of reconstructing a high-polygon mesh altogether. Instead, we propose to directly transfer details from the original point cloud to a low polygon mesh, which significantly reduces the reconstruction time and cost, preserves the scene details, and enables real-time rendering on mobile VR devices. While our technique is general, we demonstrate it in reconstructing cultural heritage sites. We for the first time digitally reconstruct the Elmina Castle, a UNESCO world heritage site at Ghana, from billions of laser-scanned points. The reconstruction process executes on low-end desktop systems without requiring high processing power, making it accessible to the broad community. The reconstructed scenes render on Oculus Go in 60 FPS, providing a real-time VR experience with high visual quality. Our project is part of the Digital Elmina effort (http://digitalelmina.org/) between University of Rochester and University of Ghana. △ Less

Submitted 20 December, 2021; v1 submitted 19 December, 2020; originally announced December 2020.

arXiv:2012.10497 [pdf, other]

doi 10.3847/2041-8213/abdf5b

Intermediate-mass Black Holes from High Massive-star Binary Fractions in Young Star Clusters

Authors: Elena González, Kyle Kremer, Sourav Chatterjee, Giacomo Fragione, Carl L. Rodriguez, Newlin C. Weatherford, Claire S. Ye, Frederic A. Rasio

Abstract: Black holes formed in dense star clusters, where dynamical interactions are frequent, may have fundamentally different properties than those formed through isolated stellar evolution. Theoretical models for single star evolution predict a gap in the black hole mass spectrum from roughly $40-120\,M_{\odot}$ caused by (pulsational) pair-instability supernovae. Motivated by the recent LIGO/Virgo even… ▽ More Black holes formed in dense star clusters, where dynamical interactions are frequent, may have fundamentally different properties than those formed through isolated stellar evolution. Theoretical models for single star evolution predict a gap in the black hole mass spectrum from roughly $40-120\,M_{\odot}$ caused by (pulsational) pair-instability supernovae. Motivated by the recent LIGO/Virgo event GW190521, we investigate whether black holes with masses within or in excess of this "upper-mass gap" can be formed dynamically in young star clusters through strong interactions of massive stars in binaries. We perform a set of $N$-body simulations using the CMC cluster-dynamics code to study the effects of the high-mass binary fraction on the formation and collision histories of the most massive stars and their remnants. We find that typical young star clusters with low metallicities and high binary fractions in massive stars can form several black holes in the upper-mass gap and often form at least one intermediate-mass black hole. These results provide strong evidence that dynamical interactions in young star clusters naturally lead to the formation of more massive black hole remnants. △ Less

Submitted 21 January, 2021; v1 submitted 18 December, 2020; originally announced December 2020.

Comments: 10 pages, 2 figures, 1 table. Accepted for publication in ApJ Letters. Comments welcome

arXiv:2012.04206 [pdf]

doi 10.1103/PhysRevLett.125.237005

Anomalous do** evolution of superconductivity and quasiparticle interference in Bi2Sr2Ca2Cu3O10+δ trilayer cuprates

Authors: Zhenqi Hao, Changwei Zou, Xiangyu Luo, Yu Ji, Miao Xu, Shusen Ye, Xingjiang Zhou, Chengtian Lin, Yayu Wang

Abstract: We use scanning tunneling microscopy to investigate Bi2Sr2Ca2Cu3O10+δ trilayer cuprates from the optimally doped to overdoped regime. We find that the two distinct superconducting gaps from the inner and outer CuO2 planes both decrease rapidly with do**, in sharp contrast to the nearly constant Tc. Spectroscopic imaging reveals the absence of quasiparticle interference in the antinodal region of… ▽ More We use scanning tunneling microscopy to investigate Bi2Sr2Ca2Cu3O10+δ trilayer cuprates from the optimally doped to overdoped regime. We find that the two distinct superconducting gaps from the inner and outer CuO2 planes both decrease rapidly with do**, in sharp contrast to the nearly constant Tc. Spectroscopic imaging reveals the absence of quasiparticle interference in the antinodal region of overdoped samples, showing an opposite trend to that in single- and double-layer compounds. We propose that the existence of two types of inequivalent CuO2 planes and the intricate interaction between them are responsible for these highly anomalous observations in trilayer cuprates. △ Less

Submitted 7 December, 2020; originally announced December 2020.

Comments: 13 pages, 4 figures

Journal ref: Phys. Rev. Lett. 125, 237005 (2020)

arXiv:2012.02796 [pdf, other]

doi 10.3847/1538-4357/abeb14

Fast Optical Transients from Stellar-Mass Black Hole Tidal Disruption Events in Young Star Clusters

Authors: Kyle Kremer, Wenbin Lu, Anthony L. Piro, Sourav Chatterjee, Frederic A. Rasio, Claire S. Ye

Abstract: Observational evidence suggests that the majority of stars may have been born in stellar clusters or associations. Within these dense environments, dynamical interactions lead to high rates of close stellar encounters. A variety of recent observational and theoretical indications suggest stellar-mass black holes may be present and play an active dynamical role in stellar clusters of all masses. In… ▽ More Observational evidence suggests that the majority of stars may have been born in stellar clusters or associations. Within these dense environments, dynamical interactions lead to high rates of close stellar encounters. A variety of recent observational and theoretical indications suggest stellar-mass black holes may be present and play an active dynamical role in stellar clusters of all masses. In this study, we explore the tidal disruption of main sequence stars by stellar-mass black holes in young star clusters. We compute a suite of over 3000 independent $N$-body simulations that cover a range in cluster mass, metallicity, and half-mass radii. We find stellar-mass black hole tidal disruption events (TDEs) occur at an overall rate of up to roughly $200\,\rm{Gpc}^{-3}\,\rm{yr}^{-1}$ in young stellar clusters in the local universe. These TDEs are expected to have several characteristic features, namely fast rise times of order a day, peak X-ray luminosities of at least $10^{44}\,\rm{erg\,s}^{-1}$, and bright optical luminosities (roughly $10^{41}-10^{44}\,\rm{erg\,s}^{-1}$) associated with reprocessing by a disk wind. In particular, we show these events share many features in common with the emerging class of Fast Blue Optical Transients. △ Less

Submitted 23 March, 2021; v1 submitted 4 December, 2020; originally announced December 2020.

Comments: 17 pages, 5 figures, 3 tables. Accepted for publication in ApJ

arXiv:2012.01758 [pdf, other]

Non-parametric Quantile Regression via the K-NN Fused Lasso

Authors: Steven Siwei Ye, Oscar Hernan Madrid Padilla

Abstract: Quantile regression is a statistical method for estimating conditional quantiles of a response variable. In addition, for mean estimation, it is well known that quantile regression is more robust to outliers than $l_2$-based methods. By using the fused lasso penalty over a $K$-nearest neighbors graph, we propose an adaptive quantile estimator in a non-parametric setup. We show that the estimator a… ▽ More Quantile regression is a statistical method for estimating conditional quantiles of a response variable. In addition, for mean estimation, it is well known that quantile regression is more robust to outliers than $l_2$-based methods. By using the fused lasso penalty over a $K$-nearest neighbors graph, we propose an adaptive quantile estimator in a non-parametric setup. We show that the estimator attains optimal rate of $n^{-1/d}$ up to a logarithmic factor, under mild assumptions on the data generation mechanism of the $d$-dimensional data. We develop algorithms to compute the estimator and discuss methodology for model selection. Numerical experiments on simulated and real data demonstrate clear advantages of the proposed estimator over state of the art methods. △ Less

Submitted 17 August, 2021; v1 submitted 3 December, 2020; originally announced December 2020.

Journal ref: Journal of Machine Learning Research, Vol. 22, No. 111, 1-38, 2021

arXiv:2010.02761 [pdf, other]

doi 10.1109/TMI.2021.3095310

Unified Supervised-Unsupervised (SUPER) Learning for X-ray CT Image Reconstruction

Authors: Siqi Ye, Zhipeng Li, Michael T. McCann, Yong Long, Saiprasad Ravishankar

Abstract: Traditional model-based image reconstruction (MBIR) methods combine forward and noise models with simple object priors. Recent machine learning methods for image reconstruction typically involve supervised learning or unsupervised learning, both of which have their advantages and disadvantages. In this work, we propose a unified supervised-unsupervised (SUPER) learning framework for X-ray computed… ▽ More Traditional model-based image reconstruction (MBIR) methods combine forward and noise models with simple object priors. Recent machine learning methods for image reconstruction typically involve supervised learning or unsupervised learning, both of which have their advantages and disadvantages. In this work, we propose a unified supervised-unsupervised (SUPER) learning framework for X-ray computed tomography (CT) image reconstruction. The proposed learning formulation combines both unsupervised learning-based priors (or even simple analytical priors) together with (supervised) deep network-based priors in a unified MBIR framework based on a fixed point iteration analysis. The proposed training algorithm is also an approximate scheme for a bilevel supervised training optimization problem, wherein the network-based regularizer in the lower-level MBIR problem is optimized using an upper-level reconstruction loss. The training problem is optimized by alternating between updating the network weights and iteratively updating the reconstructions based on those weights. We demonstrate the learned SUPER models' efficacy for low-dose CT image reconstruction, for which we use the NIH AAPM Mayo Clinic Low Dose CT Grand Challenge dataset for training and testing. In our experiments, we studied different combinations of supervised deep network priors and unsupervised learning-based or analytical priors. Both numerical and visual results show the superiority of the proposed unified SUPER methods over standalone supervised learning-based methods, iterative MBIR methods, and variations of SUPER obtained via ablation studies. We also show that the proposed algorithm converges rapidly in practice. △ Less

Submitted 8 April, 2021; v1 submitted 6 October, 2020; originally announced October 2020.

Comments: 18 pages, 21 figures, submitted journal paper

Journal ref: IEEE Transactions on Medical Imaging, vol. 40, no. 11, pp. 2986-3001, Nov. 2021

arXiv:2009.13079 [pdf, other]

The Geometric Unscented Kalman Filter

Authors: Chengling Fang, Jiang Liu, Songqing Ye, Ju Zhang

Abstract: Many filters have been proposed in recent decades for the nonlinear state estimation problem. The linearization-based extended Kalman filter (EKF) is widely applied to nonlinear industrial systems. As EKF is limited in accuracy and reliability, sequential Monte-Carlo methods or particle filters (PF) can obtain superior accuracy at the cost of a huge number of random samples. The unscented Kalman f… ▽ More Many filters have been proposed in recent decades for the nonlinear state estimation problem. The linearization-based extended Kalman filter (EKF) is widely applied to nonlinear industrial systems. As EKF is limited in accuracy and reliability, sequential Monte-Carlo methods or particle filters (PF) can obtain superior accuracy at the cost of a huge number of random samples. The unscented Kalman filter (UKF) can achieve adequate accuracy more efficiently by using deterministic samples, but its weights may be negative, which might cause instability problem. For Gaussian filters, the cubature Kalman filter (CKF) and Gauss Hermit filter (GHF) employ cubature and respectively Gauss-Hermite rules to approximate statistic information of random variables and exhibit impressive performances in practical problems. Inspired by this work, this paper presents a new nonlinear estimation scheme named after geometric unscented Kalman filter (GUF). The GUF chooses the filtering framework of CKF for updating data and develops a geometric unscented sampling (GUS) strategy for approximating random variables. The main feature of GUS is selecting uniformly distributed samples according to the probability and geometric location similar to UKF and CKF, and having positive weights like PF. Through such way, GUF can maintain adequate accuracy as GHF with reasonable efficiency and good stability. The GUF does not suffer from the exponential increase of sample size as for PF or failure to converge resulted from non-positive weights as for high order CKF and UKF. △ Less

Submitted 28 September, 2020; originally announced September 2020.

Comments: This article present a new sampling method and forth a new Kalman filter. It contains 16 figures

arXiv:2009.08468 [pdf, other]

doi 10.3847/1538-4357/abba25

Black Hole Mergers from Hierarchical Triples in Dense Star Clusters

Authors: Miguel A. S. Martinez, Giacomo Fragione, Kyle Kremer, Sourav Chatterjee, Carl L. Rodriguez, Johan Samsing, Claire S. Ye, Newlin C. Weatherford, Michael Zevin, Smadar Naoz, Frederic A. Rasio

Abstract: Hierarchical triples are expected to be produced by the frequent binary-mediated interactions in the cores of globular clusters. In some of these triples, the tertiary companion can drive the inner binary to merger following large eccentricity oscillations, as a result of the eccentric Kozai-Lidov mechanism. In this paper, we study the dynamics and merger rates of black hole (BH) hierarchical trip… ▽ More Hierarchical triples are expected to be produced by the frequent binary-mediated interactions in the cores of globular clusters. In some of these triples, the tertiary companion can drive the inner binary to merger following large eccentricity oscillations, as a result of the eccentric Kozai-Lidov mechanism. In this paper, we study the dynamics and merger rates of black hole (BH) hierarchical triples, formed via binary--binary encounters in the CMC Cluster Catalog, a suite of cluster simulations with present-day properties representative of the Milky Way's globular clusters. We compare the properties of the mergers from triples to the other merger channels in dense star clusters, and show that triple systems do not produce significant differences in terms of mass and effective spin distribution. However, they represent an important pathway for forming eccentric mergers, which could be detected by LIGO--Virgo/KAGRA (LVK), and future missions such as LISA and DECIGO. We derive a conservative lower limit for the merger rate from this channel of $0.35$ Gpc$^{-3}$yr$^{-1}$ in the local Universe and up to $\sim9\%$ of these events may have a detectable eccentricity at LVK design sensitivity. Additionally, we find that triple systems could play an important role in retaining second-generation BHs, which can later merge again in the core of the host cluster. △ Less

Submitted 21 September, 2020; v1 submitted 17 September, 2020; originally announced September 2020.

Comments: 21 Pages, 11 Figures, 2 Tables, Accepted for publication in ApJ

arXiv:2008.07787 [pdf, other]

Tdcgan: Temporal Dilated Convolutional Generative Adversarial Network for End-to-end Speech Enhancement

Authors: Shuaishuai Ye, Xinhui Hu, Xinkang Xu

Abstract: In this paper, in order to further deal with the performance degradation caused by ignoring the phase information in conventional speech enhancement systems, we proposed a temporal dilated convolutional generative adversarial network (TDCGAN) in the end-to-end based speech enhancement architecture. For the first time, we introduced the temporal dilated convolutional network with depthwise separabl… ▽ More In this paper, in order to further deal with the performance degradation caused by ignoring the phase information in conventional speech enhancement systems, we proposed a temporal dilated convolutional generative adversarial network (TDCGAN) in the end-to-end based speech enhancement architecture. For the first time, we introduced the temporal dilated convolutional network with depthwise separable convolutions into the GAN structure so that the receptive field can be greatly increased without increasing the number of parameters. We also first explored the effect of signal-to-noise ratio (SNR) penalty item as regularization of the loss function of generator on improving the SNR of enhanced speech. The experimental results demonstrated that our proposed method outperformed the state-of-the-art end-to-end GAN-based speech enhancement. Moreover, compared with previous GAN-based methods, the proposed TDCGAN could greatly decreased the number of parameters. As expected, the work also demonstrated that the SNR penalty item as regularization was more effective than $L1$ on improving the SNR of enhanced speech. △ Less

Submitted 30 September, 2020; v1 submitted 18 August, 2020; originally announced August 2020.

arXiv:2007.11605 [pdf, other]

doi 10.3847/1538-4357/aba89b

Demographics of triple systems in dense star clusters

Authors: Giacomo Fragione, Miguel A. S. Martinez, Kyle Kremer, Sourav Chatterjee, Carl L. Rodriguez, Claire S. Ye, Newlin C. Weatherford, Smadar Naoz, Frederic A. Rasio

Abstract: Depending on the stellar type, more than $\sim 50$\% and $\sim 15\%$ of stars in the field have at least one and two stellar companions, respectively. Hierarchical systems can be assembled dynamically in dense star clusters, as a result of few-body encounters among stars and/or compact remnants in the cluster core. In this paper, we present the demographics of stellar and compact-object triples fo… ▽ More Depending on the stellar type, more than $\sim 50$\% and $\sim 15\%$ of stars in the field have at least one and two stellar companions, respectively. Hierarchical systems can be assembled dynamically in dense star clusters, as a result of few-body encounters among stars and/or compact remnants in the cluster core. In this paper, we present the demographics of stellar and compact-object triples formed via binary--binary encounters in the \texttt{CMC Cluster Catalog}, a suite of cluster simulations with present-day properties representative of the globular clusters (GCs) observed in the Milky Way. We show how the initial properties of the host cluster set the typical orbital parameters and formation times of the formed triples. We find that a cluster typically assembles hundreds of triples with at least one black hole (BH) in the inner binary, while only clusters with sufficiently small virial radii are efficient in producing triples with no BHs, as a result of the BH-burning process. We show that a typical GC is expected to host tens of triples with at least one luminous component at present day. We discuss how the Lidov-Kozai mechanism can drive the inner binary of the formed triples to high eccentricities, whenever it takes place before the triple is dynamically reprocessed by encountering another cluster member. Some of these systems can reach sufficiently large eccentricities to form a variety of transients and sources, such as blue stragglers, X-ray binaries, Type Ia Supernovae, Thorne-Zytkow objects, and LIGO/Virgo sources. △ Less

Submitted 22 July, 2020; originally announced July 2020.

Comments: 28 pages, 12 figures, 2 tables, accepted by ApJ

arXiv:2006.16791 [pdf, other]

Local Causal Structure Learning and its Discovery Between Type 2 Diabetes and Bone Mineral Density

Authors: Wei Wang, Gangqiang Hu, Bo Yuan, Shandong Ye, Chao Chen, YaYun Cui, Xi Zhang, Liting Qian

Abstract: Type 2 diabetes (T2DM), one of the most prevalent chronic diseases, affects the glucose metabolism of the human body, which decreases the quantity of life and brings a heavy burden on social medical care. Patients with T2DM are more likely to suffer bone fragility fracture as diabetes affects bone mineral density (BMD). However, the discovery of the determinant factors of BMD in a medical way is e… ▽ More Type 2 diabetes (T2DM), one of the most prevalent chronic diseases, affects the glucose metabolism of the human body, which decreases the quantity of life and brings a heavy burden on social medical care. Patients with T2DM are more likely to suffer bone fragility fracture as diabetes affects bone mineral density (BMD). However, the discovery of the determinant factors of BMD in a medical way is expensive and time-consuming. In this paper, we propose a novel algorithm, Prior-Knowledge-driven local Causal structure Learning (PKCL), to discover the underlying causal mechanism between BMD and its factors from the clinical data. Since there exist limited data but redundant prior knowledge for medicine, PKCL adequately utilize the prior knowledge to mine the local causal structure for the target relationship. Combining the medical prior knowledge with the discovered causal relationships, PKCL can achieve more reliable results without long-standing medical statistical experiments. Extensive experiments are conducted on a newly provided clinical data set. The experimental study of PKCL on the data is proved to highly corresponding with existing medical knowledge, which demonstrates the superiority and effectiveness of PKCL. To illustrate the importance of prior knowledge, the result of the algorithm without prior knowledge is also investigated. △ Less

Submitted 27 June, 2020; originally announced June 2020.

arXiv:2006.10771 [pdf, other]

doi 10.3847/1538-4357/abb945

Populating the upper black hole mass gap through stellar collisions in young star clusters

Authors: Kyle Kremer, Mario Spera, Devin Becker, Sourav Chatterjee, Ugo N. Di Carlo, Giacomo Fragione, Carl L. Rodriguez, Claire S. Ye, Frederic A. Rasio

Abstract: Theoretical modeling of massive stars predicts a gap in the black hole (BH) mass function above $\sim 40-50\,M_{\odot}$ for BHs formed through single star evolution, arising from (pulsational) pair-instability supernovae. However, in dense star clusters, dynamical channels may exist that allow construction of BHs with masses in excess of those allowed from single star evolution. The detection of B… ▽ More Theoretical modeling of massive stars predicts a gap in the black hole (BH) mass function above $\sim 40-50\,M_{\odot}$ for BHs formed through single star evolution, arising from (pulsational) pair-instability supernovae. However, in dense star clusters, dynamical channels may exist that allow construction of BHs with masses in excess of those allowed from single star evolution. The detection of BHs in this so-called "upper-mass gap" would provide strong evidence for the dynamical processing of BHs prior to their eventual merger. Here, we explore in detail the formation of BHs with masses within or above the pair-instability gap through collisions of young massive stars in dense star clusters. We run a suite of 68 independent cluster simulations, exploring a variety of physical assumptions pertaining to growth through stellar collisions, including primordial cluster mass segregation and the efficiency of envelope strip** during collisions. We find that as many as $\sim20\%$ of all BH progenitors undergo one or more collisions prior to stellar collapse and up to $\sim1\%$ of all BHs reside within or above the pair-instability gap through the effects of these collisions. We show that these BHs readily go on to merge with other BHs in the cluster, creating a population of massive BH mergers at a rate that may compete with the "multiple-generation" merger channel described in other analyses. This has clear relevance for the formation of very massive BH binaries as recently detected by LIGO/Virgo in GW190521. Finally, we describe how stellar collisions in clusters may provide a unique pathway to pair-instability supernovae and briefly discuss the expected rate of these events and other electromagnetic transients. △ Less

Submitted 15 September, 2020; v1 submitted 18 June, 2020; originally announced June 2020.

Comments: 25 pages, 6 figures, accepted for publication in ApJ. Comments welcome

Showing 101–150 of 256 results for author: Ye, S