Search | arXiv e-print repository

Burn After Reading: Online Adaptation for Cross-domain Streaming Data

Authors: Luyu Yang, Mingfei Gao, Zeyuan Chen, Ran Xu, Abhinav Shrivastava, Chetan Ramaiah

Abstract: In the context of online privacy, many methods propose complex privacy and security preserving measures to protect sensitive data. In this paper, we argue that: not storing any sensitive data is the best form of security. Thus we propose an online framework that "burns after reading", i.e. each online sample is immediately deleted after it is processed. Meanwhile, we tackle the inevitable distribu… ▽ More In the context of online privacy, many methods propose complex privacy and security preserving measures to protect sensitive data. In this paper, we argue that: not storing any sensitive data is the best form of security. Thus we propose an online framework that "burns after reading", i.e. each online sample is immediately deleted after it is processed. Meanwhile, we tackle the inevitable distribution shift between the labeled public data and unlabeled private data as a problem of unsupervised domain adaptation. Specifically, we propose a novel algorithm that aims at the most fundamental challenge of the online adaptation setting--the lack of diverse source-target data pairs. Therefore, we design a Cross-Domain Bootstrap** approach, called CroDoBo, to increase the combined diversity across domains. Further, to fully exploit the valuable discrepancies among the diverse combinations, we employ the training strategy of multiple learners with co-supervision. CroDoBo achieves state-of-the-art online performance on four domain adaptation benchmarks. △ Less

Submitted 8 December, 2021; originally announced December 2021.

arXiv:2112.03815 [pdf]

Accurate parameter estimation using scan-specific unsupervised deep learning for relaxometry and MR fingerprinting

Authors: Mengze Gao, Huihui Ye, Tae Hyung Kim, Zi**g Zhang, Seohee So, Berkin Bilgic

Abstract: We propose an unsupervised convolutional neural network (CNN) for relaxation parameter estimation. This network incorporates signal relaxation and Bloch simulations while taking advantage of residual learning and spatial relations across neighboring voxels. Quantification accuracy and robustness to noise is shown to be significantly improved compared to standard parameter estimation methods in num… ▽ More We propose an unsupervised convolutional neural network (CNN) for relaxation parameter estimation. This network incorporates signal relaxation and Bloch simulations while taking advantage of residual learning and spatial relations across neighboring voxels. Quantification accuracy and robustness to noise is shown to be significantly improved compared to standard parameter estimation methods in numerical simulations and in vivo data for multi-echo T2 and T2* map**. The combination of the proposed network with subspace modeling and MR fingerprinting (MRF) from highly undersampled data permits high quality T1 and T2 map**. △ Less

Submitted 12 December, 2021; v1 submitted 7 December, 2021; originally announced December 2021.

Comments: 7 pages, 5 figures, submitted to International Society for Magnetic Resonance in Medicine 2022

arXiv:2111.15636 [pdf]

Generating gapless land surface temperature with a high spatio-temporal resolution by fusing multi-source satellite-observed and model-simulated data

Authors: Jun Ma, Huanfeng Shen, Penghai Wu, **gan Wu, Meiling Gao, Chunlei Meng

Abstract: Land surface temperature (LST) is a key parameter when monitoring land surface processes. However, cloud contamination and the tradeoff between the spatial and temporal resolutions greatly impede the access to high-quality thermal infrared (TIR) remote sensing data. Despite the massive efforts made to solve these dilemmas, it is still difficult to generate LST estimates with concurrent spatial com… ▽ More Land surface temperature (LST) is a key parameter when monitoring land surface processes. However, cloud contamination and the tradeoff between the spatial and temporal resolutions greatly impede the access to high-quality thermal infrared (TIR) remote sensing data. Despite the massive efforts made to solve these dilemmas, it is still difficult to generate LST estimates with concurrent spatial completeness and a high spatio-temporal resolution. Land surface models (LSMs) can be used to simulate gapless LST with a high temporal resolution, but this usually comes with a low spatial resolution. In this paper, we present an integrated temperature fusion framework for satellite-observed and LSM-simulated LST data to map gapless LST at a 60-m spatial resolution and half-hourly temporal resolution. The global linear model (GloLM) model and the diurnal land surface temperature cycle (DTC) model are respectively performed as preprocessing steps for sensor and temporal normalization between the different LST data. The Landsat LST, Moderate Resolution Imaging Spectroradiometer (MODIS) LST, and Community Land Model Version 5.0 (CLM 5.0)-simulated LST are then fused using a filter-based spatio-temporal integrated fusion model. Evaluations were implemented in an urban-dominated region (the city of Wuhan in China) and a natural-dominated region (the Heihe River Basin in China), in terms of accuracy, spatial variability, and diurnal temporal dynamics. Results indicate that the fused LST is highly consistent with actual Landsat LST data (in situ LST measurements), in terms of a Pearson correlation coefficient of 0.94 (0.97-0.99), a mean absolute error of 0.71-0.98 K (0.82-3.17 K), and a root-mean-square error of 0.97-1.26 K (1.09-3.97 K). △ Less

Submitted 28 November, 2021; originally announced November 2021.

arXiv:2111.12386 [pdf, other]

One to Transfer All: A Universal Transfer Framework for Vision Foundation Model with Few Data

Authors: Yujie Wang, Junqin Huang, Mengya Gao, Yichao Wu, Zhenfei Yin, Ding Liang, Junjie Yan

Abstract: The foundation model is not the last chapter of the model production pipeline. Transferring with few data in a general way to thousands of downstream tasks is becoming a trend of the foundation model's application. In this paper, we proposed a universal transfer framework: One to Transfer All (OTA) to transfer any Vision Foundation Model (VFM) to any downstream tasks with few downstream data. We f… ▽ More The foundation model is not the last chapter of the model production pipeline. Transferring with few data in a general way to thousands of downstream tasks is becoming a trend of the foundation model's application. In this paper, we proposed a universal transfer framework: One to Transfer All (OTA) to transfer any Vision Foundation Model (VFM) to any downstream tasks with few downstream data. We first transfer a VFM to a task-specific model by Image Re-representation Fine-tuning (IRF) then distilling knowledge from a task-specific model to a deployed model with data produced by Downstream Image-Guided Generation (DIGG). OTA has no dependency on upstream data, VFM, and downstream tasks when transferring. It also provides a way for VFM researchers to release their upstream information for better transferring but not leaking data due to privacy requirements. Massive experiments validate the effectiveness and superiority of our methods in few data setting. Our code will be released. △ Less

Submitted 24 November, 2021; originally announced November 2021.

Comments: Technical Report

arXiv:2111.09452 [pdf, other]

Open Vocabulary Object Detection with Pseudo Bounding-Box Labels

Authors: Mingfei Gao, Chen Xing, Juan Carlos Niebles, Junnan Li, Ran Xu, Wenhao Liu, Caiming Xiong

Abstract: Despite great progress in object detection, most existing methods work only on a limited set of object categories, due to the tremendous human effort needed for bounding-box annotations of training data. To alleviate the problem, recent open vocabulary and zero-shot detection methods attempt to detect novel object categories beyond those seen during training. They achieve this goal by training on… ▽ More Despite great progress in object detection, most existing methods work only on a limited set of object categories, due to the tremendous human effort needed for bounding-box annotations of training data. To alleviate the problem, recent open vocabulary and zero-shot detection methods attempt to detect novel object categories beyond those seen during training. They achieve this goal by training on a pre-defined base categories to induce generalization to novel objects. However, their potential is still constrained by the small set of base categories available for training. To enlarge the set of base classes, we propose a method to automatically generate pseudo bounding-box annotations of diverse objects from large-scale image-caption pairs. Our method leverages the localization ability of pre-trained vision-language models to generate pseudo bounding-box labels and then directly uses them for training object detectors. Experimental results show that our method outperforms the state-of-the-art open vocabulary detector by 8% AP on COCO novel categories, by 6.3% AP on PASCAL VOC, by 2.3% AP on Objects365 and by 2.8% AP on LVIS. Code is available at https://github.com/salesforce/PB-OVD. △ Less

Submitted 13 July, 2022; v1 submitted 17 November, 2021; originally announced November 2021.

Comments: ECCV 2022

arXiv:2111.08687 [pdf, other]

INTERN: A New Learning Paradigm Towards General Vision

Authors: **g Shao, Siyu Chen, Yangguang Li, Kun Wang, Zhenfei Yin, Yinan He, Jianing Teng, Qinghong Sun, Mengya Gao, Jihao Liu, Gengshi Huang, Guanglu Song, Yichao Wu, Yuming Huang, Fenggang Liu, Huan Peng, Shuo Qin, Chengyu Wang, Yujie Wang, Conghui He, Ding Liang, Yu Liu, Fengwei Yu, Junjie Yan, Dahua Lin , et al. (2 additional authors not shown)

Abstract: Enormous waves of technological innovations over the past several years, marked by the advances in AI technologies, are profoundly resha** the industry and the society. However, down the road, a key challenge awaits us, that is, our capability of meeting rapidly-growing scenario-specific demands is severely limited by the cost of acquiring a commensurate amount of training data. This difficult s… ▽ More Enormous waves of technological innovations over the past several years, marked by the advances in AI technologies, are profoundly resha** the industry and the society. However, down the road, a key challenge awaits us, that is, our capability of meeting rapidly-growing scenario-specific demands is severely limited by the cost of acquiring a commensurate amount of training data. This difficult situation is in essence due to limitations of the mainstream learning paradigm: we need to train a new model for each new scenario, based on a large quantity of well-annotated data and commonly from scratch. In tackling this fundamental problem, we move beyond and develop a new learning paradigm named INTERN. By learning with supervisory signals from multiple sources in multiple stages, the model being trained will develop strong generalizability. We evaluate our model on 26 well-known datasets that cover four categories of tasks in computer vision. In most cases, our models, adapted with only 10% of the training data in the target domain, outperform the counterparts trained with the full set of data, often by a significant margin. This is an important step towards a promising prospect where such a model with general vision capability can dramatically reduce our reliance on data, thus expediting the adoption of AI technologies. Furthermore, revolving around our new paradigm, we also introduce a new data system, a new architecture, and a new benchmark, which, together, form a general vision ecosystem to support its future development in an open and inclusive manner. See project website at https://opengvlab.shlab.org.cn . △ Less

Submitted 24 February, 2022; v1 submitted 16 November, 2021; originally announced November 2021.

arXiv:2111.00699 [pdf, other]

Principles towards Real-Time Simulation of Material Point Method on Modern GPUs

Authors: Yun Fei, Yuhan Huang, Ming Gao

Abstract: Physics-based simulation has been actively employed in generating offline visual effects in the film and animation industry. However, the computations required for high-quality scenarios are generally immense, deterring its adoption in real-time applications, e.g., virtual production, avatar live-streaming, and cloud gaming. We summarize the principles that can accelerate the computation pipeline… ▽ More Physics-based simulation has been actively employed in generating offline visual effects in the film and animation industry. However, the computations required for high-quality scenarios are generally immense, deterring its adoption in real-time applications, e.g., virtual production, avatar live-streaming, and cloud gaming. We summarize the principles that can accelerate the computation pipeline on single-GPU and multi-GPU platforms through extensive investigation and comprehension of modern GPU architecture. We further demonstrate the effectiveness of these principles by applying them to the material point method to build up our framework, which achieves $1.7\times$--$8.6\times$ speedup on a single GPU and $2.5\times$--$14.8\times$ on four GPUs compared to the state-of-the-art. Our pipeline is specifically designed for real-time applications (i.e., scenarios with small to medium particles) and achieves significant multi-GPU efficiency. We demonstrate our pipeline by simulating a snow scenario with 1.33M particles and a fountain scenario with 143K particles in real-time (on average, 68.5 and 55.9 frame-per-second, respectively) on four NVIDIA Tesla V100 GPUs interconnected with NVLinks. △ Less

Submitted 1 November, 2021; originally announced November 2021.

ACM Class: I.3.1; I.3.7

arXiv:2111.00105 [pdf, other]

doi 10.1038/s41467-022-30966-5

Probing material absorption and optical nonlinearity of integrated photonic materials

Authors: Maodong Gao, Qi-Fan Yang, Qing-Xin Ji, Heming Wang, Lue Wu, Boqiang Shen, Junqiu Liu, Guanhao Huang, Lin Chang, Weiqiang Xie, Su-Peng Yu, Scott B. Papp, John E. Bowers, Tobias J. Kippenberg, Kerry J. Vahala

Abstract: Optical microresonators with high quality ($Q$) factors are essential to a wide range of integrated photonic devices. Steady efforts have been directed towards increasing microresonator $Q$ factors across a variety of platforms. With success in reducing microfabrication process-related optical loss as a limitation of $Q$, the ultimate attainable $Q$, as determined solely by the constituent microre… ▽ More Optical microresonators with high quality ($Q$) factors are essential to a wide range of integrated photonic devices. Steady efforts have been directed towards increasing microresonator $Q$ factors across a variety of platforms. With success in reducing microfabrication process-related optical loss as a limitation of $Q$, the ultimate attainable $Q$, as determined solely by the constituent microresonator material absorption, has come into focus. Here, we report measurements of the material-limited $Q$ factors in several photonic material platforms. High-$Q$ microresonators are fabricated from thin films of SiO$_2$, Si$_3$N$_4$, Al$_{0.2}$Ga$_{0.8}$As and Ta$_2$O$_5$. By using cavity-enhanced photothermal spectroscopy, the material-limited $Q$ is determined. The method simultaneously measures the Kerr nonlinearity in each material and reveals how material nonlinearity and ultimate $Q$ vary in a complementary fashion across photonic materials. Besides guiding microresonator design and material development in four material platforms, the results help establish performance limits in future photonic integrated systems. △ Less

Submitted 29 October, 2021; originally announced November 2021.

Comments: Maodong Gao, Qi-Fan Yang and Qing-Xin Ji contributed equally to this work. 9 pages, 4 figures, 1 table

Journal ref: Nature Communications 13, 3323 (2022)

arXiv:2110.08477 [pdf, other]

FedMM: Saddle Point Optimization for Federated Adversarial Domain Adaptation

Authors: Yan Shen, Jian Du, Han Zhao, Benyu Zhang, Zhanghexuan Ji, Mingchen Gao

Abstract: Federated adversary domain adaptation is a unique distributed minimax training task due to the prevalence of label imbalance among clients, with each client only seeing a subset of the classes of labels required to train a global model. To tackle this problem, we propose a distributed minimax optimizer referred to as FedMM, designed specifically for the federated adversary domain adaptation proble… ▽ More Federated adversary domain adaptation is a unique distributed minimax training task due to the prevalence of label imbalance among clients, with each client only seeing a subset of the classes of labels required to train a global model. To tackle this problem, we propose a distributed minimax optimizer referred to as FedMM, designed specifically for the federated adversary domain adaptation problem. It works well even in the extreme case where each client has different label classes and some clients only have unsupervised tasks. We prove that FedMM ensures convergence to a stationary point with domain-shifted unsupervised data. On a variety of benchmark datasets, extensive experiments show that FedMM consistently achieves either significant communication savings or significant accuracy improvements over federated optimizers based on the gradient descent ascent (GDA) algorithm. When training from scratch, for example, it outperforms other GDA based federated average methods by around $20\%$ in accuracy over the same communication rounds; and it consistently outperforms when training from pre-trained models with an accuracy improvement from $5.4\%$ to $9\%$ for different networks. △ Less

Submitted 15 November, 2021; v1 submitted 16 October, 2021; originally announced October 2021.

Comments: 34 pages

arXiv:2110.06082 [pdf, other]

Efficient Bayesian network structure learning via local Markov boundary search

Authors: Ming Gao, Bryon Aragam

Abstract: We analyze the complexity of learning directed acyclic graphical models from observational data in general settings without specific distributional assumptions. Our approach is information-theoretic and uses a local Markov boundary search procedure in order to recursively construct ancestral sets in the underlying graphical model. Perhaps surprisingly, we show that for certain graph ensembles, a s… ▽ More We analyze the complexity of learning directed acyclic graphical models from observational data in general settings without specific distributional assumptions. Our approach is information-theoretic and uses a local Markov boundary search procedure in order to recursively construct ancestral sets in the underlying graphical model. Perhaps surprisingly, we show that for certain graph ensembles, a simple forward greedy search algorithm (i.e. without a backward pruning phase) suffices to learn the Markov boundary of each node. This substantially improves the sample complexity, which we show is at most polynomial in the number of nodes. This is then applied to learn the entire graph under a novel identifiability condition that generalizes existing conditions from the literature. As a matter of independent interest, we establish finite-sample guarantees for the problem of recovering Markov boundaries from data. Moreover, we apply our results to the special case of polytrees, for which the assumptions simplify, and provide explicit conditions under which polytrees are identifiable and learnable in polynomial time. We further illustrate the performance of the algorithm, which is easy to implement, in a simulation study. Our approach is general, works for discrete or continuous distributions without distributional assumptions, and as such sheds light on the minimal assumptions required to efficiently learn the structure of directed graphical models from data. △ Less

Submitted 21 November, 2021; v1 submitted 12 October, 2021; originally announced October 2021.

Comments: 31 pages, 3 figures, to appear in NeurIPS 2021

arXiv:2110.04719 [pdf, other]

Structure learning in polynomial time: Greedy algorithms, Bregman information, and exponential families

Authors: Goutham Rajendran, Bohdan Kivva, Ming Gao, Bryon Aragam

Abstract: Greedy algorithms have long been a workhorse for learning graphical models, and more broadly for learning statistical models with sparse structure. In the context of learning directed acyclic graphs, greedy algorithms are popular despite their worst-case exponential runtime. In practice, however, they are very efficient. We provide new insight into this phenomenon by studying a general greedy scor… ▽ More Greedy algorithms have long been a workhorse for learning graphical models, and more broadly for learning statistical models with sparse structure. In the context of learning directed acyclic graphs, greedy algorithms are popular despite their worst-case exponential runtime. In practice, however, they are very efficient. We provide new insight into this phenomenon by studying a general greedy score-based algorithm for learning DAGs. Unlike edge-greedy algorithms such as the popular GES and hill-climbing algorithms, our approach is vertex-greedy and requires at most a polynomial number of score evaluations. We then show how recent polynomial-time algorithms for learning DAG models are a special case of this algorithm, thereby illustrating how these order-based algorithms can be rigourously interpreted as score-based algorithms. This observation suggests new score functions and optimality conditions based on the duality between Bregman divergences and exponential families, which we explore in detail. Explicit sample and computational complexity bounds are derived. Finally, we provide extensive experiments suggesting that this algorithm indeed optimizes the score in a variety of settings. △ Less

Submitted 28 October, 2021; v1 submitted 10 October, 2021; originally announced October 2021.

Comments: Accepted to NeurIPS 2021; 27 pages, 9 figures

arXiv:2110.04413 [pdf, other]

Robustness Evaluation of Transformer-based Form Field Extractors via Form Attacks

Authors: Le Xue, Mingfei Gao, Zeyuan Chen, Caiming Xiong, Ran Xu

Abstract: We propose a novel framework to evaluate the robustness of transformer-based form field extraction methods via form attacks. We introduce 14 novel form transformations to evaluate the vulnerability of the state-of-the-art field extractors against form attacks from both OCR level and form level, including OCR location/order rearrangement, form background manipulation and form field-value augmentati… ▽ More We propose a novel framework to evaluate the robustness of transformer-based form field extraction methods via form attacks. We introduce 14 novel form transformations to evaluate the vulnerability of the state-of-the-art field extractors against form attacks from both OCR level and form level, including OCR location/order rearrangement, form background manipulation and form field-value augmentation. We conduct robustness evaluation using real invoices and receipts, and perform comprehensive research analysis. Experimental results suggest that the evaluated models are very susceptible to form perturbations such as the variation of field-values (~15% drop in F1 score), the disarrangement of input text order(~15% drop in F1 score) and the disruption of the neighboring words of field-values(~10% drop in F1 score). Guided by the analysis, we make recommendations to improve the design of field extractors and the process of data collection. △ Less

Submitted 8 October, 2021; originally announced October 2021.

arXiv:2110.04282 [pdf, other]

Field Extraction from Forms with Unlabeled Data

Authors: Mingfei Gao, Zeyuan Chen, Nikhil Naik, Kazuma Hashimoto, Caiming Xiong, Ran Xu

Abstract: We propose a novel framework to conduct field extraction from forms with unlabeled data. To bootstrap the training process, we develop a rule-based method for mining noisy pseudo-labels from unlabeled forms. Using the supervisory signal from the pseudo-labels, we extract a discriminative token representation from a transformer-based model by modeling the interaction between text in the form. To pr… ▽ More We propose a novel framework to conduct field extraction from forms with unlabeled data. To bootstrap the training process, we develop a rule-based method for mining noisy pseudo-labels from unlabeled forms. Using the supervisory signal from the pseudo-labels, we extract a discriminative token representation from a transformer-based model by modeling the interaction between text in the form. To prevent the model from overfitting to label noise, we introduce a refinement module based on a progressive pseudo-label ensemble. Experimental results demonstrate the effectiveness of our framework. △ Less

Submitted 11 April, 2022; v1 submitted 8 October, 2021; originally announced October 2021.

Comments: Spa-NLP@ACL2022

arXiv:2110.00123 [pdf, other]

doi 10.3847/2041-8213/ac2de6

Observations of Forbush Decreases of cosmic ray electrons and positrons with the Dark Matter Particle Explorer

Authors: Francesca Alemanno, Qi An, Philipp Azzarello, Felicia Carla Tiziana Barbato, Paolo Bernardini, XiaoJun Bi, MingSheng Cai, Elisabetta Casilli, Enrico Catanzani, ** Chang, DengYi Chen, JunLing Chen, ZhanFang Chen, MingYang Cui, TianShu Cui, YuXing Cui, HaoTing Dai, Antonio De Benedittis, Ivan De Mitri, Francesco de Palma, Maksym Deliyergiyev, Margherita Di Santo, Qi Ding, TieKuang Dong, ZhenXing Dong , et al. (124 additional authors not shown)

Abstract: The Forbush Decrease (FD) represents the rapid decrease of the intensities of charged particles accompanied with the coronal mass ejections (CMEs) or high-speed streams from coronal holes. It has been mainly explored with ground-based neutron monitors network which indirectly measure the integrated intensities of all species of cosmic rays by counting secondary neutrons produced from interaction b… ▽ More The Forbush Decrease (FD) represents the rapid decrease of the intensities of charged particles accompanied with the coronal mass ejections (CMEs) or high-speed streams from coronal holes. It has been mainly explored with ground-based neutron monitors network which indirectly measure the integrated intensities of all species of cosmic rays by counting secondary neutrons produced from interaction between atmosphere atoms and cosmic rays. The space-based experiments can resolve the species of particles but the energy ranges are limited by the relative small acceptances except for the most abundant particles like protons and helium. Therefore, the FD of cosmic ray electrons and positrons have just been investigated by the PAMELA experiment in the low energy range ($<5$ GeV) with limited statistics. In this paper, we study the FD event occurred in September, 2017, with the electron and positron data recorded by the Dark Matter Particle Explorer. The evolution of the FDs from 2 GeV to 20 GeV with a time resolution of 6 hours are given. We observe two solar energetic particle events in the time profile of the intensity of cosmic rays, the earlier and weak one has not been shown in the neutron monitor data. Furthermore, both the amplitude and recovery time of fluxes of electrons and positrons show clear energy-dependence, which is important in probing the disturbances of the interplanetary environment by the coronal mass ejections. △ Less

Submitted 30 September, 2021; originally announced October 2021.

Comments: This article is dedicated to the 72nd anniversary of People's Republic of China

arXiv:2109.14259 [pdf, other]

Hierarchical Character Tagger for Short Text Spelling Error Correction

Authors: Mengyi Gao, Canran Xu, Peng Shi

Abstract: State-of-the-art approaches to spelling error correction problem include Transformer-based Seq2Seq models, which require large training sets and suffer from slow inference time; and sequence labeling models based on Transformer encoders like BERT, which involve token-level label space and therefore a large pre-defined vocabulary dictionary. In this paper we present a Hierarchical Character Tagger… ▽ More State-of-the-art approaches to spelling error correction problem include Transformer-based Seq2Seq models, which require large training sets and suffer from slow inference time; and sequence labeling models based on Transformer encoders like BERT, which involve token-level label space and therefore a large pre-defined vocabulary dictionary. In this paper we present a Hierarchical Character Tagger model, or HCTagger, for short text spelling error correction. We use a pre-trained language model at the character level as a text encoder, and then predict character-level edits to transform the original text into its error-free form with a much smaller label space. For decoding, we propose a hierarchical multi-task approach to alleviate the issue of long-tail label distribution without introducing extra model parameters. Experiments on two public misspelling correction datasets demonstrate that HCTagger is an accurate and much faster approach than many existing models. △ Less

Submitted 29 September, 2021; originally announced September 2021.

Comments: To appear in WNUT 2021 workshop, 8 pages, 2 figures

arXiv:2109.14120 [pdf, other]

Meta Learning on a Sequence of Imbalanced Domains with Difficulty Awareness

Authors: Zhenyi Wang, Tiehang Duan, Le Fang, Qiuling Suo, Mingchen Gao

Abstract: Recognizing new objects by learning from a few labeled examples in an evolving environment is crucial to obtain excellent generalization ability for real-world machine learning systems. A typical setting across current meta learning algorithms assumes a stationary task distribution during meta training. In this paper, we explore a more practical and challenging setting where task distribution chan… ▽ More Recognizing new objects by learning from a few labeled examples in an evolving environment is crucial to obtain excellent generalization ability for real-world machine learning systems. A typical setting across current meta learning algorithms assumes a stationary task distribution during meta training. In this paper, we explore a more practical and challenging setting where task distribution changes over time with domain shift. Particularly, we consider realistic scenarios where task distribution is highly imbalanced with domain labels unavailable in nature. We propose a kernel-based method for domain change detection and a difficulty-aware memory management mechanism that jointly considers the imbalanced domain size and domain importance to learn across domains continuously. Furthermore, we introduce an efficient adaptive task sampling method during meta training, which significantly reduces task gradient variance with theoretical guarantees. Finally, we propose a challenging benchmark with imbalanced domain sequences and varied domain difficulty. We have performed extensive evaluations on the proposed benchmark, demonstrating the effectiveness of our method. We made our code publicly available. △ Less

Submitted 28 September, 2021; originally announced September 2021.

Comments: ICCV 2021

arXiv:2109.07041 [pdf, ps, other]

Coalition Game based User Association for mmWave Mobile Relay Systems in Rail Traffic Scenarios

Authors: Chen Chen, Yong Niu, Shiwen Mao, Xiaodan Zhang, Zhu Han, Bo Ai, Meilin Gao, Huahua Xiao, Ning Wang

Abstract: Rail transportation, especially, high-speed rails (HSR), is an important infrastructure for the development of national economy and the promotion of passenger experience. Due to the large bandwidth, millimeter wave (mmWave) communication is regarded as a promising technology to meet the demand of high data rates. However, since mmWave communication has the characteristic of high attenuation, mobil… ▽ More Rail transportation, especially, high-speed rails (HSR), is an important infrastructure for the development of national economy and the promotion of passenger experience. Due to the large bandwidth, millimeter wave (mmWave) communication is regarded as a promising technology to meet the demand of high data rates. However, since mmWave communication has the characteristic of high attenuation, mobile relay (MR) is considered in this paper. Also, full-duplex (FD) communications have been proposed to improve the spectral efficiency. However, because of the high speed, as well as the problem of penetration loss, passengers on the train have a poor quality of service. Consequently, an effective user association scheme for HSR in mmWave band is necessary. In this paper, we investigate the user association optimization problem in mmWave mobilerelay systems where the MRs operate in the FD mode. To maximize the system capacity, we propose a cooperative user association approach based on coalition formation game, and develop a coalition formation algorithm to solve the challenging NP-hard problem. We also prove the convergence and Nashstable property of the proposed algorithm. Extensive simulations are done to show the system performance of the proposed scheme under various network settings. It is demonstrated that the proposed distributed low complexity scheme achieves a nearoptimal performance and outperforms two baseline schemes in terms of average system throughput. △ Less

Submitted 14 September, 2021; originally announced September 2021.

Comments: 11 pages, 11 figures

Journal ref: IEEE Transactions on Vehicular Technology, 2021

arXiv:2109.05598 [pdf, other]

Neural network based order parameter for phase transitions and its applications in high-entropy alloys

Authors: Junqi Yin, Zongrui Pei, Michael Gao

Abstract: Phase transition is one of the most important phenomena in nature and plays a central role in materials design. All phase transitions are characterized by suitable order parameters, including the order-disorder phase transition. However, finding a representative order parameter for complex systems is nontrivial, such as for high-entropy alloys. Given variational autoencoder's (VAE) strength of red… ▽ More Phase transition is one of the most important phenomena in nature and plays a central role in materials design. All phase transitions are characterized by suitable order parameters, including the order-disorder phase transition. However, finding a representative order parameter for complex systems is nontrivial, such as for high-entropy alloys. Given variational autoencoder's (VAE) strength of reducing high dimensional data into few principal components, here we coin a new concept of "VAE order parameter". We propose that the Manhattan distance in the VAE latent space can serve as a generic order parameter for order-disorder phase transitions. The physical properties of the order parameter are quantitatively interpreted and demonstrated by multiple refractory high-entropy alloys. Assisted by it, a generally applicable alloy design concept is proposed by mimicking the nature mixing of elements. Our physically interpretable "VAE order parameter" lays the foundation for the understanding of and alloy design by chemical ordering. △ Less

Submitted 12 September, 2021; originally announced September 2021.

arXiv:2109.04993 [pdf, other]

LAViTeR: Learning Aligned Visual and Textual Representations Assisted by Image and Caption Generation

Authors: Mohammad Abuzar Shaikh, Zhanghexuan Ji, Dana Moukheiber, Yan Shen, Sargur Srihari, Mingchen Gao

Abstract: Pre-training visual and textual representations from large-scale image-text pairs is becoming a standard approach for many downstream vision-language tasks. The transformer-based models learn inter and intra-modal attention through a list of self-supervised learning tasks. This paper proposes LAViTeR, a novel architecture for visual and textual representation learning. The main module, Visual Text… ▽ More Pre-training visual and textual representations from large-scale image-text pairs is becoming a standard approach for many downstream vision-language tasks. The transformer-based models learn inter and intra-modal attention through a list of self-supervised learning tasks. This paper proposes LAViTeR, a novel architecture for visual and textual representation learning. The main module, Visual Textual Alignment (VTA) will be assisted by two auxiliary tasks, GAN-based image synthesis and Image Captioning. We also propose a new evaluation metric measuring the similarity between the learnt visual and textual embedding. The experimental results on two public datasets, CUB and MS-COCO, demonstrate superior visual and textual representation alignment in the joint feature embedding space △ Less

Submitted 19 October, 2021; v1 submitted 4 September, 2021; originally announced September 2021.

Comments: 14 pages, 10 Figures, 5 Tables

arXiv:2109.04200 [pdf, other]

doi 10.1145/3459637.3482426

Double-Scale Self-Supervised Hypergraph Learning for Group Recommendation

Authors: Junwei Zhang, Min Gao, Junliang Yu, Lei Guo, Jundong Li, Hongzhi Yin

Abstract: With the prevalence of social media, there has recently been a proliferation of recommenders that shift their focus from individual modeling to group recommendation. Since the group preference is a mixture of various predilections from group members, the fundamental challenge of group recommendation is to model the correlations among members. Existing methods mostly adopt heuristic or attention-ba… ▽ More With the prevalence of social media, there has recently been a proliferation of recommenders that shift their focus from individual modeling to group recommendation. Since the group preference is a mixture of various predilections from group members, the fundamental challenge of group recommendation is to model the correlations among members. Existing methods mostly adopt heuristic or attention-based preference aggregation strategies to synthesize group preferences. However, these models mainly focus on the pairwise connections of users and ignore the complex high-order interactions within and beyond groups. Besides, group recommendation suffers seriously from the problem of data sparsity due to severely sparse group-item interactions. In this paper, we propose a self-supervised hypergraph learning framework for group recommendation to achieve two goals: (1) capturing the intra- and inter-group interactions among users; (2) alleviating the data sparsity issue with the raw data itself. Technically, for (1), a hierarchical hypergraph convolutional network based on the user- and group-level hypergraphs is developed to model the complex tuplewise correlations among users within and beyond groups. For (2), we design a double-scale node dropout strategy to create self-supervision signals that can regularize user representations with different granularities against the sparsity issue. The experimental analysis on multiple benchmark datasets demonstrates the superiority of the proposed model and also elucidates the rationality of the hypergraph modeling and the double-scale self-supervision. △ Less

Submitted 20 March, 2022; v1 submitted 9 September, 2021; originally announced September 2021.

Comments: 11 pages, 6 figures, CIKM 2021

arXiv:2109.01949 [pdf, other]

Improving Joint Learning of Chest X-Ray and Radiology Report by Word Region Alignment

Authors: Zhanghexuan Ji, Mohammad Abuzar Shaikh, Dana Moukheiber, Sargur Srihari, Yifan Peng, Mingchen Gao

Abstract: Self-supervised learning provides an opportunity to explore unlabeled chest X-rays and their associated free-text reports accumulated in clinical routine without manual supervision. This paper proposes a Joint Image Text Representation Learning Network (JoImTeRNet) for pre-training on chest X-ray images and their radiology reports. The model was pre-trained on both the global image-sentence level… ▽ More Self-supervised learning provides an opportunity to explore unlabeled chest X-rays and their associated free-text reports accumulated in clinical routine without manual supervision. This paper proposes a Joint Image Text Representation Learning Network (JoImTeRNet) for pre-training on chest X-ray images and their radiology reports. The model was pre-trained on both the global image-sentence level and the local image region-word level for visual-textual matching. Both are bidirectionally constrained on Cross-Entropy based and ranking-based Triplet Matching Losses. The region-word matching is calculated using the attention mechanism without direct supervision about their map**. The pre-trained multi-modal representation learning paves the way for downstream tasks concerning image and/or text encoding. We demonstrate the representation learning quality by cross-modality retrievals and multi-label classifications on two datasets: OpenI-IU and MIMIC-CXR △ Less

Submitted 4 September, 2021; originally announced September 2021.

Comments: 10 Pages, 1 Figure, 3 Tables, Accepted in 12th Machine Learning in Medical Imaging (MLMI 2021) workshop

arXiv:2109.00622 [pdf, other]

An End-to-End learnable Flow Regularized Model for Brain Tumor Segmentation

Authors: Yan Shen, Zhanghexuan Ji, Mingchen Gao

Abstract: Many segmentation tasks for biomedical images can be modeled as the minimization of an energy function and solved by a class of max-flow and min-cut optimization algorithms. However, the segmentation accuracy is sensitive to the contrasting of semantic features of different segmenting objects, as the traditional energy function usually uses hand-crafted features in their energy functions. To addre… ▽ More Many segmentation tasks for biomedical images can be modeled as the minimization of an energy function and solved by a class of max-flow and min-cut optimization algorithms. However, the segmentation accuracy is sensitive to the contrasting of semantic features of different segmenting objects, as the traditional energy function usually uses hand-crafted features in their energy functions. To address these limitations, we propose to incorporate end-to-end trainable neural network features into the energy functions. Our deep neural network features are extracted from the down-sampling and up-sampling layers with skip-connections of a U-net. In the inference stage, the learned features are fed into the energy functions. And the segmentations are solved in a primal-dual form by ADMM solvers. In the training stage, we train our neural networks by optimizing the energy function in the primal form with regularizations on the min-cut and flow-conservation functions, which are derived from the optimal conditions in the dual form. We evaluate our methods, both qualitatively and quantitatively, in a brain tumor segmentation task. As the energy minimization model achieves a balance on sensitivity and smooth boundaries, we would show how our segmentation contours evolve actively through iterations as ensemble references for doctor diagnosis. △ Less

Submitted 1 September, 2021; originally announced September 2021.

Comments: Accepted by 2020 11TH International Conference on Machine Learning in Medical Imaging (MLMI 2020)

arXiv:2109.00235 [pdf]

Quality assurance test and Failure Analysis of SiPM Arrays of GECAM Satellites

Authors: D. L. Zhang, M. Gao, X. L. Sun, X. Q. Li, Z. H. An, X. Y. Wen, C. Cai, Z. Chang, G. Chen, C. Chen, Y. Y. Du, R. Gao, K. Gong, D. Y. Guo, J. J. He, D. J. Hou, Y. G. Li, C. Y. Li, G. Li, L. Li, X. F. Li, M. S. Li, X. H. Liang, X. J. Liu, Y. Q. Liu , et al. (23 additional authors not shown)

Abstract: The Gravitational wave high-energy Electromagnetic Counterpart All-sky Monitor (GECAM) satellite consists of two small satellites. Each GECAM payload contains 25 gamma ray detectors (GRD) and 8 charged particle detectors (CPD). GRD is the main detector which can detect gamma-rays and particles and localize the Gamma-Ray Bursts (GRB),while CPD is used to help GRD to discriminate gamma-ray bursts an… ▽ More The Gravitational wave high-energy Electromagnetic Counterpart All-sky Monitor (GECAM) satellite consists of two small satellites. Each GECAM payload contains 25 gamma ray detectors (GRD) and 8 charged particle detectors (CPD). GRD is the main detector which can detect gamma-rays and particles and localize the Gamma-Ray Bursts (GRB),while CPD is used to help GRD to discriminate gamma-ray bursts and charged particle bursts. The GRD makes use of lanthanum bromide (LaBr3) crystal readout by SiPM. As the all available SiPM devices belong to commercial grade, quality assurance tests need to be performed in accordance with the aerospace specifications. In this paper, we present the results of quality assurance tests, especially a detailed mechanism analysis of failed devices during the development of GECAM. This paper also summarizes the application experience of commercial-grade SiPM devices in aerospace payloads, and provides suggestions for forthcoming SiPM space applications. △ Less

Submitted 9 December, 2021; v1 submitted 1 September, 2021; originally announced September 2021.

Comments: 13 pages, 23 figures

Journal ref: RDTM-D-21-00057R4.2021.9.1

arXiv:2108.12028 [pdf]

doi 10.1016/j.jallcom.2020.157266

Effects of minor alloying on the mechanical properties of Al based metallic glasses

Authors: Vrishank Jambur, Chaiyapat Tangpatjaroen, Jianqi Xi, Jirameth Tarnsangpradit, Meng Gao, Howard Sheng, John Perepezko, Izabela Szlufarska

Abstract: Minor alloying is widely used to control mechanical properties of metallic glasses (MGs). The present understanding of how a small amount of alloying element changes strength is that the additions lead to more efficient packing of atoms and increased local topological order, which then increases the barrier for shear transformations and the resistance to plastic deformation. Here, we discover that… ▽ More Minor alloying is widely used to control mechanical properties of metallic glasses (MGs). The present understanding of how a small amount of alloying element changes strength is that the additions lead to more efficient packing of atoms and increased local topological order, which then increases the barrier for shear transformations and the resistance to plastic deformation. Here, we discover that minor alloying can improve the strength of MGs by increasing the chemical bond strength alone and show that this strengthening is distinct from changes in topological order. The results were obtained using Al-Sm based MGs minor alloyed with transition metals (TMs). The addition of TMs led to an increase in the hardness of the MGs which, however, could not be explained based on changes in the topological ordering in the structure. Instead we found that it was the strong bonding between TM and Al atoms which led to a higher resistance to shear transformation that resulted in higher strength and hardness, while the topology around the TM atoms had no influence on their mechanical response. This finding demonstrates that the effects of topology and chemistry on mechanical properties of MGs are independent of each other and that they should be understood as separate, sometimes competing mechanisms of strengthening. This understanding lays a foundation for design of MGs with improved mechanical properties. △ Less

Submitted 26 August, 2021; originally announced August 2021.

Journal ref: Journal of Alloys and Compounds, vol. 854, p. 157266, Feb. 2021

arXiv:2108.04682 [pdf, other]

ChemiRise: a data-driven retrosynthesis engine

Authors: Xiangyan Sun, Ke Liu, Yuquan Lin, Lingjie Wu, Haoming Xing, Minghong Gao, Ji Liu, Suocheng Tan, Zekun Ni, Qi Han, Junqiu Wu, Jie Fan

Abstract: We have developed an end-to-end, retrosynthesis system, named ChemiRise, that can propose complete retrosynthesis routes for organic compounds rapidly and reliably. The system was trained on a processed patent database of over 3 million organic reactions. Experimental reactions were atom-mapped, clustered, and extracted into reaction templates. We then trained a graph convolutional neural network-… ▽ More We have developed an end-to-end, retrosynthesis system, named ChemiRise, that can propose complete retrosynthesis routes for organic compounds rapidly and reliably. The system was trained on a processed patent database of over 3 million organic reactions. Experimental reactions were atom-mapped, clustered, and extracted into reaction templates. We then trained a graph convolutional neural network-based one-step reaction proposer using template embeddings and developed a guiding algorithm on the directed acyclic graph (DAG) of chemical compounds to find the best candidate to explore. The atom-map** algorithm and the one-step reaction proposer were benchmarked against previous studies and showed better results. The final product was demonstrated by retrosynthesis routes reviewed and rated by human experts, showing satisfying functionality and a potential productivity boost in real-life use cases. △ Less

Submitted 9 August, 2021; originally announced August 2021.

arXiv:2108.03026 [pdf]

The Influence of Age and Gender Information on the Diagnosis of Diabetic Retinopathy: Based on Neural Networks

Authors: Long Bai, Sihang Chen, Mingyang Gao, Leila Abdelrahman, Manal Al Ghamdi, Mohamed Abdel-Mottaleb

Abstract: This paper proposes the importance of age and gender information in the diagnosis of diabetic retinopathy. We utilized Deep Residual Neural Networks (ResNet) and Densely Connected Convolutional Networks (DenseNet), which are proven effective on image classification problems and the diagnosis of diabetic retinopathy using the retinal fundus images. We used the ensemble of several classical networks… ▽ More This paper proposes the importance of age and gender information in the diagnosis of diabetic retinopathy. We utilized Deep Residual Neural Networks (ResNet) and Densely Connected Convolutional Networks (DenseNet), which are proven effective on image classification problems and the diagnosis of diabetic retinopathy using the retinal fundus images. We used the ensemble of several classical networks and decentralized the training so that the network was simple and avoided overfitting. To observe whether the age and gender information could help enhance the performance, we added the information before the dense layer and compared the results with the results that did not add age and gender information. We found that the test accuracy of the network with age and gender information was 2.67% higher than that of the network without age and gender information. Meanwhile, compared with gender information, age information had a better help for the results. △ Less

Submitted 6 August, 2021; originally announced August 2021.

Comments: 4 pages, 4 figures, Accepted in 43rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society, IEEE EMBC 2021

arXiv:2108.02057 [pdf, other]

doi 10.1038/s41467-022-31529-4

Manipulating polariton condensates by Rashba-Dresselhaus coupling at room temperature

Authors: Yao Li, Xuekai Ma, Xiaokun Zhai, Meini Gao, Haitao Dai, Stefan Schumacher, Tingge Gao

Abstract: The spin-orbit coupling plays an important role in the spin Hall effect and the topological insulators. In addition, the spin-orbit coupled Bose-Einstein condensates show remarkable quantum many-body phase transition. In this work we tune the exciton polariton condensate by virtue of the Rashba-Dresselhaus (RD) spin-orbit coupling in a liquid-crystal filled microcavity where perovskite CsPbBr3 mic… ▽ More The spin-orbit coupling plays an important role in the spin Hall effect and the topological insulators. In addition, the spin-orbit coupled Bose-Einstein condensates show remarkable quantum many-body phase transition. In this work we tune the exciton polariton condensate by virtue of the Rashba-Dresselhaus (RD) spin-orbit coupling in a liquid-crystal filled microcavity where perovskite CsPbBr3 microplates act as the gain material at room temperature. We realize an artificial gauge field on the CsPbBr3 exciton polariton condensate, which splits the condensates with opposite spins in both momentum and real spaces. Our work paves the way to manipulate the exciton polariton condensate with a synthetic gauge field based on the RD spin-orbit coupling at room temperature. △ Less

Submitted 29 September, 2023; v1 submitted 4 August, 2021; originally announced August 2021.

Journal ref: Nature Communications 13,3785(2022)

arXiv:2107.12262 [pdf, other]

Meta-Learning Adversarial Domain Adaptation Network for Few-Shot Text Classification

Authors: ChengCheng Han, Zeqiu Fan, Dongxiang Zhang, Minghui Qiu, Ming Gao, Aoying Zhou

Abstract: Meta-learning has emerged as a trending technique to tackle few-shot text classification and achieved state-of-the-art performance. However, existing solutions heavily rely on the exploitation of lexical features and their distributional signatures on training data, while neglecting to strengthen the model's ability to adapt to new tasks. In this paper, we propose a novel meta-learning framework i… ▽ More Meta-learning has emerged as a trending technique to tackle few-shot text classification and achieved state-of-the-art performance. However, existing solutions heavily rely on the exploitation of lexical features and their distributional signatures on training data, while neglecting to strengthen the model's ability to adapt to new tasks. In this paper, we propose a novel meta-learning framework integrated with an adversarial domain adaptation network, aiming to improve the adaptive ability of the model and generate high-quality text embedding for new classes. Extensive experiments are conducted on four benchmark datasets and our method demonstrates clear superiority over the state-of-the-art models in all the datasets. In particular, the accuracy of 1-shot and 5-shot classification on the dataset of 20 Newsgroups is boosted from 52.1% to 59.6%, and from 68.3% to 77.8%, respectively. △ Less

Submitted 26 July, 2021; originally announced July 2021.

arXiv:2107.10457 [pdf, other]

Ready for Emerging Threats to Recommender Systems? A Graph Convolution-based Generative Shilling Attack

Authors: Fan Wu, Min Gao, Junliang Yu, Zongwei Wang, Kecheng Liu, Xu Wange

Abstract: To explore the robustness of recommender systems, researchers have proposed various shilling attack models and analyzed their adverse effects. Primitive attacks are highly feasible but less effective due to simplistic handcrafted rules, while upgraded attacks are more powerful but costly and difficult to deploy because they require more knowledge from recommendations. In this paper, we explore a n… ▽ More To explore the robustness of recommender systems, researchers have proposed various shilling attack models and analyzed their adverse effects. Primitive attacks are highly feasible but less effective due to simplistic handcrafted rules, while upgraded attacks are more powerful but costly and difficult to deploy because they require more knowledge from recommendations. In this paper, we explore a novel shilling attack called Graph cOnvolution-based generative shilling ATtack (GOAT) to balance the attacks' feasibility and effectiveness. GOAT adopts the primitive attacks' paradigm that assigns items for fake users by sampling and the upgraded attacks' paradigm that generates fake ratings by a deep learning-based model. It deploys a generative adversarial network (GAN) that learns the real rating distribution to generate fake ratings. Additionally, the generator combines a tailored graph convolution structure that leverages the correlations between co-rated items to smoothen the fake ratings and enhance their authenticity. The extensive experiments on two public datasets evaluate GOAT's performance from multiple perspectives. Our study of the GOAT demonstrates technical feasibility for building a more powerful and intelligent attack model with a much-reduced cost, enables analysis the threat of such an attack and guides for investigating necessary prevention measures. △ Less

Submitted 22 July, 2021; originally announced July 2021.

Comments: 16 pages, 21 figures, Information Sciences - Journal - Elsevier

arXiv:2107.08260 [pdf]

doi 10.1142/S0129183121501655

Coverage-dependent magnetic and electronic properties of graphene with Co adatoms

Authors: Min Gao, Jun Hu

Abstract: Decorating two-dimensional materials with transition-metal adatoms is an effective way to bring about new physical properties that are intriguing for applications in electronics and spintronics devices. Here, we systematically studied the coverage-dependent magnetic and electronic properties of graphene decorated by Co adatoms, based on first-principles calculations. We found that the if the Co co… ▽ More Decorating two-dimensional materials with transition-metal adatoms is an effective way to bring about new physical properties that are intriguing for applications in electronics and spintronics devices. Here, we systematically studied the coverage-dependent magnetic and electronic properties of graphene decorated by Co adatoms, based on first-principles calculations. We found that the if the Co coverage is larger than 1/3 ML, the Co atoms will aggregate to form a Co monolayer and then a van der Waals bilayer system between the Co monolayer and graphene forms. When the Co coverage is <= 1/3 ML, the Co adatom is spin polarized with spin moment varying from 1.1 ~ 1.4 μB. The d(xz/yz) and d(xy/x2-y2 ) orbitals of Co hybridize significantly with the π bands of graphene, which generates a series of new bands in the energy range from -2 eV to 1 eV with respect to Dirac point of graphene. In most cases, the new bands near the Fermi level lead to topological states characterized by the quantum anomalous Hall effect. △ Less

Submitted 17 July, 2021; originally announced July 2021.

arXiv:2107.06702 [pdf, other]

doi 10.1063/5.0093908

Tilting flat bands in an empty microcavity

Authors: Ying Gao, Yao Li, Xuekai Ma, Meini Gao, Haitao Dai, Stefan Schumacher, Tingge Gao

Abstract: Recently microcavities with anisotropic materials are shown to be able to create novel bands with non-zero local Berry curvature. The anisotropic refractive index of the cavity layer is believed to be critical in opening an energy gap at the tilted Dirac points. In this work, we show that an anticrossing between a cavity mode and a Bragg mode can also form within an empty microcavity without any b… ▽ More Recently microcavities with anisotropic materials are shown to be able to create novel bands with non-zero local Berry curvature. The anisotropic refractive index of the cavity layer is believed to be critical in opening an energy gap at the tilted Dirac points. In this work, we show that an anticrossing between a cavity mode and a Bragg mode can also form within an empty microcavity without any birefringent materials. Flat bands are observed within the energy gap due to the particular refractive index distribution of the sample. The intrinsic TE-TM splitting and XY splitting induce the squeezing of the cavity modes in momentum space, so that the flat bands are spin-dependently tilted. Our results pave the way to investigate the spin orbit coupling of photons in a simple microcavity without anisotropic cavity layers. △ Less

Submitted 14 July, 2021; originally announced July 2021.

arXiv:2106.14662 [pdf, other]

Improving Uncertainty Calibration of Deep Neural Networks via Truth Discovery and Geometric Optimization

Authors: Chunwei Ma, Ziyun Huang, Jiayi Xian, Mingchen Gao, **hui Xu

Abstract: Deep Neural Networks (DNNs), despite their tremendous success in recent years, could still cast doubts on their predictions due to the intrinsic uncertainty associated with their learning process. Ensemble techniques and post-hoc calibrations are two types of approaches that have individually shown promise in improving the uncertainty calibration of DNNs. However, the synergistic effect of the two… ▽ More Deep Neural Networks (DNNs), despite their tremendous success in recent years, could still cast doubts on their predictions due to the intrinsic uncertainty associated with their learning process. Ensemble techniques and post-hoc calibrations are two types of approaches that have individually shown promise in improving the uncertainty calibration of DNNs. However, the synergistic effect of the two types of methods has not been well explored. In this paper, we propose a truth discovery framework to integrate ensemble-based and post-hoc calibration methods. Using the geometric variance of the ensemble candidates as a good indicator for sample uncertainty, we design an accuracy-preserving truth estimator with provably no accuracy drop. Furthermore, we show that post-hoc calibration can also be enhanced by truth discovery-regularized optimization. On large-scale datasets including CIFAR and ImageNet, our method shows consistent improvement against state-of-the-art calibration approaches on both histogram-based and kernel density-based evaluation metrics. Our codes are available at https://github.com/horsepurve/truly-uncertain. △ Less

Submitted 1 March, 2022; v1 submitted 25 June, 2021; originally announced June 2021.

Comments: Accepted for publication at 37th Conference on Uncertainty in Artificial Intelligence (UAI 2021); https://proceedings.mlr.press/v161/ma21a.html

arXiv:2106.12371 [pdf, ps, other]

doi 10.1103/PhysRevB.102.195140

First-principles study of the robust superconducting state of NbTi alloys under ultrahigh pressures

Authors: Jian-Feng Zhang, Miao Gao, Kai Liu, Zhong-Yi Lu

Abstract: A recent experiment reported that robust superconductivity appears in NbTi alloys under ultrahigh pressures with an almost constant superconducting $T_c$ of ~19 K from 120 to 261.7 GPa [J. Guo et al., Adv. Mater. 31, 1807240 (2019)], which is very rare among the known superconductors. We investigate the origin of this novel superconducting behavior in NbTi alloys based on density functional theory… ▽ More A recent experiment reported that robust superconductivity appears in NbTi alloys under ultrahigh pressures with an almost constant superconducting $T_c$ of ~19 K from 120 to 261.7 GPa [J. Guo et al., Adv. Mater. 31, 1807240 (2019)], which is very rare among the known superconductors. We investigate the origin of this novel superconducting behavior in NbTi alloys based on density functional theory and density functional perturbation theory calculations. Our results indicate that the pressure tends to transform NbTi alloys from a random phase to a uniformly ordered crystal phase, and the exotic robust superconductivity of NbTi alloys can still be understood in the framework of BCS theory. The Nb element in NbTi alloys plays a dominant role in the superconductivity at low pressure, while the NbTi crystal with an alternative and uniform Nb and Ti atomic arrangement may be responsible for the stable superconductivity under high pressures. The robust superconducting transition temperature of NbTi under ultrahigh pressure can be explained by a synergistic effect of the enhanced phonon frequency, the modestly reduced total electron-phonon coupling, and the pressure-dependent screened Coulomb repulsion. △ Less

Submitted 23 June, 2021; originally announced June 2021.

Comments: 7 pages, 5 figures, 1 table

Journal ref: Phys. Rev. B 102, 195140 (2020)

arXiv:2106.12365 [pdf, ps, other]

doi 10.1103/PhysRevB.101.155139

First-principles study on the electron-phonon coupling and magnetoresistance of LaBi under pressure

Authors: Jian-Feng Zhang, Peng-Jie Guo, Miao Gao, Kai Liu, Zhong-Yi Lu

Abstract: The extremely large magnetoresistance (XMR) material LaBi was reported to become superconducting under pressure accompanying with suppressed magnetoresistance. However, the underlying mechanism is unclear. By using first-principles electronic structure calculations in combination with a semiclassical model, we have studied the electron-phonon coupling and magnetoresistance of LaBi in the pressure… ▽ More The extremely large magnetoresistance (XMR) material LaBi was reported to become superconducting under pressure accompanying with suppressed magnetoresistance. However, the underlying mechanism is unclear. By using first-principles electronic structure calculations in combination with a semiclassical model, we have studied the electron-phonon coupling and magnetoresistance of LaBi in the pressure range from 0 to 18 GPa. Our calculations show that LaBi undergoes a structural phase transition from a face-centered cubic lattice to a primitive tetragonal lattice at $\sim$7 GPa, verifying previous experimental results. Meanwhile, LaBi remains topologically nontrivial across the structural transition. Under all pressures that we have studied, the phonon-mediated mechanism based on the weak electron-phonon coupling cannot account for the observed superconductivity in LaBi, and the calculated magnetoresistance for LaBi does not show a suppression. The distinct difference between our calculations and experimental observations suggests either the existence of extra Bi impurities in the real LaBi compound or the possibility of other unknown mechanism. △ Less

Submitted 23 June, 2021; originally announced June 2021.

Comments: 7 pages, 5 figures, 1 table

Journal ref: Phys. Rev. B 101, 155139 (2020)

arXiv:2106.12284 [pdf, ps, other]

doi 10.1002/mp.15799

Bayesian Statistics Guided Label Refurbishment Mechanism: Mitigating Label Noise in Medical Image Classification

Authors: Mengdi Gao, Ximeng Feng, Mufeng Geng, Zhe Jiang, Lei Zhu, Xiangxi Meng, Chuanqing Zhou, Qiushi Ren, Yanye Lu

Abstract: Purpose: Deep neural networks (DNNs) have been widely applied in medical image classification, benefiting from its powerful map** capability among medical images. However, these existing deep learning-based methods depend on an enormous amount of carefully labeled images. Meanwhile, noise is inevitably introduced in the labeling process, degrading the performance of models. Hence, it's significa… ▽ More Purpose: Deep neural networks (DNNs) have been widely applied in medical image classification, benefiting from its powerful map** capability among medical images. However, these existing deep learning-based methods depend on an enormous amount of carefully labeled images. Meanwhile, noise is inevitably introduced in the labeling process, degrading the performance of models. Hence, it's significant to devise robust training strategies to mitigate label noise in the medical image classification tasks. Methods: In this work, we propose a novel Bayesian statistics guided label refurbishment mechanism (BLRM) for DNNs to prevent overfitting noisy images. BLRM utilizes maximum a posteriori probability (MAP) in the Bayesian statistics and the exponentially time-weighted technique to selectively correct the labels of noisy images. The training images are purified gradually with the training epochs when BLRM is activated, further improving classification performance. Results: Comprehensive experiments on both synthetic noisy images (public OCT & Messidor datasets) and real-world noisy images (ANIMAL-10N) demonstrate that BLRM refurbishes the noisy labels selectively, curbing the adverse effects of noisy data. Also, the anti-noise BLRM integrated with DNNs are effective at different noise ratio and are independent of backbone DNN architectures. In addition, BLRM is superior to state-of-the-art comparative methods of anti-noise. Conclusions: These investigations indicate that the proposed BLRM is well capable of mitigating label noise in medical image classification tasks. △ Less

Submitted 11 June, 2022; v1 submitted 23 June, 2021; originally announced June 2021.

Comments: 10 pages, 11 figures

arXiv:2106.10658 [pdf, other]

Exploring Semantic Relationships for Unpaired Image Captioning

Authors: Fenglin Liu, Meng Gao, Tianhao Zhang, Yuexian Zou

Abstract: Recently, image captioning has aroused great interest in both academic and industrial worlds. Most existing systems are built upon large-scale datasets consisting of image-sentence pairs, which, however, are time-consuming to construct. In addition, even for the most advanced image captioning systems, it is still difficult to realize deep image understanding. In this work, we achieve unpaired imag… ▽ More Recently, image captioning has aroused great interest in both academic and industrial worlds. Most existing systems are built upon large-scale datasets consisting of image-sentence pairs, which, however, are time-consuming to construct. In addition, even for the most advanced image captioning systems, it is still difficult to realize deep image understanding. In this work, we achieve unpaired image captioning by bridging the vision and the language domains with high-level semantic information. The motivation stems from the fact that the semantic concepts with the same modality can be extracted from both images and descriptions. To further improve the quality of captions generated by the model, we propose the Semantic Relationship Explorer, which explores the relationships between semantic concepts for better understanding of the image. Extensive experiments on MSCOCO dataset show that we can generate desirable captions without paired datasets. Furthermore, the proposed approach boosts five strong baselines under the paired setting, where the most significant improvement in CIDEr score reaches 8%, demonstrating that it is effective and generalizes well to a wide range of models. △ Less

Submitted 17 August, 2021; v1 submitted 20 June, 2021; originally announced June 2021.

arXiv:2106.09785 [pdf, other]

Efficient Self-supervised Vision Transformers for Representation Learning

Authors: Chunyuan Li, Jianwei Yang, Pengchuan Zhang, Mei Gao, Bin Xiao, Xiyang Dai, Lu Yuan, Jianfeng Gao

Abstract: This paper investigates two techniques for develo** efficient self-supervised vision transformers (EsViT) for visual representation learning. First, we show through a comprehensive empirical study that multi-stage architectures with sparse self-attentions can significantly reduce modeling complexity but with a cost of losing the ability to capture fine-grained correspondences between image regio… ▽ More This paper investigates two techniques for develo** efficient self-supervised vision transformers (EsViT) for visual representation learning. First, we show through a comprehensive empirical study that multi-stage architectures with sparse self-attentions can significantly reduce modeling complexity but with a cost of losing the ability to capture fine-grained correspondences between image regions. Second, we propose a new pre-training task of region matching which allows the model to capture fine-grained region dependencies and as a result significantly improves the quality of the learned vision representations. Our results show that combining the two techniques, EsViT achieves 81.3% top-1 on the ImageNet linear probe evaluation, outperforming prior arts with around an order magnitude of higher throughput. When transferring to downstream linear classification tasks, EsViT outperforms its supervised counterpart on 17 out of 18 datasets. The code and models are publicly available: https://github.com/microsoft/esvit △ Less

Submitted 6 July, 2022; v1 submitted 17 June, 2021; originally announced June 2021.

Comments: ICLR 2022; Code: https://github.com/microsoft/esvit

arXiv:2106.07322 [pdf, ps, other]

doi 10.1103/PhysRevB.104.L100504

Phonon-mediated high-temperature superconductivity in ternary borohydride KB$_2$H$_8$ around 12 GPa

Authors: Miao Gao, Xun-Wang Yan, Zhong-Yi Lu, Tao Xiang

Abstract: Discovery of high-temperature superconductivity in hydrogen-rich compounds has fuelled the enthusiasm for finding materials with more promising superconducting properties among hydrides. However, the ultrahigh pressure needed to synthesize and maintain high-temperature hydrogen-rich superconductors hinders the experimental investigation of these materials. For practical applications, it is also hi… ▽ More Discovery of high-temperature superconductivity in hydrogen-rich compounds has fuelled the enthusiasm for finding materials with more promising superconducting properties among hydrides. However, the ultrahigh pressure needed to synthesize and maintain high-temperature hydrogen-rich superconductors hinders the experimental investigation of these materials. For practical applications, it is also highly desired to find more hydrogen-rich materials that superconduct at high temperatures but under relatively lower pressures. Based on first-principles density functional theory, we calculate the electronic and phonon band structures for a ternary borohydride formed by intercalating BH$_4$ tetrahedrons into a face-centered-cubic potassium lattice, KB$_2$H$_8$. Remarkably, we find that this material is dynamically stable and one of its $sp^3$-hybridized $σ$-bonding bands is metallized (i.e. partially filled) above a moderate high pressure. This metallized $σ$-bonding band couples strongly with phonons, giving rise to a strong superconducting pairing potential. By solving the anisotropic Eliashberg equations, we predict that the superconducting transition temperature of this compound is 134-146 K around 12 GPa. △ Less

Submitted 14 June, 2021; originally announced June 2021.

Comments: 5 pages, 4 figures

Journal ref: Phys. Rev. B 104, 100504 (2021)

arXiv:2106.07240 [pdf, ps, other]

Mitigating Biases in Toxic Language Detection through Invariant Rationalization

Authors: Yung-Sung Chuang, Mingye Gao, Hongyin Luo, James Glass, Hung-yi Lee, Yun-Nung Chen, Shang-Wen Li

Abstract: Automatic detection of toxic language plays an essential role in protecting social media users, especially minority groups, from verbal abuse. However, biases toward some attributes, including gender, race, and dialect, exist in most training datasets for toxicity detection. The biases make the learned models unfair and can even exacerbate the marginalization of people. Considering that current de… ▽ More Automatic detection of toxic language plays an essential role in protecting social media users, especially minority groups, from verbal abuse. However, biases toward some attributes, including gender, race, and dialect, exist in most training datasets for toxicity detection. The biases make the learned models unfair and can even exacerbate the marginalization of people. Considering that current debiasing methods for general natural language understanding tasks cannot effectively mitigate the biases in the toxicity detectors, we propose to use invariant rationalization (InvRat), a game-theoretic framework consisting of a rationale generator and a predictor, to rule out the spurious correlation of certain syntactic patterns (e.g., identity mentions, dialect) to toxicity labels. We empirically show that our method yields lower false positive rate in both lexical and dialectal attributes than previous debiasing methods. △ Less

Submitted 14 June, 2021; originally announced June 2021.

Comments: The 5th Workshop on Online Abuse and Harms at ACL 2021

arXiv:2106.03569 [pdf, other]

Socially-Aware Self-Supervised Tri-Training for Recommendation

Authors: Junliang Yu, Hongzhi Yin, Min Gao, Xin Xia, Xiangliang Zhang, Nguyen Quoc Viet Hung

Abstract: Self-supervised learning (SSL), which can automatically generate ground-truth samples from raw data, holds vast potential to improve recommender systems. Most existing SSL-based methods perturb the raw data graph with uniform node/edge dropout to generate new data views and then conduct the self-discrimination based contrastive learning over different views to learn generalizable representations.… ▽ More Self-supervised learning (SSL), which can automatically generate ground-truth samples from raw data, holds vast potential to improve recommender systems. Most existing SSL-based methods perturb the raw data graph with uniform node/edge dropout to generate new data views and then conduct the self-discrimination based contrastive learning over different views to learn generalizable representations. Under this scheme, only a bijective map** is built between nodes in two different views, which means that the self-supervision signals from other nodes are being neglected. Due to the widely observed homophily in recommender systems, we argue that the supervisory signals from other nodes are also highly likely to benefit the representation learning for recommendation. To capture these signals, a general socially-aware SSL framework that integrates tri-training is proposed in this paper. Technically, our framework first augments the user data views with the user social information. And then under the regime of tri-training for multi-view encoding, the framework builds three graph encoders (one for recommendation) upon the augmented views and iteratively improves each encoder with self-supervision signals from other users, generated by the other two encoders. Since the tri-training operates on the augmented views of the same data sources for self-supervision signals, we name it self-supervised tri-training. Extensive experiments on multiple real-world datasets consistently validate the effectiveness of the self-supervised tri-training framework for improving recommendation. The code is released at https://github.com/Coder-Yu/QRec. △ Less

Submitted 26 August, 2021; v1 submitted 7 June, 2021; originally announced June 2021.

Comments: 9 pages, accepted by KDD'21

arXiv:2105.09073 [pdf, other]

doi 10.1103/PhysRevLett.126.201102

Measurement of the cosmic ray helium energy spectrum from 70 GeV to 80 TeV with the DAMPE space mission

Authors: F. Alemanno, Q. An, P. Azzarello, F. C. T. Barbato, P. Bernardini, X. J. Bi, M. S. Cai, E. Catanzani, J. Chang, D. Y. Chen, J. L. Chen, Z. F. Chen, M. Y. Cui, T. S. Cui, Y. X. Cui, H. T. Dai, A. D'Amone, A. De Benedittis, I. De Mitri, F. de Palma, M. Deliyergiyev, M. Di Santo, T. K. Dong, Z. X. Dong, G. Donvito , et al. (120 additional authors not shown)

Abstract: The measurement of the energy spectrum of cosmic ray helium nuclei from 70 GeV to 80 TeV using 4.5 years of data recorded by the DArk Matter Particle Explorer (DAMPE) is reported in this work. A hardening of the spectrum is observed at an energy of about 1.3 TeV, similar to previous observations. In addition, a spectral softening at about 34 TeV is revealed for the first time with large statistics… ▽ More The measurement of the energy spectrum of cosmic ray helium nuclei from 70 GeV to 80 TeV using 4.5 years of data recorded by the DArk Matter Particle Explorer (DAMPE) is reported in this work. A hardening of the spectrum is observed at an energy of about 1.3 TeV, similar to previous observations. In addition, a spectral softening at about 34 TeV is revealed for the first time with large statistics and well controlled systematic uncertainties, with an overall significance of $4.3σ$. The DAMPE spectral measurements of both cosmic protons and helium nuclei suggest a particle charge dependent softening energy, although with current uncertainties a dependence on the number of nucleons cannot be ruled out. △ Less

Submitted 21 May, 2021; v1 submitted 19 May, 2021; originally announced May 2021.

Comments: 11 pages, 13 figures, published in Phys. Rev. Lett. Add one more digit for first three columns in Table S2

Journal ref: Phys. Rev. Lett. 126, 201102 (2021)

arXiv:2105.08491 [pdf, other]

Ab-initio free energies of liquid metal alloys: application to the phase diagrams of Li-Na and K-Na

Authors: Yang Huang, Michael Widom, Michael C. Gao

Abstract: Comparison of free energies between different phases and different compositions underlies the prediction of alloy phase diagrams. To allow direct comparison, consistent reference points for the energies or enthalpies are required, and the entropy must be placed on an absolute scale, yielding absolute free energies. Here we derive absolute free energies of liquids from ab-initio molecular dynamics… ▽ More Comparison of free energies between different phases and different compositions underlies the prediction of alloy phase diagrams. To allow direct comparison, consistent reference points for the energies or enthalpies are required, and the entropy must be placed on an absolute scale, yielding absolute free energies. Here we derive absolute free energies of liquids from ab-initio molecular dynamics (AIMD) by combining the directly simulated enthalpies with an entropy derived from simulated densities and pair correlation functions. As an example of the power of this method we calculate the phase diagrams of two binary alkali metal alloys, Li-Na and K-Na, revealing a critical point and liquid-liquid phase separation in the former case, and a deep eutectic in the latter. Good agreement with experimental data demonstrates the power of this simple method. △ Less

Submitted 18 May, 2021; originally announced May 2021.

arXiv:2105.03358 [pdf, other]

Soft-Attention Improves Skin Cancer Classification Performance

Authors: Soumyya Kanti Datta, Mohammad Abuzar Shaikh, Sargur N. Srihari, Mingchen Gao

Abstract: In clinical applications, neural networks must focus on and highlight the most important parts of an input image. Soft-Attention mechanism enables a neural network toachieve this goal. This paper investigates the effectiveness of Soft-Attention in deep neural architectures. The central aim of Soft-Attention is to boost the value of important features and suppress the noise-inducing features. We co… ▽ More In clinical applications, neural networks must focus on and highlight the most important parts of an input image. Soft-Attention mechanism enables a neural network toachieve this goal. This paper investigates the effectiveness of Soft-Attention in deep neural architectures. The central aim of Soft-Attention is to boost the value of important features and suppress the noise-inducing features. We compare the performance of VGG, ResNet, InceptionResNetv2 and DenseNet architectures with and without the Soft-Attention mechanism, while classifying skin lesions. The original network when coupled with Soft-Attention outperforms the baseline[16] by 4.7% while achieving a precision of 93.7% on HAM10000 dataset [25]. Additionally, Soft-Attention coupling improves the sensitivity score by 3.8% compared to baseline[31] and achieves 91.6% on ISIC-2017 dataset [2]. The code is publicly available at github. △ Less

Submitted 4 June, 2021; v1 submitted 4 May, 2021; originally announced May 2021.

Comments: 8 pages, 9 figures, 4 tables

arXiv:2105.01541 [pdf]

Apparel Recommender System based on Bilateral image shape features

Authors: Yichi Lu, Mingtian Gao, Ryosuke Saga

Abstract: Probabilistic matrix factorization (PMF) is a well-known model of recommender systems. With the development of image recognition technology, some PMF recommender systems that combine images have emerged. Some of these systems use the image shape features of the recommended products to achieve better results compared to those of the traditional PMF. However, in the existing methods, no PMF recommen… ▽ More Probabilistic matrix factorization (PMF) is a well-known model of recommender systems. With the development of image recognition technology, some PMF recommender systems that combine images have emerged. Some of these systems use the image shape features of the recommended products to achieve better results compared to those of the traditional PMF. However, in the existing methods, no PMF recommender system can combine the image features of products previously purchased by customers and of recommended products. Thus, this study proposes a novel probabilistic model that integrates double convolutional neural networks (CNNs) into PMF. For apparel goods, two trained CNNs from the image shape features of users and items are combined, and the latent variables of users and items are optimized based on the vectorized features of CNNs and ratings. Extensive experiments show that our model predicts outcome more accurately than do other recommender models. △ Less

Submitted 4 May, 2021; originally announced May 2021.

arXiv:2104.00086 [pdf, other]

An Online Survey on the Perception of Mediated Social Touch Interaction and Device Design

Authors: Carine Rognon, Taylor Bunge, Meiyuzi Gao, Chip Connor, Benjamin Stephens-Fripp, Casey Brown, Ali Israr

Abstract: Social touch is essential for our social interactions, communication, and well-being. It has been shown to reduce anxiety and loneliness; and is a key channel to transmit emotions for which words are not sufficient, such as love, sympathy, reassurance, etc. However, direct physical contact is not always possible due to being remotely located, interacting in a virtual environment, or as a result of… ▽ More Social touch is essential for our social interactions, communication, and well-being. It has been shown to reduce anxiety and loneliness; and is a key channel to transmit emotions for which words are not sufficient, such as love, sympathy, reassurance, etc. However, direct physical contact is not always possible due to being remotely located, interacting in a virtual environment, or as a result of a health issue. Mediated social touch enables physical interactions, despite the distance, by transmitting the haptic cues that constitute social touch through devices. As this technology is fairly new, the users' needs and their expectations on a device design and its features are unclear, as well as who would use this technology, and in which conditions. To better understand these aspects of the mediated interaction, we conducted an online survey on 258 respondents located in the USA. Results give insights on the type of interactions and device features that the US population would like to use. △ Less

Submitted 31 March, 2021; originally announced April 2021.

arXiv:2103.16844 [pdf, other]

Fixing the Teacher-Student Knowledge Discrepancy in Distillation

Authors: Jiangfan Han, Mengya Gao, Yujie Wang, Quanquan Li, Hongsheng Li, Xiaogang Wang

Abstract: Training a small student network with the guidance of a larger teacher network is an effective way to promote the performance of the student. Despite the different types, the guided knowledge used to distill is always kept unchanged for different teacher and student pairs in previous knowledge distillation methods. However, we find that teacher and student models with different networks or trained… ▽ More Training a small student network with the guidance of a larger teacher network is an effective way to promote the performance of the student. Despite the different types, the guided knowledge used to distill is always kept unchanged for different teacher and student pairs in previous knowledge distillation methods. However, we find that teacher and student models with different networks or trained from different initialization could have distinct feature representations among different channels. (e.g. the high activated channel for different categories). We name this incongruous representation of channels as teacher-student knowledge discrepancy in the distillation process. Ignoring the knowledge discrepancy problem of teacher and student models will make the learning of student from teacher more difficult. To solve this problem, in this paper, we propose a novel student-dependent distillation method, knowledge consistent distillation, which makes teacher's knowledge more consistent with the student and provides the best suitable knowledge to different student networks for distillation. Extensive experiments on different datasets (CIFAR100, ImageNet, COCO) and tasks (image classification, object detection) reveal the widely existing knowledge discrepancy problem between teachers and students and demonstrate the effectiveness of our proposed method. Our method is very flexible that can be easily combined with other state-of-the-art approaches. △ Less

Submitted 31 March, 2021; originally announced March 2021.

arXiv:2103.15846 [pdf, other]

doi 10.1103/PhysRevD.104.053005

Differential distributions for Single Top Quark Production at the LHeC

Authors: Meisen Gao, Jun Gao

Abstract: We present a phenomenological study of the single top (anti-)quark production with leptonic decays at the Large Hadron electron Collider (LHeC) at the next-to-leading-order (NLO) in QCD. We focus on various differential distributions in a fiducial region. The NLO corrections can reduce the fiducial cross section by 14%. We find the NLO predictions exhibit strong stability under scale variations fo… ▽ More We present a phenomenological study of the single top (anti-)quark production with leptonic decays at the Large Hadron electron Collider (LHeC) at the next-to-leading-order (NLO) in QCD. We focus on various differential distributions in a fiducial region. The NLO corrections can reduce the fiducial cross section by 14%. We find the NLO predictions exhibit strong stability under scale variations for most observables considered while the scale variations at the leading-order (LO) dominated in the theoretical uncertainties. We propose a method of determining the top-quark mass using the measurement of the average transverse momentum of the charged lepton. The scale variations at the NLO induce a theoretical uncertainty of about 1.3 GeV of the extracted top-quark mass. The statistical error of the extracted top-quark mass amounts to 1.1 GeV. We also investigate the impact of the QCD corrections and the scale variations in searches of the anomalous Wtb couplings. △ Less

Submitted 9 September, 2021; v1 submitted 29 March, 2021; originally announced March 2021.

Comments: 13 pages, 12 figures, 8 tables; Published version

Journal ref: Phys. Rev. D 104, 053005 (2021)

arXiv:2103.07449 [pdf, other]

Cooperative Self-training of Machine Reading Comprehension

Authors: Hongyin Luo, Shang-Wen Li, Mingye Gao, Seunghak Yu, James Glass

Abstract: Pretrained language models have significantly improved the performance of downstream language understanding tasks, including extractive question answering, by providing high-quality contextualized word embeddings. However, training question answering models still requires large amounts of annotated data for specific domains. In this work, we propose a cooperative self-training framework, RGX, for… ▽ More Pretrained language models have significantly improved the performance of downstream language understanding tasks, including extractive question answering, by providing high-quality contextualized word embeddings. However, training question answering models still requires large amounts of annotated data for specific domains. In this work, we propose a cooperative self-training framework, RGX, for automatically generating more non-trivial question-answer pairs to improve model performance. RGX is built upon a masked answer extraction task with an interactive learning environment containing an answer entity Recognizer, a question Generator, and an answer eXtractor. Given a passage with a masked entity, the generator generates a question around the entity, and the extractor is trained to extract the masked entity with the generated question and raw texts. The framework allows the training of question generation and answering models on any text corpora without annotation. Experiment results show that RGX outperforms the state-of-the-art (SOTA) pretrained language models and transfer learning approaches on standard question-answering benchmarks, and yields the new SOTA performance under given model size and transfer learning settings. △ Less

Submitted 27 June, 2022; v1 submitted 12 March, 2021; originally announced March 2021.

Comments: NAACL 2022

arXiv:2103.03500 [pdf, other]

doi 10.1145/3503222.3507733

ShEF: Shielded Enclaves for Cloud FPGAs

Authors: Mark Zhao, Mingyu Gao, Christos Kozyrakis

Abstract: FPGAs are now used in public clouds to accelerate a wide range of applications, including many that operate on sensitive data such as financial and medical records. We present ShEF, a trusted execution environment (TEE) for cloud-based reconfigurable accelerators. ShEF is independent from CPU-based TEEs and allows secure execution under a threat model where the adversary can control all software r… ▽ More FPGAs are now used in public clouds to accelerate a wide range of applications, including many that operate on sensitive data such as financial and medical records. We present ShEF, a trusted execution environment (TEE) for cloud-based reconfigurable accelerators. ShEF is independent from CPU-based TEEs and allows secure execution under a threat model where the adversary can control all software running on the CPU connected to the FPGA, has physical access to the FPGA, and can compromise the FPGA interface logic of the cloud provider. ShEF provides a secure boot and remote attestation process that relies solely on existing FPGA mechanisms for root of trust. It also includes a Shield component that provides secure access to data while the accelerator is in use. The Shield is highly customizable and extensible, allowing users to craft a bespoke security solution that fits their accelerator's memory access patterns, bandwidth, and security requirements at minimum performance and area overheads. We describe a prototype implementation of ShEF for existing cloud FPGAs, map ShEF to a performant and secure storage application, and measure the performance benefits of customizable security using five additional accelerators. △ Less

Submitted 27 January, 2022; v1 submitted 5 March, 2021; originally announced March 2021.

arXiv:2102.07957 [pdf, other]

doi 10.1073/pnas.2104425118

Vibrational relaxation dynamics in layered perovskite quantum wells

Authors: Li Na Quan, Yoonjae Park, Peijun Guo, Mengyu Gao, Jianbo **, Jianmei Huang, Jason K. Copper, Adam Schwartzberg, Richard Schaller, David T. Limmer, Peidong Yang

Abstract: Organic-inorganic layered perovskites are two-dimensional quantum wells with layers of lead-halide octahedra stacked between organic ligand barriers. The combination of their dielectric confinement and ionic sublattice results in excitonic excitations with substantial binding energies that are strongly coupled to the surrounding soft, polar lattice. However, the ligand environment in layered perov… ▽ More Organic-inorganic layered perovskites are two-dimensional quantum wells with layers of lead-halide octahedra stacked between organic ligand barriers. The combination of their dielectric confinement and ionic sublattice results in excitonic excitations with substantial binding energies that are strongly coupled to the surrounding soft, polar lattice. However, the ligand environment in layered perovskites can significantly alter their optical properties due to the complex dynamic disorder of soft perovskite lattice. Here, we observe the dynamic disorder through phonon dephasing lifetimes initiated by ultrafast photoexcitation employing high-resolution resonant impulsive stimulated Raman spectroscopy of a variety of ligand substitutions. We demonstrate that vibrational relaxation in layered perovskite formed from flexible alkyl-amines as organic barriers is fast and relatively independent of the lattice temperature. Relaxation in aromatic amine based layered perovskite is slower, though still fast relative to pure inorganic lead bromide lattices, with a rate that is temperature dependent. Using molecular dynamics simulations, we explain the fast rates of relaxation by quantifying the large anharmonic coupling of the optical modes with the ligand layers and rationalize the temperature independence due to their amorphous packing. This work provides a molecular and time-domain depiction of the relaxation of nascent optical excitations and opens opportunities to understand how they couple to the complex layered perovskite lattice, elucidating design principles for optoelectronic devices. △ Less

Submitted 15 February, 2021; originally announced February 2021.

Comments: 7 pages, 4 figures, SI

Showing 251–300 of 514 results for author: Gao, M