-
Universally Optimal Verification of Entangled States with Nondemolition Measurements
Authors:
Ye-Chao Liu,
Jiangwei Shang,
Rui Han,
Xiangdong Zhang
Abstract:
The efficient and reliable characterization of quantum states plays a vital role in most, if not all, quantum information processing tasks. In this work, we present a universally optimal protocol for verifying entangled states by employing the so-called quantum nondemolition measurements, such that the verification efficiency is equivalent to that of the optimal global strategy. Instead of being p…
▽ More
The efficient and reliable characterization of quantum states plays a vital role in most, if not all, quantum information processing tasks. In this work, we present a universally optimal protocol for verifying entangled states by employing the so-called quantum nondemolition measurements, such that the verification efficiency is equivalent to that of the optimal global strategy. Instead of being probabilistic as the standard verification strategies, our protocol is constructed sequentially, which is thus more favorable for experimental realizations. In addition, the target states are preserved in the protocol after each measurement, so can be reused in any subsequent tasks. We demonstrate the power of our protocol for the optimal verification of Bell states, arbitrary two-qubit pure states, and stabilizer states. We also prove that our protocol is able to perform tasks including fidelity estimation and state preparation.
△ Less
Submitted 4 March, 2021; v1 submitted 3 May, 2020;
originally announced May 2020.
-
User-Guided Aspect Classification for Domain-Specific Texts
Authors:
Peiran Li,
Fang Guo,
**gbo Shang
Abstract:
Aspect classification, identifying aspects of text segments, facilitates numerous applications, such as sentiment analysis and review summarization. To alleviate the human effort on annotating massive texts, in this paper, we study the problem of classifying aspects based on only a few user-provided seed words for pre-defined aspects. The major challenge lies in how to handle the noisy misc aspect…
▽ More
Aspect classification, identifying aspects of text segments, facilitates numerous applications, such as sentiment analysis and review summarization. To alleviate the human effort on annotating massive texts, in this paper, we study the problem of classifying aspects based on only a few user-provided seed words for pre-defined aspects. The major challenge lies in how to handle the noisy misc aspect, which is designed for texts without any pre-defined aspects. Even domain experts have difficulties to nominate seed words for the misc aspect, making existing seed-driven text classification methods not applicable. We propose a novel framework, ARYA, which enables mutual enhancements between pre-defined aspects and the misc aspect via iterative classifier training and seed updating. Specifically, it trains a classifier for pre-defined aspects and then leverages it to induce the supervision for the misc aspect. The prediction results of the misc aspect are later utilized to filter out noisy seed words for pre-defined aspects. Experiments in two domains demonstrate the superior performance of our proposed framework, as well as the necessity and importance of properly modeling the misc aspect.
△ Less
Submitted 29 April, 2020;
originally announced April 2020.
-
Empower Entity Set Expansion via Language Model Probing
Authors:
Yunyi Zhang,
Jiaming Shen,
**gbo Shang,
Jiawei Han
Abstract:
Entity set expansion, aiming at expanding a small seed entity set with new entities belonging to the same semantic class, is a critical task that benefits many downstream NLP and IR applications, such as question answering, query understanding, and taxonomy construction. Existing set expansion methods bootstrap the seed entity set by adaptively selecting context features and extracting new entitie…
▽ More
Entity set expansion, aiming at expanding a small seed entity set with new entities belonging to the same semantic class, is a critical task that benefits many downstream NLP and IR applications, such as question answering, query understanding, and taxonomy construction. Existing set expansion methods bootstrap the seed entity set by adaptively selecting context features and extracting new entities. A key challenge for entity set expansion is to avoid selecting ambiguous context features which will shift the class semantics and lead to accumulative errors in later iterations. In this study, we propose a novel iterative set expansion framework that leverages automatically generated class names to address the semantic drift issue. In each iteration, we select one positive and several negative class names by probing a pre-trained language model, and further score each candidate entity based on selected class names. Experiments on two datasets show that our framework generates high-quality class names and outperforms previous state-of-the-art methods significantly.
△ Less
Submitted 29 June, 2020; v1 submitted 28 April, 2020;
originally announced April 2020.
-
Joint Semantic Segmentation and Boundary Detection using Iterative Pyramid Contexts
Authors:
Mingmin Zhen,
**glu Wang,
Lei Zhou,
Shiwei Li,
Tianwei Shen,
Jiaxiang Shang,
Tian Fang,
Quan Long
Abstract:
In this paper, we present a joint multi-task learning framework for semantic segmentation and boundary detection. The critical component in the framework is the iterative pyramid context module (PCM), which couples two tasks and stores the shared latent semantics to interact between the two tasks. For semantic boundary detection, we propose the novel spatial gradient fusion to suppress nonsemantic…
▽ More
In this paper, we present a joint multi-task learning framework for semantic segmentation and boundary detection. The critical component in the framework is the iterative pyramid context module (PCM), which couples two tasks and stores the shared latent semantics to interact between the two tasks. For semantic boundary detection, we propose the novel spatial gradient fusion to suppress nonsemantic edges. As semantic boundary detection is the dual task of semantic segmentation, we introduce a loss function with boundary consistency constraint to improve the boundary pixel accuracy for semantic segmentation. Our extensive experiments demonstrate superior performance over state-of-the-art works, not only in semantic segmentation but also in semantic boundary detection. In particular, a mean IoU score of 81:8% on Cityscapes test set is achieved without using coarse data or any external data for semantic segmentation. For semantic boundary detection, we improve over previous state-of-the-art works by 9.9% in terms of AP and 6:8% in terms of MF(ODS).
△ Less
Submitted 16 April, 2020;
originally announced April 2020.
-
Verification of phased Dicke states
Authors:
Zihao Li,
Yun-Guang Han,
Hao-Feng Sun,
Jiangwei Shang,
Huangjun Zhu
Abstract:
Dicke states are typical examples of quantum states with genuine multipartite entanglement. They are valuable resources in many quantum information processing tasks, including multiparty quantum communication and quantum metrology. Phased Dicke states are a generalization of Dicke states and include antisymmetric basis states as a special example. These states are useful in atomic and molecular ph…
▽ More
Dicke states are typical examples of quantum states with genuine multipartite entanglement. They are valuable resources in many quantum information processing tasks, including multiparty quantum communication and quantum metrology. Phased Dicke states are a generalization of Dicke states and include antisymmetric basis states as a special example. These states are useful in atomic and molecular physics besides quantum information processing. Here we propose practical and efficient protocols based on adaptive local projective measurements for verifying all phased Dicke states, including $W$ states and qudit Dicke states. To verify any $n$-partite phased Dicke state within infidelity $ε$ and significance level $δ$, the number of tests required is only $O(nε^{-1}\lnδ^{-1})$, which is linear in $n$ and is exponentially more efficient than traditional tomographic approaches. In the case of $W$ states, the number of tests can be further reduced to $O(\sqrt{n}\,ε^{-1}\lnδ^{-1})$. Moreover, we construct an optimal protocol for any antisymmetric basis state; the number of tests required decreases (rather than increases) monotonically with $n$. This is the only optimal protocol known for multipartite nonstabilizer states.
△ Less
Submitted 10 February, 2021; v1 submitted 15 April, 2020;
originally announced April 2020.
-
Investigation and Application of Fitting Models for Centering Algorithms in Astrometry
Authors:
F. R. Lin,
Q. Y. Peng,
Z. J. Zheng,
B. F. Guo,
Y. J. Shang
Abstract:
To determine the precise positions of stars in CCD frames, various centering algorithms have been proposed for astrometry. The effective point spread function (ePSF) and the Gaussian centering algorithms are two representative centering algorithms. This paper compares in detail and investigates these two centering algorithms in performing data reduction. Specifically, synthetic star images in diff…
▽ More
To determine the precise positions of stars in CCD frames, various centering algorithms have been proposed for astrometry. The effective point spread function (ePSF) and the Gaussian centering algorithms are two representative centering algorithms. This paper compares in detail and investigates these two centering algorithms in performing data reduction. Specifically, synthetic star images in different conditions (i.e. profiles, fluxes, backgrounds and full width at half maximums) are generated and processed. We find that the difference in precision between the two algorithms is related to the profiles of the star images. Therefore, the precision comparison results using an ideal Gaussian-profile star image cannot be extended to other more specific experimental scenarios. Based on the simulation results, the most appropriate algorithm can be selected according to the image characteristics of observations, and the loss of precision of other algorithms can be estimated. The conclusions are verified using observations captured by the 1-m and 2.4-m telescopes at Yunnan Observatory.
△ Less
Submitted 24 December, 2021; v1 submitted 13 April, 2020;
originally announced April 2020.
-
AI Online Filters to Real World Image Recognition
Authors:
Hai Xiao,
** Shang,
Mengyuan Huang
Abstract:
Deep artificial neural networks, trained with labeled data sets are widely used in numerous vision and robotics applications today. In terms of AI, these are called reflex models, referring to the fact that they do not self-evolve or actively adapt to environmental changes. As demand for intelligent robot control expands to many high level tasks, reinforcement learning and state based models play…
▽ More
Deep artificial neural networks, trained with labeled data sets are widely used in numerous vision and robotics applications today. In terms of AI, these are called reflex models, referring to the fact that they do not self-evolve or actively adapt to environmental changes. As demand for intelligent robot control expands to many high level tasks, reinforcement learning and state based models play an increasingly important role. Herein, in computer vision and robotics domain, we study a novel approach to add reinforcement controls onto the image recognition reflex models to attain better overall performance, specifically to a wider environment range beyond what is expected of the task reflex models. Follow a common infrastructure with environment sensing and AI based modeling of self-adaptive agents, we implement multiple types of AI control agents. To the end, we provide comparative results of these agents with baseline, and an insightful analysis of their benefit to improve overall image recognition performance in real world.
△ Less
Submitted 11 February, 2020;
originally announced February 2020.
-
Experimental optimal orienteering via parallel and antiparallel spins
Authors:
Jun-Feng Tang,
Zhibo Hou,
Jiangwei Shang,
Huangjun Zhu,
Guo-Yong Xiang,
Chuan-Feng Li,
Guang-Can Guo
Abstract:
Antiparallel spins are superior in orienteering to parallel spins. This intriguing phenomenon is tied to entanglement associated with quantum measurements rather than quantum states. Using photonic systems, we experimentally realize the optimal orienteering protocols based on parallel spins and antiparallel spins, respectively. The optimal entangling measurements for decoding the direction informa…
▽ More
Antiparallel spins are superior in orienteering to parallel spins. This intriguing phenomenon is tied to entanglement associated with quantum measurements rather than quantum states. Using photonic systems, we experimentally realize the optimal orienteering protocols based on parallel spins and antiparallel spins, respectively. The optimal entangling measurements for decoding the direction information from parallel spins and antiparallel spins are realized using photonic quantum walks, which is a useful idea that is of wide interest in quantum information processing and foundational studies. Our experiments clearly demonstrate the advantage of antiparallel spins over parallel spins in orienteering. In addition, entangling measurements can extract more information than local measurements even if no entanglement is present in the quantum states.
△ Less
Submitted 17 February, 2020;
originally announced February 2020.
-
Opportunities and Challenges of Deep Learning Methods for Electrocardiogram Data: A Systematic Review
Authors:
Shenda Hong,
Yuxi Zhou,
Junyuan Shang,
Cao Xiao,
Jimeng Sun
Abstract:
Background:The electrocardiogram (ECG) is one of the most commonly used diagnostic tools in medicine and healthcare. Deep learning methods have achieved promising results on predictive healthcare tasks using ECG signals. Objective:This paper presents a systematic review of deep learning methods for ECG data from both modeling and application perspectives. Methods:We extracted papers that applied d…
▽ More
Background:The electrocardiogram (ECG) is one of the most commonly used diagnostic tools in medicine and healthcare. Deep learning methods have achieved promising results on predictive healthcare tasks using ECG signals. Objective:This paper presents a systematic review of deep learning methods for ECG data from both modeling and application perspectives. Methods:We extracted papers that applied deep learning (deep neural network) models to ECG data that were published between Jan. 1st of 2010 and Feb. 29th of 2020 from Google Scholar, PubMed, and the DBLP. We then analyzed each article according to three factors: tasks, models, and data. Finally, we discuss open challenges and unsolved problems in this area. Results: The total number of papers extracted was 191. Among these papers, 108 were published after 2019. Different deep learning architectures have been used in various ECG analytics tasks, such as disease detection/classification, annotation/localization, sleep staging, biometric human identification, and denoising. Conclusion: The number of works on deep learning for ECG data has grown explosively in recent years. Such works have achieved accuracy comparable to that of traditional feature-based approaches and ensembles of multiple approaches can achieve even better results. Specifically, we found that a hybrid architecture of a convolutional neural network and recurrent neural network ensemble using expert features yields the best results. However, there are some new challenges and problems related to interpretability, scalability, and efficiency that must be addressed. Furthermore, it is also worth investigating new applications from the perspectives of datasets and methods. Significance: This paper summarizes existing deep learning research using ECG data from multiple perspectives and highlights existing challenges and problems to identify potential future research directions.
△ Less
Submitted 30 April, 2020; v1 submitted 27 December, 2019;
originally announced January 2020.
-
Reversible Gas Sensing by Ferroelectric Switch and 2D Molecule Multiferroics in In2Se3 Monolayer
Authors:
Xiao Tang,
**g Shang,
Yuantong Gu,
Aijun Du,
Liangzhi Kou
Abstract:
Two-dimensional ferroelectrics are important quantum materials which have found novel application in nonvolatile memories, however, the effects of reversible polarization on chemical reactions and interaction with environments are rarely studied despite of its importance. Here, based on the first-principles calculations, we found distinct gas adsorption behaviors on the surfaces of ferroelectric I…
▽ More
Two-dimensional ferroelectrics are important quantum materials which have found novel application in nonvolatile memories, however, the effects of reversible polarization on chemical reactions and interaction with environments are rarely studied despite of its importance. Here, based on the first-principles calculations, we found distinct gas adsorption behaviors on the surfaces of ferroelectric In2Se3 layer and the reversible gas caption and release controlled by ferroelectric switch. We rationalize the novel phenomena to the synergistic effect of the different electrostatic potential and electron transfer induced by band alignments between frontier molecular orbitals of gas and band edge states of substrate. Excitingly, the adsorption of paramagnetic gas molecules such as NO and NO2 can induce surface magnetism, which is also sensitive to ferroelectric polarization direction of In2Se3, indicating the application of In2Se3 as threshold magnetic sensors or switcher. Furthermore, it is suggested two NO molecules prefer to ferromagnetically couple with each other, the Curie temperature is polarization dependent which can reach up to 50K, leading to the long-sought 2D molecule multiferroics. The ferroelectric controllable adsorption behavior and molecule multiferroic feature will find extensive application in gas caption, selective catalytic reduction and spintronic device.
△ Less
Submitted 3 December, 2019;
originally announced December 2019.
-
Multiferroic Decorated Fe2O3 Monolayer Predicted from First Principles
Authors:
**g Shang,
Chun Li,
Aijun Du,
Ting Liao,
Yuantong Gu,
Yandong Ma,
Liangzhi Kou,
Changfeng Chen
Abstract:
Two-dimensional (2D) multiferroics exhibit cross-control capacity between magnetic and electric responses in reduced spatial domain, making them well suited for next-generation nanoscale devices; however, progress has been slow in develo** materials with required characteristic properties. Here we identify by first-principles calculations robust 2D multiferroic behaviors in decorated Fe2O3 monol…
▽ More
Two-dimensional (2D) multiferroics exhibit cross-control capacity between magnetic and electric responses in reduced spatial domain, making them well suited for next-generation nanoscale devices; however, progress has been slow in develo** materials with required characteristic properties. Here we identify by first-principles calculations robust 2D multiferroic behaviors in decorated Fe2O3 monolayer, showcasing N@Fe2O3 as a prototypical case, where ferroelectricity and ferromagnetism stem from the same origin, namely Fe d-orbit splitting induced by the Jahn-Teller distortion and associated crystal field changes. The resulting ferromagnetic and ferroelectric polarization can be effectively reversed and regulated by applied electric field or strain, offering efficient functionality. These findings establish strong materials phenomena and elucidate underlying physics mechanism in a family of truly 2D multiferroics that are highly promising for advanced device applications.
△ Less
Submitted 13 May, 2020; v1 submitted 27 November, 2019;
originally announced November 2019.
-
Efficient verification of quantum processes
Authors:
Ye-Chao Liu,
Jiangwei Shang,
Xiao-Dong Yu,
Xiangdong Zhang
Abstract:
Quantum processes, such as quantum circuits, quantum memories, and quantum channels, are essential ingredients in almost all quantum information processing tasks. However, the characterization of these processes remains a daunting task due to the exponentially increasing amount of resources required by traditional methods. Here, by first proposing the concept of quantum process verification, we es…
▽ More
Quantum processes, such as quantum circuits, quantum memories, and quantum channels, are essential ingredients in almost all quantum information processing tasks. However, the characterization of these processes remains a daunting task due to the exponentially increasing amount of resources required by traditional methods. Here, by first proposing the concept of quantum process verification, we establish two efficient and practical protocols for verifying quantum processes which can provide an exponential improvement over the standard quantum process tomography and a quadratic improvement over the method of direct fidelity estimation. The efficacy of our protocols is illustrated with the verification of various quantum gates as well as the processes of well-known quantum circuits. Moreover, our protocols are readily applicable with current experimental techniques since only local measurements are required. In addition, we show that our protocols for verifying quantum processes can be easily adapted to verify quantum measurements.
△ Less
Submitted 15 April, 2020; v1 submitted 30 October, 2019;
originally announced October 2019.
-
SetExpan: Corpus-Based Set Expansion via Context Feature Selection and Rank Ensemble
Authors:
Jiaming Shen,
Zeqiu Wu,
Dongming Lei,
**gbo Shang,
Xiang Ren,
Jiawei Han
Abstract:
Corpus-based set expansion (i.e., finding the "complete" set of entities belonging to the same semantic class, based on a given corpus and a tiny set of seeds) is a critical task in knowledge discovery. It may facilitate numerous downstream applications, such as information extraction, taxonomy induction, question answering, and web search. To discover new entities in an expanded set, previous app…
▽ More
Corpus-based set expansion (i.e., finding the "complete" set of entities belonging to the same semantic class, based on a given corpus and a tiny set of seeds) is a critical task in knowledge discovery. It may facilitate numerous downstream applications, such as information extraction, taxonomy induction, question answering, and web search. To discover new entities in an expanded set, previous approaches either make one-time entity ranking based on distributional similarity, or resort to iterative pattern-based bootstrap**. The core challenge for these methods is how to deal with noisy context features derived from free-text corpora, which may lead to entity intrusion and semantic drifting. In this study, we propose a novel framework, SetExpan, which tackles this problem, with two techniques: (1) a context feature selection method that selects clean context features for calculating entity-entity distributional similarity, and (2) a ranking-based unsupervised ensemble method for expanding entity set based on denoised context features. Experiments on three datasets show that SetExpan is robust and outperforms previous state-of-the-art methods in terms of mean average precision.
△ Less
Submitted 17 October, 2019;
originally announced October 2019.
-
FUSE: Multi-Faceted Set Expansion by Coherent Clustering of Skip-grams
Authors:
Wanzheng Zhu,
Hongyu Gong,
Jiaming Shen,
Chao Zhang,
**gbo Shang,
Suma Bhat,
Jiawei Han
Abstract:
Set expansion aims to expand a small set of seed entities into a complete set of relevant entities. Most existing approaches assume the input seed set is unambiguous and completely ignore the multi-faceted semantics of seed entities. As a result, given the seed set {"Canon", "Sony", "Nikon"}, previous models return one mixed set of entities that are either Camera Brands or Japanese Companies. In t…
▽ More
Set expansion aims to expand a small set of seed entities into a complete set of relevant entities. Most existing approaches assume the input seed set is unambiguous and completely ignore the multi-faceted semantics of seed entities. As a result, given the seed set {"Canon", "Sony", "Nikon"}, previous models return one mixed set of entities that are either Camera Brands or Japanese Companies. In this paper, we study the task of multi-faceted set expansion, which aims to capture all semantic facets in the seed set and return multiple sets of entities, one for each semantic facet. We propose an unsupervised framework, FUSE, which consists of three major components: (1) facet discovery module: identifies all semantic facets of each seed entity by extracting and clustering its skip-grams, and (2) facet fusion module: discovers shared semantic facets of the entire seed set by an optimization formulation, and (3) entity expansion module: expands each semantic facet by utilizing a masked language model with pre-trained BERT models. Extensive experiments demonstrate that FUSE can accurately identify multiple semantic facets of the seed set and generate quality entities for each facet.
△ Less
Submitted 18 June, 2020; v1 submitted 9 October, 2019;
originally announced October 2019.
-
GENN: Predicting Correlated Drug-drug Interactions with Graph Energy Neural Networks
Authors:
Tengfei Ma,
Junyuan Shang,
Cao Xiao,
Jimeng Sun
Abstract:
Gaining more comprehensive knowledge about drug-drug interactions (DDIs) is one of the most important tasks in drug development and medical practice. Recently graph neural networks have achieved great success in this task by modeling drugs as nodes and drug-drug interactions as links and casting DDI predictions as link prediction problems. However, correlations between link labels (e.g., DDI types…
▽ More
Gaining more comprehensive knowledge about drug-drug interactions (DDIs) is one of the most important tasks in drug development and medical practice. Recently graph neural networks have achieved great success in this task by modeling drugs as nodes and drug-drug interactions as links and casting DDI predictions as link prediction problems. However, correlations between link labels (e.g., DDI types) were rarely considered in existing works. We propose the graph energy neural network (GENN) to explicitly model link type correlations. We formulate the DDI prediction task as a structure prediction problem and introduce a new energy-based model where the energy function is defined by graph neural networks. Experiments on two real-world DDI datasets demonstrated that GENN is superior to many baselines without consideration of link type correlations and achieved $13.77\%$ and $5.01\%$ PR-AUC improvement on the two datasets, respectively. We also present a case study in which \mname can better capture meaningful DDI correlations compared with baseline models.
△ Less
Submitted 7 October, 2019; v1 submitted 4 October, 2019;
originally announced October 2019.
-
CubeNet: Multi-Facet Hierarchical Heterogeneous Network Construction, Analysis, and Mining
Authors:
Carl Yang,
Dai Teng,
Siyang Liu,
Sayantani Basu,
Jieyu Zhang,
Jiaming Shen,
Chao Zhang,
**gbo Shang,
Lance Kaplan,
Timothy Harratty,
Jiawei Han
Abstract:
Due to the ever-increasing size of data, construction, analysis and mining of universal massive networks are becoming forbidden and meaningless. In this work, we outline a novel framework called CubeNet, which systematically constructs and organizes real-world networks into different but correlated semantic cells, to support various downstream network analysis and mining tasks with better flexibil…
▽ More
Due to the ever-increasing size of data, construction, analysis and mining of universal massive networks are becoming forbidden and meaningless. In this work, we outline a novel framework called CubeNet, which systematically constructs and organizes real-world networks into different but correlated semantic cells, to support various downstream network analysis and mining tasks with better flexibility, deeper insights and higher efficiency. Particular, we promote our recent research on text and network mining with novel concepts and techniques to (1) construct four real-world large-scale multi-facet hierarchical heterogeneous networks; (2) enable insightful OLAP-style network analysis; (3) facilitate localized and contextual network mining. Although some functions have been covered individually in our previous work, a systematic and efficient realization of an organic system has not been studied, while some functions are still our on-going research tasks. By integrating them, CubeNet may not only showcase the utility of our recent research, but also inspire and stimulate future research on effective, insightful and scalable knowledge discovery under this novel framework.
△ Less
Submitted 28 September, 2019;
originally announced October 2019.
-
CrossWeigh: Training Named Entity Tagger from Imperfect Annotations
Authors:
Zihan Wang,
**gbo Shang,
Liyuan Liu,
Lihao Lu,
Jiacheng Liu,
Jiawei Han
Abstract:
Everyone makes mistakes. So do human annotators when curating labels for named entity recognition (NER). Such label mistakes might hurt model training and interfere model comparison. In this study, we dive deep into one of the widely-adopted NER benchmark datasets, CoNLL03 NER. We are able to identify label mistakes in about 5.38% test sentences, which is a significant ratio considering that the s…
▽ More
Everyone makes mistakes. So do human annotators when curating labels for named entity recognition (NER). Such label mistakes might hurt model training and interfere model comparison. In this study, we dive deep into one of the widely-adopted NER benchmark datasets, CoNLL03 NER. We are able to identify label mistakes in about 5.38% test sentences, which is a significant ratio considering that the state-of-the-art test F1 score is already around 93%. Therefore, we manually correct these label mistakes and form a cleaner test set. Our re-evaluation of popular models on this corrected test set leads to more accurate assessments, compared to those on the original test set. More importantly, we propose a simple yet effective framework, CrossWeigh, to handle label mistakes during NER model training. Specifically, it partitions the training data into several folds and train independent NER models to identify potential mistakes in each fold. Then it adjusts the weights of training data accordingly to train the final NER model. Extensive experiments demonstrate significant improvements of plugging various NER models into our proposed framework on three datasets. All implementations and corrected test set are available at our Github repo: https://github.com/ZihanWangKi/CrossWeigh.
△ Less
Submitted 3 September, 2019;
originally announced September 2019.
-
K-margin-based Residual-Convolution-Recurrent Neural Network for Atrial Fibrillation Detection
Authors:
Yuxi Zhou,
Shenda Hong,
Junyuan Shang,
Meng Wu,
Qingyun Wang,
Hongyan Li,
Junqing Xie
Abstract:
Atrial Fibrillation (AF) is an abnormal heart rhythm which can trigger cardiac arrest and sudden death. Nevertheless, its interpretation is mostly done by medical experts due to high error rates of computerized interpretation. One study found that only about 66% of AF were correctly recognized from noisy ECGs. This is in part due to insufficient training data, class skewness, as well as semantical…
▽ More
Atrial Fibrillation (AF) is an abnormal heart rhythm which can trigger cardiac arrest and sudden death. Nevertheless, its interpretation is mostly done by medical experts due to high error rates of computerized interpretation. One study found that only about 66% of AF were correctly recognized from noisy ECGs. This is in part due to insufficient training data, class skewness, as well as semantical ambiguities caused by noisy segments in an ECG record. In this paper, we propose a K-margin-based Residual-Convolution-Recurrent neural network (K-margin-based RCR-net) for AF detection from noisy ECGs. In detail, a skewness-driven dynamic augmentation method is employed to handle the problems of data inadequacy and class imbalance. A novel RCR-net is proposed to automatically extract both long-term rhythm-level and local heartbeat-level characters. Finally, we present a K-margin-based diagnosis model to automatically focus on the most important parts of an ECG record and handle noise by naturally exploiting expected consistency among the segments associated for each record. The experimental results demonstrate that the proposed method with 0.8125 F1NAOP score outperforms all state-of-the-art deep learning methods for AF detection task by 6.8%.
△ Less
Submitted 9 August, 2019;
originally announced August 2019.
-
Raw-to-End Name Entity Recognition in Social Media
Authors:
Liyuan Liu,
Zihan Wang,
**gbo Shang,
Dandong Yin,
Heng Ji,
Xiang Ren,
Shaowen Wang,
Jiawei Han
Abstract:
Taking word sequences as the input, typical named entity recognition (NER) models neglect errors from pre-processing (e.g., tokenization). However, these errors can influence the model performance greatly, especially for noisy texts like tweets. Here, we introduce Neural-Char-CRF, a raw-to-end framework that is more robust to pre-processing errors. It takes raw character sequences as inputs and ma…
▽ More
Taking word sequences as the input, typical named entity recognition (NER) models neglect errors from pre-processing (e.g., tokenization). However, these errors can influence the model performance greatly, especially for noisy texts like tweets. Here, we introduce Neural-Char-CRF, a raw-to-end framework that is more robust to pre-processing errors. It takes raw character sequences as inputs and makes end-to-end predictions. Word embedding and contextualized representation models are further tailored to capture textual signals for each character instead of each word. Our model neither requires the conversion from character sequences to word sequences, nor assumes tokenizer can correctly detect all word boundaries. Moreover, we observe our model performance remains unchanged after replacing tokenization with string matching, which demonstrates its potential to be tokenization-free. Extensive experimental results on two public datasets demonstrate the superiority of our proposed method over the state of the art. The implementations and datasets are made available at: https://github.com/LiyuanLucasLiu/Raw-to-End.
△ Less
Submitted 14 August, 2019;
originally announced August 2019.
-
Discrete unified gas kinetic scheme for nonlinear convection-diffusion equations
Authors:
**long Shang,
Zhenhua Chai,
Huili Wang,
Baochang Shi
Abstract:
In this paper, we develop a discrete unified gas kinetic scheme (DUGKS) for general nonlinear convection-diffusion equation (NCDE), and show that the NCDE can be recovered correctly from the present model through the Chapman-Enskog analysis. We then test the present DUGKS through some classic convection-diffusion equations, and find that the numerical results are in good agreement with analytical…
▽ More
In this paper, we develop a discrete unified gas kinetic scheme (DUGKS) for general nonlinear convection-diffusion equation (NCDE), and show that the NCDE can be recovered correctly from the present model through the Chapman-Enskog analysis. We then test the present DUGKS through some classic convection-diffusion equations, and find that the numerical results are in good agreement with analytical solutions and the DUGKS model has a second-order convergence rate. Finally, as a finite-volume method, DUGKS can also adopt the non-uniform mesh. Besides, we performed some comparisons among the DUGKS, finite-volume lattice Boltzmann model (FV-LBM), single-relaxation-time lattice Boltzmann model (SLBM) and multiple-relaxation-time lattice Boltzmann model (MRT-LBM). The results show that the DUGKS model is more accurate than FV-LBM, more stable than SLBM, and almost has the same accuracy as the MRT-LBM. Besides, the using of non-uniform mesh may make DUGKS model more flexible.
△ Less
Submitted 16 June, 2019;
originally announced June 2019.
-
Pre-training of Graph Augmented Transformers for Medication Recommendation
Authors:
Junyuan Shang,
Tengfei Ma,
Cao Xiao,
Jimeng Sun
Abstract:
Medication recommendation is an important healthcare application. It is commonly formulated as a temporal prediction task. Hence, most existing works only utilize longitudinal electronic health records (EHRs) from a small number of patients with multiple visits ignoring a large number of patients with a single visit (selection bias). Moreover, important hierarchical knowledge such as diagnosis hie…
▽ More
Medication recommendation is an important healthcare application. It is commonly formulated as a temporal prediction task. Hence, most existing works only utilize longitudinal electronic health records (EHRs) from a small number of patients with multiple visits ignoring a large number of patients with a single visit (selection bias). Moreover, important hierarchical knowledge such as diagnosis hierarchy is not leveraged in the representation learning process. To address these challenges, we propose G-BERT, a new model to combine the power of Graph Neural Networks (GNNs) and BERT (Bidirectional Encoder Representations from Transformers) for medical code representation and medication recommendation. We use GNNs to represent the internal hierarchical structures of medical codes. Then we integrate the GNN representation into a transformer-based visit encoder and pre-train it on EHR data from patients only with a single visit. The pre-trained visit encoder and representation are then fine-tuned for downstream predictive tasks on longitudinal EHRs from patients with multiple visits. G-BERT is the first to bring the language model pre-training schema into the healthcare domain and it achieved state-of-the-art performance on the medication recommendation task.
△ Less
Submitted 26 November, 2019; v1 submitted 2 June, 2019;
originally announced June 2019.
-
Robust Principal Component Analysis for Modal Decomposition of Corrupt Fluid Flows
Authors:
Isabel Scherl,
Benjamin Strom,
Jessica K. Shang,
Owen Williams,
Brian L. Polagye,
Steven L. Brunton
Abstract:
Modal analysis techniques are used to identify patterns and develop reduced-order models in a variety of fluid applications. However, experimentally acquired flow fields may be corrupted with incorrect and missing entries, which may degrade modal decomposition. Here we use robust principal component analysis (RPCA) to improve the quality of flow field data by leveraging global coherent structures…
▽ More
Modal analysis techniques are used to identify patterns and develop reduced-order models in a variety of fluid applications. However, experimentally acquired flow fields may be corrupted with incorrect and missing entries, which may degrade modal decomposition. Here we use robust principal component analysis (RPCA) to improve the quality of flow field data by leveraging global coherent structures to identify and replace spurious data points. RPCA is a robust variant of principal component analysis (PCA), also known as proper orthogonal decomposition (POD) in fluids, that decomposes a data matrix into the sum of a low-rank matrix containing coherent structures and a sparse matrix of outliers and corrupt entries. We apply RPCA filtering to a range of fluid simulations and experiments of varying complexities and assess the accuracy of low-rank structure recovery. First, we analyze direct numerical simulations of flow past a circular cylinder at Reynolds number 100 with artificial outliers, alongside similar PIV measurements at Reynolds number 413. Next, we apply RPCA filtering to a turbulent channel flow simulation from the Johns Hopkins Turbulence database, demonstrating that dominant coherent structures are preserved in the low-rank matrix. Finally, we investigate PIV measurements behind a two-bladed cross-flow turbine that exhibits both broadband and coherent phenomena. In all cases, we find that RPCA filtering extracts dominant coherent structures and identifies and fills in incorrect or missing measurements. The performance is particularly striking when flow fields are analyzed using dynamic mode decomposition, which is sensitive to noise and outliers.
△ Less
Submitted 13 December, 2019; v1 submitted 16 May, 2019;
originally announced May 2019.
-
Proper error bars for self-calibrating quantum tomography
Authors:
Jun Yan Sim,
Jiangwei Shang,
Hui Khoon Ng,
Berthold-Georg Englert
Abstract:
Self-calibrating quantum state tomography aims at reconstructing the unknown quantum state and certain properties of the measurement devices from the same data. Since the estimates of the state and device parameters come from the same data, one should employ a joint estimation scheme, including the construction and reporting of joint state-device error regions to quantify uncertainty. We explain h…
▽ More
Self-calibrating quantum state tomography aims at reconstructing the unknown quantum state and certain properties of the measurement devices from the same data. Since the estimates of the state and device parameters come from the same data, one should employ a joint estimation scheme, including the construction and reporting of joint state-device error regions to quantify uncertainty. We explain how to do this naturally within the framework of optimal error regions. As an illustrative example, we apply our procedure to the double-crosshair measurement of the BB84 scenario in quantum cryptography and so reconstruct the state and estimate the detection efficiencies simultaneously and reliably. We also discuss the practical situation of a satellite-based quantum key distribution scheme, for which self-calibration and proper treatment of the data are necessities.
△ Less
Submitted 6 September, 2019; v1 submitted 25 April, 2019;
originally announced April 2019.
-
Computer-aided study of double extensions of restricted Lie superalgebras preserving the non-degenerate closed 2-forms in characteristic 2
Authors:
Sofiane Bouarroudj,
Dimitry Leites,
** Shang
Abstract:
A Lie (super)algebra with a non-degenerate invariant symmetric bilinear form $B$ is called a nis-(super)algebra. The double extension $\mathfrak{g}$ of a nis-(super)algebra $\mathfrak{a}$ is the result of simultaneous adding to $\mathfrak{a}$ a central element and a derivation so that $\mathfrak{g}$ is a nis-algebra. Loop algebras with values in simple complex Lie algebras are most known among the…
▽ More
A Lie (super)algebra with a non-degenerate invariant symmetric bilinear form $B$ is called a nis-(super)algebra. The double extension $\mathfrak{g}$ of a nis-(super)algebra $\mathfrak{a}$ is the result of simultaneous adding to $\mathfrak{a}$ a central element and a derivation so that $\mathfrak{g}$ is a nis-algebra. Loop algebras with values in simple complex Lie algebras are most known among the Lie (super)algebras suitable to be doubly extended. In characteristic 2 the notion of double extension acquires specific features.
Restricted Lie (super)algebras are among the most interesting modular Lie superalgebras. In characteristic 2, using Grozman's Mathematica-based package SuperLie, we list double extensions of restricted Lie superalgebras preserving the non-degenerate closed 2-forms with constant coefficients. The results are proved for the number of indeterminates ranging from 4 to 7 - sufficient to conjecture the pattern for larger numbers. Considering multigradings allowed us to accelerate computations up to 100 times.
△ Less
Submitted 21 April, 2019;
originally announced April 2019.
-
The roots of exceptional modular Lie superalgebras with Cartan matrix
Authors:
Sofiane Bouarroudj,
Dimitry Leites,
Alexander Lozhechnyk,
** Shang
Abstract:
For each of the exceptional Lie superalgebras with indecomposable Cartan matrix, we give the explicit list of its roots of and the corresponding Chevalley basis for one of the inequivalent Cartan matrices, the one corresponding to the greatest number of mutually orthogonal isotropic odd simple roots.
Our main tools: Grozman's Mathematica-based code SuperLie, and Python.
For each of the exceptional Lie superalgebras with indecomposable Cartan matrix, we give the explicit list of its roots of and the corresponding Chevalley basis for one of the inequivalent Cartan matrices, the one corresponding to the greatest number of mutually orthogonal isotropic odd simple roots.
Our main tools: Grozman's Mathematica-based code SuperLie, and Python.
△ Less
Submitted 21 April, 2019;
originally announced April 2019.
-
Efficient verification of Dicke states
Authors:
Ye-Chao Liu,
Xiao-Dong Yu,
Jiangwei Shang,
Huangjun Zhu,
Xiangdong Zhang
Abstract:
Among various multipartite entangled states, Dicke states stand out because their entanglement is maximally persistent and robust under particle losses. Although much attention has been attracted for their potential applications in quantum information processing and foundational studies, the characterization of Dicke states remains as a challenging task in experiments. Here, we propose efficient a…
▽ More
Among various multipartite entangled states, Dicke states stand out because their entanglement is maximally persistent and robust under particle losses. Although much attention has been attracted for their potential applications in quantum information processing and foundational studies, the characterization of Dicke states remains as a challenging task in experiments. Here, we propose efficient and practical protocols for verifying arbitrary $n$-qubit Dicke states in both adaptive and nonadaptive ways. Our protocols require only two distinct settings based on Pauli measurements besides permutations of the qubits. To achieve infidelity $ε$ and confidence level $1-δ$, the total number of tests required is only $O(nε^{-1}\lnδ^{-1})$. This performance is exponentially more efficient than all previous protocols based on local measurements, including quantum state tomography and direct fidelity estimation, and is comparable to the best global strategy. Our protocols are readily applicable with current experimental techniques and are able to verify Dicke states of hundreds of qubits.
△ Less
Submitted 10 October, 2019; v1 submitted 3 April, 2019;
originally announced April 2019.
-
Direct photoluminescence probing of ferromagnetism in monolayer two-dimensional CrBr3
Authors:
Zhaowei Zhang,
**gzhi Shang,
Chongyun Jiang,
Abdullah Rasmita,
Weibo Gao,
Ting Yu
Abstract:
Atomically thin magnets are the key element to build up spintronics based on two-dimensional materials. The surface nature of two-dimensional ferromagnet opens up opportunities to improve the device performance efficiently. Here, we report the intrinsic ferromagnetism in atomically thin monolayer CrBr3, directly probed by polarization resolved magneto-photoluminescence. The spontaneous magnetizati…
▽ More
Atomically thin magnets are the key element to build up spintronics based on two-dimensional materials. The surface nature of two-dimensional ferromagnet opens up opportunities to improve the device performance efficiently. Here, we report the intrinsic ferromagnetism in atomically thin monolayer CrBr3, directly probed by polarization resolved magneto-photoluminescence. The spontaneous magnetization persists in monolayer CrBr3 with a Curie temperature of 34 K. The development of magnons by the thermal excitation is in line with the spin-wave theory. We attribute the layer-number dependent hysteresis loops in thick layers to the magnetic domain structures. As a stable monolayer material in air, CrBr3 provides a convenient platform for fundamental physics and pushes the potential applications of the two-dimensional ferromagnetism.
△ Less
Submitted 20 February, 2019;
originally announced February 2019.
-
Optimal verification of general bipartite pure states
Authors:
Xiao-Dong Yu,
Jiangwei Shang,
Otfried Gühne
Abstract:
The efficient and reliable verification of quantum states plays a crucial role in various quantum information processing tasks. We consider the task of verifying entangled states using one-way and two-way classical communication and completely characterize the optimal strategies via convex optimization. We solve these optimization problems using both analytical and numerical methods, and the optim…
▽ More
The efficient and reliable verification of quantum states plays a crucial role in various quantum information processing tasks. We consider the task of verifying entangled states using one-way and two-way classical communication and completely characterize the optimal strategies via convex optimization. We solve these optimization problems using both analytical and numerical methods, and the optimal strategies can be constructed for any bipartite pure state. Compared with the nonadaptive approach, our adaptive strategies significantly improve the efficiency of quantum state verification. Moreover, these strategies are experimentally feasible, as only few local projective measurements are required.
△ Less
Submitted 6 December, 2019; v1 submitted 28 January, 2019;
originally announced January 2019.
-
Quantifying quantum resources with conic programming
Authors:
Roope Uola,
Tristan Kraft,
Jiangwei Shang,
Xiao-Dong Yu,
Otfried Gühne
Abstract:
Resource theories can be used to formalize the quantification and manipulation of resources in quantum information processing such as entanglement, asymmetry and coherence of quantum states, and incompatibility of quantum measurements. Given a certain state or measurement, one can ask whether there is a task in which it performs better than any resourceless state or measurement. Using conic progra…
▽ More
Resource theories can be used to formalize the quantification and manipulation of resources in quantum information processing such as entanglement, asymmetry and coherence of quantum states, and incompatibility of quantum measurements. Given a certain state or measurement, one can ask whether there is a task in which it performs better than any resourceless state or measurement. Using conic programming, we prove that any general robustness measure (with respect to a convex set of free states or measurements) can be seen as a quantifier of such outperformance in some discrimination task. We apply the technique to various examples, e.g. joint measurability, POVMs simulable by projective measurements, and state assemblages preparable with a given Schmidt number.
△ Less
Submitted 4 April, 2019; v1 submitted 21 December, 2018;
originally announced December 2018.
-
Real-Time Transmission Mechanism Design for Wireless IoT Sensors with Energy Harvesting under Power Saving Mode
Authors:
** Shang,
Muhammad Junaid Farooq,
Quanyan Zhu
Abstract:
The Internet of things (IoT) comprises of wireless sensors and actuators connected via access points to the Internet. Often, the sensing devices are remotely deployed with limited battery power and are equipped with energy harvesting equipment. These devices transmit real-time data to the base station (BS), which is used in applications such as anomaly detection. Under sufficient power availabilit…
▽ More
The Internet of things (IoT) comprises of wireless sensors and actuators connected via access points to the Internet. Often, the sensing devices are remotely deployed with limited battery power and are equipped with energy harvesting equipment. These devices transmit real-time data to the base station (BS), which is used in applications such as anomaly detection. Under sufficient power availability, wireless transmissions from sensors can be scheduled at regular time intervals to maintain real-time data acquisition. However, once the battery is significantly depleted, the devices enter into power saving mode and need to be more selective in transmitting information to the BS. Transmitting a particular piece of sensed data consumes power while discarding it may result in loss of utility at the BS. The goal is to design an optimal dynamic policy which enables the device to decide whether to transmit or to discard a piece of sensing data particularly under the power saving mode. This will enable the sensor to prolong its operation while causing minimum loss of utility to the application. We develop an analytical framework to capture the utility of the IoT sensor transmissions and leverage dynamic programming based approach to derive an optimal real-time transmission policy that is based on the statistics of information arrival, the likelihood of harvested energy, and designed lifetime of the sensors. Numerical results show that if the statistics of future data valuation are accurately predicted, there is a significant increase in utility obtained at the BS as well as the battery lifetime.
△ Less
Submitted 8 April, 2019; v1 submitted 6 December, 2018;
originally announced December 2018.
-
A Light-weight Vibrational Motor Powered Recoil Robot that Hops Rapidly Across Granular Media
Authors:
Alice C. Quillen,
Randal C. Nelson,
Hesam Askari,
Kathryn Chotkowski,
Esteban Wright,
Jessica K. Shang
Abstract:
A 1 cm coin vibrational motor fixed to the center of a 4 cm square foam platform moves rapidly across granular media (poppy seeds, millet, corn meal) at a speed of up to 30 cm/s, or about 5 body lengths/s. Fast speeds are achieved with dimensionless acceleration number, similar to a Froude number, up to 50, allowing the light-weight 1.4 g mechanism to remain above the substrate, levitated and prop…
▽ More
A 1 cm coin vibrational motor fixed to the center of a 4 cm square foam platform moves rapidly across granular media (poppy seeds, millet, corn meal) at a speed of up to 30 cm/s, or about 5 body lengths/s. Fast speeds are achieved with dimensionless acceleration number, similar to a Froude number, up to 50, allowing the light-weight 1.4 g mechanism to remain above the substrate, levitated and propelled by its kicks off the surface. The mechanism is low cost and moves without any external moving parts. With 2 s exposures we photograph the trajectory of the mechanism using an LED blocked except for a pin-hole and fixed to the mechanism. Trajectories can exhibit period doubling phenomena similar to a ball bouncing on a vibrating table top. A two dimensional numerical model gives similar trajectories, though a vertical drag force is required to keep the mechanism height low. We attribute the vertical drag force to aerodynamic suction from air flow below the mechanism base and through the granular substrate. Our numerical model suggests that speed is maximized when the mechanism is prevented from jum** high off the surface. In this way the mechanism resembles a gallo** or jum** animal whose body remains nearly at the same height above the ground during its gait.
△ Less
Submitted 23 September, 2018;
originally announced October 2018.
-
Learning Named Entity Tagger using Domain-Specific Dictionary
Authors:
**gbo Shang,
Liyuan Liu,
Xiang Ren,
Xiaotao Gu,
Teng Ren,
Jiawei Han
Abstract:
Recent advances in deep neural models allow us to build reliable named entity recognition (NER) systems without handcrafting features. However, such methods require large amounts of manually-labeled training data. There have been efforts on replacing human annotations with distant supervision (in conjunction with external dictionaries), but the generated noisy labels pose significant challenges on…
▽ More
Recent advances in deep neural models allow us to build reliable named entity recognition (NER) systems without handcrafting features. However, such methods require large amounts of manually-labeled training data. There have been efforts on replacing human annotations with distant supervision (in conjunction with external dictionaries), but the generated noisy labels pose significant challenges on learning effective neural models. Here we propose two neural models to suit noisy distant supervision from the dictionary. First, under the traditional sequence labeling framework, we propose a revised fuzzy CRF layer to handle tokens with multiple possible labels. After identifying the nature of noisy labels in distant supervision, we go beyond the traditional framework and propose a novel, more effective neural model AutoNER with a new Tie or Break scheme. In addition, we discuss how to refine distant supervision for better NER performance. Extensive experiments on three benchmark datasets demonstrate that AutoNER achieves the best performance when only using dictionaries with no additional human effort, and delivers competitive results with state-of-the-art supervised benchmarks.
△ Less
Submitted 10 September, 2018;
originally announced September 2018.
-
GAMENet: Graph Augmented MEmory Networks for Recommending Medication Combination
Authors:
Junyuan Shang,
Cao Xiao,
Tengfei Ma,
Hongyan Li,
Jimeng Sun
Abstract:
Recent progress in deep learning is revolutionizing the healthcare domain including providing solutions to medication recommendations, especially recommending medication combination for patients with complex health conditions. Existing approaches either do not customize based on patient health history, or ignore existing knowledge on drug-drug interactions (DDI) that might lead to adverse outcomes…
▽ More
Recent progress in deep learning is revolutionizing the healthcare domain including providing solutions to medication recommendations, especially recommending medication combination for patients with complex health conditions. Existing approaches either do not customize based on patient health history, or ignore existing knowledge on drug-drug interactions (DDI) that might lead to adverse outcomes. To fill this gap, we propose the Graph Augmented Memory Networks (GAMENet), which integrates the drug-drug interactions knowledge graph by a memory module implemented as a graph convolutional networks, and models longitudinal patient records as the query. It is trained end-to-end to provide safe and personalized recommendation of medication combination. We demonstrate the effectiveness and safety of GAMENet by comparing with several state-of-the-art methods on real EHR data. GAMENet outperformed all baselines in all effectiveness measures, and also achieved 3.60% DDI rate reduction from existing EHR data.
△ Less
Submitted 6 March, 2019; v1 submitted 6 September, 2018;
originally announced September 2018.
-
Origin of $sp$-electron magnetism in Graphitic Carbon Nitride
Authors:
Wei Xu,
** Shang,
Jie-Xiang Yu,
J. G. Che
Abstract:
Based on first principles calculations, this study reveals that magnetism in otherwise non-magnetic materials can originate from the partial occupation of antibonding states. Since the antibonding wavefunctions are spatially antisymmetric, the spin wavefunctions should be symmteric according to the exchange antisymmetric principle of quantum mechanics. We demonstrate that this phenomenon can be ob…
▽ More
Based on first principles calculations, this study reveals that magnetism in otherwise non-magnetic materials can originate from the partial occupation of antibonding states. Since the antibonding wavefunctions are spatially antisymmetric, the spin wavefunctions should be symmteric according to the exchange antisymmetric principle of quantum mechanics. We demonstrate that this phenomenon can be observed in a graphitic carbon nitride material, $g$-C$_4$N$_3$, which can be experimentally synthesized and seen as a honeycomb structure with a vacancy. Three dangling bonds of N atoms pointing to the vacancy site interact with each other to form one bonding and two antibonding states. As the two antibonding states are near the Fermi level, and electrons should partially occupy the antibonding states in spin polarization, this leads to 1~$μ_B$ magnetic moment.
△ Less
Submitted 10 August, 2018;
originally announced August 2018.
-
Coherence Depletion in Quantum Algorithms
Authors:
Ye-Chao Liu,
Jiangwei Shang,
Xiangdong Zhang
Abstract:
Besides the superior efficiency compared to their classical counterparts, quantum algorithms known so far are basically task-dependent, and scarcely any common features are shared between them. In this work, however, we show that the depletion of quantum coherence turns out to be a common phenomenon in these algorithms. For all the quantum algorithms that we investigated including Grover's algorit…
▽ More
Besides the superior efficiency compared to their classical counterparts, quantum algorithms known so far are basically task-dependent, and scarcely any common features are shared between them. In this work, however, we show that the depletion of quantum coherence turns out to be a common phenomenon in these algorithms. For all the quantum algorithms that we investigated including Grover's algorithm, Deutsch-Jozsa algorithm and Shor's algorithm, quantum coherence of the system states reduces to the minimum along with the successful execution of the respective processes. Notably, a similar conclusion cannot be drawn using other quantitative measures such as quantum entanglement. Thus, we expect that coherence depletion as a common feature can be useful for devising new quantum algorithms in the future.
△ Less
Submitted 7 March, 2019; v1 submitted 7 August, 2018;
originally announced August 2018.
-
Evolution of X-Ray Properties of MAXI J1535-571: Analysis with the TCAF Solution
Authors:
J. -R. Shang,
D. Debnath,
D. Chatterjee,
A. Jana,
S. K. Chakrabarti,
H. -K. Chang,
Y. -X. Yap,
C. -L. Chiu
Abstract:
We present spectral and timing properties of the newly discovered X-ray transient source, MAXI J1535-571, which is believed to be a Galactic X-ray binary containing a black hole candidate (BHC) as the primary object. After its discovery on 2017 Sep. 2, it has been monitored regularly in multi-wavelength bands by several satellites. We use archival data of Swift (XRT and BAT) and MAXI (GSC) satelli…
▽ More
We present spectral and timing properties of the newly discovered X-ray transient source, MAXI J1535-571, which is believed to be a Galactic X-ray binary containing a black hole candidate (BHC) as the primary object. After its discovery on 2017 Sep. 2, it has been monitored regularly in multi-wavelength bands by several satellites. We use archival data of Swift (XRT and BAT) and MAXI (GSC) satellite instruments to study accretion flow dynamics of the source during the outburst. During its outburst, the source became very bright in the sky with a maximum observed flux of $5$~Crab in the $2-10$~keV GSC band. Similar to other transient BHCs, it also shows signatures of low frequency quasi-periodic oscillations (QPOs) during the outburst. Spectral data of different instruments are fitted with the transonic flow solution based two-component advective flow (TCAF) model fits file to find the direct accretion flow parameters. Evolution of spectral states and their transitions are understood from the model fitted physical flow parameters and nature of QPOs. We also estimate probable mass of the black hole from our spectral analysis as $7.9-9.9~M_\odot$ or $8.9\pm1.0~M_\odot$.
△ Less
Submitted 7 May, 2019; v1 submitted 19 June, 2018;
originally announced June 2018.
-
Enhanced entanglement criterion via symmetric informationally complete measurements
Authors:
Jiangwei Shang,
Ali Asadian,
Huangjun Zhu,
Otfried Gühne
Abstract:
We show that a special type of measurements, called symmetric informationally complete positive operator-valued measures (SIC POVMs), provide a stronger entanglement detection criterion than the computable cross-norm or realignment criterion based on local orthogonal observables. As an illustration, we demonstrate the enhanced entanglement detection power in simple systems of qubit and qutrit pair…
▽ More
We show that a special type of measurements, called symmetric informationally complete positive operator-valued measures (SIC POVMs), provide a stronger entanglement detection criterion than the computable cross-norm or realignment criterion based on local orthogonal observables. As an illustration, we demonstrate the enhanced entanglement detection power in simple systems of qubit and qutrit pairs. This observation highlights the significance of SIC POVMs for entanglement detection.
△ Less
Submitted 10 August, 2018; v1 submitted 10 May, 2018;
originally announced May 2018.
-
Entity Set Search of Scientific Literature: An Unsupervised Ranking Approach
Authors:
Jiaming Shen,
**feng Xiao,
Xinwei He,
**gbo Shang,
Saurabh Sinha,
Jiawei Han
Abstract:
Literature search is critical for any scientific research. Different from Web or general domain search, a large portion of queries in scientific literature search are entity-set queries, that is, multiple entities of possibly different types. Entity-set queries reflect user's need for finding documents that contain multiple entities and reveal inter-entity relationships and thus pose non-trivial c…
▽ More
Literature search is critical for any scientific research. Different from Web or general domain search, a large portion of queries in scientific literature search are entity-set queries, that is, multiple entities of possibly different types. Entity-set queries reflect user's need for finding documents that contain multiple entities and reveal inter-entity relationships and thus pose non-trivial challenges to existing search algorithms that model each entity separately. However, entity-set queries are usually sparse (i.e., not so repetitive), which makes ineffective many supervised ranking models that rely heavily on associated click history. To address these challenges, we introduce SetRank, an unsupervised ranking framework that models inter-entity relationships and captures entity type information. Furthermore, we develop a novel unsupervised model selection algorithm, based on the technique of weighted rank aggregation, to automatically choose the parameter settings in SetRank without resorting to a labeled validation set. We evaluate our proposed unsupervised approach using datasets from TREC Genomics Tracks and Semantic Scholar's query log. The experiments demonstrate that SetRank significantly outperforms the baseline unsupervised models, especially on entity-set queries, and our model selection algorithm effectively chooses suitable parameter settings.
△ Less
Submitted 29 April, 2018;
originally announced April 2018.
-
Integrating Local Context and Global Cohesiveness for Open Information Extraction
Authors:
Qi Zhu,
Xiang Ren,
**gbo Shang,
Yu Zhang,
Ahmed El-Kishky,
Jiawei Han
Abstract:
Extracting entities and their relations from text is an important task for understanding massive text corpora. Open information extraction (IE) systems mine relation tuples (i.e., entity arguments and a predicate string to describe their relation) from sentences. These relation tuples are not confined to a predefined schema for the relations of interests. However, current Open IE systems focus on…
▽ More
Extracting entities and their relations from text is an important task for understanding massive text corpora. Open information extraction (IE) systems mine relation tuples (i.e., entity arguments and a predicate string to describe their relation) from sentences. These relation tuples are not confined to a predefined schema for the relations of interests. However, current Open IE systems focus on modeling local context information in a sentence to extract relation tuples, while ignoring the fact that global statistics in a large corpus can be collectively leveraged to identify high-quality sentence-level extractions. In this paper, we propose a novel Open IE system, called ReMine, which integrates local context signals and global structural signals in a unified, distant-supervision framework. Leveraging facts from external knowledge bases as supervision, the new system can be applied to many different domains to facilitate sentence-level tuple extractions using corpus-level statistics. Our system operates by solving a joint optimization problem to unify (1) segmenting entity/relation phrases in individual sentences based on local context; and (2) measuring the quality of tuples extracted from individual sentences with a translating-based objective. Learning the two subtasks jointly helps correct errors produced in each subtask so that they can mutually enhance each other. Experiments on two real-world corpora from different domains demonstrate the effectiveness, generality, and robustness of ReMine when compared to state-of-the-art open IE systems.
△ Less
Submitted 1 December, 2018; v1 submitted 26 April, 2018;
originally announced April 2018.
-
Efficient Contextualized Representation: Language Model Pruning for Sequence Labeling
Authors:
Liyuan Liu,
Xiang Ren,
**gbo Shang,
Jian Peng,
Jiawei Han
Abstract:
Many efforts have been made to facilitate natural language processing tasks with pre-trained language models (LMs), and brought significant improvements to various applications. To fully leverage the nearly unlimited corpora and capture linguistic information of multifarious levels, large-size LMs are required; but for a specific task, only parts of these information are useful. Such large-sized L…
▽ More
Many efforts have been made to facilitate natural language processing tasks with pre-trained language models (LMs), and brought significant improvements to various applications. To fully leverage the nearly unlimited corpora and capture linguistic information of multifarious levels, large-size LMs are required; but for a specific task, only parts of these information are useful. Such large-sized LMs, even in the inference stage, may cause heavy computation workloads, making them too time-consuming for large-scale applications. Here we propose to compress bulky LMs while preserving useful information with regard to a specific task. As different layers of the model keep different information, we develop a layer selection method for model pruning using sparsity-inducing regularization. By introducing the dense connectivity, we can detach any layer without affecting others, and stretch shallow and wide LMs to be deep and narrow. In model training, LMs are learned with layer-wise dropouts for better robustness. Experiments on two benchmark datasets demonstrate the effectiveness of our method.
△ Less
Submitted 10 September, 2018; v1 submitted 20 April, 2018;
originally announced April 2018.
-
Bound entangled states fit for robust experimental verification
Authors:
Gael Sentís,
Johannes N. Greiner,
Jiangwei Shang,
Jens Siewert,
Matthias Kleinmann
Abstract:
Preparing and certifying bound entangled states in the laboratory is an intrinsically hard task, due to both the fact that they typically form narrow regions in the state space, and that a certificate requires a tomographic reconstruction of the density matrix. Indeed, the previous experiments that have reported the preparation of a bound entangled state relied on such tomographic reconstruction t…
▽ More
Preparing and certifying bound entangled states in the laboratory is an intrinsically hard task, due to both the fact that they typically form narrow regions in the state space, and that a certificate requires a tomographic reconstruction of the density matrix. Indeed, the previous experiments that have reported the preparation of a bound entangled state relied on such tomographic reconstruction techniques. However, the reliability of these results crucially depends on the extra assumption of an unbiased reconstruction. We propose an alternative method for certifying the bound entangled character of a quantum state that leads to a rigorous claim within a desired statistical significance, while bypassing a full reconstruction of the state. The method is comprised by a search for bound entangled states that are robust for experimental verification, and a hypothesis test tailored for the detection of bound entanglement that is naturally equipped with a measure of statistical significance. We apply our method to families of states of $3\times 3$ and $4\times 4$ systems, and find that the experimental certification of bound entangled states is well within reach.
△ Less
Submitted 14 December, 2018; v1 submitted 20 April, 2018;
originally announced April 2018.
-
Investigating Rumor News Using Agreement-Aware Search
Authors:
**gbo Shang,
Tianhang Sun,
Jiaming Shen,
Xingbang Liu,
Anja Gruenheid,
Flip Korn,
Adam Lelkes,
Cong Yu,
Jiawei Han
Abstract:
Recent years have witnessed a widespread increase of rumor news generated by humans and machines. Therefore, tools for investigating rumor news have become an urgent necessity. One useful function of such tools is to see ways a specific topic or event is represented by presenting different points of view from multiple sources.
In this paper, we propose Maester, a novel agreement-aware search fra…
▽ More
Recent years have witnessed a widespread increase of rumor news generated by humans and machines. Therefore, tools for investigating rumor news have become an urgent necessity. One useful function of such tools is to see ways a specific topic or event is represented by presenting different points of view from multiple sources.
In this paper, we propose Maester, a novel agreement-aware search framework for investigating rumor news. Given an investigative question, Maester will retrieve related articles to that question, assign and display top articles from agree, disagree, and discuss categories to users. Splitting the results into these three categories provides the user a holistic view towards the investigative question. We build Maester based on the following two key observations: (1) relatedness can commonly be determined by keywords and entities occurring in both questions and articles, and (2) the level of agreement between the investigative question and the related news article can often be decided by a few key sentences. Accordingly, we use gradient boosting tree models with keyword/entity matching features for relatedness detection, and leverage recurrent neural network to infer the level of agreement. Our experiments on the Fake News Challenge (FNC) dataset demonstrate up to an order of magnitude improvement of Maester over the original FNC winning solution, for agreement-aware search.
△ Less
Submitted 16 September, 2018; v1 submitted 20 February, 2018;
originally announced February 2018.
-
Contrast Subgraph Mining from Coherent Cores
Authors:
**gbo Shang,
Xiyao Shi,
Meng Jiang,
Liyuan Liu,
Timothy Hanratty,
Jiawei Han
Abstract:
Graph pattern mining methods can extract informative and useful patterns from large-scale graphs and capture underlying principles through the overwhelmed information. Contrast analysis serves as a keystone in various fields and has demonstrated its effectiveness in mining valuable information. However, it has been long overlooked in graph pattern mining. Therefore, in this paper, we introduce the…
▽ More
Graph pattern mining methods can extract informative and useful patterns from large-scale graphs and capture underlying principles through the overwhelmed information. Contrast analysis serves as a keystone in various fields and has demonstrated its effectiveness in mining valuable information. However, it has been long overlooked in graph pattern mining. Therefore, in this paper, we introduce the concept of contrast subgraph, that is, a subset of nodes that have significantly different edges or edge weights in two given graphs of the same node set. The major challenge comes from the gap between the contrast and the informativeness. Because of the widely existing noise edges in real-world graphs, the contrast may lead to subgraphs of pure noise. To avoid such meaningless subgraphs, we leverage the similarity as the cornerstone of the contrast. Specifically, we first identify a coherent core, which is a small subset of nodes with similar edge structures in the two graphs, and then induce contrast subgraphs from the coherent cores. Moreover, we design a general family of coherence and contrast metrics and derive a polynomial-time algorithm to efficiently extract contrast subgraphs. Extensive experiments verify the necessity of introducing coherent cores as well as the effectiveness and efficiency of our algorithm. Real-world applications demonstrate the tremendous potentials of contrast subgraph mining.
△ Less
Submitted 16 February, 2018;
originally announced February 2018.
-
Cross-type Biomedical Named Entity Recognition with Deep Multi-Task Learning
Authors:
Xuan Wang,
Yu Zhang,
Xiang Ren,
Yuhao Zhang,
Marinka Zitnik,
**gbo Shang,
Curtis Langlotz,
Jiawei Han
Abstract:
Motivation: State-of-the-art biomedical named entity recognition (BioNER) systems often require handcrafted features specific to each entity type, such as genes, chemicals and diseases. Although recent studies explored using neural network models for BioNER to free experts from manual feature engineering, the performance remains limited by the available training data for each entity type. Results:…
▽ More
Motivation: State-of-the-art biomedical named entity recognition (BioNER) systems often require handcrafted features specific to each entity type, such as genes, chemicals and diseases. Although recent studies explored using neural network models for BioNER to free experts from manual feature engineering, the performance remains limited by the available training data for each entity type. Results: We propose a multi-task learning framework for BioNER to collectively use the training data of different types of entities and improve the performance on each of them. In experiments on 15 benchmark BioNER datasets, our multi-task model achieves substantially better performance compared with state-of-the-art BioNER systems and baseline neural sequence labeling models. Further analysis shows that the large performance gains come from sharing character- and word-level information among relevant biomedical entities across differently labeled corpora.
△ Less
Submitted 7 October, 2018; v1 submitted 29 January, 2018;
originally announced January 2018.
-
Deterministic realization of collective measurements via photonic quantum walks
Authors:
Zhibo Hou,
Jun-Feng Tang,
Jiangwei Shang,
Huangjun Zhu,
Jian Li,
Yuan Yuan,
Kang-Da Wu,
Guo-Yong Xiang,
Chuan-Feng Li,
Guang-Can Guo
Abstract:
Collective measurements on identically prepared quantum systems can extract more information than local measurements, thereby enhancing information-processing efficiency. Although this nonclassical phenomenon has been known for two decades, it has remained a challenging task to demonstrate the advantage of collective measurements in experiments. Here we introduce a general recipe for performing de…
▽ More
Collective measurements on identically prepared quantum systems can extract more information than local measurements, thereby enhancing information-processing efficiency. Although this nonclassical phenomenon has been known for two decades, it has remained a challenging task to demonstrate the advantage of collective measurements in experiments. Here we introduce a general recipe for performing deterministic collective measurements on two identically prepared qubits based on quantum walks. Using photonic quantum walks, we realize experimentally an optimized collective measurement with fidelity 0.9946 without post selection. As an application, we achieve the highest tomographic efficiency in qubit state tomography to date. Our work offers an effective recipe for beating the precision limit of local measurements in quantum state tomography and metrology. In addition, our study opens an avenue for harvesting the power of collective measurements in quantum information processing and for exploring the intriguing physics behind this power.
△ Less
Submitted 17 April, 2018; v1 submitted 27 October, 2017;
originally announced October 2017.
-
An Attention-based Collaboration Framework for Multi-View Network Representation Learning
Authors:
Meng Qu,
Jian Tang,
**gbo Shang,
Xiang Ren,
Ming Zhang,
Jiawei Han
Abstract:
Learning distributed node representations in networks has been attracting increasing attention recently due to its effectiveness in a variety of applications. Existing approaches usually study networks with a single type of proximity between nodes, which defines a single view of a network. However, in reality there usually exists multiple types of proximities between nodes, yielding networks with…
▽ More
Learning distributed node representations in networks has been attracting increasing attention recently due to its effectiveness in a variety of applications. Existing approaches usually study networks with a single type of proximity between nodes, which defines a single view of a network. However, in reality there usually exists multiple types of proximities between nodes, yielding networks with multiple views. This paper studies learning node representations for networks with multiple views, which aims to infer robust node representations across different views. We propose a multi-view representation learning approach, which promotes the collaboration of different views and lets them vote for the robust representations. During the voting process, an attention mechanism is introduced, which enables each node to focus on the most informative views. Experimental results on real-world networks show that the proposed approach outperforms existing state-of-the-art approaches for network representation learning with a single view and other competitive approaches with multiple views.
△ Less
Submitted 19 September, 2017;
originally announced September 2017.
-
Empower Sequence Labeling with Task-Aware Neural Language Model
Authors:
Liyuan Liu,
**gbo Shang,
Frank F. Xu,
Xiang Ren,
Huan Gui,
Jian Peng,
Jiawei Han
Abstract:
Linguistic sequence labeling is a general modeling approach that encompasses a variety of problems, such as part-of-speech tagging and named entity recognition. Recent advances in neural networks (NNs) make it possible to build reliable models without handcrafted features. However, in many cases, it is hard to obtain sufficient annotations to train these models. In this study, we develop a novel n…
▽ More
Linguistic sequence labeling is a general modeling approach that encompasses a variety of problems, such as part-of-speech tagging and named entity recognition. Recent advances in neural networks (NNs) make it possible to build reliable models without handcrafted features. However, in many cases, it is hard to obtain sufficient annotations to train these models. In this study, we develop a novel neural framework to extract abundant knowledge hidden in raw texts to empower the sequence labeling task. Besides word-level knowledge contained in pre-trained word embeddings, character-aware neural language models are incorporated to extract character-level knowledge. Transfer learning techniques are further adopted to mediate different components and guide the language model towards the key knowledge. Comparing to previous methods, these task-specific knowledge allows us to adopt a more concise model and conduct more efficient training. Different from most transfer learning methods, the proposed framework does not rely on any additional supervision. It extracts knowledge from self-contained order information of training sequences. Extensive experiments on benchmark datasets demonstrate the effectiveness of leveraging character-level knowledge and the efficiency of co-training. For example, on the CoNLL03 NER task, model training completes in about 6 hours on a single GPU, reaching F1 score of 91.71$\pm$0.10 without using any extra annotation.
△ Less
Submitted 23 November, 2017; v1 submitted 12 September, 2017;
originally announced September 2017.
-
Convex optimization over classes of multiparticle entanglement
Authors:
Jiangwei Shang,
Otfried Gühne
Abstract:
A well-known strategy to characterize multiparticle entanglement utilizes the notion of stochastic local operations and classical communication (SLOCC), but characterizing the resulting entanglement classes is difficult. Given a multiparticle quantum state, we first show that Gilbert's algorithm can be adapted to prove separability or membership in a certain entanglement class. We then present two…
▽ More
A well-known strategy to characterize multiparticle entanglement utilizes the notion of stochastic local operations and classical communication (SLOCC), but characterizing the resulting entanglement classes is difficult. Given a multiparticle quantum state, we first show that Gilbert's algorithm can be adapted to prove separability or membership in a certain entanglement class. We then present two algorithms for convex optimization over SLOCC classes. The first algorithm uses a simple gradient approach, while the other one employs the accelerated projected-gradient method. For demonstration, the algorithms are applied to the likelihood-ratio test using experimental data on bound entanglement of a noisy four-photon Smolin state [Phys. Rev. Lett. 105, 130501 (2010)].
△ Less
Submitted 1 February, 2018; v1 submitted 10 July, 2017;
originally announced July 2017.
-
Transformation thermal convection: Cloaking, concentrating, and camouflage
Authors:
Gaole Dai,
** Shang,
Ji** Huang
Abstract:
Heat can generally transfer via thermal conduction, thermal radiation, and thermal convection. All the existing theories of transformation thermotics and optics can treat thermal conduction and thermal radiation, respectively. Unfortunately, thermal convection has never been touched in transformation theories due to the lack of a suitable theory, thus limiting applications associated with heat tra…
▽ More
Heat can generally transfer via thermal conduction, thermal radiation, and thermal convection. All the existing theories of transformation thermotics and optics can treat thermal conduction and thermal radiation, respectively. Unfortunately, thermal convection has never been touched in transformation theories due to the lack of a suitable theory, thus limiting applications associated with heat transfer through fluids (liquid or gas). Here, we develop, for the first time, a general theory of transformation thermal convection by considering the convection-diffusion equation, the Navier-Stokes equation, and the Darcy law. By introducing porous media, we get a set of coupled equations kee** their forms under coordinate transformation. As model applications, the theory helps to show the effects of cloaking, concentrating, and camouflage. Our finite element simulations confirm the theoretical findings. This work offers a general transformation theory for thermal convection, thus revealing some novel behaviors of thermal convection; it not only provides new hints on how to control heat transfer by combining thermal conduction, thermal radiation, and thermal convection, but also benefits the study of mass diffusion and other related fields that contain a set of equations and need to transform velocities at the same time.
△ Less
Submitted 31 May, 2017;
originally announced May 2017.
-
Compositional Human Pose Regression
Authors:
Xiao Sun,
Jiaxiang Shang,
Shuang Liang,
Yichen Wei
Abstract:
Regression based methods are not performing as well as detection based methods for human pose estimation. A central problem is that the structural information in the pose is not well exploited in the previous regression methods. In this work, we propose a structure-aware regression approach. It adopts a reparameterized pose representation using bones instead of joints. It exploits the joint connec…
▽ More
Regression based methods are not performing as well as detection based methods for human pose estimation. A central problem is that the structural information in the pose is not well exploited in the previous regression methods. In this work, we propose a structure-aware regression approach. It adopts a reparameterized pose representation using bones instead of joints. It exploits the joint connection structure to define a compositional loss function that encodes the long range interactions in the pose. It is simple, effective, and general for both 2D and 3D pose estimation in a unified setting. Comprehensive evaluation validates the effectiveness of our approach. It significantly advances the state-of-the-art on Human3.6M and is competitive with state-of-the-art results on MPII.
△ Less
Submitted 1 August, 2017; v1 submitted 1 April, 2017;
originally announced April 2017.