Search | arXiv e-print repository

arXiv:2005.01106 [pdf, other]

Universally Optimal Verification of Entangled States with Nondemolition Measurements

Authors: Ye-Chao Liu, Jiangwei Shang, Rui Han, Xiangdong Zhang

Abstract: The efficient and reliable characterization of quantum states plays a vital role in most, if not all, quantum information processing tasks. In this work, we present a universally optimal protocol for verifying entangled states by employing the so-called quantum nondemolition measurements, such that the verification efficiency is equivalent to that of the optimal global strategy. Instead of being p… ▽ More The efficient and reliable characterization of quantum states plays a vital role in most, if not all, quantum information processing tasks. In this work, we present a universally optimal protocol for verifying entangled states by employing the so-called quantum nondemolition measurements, such that the verification efficiency is equivalent to that of the optimal global strategy. Instead of being probabilistic as the standard verification strategies, our protocol is constructed sequentially, which is thus more favorable for experimental realizations. In addition, the target states are preserved in the protocol after each measurement, so can be reused in any subsequent tasks. We demonstrate the power of our protocol for the optimal verification of Bell states, arbitrary two-qubit pure states, and stabilizer states. We also prove that our protocol is able to perform tasks including fidelity estimation and state preparation. △ Less

Submitted 4 March, 2021; v1 submitted 3 May, 2020; originally announced May 2020.

Comments: 6+6 pages, 1 figure, close to the published version

Journal ref: Phys. Rev. Lett. 126, 090504 (2021)

arXiv:2004.14555 [pdf, other]

User-Guided Aspect Classification for Domain-Specific Texts

Authors: Peiran Li, Fang Guo, **gbo Shang

Abstract: Aspect classification, identifying aspects of text segments, facilitates numerous applications, such as sentiment analysis and review summarization. To alleviate the human effort on annotating massive texts, in this paper, we study the problem of classifying aspects based on only a few user-provided seed words for pre-defined aspects. The major challenge lies in how to handle the noisy misc aspect… ▽ More Aspect classification, identifying aspects of text segments, facilitates numerous applications, such as sentiment analysis and review summarization. To alleviate the human effort on annotating massive texts, in this paper, we study the problem of classifying aspects based on only a few user-provided seed words for pre-defined aspects. The major challenge lies in how to handle the noisy misc aspect, which is designed for texts without any pre-defined aspects. Even domain experts have difficulties to nominate seed words for the misc aspect, making existing seed-driven text classification methods not applicable. We propose a novel framework, ARYA, which enables mutual enhancements between pre-defined aspects and the misc aspect via iterative classifier training and seed updating. Specifically, it trains a classifier for pre-defined aspects and then leverages it to induce the supervision for the misc aspect. The prediction results of the misc aspect are later utilized to filter out noisy seed words for pre-defined aspects. Experiments in two domains demonstrate the superior performance of our proposed framework, as well as the necessity and importance of properly modeling the misc aspect. △ Less

Submitted 29 April, 2020; originally announced April 2020.

arXiv:2004.13897 [pdf, other]

Empower Entity Set Expansion via Language Model Probing

Authors: Yunyi Zhang, Jiaming Shen, **gbo Shang, Jiawei Han

Abstract: Entity set expansion, aiming at expanding a small seed entity set with new entities belonging to the same semantic class, is a critical task that benefits many downstream NLP and IR applications, such as question answering, query understanding, and taxonomy construction. Existing set expansion methods bootstrap the seed entity set by adaptively selecting context features and extracting new entitie… ▽ More Entity set expansion, aiming at expanding a small seed entity set with new entities belonging to the same semantic class, is a critical task that benefits many downstream NLP and IR applications, such as question answering, query understanding, and taxonomy construction. Existing set expansion methods bootstrap the seed entity set by adaptively selecting context features and extracting new entities. A key challenge for entity set expansion is to avoid selecting ambiguous context features which will shift the class semantics and lead to accumulative errors in later iterations. In this study, we propose a novel iterative set expansion framework that leverages automatically generated class names to address the semantic drift issue. In each iteration, we select one positive and several negative class names by probing a pre-trained language model, and further score each candidate entity based on selected class names. Experiments on two datasets show that our framework generates high-quality class names and outperforms previous state-of-the-art methods significantly. △ Less

Submitted 29 June, 2020; v1 submitted 28 April, 2020; originally announced April 2020.

Comments: ACL 2020

arXiv:2004.07684 [pdf, other]

Joint Semantic Segmentation and Boundary Detection using Iterative Pyramid Contexts

Authors: Mingmin Zhen, **glu Wang, Lei Zhou, Shiwei Li, Tianwei Shen, Jiaxiang Shang, Tian Fang, Quan Long

Abstract: In this paper, we present a joint multi-task learning framework for semantic segmentation and boundary detection. The critical component in the framework is the iterative pyramid context module (PCM), which couples two tasks and stores the shared latent semantics to interact between the two tasks. For semantic boundary detection, we propose the novel spatial gradient fusion to suppress nonsemantic… ▽ More In this paper, we present a joint multi-task learning framework for semantic segmentation and boundary detection. The critical component in the framework is the iterative pyramid context module (PCM), which couples two tasks and stores the shared latent semantics to interact between the two tasks. For semantic boundary detection, we propose the novel spatial gradient fusion to suppress nonsemantic edges. As semantic boundary detection is the dual task of semantic segmentation, we introduce a loss function with boundary consistency constraint to improve the boundary pixel accuracy for semantic segmentation. Our extensive experiments demonstrate superior performance over state-of-the-art works, not only in semantic segmentation but also in semantic boundary detection. In particular, a mean IoU score of 81:8% on Cityscapes test set is achieved without using coarse data or any external data for semantic segmentation. For semantic boundary detection, we improve over previous state-of-the-art works by 9.9% in terms of AP and 6:8% in terms of MF(ODS). △ Less

Submitted 16 April, 2020; originally announced April 2020.

arXiv:2004.06873 [pdf, other]

doi 10.1103/PhysRevA.103.022601

Verification of phased Dicke states

Authors: Zihao Li, Yun-Guang Han, Hao-Feng Sun, Jiangwei Shang, Huangjun Zhu

Abstract: Dicke states are typical examples of quantum states with genuine multipartite entanglement. They are valuable resources in many quantum information processing tasks, including multiparty quantum communication and quantum metrology. Phased Dicke states are a generalization of Dicke states and include antisymmetric basis states as a special example. These states are useful in atomic and molecular ph… ▽ More Dicke states are typical examples of quantum states with genuine multipartite entanglement. They are valuable resources in many quantum information processing tasks, including multiparty quantum communication and quantum metrology. Phased Dicke states are a generalization of Dicke states and include antisymmetric basis states as a special example. These states are useful in atomic and molecular physics besides quantum information processing. Here we propose practical and efficient protocols based on adaptive local projective measurements for verifying all phased Dicke states, including $W$ states and qudit Dicke states. To verify any $n$-partite phased Dicke state within infidelity $ε$ and significance level $δ$, the number of tests required is only $O(nε^{-1}\lnδ^{-1})$, which is linear in $n$ and is exponentially more efficient than traditional tomographic approaches. In the case of $W$ states, the number of tests can be further reduced to $O(\sqrt{n}\,ε^{-1}\lnδ^{-1})$. Moreover, we construct an optimal protocol for any antisymmetric basis state; the number of tests required decreases (rather than increases) monotonically with $n$. This is the only optimal protocol known for multipartite nonstabilizer states. △ Less

Submitted 10 February, 2021; v1 submitted 15 April, 2020; originally announced April 2020.

Comments: 14+12 pages, 4 figures, and 1 table; published in PRA

Journal ref: Phys. Rev. A 103, 022601 (2021)

arXiv:2004.06253 [pdf, ps, other]

doi 10.1007/s10509-021-03965-z

Investigation and Application of Fitting Models for Centering Algorithms in Astrometry

Authors: F. R. Lin, Q. Y. Peng, Z. J. Zheng, B. F. Guo, Y. J. Shang

Abstract: To determine the precise positions of stars in CCD frames, various centering algorithms have been proposed for astrometry. The effective point spread function (ePSF) and the Gaussian centering algorithms are two representative centering algorithms. This paper compares in detail and investigates these two centering algorithms in performing data reduction. Specifically, synthetic star images in diff… ▽ More To determine the precise positions of stars in CCD frames, various centering algorithms have been proposed for astrometry. The effective point spread function (ePSF) and the Gaussian centering algorithms are two representative centering algorithms. This paper compares in detail and investigates these two centering algorithms in performing data reduction. Specifically, synthetic star images in different conditions (i.e. profiles, fluxes, backgrounds and full width at half maximums) are generated and processed. We find that the difference in precision between the two algorithms is related to the profiles of the star images. Therefore, the precision comparison results using an ideal Gaussian-profile star image cannot be extended to other more specific experimental scenarios. Based on the simulation results, the most appropriate algorithm can be selected according to the image characteristics of observations, and the loss of precision of other algorithms can be estimated. The conclusions are verified using observations captured by the 1-m and 2.4-m telescopes at Yunnan Observatory. △ Less

Submitted 24 December, 2021; v1 submitted 13 April, 2020; originally announced April 2020.

arXiv:2002.08242 [pdf, other]

AI Online Filters to Real World Image Recognition

Authors: Hai Xiao, ** Shang, Mengyuan Huang

Abstract: Deep artificial neural networks, trained with labeled data sets are widely used in numerous vision and robotics applications today. In terms of AI, these are called reflex models, referring to the fact that they do not self-evolve or actively adapt to environmental changes. As demand for intelligent robot control expands to many high level tasks, reinforcement learning and state based models play… ▽ More Deep artificial neural networks, trained with labeled data sets are widely used in numerous vision and robotics applications today. In terms of AI, these are called reflex models, referring to the fact that they do not self-evolve or actively adapt to environmental changes. As demand for intelligent robot control expands to many high level tasks, reinforcement learning and state based models play an increasingly important role. Herein, in computer vision and robotics domain, we study a novel approach to add reinforcement controls onto the image recognition reflex models to attain better overall performance, specifically to a wider environment range beyond what is expected of the task reflex models. Follow a common infrastructure with environment sensing and AI based modeling of self-adaptive agents, we implement multiple types of AI control agents. To the end, we provide comparative results of these agents with baseline, and an insightful analysis of their benefit to improve overall image recognition performance in real world. △ Less

Submitted 11 February, 2020; originally announced February 2020.

arXiv:2002.07364 [pdf, other]

doi 10.1103/PhysRevLett.124.060502

Experimental optimal orienteering via parallel and antiparallel spins

Authors: Jun-Feng Tang, Zhibo Hou, Jiangwei Shang, Huangjun Zhu, Guo-Yong Xiang, Chuan-Feng Li, Guang-Can Guo

Abstract: Antiparallel spins are superior in orienteering to parallel spins. This intriguing phenomenon is tied to entanglement associated with quantum measurements rather than quantum states. Using photonic systems, we experimentally realize the optimal orienteering protocols based on parallel spins and antiparallel spins, respectively. The optimal entangling measurements for decoding the direction informa… ▽ More Antiparallel spins are superior in orienteering to parallel spins. This intriguing phenomenon is tied to entanglement associated with quantum measurements rather than quantum states. Using photonic systems, we experimentally realize the optimal orienteering protocols based on parallel spins and antiparallel spins, respectively. The optimal entangling measurements for decoding the direction information from parallel spins and antiparallel spins are realized using photonic quantum walks, which is a useful idea that is of wide interest in quantum information processing and foundational studies. Our experiments clearly demonstrate the advantage of antiparallel spins over parallel spins in orienteering. In addition, entangling measurements can extract more information than local measurements even if no entanglement is present in the quantum states. △ Less

Submitted 17 February, 2020; originally announced February 2020.

Journal ref: Physical Review Letters 124, 060502(2020)

arXiv:2001.01550 [pdf, other]

Opportunities and Challenges of Deep Learning Methods for Electrocardiogram Data: A Systematic Review

Authors: Shenda Hong, Yuxi Zhou, Junyuan Shang, Cao Xiao, Jimeng Sun

Abstract: Background:The electrocardiogram (ECG) is one of the most commonly used diagnostic tools in medicine and healthcare. Deep learning methods have achieved promising results on predictive healthcare tasks using ECG signals. Objective:This paper presents a systematic review of deep learning methods for ECG data from both modeling and application perspectives. Methods:We extracted papers that applied d… ▽ More Background:The electrocardiogram (ECG) is one of the most commonly used diagnostic tools in medicine and healthcare. Deep learning methods have achieved promising results on predictive healthcare tasks using ECG signals. Objective:This paper presents a systematic review of deep learning methods for ECG data from both modeling and application perspectives. Methods:We extracted papers that applied deep learning (deep neural network) models to ECG data that were published between Jan. 1st of 2010 and Feb. 29th of 2020 from Google Scholar, PubMed, and the DBLP. We then analyzed each article according to three factors: tasks, models, and data. Finally, we discuss open challenges and unsolved problems in this area. Results: The total number of papers extracted was 191. Among these papers, 108 were published after 2019. Different deep learning architectures have been used in various ECG analytics tasks, such as disease detection/classification, annotation/localization, sleep staging, biometric human identification, and denoising. Conclusion: The number of works on deep learning for ECG data has grown explosively in recent years. Such works have achieved accuracy comparable to that of traditional feature-based approaches and ensembles of multiple approaches can achieve even better results. Specifically, we found that a hybrid architecture of a convolutional neural network and recurrent neural network ensemble using expert features yields the best results. However, there are some new challenges and problems related to interpretability, scalability, and efficiency that must be addressed. Furthermore, it is also worth investigating new applications from the perspectives of datasets and methods. Significance: This paper summarizes existing deep learning research using ECG data from multiple perspectives and highlights existing challenges and problems to identify potential future research directions. △ Less

Submitted 30 April, 2020; v1 submitted 27 December, 2019; originally announced January 2020.

Comments: Accepted by Computers in Biology and Medicine

arXiv:1912.01312 [pdf]

doi 10.1039/D0TA00854K

Reversible Gas Sensing by Ferroelectric Switch and 2D Molecule Multiferroics in In2Se3 Monolayer

Authors: Xiao Tang, **g Shang, Yuantong Gu, Aijun Du, Liangzhi Kou

Abstract: Two-dimensional ferroelectrics are important quantum materials which have found novel application in nonvolatile memories, however, the effects of reversible polarization on chemical reactions and interaction with environments are rarely studied despite of its importance. Here, based on the first-principles calculations, we found distinct gas adsorption behaviors on the surfaces of ferroelectric I… ▽ More Two-dimensional ferroelectrics are important quantum materials which have found novel application in nonvolatile memories, however, the effects of reversible polarization on chemical reactions and interaction with environments are rarely studied despite of its importance. Here, based on the first-principles calculations, we found distinct gas adsorption behaviors on the surfaces of ferroelectric In2Se3 layer and the reversible gas caption and release controlled by ferroelectric switch. We rationalize the novel phenomena to the synergistic effect of the different electrostatic potential and electron transfer induced by band alignments between frontier molecular orbitals of gas and band edge states of substrate. Excitingly, the adsorption of paramagnetic gas molecules such as NO and NO2 can induce surface magnetism, which is also sensitive to ferroelectric polarization direction of In2Se3, indicating the application of In2Se3 as threshold magnetic sensors or switcher. Furthermore, it is suggested two NO molecules prefer to ferromagnetically couple with each other, the Curie temperature is polarization dependent which can reach up to 50K, leading to the long-sought 2D molecule multiferroics. The ferroelectric controllable adsorption behavior and molecule multiferroic feature will find extensive application in gas caption, selective catalytic reduction and spintronic device. △ Less

Submitted 3 December, 2019; originally announced December 2019.

Journal ref: J. Mater. Chem. A, 2020

arXiv:1911.12088 [pdf]

Multiferroic Decorated Fe2O3 Monolayer Predicted from First Principles

Authors: **g Shang, Chun Li, Aijun Du, Ting Liao, Yuantong Gu, Yandong Ma, Liangzhi Kou, Changfeng Chen

Abstract: Two-dimensional (2D) multiferroics exhibit cross-control capacity between magnetic and electric responses in reduced spatial domain, making them well suited for next-generation nanoscale devices; however, progress has been slow in develo** materials with required characteristic properties. Here we identify by first-principles calculations robust 2D multiferroic behaviors in decorated Fe2O3 monol… ▽ More Two-dimensional (2D) multiferroics exhibit cross-control capacity between magnetic and electric responses in reduced spatial domain, making them well suited for next-generation nanoscale devices; however, progress has been slow in develo** materials with required characteristic properties. Here we identify by first-principles calculations robust 2D multiferroic behaviors in decorated Fe2O3 monolayer, showcasing N@Fe2O3 as a prototypical case, where ferroelectricity and ferromagnetism stem from the same origin, namely Fe d-orbit splitting induced by the Jahn-Teller distortion and associated crystal field changes. The resulting ferromagnetic and ferroelectric polarization can be effectively reversed and regulated by applied electric field or strain, offering efficient functionality. These findings establish strong materials phenomena and elucidate underlying physics mechanism in a family of truly 2D multiferroics that are highly promising for advanced device applications. △ Less

Submitted 13 May, 2020; v1 submitted 27 November, 2019; originally announced November 2019.

Comments: 4 figures

arXiv:1910.13730 [pdf, other]

doi 10.1103/PhysRevA.101.042315

Efficient verification of quantum processes

Authors: Ye-Chao Liu, Jiangwei Shang, Xiao-Dong Yu, Xiangdong Zhang

Abstract: Quantum processes, such as quantum circuits, quantum memories, and quantum channels, are essential ingredients in almost all quantum information processing tasks. However, the characterization of these processes remains a daunting task due to the exponentially increasing amount of resources required by traditional methods. Here, by first proposing the concept of quantum process verification, we es… ▽ More Quantum processes, such as quantum circuits, quantum memories, and quantum channels, are essential ingredients in almost all quantum information processing tasks. However, the characterization of these processes remains a daunting task due to the exponentially increasing amount of resources required by traditional methods. Here, by first proposing the concept of quantum process verification, we establish two efficient and practical protocols for verifying quantum processes which can provide an exponential improvement over the standard quantum process tomography and a quadratic improvement over the method of direct fidelity estimation. The efficacy of our protocols is illustrated with the verification of various quantum gates as well as the processes of well-known quantum circuits. Moreover, our protocols are readily applicable with current experimental techniques since only local measurements are required. In addition, we show that our protocols for verifying quantum processes can be easily adapted to verify quantum measurements. △ Less

Submitted 15 April, 2020; v1 submitted 30 October, 2019; originally announced October 2019.

Comments: 8 pages, 1 figure

Journal ref: Phys. Rev. A 101, 042315 (2020)

arXiv:1910.08192 [pdf, other]

SetExpan: Corpus-Based Set Expansion via Context Feature Selection and Rank Ensemble

Authors: Jiaming Shen, Zeqiu Wu, Dongming Lei, **gbo Shang, Xiang Ren, Jiawei Han

Abstract: Corpus-based set expansion (i.e., finding the "complete" set of entities belonging to the same semantic class, based on a given corpus and a tiny set of seeds) is a critical task in knowledge discovery. It may facilitate numerous downstream applications, such as information extraction, taxonomy induction, question answering, and web search. To discover new entities in an expanded set, previous app… ▽ More Corpus-based set expansion (i.e., finding the "complete" set of entities belonging to the same semantic class, based on a given corpus and a tiny set of seeds) is a critical task in knowledge discovery. It may facilitate numerous downstream applications, such as information extraction, taxonomy induction, question answering, and web search. To discover new entities in an expanded set, previous approaches either make one-time entity ranking based on distributional similarity, or resort to iterative pattern-based bootstrap**. The core challenge for these methods is how to deal with noisy context features derived from free-text corpora, which may lead to entity intrusion and semantic drifting. In this study, we propose a novel framework, SetExpan, which tackles this problem, with two techniques: (1) a context feature selection method that selects clean context features for calculating entity-entity distributional similarity, and (2) a ranking-based unsupervised ensemble method for expanding entity set based on denoised context features. Experiments on three datasets show that SetExpan is robust and outperforms previous state-of-the-art methods in terms of mean average precision. △ Less

Submitted 17 October, 2019; originally announced October 2019.

Comments: ECMLPKDD 2017 accepted

arXiv:1910.04345 [pdf, other]

FUSE: Multi-Faceted Set Expansion by Coherent Clustering of Skip-grams

Authors: Wanzheng Zhu, Hongyu Gong, Jiaming Shen, Chao Zhang, **gbo Shang, Suma Bhat, Jiawei Han

Abstract: Set expansion aims to expand a small set of seed entities into a complete set of relevant entities. Most existing approaches assume the input seed set is unambiguous and completely ignore the multi-faceted semantics of seed entities. As a result, given the seed set {"Canon", "Sony", "Nikon"}, previous models return one mixed set of entities that are either Camera Brands or Japanese Companies. In t… ▽ More Set expansion aims to expand a small set of seed entities into a complete set of relevant entities. Most existing approaches assume the input seed set is unambiguous and completely ignore the multi-faceted semantics of seed entities. As a result, given the seed set {"Canon", "Sony", "Nikon"}, previous models return one mixed set of entities that are either Camera Brands or Japanese Companies. In this paper, we study the task of multi-faceted set expansion, which aims to capture all semantic facets in the seed set and return multiple sets of entities, one for each semantic facet. We propose an unsupervised framework, FUSE, which consists of three major components: (1) facet discovery module: identifies all semantic facets of each seed entity by extracting and clustering its skip-grams, and (2) facet fusion module: discovers shared semantic facets of the entire seed set by an optimization formulation, and (3) entity expansion module: expands each semantic facet by utilizing a masked language model with pre-trained BERT models. Extensive experiments demonstrate that FUSE can accurately identify multiple semantic facets of the seed set and generate quality entities for each facet. △ Less

Submitted 18 June, 2020; v1 submitted 9 October, 2019; originally announced October 2019.

arXiv:1910.02107 [pdf, other]

GENN: Predicting Correlated Drug-drug Interactions with Graph Energy Neural Networks

Authors: Tengfei Ma, Junyuan Shang, Cao Xiao, Jimeng Sun

Abstract: Gaining more comprehensive knowledge about drug-drug interactions (DDIs) is one of the most important tasks in drug development and medical practice. Recently graph neural networks have achieved great success in this task by modeling drugs as nodes and drug-drug interactions as links and casting DDI predictions as link prediction problems. However, correlations between link labels (e.g., DDI types… ▽ More Gaining more comprehensive knowledge about drug-drug interactions (DDIs) is one of the most important tasks in drug development and medical practice. Recently graph neural networks have achieved great success in this task by modeling drugs as nodes and drug-drug interactions as links and casting DDI predictions as link prediction problems. However, correlations between link labels (e.g., DDI types) were rarely considered in existing works. We propose the graph energy neural network (GENN) to explicitly model link type correlations. We formulate the DDI prediction task as a structure prediction problem and introduce a new energy-based model where the energy function is defined by graph neural networks. Experiments on two real-world DDI datasets demonstrated that GENN is superior to many baselines without consideration of link type correlations and achieved $13.77\%$ and $5.01\%$ PR-AUC improvement on the two datasets, respectively. We also present a case study in which \mname can better capture meaningful DDI correlations compared with baseline models. △ Less

Submitted 7 October, 2019; v1 submitted 4 October, 2019; originally announced October 2019.

arXiv:1910.01451 [pdf, other]

CubeNet: Multi-Facet Hierarchical Heterogeneous Network Construction, Analysis, and Mining

Authors: Carl Yang, Dai Teng, Siyang Liu, Sayantani Basu, Jieyu Zhang, Jiaming Shen, Chao Zhang, **gbo Shang, Lance Kaplan, Timothy Harratty, Jiawei Han

Abstract: Due to the ever-increasing size of data, construction, analysis and mining of universal massive networks are becoming forbidden and meaningless. In this work, we outline a novel framework called CubeNet, which systematically constructs and organizes real-world networks into different but correlated semantic cells, to support various downstream network analysis and mining tasks with better flexibil… ▽ More Due to the ever-increasing size of data, construction, analysis and mining of universal massive networks are becoming forbidden and meaningless. In this work, we outline a novel framework called CubeNet, which systematically constructs and organizes real-world networks into different but correlated semantic cells, to support various downstream network analysis and mining tasks with better flexibility, deeper insights and higher efficiency. Particular, we promote our recent research on text and network mining with novel concepts and techniques to (1) construct four real-world large-scale multi-facet hierarchical heterogeneous networks; (2) enable insightful OLAP-style network analysis; (3) facilitate localized and contextual network mining. Although some functions have been covered individually in our previous work, a systematic and efficient realization of an organic system has not been studied, while some functions are still our on-going research tasks. By integrating them, CubeNet may not only showcase the utility of our recent research, but also inspire and stimulate future research on effective, insightful and scalable knowledge discovery under this novel framework. △ Less

Submitted 28 September, 2019; originally announced October 2019.

Comments: Published at KDD 2019 as a demo paper

arXiv:1909.01441 [pdf, other]

CrossWeigh: Training Named Entity Tagger from Imperfect Annotations

Authors: Zihan Wang, **gbo Shang, Liyuan Liu, Lihao Lu, Jiacheng Liu, Jiawei Han

Abstract: Everyone makes mistakes. So do human annotators when curating labels for named entity recognition (NER). Such label mistakes might hurt model training and interfere model comparison. In this study, we dive deep into one of the widely-adopted NER benchmark datasets, CoNLL03 NER. We are able to identify label mistakes in about 5.38% test sentences, which is a significant ratio considering that the s… ▽ More Everyone makes mistakes. So do human annotators when curating labels for named entity recognition (NER). Such label mistakes might hurt model training and interfere model comparison. In this study, we dive deep into one of the widely-adopted NER benchmark datasets, CoNLL03 NER. We are able to identify label mistakes in about 5.38% test sentences, which is a significant ratio considering that the state-of-the-art test F1 score is already around 93%. Therefore, we manually correct these label mistakes and form a cleaner test set. Our re-evaluation of popular models on this corrected test set leads to more accurate assessments, compared to those on the original test set. More importantly, we propose a simple yet effective framework, CrossWeigh, to handle label mistakes during NER model training. Specifically, it partitions the training data into several folds and train independent NER models to identify potential mistakes in each fold. Then it adjusts the weights of training data accordingly to train the final NER model. Extensive experiments demonstrate significant improvements of plugging various NER models into our proposed framework on three datasets. All implementations and corrected test set are available at our Github repo: https://github.com/ZihanWangKi/CrossWeigh. △ Less

Submitted 3 September, 2019; originally announced September 2019.

arXiv:1908.06857 [pdf, other]

K-margin-based Residual-Convolution-Recurrent Neural Network for Atrial Fibrillation Detection

Authors: Yuxi Zhou, Shenda Hong, Junyuan Shang, Meng Wu, Qingyun Wang, Hongyan Li, Junqing Xie

Abstract: Atrial Fibrillation (AF) is an abnormal heart rhythm which can trigger cardiac arrest and sudden death. Nevertheless, its interpretation is mostly done by medical experts due to high error rates of computerized interpretation. One study found that only about 66% of AF were correctly recognized from noisy ECGs. This is in part due to insufficient training data, class skewness, as well as semantical… ▽ More Atrial Fibrillation (AF) is an abnormal heart rhythm which can trigger cardiac arrest and sudden death. Nevertheless, its interpretation is mostly done by medical experts due to high error rates of computerized interpretation. One study found that only about 66% of AF were correctly recognized from noisy ECGs. This is in part due to insufficient training data, class skewness, as well as semantical ambiguities caused by noisy segments in an ECG record. In this paper, we propose a K-margin-based Residual-Convolution-Recurrent neural network (K-margin-based RCR-net) for AF detection from noisy ECGs. In detail, a skewness-driven dynamic augmentation method is employed to handle the problems of data inadequacy and class imbalance. A novel RCR-net is proposed to automatically extract both long-term rhythm-level and local heartbeat-level characters. Finally, we present a K-margin-based diagnosis model to automatically focus on the most important parts of an ECG record and handle noise by naturally exploiting expected consistency among the segments associated for each record. The experimental results demonstrate that the proposed method with 0.8125 F1NAOP score outperforms all state-of-the-art deep learning methods for AF detection task by 6.8%. △ Less

Submitted 9 August, 2019; originally announced August 2019.

Comments: IJCAI 2019

arXiv:1908.05344 [pdf, other]

Raw-to-End Name Entity Recognition in Social Media

Authors: Liyuan Liu, Zihan Wang, **gbo Shang, Dandong Yin, Heng Ji, Xiang Ren, Shaowen Wang, Jiawei Han

Abstract: Taking word sequences as the input, typical named entity recognition (NER) models neglect errors from pre-processing (e.g., tokenization). However, these errors can influence the model performance greatly, especially for noisy texts like tweets. Here, we introduce Neural-Char-CRF, a raw-to-end framework that is more robust to pre-processing errors. It takes raw character sequences as inputs and ma… ▽ More Taking word sequences as the input, typical named entity recognition (NER) models neglect errors from pre-processing (e.g., tokenization). However, these errors can influence the model performance greatly, especially for noisy texts like tweets. Here, we introduce Neural-Char-CRF, a raw-to-end framework that is more robust to pre-processing errors. It takes raw character sequences as inputs and makes end-to-end predictions. Word embedding and contextualized representation models are further tailored to capture textual signals for each character instead of each word. Our model neither requires the conversion from character sequences to word sequences, nor assumes tokenizer can correctly detect all word boundaries. Moreover, we observe our model performance remains unchanged after replacing tokenization with string matching, which demonstrates its potential to be tokenization-free. Extensive experimental results on two public datasets demonstrate the superiority of our proposed method over the state of the art. The implementations and datasets are made available at: https://github.com/LiyuanLucasLiu/Raw-to-End. △ Less

Submitted 14 August, 2019; originally announced August 2019.

arXiv:1906.11036 [pdf, other]

doi 10.1103/PhysRevE.101.023306

Discrete unified gas kinetic scheme for nonlinear convection-diffusion equations

Authors: **long Shang, Zhenhua Chai, Huili Wang, Baochang Shi

Abstract: In this paper, we develop a discrete unified gas kinetic scheme (DUGKS) for general nonlinear convection-diffusion equation (NCDE), and show that the NCDE can be recovered correctly from the present model through the Chapman-Enskog analysis. We then test the present DUGKS through some classic convection-diffusion equations, and find that the numerical results are in good agreement with analytical… ▽ More In this paper, we develop a discrete unified gas kinetic scheme (DUGKS) for general nonlinear convection-diffusion equation (NCDE), and show that the NCDE can be recovered correctly from the present model through the Chapman-Enskog analysis. We then test the present DUGKS through some classic convection-diffusion equations, and find that the numerical results are in good agreement with analytical solutions and the DUGKS model has a second-order convergence rate. Finally, as a finite-volume method, DUGKS can also adopt the non-uniform mesh. Besides, we performed some comparisons among the DUGKS, finite-volume lattice Boltzmann model (FV-LBM), single-relaxation-time lattice Boltzmann model (SLBM) and multiple-relaxation-time lattice Boltzmann model (MRT-LBM). The results show that the DUGKS model is more accurate than FV-LBM, more stable than SLBM, and almost has the same accuracy as the MRT-LBM. Besides, the using of non-uniform mesh may make DUGKS model more flexible. △ Less

Submitted 16 June, 2019; originally announced June 2019.

Journal ref: Phys. Rev. E 101, 023306 (2020)

arXiv:1906.00346 [pdf, other]

Pre-training of Graph Augmented Transformers for Medication Recommendation

Authors: Junyuan Shang, Tengfei Ma, Cao Xiao, Jimeng Sun

Abstract: Medication recommendation is an important healthcare application. It is commonly formulated as a temporal prediction task. Hence, most existing works only utilize longitudinal electronic health records (EHRs) from a small number of patients with multiple visits ignoring a large number of patients with a single visit (selection bias). Moreover, important hierarchical knowledge such as diagnosis hie… ▽ More Medication recommendation is an important healthcare application. It is commonly formulated as a temporal prediction task. Hence, most existing works only utilize longitudinal electronic health records (EHRs) from a small number of patients with multiple visits ignoring a large number of patients with a single visit (selection bias). Moreover, important hierarchical knowledge such as diagnosis hierarchy is not leveraged in the representation learning process. To address these challenges, we propose G-BERT, a new model to combine the power of Graph Neural Networks (GNNs) and BERT (Bidirectional Encoder Representations from Transformers) for medical code representation and medication recommendation. We use GNNs to represent the internal hierarchical structures of medical codes. Then we integrate the GNN representation into a transformer-based visit encoder and pre-train it on EHR data from patients only with a single visit. The pre-trained visit encoder and representation are then fine-tuned for downstream predictive tasks on longitudinal EHRs from patients with multiple visits. G-BERT is the first to bring the language model pre-training schema into the healthcare domain and it achieved state-of-the-art performance on the medication recommendation task. △ Less

Submitted 26 November, 2019; v1 submitted 2 June, 2019; originally announced June 2019.

Comments: IJCAI2019; fix some undefined problems; provide more intuitive figures

arXiv:1905.07062 [pdf, other]

Robust Principal Component Analysis for Modal Decomposition of Corrupt Fluid Flows

Authors: Isabel Scherl, Benjamin Strom, Jessica K. Shang, Owen Williams, Brian L. Polagye, Steven L. Brunton

Abstract: Modal analysis techniques are used to identify patterns and develop reduced-order models in a variety of fluid applications. However, experimentally acquired flow fields may be corrupted with incorrect and missing entries, which may degrade modal decomposition. Here we use robust principal component analysis (RPCA) to improve the quality of flow field data by leveraging global coherent structures… ▽ More Modal analysis techniques are used to identify patterns and develop reduced-order models in a variety of fluid applications. However, experimentally acquired flow fields may be corrupted with incorrect and missing entries, which may degrade modal decomposition. Here we use robust principal component analysis (RPCA) to improve the quality of flow field data by leveraging global coherent structures to identify and replace spurious data points. RPCA is a robust variant of principal component analysis (PCA), also known as proper orthogonal decomposition (POD) in fluids, that decomposes a data matrix into the sum of a low-rank matrix containing coherent structures and a sparse matrix of outliers and corrupt entries. We apply RPCA filtering to a range of fluid simulations and experiments of varying complexities and assess the accuracy of low-rank structure recovery. First, we analyze direct numerical simulations of flow past a circular cylinder at Reynolds number 100 with artificial outliers, alongside similar PIV measurements at Reynolds number 413. Next, we apply RPCA filtering to a turbulent channel flow simulation from the Johns Hopkins Turbulence database, demonstrating that dominant coherent structures are preserved in the low-rank matrix. Finally, we investigate PIV measurements behind a two-bladed cross-flow turbine that exhibits both broadband and coherent phenomena. In all cases, we find that RPCA filtering extracts dominant coherent structures and identifies and fills in incorrect or missing measurements. The performance is particularly striking when flow fields are analyzed using dynamic mode decomposition, which is sensitive to noise and outliers. △ Less

Submitted 13 December, 2019; v1 submitted 16 May, 2019; originally announced May 2019.

arXiv:1904.11202 [pdf, other]

doi 10.1103/PhysRevA.100.022333

Proper error bars for self-calibrating quantum tomography

Authors: Jun Yan Sim, Jiangwei Shang, Hui Khoon Ng, Berthold-Georg Englert

Abstract: Self-calibrating quantum state tomography aims at reconstructing the unknown quantum state and certain properties of the measurement devices from the same data. Since the estimates of the state and device parameters come from the same data, one should employ a joint estimation scheme, including the construction and reporting of joint state-device error regions to quantify uncertainty. We explain h… ▽ More Self-calibrating quantum state tomography aims at reconstructing the unknown quantum state and certain properties of the measurement devices from the same data. Since the estimates of the state and device parameters come from the same data, one should employ a joint estimation scheme, including the construction and reporting of joint state-device error regions to quantify uncertainty. We explain how to do this naturally within the framework of optimal error regions. As an illustrative example, we apply our procedure to the double-crosshair measurement of the BB84 scenario in quantum cryptography and so reconstruct the state and estimate the detection efficiencies simultaneously and reliably. We also discuss the practical situation of a satellite-based quantum key distribution scheme, for which self-calibration and proper treatment of the data are necessities. △ Less

Submitted 6 September, 2019; v1 submitted 25 April, 2019; originally announced April 2019.

Comments: 10 pages, 7 figures, 2 tables

Journal ref: Phys. Rev. A 100, 022333 (2019)

arXiv:1904.09579 [pdf, ps, other]

Computer-aided study of double extensions of restricted Lie superalgebras preserving the non-degenerate closed 2-forms in characteristic 2

Authors: Sofiane Bouarroudj, Dimitry Leites, ** Shang

Abstract: A Lie (super)algebra with a non-degenerate invariant symmetric bilinear form $B$ is called a nis-(super)algebra. The double extension $\mathfrak{g}$ of a nis-(super)algebra $\mathfrak{a}$ is the result of simultaneous adding to $\mathfrak{a}$ a central element and a derivation so that $\mathfrak{g}$ is a nis-algebra. Loop algebras with values in simple complex Lie algebras are most known among the… ▽ More A Lie (super)algebra with a non-degenerate invariant symmetric bilinear form $B$ is called a nis-(super)algebra. The double extension $\mathfrak{g}$ of a nis-(super)algebra $\mathfrak{a}$ is the result of simultaneous adding to $\mathfrak{a}$ a central element and a derivation so that $\mathfrak{g}$ is a nis-algebra. Loop algebras with values in simple complex Lie algebras are most known among the Lie (super)algebras suitable to be doubly extended. In characteristic 2 the notion of double extension acquires specific features. Restricted Lie (super)algebras are among the most interesting modular Lie superalgebras. In characteristic 2, using Grozman's Mathematica-based package SuperLie, we list double extensions of restricted Lie superalgebras preserving the non-degenerate closed 2-forms with constant coefficients. The results are proved for the number of indeterminates ranging from 4 to 7 - sufficient to conjecture the pattern for larger numbers. Considering multigradings allowed us to accelerate computations up to 100 times. △ Less

Submitted 21 April, 2019; originally announced April 2019.

Comments: 18 pages

arXiv:1904.09578 [pdf, ps, other]

The roots of exceptional modular Lie superalgebras with Cartan matrix

Authors: Sofiane Bouarroudj, Dimitry Leites, Alexander Lozhechnyk, ** Shang

Abstract: For each of the exceptional Lie superalgebras with indecomposable Cartan matrix, we give the explicit list of its roots of and the corresponding Chevalley basis for one of the inequivalent Cartan matrices, the one corresponding to the greatest number of mutually orthogonal isotropic odd simple roots. Our main tools: Grozman's Mathematica-based code SuperLie, and Python. For each of the exceptional Lie superalgebras with indecomposable Cartan matrix, we give the explicit list of its roots of and the corresponding Chevalley basis for one of the inequivalent Cartan matrices, the one corresponding to the greatest number of mutually orthogonal isotropic odd simple roots. Our main tools: Grozman's Mathematica-based code SuperLie, and Python. △ Less

Submitted 21 April, 2019; originally announced April 2019.

Comments: 35 pages

arXiv:1904.01979 [pdf, other]

doi 10.1103/PhysRevApplied.12.044020

Efficient verification of Dicke states

Authors: Ye-Chao Liu, Xiao-Dong Yu, Jiangwei Shang, Huangjun Zhu, Xiangdong Zhang

Abstract: Among various multipartite entangled states, Dicke states stand out because their entanglement is maximally persistent and robust under particle losses. Although much attention has been attracted for their potential applications in quantum information processing and foundational studies, the characterization of Dicke states remains as a challenging task in experiments. Here, we propose efficient a… ▽ More Among various multipartite entangled states, Dicke states stand out because their entanglement is maximally persistent and robust under particle losses. Although much attention has been attracted for their potential applications in quantum information processing and foundational studies, the characterization of Dicke states remains as a challenging task in experiments. Here, we propose efficient and practical protocols for verifying arbitrary $n$-qubit Dicke states in both adaptive and nonadaptive ways. Our protocols require only two distinct settings based on Pauli measurements besides permutations of the qubits. To achieve infidelity $ε$ and confidence level $1-δ$, the total number of tests required is only $O(nε^{-1}\lnδ^{-1})$. This performance is exponentially more efficient than all previous protocols based on local measurements, including quantum state tomography and direct fidelity estimation, and is comparable to the best global strategy. Our protocols are readily applicable with current experimental techniques and are able to verify Dicke states of hundreds of qubits. △ Less

Submitted 10 October, 2019; v1 submitted 3 April, 2019; originally announced April 2019.

Comments: 11 pages, 4 figures, 1 table, close to the published version

Journal ref: Phys. Rev. Applied 12, 044020 (2019)

arXiv:1902.07446 [pdf]

doi 10.1021/acs.nanolett.9b00553

Direct photoluminescence probing of ferromagnetism in monolayer two-dimensional CrBr3

Authors: Zhaowei Zhang, **gzhi Shang, Chongyun Jiang, Abdullah Rasmita, Weibo Gao, Ting Yu

Abstract: Atomically thin magnets are the key element to build up spintronics based on two-dimensional materials. The surface nature of two-dimensional ferromagnet opens up opportunities to improve the device performance efficiently. Here, we report the intrinsic ferromagnetism in atomically thin monolayer CrBr3, directly probed by polarization resolved magneto-photoluminescence. The spontaneous magnetizati… ▽ More Atomically thin magnets are the key element to build up spintronics based on two-dimensional materials. The surface nature of two-dimensional ferromagnet opens up opportunities to improve the device performance efficiently. Here, we report the intrinsic ferromagnetism in atomically thin monolayer CrBr3, directly probed by polarization resolved magneto-photoluminescence. The spontaneous magnetization persists in monolayer CrBr3 with a Curie temperature of 34 K. The development of magnons by the thermal excitation is in line with the spin-wave theory. We attribute the layer-number dependent hysteresis loops in thick layers to the magnetic domain structures. As a stable monolayer material in air, CrBr3 provides a convenient platform for fundamental physics and pushes the potential applications of the two-dimensional ferromagnetism. △ Less

Submitted 20 February, 2019; originally announced February 2019.

Comments: 27 pages, 10 figures

arXiv:1901.09856 [pdf, other]

doi 10.1038/s41534-019-0226-z

Optimal verification of general bipartite pure states

Authors: Xiao-Dong Yu, Jiangwei Shang, Otfried Gühne

Abstract: The efficient and reliable verification of quantum states plays a crucial role in various quantum information processing tasks. We consider the task of verifying entangled states using one-way and two-way classical communication and completely characterize the optimal strategies via convex optimization. We solve these optimization problems using both analytical and numerical methods, and the optim… ▽ More The efficient and reliable verification of quantum states plays a crucial role in various quantum information processing tasks. We consider the task of verifying entangled states using one-way and two-way classical communication and completely characterize the optimal strategies via convex optimization. We solve these optimization problems using both analytical and numerical methods, and the optimal strategies can be constructed for any bipartite pure state. Compared with the nonadaptive approach, our adaptive strategies significantly improve the efficiency of quantum state verification. Moreover, these strategies are experimentally feasible, as only few local projective measurements are required. △ Less

Submitted 6 December, 2019; v1 submitted 28 January, 2019; originally announced January 2019.

Comments: 7 pages, 2 figures

Journal ref: npj Quantum Information 5, 112 (2019)

arXiv:1812.09216 [pdf, other]

doi 10.1103/PhysRevLett.122.130404

Quantifying quantum resources with conic programming

Authors: Roope Uola, Tristan Kraft, Jiangwei Shang, Xiao-Dong Yu, Otfried Gühne

Abstract: Resource theories can be used to formalize the quantification and manipulation of resources in quantum information processing such as entanglement, asymmetry and coherence of quantum states, and incompatibility of quantum measurements. Given a certain state or measurement, one can ask whether there is a task in which it performs better than any resourceless state or measurement. Using conic progra… ▽ More Resource theories can be used to formalize the quantification and manipulation of resources in quantum information processing such as entanglement, asymmetry and coherence of quantum states, and incompatibility of quantum measurements. Given a certain state or measurement, one can ask whether there is a task in which it performs better than any resourceless state or measurement. Using conic programming, we prove that any general robustness measure (with respect to a convex set of free states or measurements) can be seen as a quantifier of such outperformance in some discrimination task. We apply the technique to various examples, e.g. joint measurability, POVMs simulable by projective measurements, and state assemblages preparable with a given Schmidt number. △ Less

Submitted 4 April, 2019; v1 submitted 21 December, 2018; originally announced December 2018.

Comments: 8 pages, 1 figure, v2: small changes, final version

Journal ref: Phys. Rev. Lett. 122, 130404 (2019)

arXiv:1812.02615 [pdf, other]

Real-Time Transmission Mechanism Design for Wireless IoT Sensors with Energy Harvesting under Power Saving Mode

Authors: ** Shang, Muhammad Junaid Farooq, Quanyan Zhu

Abstract: The Internet of things (IoT) comprises of wireless sensors and actuators connected via access points to the Internet. Often, the sensing devices are remotely deployed with limited battery power and are equipped with energy harvesting equipment. These devices transmit real-time data to the base station (BS), which is used in applications such as anomaly detection. Under sufficient power availabilit… ▽ More The Internet of things (IoT) comprises of wireless sensors and actuators connected via access points to the Internet. Often, the sensing devices are remotely deployed with limited battery power and are equipped with energy harvesting equipment. These devices transmit real-time data to the base station (BS), which is used in applications such as anomaly detection. Under sufficient power availability, wireless transmissions from sensors can be scheduled at regular time intervals to maintain real-time data acquisition. However, once the battery is significantly depleted, the devices enter into power saving mode and need to be more selective in transmitting information to the BS. Transmitting a particular piece of sensed data consumes power while discarding it may result in loss of utility at the BS. The goal is to design an optimal dynamic policy which enables the device to decide whether to transmit or to discard a piece of sensing data particularly under the power saving mode. This will enable the sensor to prolong its operation while causing minimum loss of utility to the application. We develop an analytical framework to capture the utility of the IoT sensor transmissions and leverage dynamic programming based approach to derive an optimal real-time transmission policy that is based on the statistics of information arrival, the likelihood of harvested energy, and designed lifetime of the sensors. Numerical results show that if the statistics of future data valuation are accurately predicted, there is a significant increase in utility obtained at the BS as well as the battery lifetime. △ Less

Submitted 8 April, 2019; v1 submitted 6 December, 2018; originally announced December 2018.

arXiv:1810.01496 [pdf, other]

A Light-weight Vibrational Motor Powered Recoil Robot that Hops Rapidly Across Granular Media

Authors: Alice C. Quillen, Randal C. Nelson, Hesam Askari, Kathryn Chotkowski, Esteban Wright, Jessica K. Shang

Abstract: A 1 cm coin vibrational motor fixed to the center of a 4 cm square foam platform moves rapidly across granular media (poppy seeds, millet, corn meal) at a speed of up to 30 cm/s, or about 5 body lengths/s. Fast speeds are achieved with dimensionless acceleration number, similar to a Froude number, up to 50, allowing the light-weight 1.4 g mechanism to remain above the substrate, levitated and prop… ▽ More A 1 cm coin vibrational motor fixed to the center of a 4 cm square foam platform moves rapidly across granular media (poppy seeds, millet, corn meal) at a speed of up to 30 cm/s, or about 5 body lengths/s. Fast speeds are achieved with dimensionless acceleration number, similar to a Froude number, up to 50, allowing the light-weight 1.4 g mechanism to remain above the substrate, levitated and propelled by its kicks off the surface. The mechanism is low cost and moves without any external moving parts. With 2 s exposures we photograph the trajectory of the mechanism using an LED blocked except for a pin-hole and fixed to the mechanism. Trajectories can exhibit period doubling phenomena similar to a ball bouncing on a vibrating table top. A two dimensional numerical model gives similar trajectories, though a vertical drag force is required to keep the mechanism height low. We attribute the vertical drag force to aerodynamic suction from air flow below the mechanism base and through the granular substrate. Our numerical model suggests that speed is maximized when the mechanism is prevented from jum** high off the surface. In this way the mechanism resembles a gallo** or jum** animal whose body remains nearly at the same height above the ground during its gait. △ Less

Submitted 23 September, 2018; originally announced October 2018.

arXiv:1809.03599 [pdf, other]

Learning Named Entity Tagger using Domain-Specific Dictionary

Authors: **gbo Shang, Liyuan Liu, Xiang Ren, Xiaotao Gu, Teng Ren, Jiawei Han

Abstract: Recent advances in deep neural models allow us to build reliable named entity recognition (NER) systems without handcrafting features. However, such methods require large amounts of manually-labeled training data. There have been efforts on replacing human annotations with distant supervision (in conjunction with external dictionaries), but the generated noisy labels pose significant challenges on… ▽ More Recent advances in deep neural models allow us to build reliable named entity recognition (NER) systems without handcrafting features. However, such methods require large amounts of manually-labeled training data. There have been efforts on replacing human annotations with distant supervision (in conjunction with external dictionaries), but the generated noisy labels pose significant challenges on learning effective neural models. Here we propose two neural models to suit noisy distant supervision from the dictionary. First, under the traditional sequence labeling framework, we propose a revised fuzzy CRF layer to handle tokens with multiple possible labels. After identifying the nature of noisy labels in distant supervision, we go beyond the traditional framework and propose a novel, more effective neural model AutoNER with a new Tie or Break scheme. In addition, we discuss how to refine distant supervision for better NER performance. Extensive experiments on three benchmark datasets demonstrate that AutoNER achieves the best performance when only using dictionaries with no additional human effort, and delivers competitive results with state-of-the-art supervised benchmarks. △ Less

Submitted 10 September, 2018; originally announced September 2018.

arXiv:1809.01852 [pdf, other]

GAMENet: Graph Augmented MEmory Networks for Recommending Medication Combination

Authors: Junyuan Shang, Cao Xiao, Tengfei Ma, Hongyan Li, Jimeng Sun

Abstract: Recent progress in deep learning is revolutionizing the healthcare domain including providing solutions to medication recommendations, especially recommending medication combination for patients with complex health conditions. Existing approaches either do not customize based on patient health history, or ignore existing knowledge on drug-drug interactions (DDI) that might lead to adverse outcomes… ▽ More Recent progress in deep learning is revolutionizing the healthcare domain including providing solutions to medication recommendations, especially recommending medication combination for patients with complex health conditions. Existing approaches either do not customize based on patient health history, or ignore existing knowledge on drug-drug interactions (DDI) that might lead to adverse outcomes. To fill this gap, we propose the Graph Augmented Memory Networks (GAMENet), which integrates the drug-drug interactions knowledge graph by a memory module implemented as a graph convolutional networks, and models longitudinal patient records as the query. It is trained end-to-end to provide safe and personalized recommendation of medication combination. We demonstrate the effectiveness and safety of GAMENet by comparing with several state-of-the-art methods on real EHR data. GAMENet outperformed all baselines in all effectiveness measures, and also achieved 3.60% DDI rate reduction from existing EHR data. △ Less

Submitted 6 March, 2019; v1 submitted 6 September, 2018; originally announced September 2018.

Comments: AAAI 2019; change the template and fix some typos

arXiv:1808.03505 [pdf, ps, other]

doi 10.1016/j.jmmm.2018.10.095

Origin of $sp$-electron magnetism in Graphitic Carbon Nitride

Authors: Wei Xu, ** Shang, Jie-Xiang Yu, J. G. Che

Abstract: Based on first principles calculations, this study reveals that magnetism in otherwise non-magnetic materials can originate from the partial occupation of antibonding states. Since the antibonding wavefunctions are spatially antisymmetric, the spin wavefunctions should be symmteric according to the exchange antisymmetric principle of quantum mechanics. We demonstrate that this phenomenon can be ob… ▽ More Based on first principles calculations, this study reveals that magnetism in otherwise non-magnetic materials can originate from the partial occupation of antibonding states. Since the antibonding wavefunctions are spatially antisymmetric, the spin wavefunctions should be symmteric according to the exchange antisymmetric principle of quantum mechanics. We demonstrate that this phenomenon can be observed in a graphitic carbon nitride material, $g$-C$_4$N$_3$, which can be experimentally synthesized and seen as a honeycomb structure with a vacancy. Three dangling bonds of N atoms pointing to the vacancy site interact with each other to form one bonding and two antibonding states. As the two antibonding states are near the Fermi level, and electrons should partially occupy the antibonding states in spin polarization, this leads to 1~$μ_B$ magnetic moment. △ Less

Submitted 10 August, 2018; originally announced August 2018.

Comments: four pages, three figures

arXiv:1808.02222 [pdf, other]

doi 10.3390/e21030260

Coherence Depletion in Quantum Algorithms

Authors: Ye-Chao Liu, Jiangwei Shang, Xiangdong Zhang

Abstract: Besides the superior efficiency compared to their classical counterparts, quantum algorithms known so far are basically task-dependent, and scarcely any common features are shared between them. In this work, however, we show that the depletion of quantum coherence turns out to be a common phenomenon in these algorithms. For all the quantum algorithms that we investigated including Grover's algorit… ▽ More Besides the superior efficiency compared to their classical counterparts, quantum algorithms known so far are basically task-dependent, and scarcely any common features are shared between them. In this work, however, we show that the depletion of quantum coherence turns out to be a common phenomenon in these algorithms. For all the quantum algorithms that we investigated including Grover's algorithm, Deutsch-Jozsa algorithm and Shor's algorithm, quantum coherence of the system states reduces to the minimum along with the successful execution of the respective processes. Notably, a similar conclusion cannot be drawn using other quantitative measures such as quantum entanglement. Thus, we expect that coherence depletion as a common feature can be useful for devising new quantum algorithms in the future. △ Less

Submitted 7 March, 2019; v1 submitted 7 August, 2018; originally announced August 2018.

Comments: final version, title changed; 11 pages, 2 figures, 44 references

Journal ref: Entropy 21, 260 (2019)

arXiv:1806.07147 [pdf, ps, other]

doi 10.3847/1538-4357/ab0c1e

Evolution of X-Ray Properties of MAXI J1535-571: Analysis with the TCAF Solution

Authors: J. -R. Shang, D. Debnath, D. Chatterjee, A. Jana, S. K. Chakrabarti, H. -K. Chang, Y. -X. Yap, C. -L. Chiu

Abstract: We present spectral and timing properties of the newly discovered X-ray transient source, MAXI J1535-571, which is believed to be a Galactic X-ray binary containing a black hole candidate (BHC) as the primary object. After its discovery on 2017 Sep. 2, it has been monitored regularly in multi-wavelength bands by several satellites. We use archival data of Swift (XRT and BAT) and MAXI (GSC) satelli… ▽ More We present spectral and timing properties of the newly discovered X-ray transient source, MAXI J1535-571, which is believed to be a Galactic X-ray binary containing a black hole candidate (BHC) as the primary object. After its discovery on 2017 Sep. 2, it has been monitored regularly in multi-wavelength bands by several satellites. We use archival data of Swift (XRT and BAT) and MAXI (GSC) satellite instruments to study accretion flow dynamics of the source during the outburst. During its outburst, the source became very bright in the sky with a maximum observed flux of $5$~Crab in the $2-10$~keV GSC band. Similar to other transient BHCs, it also shows signatures of low frequency quasi-periodic oscillations (QPOs) during the outburst. Spectral data of different instruments are fitted with the transonic flow solution based two-component advective flow (TCAF) model fits file to find the direct accretion flow parameters. Evolution of spectral states and their transitions are understood from the model fitted physical flow parameters and nature of QPOs. We also estimate probable mass of the black hole from our spectral analysis as $7.9-9.9~M_\odot$ or $8.9\pm1.0~M_\odot$. △ Less

Submitted 7 May, 2019; v1 submitted 19 June, 2018; originally announced June 2018.

Comments: 14 pages, 6 figures, 2 tables

Journal ref: 2019ApJ...875....4S

arXiv:1805.03955 [pdf, other]

doi 10.1103/PhysRevA.98.022309

Enhanced entanglement criterion via symmetric informationally complete measurements

Authors: Jiangwei Shang, Ali Asadian, Huangjun Zhu, Otfried Gühne

Abstract: We show that a special type of measurements, called symmetric informationally complete positive operator-valued measures (SIC POVMs), provide a stronger entanglement detection criterion than the computable cross-norm or realignment criterion based on local orthogonal observables. As an illustration, we demonstrate the enhanced entanglement detection power in simple systems of qubit and qutrit pair… ▽ More We show that a special type of measurements, called symmetric informationally complete positive operator-valued measures (SIC POVMs), provide a stronger entanglement detection criterion than the computable cross-norm or realignment criterion based on local orthogonal observables. As an illustration, we demonstrate the enhanced entanglement detection power in simple systems of qubit and qutrit pairs. This observation highlights the significance of SIC POVMs for entanglement detection. △ Less

Submitted 10 August, 2018; v1 submitted 10 May, 2018; originally announced May 2018.

Comments: final version, 7 pages, 6 figures, 32 references; published in PRA as an Editors' Suggestion

Journal ref: Phys. Rev. A 98, 022309 (2018)

arXiv:1804.10877 [pdf, other]

Entity Set Search of Scientific Literature: An Unsupervised Ranking Approach

Authors: Jiaming Shen, **feng Xiao, Xinwei He, **gbo Shang, Saurabh Sinha, Jiawei Han

Abstract: Literature search is critical for any scientific research. Different from Web or general domain search, a large portion of queries in scientific literature search are entity-set queries, that is, multiple entities of possibly different types. Entity-set queries reflect user's need for finding documents that contain multiple entities and reveal inter-entity relationships and thus pose non-trivial c… ▽ More Literature search is critical for any scientific research. Different from Web or general domain search, a large portion of queries in scientific literature search are entity-set queries, that is, multiple entities of possibly different types. Entity-set queries reflect user's need for finding documents that contain multiple entities and reveal inter-entity relationships and thus pose non-trivial challenges to existing search algorithms that model each entity separately. However, entity-set queries are usually sparse (i.e., not so repetitive), which makes ineffective many supervised ranking models that rely heavily on associated click history. To address these challenges, we introduce SetRank, an unsupervised ranking framework that models inter-entity relationships and captures entity type information. Furthermore, we develop a novel unsupervised model selection algorithm, based on the technique of weighted rank aggregation, to automatically choose the parameter settings in SetRank without resorting to a labeled validation set. We evaluate our proposed unsupervised approach using datasets from TREC Genomics Tracks and Semantic Scholar's query log. The experiments demonstrate that SetRank significantly outperforms the baseline unsupervised models, especially on entity-set queries, and our model selection algorithm effectively chooses suitable parameter settings. △ Less

Submitted 29 April, 2018; originally announced April 2018.

Comments: SIGIR 2018 Full Paper

arXiv:1804.09931 [pdf, other]

Integrating Local Context and Global Cohesiveness for Open Information Extraction

Authors: Qi Zhu, Xiang Ren, **gbo Shang, Yu Zhang, Ahmed El-Kishky, Jiawei Han

Abstract: Extracting entities and their relations from text is an important task for understanding massive text corpora. Open information extraction (IE) systems mine relation tuples (i.e., entity arguments and a predicate string to describe their relation) from sentences. These relation tuples are not confined to a predefined schema for the relations of interests. However, current Open IE systems focus on… ▽ More Extracting entities and their relations from text is an important task for understanding massive text corpora. Open information extraction (IE) systems mine relation tuples (i.e., entity arguments and a predicate string to describe their relation) from sentences. These relation tuples are not confined to a predefined schema for the relations of interests. However, current Open IE systems focus on modeling local context information in a sentence to extract relation tuples, while ignoring the fact that global statistics in a large corpus can be collectively leveraged to identify high-quality sentence-level extractions. In this paper, we propose a novel Open IE system, called ReMine, which integrates local context signals and global structural signals in a unified, distant-supervision framework. Leveraging facts from external knowledge bases as supervision, the new system can be applied to many different domains to facilitate sentence-level tuple extractions using corpus-level statistics. Our system operates by solving a joint optimization problem to unify (1) segmenting entity/relation phrases in individual sentences based on local context; and (2) measuring the quality of tuples extracted from individual sentences with a translating-based objective. Learning the two subtasks jointly helps correct errors produced in each subtask so that they can mutually enhance each other. Experiments on two real-world corpora from different domains demonstrate the effectiveness, generality, and robustness of ReMine when compared to state-of-the-art open IE systems. △ Less

Submitted 1 December, 2018; v1 submitted 26 April, 2018; originally announced April 2018.

Comments: 8 pages + 1 page reference. Accepted to WSDM 2019

arXiv:1804.07827 [pdf, other]

Efficient Contextualized Representation: Language Model Pruning for Sequence Labeling

Authors: Liyuan Liu, Xiang Ren, **gbo Shang, Jian Peng, Jiawei Han

Abstract: Many efforts have been made to facilitate natural language processing tasks with pre-trained language models (LMs), and brought significant improvements to various applications. To fully leverage the nearly unlimited corpora and capture linguistic information of multifarious levels, large-size LMs are required; but for a specific task, only parts of these information are useful. Such large-sized L… ▽ More Many efforts have been made to facilitate natural language processing tasks with pre-trained language models (LMs), and brought significant improvements to various applications. To fully leverage the nearly unlimited corpora and capture linguistic information of multifarious levels, large-size LMs are required; but for a specific task, only parts of these information are useful. Such large-sized LMs, even in the inference stage, may cause heavy computation workloads, making them too time-consuming for large-scale applications. Here we propose to compress bulky LMs while preserving useful information with regard to a specific task. As different layers of the model keep different information, we develop a layer selection method for model pruning using sparsity-inducing regularization. By introducing the dense connectivity, we can detach any layer without affecting others, and stretch shallow and wide LMs to be deep and narrow. In model training, LMs are learned with layer-wise dropouts for better robustness. Experiments on two benchmark datasets demonstrate the effectiveness of our method. △ Less

Submitted 10 September, 2018; v1 submitted 20 April, 2018; originally announced April 2018.

Comments: EMNLP 2018

arXiv:1804.07562 [pdf, other]

doi 10.22331/q-2018-12-18-113

Bound entangled states fit for robust experimental verification

Authors: Gael Sentís, Johannes N. Greiner, Jiangwei Shang, Jens Siewert, Matthias Kleinmann

Abstract: Preparing and certifying bound entangled states in the laboratory is an intrinsically hard task, due to both the fact that they typically form narrow regions in the state space, and that a certificate requires a tomographic reconstruction of the density matrix. Indeed, the previous experiments that have reported the preparation of a bound entangled state relied on such tomographic reconstruction t… ▽ More Preparing and certifying bound entangled states in the laboratory is an intrinsically hard task, due to both the fact that they typically form narrow regions in the state space, and that a certificate requires a tomographic reconstruction of the density matrix. Indeed, the previous experiments that have reported the preparation of a bound entangled state relied on such tomographic reconstruction techniques. However, the reliability of these results crucially depends on the extra assumption of an unbiased reconstruction. We propose an alternative method for certifying the bound entangled character of a quantum state that leads to a rigorous claim within a desired statistical significance, while bypassing a full reconstruction of the state. The method is comprised by a search for bound entangled states that are robust for experimental verification, and a hypothesis test tailored for the detection of bound entanglement that is naturally equipped with a measure of statistical significance. We apply our method to families of states of $3\times 3$ and $4\times 4$ systems, and find that the experimental certification of bound entangled states is well within reach. △ Less

Submitted 14 December, 2018; v1 submitted 20 April, 2018; originally announced April 2018.

Comments: Accepted version in Quantum

Journal ref: Quantum 2, 113 (2018)

arXiv:1802.07398 [pdf, other]

doi 10.1145/3269206.3272020

Investigating Rumor News Using Agreement-Aware Search

Authors: **gbo Shang, Tianhang Sun, Jiaming Shen, Xingbang Liu, Anja Gruenheid, Flip Korn, Adam Lelkes, Cong Yu, Jiawei Han

Abstract: Recent years have witnessed a widespread increase of rumor news generated by humans and machines. Therefore, tools for investigating rumor news have become an urgent necessity. One useful function of such tools is to see ways a specific topic or event is represented by presenting different points of view from multiple sources. In this paper, we propose Maester, a novel agreement-aware search fra… ▽ More Recent years have witnessed a widespread increase of rumor news generated by humans and machines. Therefore, tools for investigating rumor news have become an urgent necessity. One useful function of such tools is to see ways a specific topic or event is represented by presenting different points of view from multiple sources. In this paper, we propose Maester, a novel agreement-aware search framework for investigating rumor news. Given an investigative question, Maester will retrieve related articles to that question, assign and display top articles from agree, disagree, and discuss categories to users. Splitting the results into these three categories provides the user a holistic view towards the investigative question. We build Maester based on the following two key observations: (1) relatedness can commonly be determined by keywords and entities occurring in both questions and articles, and (2) the level of agreement between the investigative question and the related news article can often be decided by a few key sentences. Accordingly, we use gradient boosting tree models with keyword/entity matching features for relatedness detection, and leverage recurrent neural network to infer the level of agreement. Our experiments on the Fake News Challenge (FNC) dataset demonstrate up to an order of magnitude improvement of Maester over the original FNC winning solution, for agreement-aware search. △ Less

Submitted 16 September, 2018; v1 submitted 20 February, 2018; originally announced February 2018.

arXiv:1802.06189 [pdf, other]

Contrast Subgraph Mining from Coherent Cores

Authors: **gbo Shang, Xiyao Shi, Meng Jiang, Liyuan Liu, Timothy Hanratty, Jiawei Han

Abstract: Graph pattern mining methods can extract informative and useful patterns from large-scale graphs and capture underlying principles through the overwhelmed information. Contrast analysis serves as a keystone in various fields and has demonstrated its effectiveness in mining valuable information. However, it has been long overlooked in graph pattern mining. Therefore, in this paper, we introduce the… ▽ More Graph pattern mining methods can extract informative and useful patterns from large-scale graphs and capture underlying principles through the overwhelmed information. Contrast analysis serves as a keystone in various fields and has demonstrated its effectiveness in mining valuable information. However, it has been long overlooked in graph pattern mining. Therefore, in this paper, we introduce the concept of contrast subgraph, that is, a subset of nodes that have significantly different edges or edge weights in two given graphs of the same node set. The major challenge comes from the gap between the contrast and the informativeness. Because of the widely existing noise edges in real-world graphs, the contrast may lead to subgraphs of pure noise. To avoid such meaningless subgraphs, we leverage the similarity as the cornerstone of the contrast. Specifically, we first identify a coherent core, which is a small subset of nodes with similar edge structures in the two graphs, and then induce contrast subgraphs from the coherent cores. Moreover, we design a general family of coherence and contrast metrics and derive a polynomial-time algorithm to efficiently extract contrast subgraphs. Extensive experiments verify the necessity of introducing coherent cores as well as the effectiveness and efficiency of our algorithm. Real-world applications demonstrate the tremendous potentials of contrast subgraph mining. △ Less

Submitted 16 February, 2018; originally announced February 2018.

arXiv:1801.09851 [pdf, other]

Cross-type Biomedical Named Entity Recognition with Deep Multi-Task Learning

Authors: Xuan Wang, Yu Zhang, Xiang Ren, Yuhao Zhang, Marinka Zitnik, **gbo Shang, Curtis Langlotz, Jiawei Han

Abstract: Motivation: State-of-the-art biomedical named entity recognition (BioNER) systems often require handcrafted features specific to each entity type, such as genes, chemicals and diseases. Although recent studies explored using neural network models for BioNER to free experts from manual feature engineering, the performance remains limited by the available training data for each entity type. Results:… ▽ More Motivation: State-of-the-art biomedical named entity recognition (BioNER) systems often require handcrafted features specific to each entity type, such as genes, chemicals and diseases. Although recent studies explored using neural network models for BioNER to free experts from manual feature engineering, the performance remains limited by the available training data for each entity type. Results: We propose a multi-task learning framework for BioNER to collectively use the training data of different types of entities and improve the performance on each of them. In experiments on 15 benchmark BioNER datasets, our multi-task model achieves substantially better performance compared with state-of-the-art BioNER systems and baseline neural sequence labeling models. Further analysis shows that the large performance gains come from sharing character- and word-level information among relevant biomedical entities across differently labeled corpora. △ Less

Submitted 7 October, 2018; v1 submitted 29 January, 2018; originally announced January 2018.

Comments: 7 pages, 4 figures

arXiv:1710.10045 [pdf, other]

doi 10.1038/s41467-018-03849-x

Deterministic realization of collective measurements via photonic quantum walks

Authors: Zhibo Hou, Jun-Feng Tang, Jiangwei Shang, Huangjun Zhu, Jian Li, Yuan Yuan, Kang-Da Wu, Guo-Yong Xiang, Chuan-Feng Li, Guang-Can Guo

Abstract: Collective measurements on identically prepared quantum systems can extract more information than local measurements, thereby enhancing information-processing efficiency. Although this nonclassical phenomenon has been known for two decades, it has remained a challenging task to demonstrate the advantage of collective measurements in experiments. Here we introduce a general recipe for performing de… ▽ More Collective measurements on identically prepared quantum systems can extract more information than local measurements, thereby enhancing information-processing efficiency. Although this nonclassical phenomenon has been known for two decades, it has remained a challenging task to demonstrate the advantage of collective measurements in experiments. Here we introduce a general recipe for performing deterministic collective measurements on two identically prepared qubits based on quantum walks. Using photonic quantum walks, we realize experimentally an optimized collective measurement with fidelity 0.9946 without post selection. As an application, we achieve the highest tomographic efficiency in qubit state tomography to date. Our work offers an effective recipe for beating the precision limit of local measurements in quantum state tomography and metrology. In addition, our study opens an avenue for harvesting the power of collective measurements in quantum information processing and for exploring the intriguing physics behind this power. △ Less

Submitted 17 April, 2018; v1 submitted 27 October, 2017; originally announced October 2017.

Comments: Close to the published version

Journal ref: Nature Communications 9 (1), 1414 (2018)

arXiv:1709.06636 [pdf, other]

An Attention-based Collaboration Framework for Multi-View Network Representation Learning

Authors: Meng Qu, Jian Tang, **gbo Shang, Xiang Ren, Ming Zhang, Jiawei Han

Abstract: Learning distributed node representations in networks has been attracting increasing attention recently due to its effectiveness in a variety of applications. Existing approaches usually study networks with a single type of proximity between nodes, which defines a single view of a network. However, in reality there usually exists multiple types of proximities between nodes, yielding networks with… ▽ More Learning distributed node representations in networks has been attracting increasing attention recently due to its effectiveness in a variety of applications. Existing approaches usually study networks with a single type of proximity between nodes, which defines a single view of a network. However, in reality there usually exists multiple types of proximities between nodes, yielding networks with multiple views. This paper studies learning node representations for networks with multiple views, which aims to infer robust node representations across different views. We propose a multi-view representation learning approach, which promotes the collaboration of different views and lets them vote for the robust representations. During the voting process, an attention mechanism is introduced, which enables each node to focus on the most informative views. Experimental results on real-world networks show that the proposed approach outperforms existing state-of-the-art approaches for network representation learning with a single view and other competitive approaches with multiple views. △ Less

Submitted 19 September, 2017; originally announced September 2017.

Comments: CIKM 2017

arXiv:1709.04109 [pdf, other]

Empower Sequence Labeling with Task-Aware Neural Language Model

Authors: Liyuan Liu, **gbo Shang, Frank F. Xu, Xiang Ren, Huan Gui, Jian Peng, Jiawei Han

Abstract: Linguistic sequence labeling is a general modeling approach that encompasses a variety of problems, such as part-of-speech tagging and named entity recognition. Recent advances in neural networks (NNs) make it possible to build reliable models without handcrafted features. However, in many cases, it is hard to obtain sufficient annotations to train these models. In this study, we develop a novel n… ▽ More Linguistic sequence labeling is a general modeling approach that encompasses a variety of problems, such as part-of-speech tagging and named entity recognition. Recent advances in neural networks (NNs) make it possible to build reliable models without handcrafted features. However, in many cases, it is hard to obtain sufficient annotations to train these models. In this study, we develop a novel neural framework to extract abundant knowledge hidden in raw texts to empower the sequence labeling task. Besides word-level knowledge contained in pre-trained word embeddings, character-aware neural language models are incorporated to extract character-level knowledge. Transfer learning techniques are further adopted to mediate different components and guide the language model towards the key knowledge. Comparing to previous methods, these task-specific knowledge allows us to adopt a more concise model and conduct more efficient training. Different from most transfer learning methods, the proposed framework does not rely on any additional supervision. It extracts knowledge from self-contained order information of training sequences. Extensive experiments on benchmark datasets demonstrate the effectiveness of leveraging character-level knowledge and the efficiency of co-training. For example, on the CoNLL03 NER task, model training completes in about 6 hours on a single GPU, reaching F1 score of 91.71$\pm$0.10 without using any extra annotation. △ Less

Submitted 23 November, 2017; v1 submitted 12 September, 2017; originally announced September 2017.

Comments: AAAI 2018

arXiv:1707.02958 [pdf, other]

doi 10.1103/PhysRevLett.120.050506

Convex optimization over classes of multiparticle entanglement

Authors: Jiangwei Shang, Otfried Gühne

Abstract: A well-known strategy to characterize multiparticle entanglement utilizes the notion of stochastic local operations and classical communication (SLOCC), but characterizing the resulting entanglement classes is difficult. Given a multiparticle quantum state, we first show that Gilbert's algorithm can be adapted to prove separability or membership in a certain entanglement class. We then present two… ▽ More A well-known strategy to characterize multiparticle entanglement utilizes the notion of stochastic local operations and classical communication (SLOCC), but characterizing the resulting entanglement classes is difficult. Given a multiparticle quantum state, we first show that Gilbert's algorithm can be adapted to prove separability or membership in a certain entanglement class. We then present two algorithms for convex optimization over SLOCC classes. The first algorithm uses a simple gradient approach, while the other one employs the accelerated projected-gradient method. For demonstration, the algorithms are applied to the likelihood-ratio test using experimental data on bound entanglement of a noisy four-photon Smolin state [Phys. Rev. Lett. 105, 130501 (2010)]. △ Less

Submitted 1 February, 2018; v1 submitted 10 July, 2017; originally announced July 2017.

Comments: 10 pages, 9 figures, 1 table, 44 references, close to the published version

Journal ref: Phys. Rev. Lett. 120, 050506 (2018)

arXiv:1705.11062 [pdf, ps, other]

doi 10.1103/PhysRevE.97.022129

Transformation thermal convection: Cloaking, concentrating, and camouflage

Authors: Gaole Dai, ** Shang, Ji** Huang

Abstract: Heat can generally transfer via thermal conduction, thermal radiation, and thermal convection. All the existing theories of transformation thermotics and optics can treat thermal conduction and thermal radiation, respectively. Unfortunately, thermal convection has never been touched in transformation theories due to the lack of a suitable theory, thus limiting applications associated with heat tra… ▽ More Heat can generally transfer via thermal conduction, thermal radiation, and thermal convection. All the existing theories of transformation thermotics and optics can treat thermal conduction and thermal radiation, respectively. Unfortunately, thermal convection has never been touched in transformation theories due to the lack of a suitable theory, thus limiting applications associated with heat transfer through fluids (liquid or gas). Here, we develop, for the first time, a general theory of transformation thermal convection by considering the convection-diffusion equation, the Navier-Stokes equation, and the Darcy law. By introducing porous media, we get a set of coupled equations kee** their forms under coordinate transformation. As model applications, the theory helps to show the effects of cloaking, concentrating, and camouflage. Our finite element simulations confirm the theoretical findings. This work offers a general transformation theory for thermal convection, thus revealing some novel behaviors of thermal convection; it not only provides new hints on how to control heat transfer by combining thermal conduction, thermal radiation, and thermal convection, but also benefits the study of mass diffusion and other related fields that contain a set of equations and need to transform velocities at the same time. △ Less

Submitted 31 May, 2017; originally announced May 2017.

Comments: 17 pages, 6 figures

Journal ref: Phys. Rev. E 97, 022129 (2018)

arXiv:1704.00159 [pdf, other]

Compositional Human Pose Regression

Authors: Xiao Sun, Jiaxiang Shang, Shuang Liang, Yichen Wei

Abstract: Regression based methods are not performing as well as detection based methods for human pose estimation. A central problem is that the structural information in the pose is not well exploited in the previous regression methods. In this work, we propose a structure-aware regression approach. It adopts a reparameterized pose representation using bones instead of joints. It exploits the joint connec… ▽ More Regression based methods are not performing as well as detection based methods for human pose estimation. A central problem is that the structural information in the pose is not well exploited in the previous regression methods. In this work, we propose a structure-aware regression approach. It adopts a reparameterized pose representation using bones instead of joints. It exploits the joint connection structure to define a compositional loss function that encodes the long range interactions in the pose. It is simple, effective, and general for both 2D and 3D pose estimation in a unified setting. Comprehensive evaluation validates the effectiveness of our approach. It significantly advances the state-of-the-art on Human3.6M and is competitive with state-of-the-art results on MPII. △ Less

Submitted 1 August, 2017; v1 submitted 1 April, 2017; originally announced April 2017.

Comments: Accepted by International Conference on Computer Vision (ICCV) 2017

Showing 201–250 of 282 results for author: Shang, J