-
Collective Quantum Entanglement in Molecular Cavity Optomechanics
Authors:
Jian Huang,
Dangyuan Lei,
Girish S. Agarwal,
Zhedong Zhang
Abstract:
We propose an optomechanical scheme for reaching quantum entanglement in vibration polaritons. The system involves $N$ molecules, whose vibrations can be fairly entangled with plasmonic cavities. We find that the vibration-photon entanglement can exist at room temperature and is robust against thermal noise. We further demonstrate the quantum entanglement between the vibrational modes through the…
▽ More
We propose an optomechanical scheme for reaching quantum entanglement in vibration polaritons. The system involves $N$ molecules, whose vibrations can be fairly entangled with plasmonic cavities. We find that the vibration-photon entanglement can exist at room temperature and is robust against thermal noise. We further demonstrate the quantum entanglement between the vibrational modes through the plasmonic cavities, which shows a delocalized nature and an incredible enhancement with the number of molecules. The underlying mechanism for the entanglement is attributed to the strong vibration-cavity coupling which possesses collectivity. Our results provide a molecular optomechanical scheme which offers a promising platform for the study of noise-free quantum resources and macroscopic quantum phenomena.
△ Less
Submitted 25 May, 2024; v1 submitted 20 May, 2024;
originally announced May 2024.
-
HDR Imaging for Dynamic Scenes with Events
Authors:
Li Xiaopeng,
Zeng Zhaoyuan,
Fan Cien,
Zhao Chen,
Deng Lei,
Yu Lei
Abstract:
High dynamic range imaging (HDRI) for real-world dynamic scenes is challenging because moving objects may lead to hybrid degradation of low dynamic range and motion blur. Existing event-based approaches only focus on a separate task, while cascading HDRI and motion deblurring would lead to sub-optimal solutions, and unavailable ground-truth sharp HDR images aggravate the predicament. To address th…
▽ More
High dynamic range imaging (HDRI) for real-world dynamic scenes is challenging because moving objects may lead to hybrid degradation of low dynamic range and motion blur. Existing event-based approaches only focus on a separate task, while cascading HDRI and motion deblurring would lead to sub-optimal solutions, and unavailable ground-truth sharp HDR images aggravate the predicament. To address these challenges, we propose an Event-based HDRI framework within a Self-supervised learning paradigm, i.e., Self-EHDRI, which generalizes HDRI performance in real-world dynamic scenarios. Specifically, a self-supervised learning strategy is carried out by learning cross-domain conversions from blurry LDR images to sharp LDR images, which enables sharp HDR images to be accessible in the intermediate process even though ground-truth sharp HDR images are missing. Then, we formulate the event-based HDRI and motion deblurring model and conduct a unified network to recover the intermediate sharp HDR results, where both the high dynamic range and high temporal resolution of events are leveraged simultaneously for compensation. We construct large-scale synthetic and real-world datasets to evaluate the effectiveness of our method. Comprehensive experiments demonstrate that the proposed Self-EHDRI outperforms state-of-the-art approaches by a large margin. The codes, datasets, and results are available at https://lxp-whu.github.io/Self-EHDRI.
△ Less
Submitted 4 April, 2024;
originally announced April 2024.
-
What is the focus of XAI in UI design? Prioritizing UI design principles for enhancing XAI user experience
Authors:
Dian Lei,
Yao He,
Jianyou Zeng
Abstract:
With the widespread application of artificial intelligence(AI), the explainable AI (XAI) field has undergone a notable resurgence. In this background, the importance of user experience in XAI has become increasingly prominent. Simultaneously, the user interface (UI) serves as a crucial link between XAI and users. However, despite the existence of UI design principles for XAI, there is a lack of pr…
▽ More
With the widespread application of artificial intelligence(AI), the explainable AI (XAI) field has undergone a notable resurgence. In this background, the importance of user experience in XAI has become increasingly prominent. Simultaneously, the user interface (UI) serves as a crucial link between XAI and users. However, despite the existence of UI design principles for XAI, there is a lack of prioritization based on their significance. This will lead practitioners to have a vague understanding of different design principles, making it difficult to allocate design space reasonably and emphasize design focal points. This paper aims to prioritize four design principles, providing clear guidance for UI design in XAI. Initially, we conducted a lightweight summary to derive five user experience standards for non-expert users in XAI. Subsequently, we developed four corresponding webpage prototypes for the four design principles. Nineteen participants then interacted with these prototypes, providing ratings based on five user experience standards, and We calculated the weights of the design principles. Our findings indicate that, for non-expert users, "sensitivity" is the optimal UI design principle (weight = 0.3296), followed by "flexibility" (weight = 0.3014). Finally, we engage in further discussion and summarization of our research results, and present future works and limitations.
△ Less
Submitted 10 June, 2024; v1 submitted 21 February, 2024;
originally announced February 2024.
-
All-optical correlated noisy channel and its application in recovering quantum coherence
Authors:
Dan Lei,
Disheng Guo,
Jun Xin,
Xiao-Ming Lu
Abstract:
Attenuation and amplification are the most common processes for optical communications. Amplification can be used to compensate the attenuation of the complex amplitude of an optical field, but is unable to recover the coherence lost, provided that the attenuation channel and the amplification channel are independent. In this work, we show that the quantum coherence of an optical filed can be rega…
▽ More
Attenuation and amplification are the most common processes for optical communications. Amplification can be used to compensate the attenuation of the complex amplitude of an optical field, but is unable to recover the coherence lost, provided that the attenuation channel and the amplification channel are independent. In this work, we show that the quantum coherence of an optical filed can be regained if the attenuation channel and the amplification channel share correlated noise. We propose an all-optical correlated noisy channel relying on four-wave mixing process and demonstrate its capability of recovering quantum coherence within continuous-variable systems. We quantitatively investigate the coherence recovery phenomena for coherent states and two-mode squeezed states. Moreover, we analyze the effect of other photon losses that are independent with the recovery channel on the performance of recovering coherence. Different from correlated noisy channels previously proposed based on electro-optic conversions, the correlated noisy channel in our protocol is all-optical and thus owns larger operational bandwidths.
△ Less
Submitted 22 December, 2023; v1 submitted 24 October, 2023;
originally announced October 2023.
-
Chain of Natural Language Inference for Reducing Large Language Model Ungrounded Hallucinations
Authors:
Deren Lei,
Yaxi Li,
Mengya Hu,
Mingyu Wang,
Vincent Yun,
Emily Ching,
Eslam Kamal
Abstract:
Large language models (LLMs) can generate fluent natural language texts when given relevant documents as background context. This ability has attracted considerable interest in develo** industry applications of LLMs. However, LLMs are prone to generate hallucinations that are not supported by the provided sources. In this paper, we propose a hierarchical framework to detect and mitigate such ung…
▽ More
Large language models (LLMs) can generate fluent natural language texts when given relevant documents as background context. This ability has attracted considerable interest in develo** industry applications of LLMs. However, LLMs are prone to generate hallucinations that are not supported by the provided sources. In this paper, we propose a hierarchical framework to detect and mitigate such ungrounded hallucination. Our framework uses Chain of Natural Language Inference (CoNLI) for hallucination detection and hallucination reduction via post-editing. Our approach achieves state-of-the-art performance on hallucination detection and enhances text quality through rewrite, using LLMs without any fine-tuning or domain-specific prompt engineering. We show that this simple plug-and-play framework can serve as an effective choice for hallucination detection and reduction, achieving competitive performance across various contexts.
△ Less
Submitted 9 October, 2023; v1 submitted 5 October, 2023;
originally announced October 2023.
-
Characterising User Transfer Amid Industrial Resource Variation: A Bayesian Nonparametric Approach
Authors:
Dongxu Lei,
Xiaotian Lin,
Xinghu Yu,
Zhan Li,
Weichao Sun,
Jianbin Qiu,
Songlin Zhuang,
Huijun Gao
Abstract:
In a multitude of industrial fields, a key objective entails optimising resource management whilst satisfying user requirements. Resource management by industrial practitioners can result in a passive transfer of user loads across resource providers, a phenomenon whose accurate characterisation is both challenging and crucial. This research reveals the existence of user clusters, which capture mac…
▽ More
In a multitude of industrial fields, a key objective entails optimising resource management whilst satisfying user requirements. Resource management by industrial practitioners can result in a passive transfer of user loads across resource providers, a phenomenon whose accurate characterisation is both challenging and crucial. This research reveals the existence of user clusters, which capture macro-level user transfer patterns amid resource variation. We then propose CLUSTER, an interpretable hierarchical Bayesian nonparametric model capable of automating cluster identification, and thereby predicting user transfer in response to resource variation. Furthermore, CLUSTER facilitates uncertainty quantification for further reliable decision-making. Our method enables privacy protection by functioning independently of personally identifiable information. Experiments with simulated and real-world data from the communications industry reveal a pronounced alignment between prediction results and empirical observations across a spectrum of resource management scenarios. This research establishes a solid groundwork for advancing resource management strategy development.
△ Less
Submitted 25 September, 2023;
originally announced September 2023.
-
p-sylowizers and p-nilpotency of finite groups
Authors:
Yaxin Gao,
Xianhua Li,
Donglin Lei
Abstract:
In this paper, we investigate the structure of finite group G by assuming that the intersections between p-sylowizers of some p-subgroups of G and $O^p(G)$ are S-permutable in G. We obtain some criterions for p-nilpotency of a finite group.
In this paper, we investigate the structure of finite group G by assuming that the intersections between p-sylowizers of some p-subgroups of G and $O^p(G)$ are S-permutable in G. We obtain some criterions for p-nilpotency of a finite group.
△ Less
Submitted 23 June, 2023;
originally announced June 2023.
-
Multidimensional Coherent Spectroscopy of Molecular Polaritons: Langevin Approach
Authors:
Zhedong Zhang,
Xiaoyu Nie,
Dangyuan Lei,
Shaul Mukame
Abstract:
We present a microscopic theory for nonlinear optical spectroscopy of N molecules in an optical cavity. A quantum Langevin analytical expression is derived for the time- and frequency-resolved signals accounting for arbitrary numbers of vibrational excitations. We identify clear signatures of the polariton-polaron interaction from multidimensional projections of the signal, e.g., pathways and time…
▽ More
We present a microscopic theory for nonlinear optical spectroscopy of N molecules in an optical cavity. A quantum Langevin analytical expression is derived for the time- and frequency-resolved signals accounting for arbitrary numbers of vibrational excitations. We identify clear signatures of the polariton-polaron interaction from multidimensional projections of the signal, e.g., pathways and timescales. Cooperative dynamics of cavity polaritons against intramolecular vibrations is revealed, along with a cross talk between long-range coherence and vibronic coupling that may lead to localization effects. Our results further characterize the polaritonic coherence and the population transfer that is slower.
△ Less
Submitted 24 October, 2022;
originally announced October 2022.
-
Flat extensions of groups and limit varieties of ai-semirings
Authors:
Miaomiao Ren,
Marcel Jackson,
Xianzhong Zhao,
Donglin Lei
Abstract:
The present paper is a continuation of \cite{jrz} and is devoted to the study of limit varieties of additively idempotent semirings. A limit variety is a nonfinitely based variety whose proper subvarieties are all finitely based. We present concrete constructions for one infinite family of limit additively idempotent semiring varieties, and one further ad hoc example. Each of these examples can be…
▽ More
The present paper is a continuation of \cite{jrz} and is devoted to the study of limit varieties of additively idempotent semirings. A limit variety is a nonfinitely based variety whose proper subvarieties are all finitely based. We present concrete constructions for one infinite family of limit additively idempotent semiring varieties, and one further ad hoc example. Each of these examples can be generated by a finite flat semiring, with the infinite family arising by a way of a complete characterisation of limit varieties that can be generated by the flat extension of a finite group. We also demonstrate the existence of other examples of limit varieties of additively idempotent semirings, including one further continuum-sized family, each with no finite generator, and two further ad hoc examples. While an explicit description of these latter examples is not given, one of the examples is proved to contain only trivial flat semirings.
△ Less
Submitted 29 August, 2022;
originally announced August 2022.
-
Directional Dipole Dice Enabled by Anisotropic Chirality
Authors:
Yuqiong Cheng,
Kayode Adedotun Oyesina,
Bo Xue,
Dangyuan Lei,
Alex M. H. Wong,
Shubo Wang
Abstract:
Directional radiation and scattering play an essential role in light manipulation for various applications in integrated nanophotonics, antenna and metasurface designs, quantum optics, etc. The most elemental system with this property is the class of directional dipoles, including the circular dipole, Huygens dipole, and Janus dipole. A unified realization of all three dipole types and a mechanism…
▽ More
Directional radiation and scattering play an essential role in light manipulation for various applications in integrated nanophotonics, antenna and metasurface designs, quantum optics, etc. The most elemental system with this property is the class of directional dipoles, including the circular dipole, Huygens dipole, and Janus dipole. A unified realization of all three dipole types and a mechanism to freely switch among them are previously unreported, yet highly desirable for develo** compact and multifunctional directional sources. Here, we theoretically and experimentally demonstrate that the synergy of chirality and anisotropy can give rise to all three directional dipoles in one structure at the same frequency under linearly polarized plane wave excitations. This mechanism enables a simple helix particle to serve as a directional dipole dice (DDD), achieving selective manipulation of optical directionality via different "faces" of the particle. We employ three "faces" of the DDD to realize face-multiplexed routing of guided waves in three orthogonal directions with the directionality determined by spin, power flow, and reactive power, respectively. This construction of the complete directionality space can enable the unprecedented high-dimensional control of both near-field and far-field directionality with broad applications in photonic integrated circuits, quantum information processing, and subwavelength-resolution imaging.
△ Less
Submitted 14 January, 2023; v1 submitted 17 July, 2022;
originally announced August 2022.
-
Super-resolution multicolor fluorescence microscopy enabled by an apochromatic super-oscillatory lens with extended depth-of-focus
Authors:
Wenli Li,
Pei He,
Yulong Fan,
Yangtao Du,
Bo Gao,
Zhiqin Chu,
Chengxu An,
Dangyuan Lei,
Weizheng Yuan,
Yiting Yu
Abstract:
Multicolor super-resolution imaging remains an intractable challenge for both far-field and near-field based super-resolution techniques. Planar super-oscillatory lens (SOL), a far-field subwavelength-focusing diffractive lens device, holds great potential for achieving sub-diffraction-limit imaging at multiple wavelengths. However, conventional SOL devices suffer from a numerical aperture (NA) re…
▽ More
Multicolor super-resolution imaging remains an intractable challenge for both far-field and near-field based super-resolution techniques. Planar super-oscillatory lens (SOL), a far-field subwavelength-focusing diffractive lens device, holds great potential for achieving sub-diffraction-limit imaging at multiple wavelengths. However, conventional SOL devices suffer from a numerical aperture (NA) related intrinsic tradeoff among the depth of focus (DoF), chromatic dispersion and focus spot size, being an essential characteristics of common diffractive optical elements. Typically, the limited DoF and significant chromatism associated with high NA can lead to unfavorable degradation of image quality although increasing NA imporves the resolution. Here, we apply a multi-objective genetic algorithm (GA) optimization approach to design an apochromatic binary-phase SOL that generates axially jointed multifoci concurrently having prolonged DoF, customized working distance (WD) and suppressed side-lobes yet minimized main-lobe size, optimizing the aforementioned NA-dependent tradeoff. Experimental implementation of this GA-optimized SOL demonstrates simultaneous focusing of blue, green and red light beams into an optical needle half of the incident wavelength in diameter at 428 um WD, resulting in an ultimate resolution better than one third of the incident wavelength in the lateral dimension. By integrating this apochromatic SOL device with a commercial fluorescence microscope, we employ the optical needle to perform, for the first time, three-dimensional super-resolution multicolor fluorescence imaging of the unseen fine structure of neurons at one go. The present study provides not only a practical route to far-field multicolor super-resolution imaging but also a viable approach for constructing imaging systems avoiding complex sample positioning and unfavorable photobleaching.
△ Less
Submitted 5 June, 2022;
originally announced June 2022.
-
Quantum Fluctuations and Coherence of a Molecular Polariton Condensate
Authors:
Zhedong Zhang,
Shixuan Zhao,
Dangyuan Lei
Abstract:
A full quantum theory beyond the mean-field regime is developed for an exciton polariton condensate, to gain a complete understanding of quantum fluctuations. We find analytical solution for the polariton density matrix, showing the polariton nonlinearity causing fast relaxation correlated with the pump so as to yield the condensation at threshold. Increasing the pump intensity, a nonequilibrium p…
▽ More
A full quantum theory beyond the mean-field regime is developed for an exciton polariton condensate, to gain a complete understanding of quantum fluctuations. We find analytical solution for the polariton density matrix, showing the polariton nonlinearity causing fast relaxation correlated with the pump so as to yield the condensation at threshold. Increasing the pump intensity, a nonequilibrium phase transition towards the condensation of lower polaritons emerges, with a statistics transiting from a thermal, through a super-Poissonian and to a nonclassical distribution beyond the understanding at the level of off-diagonal long-range order. The results signify the role of dark states for polariton fluctuations, and lead to a nonclassical counting statistics of emitted photons, which elaborates the role of the key parameters, e.g., pump, detuning and temperature.
△ Less
Submitted 28 April, 2022;
originally announced April 2022.
-
Intrinsic Superflat Bands in General Twisted Bilayer Systems
Authors:
Hongfei Wang,
Shaojie Ma,
Shuang Zhang,
Dangyuan Lei
Abstract:
Twisted bilayer systems with discrete magic angles, such as twisted bilayer graphene featuring moiré superlattices, provide a versatile platform for exploring novel physical properties. Here, we discover a class of superflat bands in general twisted bilayer systems beyond the low-energy physics of magic-angle twisted counterparts. By considering continuous lattice dislocation, we obtain intrinsic…
▽ More
Twisted bilayer systems with discrete magic angles, such as twisted bilayer graphene featuring moiré superlattices, provide a versatile platform for exploring novel physical properties. Here, we discover a class of superflat bands in general twisted bilayer systems beyond the low-energy physics of magic-angle twisted counterparts. By considering continuous lattice dislocation, we obtain intrinsic localized states, which are spectrally isolated at lowest and highest energies and spatially centered around the AA stacked region, governed by the macroscopic effective energy potential well. Such localized states exhibit negligible inter-cell coupling and support the formation of superflat bands in a wide and continuous parameter space, which can be mimicked using a twisted bilayer nanophotonic system. Our finding suggests that general twisted bilayer systems can realize continuously tunable superflat bands and the corresponding localized states for various photonic, phononic and mechanical waves.
△ Less
Submitted 2 January, 2022;
originally announced January 2022.
-
Phyllotaxis-inspired Nanosieves with Multiplexed Orbital Angular Momentum
Authors:
Zhongwei **,
David Janoschka,
Junhong Deng,
Lin Ge,
Pascal Dreher,
Bettina Frank,
Guangwei Hu,
**cheng Ni,
Yuanjie Yang,
**g Li,
Changyuan Yu,
Dangyuan Lei,
Guixin Li,
Shumin Xiao1,
Shengtao Mei,
Harald Giessen,
Frank Meyer zu Heringdorf,
Cheng-Wei Qiu
Abstract:
Nanophotonic platforms such as metasurfaces, achieving arbitrary phase profiles within ultrathin thickness, emerge as miniaturized, ultracompact and kaleidoscopic optical vortex generators. However, it is often required to segment or interleave independent subarray metasurfaces to multiplex optical vortices in a single nano device, which in turn affects the compactness and channel capacity of the…
▽ More
Nanophotonic platforms such as metasurfaces, achieving arbitrary phase profiles within ultrathin thickness, emerge as miniaturized, ultracompact and kaleidoscopic optical vortex generators. However, it is often required to segment or interleave independent subarray metasurfaces to multiplex optical vortices in a single nano device, which in turn affects the compactness and channel capacity of the device. Here, inspired by phyllotaxis patterns in pine cones and sunflowers, we theoretically prove and experimentally report that multiple optical vortices can be produced in a single compact phyllotaxis nanosieve, both in free space and on a chip, where one metaatom may contribute to many vortices simultaneously. The time resolved dynamics of on chip interference wavefronts between multiple plasmonic vortices was revealed by ultrafast time-resolved photoemission electron microscopy. Our nature inspired optical vortex generator would facilitate various vortex related optical applications, including structured wavefront sha**, free space and plasmonic vortices, and high capacity information metaphotonics.
△ Less
Submitted 4 September, 2021;
originally announced September 2021.
-
Extract, Denoise and Enforce: Evaluating and Improving Concept Preservation for Text-to-Text Generation
Authors:
Yuning Mao,
Wenchang Ma,
Deren Lei,
Jiawei Han,
Xiang Ren
Abstract:
Prior studies on text-to-text generation typically assume that the model could figure out what to attend to in the input and what to include in the output via seq2seq learning, with only the parallel training data and no additional guidance. However, it remains unclear whether current models can preserve important concepts in the source input, as seq2seq learning does not have explicit focus on th…
▽ More
Prior studies on text-to-text generation typically assume that the model could figure out what to attend to in the input and what to include in the output via seq2seq learning, with only the parallel training data and no additional guidance. However, it remains unclear whether current models can preserve important concepts in the source input, as seq2seq learning does not have explicit focus on the concepts and commonly used evaluation metrics also treat concepts equally important as other tokens. In this paper, we present a systematic analysis that studies whether current seq2seq models, especially pre-trained language models, are good enough for preserving important input concepts and to what extent explicitly guiding generation with the concepts as lexical constraints is beneficial. We answer the above questions by conducting extensive analytical experiments on four representative text-to-text generation tasks. Based on the observations, we then propose a simple yet effective framework to automatically extract, denoise, and enforce important input concepts as lexical constraints. This new method performs comparably or better than its unconstrained counterpart on automatic metrics, demonstrates higher coverage for concept preservation, and receives better ratings in the human evaluation. Our code is available at https://github.com/morningmoni/EDE.
△ Less
Submitted 2 September, 2021; v1 submitted 18 April, 2021;
originally announced April 2021.
-
Learning to Reason in Round-based Games: Multi-task Sequence Generation for Purchasing Decision Making in First-person Shooters
Authors:
Yilei Zeng,
Deren Lei,
Beichen Li,
Gangrong Jiang,
Emilio Ferrara,
Michael Zyda
Abstract:
Sequential reasoning is a complex human ability, with extensive previous research focusing on gaming AI in a single continuous game, round-based decision makings extending to a sequence of games remain less explored. Counter-Strike: Global Offensive (CS:GO), as a round-based game with abundant expert demonstrations, provides an excellent environment for multi-player round-based sequential reasonin…
▽ More
Sequential reasoning is a complex human ability, with extensive previous research focusing on gaming AI in a single continuous game, round-based decision makings extending to a sequence of games remain less explored. Counter-Strike: Global Offensive (CS:GO), as a round-based game with abundant expert demonstrations, provides an excellent environment for multi-player round-based sequential reasoning. In this work, we propose a Sequence Reasoner with Round Attribute Encoder and Multi-Task Decoder to interpret the strategies behind the round-based purchasing decisions. We adopt few-shot learning to sample multiple rounds in a match, and modified model agnostic meta-learning algorithm Reptile for the meta-learning loop. We formulate each round as a multi-task sequence generation problem. Our state representations combine action encoder, team encoder, player features, round attribute encoder, and economy encoders to help our agent learn to reason under this specific multi-player round-based scenario. A complete ablation study and comparison with the greedy approach certify the effectiveness of our model. Our research will open doors for interpretable AI for understanding episodic and long-term purchasing strategies beyond the gaming community.
△ Less
Submitted 12 August, 2020;
originally announced August 2020.
-
Learning Collaborative Agents with Rule Guidance for Knowledge Graph Reasoning
Authors:
Deren Lei,
Gangrong Jiang,
Xiaotao Gu,
Kexuan Sun,
Yuning Mao,
Xiang Ren
Abstract:
Walk-based models have shown their advantages in knowledge graph (KG) reasoning by achieving decent performance while providing interpretable decisions. However, the sparse reward signals offered by the KG during traversal are often insufficient to guide a sophisticated walk-based reinforcement learning (RL) model. An alternate approach is to use traditional symbolic methods (e.g., rule induction)…
▽ More
Walk-based models have shown their advantages in knowledge graph (KG) reasoning by achieving decent performance while providing interpretable decisions. However, the sparse reward signals offered by the KG during traversal are often insufficient to guide a sophisticated walk-based reinforcement learning (RL) model. An alternate approach is to use traditional symbolic methods (e.g., rule induction), which achieve good performance but can be hard to generalize due to the limitation of symbolic representation. In this paper, we propose RuleGuider, which leverages high-quality rules generated by symbolic-based methods to provide reward supervision for walk-based agents. Experiments on benchmark datasets show that RuleGuider improves the performance of walk-based models without losing interpretability.
△ Less
Submitted 6 October, 2020; v1 submitted 1 May, 2020;
originally announced May 2020.
-
HiExpan: Task-Guided Taxonomy Construction by Hierarchical Tree Expansion
Authors:
Jiaming Shen,
Zeqiu Wu,
Dongming Lei,
Chao Zhang,
Xiang Ren,
Michelle T. Vanni,
Brian M. Sadler,
Jiawei Han
Abstract:
Taxonomies are of great value to many knowledge-rich applications. As the manual taxonomy curation costs enormous human effects, automatic taxonomy construction is in great demand. However, most existing automatic taxonomy construction methods can only build hypernymy taxonomies wherein each edge is limited to expressing the "is-a" relation. Such a restriction limits their applicability to more di…
▽ More
Taxonomies are of great value to many knowledge-rich applications. As the manual taxonomy curation costs enormous human effects, automatic taxonomy construction is in great demand. However, most existing automatic taxonomy construction methods can only build hypernymy taxonomies wherein each edge is limited to expressing the "is-a" relation. Such a restriction limits their applicability to more diverse real-world tasks where the parent-child may carry different relations. In this paper, we aim to construct a task-guided taxonomy from a domain-specific corpus and allow users to input a "seed" taxonomy, serving as the task guidance. We propose an expansion-based taxonomy construction framework, namely HiExpan, which automatically generates key term list from the corpus and iteratively grows the seed taxonomy. Specifically, HiExpan views all children under each taxonomy node forming a coherent set and builds the taxonomy by recursively expanding all these sets. Furthermore, HiExpan incorporates a weakly-supervised relation extraction module to extract the initial children of a newly-expanded node and adjusts the taxonomy tree by optimizing its global structure. Our experiments on three real datasets from different domains demonstrate the effectiveness of HiExpan for building task-guided taxonomies.
△ Less
Submitted 17 October, 2019;
originally announced October 2019.
-
SetExpan: Corpus-Based Set Expansion via Context Feature Selection and Rank Ensemble
Authors:
Jiaming Shen,
Zeqiu Wu,
Dongming Lei,
**gbo Shang,
Xiang Ren,
Jiawei Han
Abstract:
Corpus-based set expansion (i.e., finding the "complete" set of entities belonging to the same semantic class, based on a given corpus and a tiny set of seeds) is a critical task in knowledge discovery. It may facilitate numerous downstream applications, such as information extraction, taxonomy induction, question answering, and web search. To discover new entities in an expanded set, previous app…
▽ More
Corpus-based set expansion (i.e., finding the "complete" set of entities belonging to the same semantic class, based on a given corpus and a tiny set of seeds) is a critical task in knowledge discovery. It may facilitate numerous downstream applications, such as information extraction, taxonomy induction, question answering, and web search. To discover new entities in an expanded set, previous approaches either make one-time entity ranking based on distributional similarity, or resort to iterative pattern-based bootstrap**. The core challenge for these methods is how to deal with noisy context features derived from free-text corpora, which may lead to entity intrusion and semantic drifting. In this study, we propose a novel framework, SetExpan, which tackles this problem, with two techniques: (1) a context feature selection method that selects clean context features for calculating entity-entity distributional similarity, and (2) a ranking-based unsupervised ensemble method for expanding entity set based on denoised context features. Experiments on three datasets show that SetExpan is robust and outperforms previous state-of-the-art methods in terms of mean average precision.
△ Less
Submitted 17 October, 2019;
originally announced October 2019.
-
Deep Reinforcement Learning with Distributional Semantic Rewards for Abstractive Summarization
Authors:
Siyao Li,
Deren Lei,
Pengda Qin,
William Yang Wang
Abstract:
Deep reinforcement learning (RL) has been a commonly-used strategy for the abstractive summarization task to address both the exposure bias and non-differentiable task issues. However, the conventional reward Rouge-L simply looks for exact n-grams matches between candidates and annotated references, which inevitably makes the generated sentences repetitive and incoherent. In this paper, instead of…
▽ More
Deep reinforcement learning (RL) has been a commonly-used strategy for the abstractive summarization task to address both the exposure bias and non-differentiable task issues. However, the conventional reward Rouge-L simply looks for exact n-grams matches between candidates and annotated references, which inevitably makes the generated sentences repetitive and incoherent. In this paper, instead of Rouge-L, we explore the practicability of utilizing the distributional semantics to measure the matching degrees. With distributional semantics, sentence-level evaluation can be obtained, and semantically-correct phrases can also be generated without being limited to the surface form of the reference sentences. Human judgments on Gigaword and CNN/Daily Mail datasets show that our proposed distributional semantics reward (DSR) has distinct superiority in capturing the lexical and compositional diversity of natural language.
△ Less
Submitted 10 September, 2019; v1 submitted 31 August, 2019;
originally announced September 2019.
-
Imposing Label-Relational Inductive Bias for Extremely Fine-Grained Entity Ty**
Authors:
Wenhan Xiong,
Jiawei Wu,
Deren Lei,
Mo Yu,
Shiyu Chang,
Xiaoxiao Guo,
William Yang Wang
Abstract:
Existing entity ty** systems usually exploit the type hierarchy provided by knowledge base (KB) schema to model label correlations and thus improve the overall performance. Such techniques, however, are not directly applicable to more open and practical scenarios where the type set is not restricted by KB schema and includes a vast number of free-form types. To model the underly-ing label correl…
▽ More
Existing entity ty** systems usually exploit the type hierarchy provided by knowledge base (KB) schema to model label correlations and thus improve the overall performance. Such techniques, however, are not directly applicable to more open and practical scenarios where the type set is not restricted by KB schema and includes a vast number of free-form types. To model the underly-ing label correlations without access to manually annotated label structures, we introduce a novel label-relational inductive bias, represented by a graph propagation layer that effectively encodes both global label co-occurrence statistics and word-level similarities.On a large dataset with over 10,000 free-form types, the graph-enhanced model equipped with an attention-based matching module is able to achieve a much higher recall score while maintaining a high-level precision. Specifically, it achieves a 15.3% relative F1 improvement and also less inconsistency in the outputs. We further show that a simple modification of our proposed graph layer can also improve the performance on a conventional and widely-tested dataset that only includes KB-schema types.
△ Less
Submitted 6 March, 2019;
originally announced March 2019.
-
Implicit Regularization of Stochastic Gradient Descent in Natural Language Processing: Observations and Implications
Authors:
Deren Lei,
Zichen Sun,
Yijun Xiao,
William Yang Wang
Abstract:
Deep neural networks with remarkably strong generalization performances are usually over-parameterized. Despite explicit regularization strategies are used for practitioners to avoid over-fitting, the impacts are often small. Some theoretical studies have analyzed the implicit regularization effect of stochastic gradient descent (SGD) on simple machine learning models with certain assumptions. How…
▽ More
Deep neural networks with remarkably strong generalization performances are usually over-parameterized. Despite explicit regularization strategies are used for practitioners to avoid over-fitting, the impacts are often small. Some theoretical studies have analyzed the implicit regularization effect of stochastic gradient descent (SGD) on simple machine learning models with certain assumptions. However, how it behaves practically in state-of-the-art models and real-world datasets is still unknown. To bridge this gap, we study the role of SGD implicit regularization in deep learning systems. We show pure SGD tends to converge to minimas that have better generalization performances in multiple natural language processing (NLP) tasks. This phenomenon coexists with dropout, an explicit regularizer. In addition, neural network's finite learning capability does not impact the intrinsic nature of SGD's implicit regularization effect. Specifically, under limited training samples or with certain corrupted labels, the implicit regularization effect remains strong. We further analyze the stability by varying the weight initialization range. We corroborate these experimental findings with a decision boundary visualization using a 3-layer neural network for interpretation. Altogether, our work enables a deepened understanding on how implicit regularization affects the deep learning model and sheds light on the future study of the over-parameterized model's generalization ability.
△ Less
Submitted 1 November, 2018;
originally announced November 2018.
-
Opening the black box of deep learning
Authors:
Dian Lei,
Xiaoxiao Chen,
Jianfei Zhao
Abstract:
The great success of deep learning shows that its technology contains profound truth, and understanding its internal mechanism not only has important implications for the development of its technology and effective application in various fields, but also provides meaningful insights into the understanding of human brain mechanism. At present, most of the theoretical research on deep learning is ba…
▽ More
The great success of deep learning shows that its technology contains profound truth, and understanding its internal mechanism not only has important implications for the development of its technology and effective application in various fields, but also provides meaningful insights into the understanding of human brain mechanism. At present, most of the theoretical research on deep learning is based on mathematics. This dissertation proposes that the neural network of deep learning is a physical system, examines deep learning from three different perspectives: microscopic, macroscopic, and physical world views, answers multiple theoretical puzzles in deep learning by using physics principles. For example, from the perspective of quantum mechanics and statistical physics, this dissertation presents the calculation methods for convolution calculation, pooling, normalization, and Restricted Boltzmann Machine, as well as the selection of cost functions, explains why deep learning must be deep, what characteristics are learned in deep learning, why Convolutional Neural Networks do not have to be trained layer by layer, and the limitations of deep learning, etc., and proposes the theoretical direction and basis for the further development of deep learning now and in the future. The brilliance of physics flashes in deep learning, we try to establish the deep learning technology based on the scientific theory of physics.
△ Less
Submitted 21 May, 2018;
originally announced May 2018.
-
Attosecond streaking of photoelectron emission from disordered solids
Authors:
W. A. Okell,
T. Witting,
D. Fabris,
C. A. Arrell,
J. Hengster,
S. Ibrahimkutty,
A. Seiler,
M. Barthelmess,
S. Stankov,
D. Y. Lei,
Y. Sonnefraud,
M. Rahmani,
Th. Uphues,
S. A. Maier,
J. P. Marangos,
J. W. G. Tisch
Abstract:
Attosecond streaking of photoelectrons emitted by extreme ultraviolet light has begun to reveal how electrons behave during their transport within simple crystalline solids. Many sample types within nanoplasmonics, thin-film physics, and semiconductor physics, however, do not have a simple single crystal structure. The electron dynamics which underpin the optical response of plasmonic nanostructur…
▽ More
Attosecond streaking of photoelectrons emitted by extreme ultraviolet light has begun to reveal how electrons behave during their transport within simple crystalline solids. Many sample types within nanoplasmonics, thin-film physics, and semiconductor physics, however, do not have a simple single crystal structure. The electron dynamics which underpin the optical response of plasmonic nanostructures and wide-bandgap semiconductors happen on an attosecond timescale. Measuring these dynamics using attosecond streaking will enable such systems to be specially tailored for applications in areas such as ultrafast opto-electronics. We show that streaking can be extended to this very general type of sample by presenting streaking measurements on an amorphous film of the wide-bandgap semiconductor tungsten trioxide, and on polycrystalline gold, a material that forms the basis of many nanoplasmonic devices. Our measurements reveal the near-field temporal structure at the sample surface, and photoelectron wavepacket temporal broadening consistent with a spread of electron transport times to the surface.
△ Less
Submitted 21 October, 2014;
originally announced October 2014.
-
Experimental demonstration of light capsule embracing super-sized darkness inside via anti-resolution
Authors:
Chao Wan,
Kun Huang,
Tiancheng Han,
Eunice Leong,
Weiqiang Ding,
Tat-Soon Yeo,
Xia Yu,
**ghua Teng,
Dang Yuan Lei,
Stefan A. Maier,
Shuang Zhang,
Cheng-Wei Qiu
Abstract:
We theoretically and experimentally demonstrate the focusing of macroscopic 3D darkness surrounded by all light in free space. The object staying in the darkness is similar to staying in an empty light capsule because light just bypasses it by resorting to destructive interference. Its functionality of controlling the direction of energy flux of light macroscopically is fascinating, similar in som…
▽ More
We theoretically and experimentally demonstrate the focusing of macroscopic 3D darkness surrounded by all light in free space. The object staying in the darkness is similar to staying in an empty light capsule because light just bypasses it by resorting to destructive interference. Its functionality of controlling the direction of energy flux of light macroscopically is fascinating, similar in some sense to the transformation-based cloaking effect. Binary-optical system exhibiting anti-resolution (AR) is designed and fabricated, by which electromagnetic energy flux avoids and bends smoothly around a nearly perfect darkness region. AR remains an unexplored topic hitherto, in contrast to the super-resolution for realizing high spatial resolution. This novel scheme replies on smearing out the PSF and thus poses less stringent limitations upon the object's size and position since the created dark (zero-field) area reach 8 orders of magnitude larger than the square of wavelength in size. It functions very well at arbitrarily polarized beams in three dimensions, which is also frequency-scalable in the whole electromagnetic spectrum.
△ Less
Submitted 9 December, 2013; v1 submitted 29 November, 2013;
originally announced December 2013.
-
One-loop QCD and electroweak corrections to $t\bar{t}Z^0$ production at an $e^+e^-$ linear collider
Authors:
Dai Lei,
Ma Wen-Gan,
Zhang Ren-You,
Guo Lei,
Wang Shao-Ming
Abstract:
We study the impact of the full ${\cal O}(α_{s})$ QCD and ${\cal O}(α_{ew})$ electroweak (EW) radiative corrections to the $e^+e^- \to t \bar t Z^0$ process in the standard model (SM), and investigate the dependence of the lowest-order(LO), one-loop QCD and EW corrected cross sections on colliding energy $\sqrt{s}$. The LO, QCD and EW corrected spectrums of the invariant mass of $t\bar t$-pair a…
▽ More
We study the impact of the full ${\cal O}(α_{s})$ QCD and ${\cal O}(α_{ew})$ electroweak (EW) radiative corrections to the $e^+e^- \to t \bar t Z^0$ process in the standard model (SM), and investigate the dependence of the lowest-order(LO), one-loop QCD and EW corrected cross sections on colliding energy $\sqrt{s}$. The LO, QCD and EW corrected spectrums of the invariant mass of $t\bar t$-pair and the distributions of the transverse momenta of final top-quark and $Z^0$-boson are presented. The numerical results show that the one-loop QCD correction enhances the LO cross section, but the EW one-loop correction generally suppresses the LO cross section with our chosen parameters. In the case of $m_H=120 GeV$, the QCD relative corrections can reach 43.16% when $\sqrt{s} = 500 GeV$, while the EW relative corrections have the values of -9.24%, -4.36% and -5.81%, when $\sqrt{s} = 500 GeV$, $800 GeV$, 1.2 TeV$, respectively.
△ Less
Submitted 26 December, 2009; v1 submitted 23 October, 2008;
originally announced October 2008.