-
WuYun: Exploring hierarchical skeleton-guided melody generation using knowledge-enhanced deep learning
Authors:
Kejun Zhang,
Xinda Wu,
Tieyao Zhang,
Zhijie Huang,
Xu Tan,
Qihao Liang,
Songruoyao Wu,
Lingyun Sun
Abstract:
Although deep learning has revolutionized music generation, existing methods for structured melody generation follow an end-to-end left-to-right note-by-note generative paradigm and treat each note equally. Here, we present WuYun, a knowledge-enhanced deep learning architecture for improving the structure of generated melodies, which first generates the most structurally important notes to constru…
▽ More
Although deep learning has revolutionized music generation, existing methods for structured melody generation follow an end-to-end left-to-right note-by-note generative paradigm and treat each note equally. Here, we present WuYun, a knowledge-enhanced deep learning architecture for improving the structure of generated melodies, which first generates the most structurally important notes to construct a melodic skeleton and subsequently infills it with dynamically decorative notes into a full-fledged melody. Specifically, we use music domain knowledge to extract melodic skeletons and employ sequence learning to reconstruct them, which serve as additional knowledge to provide auxiliary guidance for the melody generation process. We demonstrate that WuYun can generate melodies with better long-term structure and musicality and outperforms other state-of-the-art methods by 0.51 on average on all subjective evaluation metrics. Our study provides a multidisciplinary lens to design melodic hierarchical structures and bridge the gap between data-driven and knowledge-based approaches for numerous music generation tasks.
△ Less
Submitted 14 March, 2023; v1 submitted 11 January, 2023;
originally announced January 2023.
-
Circularly Polarized Lasing from a Microcavity Filled with Achiral Single-Crystalline Microribbons
Authors:
Qian Liang,
Xuekai Ma,
Teng Long,
Jiannian Yao,
Qing Liao,
Hongbing Fu
Abstract:
Organic circularly polarized (CP) lasers have received increasing attention due to their future photoelectric applications. Here, we demonstrate a CP laser from a pure organic crystal-filled microcavity without any chiral molecules or chiral structures. Benefited from the giant anisotropy and excellent laser gain of organic crystals, optical Rashba-Dresselhaus spin-orbit coupling effect can be ind…
▽ More
Organic circularly polarized (CP) lasers have received increasing attention due to their future photoelectric applications. Here, we demonstrate a CP laser from a pure organic crystal-filled microcavity without any chiral molecules or chiral structures. Benefited from the giant anisotropy and excellent laser gain of organic crystals, optical Rashba-Dresselhaus spin-orbit coupling effect can be induced and is conductive to the CP laser in such microcavities. The maximum dissymmetry factor of the CP lasing with opposite helicities reached, is as high as 1.2. Our strategy may provide a new idea for the design of CP lasers towards future 3D laser displays, information storage and other fields.
△ Less
Submitted 23 November, 2022;
originally announced November 2022.
-
Ecovisor: A Virtual Energy System for Carbon-Efficient Applications
Authors:
Abel Souza,
Noman Bashir,
Jorge Murillo,
Walid Hanafy,
Qianlin Liang,
David Irwin,
Prashant Shenoy
Abstract:
Cloud platforms' rapid growth is raising significant concerns about their carbon emissions. To reduce emissions, future cloud platforms will need to increase their reliance on renewable energy sources, such as solar and wind, which have zero emissions but are highly unreliable. Unfortunately, today's energy systems effectively mask this unreliability in hardware, which prevents applications from o…
▽ More
Cloud platforms' rapid growth is raising significant concerns about their carbon emissions. To reduce emissions, future cloud platforms will need to increase their reliance on renewable energy sources, such as solar and wind, which have zero emissions but are highly unreliable. Unfortunately, today's energy systems effectively mask this unreliability in hardware, which prevents applications from optimizing their carbon-efficiency, or work done per kilogram of carbon emitted. To address this problem, we design an "ecovisor", which virtualizes the energy system and exposes software-defined control of it to applications. An ecovisor enables each application to handle clean energy's unreliability in software based on its own specific requirements. We implement a small-scale ecovisor prototype that virtualizes a physical energy system to enable software-based application-level i) visibility into variable grid carbon-intensity and renewable generation and ii) control of server power usage and battery charging/discharging. We evaluate the ecovisor approach by showing how multiple applications can concurrently exercise their virtual energy system in different ways to better optimize carbon-efficiency based on their specific requirements compared to a general system-wide policy.
△ Less
Submitted 10 October, 2022;
originally announced October 2022.
-
On the EFT of Conformal Symmetry Breaking
Authors:
Kurt Hinterbichler,
Qiuyue Liang,
Mark Trodden
Abstract:
Conformal symmetry can be spontaneously broken due to the presence of a defect or other background, which gives a symmetry-breaking vacuum expectation value (VEV) to some scalar operators. We study the effective field theory of fluctuations around these backgrounds, showing that it organizes as an expansion in powers of the inverse of the VEV, and computing some of the leading corrections. We focu…
▽ More
Conformal symmetry can be spontaneously broken due to the presence of a defect or other background, which gives a symmetry-breaking vacuum expectation value (VEV) to some scalar operators. We study the effective field theory of fluctuations around these backgrounds, showing that it organizes as an expansion in powers of the inverse of the VEV, and computing some of the leading corrections. We focus on the case of space-like defects in a four-dimensional Lorentzian theory relevant to the pseudo-conformal universe scenario, although the conclusions extend to other kinds of defects and to the breaking of conformal symmetry to Poincaré symmetry.
△ Less
Submitted 3 October, 2022;
originally announced October 2022.
-
SongDriver: Real-time Music Accompaniment Generation without Logical Latency nor Exposure Bias
Authors:
Zihao Wang,
Qihao Liang,
Kejun Zhang,
Yuxing Wang,
Chen Zhang,
Pengfei Yu,
Yongsheng Feng,
Wenbo Liu,
Yikai Wang,
Yuntai Bao,
Yiheng Yang
Abstract:
Real-time music accompaniment generation has a wide range of applications in the music industry, such as music education and live performances. However, automatic real-time music accompaniment generation is still understudied and often faces a trade-off between logical latency and exposure bias. In this paper, we propose SongDriver, a real-time music accompaniment generation system without logical…
▽ More
Real-time music accompaniment generation has a wide range of applications in the music industry, such as music education and live performances. However, automatic real-time music accompaniment generation is still understudied and often faces a trade-off between logical latency and exposure bias. In this paper, we propose SongDriver, a real-time music accompaniment generation system without logical latency nor exposure bias. Specifically, SongDriver divides one accompaniment generation task into two phases: 1) The arrangement phase, where a Transformer model first arranges chords for input melodies in real-time, and caches the chords for the next phase instead of playing them out. 2) The prediction phase, where a CRF model generates playable multi-track accompaniments for the coming melodies based on previously cached chords. With this two-phase strategy, SongDriver directly generates the accompaniment for the upcoming melody, achieving zero logical latency. Furthermore, when predicting chords for a timestep, SongDriver refers to the cached chords from the first phase rather than its previous predictions, which avoids the exposure bias problem. Since the input length is often constrained under real-time conditions, another potential problem is the loss of long-term sequential information. To make up for this disadvantage, we extract four musical features from a long-term music piece before the current time step as global information. In the experiment, we train SongDriver on some open-source datasets and an original àiSong Dataset built from Chinese-style modern pop music scores. The results show that SongDriver outperforms existing SOTA (state-of-the-art) models on both objective and subjective metrics, meanwhile significantly reducing the physical latency.
△ Less
Submitted 13 October, 2022; v1 submitted 13 September, 2022;
originally announced September 2022.
-
A Language Agnostic Multilingual Streaming On-Device ASR System
Authors:
Bo Li,
Tara N. Sainath,
Ruoming Pang,
Shuo-yiin Chang,
Qiumin Xu,
Trevor Strohman,
Vince Chen,
Qiao Liang,
Heguang Liu,
Yanzhang He,
Parisa Haghani,
Sameer Bidichandani
Abstract:
On-device end-to-end (E2E) models have shown improvements over a conventional model on English Voice Search tasks in both quality and latency. E2E models have also shown promising results for multilingual automatic speech recognition (ASR). In this paper, we extend our previous capacity solution to streaming applications and present a streaming multilingual E2E ASR system that runs fully on device…
▽ More
On-device end-to-end (E2E) models have shown improvements over a conventional model on English Voice Search tasks in both quality and latency. E2E models have also shown promising results for multilingual automatic speech recognition (ASR). In this paper, we extend our previous capacity solution to streaming applications and present a streaming multilingual E2E ASR system that runs fully on device with comparable quality and latency to individual monolingual models. To achieve that, we propose an Encoder Endpointer model and an End-of-Utterance (EOU) Joint Layer for a better quality and latency trade-off. Our system is built in a language agnostic manner allowing it to natively support intersentential code switching in real time. To address the feasibility concerns on large models, we conducted on-device profiling and replaced the time consuming LSTM decoder with the recently developed Embedding decoder. With these changes, we managed to run such a system on a mobile device in less than real time.
△ Less
Submitted 29 August, 2022;
originally announced August 2022.
-
Streaming Intended Query Detection using E2E Modeling for Continued Conversation
Authors:
Shuo-yiin Chang,
Guru Prakash,
Zelin Wu,
Qiao Liang,
Tara N. Sainath,
Bo Li,
Adam Stambler,
Shyam Upadhyay,
Manaal Faruqui,
Trevor Strohman
Abstract:
In voice-enabled applications, a predetermined hotword isusually used to activate a device in order to attend to the query.However, speaking queries followed by a hotword each timeintroduces a cognitive burden in continued conversations. Toavoid repeating a hotword, we propose a streaming end-to-end(E2E) intended query detector that identifies the utterancesdirected towards the device and filters…
▽ More
In voice-enabled applications, a predetermined hotword isusually used to activate a device in order to attend to the query.However, speaking queries followed by a hotword each timeintroduces a cognitive burden in continued conversations. Toavoid repeating a hotword, we propose a streaming end-to-end(E2E) intended query detector that identifies the utterancesdirected towards the device and filters out other utterancesnot directed towards device. The proposed approach incor-porates the intended query detector into the E2E model thatalready folds different components of the speech recognitionpipeline into one neural network.The E2E modeling onspeech decoding and intended query detection also allows us todeclare a quick intended query detection based on early partialrecognition result, which is important to decrease latencyand make the system responsive. We demonstrate that theproposed E2E approach yields a 22% relative improvement onequal error rate (EER) for the detection accuracy and 600 mslatency improvement compared with an independent intendedquery detector. In our experiment, the proposed model detectswhether the user is talking to the device with a 8.7% EERwithin 1.4 seconds of median latency after user starts speaking.
△ Less
Submitted 28 August, 2022;
originally announced August 2022.
-
Turn-Taking Prediction for Natural Conversational Speech
Authors:
Shuo-yiin Chang,
Bo Li,
Tara N. Sainath,
Chao Zhang,
Trevor Strohman,
Qiao Liang,
Yanzhang He
Abstract:
While a streaming voice assistant system has been used in many applications, this system typically focuses on unnatural, one-shot interactions assuming input from a single voice query without hesitation or disfluency. However, a common conversational utterance often involves multiple queries with turn-taking, in addition to disfluencies. These disfluencies include pausing to think, hesitations, wo…
▽ More
While a streaming voice assistant system has been used in many applications, this system typically focuses on unnatural, one-shot interactions assuming input from a single voice query without hesitation or disfluency. However, a common conversational utterance often involves multiple queries with turn-taking, in addition to disfluencies. These disfluencies include pausing to think, hesitations, word lengthening, filled pauses and repeated phrases. This makes doing speech recognition with conversational speech, including one with multiple queries, a challenging task. To better model the conversational interaction, it is critical to discriminate disfluencies and end of query in order to allow the user to hold the floor for disfluencies while having the system respond as quickly as possible when the user has finished speaking. In this paper, we present a turntaking predictor built on top of the end-to-end (E2E) speech recognizer. Our best system is obtained by jointly optimizing for ASR task and detecting when the user is paused to think or finished speaking. The proposed approach demonstrates over 97% recall rate and 85% precision rate on predicting true turn-taking with only 100 ms latency on a test set designed with 4 types of disfluencies inserted in conversational utterances.
△ Less
Submitted 28 August, 2022;
originally announced August 2022.
-
Applying the Estimand and Target Trial frameworks to external control analyses using observational data: a case study in the solid tumor setting
Authors:
Letizia Polito,
Qixing Liang,
Navdeep Pal,
Philani Mpofu,
Ahmed Sawas,
Olivier Humblet,
Kaspar Rufibach,
Dominik Heinzmann
Abstract:
In causal inference, the correct formulation of the scientific question of interest is a crucial step. Here we apply the estimand framework to a comparison of the outcomes of patient-level clinical trials and observational data to help structure the clinical question. In addition, we complement the estimand framework with the target trial framework to address specific issues in defining the estima…
▽ More
In causal inference, the correct formulation of the scientific question of interest is a crucial step. Here we apply the estimand framework to a comparison of the outcomes of patient-level clinical trials and observational data to help structure the clinical question. In addition, we complement the estimand framework with the target trial framework to address specific issues in defining the estimand attributes using observational data and discuss synergies and differences of the two frameworks. Whereas the estimand framework proves useful to address the challenge that in clinical trials and routine clinical practice patients may switch to subsequent systemic therapies after the initially assigned systematic treatment, the target trial framework supports addressing challenges around baseline confounding and the index date. We apply the combined framework to compare long-term outcomes of a pooled set of three previously reported randomized phase 3 trials studying patients with metastatic non-small cell lung cancer receiving front-line chemotherapy (randomized clinical trial cohort) and similar patients treated with front-line chemotherapy as part of routine clinical care (observational comparative cohort). We illustrate the process to define the estimand attributes and select the estimator to estimate the estimand of interest while accounting for key baseline confounders, index date, and receipt of subsequent therapies. The proposed combined framework provides more clarity on the causal contrast of interest and the estimator to adopt and thus facilitates design and interpretation of the analyses.
△ Less
Submitted 13 August, 2022;
originally announced August 2022.
-
Aharonov-Bohm Caging and Inverse Anderson transition in Ultracold Atoms
Authors:
Hang Li,
Zhaoli Dong,
Stefano Longhi,
Qian Liang,
Dizhou Xie,
Bo Yan
Abstract:
Aharonov-Bohm (AB) caging, a special flat-band localization mechanism, has spurred great interest in different areas of physics. AB caging can be harnessed to explore the rich and exotic physics of quantum transport in flatband systems, where geometric frustration, disorder and correlations act in a synergetic and distinct way than in ordinary dispersive band systems. In contrast to the ordinary A…
▽ More
Aharonov-Bohm (AB) caging, a special flat-band localization mechanism, has spurred great interest in different areas of physics. AB caging can be harnessed to explore the rich and exotic physics of quantum transport in flatband systems, where geometric frustration, disorder and correlations act in a synergetic and distinct way than in ordinary dispersive band systems. In contrast to the ordinary Anderson localization, where disorder induces localization and prevents transport, in flat band systems disorder can induce mobility, a phenomenon dubbed inverse Anderson transition. Here, we report on the experimental realization of the AB cage using a synthehtic lattice in the momentum space of ultracold atoms with tailored gauge fields, demonstrate the geometric localization due to the flat band and the inverse Anderson transition when correlated binary disorder is added to the system. Our experimental platform in a many-body environment provides a fashiinating quantum simulator where the interplay between engineered gauge fields, localization, and topological properties of flat band systems can be finely explored.
△ Less
Submitted 4 August, 2022;
originally announced August 2022.
-
KD-SCFNet: Towards More Accurate and Efficient Salient Object Detection via Knowledge Distillation
Authors:
** Zhang,
Qiuwei Liang,
Yanjiao Shi
Abstract:
Most existing salient object detection (SOD) models are difficult to apply due to the complex and huge model structures. Although some lightweight models are proposed, the accuracy is barely satisfactory. In this paper, we design a novel semantics-guided contextual fusion network (SCFNet) that focuses on the interactive fusion of multi-level features for accurate and efficient salient object detec…
▽ More
Most existing salient object detection (SOD) models are difficult to apply due to the complex and huge model structures. Although some lightweight models are proposed, the accuracy is barely satisfactory. In this paper, we design a novel semantics-guided contextual fusion network (SCFNet) that focuses on the interactive fusion of multi-level features for accurate and efficient salient object detection. Furthermore, we apply knowledge distillation to SOD task and provide a sizeable dataset KD-SOD80K. In detail, we transfer the rich knowledge from a seasoned teacher to the untrained SCFNet through unlabeled images, enabling SCFNet to learn a strong generalization ability to detect salient objects more accurately. The knowledge distillation based SCFNet (KDSCFNet) achieves comparable accuracy to the state-of-the-art heavyweight methods with less than 1M parameters and 174 FPS real-time detection speed. Extensive experiments demonstrate the robustness and effectiveness of the proposed distillation method and SOD framework. Code and data: https://github.com/zhang**CV/KD-SCFNet.
△ Less
Submitted 21 November, 2022; v1 submitted 3 August, 2022;
originally announced August 2022.
-
BrainCog: A Spiking Neural Network based Brain-inspired Cognitive Intelligence Engine for Brain-inspired AI and Brain Simulation
Authors:
Yi Zeng,
Dongcheng Zhao,
Feifei Zhao,
Guobin Shen,
Yiting Dong,
Enmeng Lu,
Qian Zhang,
Yinqian Sun,
Qian Liang,
Yuxuan Zhao,
Zhuoya Zhao,
Hongjian Fang,
Yuwei Wang,
Yang Li,
Xin Liu,
Chengcheng Du,
Qingqun Kong,
Zizhe Ruan,
Weida Bi
Abstract:
Spiking neural networks (SNNs) have attracted extensive attentions in Brain-inspired Artificial Intelligence and computational neuroscience. They can be used to simulate biological information processing in the brain at multiple scales. More importantly, SNNs serve as an appropriate level of abstraction to bring inspirations from brain and cognition to Artificial Intelligence. In this paper, we pr…
▽ More
Spiking neural networks (SNNs) have attracted extensive attentions in Brain-inspired Artificial Intelligence and computational neuroscience. They can be used to simulate biological information processing in the brain at multiple scales. More importantly, SNNs serve as an appropriate level of abstraction to bring inspirations from brain and cognition to Artificial Intelligence. In this paper, we present the Brain-inspired Cognitive Intelligence Engine (BrainCog) for creating brain-inspired AI and brain simulation models. BrainCog incorporates different types of spiking neuron models, learning rules, brain areas, etc., as essential modules provided by the platform. Based on these easy-to-use modules, BrainCog supports various brain-inspired cognitive functions, including Perception and Learning, Decision Making, Knowledge Representation and Reasoning, Motor Control, and Social Cognition. These brain-inspired AI models have been effectively validated on various supervised, unsupervised, and reinforcement learning tasks, and they can be used to enable AI models to be with multiple brain-inspired cognitive functions. For brain simulation, BrainCog realizes the function simulation of decision-making, working memory, the structure simulation of the Neural Circuit, and whole brain structure simulation of Mouse brain, Macaque brain, and Human brain. An AI engine named BORN is developed based on BrainCog, and it demonstrates how the components of BrainCog can be integrated and used to build AI models and applications. To enable the scientific quest to decode the nature of biological intelligence and create AI, BrainCog aims to provide essential and easy-to-use building blocks, and infrastructural support to develop brain-inspired spiking neural network based AI, and to simulate the cognitive brains at multiple scales. The online repository of BrainCog can be found at https://github.com/braincog-x.
△ Less
Submitted 11 July, 2023; v1 submitted 18 July, 2022;
originally announced July 2022.
-
TRIE++: Towards End-to-End Information Extraction from Visually Rich Documents
Authors:
Zhanzhan Cheng,
Peng Zhang,
Can Li,
Qiao Liang,
Yunlu Xu,
Pengfei Li,
Shiliang Pu,
Yi Niu,
Fei Wu
Abstract:
Recently, automatically extracting information from visually rich documents (e.g., tickets and resumes) has become a hot and vital research topic due to its widespread commercial value. Most existing methods divide this task into two subparts: the text reading part for obtaining the plain text from the original document images and the information extraction part for extracting key contents. These…
▽ More
Recently, automatically extracting information from visually rich documents (e.g., tickets and resumes) has become a hot and vital research topic due to its widespread commercial value. Most existing methods divide this task into two subparts: the text reading part for obtaining the plain text from the original document images and the information extraction part for extracting key contents. These methods mainly focus on improving the second, while neglecting that the two parts are highly correlated. This paper proposes a unified end-to-end information extraction framework from visually rich documents, where text reading and information extraction can reinforce each other via a well-designed multi-modal context block. Specifically, the text reading part provides multi-modal features like visual, textual and layout features. The multi-modal context block is developed to fuse the generated multi-modal features and even the prior knowledge from the pre-trained language model for better semantic representation. The information extraction part is responsible for generating key contents with the fused context features. The framework can be trained in an end-to-end trainable manner, achieving global optimization. What is more, we define and group visually rich documents into four categories across two dimensions, the layout and text type. For each document category, we provide or recommend the corresponding benchmarks, experimental settings and strong baselines for remedying the problem that this research area lacks the uniform evaluation standard. Extensive experiments on four kinds of benchmarks (from fixed layout to variable layout, from full-structured text to semi-unstructured text) are reported, demonstrating the proposed method's effectiveness. Data, source code and models are available.
△ Less
Submitted 14 July, 2022;
originally announced July 2022.
-
Efficient motional-mode characterization for high-fidelity trapped-ion quantum computing
Authors:
Mingyu Kang,
Qiyao Liang,
Ming Li,
Yunseong Nam
Abstract:
To achieve high-fidelity operations on a large-scale quantum computer, the parameters of the physical system must be efficiently characterized with high accuracy. For trapped ions, the entanglement between qubits are mediated by the motional modes of the ion chain, and thus characterizing the motional-mode parameters becomes essential. In this paper, we develop and explore physical models that acc…
▽ More
To achieve high-fidelity operations on a large-scale quantum computer, the parameters of the physical system must be efficiently characterized with high accuracy. For trapped ions, the entanglement between qubits are mediated by the motional modes of the ion chain, and thus characterizing the motional-mode parameters becomes essential. In this paper, we develop and explore physical models that accurately predict both magnitude and sign of the Lamb-Dicke parameters when the modes are probed {\it in parallel}. We further devise an advanced characterization protocol that shortens the characterization time by more than an order of magnitude, when compared to that of the conventional method that only uses mode spectroscopy. We discuss potential ramifications of our results to the development of a scalable trapped-ion quantum computer, viewed through the lens of system-level resource trade offs.
△ Less
Submitted 23 January, 2023; v1 submitted 8 June, 2022;
originally announced June 2022.
-
Collision-induced C_60 rovibrational relaxation probed by state-resolved nonlinear spectroscopy
Authors:
Lee R. Liu,
P. Bryan Changala,
Marissa L. Weichman,
Qizhong Liang,
Jutta Toscano,
Jacek Klos,
Svetlana Kotochigova,
David J. Nesbitt,
Jun Ye
Abstract:
Quantum state-resolved spectroscopy was recently achieved for C60 molecules when cooled by buffer gas collisions and probed with a midinfrared frequency comb. This rovibrational quantum state resolution for the largest molecule on record is facilitated by the remarkable symmetry and rigidity of C60, which also present new opportunities and challenges to explore energy transfer between quantum stat…
▽ More
Quantum state-resolved spectroscopy was recently achieved for C60 molecules when cooled by buffer gas collisions and probed with a midinfrared frequency comb. This rovibrational quantum state resolution for the largest molecule on record is facilitated by the remarkable symmetry and rigidity of C60, which also present new opportunities and challenges to explore energy transfer between quantum states in this many-atom system. Here we combine state-specific optical pum**, buffer gas collisions, and ultrasensitive intracavity nonlinear spectroscopy to initiate and probe the rotation-vibration energy transfer and relaxation. This approach provides the first detailed characterization of C60 collisional energy transfer for a variety of collision partners, and determines the rotational and vibrational inelastic collision cross sections. These results compare well with our theoretical modeling of the collisions, and establish a route towards quantum state control of a new class of unprecedentedly large molecules.
△ Less
Submitted 3 October, 2022; v1 submitted 6 June, 2022;
originally announced June 2022.
-
Catalytic growth of ultralong graphene nanoribbons on insulating substrates
Authors:
Bosai Lyu,
Jiajun Chen,
Shuo Lou,
Can Li,
Lu Qiu,
Wengen Ouyang,
**gxu Xie,
Izaac Mitchell,
Tongyao Wu,
Aolin Deng,
Cheng Hu,
Xianliang Zhou,
Peiyue Shen,
Saiqun Ma,
Zhenghan Wu,
Kenji Watanabe,
Takashi Taniguchi,
Xiaoqun Wang,
Qi Liang,
**feng Jia,
Michael Urbakh,
Oded Hod,
Feng Ding,
Shiyong Wang,
Zhiwen Shi
Abstract:
Graphene nanoribbons (GNRs) with widths of a few nanometres are promising candidates for future nano-electronic applications due to their structurally tunable bandgaps, ultrahigh carrier mobilities, and exceptional stability. However, the direct growth of micrometre-long GNRs on insulating substrates, which is essential for the fabrication of nano-electronic devices, remains an immense challenge.…
▽ More
Graphene nanoribbons (GNRs) with widths of a few nanometres are promising candidates for future nano-electronic applications due to their structurally tunable bandgaps, ultrahigh carrier mobilities, and exceptional stability. However, the direct growth of micrometre-long GNRs on insulating substrates, which is essential for the fabrication of nano-electronic devices, remains an immense challenge. Here, we report the epitaxial growth of GNRs on an insulating hexagonal boron nitride (h-BN) substrate through nanoparticle-catalysed chemical vapor deposition (CVD). Ultra-narrow GNRs with lengths of up to 10 μm are synthesized. Remarkably, the as-grown GNRs are crystallographically aligned with the h-BN substrate, forming one-dimensional (1D) moiré superlattices. Scanning tunnelling microscopy reveals an average width of 2 nm and a typical bandgap of ~1 eV for similar GNRs grown on conducting graphite substrates. Fully atomistic computational simulations support the experimental results and reveal a competition between the formation of GNRs and carbon nanotubes (CNTs) during the nucleation stage, and van der Waals sliding of the GNRs on the h-BN substrate throughout the growth stage. Our study provides a scalable, single-step method for growing micrometre-long narrow GNRs on insulating substrates, thus opening a route to explore the performance of high-quality GNR devices and the fundamental physics of 1D moiré superlattices.
△ Less
Submitted 27 May, 2022;
originally announced May 2022.
-
Divisible Codes for Quantum Computation
Authors:
**gzhen Hu,
Qingzhong Liang,
Robert Calderbank
Abstract:
Divisible codes are defined by the property that codeword weights share a common divisor greater than one. They are used to design signals for communications and sensing, and this paper explores how they can be used to protect quantum information as it is transformed by logical gates. Given a CSS code $\mathcal{C}$, we derive conditions that are both necessary and sufficient for a transversal diag…
▽ More
Divisible codes are defined by the property that codeword weights share a common divisor greater than one. They are used to design signals for communications and sensing, and this paper explores how they can be used to protect quantum information as it is transformed by logical gates. Given a CSS code $\mathcal{C}$, we derive conditions that are both necessary and sufficient for a transversal diagonal physical operator $U_Z$ to preserve $\mathcal{C}$ and induce $U_L$. The group of $Z$-stabilizers in a CSS code $\mathcal{C}$ is determined by the dual of a classical $[n, k_1]$ binary code $\mathcal{C}_1$, and the group of $X$-stabilizers is determined by a classical $[n, k_2]$ binary code $\mathcal{C}_2$ that is contained in $\mathcal{C}_1$. The requirement that a diagonal physical operator $U_Z$ fixes a CSS code $\mathcal{C}$ leads to constraints on the congruence of weights in cosets of $\mathcal{C}_2$. These constraints are a perfect fit to divisible codes, and represent an opportunity to take advantage of the extensive literature on classical codes with two or three weights. We construct new families of CSS codes using cosets of the first order Reed Muller code defined by quadratic forms. We provide a simple alternative to the standard method of deriving the coset weight distributions (based on Dickson normal form) that may be of independent interest. Finally, we develop an approach to circumventing the Eastin-Knill Theorem which states that no QECC can implement a universal set of logical gates through transversal gates alone. The essential idea is to design stabilizer codes in layers, with $N_1$ inner qubits and $N_2$ outer qubits, and to assemble a universal set of fault tolerant gates on the inner qubits.
△ Less
Submitted 27 April, 2022;
originally announced April 2022.
-
A Unified Cascaded Encoder ASR Model for Dynamic Model Sizes
Authors:
Shao** Ding,
Weiran Wang,
Ding Zhao,
Tara N. Sainath,
Yanzhang He,
Robert David,
Rami Botros,
Xin Wang,
Rina Panigrahy,
Qiao Liang,
Dongseong Hwang,
Ian McGraw,
Rohit Prabhavalkar,
Trevor Strohman
Abstract:
In this paper, we propose a dynamic cascaded encoder Automatic Speech Recognition (ASR) model, which unifies models for different deployment scenarios. Moreover, the model can significantly reduce model size and power consumption without loss of quality. Namely, with the dynamic cascaded encoder model, we explore three techniques to maximally boost the performance of each model size: 1) Use separa…
▽ More
In this paper, we propose a dynamic cascaded encoder Automatic Speech Recognition (ASR) model, which unifies models for different deployment scenarios. Moreover, the model can significantly reduce model size and power consumption without loss of quality. Namely, with the dynamic cascaded encoder model, we explore three techniques to maximally boost the performance of each model size: 1) Use separate decoders for each sub-model while sharing the encoders; 2) Use funnel-pooling to improve the encoder efficiency; 3) Balance the size of causal and non-causal encoders to improve quality and fit deployment constraints. Overall, the proposed large-medium model has 30% smaller size and reduces power consumption by 33%, compared to the baseline cascaded encoder model. The triple-size model that unifies the large, medium, and small models achieves 37% total size reduction with minimal quality loss, while substantially reducing the engineering efforts of having separate models.
△ Less
Submitted 24 June, 2022; v1 submitted 13 April, 2022;
originally announced April 2022.
-
Personal VAD 2.0: Optimizing Personal Voice Activity Detection for On-Device Speech Recognition
Authors:
Shao** Ding,
Rajeev Rikhye,
Qiao Liang,
Yanzhang He,
Quan Wang,
Arun Narayanan,
Tom O'Malley,
Ian McGraw
Abstract:
Personalization of on-device speech recognition (ASR) has seen explosive growth in recent years, largely due to the increasing popularity of personal assistant features on mobile devices and smart home speakers. In this work, we present Personal VAD 2.0, a personalized voice activity detector that detects the voice activity of a target speaker, as part of a streaming on-device ASR system. Although…
▽ More
Personalization of on-device speech recognition (ASR) has seen explosive growth in recent years, largely due to the increasing popularity of personal assistant features on mobile devices and smart home speakers. In this work, we present Personal VAD 2.0, a personalized voice activity detector that detects the voice activity of a target speaker, as part of a streaming on-device ASR system. Although previous proof-of-concept studies have validated the effectiveness of Personal VAD, there are still several critical challenges to address before this model can be used in production: first, the quality must be satisfactory in both enrollment and enrollment-less scenarios; second, it should operate in a streaming fashion; and finally, the model size should be small enough to fit a limited latency and CPU/Memory budget. To meet the multi-faceted requirements, we propose a series of novel designs: 1) advanced speaker embedding modulation methods; 2) a new training paradigm to generalize to enrollment-less conditions; 3) architecture and runtime optimizations for latency and resource restrictions. Extensive experiments on a realistic speech recognition system demonstrated the state-of-the-art performance of our proposed method.
△ Less
Submitted 24 June, 2022; v1 submitted 7 April, 2022;
originally announced April 2022.
-
A Two-Level Block Preconditioned Jacobi-Davidson Method for Multiple and Clustered Eigenvalues of Elliptic Operators
Authors:
Qigang Liang,
Wei Wang,
Xuejun Xu
Abstract:
In this paper, we propose a two-level block preconditioned Jacobi-Davidson (BPJD) method for efficiently solving discrete eigenvalue problems resulting from finite element approximations of $2m$th ($m = 1, 2$) order symmetric elliptic eigenvalue problems. Our method works effectively to compute the first several eigenpairs, including both multiple and clustered eigenvalues with corresponding eigen…
▽ More
In this paper, we propose a two-level block preconditioned Jacobi-Davidson (BPJD) method for efficiently solving discrete eigenvalue problems resulting from finite element approximations of $2m$th ($m = 1, 2$) order symmetric elliptic eigenvalue problems. Our method works effectively to compute the first several eigenpairs, including both multiple and clustered eigenvalues with corresponding eigenfunctions, particularly. The method is highly parallelizable by constructing a new and efficient preconditioner using an overlap** domain decomposition (DD). It only requires computing a couple of small scale parallel subproblems and a quite small scale eigenvalue problem per iteration. Our theoretical analysis reveals that the convergence rate of the method is bounded by $c(H)(1-C\frac{δ^{2m-1}}{H^{2m-1}})^{2}$, where $H$ is the diameter of subdomains and $δ$ is the overlap** size among subdomains. The constant $C$ is independent of the mesh size $h$ and the internal gaps among the target eigenvalues, demonstrating that our method is optimal and cluster robust. Meanwhile, the $H$-dependent constant $c(H)$ decreases monotonically to $1$, as $H \to 0$, which means that more subdomains lead to the better convergence rate. Numerical results supporting our theory are given.
△ Less
Submitted 11 April, 2023; v1 submitted 11 March, 2022;
originally announced March 2022.
-
Closing the Gap between Single-User and Multi-User VoiceFilter-Lite
Authors:
Rajeev Rikhye,
Quan Wang,
Qiao Liang,
Yanzhang He,
Ian McGraw
Abstract:
VoiceFilter-Lite is a speaker-conditioned voice separation model that plays a crucial role in improving speech recognition and speaker verification by suppressing overlap** speech from non-target speakers. However, one limitation of VoiceFilter-Lite, and other speaker-conditioned speech models in general, is that these models are usually limited to a single target speaker. This is undesirable as…
▽ More
VoiceFilter-Lite is a speaker-conditioned voice separation model that plays a crucial role in improving speech recognition and speaker verification by suppressing overlap** speech from non-target speakers. However, one limitation of VoiceFilter-Lite, and other speaker-conditioned speech models in general, is that these models are usually limited to a single target speaker. This is undesirable as most smart home devices now support multiple enrolled users. In order to extend the benefits of personalization to multiple users, we previously developed an attention-based speaker selection mechanism and applied it to VoiceFilter-Lite. However, the original multi-user VoiceFilter-Lite model suffers from significant performance degradation compared with single-user models. In this paper, we devised a series of experiments to improve the multi-user VoiceFilter-Lite model. By incorporating a dual learning rate schedule and by using feature-wise linear modulation (FiLM) to condition the model with the attended speaker embedding, we successfully closed the performance gap between multi-user and single-user VoiceFilter-Lite models on single-speaker evaluations. At the same time, the new model can also be easily extended to support any number of users, and significantly outperforms our previously published model on multi-speaker evaluations.
△ Less
Submitted 26 April, 2022; v1 submitted 24 February, 2022;
originally announced February 2022.
-
Saturated absorption spectroscopy of buffer-gas-cooled Barium monofluoride molecules
Authors:
Wenhao Bu,
Yuhe Zhang,
Qian Liang,
Tao Chen,
Bo Yan
Abstract:
We report an experimental investigation on the Doppler-free saturated absorption spectroscopy of buffer-gas-cooled Barium monofluoride (BaF) molecules in a 4~K cryogenic cell. The obtained spectra with a resolution of 19~MHz, much smaller than previously observed in absorption spectroscopy, clearly resolve the hyperfine transitions. Moreover, we use these high-resolution spectra to fit the hyperfi…
▽ More
We report an experimental investigation on the Doppler-free saturated absorption spectroscopy of buffer-gas-cooled Barium monofluoride (BaF) molecules in a 4~K cryogenic cell. The obtained spectra with a resolution of 19~MHz, much smaller than previously observed in absorption spectroscopy, clearly resolve the hyperfine transitions. Moreover, we use these high-resolution spectra to fit the hyperfine splittings of excited $A(v=0)$ state and find the hyperfine splitting of the laser-cooling-relevant $A^2Π_{1/2}(v=0, J=1/2, +)$ state is about 18 MHz, much higher than the previous theoretically predicted value. This provides important missing information for laser cooling of BaF molecules.
△ Less
Submitted 13 February, 2023; v1 submitted 4 February, 2022;
originally announced February 2022.
-
Breath analysis by ultra-sensitive broadband laser spectroscopy detects SARS-CoV-2 infection
Authors:
Qizhong Liang,
Ya-Chu Chan,
Jutta Toscano,
Kristen K. Bjorkman,
Leslie A. Leinwand,
Roy Parker,
Eva S. Nozik,
David J. Nesbitt,
Jun Ye
Abstract:
Rapid testing is essential to fighting pandemics such as COVID-19, the disease caused by the SARS-CoV-2 virus. Exhaled human breath contains multiple volatile molecules providing powerful potential for non-invasive diagnosis of diverse medical conditions. We investigated breath detection of SARS-CoV-2 infection using cavity-enhanced direct frequency comb spectroscopy (CE-DFCS), a state-of-the-art…
▽ More
Rapid testing is essential to fighting pandemics such as COVID-19, the disease caused by the SARS-CoV-2 virus. Exhaled human breath contains multiple volatile molecules providing powerful potential for non-invasive diagnosis of diverse medical conditions. We investigated breath detection of SARS-CoV-2 infection using cavity-enhanced direct frequency comb spectroscopy (CE-DFCS), a state-of-the-art laser spectroscopic technique capable of a real-time massive collection of broadband molecular absorption features at ro-vibrational quantum state resolution and at parts-per-trillion volume detection sensitivity. Using a total of 170 individual breath samples (83 positive and 87 negative with SARS-CoV-2 based on Reverse Transcription Polymerase Chain Reaction tests), we report excellent discrimination capability for SARS-CoV-2 infection with an area under the Receiver-Operating-Characteristics curve of 0.849(4). Our results support the development of CE-DFCS as an alternative, rapid, non-invasive test for COVID-19 and highlight its remarkable potential for optical diagnoses of diverse biological conditions and disease states.
△ Less
Submitted 13 February, 2023; v1 submitted 4 February, 2022;
originally announced February 2022.
-
Double Copy for Massive Scalar Field Theories
Authors:
Mariana Carrillo González,
Qiuyue Liang,
Mark Trodden
Abstract:
We explore extensions of the double copy to massive theories and find a new cubic theory with a local double copy. We consider the nonlinear sigma model and the special galileon theory, massless versions of which are known to be related through the double copy. We show that by performing a Kaluza-Klein reduction of these theories from five dimensions down to four, a double copy relation exists bet…
▽ More
We explore extensions of the double copy to massive theories and find a new cubic theory with a local double copy. We consider the nonlinear sigma model and the special galileon theory, massless versions of which are known to be related through the double copy. We show that by performing a Kaluza-Klein reduction of these theories from five dimensions down to four, a double copy relation exists between the resulting massive four-dimensional scalar field theories. This requires the vanishing contribution of new galileon terms arising in high dimensions. We further explore if other interactions that do not arise from a dimensional reduction of the nonlinear sigma model could be double copied and find a new cubic interaction which satisfies the BCJ relations up to 5-point amplitudes.
△ Less
Submitted 21 June, 2022; v1 submitted 1 February, 2022;
originally announced February 2022.
-
Observation of Non-Hermitian Skin Effect and Topology in Ultracold Atoms
Authors:
Qian Liang,
Dizhou Xie,
Zhaoli Dong,
Haowei Li,
Hang Li,
Bryce Gadway,
Wei Yi,
Bo Yan
Abstract:
The non-Hermitian skin effect (NHSE), the accumulation of eigen wavefunctions at boundaries of open systems, underlies a variety of exotic properties that defy conventional wisdom. While NHSE and its intriguing impact on band topology and dynamics have been observed in classical or photonic systems, their demonstration in a quantum many-body setting remains elusive. Here we report the experimental…
▽ More
The non-Hermitian skin effect (NHSE), the accumulation of eigen wavefunctions at boundaries of open systems, underlies a variety of exotic properties that defy conventional wisdom. While NHSE and its intriguing impact on band topology and dynamics have been observed in classical or photonic systems, their demonstration in a quantum many-body setting remains elusive. Here we report the experimental realization of a dissipative Aharonov-Bohm chain -- a non-Hermitian topological model with NHSE -- in the momentum space of a two-component Bose-Einstein condensate. We identify unique signatures of NHSE in the condensate dynamics, and perform Bragg spectroscopy to resolve topological edge states against a background of localized bulk states. Our work sets the stage for further investigation on the interplay of many-body statistics and interactions with NHSE, and is a significant step forward in the quantum control and simulation of non-Hermitian physics.
△ Less
Submitted 13 August, 2022; v1 submitted 24 January, 2022;
originally announced January 2022.
-
Model-driven Cluster Resource Management for AI Workloads in Edge Clouds
Authors:
Qianlin Liang,
Walid A. Hanafy,
Ahmed Ali-Eldin,
Prashant Shenoy
Abstract:
Since emerging edge applications such as Internet of Things (IoT) analytics and augmented reality have tight latency constraints, hardware AI accelerators have been recently proposed to speed up deep neural network (DNN) inference run by these applications. Resource-constrained edge servers and accelerators tend to be multiplexed across multiple IoT applications, introducing the potential for perf…
▽ More
Since emerging edge applications such as Internet of Things (IoT) analytics and augmented reality have tight latency constraints, hardware AI accelerators have been recently proposed to speed up deep neural network (DNN) inference run by these applications. Resource-constrained edge servers and accelerators tend to be multiplexed across multiple IoT applications, introducing the potential for performance interference between latency-sensitive workloads. In this paper, we design analytic models to capture the performance of DNN inference workloads on shared edge accelerators, such as GPU and edgeTPU, under different multiplexing and concurrency behaviors. After validating our models using extensive experiments, we use them to design various cluster resource management algorithms to intelligently manage multiple applications on edge accelerators while respecting their latency constraints. We implement a prototype of our system in Kubernetes and show that our system can host 2.3X more DNN applications in heterogeneous multi-tenant edge clusters with no latency violations when compared to traditional knapsack hosting algorithms.
△ Less
Submitted 18 January, 2022;
originally announced January 2022.
-
Doppler cooling of buffer-gas-cooled Barium monofluoride molecules
Authors:
Yuhe Zhang,
Zixuan Zeng,
Qian Liang,
Wenhao Bu,
Bo Yan
Abstract:
We demonstrate one-dimensional Doppler cooling of a beam of buffer-gas cooled Barium monofluoride (BaF) molecules. The dependences of the cooling efficiency with the laser detuning, the bias filed and the laser intensity are carefully measured. We numerical simulate our experiment with a Monte Carlo method, and find the theoretic predictions consists with our experimental data. This result represe…
▽ More
We demonstrate one-dimensional Doppler cooling of a beam of buffer-gas cooled Barium monofluoride (BaF) molecules. The dependences of the cooling efficiency with the laser detuning, the bias filed and the laser intensity are carefully measured. We numerical simulate our experiment with a Monte Carlo method, and find the theoretic predictions consists with our experimental data. This result represents a key step towards further cooling and trap** of BaF molecules.
△ Less
Submitted 12 January, 2022;
originally announced January 2022.
-
Diffraction of strongly interacting molecular Bose-Einstein condensate from standing wave light pulses
Authors:
Qi Liang,
Chen Li,
Sebastian Erne,
Pradyumna Paranjape,
RuGway Wu,
Jörg Schmiedmayer
Abstract:
We study the effects of strong inter-particle interaction on diffraction of a Bose-Einstein condensate of $^6Li_2$ molecules from a periodic potential created by pulses of a far detuned optical standing wave. For short pulses we observe the standard Kapitza-Dirac diffraction, with the contrast of the diffraction pattern strongly reduced for very large interactions due to interaction dependent loss…
▽ More
We study the effects of strong inter-particle interaction on diffraction of a Bose-Einstein condensate of $^6Li_2$ molecules from a periodic potential created by pulses of a far detuned optical standing wave. For short pulses we observe the standard Kapitza-Dirac diffraction, with the contrast of the diffraction pattern strongly reduced for very large interactions due to interaction dependent loss processes. For longer pulses diffraction shows the characteristic for matter waves im**ing on an array of tubes and coherent channeling transport. We observe a slowing down of the time evolution governing the population of the momentum modes caused by the strong atom interaction. A simple physical explanation of that slowing down is the phase shift caused by the self-interaction of the forming matter wave patterns inside the standing light wave. Simple 1D mean field simulations qualitatively capture the phenomenon, however to quantitatively reproduce the experimental results the molecular scattering length has to be multiplied by factor of 4.2. In addition, two contributions to interaction-dependent degradation of the coherent diffraction patterns were identified: (i) in-trap loss of molecules during the lattice pulse, which involves dissociation of Feshbach molecules into free atoms, as confirmed by radio-frequency spectroscopy and (ii) collisions between different momentum modes during separation. This was confirmed by interferometrically recombining the diffracted momenta into the zero-momentum peak, which consequently removed the scattering background.
△ Less
Submitted 31 March, 2022; v1 submitted 5 January, 2022;
originally announced January 2022.
-
Image Denoising with Control over Deep Network Hallucination
Authors:
Qiyuan Liang,
Florian Cassayre,
Haley Owsianko,
Majed El Helou,
Sabine Süsstrunk
Abstract:
Deep image denoisers achieve state-of-the-art results but with a hidden cost. As witnessed in recent literature, these deep networks are capable of overfitting their training distributions, causing inaccurate hallucinations to be added to the output and generalizing poorly to varying data. For better control and interpretability over a deep denoiser, we propose a novel framework exploiting a denoi…
▽ More
Deep image denoisers achieve state-of-the-art results but with a hidden cost. As witnessed in recent literature, these deep networks are capable of overfitting their training distributions, causing inaccurate hallucinations to be added to the output and generalizing poorly to varying data. For better control and interpretability over a deep denoiser, we propose a novel framework exploiting a denoising network. We call it controllable confidence-based image denoising (CCID). In this framework, we exploit the outputs of a deep denoising network alongside an image convolved with a reliable filter. Such a filter can be a simple convolution kernel which does not risk adding hallucinated information. We propose to fuse the two components with a frequency-domain approach that takes into account the reliability of the deep network outputs. With our framework, the user can control the fusion of the two components in the frequency domain. We also provide a user-friendly map estimating spatially the confidence in the output that potentially contains network hallucination. Results show that our CCID not only provides more interpretability and control, but can even outperform both the quantitative performance of the deep denoiser and that of the reliable filter, especially when the test data diverge from the training data.
△ Less
Submitted 2 January, 2022;
originally announced January 2022.
-
Rail Vehicle Localization and Map** with LiDAR-Vision-Inertial-GNSS Fusion
Authors:
Yusheng Wang,
Weiwei Song,
Yidong Lou,
Yi Zhang,
Fei Huang,
Zhiyong Tu,
Qiangsheng Liang
Abstract:
In this paper, we present a global navigation satellite system (GNSS) aided LiDAR-visual-inertial scheme, RailLoMer-V, for accurate and robust rail vehicle localization and map**. RailLoMer-V is formulated atop a factor graph and consists of two subsystems: an odometer assisted LiDAR-inertial system (OLIS) and an odometer integrated Visual-inertial system (OVIS). Both the subsystem exploits the…
▽ More
In this paper, we present a global navigation satellite system (GNSS) aided LiDAR-visual-inertial scheme, RailLoMer-V, for accurate and robust rail vehicle localization and map**. RailLoMer-V is formulated atop a factor graph and consists of two subsystems: an odometer assisted LiDAR-inertial system (OLIS) and an odometer integrated Visual-inertial system (OVIS). Both the subsystem exploits the typical geometry structure on the railroads. The plane constraints from extracted rail tracks are used to complement the rotation and vertical errors in OLIS. Besides, the line features and vanishing points are leveraged to constrain rotation drifts in OVIS. The proposed framework is extensively evaluated on datasets over 800 km, gathered for more than a year on both general-speed and high-speed railways, day and night. Taking advantage of the tightly-coupled integration of all measurements from individual sensors, our framework is accurate to long-during tasks and robust enough to grievously degenerated scenarios (railway tunnels). In addition, the real-time performance can be achieved with an onboard computer.
△ Less
Submitted 15 December, 2021;
originally announced December 2021.
-
Climbing the Diagonal Clifford Hierarchy
Authors:
**gzhen Hu,
Qingzhong Liang,
Robert Calderbank
Abstract:
Magic state distillation and the Shor factoring algorithm make essential use of logical diagonal gates. We introduce a method of synthesizing CSS codes that realize a target logical diagonal gate at some level $l$ in the Clifford hierarchy. The method combines three basic operations: concatenation, removal of $Z$-stabilizers, and addition of $X$-stabilizers. It explicitly tracks the logical gate i…
▽ More
Magic state distillation and the Shor factoring algorithm make essential use of logical diagonal gates. We introduce a method of synthesizing CSS codes that realize a target logical diagonal gate at some level $l$ in the Clifford hierarchy. The method combines three basic operations: concatenation, removal of $Z$-stabilizers, and addition of $X$-stabilizers. It explicitly tracks the logical gate induced by a diagonal physical gate that preserves a CSS code. The first step is concatenation, where the input is a CSS code and a physical diagonal gate at level $l$ inducing a logical diagonal gate at the same level. The output is a new code for which a physical diagonal gate at level $l+1$ induces the original logical gate. The next step is judicious removal of $Z$-stabilizers to increase the level of the induced logical operator. We identify three ways of climbing the logical Clifford hierarchy from level $l$ to level $l+1$, each built on a recursive relation on the Pauli coefficients of the induced logical operators. Removal of $Z$-stabilizers may reduce distance, and the purpose of the third basic operation, addition of $X$-stabilizers, is to compensate for such losses. For the coherent noise model, we describe how to switch between computation and storage of intermediate results in a decoherence-free subspace by simply applying Pauli $X$ matrices. The approach to logical gate synthesis taken in prior work focuses on the code states, and results in sufficient conditions for a CSS code to be fixed by a transversal $Z$-rotation. In contrast, we derive necessary and sufficient conditions by analyzing the action of a transversal diagonal gate on the stabilizer group that determines the code. The power of our approach is demonstrated by two proofs of concept: the $[[2^{l+1}-2,2,2]]$ triorthogonal code family, and the $[[2^m,\binom{m}{r},2^{\min\{r,m-r\}}]]$ quantum Reed-Muller code family.
△ Less
Submitted 27 October, 2021; v1 submitted 22 October, 2021;
originally announced October 2021.
-
Designing the Quantum Channels Induced by Diagonal Gates
Authors:
**gzhen Hu,
Qingzhong Liang,
Robert Calderbank
Abstract:
The challenge of quantum computing is to combine error resilience with universal computation. Diagonal gates such as the transversal $T$ gate play an important role in implementing a universal set of quantum operations. This paper introduces a framework that describes the process of preparing a code state, applying a diagonal physical gate, measuring a code syndrome, and applying a Pauli correctio…
▽ More
The challenge of quantum computing is to combine error resilience with universal computation. Diagonal gates such as the transversal $T$ gate play an important role in implementing a universal set of quantum operations. This paper introduces a framework that describes the process of preparing a code state, applying a diagonal physical gate, measuring a code syndrome, and applying a Pauli correction that may depend on the measured syndrome (the average logical channel induced by an arbitrary diagonal gate). It focuses on CSS codes, and describes the interaction of code states and physical gates in terms of generator coefficients determined by the induced logical operator. The interaction of code states and diagonal gates depends very strongly on the signs of $Z$-stabilizers in the CSS code, and the proposed generator coefficient framework explicitly includes this degree of freedom. The paper derives necessary and sufficient conditions for an arbitrary diagonal gate to preserve the code space of a stabilizer code, and provides an explicit expression of the induced logical operator. When the diagonal gate is a quadratic form diagonal gate (introduced by Rengaswamy et al.), the conditions can be expressed in terms of divisibility of weights in the two classical codes that determine the CSS code. These codes find application in magic state distillation and elsewhere. When all the signs are positive, the paper characterizes all possible CSS codes, invariant under transversal $Z$-rotation through $π/2^l$, that are constructed from classical Reed-Muller codes by deriving the necessary and sufficient constraints on $l$. The generator coefficient framework extends to arbitrary stabilizer codes but there is nothing to be gained by considering the more general class of non-degenerate stabilizer codes.
△ Less
Submitted 6 September, 2022; v1 submitted 28 September, 2021;
originally announced September 2021.
-
Lyra: A Benchmark for Turducken-Style Code Generation
Authors:
Qingyuan Liang,
Zeyu Sun,
Qihao Zhu,
Wenjie Zhang,
Lian Yu,
Yingfei Xiong,
Lu Zhang
Abstract:
Recently, neural techniques have been used to generate source code automatically. While promising for declarative languages, these approaches achieve much poorer performance on datasets for imperative languages. Since a declarative language is typically embedded in an imperative language (i.e., the turducken-style programming) in real-world software development, the promising results on declarativ…
▽ More
Recently, neural techniques have been used to generate source code automatically. While promising for declarative languages, these approaches achieve much poorer performance on datasets for imperative languages. Since a declarative language is typically embedded in an imperative language (i.e., the turducken-style programming) in real-world software development, the promising results on declarative languages can hardly lead to significant reduction of manual software development efforts. In this paper, we define a new code generation task: given a natural language comment, this task aims to generate a program in a base imperative language with an embedded declarative language. To our knowledge, this is the first turducken-style code generation task. For this task, we present Lyra: a dataset in Python with embedded SQL. This dataset contains 2,000 carefully annotated database manipulation programs from real-world projects. Each program is paired with both a Chinese comment and an English comment. In our experiment, we adopted Transformer, BERT-style, and GPT-style models as baselines. In the best setting, the generation performance of GPT-style models is better than others, where the AST exact matching accuracy is 24% and 25.5% when using Chinese and English comments, respectively. Therefore, we believe that Lyra provides a new challenge for code generation. Yet, overcoming this challenge may significantly boost the applicability of code generation techniques for real-world software development.
△ Less
Submitted 24 July, 2022; v1 submitted 27 August, 2021;
originally announced August 2021.
-
A Photonics-based superheterodyne RF reception approach
Authors:
Guangyu Gao,
Qijun Liang,
Ziyu Liu,
Huanfa Peng,
Qiang Zhao,
Nai** Liu
Abstract:
A novel photonics-based RF reception approach is proposed as a competitive solution to meet the current challenges of photonic-based approaches and to realize high performances at the same time. The proposed approach adopts the superheterodyne configuration by a combination manner of electronic techniques and photonic techniques, including the ultrawideband generation of optical LO, the two-stage…
▽ More
A novel photonics-based RF reception approach is proposed as a competitive solution to meet the current challenges of photonic-based approaches and to realize high performances at the same time. The proposed approach adopts the superheterodyne configuration by a combination manner of electronic techniques and photonic techniques, including the ultrawideband generation of optical LO, the two-stage photonic superheterodyne frequency conversion and the real-time IF compensation. An engineering prototype has been developed and its performance has been evaluated in the laboratory environment. The experiment results preliminarily verify the feasibility of the proposed approach and its engineering potential. The typical performances are as follows: 0.1 GHz~ 45GHz operation spectrum range (>40 GHz), 900 MHz instantaneous bandwidth, 101 dBHz2/3 SFDR and 130 dBHz LDR, image rejections of ~80 dB for 1st frequency conversion and >90 dB for 2nd frequency conversion.
△ Less
Submitted 9 August, 2021;
originally announced August 2021.
-
Detecting the Stochastic Gravitational Wave Background from Massive Gravity with Pulsar Timing Arrays
Authors:
Qiuyue Liang,
Mark Trodden
Abstract:
We explore the potential of Pulsar Timing Arrays (PTAs) such as NANOGrav, EPTA, and PPTA to detect the Stochastic Gravitational Wave Background (SGWB) in theories of massive gravity. In General Relativity, the function describing the dependence of the correlation between the arrival times of signals from two pulsars on the angle between them is known as the Hellings-Downs curve. We compute the ana…
▽ More
We explore the potential of Pulsar Timing Arrays (PTAs) such as NANOGrav, EPTA, and PPTA to detect the Stochastic Gravitational Wave Background (SGWB) in theories of massive gravity. In General Relativity, the function describing the dependence of the correlation between the arrival times of signals from two pulsars on the angle between them is known as the Hellings-Downs curve. We compute the analogous overlap reduction function for massive gravity, including the additional polarization states and the correction due to the mass of the graviton, and compare the result with the Hellings-Downs curve. The primary result is a complete analytical form for the analog Hellings-Downs curve, providing a starting point for future numerical studies aimed at a detailed comparison between PTA data and the predictions of massive gravity. We study both the massless limit and the stationary limit as checks on our calculation, and discuss how our formalism also allows us to study the impact of massive spin-2 dark matter candidates on data from PTAs.
△ Less
Submitted 17 September, 2021; v1 submitted 11 August, 2021;
originally announced August 2021.
-
Optical vortex coronagraph imaging of a laser-induced plasma filament
Authors:
Qingqing Liang,
Xia Huang,
Yanfei Mou,
Shaodong Zhou,
Wenxing Zhang,
Jieyu Gui,
Grover A. Swartzlander,
JR.,
Qingqing Cheng,
Yi Liu
Abstract:
A high contrast imaging technique based on an optical vortex coronagraph (OVC) is used to measure the spatial phase profile induced by an air plasma generated by a femtosecond laser pulse. The sensitivity of the OVC method significantly surpassed both in-line holographic and direct imaging methods based on air plasma fluorescence. The estimated phase sensitivity of 0.046 waves provides opportuniti…
▽ More
A high contrast imaging technique based on an optical vortex coronagraph (OVC) is used to measure the spatial phase profile induced by an air plasma generated by a femtosecond laser pulse. The sensitivity of the OVC method significantly surpassed both in-line holographic and direct imaging methods based on air plasma fluorescence. The estimated phase sensitivity of 0.046 waves provides opportunities for OVC applications in areas such as bioimaging, material characterization, as well as plasma diagnostics.
△ Less
Submitted 3 August, 2021;
originally announced August 2021.
-
Core shell NaErF4 at NaYF4 upconversion nanoparticles qualify a NIR speckle wavemeter by a visible CCD
Authors:
Tianliang Wang,
Yi Li,
Long Yan,
Qin Liang,
Xu Wang,
**chao Tao,
**g Yang,
Yanqing Qiu,
Yanlong Meng,
Bangning Mao,
Shilong Zhao,
Pengwei Zhou,
Bo Zhou
Abstract:
Speckle patterns have been widely confirmed that can be utilized to reconstruct the wavelength information. In order to achieve higher resolution, a varies of optical diffusing waveguides have been investigated with a focus on their wavelength sensitivity. However, it has been a challenge to reach the balance among cost, volumes, resolution, and stability. In this work, we designed a compact cylin…
▽ More
Speckle patterns have been widely confirmed that can be utilized to reconstruct the wavelength information. In order to achieve higher resolution, a varies of optical diffusing waveguides have been investigated with a focus on their wavelength sensitivity. However, it has been a challenge to reach the balance among cost, volumes, resolution, and stability. In this work, we designed a compact cylindrical random scattering waveguide (CRSW) as the light diffuser only by mixing TiO2 particles and ultra-violate adhesive. The speckle patterns are generated by the light multiple scattering in the CRSW. Importantly, the thin layer of upconversion nanoparticles (UCNPs) were sprayed on the end face of the CRSW. This allows the near infrared (NIR) light to be converted to the visible light, breaking the imaging limitation of visible cameras in the NIR range. We further designed a convolution neural network (CNN) to recognize the wavelength of the speckle patterns with good robustness and excellent ability of transfer learning, resulting in the achievement of a high resolution of 20 kHz ( 0.16 fm) at around 1550 nm with temperature resistance of 2 celsius. Our results provide a low-cost, compact, and simple NIR wavemeter in particular with the ultra high resolution and good temperature stability.
△ Less
Submitted 19 July, 2021;
originally announced July 2021.
-
Statistical Estimation and Nonlinear Filtering in Environmental Pollution
Authors:
Qizhu Liang,
Jie Xiong,
Xingqiu Zhao
Abstract:
This paper studies a nonlinear filtering problem over an infinite time interval. The signal to be estimated is driven by a stochastic partial differential equation involves unknown parameters. Based on discrete observation, strongly consistent estimators of the parameters are derived at first. With the optimal filter given by Bayes formula, the uniqueness of invariant measure for the signal-filter…
▽ More
This paper studies a nonlinear filtering problem over an infinite time interval. The signal to be estimated is driven by a stochastic partial differential equation involves unknown parameters. Based on discrete observation, strongly consistent estimators of the parameters are derived at first. With the optimal filter given by Bayes formula, the uniqueness of invariant measure for the signal-filter pair has been verified. The paper then establishes approximation to the optimal filter, showing that the pathwise average distance, per unit time, of the computed approximating filter from the optimal filter converges to zero in probability. Simulation results are presented at last.
△ Less
Submitted 10 July, 2021;
originally announced July 2021.
-
Multi-user VoiceFilter-Lite via Attentive Speaker Embedding
Authors:
Rajeev Rikhye,
Quan Wang,
Qiao Liang,
Yanzhang He,
Ian McGraw
Abstract:
In this paper, we propose a solution to allow speaker conditioned speech models, such as VoiceFilter-Lite, to support an arbitrary number of enrolled users in a single pass. This is achieved by using an attention mechanism on multiple speaker embeddings to compute a single attentive embedding, which is then used as a side input to the model. We implemented multi-user VoiceFilter-Lite and evaluated…
▽ More
In this paper, we propose a solution to allow speaker conditioned speech models, such as VoiceFilter-Lite, to support an arbitrary number of enrolled users in a single pass. This is achieved by using an attention mechanism on multiple speaker embeddings to compute a single attentive embedding, which is then used as a side input to the model. We implemented multi-user VoiceFilter-Lite and evaluated it for three tasks: (1) a streaming automatic speech recognition (ASR) task; (2) a text-independent speaker verification task; and (3) a personalized keyphrase detection task, where ASR has to detect keyphrases from multiple enrolled users in a noisy environment. Our experiments show that, with up to four enrolled users, multi-user VoiceFilter-Lite is able to significantly reduce speech recognition and speaker verification errors when there is overlap** speech, without affecting performance under other acoustic conditions. This attentive speaker embedding approach can also be easily applied to other speaker-conditioned models such as personal VAD and personalized ASR.
△ Less
Submitted 8 November, 2021; v1 submitted 2 July, 2021;
originally announced July 2021.
-
Semi-supervised Optimal Transport with Self-paced Ensemble for Cross-hospital Sepsis Early Detection
Authors:
Ruiqing Ding,
Yu Zhou,
Jie Xu,
Yan Xie,
Qiqiang Liang,
He Ren,
Yixuan Wang,
Yanlin Chen,
Leye Wang,
Man Huang
Abstract:
The utilization of computer technology to solve problems in medical scenarios has attracted considerable attention in recent years, which still has great potential and space for exploration. Among them, machine learning has been widely used in the prediction, diagnosis and even treatment of Sepsis. However, state-of-the-art methods require large amounts of labeled medical data for supervised learn…
▽ More
The utilization of computer technology to solve problems in medical scenarios has attracted considerable attention in recent years, which still has great potential and space for exploration. Among them, machine learning has been widely used in the prediction, diagnosis and even treatment of Sepsis. However, state-of-the-art methods require large amounts of labeled medical data for supervised learning. In real-world applications, the lack of labeled data will cause enormous obstacles if one hospital wants to deploy a new Sepsis detection system. Different from the supervised learning setting, we need to use known information (e.g., from another hospital with rich labeled data) to help build a model with acceptable performance, i.e., transfer learning. In this paper, we propose a semi-supervised optimal transport with self-paced ensemble framework for Sepsis early detection, called SPSSOT, to transfer knowledge from the other that has rich labeled data. In SPSSOT, we first extract the same clinical indicators from the source domain (e.g., hospital with rich labeled data) and the target domain (e.g., hospital with little labeled data), then we combine the semi-supervised domain adaptation based on optimal transport theory with self-paced under-sampling to avoid a negative transfer possibly caused by covariate shift and class imbalance. On the whole, SPSSOT is an end-to-end transfer learning method for Sepsis early detection which can automatically select suitable samples from two domains respectively according to the number of iterations and align feature space of two domains. Extensive experiments on two open clinical datasets demonstrate that comparing with other methods, our proposed SPSSOT, can significantly improve the AUC values with only 1% labeled data in the target domain in two transfer learning scenarios, MIMIC $rightarrow$ Challenge and Challenge $rightarrow$ MIMIC.
△ Less
Submitted 18 June, 2021;
originally announced June 2021.
-
Controllable Confidence-Based Image Denoising
Authors:
Haley Owsianko,
Florian Cassayre,
Qiyuan Liang
Abstract:
Image denoising is a classic restoration problem. Yet, current deep learning methods are subject to the problems of generalization and interpretability. To mitigate these problems, in this project, we present a framework that is capable of controllable, confidence-based noise removal. The framework is based on the fusion between two different denoised images, both derived from the same noisy input…
▽ More
Image denoising is a classic restoration problem. Yet, current deep learning methods are subject to the problems of generalization and interpretability. To mitigate these problems, in this project, we present a framework that is capable of controllable, confidence-based noise removal. The framework is based on the fusion between two different denoised images, both derived from the same noisy input. One of the two is denoised using generic algorithms (e.g. Gaussian), which make few assumptions on the input images, therefore, generalize in all scenarios. The other is denoised using deep learning, performing well on seen datasets. We introduce a set of techniques to fuse the two components smoothly in the frequency domain. Beyond that, we estimate the confidence of a deep learning denoiser to allow users to interpret the output, and provide a fusion strategy that safeguards them against out-of-distribution inputs. Through experiments, we demonstrate the effectiveness of the proposed framework in different use cases.
△ Less
Submitted 17 June, 2021;
originally announced June 2021.
-
Third-order many-body expansion of OSV-MP2 wavefunction for low-order scaling analytical gradient computation
Authors:
Qiujiang Liang,
Jun Yang
Abstract:
We present a many-body expansion (MBE) formulation and implementation for efficient computation of analytical energy gradients from OSV-MP2 theory based on our earlier work (Zhou et al. J. Chem. Theory Comput. 2020, 16, 196-210). The third-order MBE(3) expansion of OSV-MP2 wavefunction was developed to adopt the orbital-specific clustering and long-range termination schemes, which avoids term-by-t…
▽ More
We present a many-body expansion (MBE) formulation and implementation for efficient computation of analytical energy gradients from OSV-MP2 theory based on our earlier work (Zhou et al. J. Chem. Theory Comput. 2020, 16, 196-210). The third-order MBE(3) expansion of OSV-MP2 wavefunction was developed to adopt the orbital-specific clustering and long-range termination schemes, which avoids term-by-term differentiations of the MBE energy bodies. We achieve better efficiency by exploiting the algorithmic sparsity that allows to prune out insignificant fitting integrals and OSV relaxations. With these approximations, the present implementation is benchmarked on a range of molecules that show an economic scaling in the linear and quadratic regimes for computing MBE(3)-OSV-MP2 amplitude and gradient equations, respectively, and yields normal accuracy comparable to the original OSV-MP2 results. The MPI-3-based parallelism through shared memory one-sided communication is further developed for improving parallel scalability and memory accessibility by sorting the MBE(3) orbital clusters into independent tasks that are distributed on multiple processes across many nodes, supporting both global and local data locations in which selected MBE(3)-OSV-MP2 intermediates of different sizes are distinguished and accordingly placed. The accuracy and efficiency level of our MBE(3)-OSV-MP2 analytical gradient implementation is finally illustrated in two applications: we show that the subtle coordination structure differences of mechanically interlocked Cu-catenane complexes can be distinguished when tuning ligand lengths; and the porphycene molecular dynamics reveals the emergence of the vibrational signature arising from softened N-H stretching associated with hydrogen transfer, using an MP2 level of electron correlation and classical nuclei for the first time.
△ Less
Submitted 10 June, 2021;
originally announced June 2021.
-
Benchmarking the Performance of Bayesian Optimization across Multiple Experimental Materials Science Domains
Authors:
Qiaohao Liang,
Aldair E. Gongora,
Zekun Ren,
Armi Tiihonen,
Zhe Liu,
Shi**g Sun,
James R. Deneault,
Daniil Bash,
Flore Mekki-Berrada,
Saif A. Khan,
Kedar Hippalgaonkar,
Benji Maruyama,
Keith A. Brown,
John Fisher III,
Tonio Buonassisi
Abstract:
In the field of machine learning (ML) for materials optimization, active learning algorithms, such as Bayesian Optimization (BO), have been leveraged for guiding autonomous and high-throughput experimentation systems. However, very few studies have evaluated the efficiency of BO as a general optimization algorithm across a broad range of experimental materials science domains. In this work, we eva…
▽ More
In the field of machine learning (ML) for materials optimization, active learning algorithms, such as Bayesian Optimization (BO), have been leveraged for guiding autonomous and high-throughput experimentation systems. However, very few studies have evaluated the efficiency of BO as a general optimization algorithm across a broad range of experimental materials science domains. In this work, we evaluate the performance of BO algorithms with a collection of surrogate model and acquisition function pairs across five diverse experimental materials systems, namely carbon nanotube polymer blends, silver nanoparticles, lead-halide perovskites, as well as additively manufactured polymer structures and shapes. By defining acceleration and enhancement metrics for general materials optimization objectives, we find that for surrogate model selection, Gaussian Process (GP) with anisotropic kernels (automatic relevance detection, ARD) and Random Forests (RF) have comparable performance and both outperform the commonly used GP without ARD. We discuss the implicit distributional assumptions of RF and GP, and the benefits of using GP with anisotropic kernels in detail. We provide practical insights for experimentalists on surrogate model selection of BO during materials optimization campaigns.
△ Less
Submitted 23 May, 2021;
originally announced June 2021.
-
Predicting antimicrobial activity of conjugated oligoelectrolyte molecules via machine learning
Authors:
Armi Tiihonen,
Sarah J. Cox-Vazquez,
Qiaohao Liang,
Mohamed Ragab,
Zekun Ren,
Noor Titan Putri Hartono,
Zhe Liu,
Shi**g Sun,
Cheng Zhou,
Nathan C. Incandela,
Jakkarin Limwongyut,
Alex S. Moreland,
Senthilnath Jayavelu,
Guillermo C. Bazan,
Tonio Buonassisi
Abstract:
New antibiotics are needed to battle growing antibiotic resistance, but the development process from hit, to lead, and ultimately to a useful drug, takes decades. Although progress in molecular property prediction using machine-learning methods has opened up new pathways for aiding the antibiotics development process, many existing solutions rely on large datasets and finding structural similariti…
▽ More
New antibiotics are needed to battle growing antibiotic resistance, but the development process from hit, to lead, and ultimately to a useful drug, takes decades. Although progress in molecular property prediction using machine-learning methods has opened up new pathways for aiding the antibiotics development process, many existing solutions rely on large datasets and finding structural similarities to existing antibiotics. Challenges remain in modelling of unconventional antibiotics classes that are drawing increasing research attention. In response, we developed an antimicrobial activity prediction model for conjugated oligoelectrolyte molecules, a new class of antibiotics that lacks extensive prior structure-activity relationship studies. Our approach enables us to predict minimum inhibitory concentration for E. coli K12, with 21 molecular descriptors selected by recursive elimination from a set of 5,305 descriptors. This predictive model achieves an R2 of 0.65 with no prior knowledge of the underlying mechanism. We find the molecular representation optimum for the domain is the key to good predictions of antimicrobial activity. In the case of conjugated oligoelectrolytes, a representation reflecting the 3-dimensional shape of the molecules is most critical. Although it is demonstrated with a specific example of conjugated oligoelectrolytes, our proposed approach for creating the predictive model can be readily adapted to other novel antibiotic candidate domains.
△ Less
Submitted 30 November, 2021; v1 submitted 21 May, 2021;
originally announced May 2021.
-
Pyramid Fusion Dark Channel Prior for Single Image Dehazing
Authors:
Qiyuan Liang,
Bin Zhu,
Chong-Wah Ngo
Abstract:
In this paper, we propose the pyramid fusion dark channel prior (PF-DCP) for single image dehazing. Based on the well-known Dark Channel Prior (DCP), we introduce an easy yet effective approach PF-DCP by employing the DCP algorithm at a pyramid of multi-scale images to alleviate the problem of patch size selection. In this case, we obtain the final transmission map by fusing transmission maps at e…
▽ More
In this paper, we propose the pyramid fusion dark channel prior (PF-DCP) for single image dehazing. Based on the well-known Dark Channel Prior (DCP), we introduce an easy yet effective approach PF-DCP by employing the DCP algorithm at a pyramid of multi-scale images to alleviate the problem of patch size selection. In this case, we obtain the final transmission map by fusing transmission maps at each level to recover a high-quality haze-free image. Experiments on RESIDE SOTS show that PF-DCP not only outperforms the traditional prior-based methods with a large margin but also achieves comparable or even better results of state-of-art deep learning approaches. Furthermore, the visual quality is also greatly improved with much fewer color distortions and halo artifacts.
△ Less
Submitted 21 May, 2021;
originally announced May 2021.
-
Personalized Keyphrase Detection using Speaker and Environment Information
Authors:
Rajeev Rikhye,
Quan Wang,
Qiao Liang,
Yanzhang He,
Ding Zhao,
Yiteng,
Huang,
Arun Narayanan,
Ian McGraw
Abstract:
In this paper, we introduce a streaming keyphrase detection system that can be easily customized to accurately detect any phrase composed of words from a large vocabulary. The system is implemented with an end-to-end trained automatic speech recognition (ASR) model and a text-independent speaker verification model. To address the challenge of detecting these keyphrases under various noisy conditio…
▽ More
In this paper, we introduce a streaming keyphrase detection system that can be easily customized to accurately detect any phrase composed of words from a large vocabulary. The system is implemented with an end-to-end trained automatic speech recognition (ASR) model and a text-independent speaker verification model. To address the challenge of detecting these keyphrases under various noisy conditions, a speaker separation model is added to the feature frontend of the speaker verification model, and an adaptive noise cancellation (ANC) algorithm is included to exploit cross-microphone noise coherence. Our experiments show that the text-independent speaker verification model largely reduces the false triggering rate of the keyphrase detection, while the speaker separation model and adaptive noise cancellation largely reduce false rejections.
△ Less
Submitted 15 June, 2021; v1 submitted 28 April, 2021;
originally announced April 2021.
-
Unusual normal and superconducting state properties observed in hydrothermal Fe1-xSe flakes
Authors:
Shaobo Liu,
Sheng Ma,
Zhaosheng Wang,
Wei Hu,
Zian Li,
Qimei Liang,
Hong Wang,
Yuhang Zhang,
Zouyouwei Lu,
Jie Yuan,
Kui **,
Jian-Qi Li,
Li Pi,
Li Yu,
Fang Zhou,
Xiaoli Dong,
Zhongxian Zhao
Abstract:
The electronic and superconducting properties of Fe1-xSe single-crystal flakes grown hydrothermally are studied by the transport measurements under zero and high magnetic fields up to 38.5 T. The results contrast sharply with those previously reported for nematically ordered FeSe by chemical-vapor-transport (CVT) growth. No signature of the electronic nematicity, but an evident metal-to-nonmetal c…
▽ More
The electronic and superconducting properties of Fe1-xSe single-crystal flakes grown hydrothermally are studied by the transport measurements under zero and high magnetic fields up to 38.5 T. The results contrast sharply with those previously reported for nematically ordered FeSe by chemical-vapor-transport (CVT) growth. No signature of the electronic nematicity, but an evident metal-to-nonmetal crossover with increasing temperature, is detected in the normal state of the present hydrothermal samples. Interestingly, a higher superconducting critical temperature Tc of 13.2 K is observed compared to a suppressed Tc of 9 K in the presence of the nematicity in the CVT FeSe. Moreover, the upper critical field in the zero-temperature limit is found to be isotropic with respect to the field direction and to reach a higher value of ~42 T, which breaks the Pauli limit by a factor of 1.8.
△ Less
Submitted 25 April, 2021;
originally announced April 2021.
-
Fano resonance enabled infrared nano-imaging of local strain in bilayer graphene
Authors:
**g Du,
Bosai Lyu,
Wanfei Shan,
Jiajun Chen,
Xianliang Zhou,
**gxu Xie,
Aolin Deng,
Cheng Hu,
Qi Liang,
Guibai Xie,
Xiaojun Li,
Weidong Luo,
Zhiwen Shi
Abstract:
Detection of local strain at the nanometer scale with high sensitivity remains challenging. Here we report near-field infrared nano-imaging of local strains in bilayer graphene through probing strain-induced shifts of phonon frequency. As a non-polar crystal, intrinsic bilayer graphene possesses little infrared response at its transverse optical (TO) phonon frequency. The reported optical detectio…
▽ More
Detection of local strain at the nanometer scale with high sensitivity remains challenging. Here we report near-field infrared nano-imaging of local strains in bilayer graphene through probing strain-induced shifts of phonon frequency. As a non-polar crystal, intrinsic bilayer graphene possesses little infrared response at its transverse optical (TO) phonon frequency. The reported optical detection of local strain is enabled by applying a vertical electrical field that breaks the symmetry of the two graphene layers and introduces finite electrical dipole moment to graphene phonon. The activated phonon further interacts with continuum electronic transitions, and generates a strong Fano resonance. The resulted Fano resonance features a very sharp near-field infrared scattering peak, which leads to an extraordinary sensitivity of ~0.002% for the strain detection. Our studies demonstrate the first nano-scale near-field Fano resonance, provide a new way to probe local strains with high sensitivity in non-polar crystals, and open exciting possibilities for studying strain-induced rich phenomena.
△ Less
Submitted 19 April, 2021;
originally announced April 2021.
-
A Robust Model for Trust Evaluation during Interactions between Agents in a Sociable Environment
Authors:
Qin Liang,
Minjie Zhang,
Fenghui Ren,
Takayuki Ito
Abstract:
Trust evaluation is an important topic in both research and applications in sociable environments. This paper presents a model for trust evaluation between agents by the combination of direct trust, indirect trust through neighbouring links and the reputation of an agent in the environment (i.e. social network) to provide the robust evaluation. Our approach is typology independent from social netw…
▽ More
Trust evaluation is an important topic in both research and applications in sociable environments. This paper presents a model for trust evaluation between agents by the combination of direct trust, indirect trust through neighbouring links and the reputation of an agent in the environment (i.e. social network) to provide the robust evaluation. Our approach is typology independent from social network structures and in a decentralized manner without a central controller, so it can be used in broad domains.
△ Less
Submitted 17 April, 2021;
originally announced April 2021.
-
Batch Optimization of Frequency-Modulated Pulses for Robust Two-qubit Gates in Ion Chains
Authors:
Mingyu Kang,
Qiyao Liang,
Bichen Zhang,
Shilin Huang,
Ye Wang,
Chao Fang,
Jungsang Kim,
Kenneth R. Brown
Abstract:
Two-qubit gates in trapped-ion quantum computers are generated by applying spin-dependent forces that temporarily entangle the internal state of the ion with its motion. Laser pulses are carefully designed to generate a maximally entangling gate between the ions while minimizing any residual entanglement between the motion and the ion. The quality of the gates suffers when the actual experimental…
▽ More
Two-qubit gates in trapped-ion quantum computers are generated by applying spin-dependent forces that temporarily entangle the internal state of the ion with its motion. Laser pulses are carefully designed to generate a maximally entangling gate between the ions while minimizing any residual entanglement between the motion and the ion. The quality of the gates suffers when the actual experimental parameters differ from the ideal case. Here, we improve the robustness of frequency-modulated Mølmer-Sørensen gates to motional mode-frequency offsets by optimizing the average performance over a range of systematic errors using batch optimization. We then compare this method with frequency-modulated gates optimized for ideal parameters that include an analytic robustness condition. Numerical simulations show good performance up to 12 ions, and the method is experimentally demonstrated on a two-ion chain.
△ Less
Submitted 20 December, 2021; v1 submitted 14 April, 2021;
originally announced April 2021.