Search | arXiv e-print repository

SiRi: A Simple Selective Retraining Mechanism for Transformer-based Visual Grounding

Authors: Mengxue Qu, Yu Wu, Wu Liu, Qiqi Gong, Xiaodan Liang, Olga Russakovsky, Yao Zhao, Yunchao Wei

Abstract: In this paper, we investigate how to achieve better visual grounding with modern vision-language transformers, and propose a simple yet powerful Selective Retraining (SiRi) mechanism for this challenging task. Particularly, SiRi conveys a significant principle to the research of visual grounding, i.e., a better initialized vision-language encoder would help the model converge to a better local min… ▽ More In this paper, we investigate how to achieve better visual grounding with modern vision-language transformers, and propose a simple yet powerful Selective Retraining (SiRi) mechanism for this challenging task. Particularly, SiRi conveys a significant principle to the research of visual grounding, i.e., a better initialized vision-language encoder would help the model converge to a better local minimum, advancing the performance accordingly. In specific, we continually update the parameters of the encoder as the training goes on, while periodically re-initialize rest of the parameters to compel the model to be better optimized based on an enhanced encoder. SiRi can significantly outperform previous approaches on three popular benchmarks. Specifically, our method achieves 83.04% Top1 accuracy on RefCOCO+ testA, outperforming the state-of-the-art approaches (training from scratch) by more than 10.21%. Additionally, we reveal that SiRi performs surprisingly superior even with limited training data. We also extend it to transformer-based visual grounding models and other vision-language tasks to verify the validity. △ Less

Submitted 27 July, 2022; originally announced July 2022.

Comments: 21 pages (including Supplementary Materials); Accepted to ECCV 2022

arXiv:2207.08976 [pdf, other]

Gain-gain and gain-lossless PT-symmetry broken from PT-phase diagram

Authors: Qi Zhang, Yun Ma, Qi Liu, Xinchen Zhang, Yali Jia, Limin Tong, Qihuang Gong, Ying Gu

Abstract: Parity-time (PT) symmetry and broken in micro/nano photonic structures have been investigated extensively as they bring new opportunities to control the flow of light based on non-Hermitian optics. Previous studies have focused on the situations of PT-symmetry broken in loss-loss or gain-loss coupling systems. Here, we theoretically predict the gain-gain and gain-lossless PT-broken from phase diag… ▽ More Parity-time (PT) symmetry and broken in micro/nano photonic structures have been investigated extensively as they bring new opportunities to control the flow of light based on non-Hermitian optics. Previous studies have focused on the situations of PT-symmetry broken in loss-loss or gain-loss coupling systems. Here, we theoretically predict the gain-gain and gain-lossless PT-broken from phase diagram, where the boundaries between PT-symmetry and PT-broken can be clearly defined in the full-parameter space including gain, lossless and loss. For specific micro/nano photonic structures, such as coupled waveguides, we give the transmission matrices of each phase space, which can be used for beam splitting. Taking coupled waveguides as an example, we obtain periodic energy exchange in PT-symmetry phase and exponential gain or loss in PT-broken phase, which are consistent with the phase diagram. The scenario giving a full view of PT-symmetry or broken, will not only deepen the understanding of fundamental physics, but also will promote the breakthrough of photonic applications like optical routers and beam splitters. △ Less

Submitted 18 July, 2022; originally announced July 2022.

arXiv:2207.02644 [pdf, other]

doi 10.1109/JSTQE.2020.3025737

Advances in silicon quantum photonics

Authors: Jeremy C. Adcock, Jueming Bao, Yulin Chi, Xiaojiong Chen, Davide Bacco, Qihuang Gong, Leif K. Oxenløwe, Jianwei Wang, Yunhong Ding

Abstract: Quantum technology is poised to enable a step change in human capability for computing, communications and sensing. Photons are indispensable as carriers of quantum information - they travel at the fastest possible speed and readily protected from decoherence. However, the system requires thousands of near-transparent components with ultra-low-latency control. For quantum technology to be implemen… ▽ More Quantum technology is poised to enable a step change in human capability for computing, communications and sensing. Photons are indispensable as carriers of quantum information - they travel at the fastest possible speed and readily protected from decoherence. However, the system requires thousands of near-transparent components with ultra-low-latency control. For quantum technology to be implemented, a new paradigm photonic system is required: one with in-built coherence, stability, the ability to define arbitrary circuits, and a path to manufacturability. Silicon photonics has unparalleled density and component performance, which, with CMOS compatible fabrication, place it in a strong position for a scalable quantum photonics platform. This paper is a progress report on silicon quantum photonics, focused on developments in the past five years. We provide an introduction on silicon quantum photonic component and the challenges in the field, summarise the current state-of-the-art and identify outstanding technical challenges, as well as promising avenues of future research. We also resolve a conflict in the definition of Hong-Ou-Mandel interference visibility in integrated quantum photonic experiments, needed for fair comparison of photon quality across different platforms. Our aim is the development of scalability on the platform, to which end we point the way to ever-closer integration, toward silicon quantum photonic systems-on-a-chip. △ Less

Submitted 6 July, 2022; originally announced July 2022.

Journal ref: IEEE Journal of Selected Topics in Quantum Electronics 27.2 (2020): 1-24

arXiv:2205.07405 [pdf]

Multiple-Photon Resonance Enabled Quantum Interference in Emission Spectroscopy of N_2^+

Authors: Xiang Zhang, Qi Lu, Yalei Zhu, **g Zhao, Rostyslav Danylo, Mingwei Lei, Hongbing Jiang, Chengyin Wu, Zhedong Zhang, Aurélien Houard, Vladimir Tikhonchuk, André Mysyrowicz, Qihuang Gong, Songlin Zhuang, Zengxiu Zhao, Yi Liu

Abstract: Quantum interference occurs frequently in the interaction of laser radiation with materials, leading to a series of fascinating effects such as lasing without inversion, electromagnetically induced transparency, Fano resonance, etc. Such quantum interference effects are mostly enabled by single-photon resonance with transitions in the matter, regardless of how many optical frequencies are involved… ▽ More Quantum interference occurs frequently in the interaction of laser radiation with materials, leading to a series of fascinating effects such as lasing without inversion, electromagnetically induced transparency, Fano resonance, etc. Such quantum interference effects are mostly enabled by single-photon resonance with transitions in the matter, regardless of how many optical frequencies are involved. Here, we demonstrate quantum interference driven by multiple photons in the emission spectroscopy of nitrogen ions that are resonantly pumped by ultrafast infrared laser pulses. In the spectral domain, Fano resonance is observed in the emission spectrum, where a laser-assisted dynamic Stark effect creates the continuum. In the time domain, the fast-evolving emission is measured, revealing the nature of free-induction decay (FID) arising from quantum radiation and molecular cooperativity. These findings clarify the mechanism of coherent emission of nitrogen ions pumped with MIR pump laser and are likely to be universal. The present work opens a route to explore the important role of quantum interference during the interaction of intense laser pulses with materials near multiple photon resonance. △ Less

Submitted 15 May, 2022; originally announced May 2022.

Comments: 20 pages, 8 figures

arXiv:2205.00394 [pdf]

doi 10.1109/OJCSYS.2022.3205863

Neural Network Optimal Feedback Control with Guaranteed Local Stability

Authors: Tenavi Nakamura-Zimmerer, Qi Gong, Wei Kang

Abstract: Recent research shows that supervised learning can be an effective tool for designing near-optimal feedback controllers for high-dimensional nonlinear dynamic systems. But the behavior of neural network controllers is still not well understood. In particular, some neural networks with high test accuracy can fail to even locally stabilize the dynamic system. To address this challenge we propose sev… ▽ More Recent research shows that supervised learning can be an effective tool for designing near-optimal feedback controllers for high-dimensional nonlinear dynamic systems. But the behavior of neural network controllers is still not well understood. In particular, some neural networks with high test accuracy can fail to even locally stabilize the dynamic system. To address this challenge we propose several novel neural network architectures, which we show guarantee local asymptotic stability while retaining the approximation capacity to learn the optimal feedback policy semi-globally. The proposed architectures are compared against standard neural network feedback controllers through numerical simulations of two high-dimensional nonlinear optimal control problems: stabilization of an unstable Burgers-type partial differential equation, and altitude and course tracking for an unmanned aerial vehicle. The simulations demonstrate that standard neural networks can fail to stabilize the dynamics even when trained well, while the proposed architectures are always at least locally stabilizing and can achieve near-optimal performance. △ Less

Submitted 6 October, 2022; v1 submitted 1 May, 2022; originally announced May 2022.

Comments: arXiv admin note: text overlap with arXiv:2109.07466

Journal ref: IEEE Open Journal of Control Systems, 1 (2022) 210-222

arXiv:2204.11552 [pdf, ps, other]

doi 10.1103/PhysRevLett.128.200401

Experimental demonstration of remotely creating Wigner negativity via quantum steering

Authors: Shuheng Liu, Dongmei Han, Na Wang, Yu Xiang, Fengxiao Sun, Meihong Wang, Zhongzhong Qin, Qihuang Gong, Xiaolong Su, Qiongyi He

Abstract: Non-Gaussian states with Wigner negativity are of particular interest in quantum technology due to their potential applications in quantum computing and quantum metrology. However, how to create such states at a remote location remains a challenge, which is important for efficiently distributing quantum resource between distant nodes in a network. Here, we experimentally prepare optical non-Gaussi… ▽ More Non-Gaussian states with Wigner negativity are of particular interest in quantum technology due to their potential applications in quantum computing and quantum metrology. However, how to create such states at a remote location remains a challenge, which is important for efficiently distributing quantum resource between distant nodes in a network. Here, we experimentally prepare optical non-Gaussian state with negative Wigner function at a remote node via local non-Gaussian operation and shared Gaussian entangled state existing quantum steering. By performing photon subtraction on one mode, Wigner negativity is created in the remote target mode. We show that the Wigner negativity is sensitive to loss on the target mode, but robust to loss on the mode performing photon subtraction. This experiment confirms the connection between the remotely created Wigner negativity and quantum steering. As an application, we present that the generated non-Gaussian state exhibits metrological power in quantum phase estimation. △ Less

Submitted 25 April, 2022; originally announced April 2022.

Comments: Phys. Rev. Lett. (Accepted)

Journal ref: Phys. Rev. Lett. 128, 200401 (2022)

arXiv:2204.01715 [pdf]

BigDL 2.0: Seamless Scaling of AI Pipelines from Laptops to Distributed Cluster

Authors: Jason Dai, Ding Ding, Dongjie Shi, Shengsheng Huang, Jiao Wang, Xin Qiu, Kai Huang, Guoqiong Song, Yang Wang, Qiyuan Gong, Jiaming Song, Shan Yu, Le Zheng, Yina Chen, Junwei Deng, Ge Song

Abstract: Most AI projects start with a Python notebook running on a single laptop; however, one usually needs to go through a mountain of pains to scale it to handle larger dataset (for both experimentation and production deployment). These usually entail many manual and error-prone steps for the data scientists to fully take advantage of the available hardware resources (e.g., SIMD instructions, multi-pro… ▽ More Most AI projects start with a Python notebook running on a single laptop; however, one usually needs to go through a mountain of pains to scale it to handle larger dataset (for both experimentation and production deployment). These usually entail many manual and error-prone steps for the data scientists to fully take advantage of the available hardware resources (e.g., SIMD instructions, multi-processing, quantization, memory allocation optimization, data partitioning, distributed computing, etc.). To address this challenge, we have open sourced BigDL 2.0 at https://github.com/intel-analytics/BigDL/ under Apache 2.0 license (combining the original BigDL and Analytics Zoo projects); using BigDL 2.0, users can simply build conventional Python notebooks on their laptops (with possible AutoML support), which can then be transparently accelerated on a single node (with up-to 9.6x speedup in our experiments), and seamlessly scaled out to a large cluster (across several hundreds servers in real-world use cases). BigDL 2.0 has already been adopted by many real-world users (such as Mastercard, Burger King, Inspur, etc.) in production. △ Less

Submitted 19 April, 2022; v1 submitted 2 April, 2022; originally announced April 2022.

Comments: Accepted by CVPR 2022 (Demo Track)

arXiv:2203.14192 [pdf, ps, other]

doi 10.1103/PhysRevA.104.L021102

Laser-induced electron Fresnel diffraction by XUV pulses at extreme intensity

Authors: Lei Geng, Hao Liang, K. Krajewska, Liang-You Peng, Qihuang Gong

Abstract: Ionization of atoms and molecules in laser fields can lead to various interesting interference structures in the photoelectron spectrum. For the case of a super-intense extreme ultraviolet laser pulse, we identify a novel petal-like interference structure in the electron momentum distribution along the direction of the laser field propagation. We show that this structure is quite general and can b… ▽ More Ionization of atoms and molecules in laser fields can lead to various interesting interference structures in the photoelectron spectrum. For the case of a super-intense extreme ultraviolet laser pulse, we identify a novel petal-like interference structure in the electron momentum distribution along the direction of the laser field propagation. We show that this structure is quite general and can be attributed to the Fresnel diffraction of the electronic wavepacket by the nucleus. Our results are demonstrated by numerically solving the time-dependent Schrodinger equation of the atomic hydrogen beyond the dipole approximation. By building an analytical model, we find that the electron displacement determines the aforementioned interference pattern. In addition, we establish the physical picture of laser-induced electron Fresnel diffraction which is reinforced by both quantum and semiclassical models. △ Less

Submitted 26 March, 2022; originally announced March 2022.

arXiv:2201.05541 [pdf, other]

ViT2Hash: Unsupervised Information-Preserving Hashing

Authors: Qinkang Gong, Liangdao Wang, Hanjiang Lai, Yan Pan, Jian Yin

Abstract: Unsupervised image hashing, which maps images into binary codes without supervision, is a compressor with a high compression rate. Hence, how to preserving meaningful information of the original data is a critical problem. Inspired by the large-scale vision pre-training model, known as ViT, which has shown significant progress for learning visual representations, in this paper, we propose a simple… ▽ More Unsupervised image hashing, which maps images into binary codes without supervision, is a compressor with a high compression rate. Hence, how to preserving meaningful information of the original data is a critical problem. Inspired by the large-scale vision pre-training model, known as ViT, which has shown significant progress for learning visual representations, in this paper, we propose a simple information-preserving compressor to finetune the ViT model for the target unsupervised hashing task. Specifically, from pixels to continuous features, we first propose a feature-preserving module, using the corrupted image as input to reconstruct the original feature from the pre-trained ViT model and the complete image, so that the feature extractor can focus on preserving the meaningful information of original data. Secondly, from continuous features to hash codes, we propose a hashing-preserving module, which aims to keep the semantic information from the pre-trained ViT model by using the proposed Kullback-Leibler divergence loss. Besides, the quantization loss and the similarity loss are added to minimize the quantization error. Our method is very simple and achieves a significantly higher degree of MAP on three benchmark image datasets. △ Less

Submitted 14 January, 2022; originally announced January 2022.

arXiv:2112.11679 [pdf, other]

Ghost-dil-NetVLAD: A Lightweight Neural Network for Visual Place Recognition

Authors: Qingyuan Gong, Yu Liu, Liqiang Zhang, Renhe Liu

Abstract: Visual place recognition (VPR) is a challenging task with the unbalance between enormous computational cost and high recognition performance. Thanks to the practical feature extraction ability of the lightweight convolution neural networks (CNNs) and the train-ability of the vector of locally aggregated descriptors (VLAD) layer, we propose a lightweight weakly supervised end-to-end neural network… ▽ More Visual place recognition (VPR) is a challenging task with the unbalance between enormous computational cost and high recognition performance. Thanks to the practical feature extraction ability of the lightweight convolution neural networks (CNNs) and the train-ability of the vector of locally aggregated descriptors (VLAD) layer, we propose a lightweight weakly supervised end-to-end neural network consisting of a front-ended perception model called GhostCNN and a learnable VLAD layer as a back-end. GhostCNN is based on Ghost modules that are lightweight CNN-based architectures. They can generate redundant feature maps using linear operations instead of the traditional convolution process, making a good trade-off between computation resources and recognition accuracy. To enhance our proposed lightweight model further, we add dilated convolutions to the Ghost module to get features containing more spatial semantic information, improving accuracy. Finally, rich experiments conducted on a commonly used public benchmark and our private dataset validate that the proposed neural network reduces the FLOPs and parameters of VGG16-NetVLAD by 99.04% and 80.16%, respectively. Besides, both models achieve similar accuracy. △ Less

Submitted 16 April, 2024; v1 submitted 22 December, 2021; originally announced December 2021.

arXiv:2110.14893 [pdf, other]

doi 10.1103/PhysRevA.105.053518

Ground-state cooling of multiple near-degenerate mechanical modes

Authors: **-Yu Liu, Wen**g Liu, Da Xu, Jia-Chen Shi, Qihuang Gong, Yun-Feng Xiao

Abstract: We propose a general and experimentally feasible approach to realize simultaneous ground-state cooling of arbitrary number of near-degenerate, or even fully degenerate mechanical modes, overcoming the limit imposed by the formation of mechanical dark modes. Multiple optical modes are employed to provide different dissipation channels that prevent complete destructive interference of the cooling pa… ▽ More We propose a general and experimentally feasible approach to realize simultaneous ground-state cooling of arbitrary number of near-degenerate, or even fully degenerate mechanical modes, overcoming the limit imposed by the formation of mechanical dark modes. Multiple optical modes are employed to provide different dissipation channels that prevent complete destructive interference of the cooling pathway, and thus eliminating the dark modes. The cooling rate and limit are explicitly specified, in which the distinguishability of the optical modes to the mechanical modes is found to be critical for an efficient cooling process. In a realistic multi-mode optomechanical system, ground-state cooling of all mechanical modes is demonstrated by sequentially introducing optical drives, proving the feasibility and scalability of the proposed scheme. The work may provide new insights in preparing and manipulating multiple quantum states in macroscopic systems. △ Less

Submitted 28 October, 2021; originally announced October 2021.

arXiv:2110.08997 [pdf, ps, other]

Spectrally multiplexed and ultrabright entangled photon pairs in a lithium niobate microresonator

Authors: Bo-Yu Xu, Li-Kun Chen, **tian Lin, Lan-Tian Feng1, Rui Niu, Zhi-Yuan Zhou, Renhong Gao, Chun-Hua Dong, Guang-Can Guo, Qihuang Gong, Ya Cheng, Yun-Feng Xiao, Xi-Feng Ren

Abstract: On-chip bright quantum sources with multiplexing ability are extremely high in demand for the integrated quantum networks with unprecedented scalability and complexity. Here, we demonstrate an ultrabright and broadband biphoton quantum source generated in a lithium niobate microresonator system.Without introducing the conventional domain poling, the on-chip microdisk produces entangled photon pair… ▽ More On-chip bright quantum sources with multiplexing ability are extremely high in demand for the integrated quantum networks with unprecedented scalability and complexity. Here, we demonstrate an ultrabright and broadband biphoton quantum source generated in a lithium niobate microresonator system.Without introducing the conventional domain poling, the on-chip microdisk produces entangled photon pairs covering a broad bandwidth promised by natural phase matching in spontaneous parametric down conversion.Experimentally, the multiplexed photon pairs are characterized by $30\ \rm nm$ bandwidth limited by the filtering system, which can be furthered enlarged.Meanwhile, the generation rate reaches $5.13\ {\rm MHz}/\upmu \rm W$ with a coincidence-to-accidental ratio up to $804$.Besides, the quantum source manifests the prominent purity with heralded single photon correlation $g_H^{(2)}(0)=0.0098\pm0.0021$ and energy-time entanglement with excellent interference visibility of $96.5\%\pm1.9\%$. Such quantum sources at the telecommunication band pave the way for high-dimensional entanglement and future integrated quantum information systems. △ Less

Submitted 17 October, 2021; originally announced October 2021.

Comments: 8 pages,4 figures

arXiv:2109.14506 [pdf, other]

doi 10.1103/PhysRevLett.128.073901

Vibrational Kerr solitons in an optomechanical microresonator

Authors: Jia-Chen Shi, Qing-Xin Ji, Qi-Tao Cao, Yan Yu, Wen**g Liu, Qihuang Gong, Yun-Feng Xiao

Abstract: Soliton microcombs based on Kerr nonlinearity in microresonators have been a prominent miniaturized coherent light source. Here, for the first time, we demonstrate the existence of Kerr solitons in an optomechanical microresonator, for which a nonlinear model is built by incorporating a single mechanical mode and multiple optical modes. Interestingly, an exotic vibrational Kerr soliton state is fo… ▽ More Soliton microcombs based on Kerr nonlinearity in microresonators have been a prominent miniaturized coherent light source. Here, for the first time, we demonstrate the existence of Kerr solitons in an optomechanical microresonator, for which a nonlinear model is built by incorporating a single mechanical mode and multiple optical modes. Interestingly, an exotic vibrational Kerr soliton state is found, which is modulated by a self-sustained mechanical oscillation. Besides, the soliton provides extra mechanical gain through the optical spring effect, and results in phonon lasing with a red-detuned pump. Various nonlinear dynamics is also observed, including limit cycle, higher periodicity, and transient chaos. This work provides a guidance for not only exploring many-body nonlinear interactions, but also promoting precision measurements by featuring superiority of both frequency combs and optomechanics. △ Less

Submitted 29 September, 2021; originally announced September 2021.

arXiv:2109.07466 [pdf, ps, other]

doi 10.23919/ACC53348.2022.9867619

Neural network optimal feedback control with enhanced closed loop stability

Authors: Tenavi Nakamura-Zimmerer, Qi Gong, Wei Kang

Abstract: Recent research has shown that supervised learning can be an effective tool for designing optimal feedback controllers for high-dimensional nonlinear dynamic systems. But the behavior of these neural network (NN) controllers is still not well understood. In this paper we use numerical simulations to demonstrate that typical test accuracy metrics do not effectively capture the ability of an NN cont… ▽ More Recent research has shown that supervised learning can be an effective tool for designing optimal feedback controllers for high-dimensional nonlinear dynamic systems. But the behavior of these neural network (NN) controllers is still not well understood. In this paper we use numerical simulations to demonstrate that typical test accuracy metrics do not effectively capture the ability of an NN controller to stabilize a system. In particular, some NNs with high test accuracy can fail to stabilize the dynamics. To address this we propose two NN architectures which locally approximate a linear quadratic regulator (LQR). Numerical simulations confirm our intuition that the proposed architectures reliably produce stabilizing feedback controllers without sacrificing optimality. In addition, we introduce a preliminary theoretical result describing some stability properties of such NN-controlled systems. △ Less

Submitted 17 November, 2021; v1 submitted 15 September, 2021; originally announced September 2021.

Report number: American Control Conference (2022) 2373-2378

arXiv:2108.05095 [pdf, other]

doi 10.1103/PhysRevLett.127.087203

Remote generation of magnon Schrödinger cat state via magnon-photon entanglement

Authors: Feng-Xiao Sun, Sha-Sha Zheng, Yang Xiao, Qihuang Gong, Qiongyi He, Ke Xia

Abstract: Magnon cat state represents a macroscopic quantum superposition of collective magnetic excitations of large number spins that not only provides fundamental tests of macroscopic quantum effects but also finds applications in quantum metrology and quantum computation. In particular, remote generation and manipulation of Schrödinger cat states are particularly interesting for the development of long-… ▽ More Magnon cat state represents a macroscopic quantum superposition of collective magnetic excitations of large number spins that not only provides fundamental tests of macroscopic quantum effects but also finds applications in quantum metrology and quantum computation. In particular, remote generation and manipulation of Schrödinger cat states are particularly interesting for the development of long-distance and large-scale quantum information processing. Here, we propose an approach to remotely prepare magnon even/odd cat states by performing local non-Gaussian operations on the optical mode that is entangled with magnon mode through pulsed optomagnonic interaction. By evaluating key properties of the resulting cat states, we show that for experimentally feasible parameters they are generated with both high fidelity and nonclassicality, and with a size large enough to be useful for quantum technologies. Furthermore, the effects of experimental imperfections such as the error of projective measurements and dark count when performing single-photon operations have been discussed, where the lifetime of the created magnon cat states is expected to be $t\sim1\,μ$s. △ Less

Submitted 11 August, 2021; originally announced August 2021.

Comments: 14 pages, 10 figures

Journal ref: Phys. Rev. Lett. 127, 087203 (2021)

arXiv:2108.04205 [pdf, other]

Defense Against Adversarial Swarms with Parameter Uncertainty

Authors: Claire Walton, Isaac Kaminer, Qi Gong, Abram. H. Clark, Theodoros Tsatsanifos

Abstract: This paper addresses the problem of optimal defense of a High Value Unit against a large-scale swarm attack. We show that the problem can be cast in the framework of uncertain parameter optimal control and derive a consistency result for the dual problem of this framework. We show that the dual can be computed numerically and apply these numerical results to derive optimal defender strategies agai… ▽ More This paper addresses the problem of optimal defense of a High Value Unit against a large-scale swarm attack. We show that the problem can be cast in the framework of uncertain parameter optimal control and derive a consistency result for the dual problem of this framework. We show that the dual can be computed numerically and apply these numerical results to derive optimal defender strategies against a 100 agent swarm attack. △ Less

Submitted 9 August, 2021; originally announced August 2021.

arXiv:2108.02311 [pdf, other]

Modeling and Control of Large-Scale Adversarial Swarm Engagements

Authors: Theodoros Tsatsanifos, Abram H. Clark, Claire Walton, Isaac Kaminer, Qi Gong

Abstract: We theoretically and numerically study the problem of optimal control of large-scale autonomous systems under explicitly adversarial conditions, including probabilistic destruction of agents during the simulation. Large-scale autonomous systems often include an adversarial component, where different agents or groups of agents explicitly compete with one another. An important component of these sys… ▽ More We theoretically and numerically study the problem of optimal control of large-scale autonomous systems under explicitly adversarial conditions, including probabilistic destruction of agents during the simulation. Large-scale autonomous systems often include an adversarial component, where different agents or groups of agents explicitly compete with one another. An important component of these systems that is not included in current theory or modeling frameworks is random destruction of agents in time. In this case, the modeling and optimal control framework should consider the attrition of agents as well as their position. We propose and test three numerical modeling schemes, where survival probabilities of all agents are smoothly and continuously decreased in time, based on the relative positions of all agents during the simulation. In particular, we apply these schemes to the case of agents defending a high-value unit from an attacking swarm. We show that these models can be successfully used to model this situation, provided that attrition and spatial dynamics are coupled. Our results have relevance to an entire class of adversarial autonomy situations, where the positions of agents and their survival probabilities are both important. △ Less

Submitted 4 August, 2021; originally announced August 2021.

arXiv:2107.08230 [pdf]

Plasmon-Exciton Coupling Effect on Plasmon Dam**

Authors: Lulu Ye, Weidong Zhang, Aiqin Hu, Hai Lin, **glin Tang, Yunkun Wang, Chenxinyu Pan, Pan Wang, Xin Guo, Limin Tong, Yunan Gao, Qihuang Gong, Guowei Lu

Abstract: Plasmon decay via the surface or interface is a critical process for practical energy conversion and plasmonic catalysis. However, the relationship between plasmon dam** and the coupling between the plasmon and 2D materials is still unclear. The spectral splitting due to plasmon-exciton interaction impedes the conventional single-particle method to evaluate the plasmon dam** rate by the spectr… ▽ More Plasmon decay via the surface or interface is a critical process for practical energy conversion and plasmonic catalysis. However, the relationship between plasmon dam** and the coupling between the plasmon and 2D materials is still unclear. The spectral splitting due to plasmon-exciton interaction impedes the conventional single-particle method to evaluate the plasmon dam** rate by the spectral linewidth directly. Here, we investigated the interaction between a single gold nanorod (GNR) and 2D materials using the single-particle spectroscopy method assisted with in situ nanomanipulation technique by comparing scattering intensity and linewidth together. Our approach allows us to indisputably identify that the plasmon-exciton coupling in the GNR-WSe2 hybrid would induce plasmon dam**. We can also isolate the contribution between the charge transfer channel and resonant energy transfer channel for the plasmon decay in the GNR-graphene hybrid by comparing that with thin hBN layers as an intermediate medium to block the charge transfer. We find out that the contact layer between the GNR and 2D materials contributes most of the interfacial plasmon dam**. These findings contribute to a deep understanding of interfacial excitonic effects on the plasmon and 2D materials hybrid. △ Less

Submitted 17 July, 2021; originally announced July 2021.

arXiv:2106.11509 [pdf]

doi 10.1088/1674-1056/ac0daa

Controlled plasmon-enhanced fluorescence by spherical microcavity

Authors: **gyi Zhao, Weidong Zhang, Te Wen, Lulu Ye, Hai Lin, **glin Tang, Qihuang Gong, Guowei Lu

Abstract: A surrounding electromagnetic environment can engineer spontaneous emissions from quantum emitters through the Purcell effect. For instance, a plasmonic antenna can efficiently confine an electromagnetic field and enhance the fluorescent process. In this study, we demonstrate that a photonic microcavity can modulate plasmon-enhanced fluorescence by engineering the local electromagnetic environment… ▽ More A surrounding electromagnetic environment can engineer spontaneous emissions from quantum emitters through the Purcell effect. For instance, a plasmonic antenna can efficiently confine an electromagnetic field and enhance the fluorescent process. In this study, we demonstrate that a photonic microcavity can modulate plasmon-enhanced fluorescence by engineering the local electromagnetic environment. Consequently, we constructed a plasmon-enhanced emitter (PE-emitter), which comprised a nanorod and a nanodiamond, using the nanomanipulation technique. Furthermore, we controlled a polystyrene sphere approaching the PE-emitter and investigated in situ the associated fluorescent spectrum and lifetime. The emission of PE-emitter can be enhanced resonantly at the photonic modes as compared to that within the free spectral range. The spectral shape modulated by photonic modes is independent of the separation between the PS sphere and PE-emitter. The band integral of the fluorescence decay rate can be enhanced or suppressed after the PS sphere couples to the PE-emitters, depending on the coupling strength between the plasmonic antenna and the photonic cavity. These findings can be utilized in sensing and imaging applications. △ Less

Submitted 21 June, 2021; originally announced June 2021.

Comments: 4 figures. 14 pages. Accepted by Chinese physics B

MSC Class: 42.82.Fv; 73.20.Mf;

arXiv:2106.10162 [pdf]

ChemiQ: A Chemistry Simulator for Quantum Computer

Authors: Qingchun Wang, Huan-Yu Liu, Qing-Song Li, Jianyu Zhao, Qiankun Gong, Ye Li, Yu-Chun Wu, Guo-** Guo

Abstract: Quantum computing, an innovative computing system carrying prominent processing rate, is meant to be the solutions to problems in many fields. Among these realms, the most intuitive application is to help chemical researchers correctly de-scribe strong correlation and complex systems, which are the great challenge in current chemistry simulation. In this paper, we will present a standalone quantum… ▽ More Quantum computing, an innovative computing system carrying prominent processing rate, is meant to be the solutions to problems in many fields. Among these realms, the most intuitive application is to help chemical researchers correctly de-scribe strong correlation and complex systems, which are the great challenge in current chemistry simulation. In this paper, we will present a standalone quantum simulation tool for chemistry, ChemiQ, which is designed to assist people carry out chemical research or molecular calculation on real or virtual quantum computers. Under the idea of modular programming in C++ language, the software is designed as a full-stack tool without third-party physics or chemistry application packages. It provides services as follow: visually construct molecular structure, quickly simulate ground-state energy, scan molecular potential energy curve by distance or angle, study chemical reaction, and return calculation results graphically after analysis. △ Less

Submitted 28 December, 2022; v1 submitted 18 June, 2021; originally announced June 2021.

Comments: software,7 pages, 5 figures

arXiv:2105.12764 [pdf, other]

Scalable Multigrid-based Hierarchical Scientific Data Refactoring on GPUs

Authors: Jieyang Chen, Lipeng Wan, Xin Liang, Ben Whitney, Qing Liu, Qian Gong, David Pugmire, Nicholas Thompson, Jong Youl Choi, Matthew Wolf, Todd Munson, Ian Foster, Scott Klasky

Abstract: Rapid growth in scientific data and a widening gap between computational speed and I/O bandwidth makes it increasingly infeasible to store and share all data produced by scientific simulations. Instead, we need methods for reducing data volumes: ideally, methods that can scale data volumes adaptively so as to enable negotiation of performance and fidelity tradeoffs in different situations. Multigr… ▽ More Rapid growth in scientific data and a widening gap between computational speed and I/O bandwidth makes it increasingly infeasible to store and share all data produced by scientific simulations. Instead, we need methods for reducing data volumes: ideally, methods that can scale data volumes adaptively so as to enable negotiation of performance and fidelity tradeoffs in different situations. Multigrid-based hierarchical data representations hold promise as a solution to this problem, allowing for flexible conversion between different fidelities so that, for example, data can be created at high fidelity and then transferred or stored at lower fidelity via logically simple and mathematically sound operations. However, the effective use of such representations has been hindered until now by the relatively high costs of creating, accessing, reducing, and otherwise operating on such representations. We describe here highly optimized data refactoring kernels for GPU accelerators that enable efficient creation and manipulation of data in multigrid-based hierarchical forms. We demonstrate that our optimized design can achieve up to 264 TB/s aggregated data refactoring throughput -- 92% of theoretical peak -- on 1024 nodes of the Summit supercomputer. We showcase our optimized design by applying it to a large-scale scientific visualization workflow and the MGARD lossy compression software. △ Less

Submitted 26 May, 2021; originally announced May 2021.

Comments: arXiv admin note: text overlap with arXiv:2007.04457

arXiv:2105.05580 [pdf, other]

doi 10.1038/s41467-021-22887-6

A generalised multipath delayed-choice experiment on a large-scale quantum nanophotonic chip

Authors: Xiaojiong Chen, Yaohao Deng, Shuheng Liu, Tanumoy Pramanik, Jun Mao, Jueming Bao, Chonghao Zhai, Tianxiang Dai, Huihong Yuan, Jiajie Guo, Shao-Ming Fei, Marcus Huber, Bo Tang, Yan Yang, Zhihua Li, Qiongyi He, Qihuang Gong, Jianwei Wang

Abstract: Famous double-slit or double-path experiments, implemented in a Young's or Mach-Zehnder interferometer, have confirmed the dual nature of quantum matter, When a stream of photons, neutrons, atoms, or molecules, passes through two slits, either wave-like interference fringes build up on a screen, or particle-like which-path distribution can be ascertained. These quantum objects exhibit both wave an… ▽ More Famous double-slit or double-path experiments, implemented in a Young's or Mach-Zehnder interferometer, have confirmed the dual nature of quantum matter, When a stream of photons, neutrons, atoms, or molecules, passes through two slits, either wave-like interference fringes build up on a screen, or particle-like which-path distribution can be ascertained. These quantum objects exhibit both wave and particle properties but exclusively, depending on the way they are measured. In an equivalent Mach-Zehnder configuration, the object displays either wave or particle nature in the presence or absence of a beamsplitter, respectively, that represents the choice of which-measurement. Wheeler further proposed a gedanken experiment, in which the choice of which-measurement is delayed, i.e. determined after the object has already entered the interferometer, so as to exclude the possibility of predicting which-measurement it will confront. The delayed-choice experiments have enabled significant demonstrations of genuine two-path duality of different quantum objects. Recently, a quantum controlled version of delayed-choice was proposed by Ionicioiu and Terno, by introducing a quantum-controlled beamsplitter that is in a coherent superposition of presence and absence. It represents a controllable experiment platform that can not only reveal wave and particle characters, but also their superposition. Moreover, a quantitative description of two-slit duality relation was initialized in Wootters and Zurek's seminal work and formalized by Greenberger,et. al. as D2+V2<=1, where D is the distinguishability of whichpath information, and V is the contrast visibility of interference. In this regard, getting which-path information exclusively reduces the interference visibility, and vice versa. This double-path duality relation has been tested in pioneer experiments and recently in delayed-choice measurements. △ Less

Submitted 12 May, 2021; originally announced May 2021.

Comments: 9 pages, 5 figures

Journal ref: Nat. Commun. 12, 2712 (2021)

arXiv:2104.00451 [pdf, other]

doi 10.1038/s41534-022-00533-3

Quantification of Wigner Negativity Remotely Generated via Einstein-Podolsky-Rosen Steering

Authors: Yu Xiang, Shuheng Liu, Jiajie Guo, Qihuang Gong, Nicolas Treps, Qiongyi He, Mattia Walschaers

Abstract: Wigner negativity, as a well-known indicator of nonclassicality, plays an essential role in quantum computing and simulation using continuous-variable systems. Recently, it has been proven that Einstein-Podolsky-Rosen steering is a prerequisite to generate Wigner negativity between two remote modes. Motivated by the demand of real-world quantum network, here we investigate the shareability of gene… ▽ More Wigner negativity, as a well-known indicator of nonclassicality, plays an essential role in quantum computing and simulation using continuous-variable systems. Recently, it has been proven that Einstein-Podolsky-Rosen steering is a prerequisite to generate Wigner negativity between two remote modes. Motivated by the demand of real-world quantum network, here we investigate the shareability of generated Wigner negativity in the multipartite scenario from a quantitative perspective. By establishing a monogamy relation akin to the generalized Coffman-Kundu-Wootters inequality, we show that the amount of Wigner negativity cannot be freely distributed among different modes. Moreover, for photon subtraction -- one of the main experimentally realized non-Gaussian operations -- we provide a general method to quantify the remotely generated Wigner negativity. With this method, we find that there is no direct quantitative relation between the Gaussian steerability and the amount of generated Wigner negativity. Our results pave the way for exploiting Wigner negativity as a valuable resource for numerous quantum information protocols based on non-Gaussian scenario. △ Less

Submitted 5 April, 2021; v1 submitted 1 April, 2021; originally announced April 2021.

Journal ref: npj Quantum Information 8, 21 (2022)

arXiv:2103.04671 [pdf]

Local phase delay effect on the asymmetric spectroscopy of plasmon-exciton coupling systems

Authors: Aiqin Hu, Weidong Zhang, Lulu Ye, Ying Gu, Zhaohang Xue, Hai Lin, **glin Tang, Qihuang Gong, Guowei Lu

Abstract: The phase delay of a local electric field, being well-known in plasmonic nanostructures, has seldom been investigated to modulate the plasmon-exciton interaction. Here, with the single-particle spectroscopy method, we experimentally investigate the phase effect in plasmon-exciton coupling systems consisting of monolayer WSe2 and an individual gold nanorod. The local plasmon phase delay is tuned by… ▽ More The phase delay of a local electric field, being well-known in plasmonic nanostructures, has seldom been investigated to modulate the plasmon-exciton interaction. Here, with the single-particle spectroscopy method, we experimentally investigate the phase effect in plasmon-exciton coupling systems consisting of monolayer WSe2 and an individual gold nanorod. The local plasmon phase delay is tuned by adopting various nanorods with different resonant energies respective to the exciton. We find that the local plasmon phase delay between the excitons and the plasmonic modes is as equally essential as the amplitude. The phase delay modulates the plasmon-exciton coupling considerably, resulting in an asymmetric spectral line-shape due to the interference behavior. There is an excellent agreement for the phase delay between the numerically calculated near-field phase distribution and the experimental results. The local phase delay can act as an effective way to modulate the properties of plexcitonic coupling at the nanoscale, which may have potential applications in nanoscale sensing, solar energy devices, and enhancing nonlinear processes. △ Less

Submitted 8 March, 2021; originally announced March 2021.

arXiv:2101.01422 [pdf, other]

doi 10.1103/PhysRevLett.125.260506

Deterministic distribution of multipartite entanglement and steering in a quantum network by separable states

Authors: Meihong Wang, Yu Xiang, Haijun Kang, Dongmei Han, Yang Liu, Qiongyi He, Qihuang Gong, Xiaolong Su, Kunchi Peng

Abstract: As two valuable quantum resources, Einstein-Podolsky-Rosen entanglement and steering play important roles in quantum-enhanced communication protocols. Distributing such quantum resources among multiple remote users in a network is a crucial precondition underlying various quantum tasks. We experimentally demonstrate the deterministic distribution of two- and three-mode Gaussian entanglement and st… ▽ More As two valuable quantum resources, Einstein-Podolsky-Rosen entanglement and steering play important roles in quantum-enhanced communication protocols. Distributing such quantum resources among multiple remote users in a network is a crucial precondition underlying various quantum tasks. We experimentally demonstrate the deterministic distribution of two- and three-mode Gaussian entanglement and steering by transmitting separable states in a network consisting of a quantum server and multiple users. In our experiment, entangled states are not prepared solely by the quantum server, but are created among independent users during the distribution process. More specifically, the quantum server prepares separable squeezed states and applies classical displacements on them before spreading out, and users simply perform local beam-splitter operations and homodyne measurements after they receive separable states. We show that the distributed Gaussian entanglement and steerability are robust against channel loss. Furthermore, one-way Gaussian steering is achieved among users that is useful for further directional or highly asymmetric quantum information processing. △ Less

Submitted 5 January, 2021; originally announced January 2021.

Journal ref: Phys. Rev. Lett. 125, 260506 (2020)

arXiv:2012.01698 [pdf, other]

Neural Network Approximations of Compositional Functions With Applications to Dynamical Systems

Authors: Wei Kang, Qi Gong

Abstract: As demonstrated in many areas of real-life applications, neural networks have the capability of dealing with high dimensional data. In the fields of optimal control and dynamical systems, the same capability was studied and verified in many published results in recent years. Towards the goal of revealing the underlying reason why neural networks are capable of solving some high dimensional problem… ▽ More As demonstrated in many areas of real-life applications, neural networks have the capability of dealing with high dimensional data. In the fields of optimal control and dynamical systems, the same capability was studied and verified in many published results in recent years. Towards the goal of revealing the underlying reason why neural networks are capable of solving some high dimensional problems, we develop an algebraic framework and an approximation theory for compositional functions and their neural network approximations. The theoretical foundation is developed in a way so that it supports the error analysis for not only functions as input-output relations, but also numerical algorithms. This capability is critical because it enables the analysis of approximation errors for problems for which analytic solutions are not available, such as differential equations and optimal control. We identify a set of key features of compositional functions and the relationship between the features and the complexity of neural networks. In addition to function approximations, we prove several formulae of error upper bounds for neural networks that approximate the solutions to differential equations, optimization, and optimal control. △ Less

Submitted 2 December, 2020; originally announced December 2020.

Comments: 40 pages, 18 figures

arXiv:2011.06111 [pdf]

doi 10.1007/s11207-020-01751-8

The Balloon-borne Investigation of Temperature and Speed of Electrons in the corona (BITSE): Mission Description and Preliminary Results

Authors: N. Gopalswamy, J. Newmark, S. Yashiro, P. Mäkelä, N. Reginald, N. Thakur, Q. Gong, Y-H. Kim, K-S. Cho, S-H. Choi, J-H. Baek, S-C. Bong, H-S. Yang, J-Y. Park, J-H. Kim, Y-D. Park, J. -O. Lee, R. -S. Kim, E. -K. Lim

Abstract: We report on the Balloonborne Investigation of Temperature and Speed of Electrons in the corona (BITSE) mission launched recently to observe the solar corona from about 3 Rs to 15 Rs at four wavelengths (393.5, 405.0, 398.7, and 423.4 nm). The BITSE instrument is an externally occulted single stage coronagraph developed at NASA's Goddard Space Flight Center in collaboration with the Korea Astronom… ▽ More We report on the Balloonborne Investigation of Temperature and Speed of Electrons in the corona (BITSE) mission launched recently to observe the solar corona from about 3 Rs to 15 Rs at four wavelengths (393.5, 405.0, 398.7, and 423.4 nm). The BITSE instrument is an externally occulted single stage coronagraph developed at NASA's Goddard Space Flight Center in collaboration with the Korea Astronomy and Space Science Institute (KASI). BITSE used a polarization camera that provided polarization and total brightness images of size 1024 x 1024 pixels. The Wallops Arc Second Pointing (WASP) system developed at NASA's Wallops Flight Facility (WFF) was used for Sun-pointing. The coronagraph and WASP were mounted on a gondola provided by WFF and launched from the Fort Sumner, New Mexico station of Columbia Scientific Balloon Facility (CSBF) on September 18, 2019. BITSE obtained 17,060 coronal images at a float altitude of about 128,000 feet (39 km) over a period of about 4 hrs. BITSE flight software was based on NASA's core Flight System, which was designed to help develop flight quality software. We used EVTM (Ethernet Via Telemetry) to download science data during operations; all images were stored onboard using flash storage. At the end of the mission, all data were recovered and analyzed. Preliminary analysis shows that BITSE imaged the solar minimum corona with the equatorial streamers on the east and west limbs. The narrow streamers observed by BITSE are in good agreement with the geometric properties obtained by SOHO coronagraphs in the overlap** physical domain. In spite of the small signal-to-noise ratio (about 14) we were able to obtain the temperature and flow speed of the western steamer region in the range 4 to 7 Rs as: For the equatorial streamer on the west limb, we obtained a temperature of 1.0 +/- 0.3 MK and a flow speed of about 260 km/s with a large uncertainty interval. △ Less

Submitted 12 December, 2020; v1 submitted 11 November, 2020; originally announced November 2020.

Comments: 40 pages, 25 figures, 4 tables, 3 electronic supplements, to appear in Solar Physics

arXiv:2011.01426 [pdf, other]

Nesting and Degeneracy of Mie Resonances of Dielectric Cavities within Zero-Index Materials

Authors: Xueke Duan, Haoxiang Chen, Yun Ma, Zhiyuan Qian, Qi Zhang, Yun Lai, Ruwen Peng, Qihuang Gong, Ying Gu

Abstract: Resonances in optical cavities have been used to manipulate light propagation, enhance light-matter interaction, modulate quantum states, and so on. However, in traditional cavities, the permittivity contrast in and out the cavity is not so high. Recently, zero-index materials (ZIMs) with unique properties and specific applications have attracted great interest. By putting optical cavity into ZIMs… ▽ More Resonances in optical cavities have been used to manipulate light propagation, enhance light-matter interaction, modulate quantum states, and so on. However, in traditional cavities, the permittivity contrast in and out the cavity is not so high. Recently, zero-index materials (ZIMs) with unique properties and specific applications have attracted great interest. By putting optical cavity into ZIMs, the extreme circumstance with infinite permittivity contrast can be obtained. Here, we theoretically study Mie resonances of dielectric cavities embedded in ZIMs with $\varepsilon \approx 0$, or $μ\approx 0$, or $(\varepsilon,μ) \approx 0$. Owing to ultrahigh contrast ratio of $\varepsilon$ or $μ$ in and out the cavities, with fixed wavelength, a series of Mie resonances with the same angular mode number $l$ but with different cavity radii are obtained; more interestingly, its $2^l$-TM (TE) and $2^{l+1}$-TE (TM) modes have the same resonant solution for the cavity in $\varepsilon \approx 0$ ($μ\approx 0$) material, and the resonance degeneracy also occurs between $2^l$-TM mode and $2^l$-TE mode for $(\varepsilon,μ) \approx 0$ material. We further use resonance degeneracy to modulate the Purcell effect of quantum emitter inside the cavity. The results of resonance nesting and degeneracy will provide an additional view or freedom to enhance the performance of cavity behaviors. △ Less

Submitted 2 November, 2020; originally announced November 2020.

Comments: 6 pages, 4 figures

arXiv:2009.05686 [pdf]

doi 10.1109/LCSYS.2020.3034415

QRnet: optimal regulator design with LQR-augmented neural networks

Authors: Tenavi Nakamura-Zimmerer, Qi Gong, Wei Kang

Abstract: In this paper we propose a new computational method for designing optimal regulators for high-dimensional nonlinear systems. The proposed approach leverages physics-informed machine learning to solve high-dimensional Hamilton-Jacobi-Bellman equations arising in optimal feedback control. Concretely, we augment linear quadratic regulators with neural networks to handle nonlinearities. We train the a… ▽ More In this paper we propose a new computational method for designing optimal regulators for high-dimensional nonlinear systems. The proposed approach leverages physics-informed machine learning to solve high-dimensional Hamilton-Jacobi-Bellman equations arising in optimal feedback control. Concretely, we augment linear quadratic regulators with neural networks to handle nonlinearities. We train the augmented models on data generated without discretizing the state space, enabling application to high-dimensional problems. We use the proposed method to design a candidate optimal regulator for an unstable Burgers' equation, and through this example, demonstrate improved robustness and accuracy compared to existing neural network formulations. △ Less

Submitted 16 November, 2020; v1 submitted 11 September, 2020; originally announced September 2020.

Comments: Added IEEE accepted manuscript with copyright notice

Journal ref: IEEE Control Systems Letters 5 (2021) 1303-1308

arXiv:2008.12510 [pdf, other]

Gain-assisted chiral soliton microcombs

Authors: Teng Tan, Hao-**g Chen, Zhongye Yuan, Yan Yu, Qi-Tao Cao, Ning An, Qihuang Gong, Chee Wei Wong, Yunjiang Rao, Yun-Feng Xiao, Baicheng Yao

Abstract: The emerging microresonator-based frequency combs revolutionize a broad range of applications from optical communications to astronomical calibration. Despite of their significant merits, low energy efficiency and the lack of all-optical dynamical control severely hinder the transfer of microcomb system to real-world applications. Here, by introducing active lasing medium into the soliton microcom… ▽ More The emerging microresonator-based frequency combs revolutionize a broad range of applications from optical communications to astronomical calibration. Despite of their significant merits, low energy efficiency and the lack of all-optical dynamical control severely hinder the transfer of microcomb system to real-world applications. Here, by introducing active lasing medium into the soliton microcomb, for the first time, we experimentally achieve the chiral soliton with agile on-off switch and tunable dual-comb generation in a packaged microresonator. It is found that such a microresonator enables a soliton slingshot effect, the rapid soliton formation arising from the extra energy accumulation induced by inter-modal couplings. Moreover, tuning the erbium gain can generate versatile multi-soliton states, and extend the soliton operation window to a remarkable range over 18 GHz detuning. Finally, the gain-assisted chirality of counterpropagating soliton is demonstrated, which enables an unprecedented fast on-off switching of soliton microcombs. The non-trivial chiral soliton formation with active controllability inspires new paradigms of miniature optical frequency combs and brings the fast tunable soliton tools within reach. △ Less

Submitted 28 August, 2020; originally announced August 2020.

Comments: 11 pages, 8 figures

arXiv:2008.06495 [pdf, other]

Joint Policy Search for Multi-agent Collaboration with Imperfect Information

Authors: Yuandong Tian, Qucheng Gong, Tina Jiang

Abstract: To learn good joint policies for multi-agent collaboration with imperfect information remains a fundamental challenge. While for two-player zero-sum games, coordinate-ascent approaches (optimizing one agent's policy at a time, e.g., self-play) work with guarantees, in multi-agent cooperative setting they often converge to sub-optimal Nash equilibrium. On the other hand, directly modeling joint pol… ▽ More To learn good joint policies for multi-agent collaboration with imperfect information remains a fundamental challenge. While for two-player zero-sum games, coordinate-ascent approaches (optimizing one agent's policy at a time, e.g., self-play) work with guarantees, in multi-agent cooperative setting they often converge to sub-optimal Nash equilibrium. On the other hand, directly modeling joint policy changes in imperfect information game is nontrivial due to complicated interplay of policies (e.g., upstream updates affect downstream state reachability). In this paper, we show global changes of game values can be decomposed to policy changes localized at each information set, with a novel term named policy-change density. Based on this, we propose Joint Policy Search(JPS) that iteratively improves joint policies of collaborative agents in imperfect information games, without re-evaluating the entire game. On multi-agent collaborative tabular games, JPS is proven to never worsen performance and can improve solutions provided by unilateral approaches (e.g, CFR), outperforming algorithms designed for collaborative policy learning (e.g. BAD). Furthermore, for real-world games, JPS has an online form that naturally links with gradient updates. We test it to Contract Bridge, a 4-player imperfect-information game where a team of $2$ collaborates to compete against the other. In its bidding phase, players bid in turn to find a good contract through a limited information channel. Based on a strong baseline agent that bids competitive bridge purely through domain-agnostic self-play, JPS improves collaboration of team players and outperforms WBridge5, a championship-winning software, by $+0.63$ IMPs (International Matching Points) per board over 1k games, substantially better than previous SoTA ($+0.41$ IMPs/b) under Double-Dummy evaluation. △ Less

Submitted 5 December, 2020; v1 submitted 14 August, 2020; originally announced August 2020.

Comments: Minor fix of the algorithm block

arXiv:2007.13544 [pdf, other]

Combining Deep Reinforcement Learning and Search for Imperfect-Information Games

Authors: Noam Brown, Anton Bakhtin, Adam Lerer, Qucheng Gong

Abstract: The combination of deep reinforcement learning and search at both training and test time is a powerful paradigm that has led to a number of successes in single-agent settings and perfect-information games, best exemplified by AlphaZero. However, prior algorithms of this form cannot cope with imperfect-information games. This paper presents ReBeL, a general framework for self-play reinforcement lea… ▽ More The combination of deep reinforcement learning and search at both training and test time is a powerful paradigm that has led to a number of successes in single-agent settings and perfect-information games, best exemplified by AlphaZero. However, prior algorithms of this form cannot cope with imperfect-information games. This paper presents ReBeL, a general framework for self-play reinforcement learning and search that provably converges to a Nash equilibrium in any two-player zero-sum game. In the simpler setting of perfect-information games, ReBeL reduces to an algorithm similar to AlphaZero. Results in two different imperfect-information games show ReBeL converges to an approximate Nash equilibrium. We also show ReBeL achieves superhuman performance in heads-up no-limit Texas hold'em poker, while using far less domain knowledge than any prior poker AI. △ Less

Submitted 28 November, 2020; v1 submitted 27 July, 2020; originally announced July 2020.

arXiv:2005.11471 [pdf, other]

Enhanced entanglement and asymmetric EPR steering between magnons

Authors: Sha-Sha Zheng, Feng-Xiao Sun, Huai-Yang Yuan, Zbigniew Ficek, Qi-Huang Gong, Qiong-Yi He

Abstract: The generation and manipulation of strong entanglement and Einstein-Podolsky-Rosen (EPR) steering in macroscopic systems are outstanding challenges in modern physics. Especially, the observation of asymmetric EPR steering is important for both its fundamental role in interpreting the nature of quantum mechanics and its application as resource for the tasks where the levels of trust at different pa… ▽ More The generation and manipulation of strong entanglement and Einstein-Podolsky-Rosen (EPR) steering in macroscopic systems are outstanding challenges in modern physics. Especially, the observation of asymmetric EPR steering is important for both its fundamental role in interpreting the nature of quantum mechanics and its application as resource for the tasks where the levels of trust at different parties are highly asymmetric. Here, we study the entanglement and EPR steering between two macroscopic magnons in a hybrid ferrimagnet-light system. In the absence of light, the two types of magnons on the two sublattices can be entangled, but no quantum steering occurs when they are damped with the same rates. In the presence of the cavity field, the entanglement can be significantly enhanced, and strong two-way asymmetric quantum steering appears between two magnons with equal dispassion. This is very different from the conventional protocols to produce asymmetric steering by imposing additional unbalanced losses or noises on the two parties at the cost of reducing steerability. The essential physics is well understood by the unbalanced population of acoustic and optical magnons under the cooling effect of cavity photons. Our finding may provide a novel platform to manipulate the quantum steering and the detection of bi-party steering provides a knob to probe the magnetic dam** on each sublattice of a magnet. △ Less

Submitted 23 May, 2020; originally announced May 2020.

arXiv:2001.09832 [pdf, other]

Polygames: Improved Zero Learning

Authors: Tristan Cazenave, Yen-Chi Chen, Guan-Wei Chen, Shi-Yu Chen, Xian-Dong Chiu, Julien Dehos, Maria Elsa, Qucheng Gong, Hengyuan Hu, Vasil Khalidov, Cheng-Ling Li, Hsin-I Lin, Yu-** Lin, Xavier Martinet, Vegard Mella, Jeremy Rapin, Baptiste Roziere, Gabriel Synnaeve, Fabien Teytaud, Olivier Teytaud, Shi-Cheng Ye, Yi-Jun Ye, Shi-Jim Yen, Sergey Zagoruyko

Abstract: Since DeepMind's AlphaZero, Zero learning quickly became the state-of-the-art method for many board games. It can be improved using a fully convolutional structure (no fully connected layer). Using such an architecture plus global pooling, we can create bots independent of the board size. The training can be made more robust by kee** track of the best checkpoints during the training and by train… ▽ More Since DeepMind's AlphaZero, Zero learning quickly became the state-of-the-art method for many board games. It can be improved using a fully convolutional structure (no fully connected layer). Using such an architecture plus global pooling, we can create bots independent of the board size. The training can be made more robust by kee** track of the best checkpoints during the training and by training against them. Using these features, we release Polygames, our framework for Zero learning, with its library of games and its checkpoints. We won against strong humans at the game of Hex in 19x19, which was often said to be untractable for zero learning; and in Havannah. We also won several first places at the TAAI competitions. △ Less

Submitted 27 January, 2020; originally announced January 2020.

arXiv:1912.01328 [pdf, other]

Trimming Mobile Applications for Bandwidth-Challenged Networks in Develo** Regions

Authors: Qinge Xie, Qingyuan Gong, Xinlei He, Yang Chen, Xin Wang, Haitao Zheng, Ben Y. Zhao

Abstract: Despite continuous efforts to build and update network infrastructure, mobile devices in develo** regions continue to be constrained by limited bandwidth. Unfortunately, this coincides with a period of unprecedented growth in the size of mobile applications. Thus it is becoming prohibitively expensive for users in develo** regions to download and update mobile apps critical to their economic a… ▽ More Despite continuous efforts to build and update network infrastructure, mobile devices in develo** regions continue to be constrained by limited bandwidth. Unfortunately, this coincides with a period of unprecedented growth in the size of mobile applications. Thus it is becoming prohibitively expensive for users in develo** regions to download and update mobile apps critical to their economic and educational development. Unchecked, these trends can further contribute to a large and growing global digital divide. Our goal is to better understand the source of this rapid growth in mobile app code size, whether it is reflective of new functionality, and identify steps that can be taken to make existing mobile apps more friendly bandwidth constrained mobile networks. We hypothesize that much of this growth in mobile apps is due to poor resource/code management, and do not reflect proportional increases in functionality. Our hypothesis is partially validated by mini-programs, apps with extremely small footprints gaining popularity in Chinese mobile networks. Here, we use functionally equivalent pairs of mini-programs and Android apps to identify potential sources of "bloat," inefficient uses of code or resources that contribute to large package sizes. We analyze a large sample of popular Android apps and quantify instances of code and resource bloat. We develop techniques for automated code and resource trimming, and successfully validate them on a large set of Android apps. We hope our results will lead to continued efforts to streamline mobile apps, making them easier to access and maintain for users in develo** regions. △ Less

Submitted 8 December, 2019; v1 submitted 3 December, 2019; originally announced December 2019.

Comments: 12 pages, 8 figures

arXiv:1912.00492 [pdf, ps, other]

doi 10.1016/j.physd.2021.132955

Algorithms of Data Development For Deep Learning and Feedback Design

Authors: Wei Kang, Qi Gong, Tenavi Nakamura-Zimmerer

Abstract: Recent research reveals that deep learning is an effective way of solving high dimensional Hamilton-Jacobi-Bellman equations. The resulting feedback control law in the form of a neural network is computationally efficient for real-time applications of optimal control. A critical part of this design method is to generate data for training the neural network and validating its accuracy. In this pape… ▽ More Recent research reveals that deep learning is an effective way of solving high dimensional Hamilton-Jacobi-Bellman equations. The resulting feedback control law in the form of a neural network is computationally efficient for real-time applications of optimal control. A critical part of this design method is to generate data for training the neural network and validating its accuracy. In this paper, we provide a survey of existing algorithms that can be used to generate data. All the algorithms surveyed in this paper are causality-free, i.e., the solution at a point is computed without using the value of the function at any other points. At the end of the paper, an illustrative example of optimal feedback design using deep learning is given. △ Less

Submitted 28 January, 2020; v1 submitted 1 December, 2019; originally announced December 2019.

Comments: 15 pages, 1 figure

Journal ref: Physica D: Nonlinear Phenomena 425 (2021) 132955

arXiv:1911.09311 [pdf, ps, other]

Density Propagation with Characteristics-based Deep Learning

Authors: Tenavi Nakamura-Zimmerer, Daniele Venturi, Qi Gong, Wei Kang

Abstract: Uncertainty propagation in nonlinear dynamic systems remains an outstanding problem in scientific computing and control. Numerous approaches have been developed, but are limited in their capability to tackle problems with more than a few uncertain variables or require large amounts of simulation data. In this paper, we propose a data-driven method for approximating joint probability density functi… ▽ More Uncertainty propagation in nonlinear dynamic systems remains an outstanding problem in scientific computing and control. Numerous approaches have been developed, but are limited in their capability to tackle problems with more than a few uncertain variables or require large amounts of simulation data. In this paper, we propose a data-driven method for approximating joint probability density functions (PDFs) of nonlinear dynamic systems with initial condition and parameter uncertainty. Our approach leverages on the power of deep learning to deal with high-dimensional inputs, but we overcome the need for huge quantities of training data by encoding PDF evolution equations directly into the optimization problem. We demonstrate the potential of the proposed method by applying it to evaluate the robustness of a feedback controller for a six-dimensional rigid body with parameter uncertainty. △ Less

Submitted 21 November, 2019; originally announced November 2019.

Comments: This work has been submitted to IFAC for possible publication

arXiv:1911.07839 [pdf, other]

doi 10.1038/s41567-019-0727-x

Chip-to-chip quantum teleportation and multi-photon entanglement in silicon

Authors: Daniel Llewellyn, Yunhong Ding, Imad I. Faruque, Stefano Paesani, Davide Bacco, Raffaele Santagati, Yan-Jun Qian, Yan Li, Yun-Feng Xiao, Marcus Huber, Mehul Malik, Gary F. Sinclair, Xiaoqi Zhou, Karsten Rottwitt, Jeremy L. O Brien, John G. Rarity, Qihuang Gong, Leif K. Oxenlowe, Jianwei Wang, Mark G. Thompson

Abstract: Exploiting semiconductor fabrication techniques, natural carriers of quantum information such as atoms, electrons, and photons can be embedded in scalable integrated devices. Integrated optics provides a versatile platform for large-scale quantum information processing and transceiving with photons. Scaling up the integrated devices for quantum applications requires highperformance single-photon g… ▽ More Exploiting semiconductor fabrication techniques, natural carriers of quantum information such as atoms, electrons, and photons can be embedded in scalable integrated devices. Integrated optics provides a versatile platform for large-scale quantum information processing and transceiving with photons. Scaling up the integrated devices for quantum applications requires highperformance single-photon generation and photonic qubit-qubit entangling operations. However, previous demonstrations report major challenges in producing multiple bright, pure and identical single-photons, and entangling multiple photonic qubits with high fidelity. Another notable challenge is to noiselessly interface multiphoton sources and multiqubit operators in a single device. Here we demonstrate on-chip genuine multipartite entanglement and quantum teleportation in silicon, by coherently controlling an integrated network of microresonator nonlinear single-photon sources and linear-optic multiqubit entangling circuits. The microresonators are engineered to locally enhance the nonlinearity, producing multiple frequencyuncorrelated and indistinguishable single-photons, without requiring any spectral filtering. The multiqubit states are processed in a programmable linear circuit facilitating Bell-projection and fusion operation in a measurement-based manner. We benchmark key functionalities, such as intra-/inter-chip teleportation of quantum states, and generation of four-photon Greenberger-HorneZeilinger entangled states. The production, control, and transceiving of states are all achieved in micrometer-scale silicon chips, fabricated by complementary metal-oxide-semiconductor processes. Our work lays the groundwork for scalable on-chip multiphoton technologies for quantum computing and communication. △ Less

Submitted 9 February, 2020; v1 submitted 15 November, 2019; originally announced November 2019.

Journal ref: Nat. Phys. 16, 148-153 (2020)

arXiv:1910.14222 [pdf, other]

doi 10.1103/PhysRevLett.126.023901

Topologically Enabled Ultralarge Purcell Enhancement Robust to Photon Scattering

Authors: Zhiyuan Qian, Zhichao Li, He Hao, Lingxiao Shan, Qihuang Gong, Ying Gu

Abstract: Micro/nanoscale single photon source is a building block of on-chip quantum information devices. Owing to possessing ultrasmall optical mode volume, plasmon structures can provide large Purcell enhancement, however scattering and absorption are two barriers to prevent them from being used in practice. To overcome these barriers, we propose the topological photonic structure containing resonant pla… ▽ More Micro/nanoscale single photon source is a building block of on-chip quantum information devices. Owing to possessing ultrasmall optical mode volume, plasmon structures can provide large Purcell enhancement, however scattering and absorption are two barriers to prevent them from being used in practice. To overcome these barriers, we propose the topological photonic structure containing resonant plasmon nanoantenna, where nanoantenna provides large Purcell enhancement while topological photonic crystal guides all scattering light into its edge state. Through the optical mode design, the rate of single photons emitted into the edge state reaches more than 104γ0 simultaneously accompanied with an obvious reduction of absorption. This kind of nonscattering large Purcell enhancement will provide new sight for on-chip quantum light sources such as a single photon source and nanolaser. △ Less

Submitted 30 October, 2019; originally announced October 2019.

Comments: 6 pages, 4 figures

Journal ref: Phys. Rev. Lett. 126, 023901 (2021)

arXiv:1910.01298 [pdf, other]

doi 10.1103/PhysRevA.101.043807

Dynamics of transient cat-states in degenerate parametric oscillation with and without nonlinear Kerr interactions

Authors: R. Y. Teh, F. -X. Sun, R. E. S. Polkinghorne, Q. Y. He, Q. Gong, P. D. Drummond, M. D. Reid

Abstract: A cat-state is formed as the steady-state solution for the signal mode of an ideal, degenerate parametric oscillator, in the limit of negligible single-photon signal loss. In the presence of the signal loss, this is no longer true over timescales much longer than the dam** time. However, for sufficient parametric nonlinearity, a cat-state can exist as a transient state. In this paper, we study t… ▽ More A cat-state is formed as the steady-state solution for the signal mode of an ideal, degenerate parametric oscillator, in the limit of negligible single-photon signal loss. In the presence of the signal loss, this is no longer true over timescales much longer than the dam** time. However, for sufficient parametric nonlinearity, a cat-state can exist as a transient state. In this paper, we study the dynamics of the creation and decoherence of cat-states in degenerate parametric oscillation, both with and without the effect of a Kerr nonlinearity that applies to recent superconducting-circuit experiments generating cat-states in microwave cavities. We determine the time of formation and the lifetime of a cat-state in terms of three dimensionless parameters $λ$, $g$ and $χ$. These relate to the driving strength, the parametric nonlinearity, and the Kerr nonlinearity, respectively. We find that the Kerr nonlinearity has little effect on the threshold parametric nonlinearity ($g>1$) required for the formation of cat-states, and does not significantly alter the decoherence time of the cat-state, but can reduce the time of formation. The quality of the cat-state increases with the value $g$, and can also improved by the Kerr nonlinearity. To verify the existence and quality of the cat-state, we consider several signatures, including interference fringes and negativity, and show how they can be computed. We simulate a superconducting-circuit experiment using published experimental parameters and found good agreement with experimental results, indicating that a nonclassical cat-like state with a small Wigner negativity is generated in the experiment. A stronger nonlinearity would lead to a cat-state with convincing cat-state signatures. Finally, we explore the feasibility of creating large cat-states with a coherent amplitude of 20, corresponding to 400 photons. △ Less

Submitted 30 November, 2020; v1 submitted 3 October, 2019; originally announced October 2019.

Journal ref: Phys. Rev. A 101, 043807 (2020)

arXiv:1909.07960 [pdf, other]

doi 10.1016/j.jcp.2020.109710

A new scalable algorithm for computational optimal control under uncertainty

Authors: Panos Lambrianides, Qi Gong, Daniele Venturi

Abstract: We address the design and synthesis of optimal control strategies for high-dimensional stochastic dynamical systems. Such systems may be deterministic nonlinear systems evolving from random initial states, or systems driven by random parameters or processes. The objective is to provide a validated new computational capability for optimal control which will be achieved more efficiently than current… ▽ More We address the design and synthesis of optimal control strategies for high-dimensional stochastic dynamical systems. Such systems may be deterministic nonlinear systems evolving from random initial states, or systems driven by random parameters or processes. The objective is to provide a validated new computational capability for optimal control which will be achieved more efficiently than current state-of-the-art methods. The new framework utilizes direct single or multi-shooting discretization, and is based on efficient vectorized gradient computation with adaptable memory management. The algorithm is demonstrated to be scalable to high-dimensional nonlinear control systems with random initial condition and unknown parameters. △ Less

Submitted 17 September, 2019; originally announced September 2019.

Comments: 23 pages, 17 figures

arXiv:1908.02356 [pdf, other]

The Mid-InfraRed Exo-planet CLimate Explorer MIRECLE: Exploring the Nearest M-Earths Through Ultra-Stable Mid-IR Transit and Phase-Curve Spectroscopy

Authors: Johannes Staguhn, Avi Mandell, Kevin Stevenson, Prabal Saxena, Ravi Kopparapu, Dale Fixsen, Elmer Sharp, Michael DiPirro, Claudia Knez, Eric Wolf, Kristin Sotzen, Kathleen Mandt, Qian Gong, Geronimo Villanueva

Abstract: This White Paper presents a mission concept called MIRECLE - the Mid-InfraRed Exoplanet CLimate Explorer. With a moderately sized aperture of 2 meters, broad wavelength coverage (4 - 25 um), and next generation instruments, MIRECLE will be capable of efficiently characterizing a statistically significant sample of terrestrial planets, many of which will be in their host stars's habitable zones. Sp… ▽ More This White Paper presents a mission concept called MIRECLE - the Mid-InfraRed Exoplanet CLimate Explorer. With a moderately sized aperture of 2 meters, broad wavelength coverage (4 - 25 um), and next generation instruments, MIRECLE will be capable of efficiently characterizing a statistically significant sample of terrestrial planets, many of which will be in their host stars's habitable zones. Spectroscopic characterization of terrestrial atmospheres will provide constraints for the distribution of planets with tenuous vs. substantial atmospheres, on the inner and outer edges of the habitable zone, and climate models to assess the potential for habitability. For the few brightest targets, the detection of specific combinations of molecules would provide evidence of biosignatures. For all other targets, this comprehensive survey would filter out the airless, desiccated, or lifeless worlds, thus providing a subset of potentially habitable worlds ready for in-depth atmospheric characterization using a larger aperture telescope. △ Less

Submitted 6 August, 2019; originally announced August 2019.

Comments: Astro2020: Decadal Survey on Astronomy and Astrophysics APC White Paper

arXiv:1907.05317 [pdf, ps, other]

doi 10.1137/19M1288802

Adaptive Deep Learning for High-Dimensional Hamilton-Jacobi-Bellman Equations

Authors: Tenavi Nakamura-Zimmerer, Qi Gong, Wei Kang

Abstract: Computing optimal feedback controls for nonlinear systems generally requires solving Hamilton-Jacobi-Bellman (HJB) equations, which are notoriously difficult when the state dimension is large. Existing strategies for high-dimensional problems often rely on specific, restrictive problem structures, or are valid only locally around some nominal trajectory. In this paper, we propose a data-driven met… ▽ More Computing optimal feedback controls for nonlinear systems generally requires solving Hamilton-Jacobi-Bellman (HJB) equations, which are notoriously difficult when the state dimension is large. Existing strategies for high-dimensional problems often rely on specific, restrictive problem structures, or are valid only locally around some nominal trajectory. In this paper, we propose a data-driven method to approximate semi-global solutions to HJB equations for general high-dimensional nonlinear systems and compute candidate optimal feedback controls in real-time. To accomplish this, we model solutions to HJB equations with neural networks (NNs) trained on data generated without discretizing the state space. Training is made more effective and data-efficient by leveraging the known physics of the problem and using the partially-trained NN to aid in adaptive data generation. We demonstrate the effectiveness of our method by learning solutions to HJB equations corresponding to the attitude control of a six-dimensional nonlinear rigid body, and nonlinear systems of dimension up to 30 arising from the stabilization of a Burgers'-type partial differential equation. The trained NNs are then used for real-time feedback control of these systems. △ Less

Submitted 8 February, 2021; v1 submitted 11 July, 2019; originally announced July 2019.

Comments: Added section on validation error computation. Updated convergence test formula and associated results

Journal ref: SIAM Journal on Scientific Computing 43 (2021) A1221-A1247

arXiv:1907.03323 [pdf, other]

Synchronization and temporal nonreciprocity of optical microresonators via spontaneous symmetry breaking

Authors: Da Xu, Zi-Zhao Han, Yu-Kun Lu, Qihuang Gong, Cheng-Wei Qiu, Gang Chen, Yun-Feng Xiao

Abstract: Synchronization is of importance in both fundamental and applied physics, but their demonstration at the micro/nanoscale is mainly limited to low-frequency oscillations like mechanical resonators. Here, we report the synchronization of two coupled optical microresonators, in which the high-frequency resonances in optical domain are aligned with reduced noise. It is found that two types of synchron… ▽ More Synchronization is of importance in both fundamental and applied physics, but their demonstration at the micro/nanoscale is mainly limited to low-frequency oscillations like mechanical resonators. Here, we report the synchronization of two coupled optical microresonators, in which the high-frequency resonances in optical domain are aligned with reduced noise. It is found that two types of synchronization emerge with either the first- or second-order transition, both presenting a process of spontaneous symmetry breaking. In the second-order regime, the synchronization happens with an invariant topological character number and a larger detuning than that of the first-order case. Furthermore, an unconventional hysteresis behavior is revealed for a time-dependent coupling strength, breaking the static limitation and the temporal reciprocity. The synchronization of optical microresonators offers great potential in reconfigurable simulations of many-body physics and scalable photonic devices on a chip. △ Less

Submitted 7 July, 2019; originally announced July 2019.

arXiv:1906.04898 [pdf, other]

Hierarchical Taxonomy-Aware and Attentional Graph Capsule RCNNs for Large-Scale Multi-Label Text Classification

Authors: Hao Peng, Jianxin Li, Qiran Gong, Senzhang Wang, Lifang He, Bo Li, Lihong Wang, Philip S. Yu

Abstract: CNNs, RNNs, GCNs, and CapsNets have shown significant insights in representation learning and are widely used in various text mining tasks such as large-scale multi-label text classification. However, most existing deep models for multi-label text classification consider either the non-consecutive and long-distance semantics or the sequential semantics, but how to consider them both coherently is… ▽ More CNNs, RNNs, GCNs, and CapsNets have shown significant insights in representation learning and are widely used in various text mining tasks such as large-scale multi-label text classification. However, most existing deep models for multi-label text classification consider either the non-consecutive and long-distance semantics or the sequential semantics, but how to consider them both coherently is less studied. In addition, most existing methods treat output labels as independent methods, but ignore the hierarchical relations among them, leading to useful semantic information loss. In this paper, we propose a novel hierarchical taxonomy-aware and attentional graph capsule recurrent CNNs framework for large-scale multi-label text classification. Specifically, we first propose to model each document as a word order preserved graph-of-words and normalize it as a corresponding words-matrix representation which preserves both the non-consecutive, long-distance and local sequential semantics. Then the words-matrix is input to the proposed attentional graph capsule recurrent CNNs for more effectively learning the semantic features. To leverage the hierarchical relations among the class labels, we propose a hierarchical taxonomy embedding method to learn their representations, and define a novel weighted margin loss by incorporating the label representation similarity. Extensive evaluations on three datasets show that our model significantly improves the performance of large-scale multi-label text classification by comparing with state-of-the-art approaches. △ Less

Submitted 9 June, 2019; originally announced June 2019.

arXiv:1906.04580 [pdf, other]

Fine-grained Event Categorization with Heterogeneous Graph Convolutional Networks

Authors: Hao Peng, Jianxin Li, Qiran Gong, Yangqiu Song, Yuanxing Ning, Kunfeng Lai, Philip S. Yu

Abstract: Events are happening in real-world and real-time, which can be planned and organized occasions involving multiple people and objects. Social media platforms publish a lot of text messages containing public events with comprehensive topics. However, mining social events is challenging due to the heterogeneous event elements in texts and explicit and implicit social network structures. In this paper… ▽ More Events are happening in real-world and real-time, which can be planned and organized occasions involving multiple people and objects. Social media platforms publish a lot of text messages containing public events with comprehensive topics. However, mining social events is challenging due to the heterogeneous event elements in texts and explicit and implicit social network structures. In this paper, we design an event meta-schema to characterize the semantic relatedness of social events and build an event-based heterogeneous information network (HIN) integrating information from external knowledge base, and propose a novel Pair-wise Popularity Graph Convolutional Network (PP-GCN) based fine-grained social event categorization model. We propose a Knowledgeable meta-paths Instances based social Event Similarity (KIES) between events and build a weighted adjacent matrix as input to the PP-GCN model. Comprehensive experiments on real data collections are conducted to compare various social event detection and clustering tasks. Experimental results demonstrate that our proposed framework outperforms other alternative social event categorization techniques. △ Less

Submitted 9 June, 2019; originally announced June 2019.

Comments: Accepted by IJCAI'19(International Joint Conference on Artificial Intelligence)

arXiv:1906.03586 [pdf, other]

Dynamic Network Embedding via Incremental Skip-gram with Negative Sampling

Authors: Hao Peng, Jianxin Li, Hao Yan, Qiran Gong, Senzhang Wang, Lin Liu, Lihong Wang, Xiang Ren

Abstract: Network representation learning, as an approach to learn low dimensional representations of vertices, has attracted considerable research attention recently. It has been proven extremely useful in many machine learning tasks over large graph. Most existing methods focus on learning the structural representations of vertices in a static network, but cannot guarantee an accurate and efficient embedd… ▽ More Network representation learning, as an approach to learn low dimensional representations of vertices, has attracted considerable research attention recently. It has been proven extremely useful in many machine learning tasks over large graph. Most existing methods focus on learning the structural representations of vertices in a static network, but cannot guarantee an accurate and efficient embedding in a dynamic network scenario. To address this issue, we present an efficient incremental skip-gram algorithm with negative sampling for dynamic network embedding, and provide a set of theoretical analyses to characterize the performance guarantee. Specifically, we first partition a dynamic network into the updated, including addition/deletion of links and vertices, and the retained networks over time. Then we factorize the objective function of network embedding into the added, vanished and retained parts of the network. Next we provide a new stochastic gradient-based method, guided by the partitions of the network, to update the nodes and the parameter vectors. The proposed algorithm is proven to yield an objective function value with a bounded difference to that of the original objective function. Experimental results show that our proposal can significantly reduce the training time while preserving the comparable performance. We also demonstrate the correctness of the theoretical analysis and the practical usefulness of the dynamic network embedding. We perform extensive experiments on multiple real-world large network datasets over multi-label classification and link prediction tasks to evaluate the effectiveness and efficiency of the proposed framework, and up to 22 times speedup has been achieved. △ Less

Submitted 9 June, 2019; originally announced June 2019.

Comments: Accepted by China Science Information Science. arXiv admin note: text overlap with arXiv:1811.05932 by other authors

arXiv:1906.00744 [pdf, other]

Hierarchical Decision Making by Generating and Following Natural Language Instructions

Authors: Hengyuan Hu, Denis Yarats, Qucheng Gong, Yuandong Tian, Mike Lewis

Abstract: We explore using latent natural language instructions as an expressive and compositional representation of complex actions for hierarchical decision making. Rather than directly selecting micro-actions, our agent first generates a latent plan in natural language, which is then executed by a separate model. We introduce a challenging real-time strategy game environment in which the actions of a lar… ▽ More We explore using latent natural language instructions as an expressive and compositional representation of complex actions for hierarchical decision making. Rather than directly selecting micro-actions, our agent first generates a latent plan in natural language, which is then executed by a separate model. We introduce a challenging real-time strategy game environment in which the actions of a large number of units must be coordinated across long time scales. We gather a dataset of 76 thousand pairs of instructions and executions from human play, and train instructor and executor models. Experiments show that models using natural language as a latent variable significantly outperform models that directly imitate human actions. The compositional structure of language proves crucial to its effectiveness for action representation. We also release our code, models and data. △ Less

Submitted 2 October, 2019; v1 submitted 3 June, 2019; originally announced June 2019.

arXiv:1905.13405 [pdf, other]

Luck Matters: Understanding Training Dynamics of Deep ReLU Networks

Authors: Yuandong Tian, Tina Jiang, Qucheng Gong, Ari Morcos

Abstract: We analyze the dynamics of training deep ReLU networks and their implications on generalization capability. Using a teacher-student setting, we discovered a novel relationship between the gradient received by hidden student nodes and the activations of teacher nodes for deep ReLU networks. With this relationship and the assumption of small overlap** teacher node activations, we prove that (1) st… ▽ More We analyze the dynamics of training deep ReLU networks and their implications on generalization capability. Using a teacher-student setting, we discovered a novel relationship between the gradient received by hidden student nodes and the activations of teacher nodes for deep ReLU networks. With this relationship and the assumption of small overlap** teacher node activations, we prove that (1) student nodes whose weights are initialized to be close to teacher nodes converge to them at a faster rate, and (2) in over-parameterized regimes and 2-layer case, while a small set of lucky nodes do converge to the teacher nodes, the fan-out weights of other nodes converge to zero. This framework provides insight into multiple puzzling phenomena in deep learning like over-parameterization, implicit regularization, lottery tickets, etc. We verify our assumption by showing that the majority of BatchNorm biases of pre-trained VGG11/16 models are negative. Experiments on (1) random deep teacher networks with Gaussian inputs, (2) teacher network pre-trained on CIFAR-10 and (3) extensive ablation studies validate our multiple theoretical predictions. △ Less

Submitted 28 June, 2019; v1 submitted 31 May, 2019; originally announced May 2019.

arXiv:1905.08010 [pdf, other]

doi 10.1103/PhysRevA.100.033827

Schrödinger cats and steady states in subharmonic generation with Kerr nonlinearities

Authors: Feng-Xiao Sun, Qiongyi He, Qihuang Gong, Run Yan Teh, Margaret D. Reid, Peter D. Drummond

Abstract: We discuss general properties of the equilibrium state of parametric down-conversion in superconducting quantum circuits with detunings and Kerr anharmonicities, in the strongly nonlinear regime. By comparing moments of the steady state and those of a Schrödinger cat, we show that true Schrödinger cats cannot survive in the steady state if there is any single-photon loss. A delta-function 'cat-lik… ▽ More We discuss general properties of the equilibrium state of parametric down-conversion in superconducting quantum circuits with detunings and Kerr anharmonicities, in the strongly nonlinear regime. By comparing moments of the steady state and those of a Schrödinger cat, we show that true Schrödinger cats cannot survive in the steady state if there is any single-photon loss. A delta-function 'cat-like' steady-state distribution can be formed, but this only exists in the limit of an extremely large nonlinearity. The steady state is a mixed state, which is more complex than a mixture or linear combination of delta-functions, and whose purity is reduced by driving. We expect this general behaviour to occur in other driven, dissipative quantum subharmonic non-equilibrium open systems. △ Less

Submitted 9 August, 2019; v1 submitted 20 May, 2019; originally announced May 2019.

Comments: 10 pages, 4 figures

Journal ref: Phys. Rev. A 100, 033827 (2019)

Showing 51–100 of 192 results for author: Gong, Q