-
Supernova 2020wnt: An Atypical Superluminous Supernova with a Hidden Central Engine
Authors:
Samaporn Tinyanont,
Stan E. Woosley,
Kirsty Taggart,
Ryan J. Foley,
Lin Yan,
Ragnhild Lunnan,
Kyle W. Davis,
Charles D. Kilpatrick,
Matthew R. Siebert,
Steve Schulze,
Chris Ashall,
Ting-Wan Chen,
Kishalay De,
Georgios Dimitriadis,
Dillon Z. Dong,
Christoffer Fremling,
Alexander Gagliano,
Saurabh W. Jha,
David O. Jones,
Mansi M. Kasliwal,
Hao-Yu Miao,
Yen-Chen Pan,
Daniel A. Perley,
Vikram Ravi,
César Rojas-Bravo
, et al. (12 additional authors not shown)
Abstract:
We present observations of a peculiar hydrogen- and helium-poor stripped-envelope (SE) supernova (SN) 2020wnt, primarily in the optical and near-infrared (near-IR). Its peak absolute bolometric magnitude of -20.9 mag and a rise time of 69~days are reminiscent of hydrogen-poor superluminous SNe (SLSNe~I), luminous transients potentially powered by spinning-down magnetars. Before the main peak, ther…
▽ More
We present observations of a peculiar hydrogen- and helium-poor stripped-envelope (SE) supernova (SN) 2020wnt, primarily in the optical and near-infrared (near-IR). Its peak absolute bolometric magnitude of -20.9 mag and a rise time of 69~days are reminiscent of hydrogen-poor superluminous SNe (SLSNe~I), luminous transients potentially powered by spinning-down magnetars. Before the main peak, there is a brief peak lasting <10 days post-explosion, likely caused by interaction with circumstellar medium (CSM) ejected ~years before the SN explosion. The optical spectra near peak lack a hot continuum and OII absorptions, which are signs of heating from a central engine; they quantitatively resemble those of radioactivity-powered H/He-poor Type Ic SESNe. At ~1 year after peak, nebular spectra reveal a blue pseudo-continuum and narrow OI recombination lines associated with magnetar heating. Radio observations rule out strong CSM interactions as the dominant energy source at +266 days post peak. Near-IR observations at +200-300 day reveal carbon monoxide and dust formation, which causes a dramatic optical light curve dip. Pair-instability explosion models predict slow light curve and spectral features incompatible with observations. SN 2020wnt is best explained as a magnetar-powered core-collapse explosion of a 28 Msun pre-SN star. The explosion kinetic energy is significantly larger than the magnetar energy at peak, effectively concealing the magnetar-heated inner ejecta until well after peak. SN 2020wnt falls into a continuum between normal SNe Ic and SLSNe I and demonstrates that optical spectra at peak alone cannot rule out the presence of a central engine.
△ Less
Submitted 30 November, 2022;
originally announced December 2022.
-
Deep Synoptic Array science I: discovery of the host galaxy of FRB 20220912A
Authors:
Vikram Ravi,
Morgan Catha,
Ge Chen,
Liam Connor,
Jakob T. Faber,
James W. Lamb,
Gregg Hallinan,
Charlie Harnach,
Greg Hellbourg,
Rick Hobbs,
David Hodge,
Mark Hodges,
Casey Law,
Paul Rasmussen,
Kritti Sharma,
Myles B. Sherman,
Jun Shi,
Dana Simard,
Reynier Squillace,
Sander Weinreb,
David P. Woody,
Nitika Yadlapalli,
Tomas Ahumada,
Dillon Dong,
Christoffer Fremling
, et al. (3 additional authors not shown)
Abstract:
We report the detection and interferometric localization of the repeating fast radio burst (FRB) source FRB 20220912A during commissioning observations with the Deep Synoptic Array (DSA-110). Two bursts were detected from FRB 20220912A, one each on 2022 October 18 and 2022 October 25. The best-fit position is (R.A. J2000, decl. J2000) = (23:09:04.9, +48:42:25.4), with a 90% confidence error ellips…
▽ More
We report the detection and interferometric localization of the repeating fast radio burst (FRB) source FRB 20220912A during commissioning observations with the Deep Synoptic Array (DSA-110). Two bursts were detected from FRB 20220912A, one each on 2022 October 18 and 2022 October 25. The best-fit position is (R.A. J2000, decl. J2000) = (23:09:04.9, +48:42:25.4), with a 90% confidence error ellipse of $\pm2$ arcsec and $\pm1$ arcsec in right ascension and declination respectively. The two bursts have disparate polarization properties and temporal profiles. We find a Faraday rotation measure that is consistent with the low value of $+0.6$ rad m$^{-2}$ reported by CHIME/FRB. The DSA-110 localization overlaps with the galaxy PSO J347.2702+48.7066 at a redshift $z=0.0771$, which we identify as the likely host. PSO J347.2702$+$48.7066 has a stellar mass of approximately $10^{10}M_{\odot}$, modest internal dust extinction, and a star-formation rate likely in excess of $0.1\,M_{\odot}$ yr$^{-1}$. The host-galaxy contribution to the dispersion measure is likely $\lesssim50$ pc cm$^{-3}$. The FRB 20220912A source is therefore likely viewed along a tenuous plasma column through the host galaxy.
△ Less
Submitted 16 November, 2022;
originally announced November 2022.
-
PAD-Net: An Efficient Framework for Dynamic Networks
Authors:
Shwai He,
Liang Ding,
Daize Dong,
Boan Liu,
Fuqiang Yu,
Dacheng Tao
Abstract:
Dynamic networks, e.g., Dynamic Convolution (DY-Conv) and the Mixture of Experts (MoE), have been extensively explored as they can considerably improve the model's representation power with acceptable computational cost. The common practice in implementing dynamic networks is to convert the given static layers into fully dynamic ones where all parameters are dynamic (at least within a single layer…
▽ More
Dynamic networks, e.g., Dynamic Convolution (DY-Conv) and the Mixture of Experts (MoE), have been extensively explored as they can considerably improve the model's representation power with acceptable computational cost. The common practice in implementing dynamic networks is to convert the given static layers into fully dynamic ones where all parameters are dynamic (at least within a single layer) and vary with the input. However, such a fully dynamic setting may cause redundant parameters and high deployment costs, limiting the applicability of dynamic networks to a broader range of tasks and models. The main contributions of our work are challenging the basic commonsense in dynamic networks and proposing a partially dynamic network, namely PAD-Net, to transform the redundant dynamic parameters into static ones. Also, we further design Iterative Mode Partition to partition dynamic and static parameters efficiently. Our method is comprehensively supported by large-scale experiments with two typical advanced dynamic architectures, i.e., DY-Conv and MoE, on both image classification and GLUE benchmarks. Encouragingly, we surpass the fully dynamic networks by $+0.7\%$ top-1 acc with only $30\%$ dynamic parameters for ResNet-50 and $+1.9\%$ average score in language understanding with only $50\%$ dynamic parameters for BERT. Code will be released at: \url{https://github.com/Shwai-He/PAD-Net}.
△ Less
Submitted 31 May, 2023; v1 submitted 10 November, 2022;
originally announced November 2022.
-
Safety-Critical Ergodic Exploration in Cluttered Environments via Control Barrier Functions
Authors:
Cameron Lerch,
Dayi Dong,
Ian Abraham
Abstract:
In this paper, we address the problem of safe trajectory planning for autonomous search and exploration in constrained, cluttered environments. Guaranteeing safe (collision-free) trajectories is a challenging problem that has garnered significant due to its importance in the successful utilization of robots in search and exploration tasks. This work contributes a method that generates guaranteed s…
▽ More
In this paper, we address the problem of safe trajectory planning for autonomous search and exploration in constrained, cluttered environments. Guaranteeing safe (collision-free) trajectories is a challenging problem that has garnered significant due to its importance in the successful utilization of robots in search and exploration tasks. This work contributes a method that generates guaranteed safety-critical search trajectories in a cluttered environment. Our approach integrates safety-critical constraints using discrete control barrier functions (DCBFs) with ergodic trajectory optimization to enable safe exploration. Ergodic trajectory optimization plans continuous exploratory trajectories that guarantee complete coverage of a space. We demonstrate through simulated and experimental results on a drone that our approach is able to generate trajectories that enable safe and effective exploration. Furthermore, we show the efficacy of our approach for safe exploration using real-world single- and multi- drone platforms.
△ Less
Submitted 29 April, 2023; v1 submitted 8 November, 2022;
originally announced November 2022.
-
2D and 3D CT Radiomic Features Performance Comparison in Characterization of Gastric Cancer: A Multi-center Study
Authors:
Lingwei Meng,
Di Dong,
Xin Chen,
Mengjie Fang,
Rongpin Wang,
**g Li,
Zaiyi Liu,
Jie Tian
Abstract:
Objective: Radiomics, an emerging tool for medical image analysis, is potential towards precisely characterizing gastric cancer (GC). Whether using one-slice 2D annotation or whole-volume 3D annotation remains a long-time debate, especially for heterogeneous GC. We comprehensively compared 2D and 3D radiomic features' representation and discrimination capacity regarding GC, via three tasks.
Meth…
▽ More
Objective: Radiomics, an emerging tool for medical image analysis, is potential towards precisely characterizing gastric cancer (GC). Whether using one-slice 2D annotation or whole-volume 3D annotation remains a long-time debate, especially for heterogeneous GC. We comprehensively compared 2D and 3D radiomic features' representation and discrimination capacity regarding GC, via three tasks.
Methods: Four-center 539 GC patients were retrospectively enrolled and divided into the training and validation cohorts. From 2D or 3D regions of interest (ROIs) annotated by radiologists, radiomic features were extracted respectively. Feature selection and model construction procedures were customed for each combination of two modalities (2D or 3D) and three tasks. Subsequently, six machine learning models (Model_2D^LNM, Model_3D^LNM; Model_2D^LVI, Model_3D^LVI; Model_2D^pT, Model_3D^pT) were derived and evaluated to reflect modalities' performances in characterizing GC. Furthermore, we performed an auxiliary experiment to assess modalities' performances when resampling spacing is different.
Results: Regarding three tasks, the yielded areas under the curve (AUCs) were: Model_2D^LNM's 0.712 (95% confidence interval, 0.613-0.811), Model_3D^LNM's 0.680 (0.584-0.775); Model_2D^LVI's 0.677 (0.595-0.761), Model_3D^LVI's 0.615 (0.528-0.703); Model_2D^pT's 0.840 (0.779-0.901), Model_3D^pT's 0.813 (0.747-0.879). Moreover, the auxiliary experiment indicated that Models_2D are statistically more advantageous than Models3D with different resampling spacings.
Conclusion: Models constructed with 2D radiomic features revealed comparable performances with those constructed with 3D features in characterizing GC.
Significance: Our work indicated that time-saving 2D annotation would be the better choice in GC, and provided a related reference to further radiomics-based researches.
△ Less
Submitted 29 October, 2022;
originally announced October 2022.
-
Maximum gaps in one-dimensional hard-core models
Authors:
Dingding Dong,
Nitya Mani
Abstract:
We study the distribution of the maximum gap size in one-dimensional hard-core models. First, we randomly sequentially pack rods of length $2$ onto an interval of length $L$, subject to the hard-core constraint that rods do not overlap. We find that in a saturated packing, with high probability there is no gap of size $2 - o(1/L)$ between adjacent rods, but there are gaps of size at least…
▽ More
We study the distribution of the maximum gap size in one-dimensional hard-core models. First, we randomly sequentially pack rods of length $2$ onto an interval of length $L$, subject to the hard-core constraint that rods do not overlap. We find that in a saturated packing, with high probability there is no gap of size $2 - o(1/L)$ between adjacent rods, but there are gaps of size at least $2 - 1/L^{1-ε}$ for all $ε> 0$.
We subsequently study a variant of the hard-core process, the one-dimensional ghost hard-core model introduced by Torquato and Stillinger. In this model, we randomly sequentially pack rods of length $2$ onto an interval of length $L$, such that placed rods neither overlap with previously placed rods nor previously considered candidate rods. We find that in the infinite time limit, with high probability the maximum gap between adjacent rods is smaller than $\log L$ but at least $(\log L)^{1-ε}$ for all $ε> 0.$
△ Less
Submitted 18 October, 2022;
originally announced October 2022.
-
SparseAdapter: An Easy Approach for Improving the Parameter-Efficiency of Adapters
Authors:
Shwai He,
Liang Ding,
Daize Dong,
Miao Zhang,
Dacheng Tao
Abstract:
Adapter Tuning, which freezes the pretrained language models (PLMs) and only fine-tunes a few extra modules, becomes an appealing efficient alternative to the full model fine-tuning. Although computationally efficient, the recent Adapters often increase parameters (e.g. bottleneck dimension) for matching the performance of full model fine-tuning, which we argue goes against their original intentio…
▽ More
Adapter Tuning, which freezes the pretrained language models (PLMs) and only fine-tunes a few extra modules, becomes an appealing efficient alternative to the full model fine-tuning. Although computationally efficient, the recent Adapters often increase parameters (e.g. bottleneck dimension) for matching the performance of full model fine-tuning, which we argue goes against their original intention. In this work, we re-examine the parameter-efficiency of Adapters through the lens of network pruning (we name such plug-in concept as \texttt{SparseAdapter}) and find that SparseAdapter can achieve comparable or better performance than standard Adapters when the sparse ratio reaches up to 80\%. Based on our findings, we introduce an easy but effective setting ``\textit{Large-Sparse}'' to improve the model capacity of Adapters under the same parameter budget. Experiments on five competitive Adapters upon three advanced PLMs show that with proper sparse method (e.g. SNIP) and ratio (e.g. 40\%) SparseAdapter can consistently outperform their corresponding counterpart. Encouragingly, with the \textit{Large-Sparse} setting, we can obtain further appealing gains, even outperforming the full fine-tuning by a large margin. Our code will be released at: https://github.com/Shwai-He/SparseAdapter.
△ Less
Submitted 10 November, 2022; v1 submitted 9 October, 2022;
originally announced October 2022.
-
Sign uncertainty principles and low-degree polynomials
Authors:
Henry Cohn,
Dingding Dong,
Felipe Gonçalves
Abstract:
We prove an asymptotically sharp version of the Bourgain-Clozel-Kahane and Cohn-Gonçalves sign uncertainty principles for polynomials of sublinear degree times a Gaussian, as the dimension tends to infinity. In particular, we show that polynomials whose degree is sublinear in the dimension cannot improve asymptotically on those of degree at most three. This question arises naturally in the study o…
▽ More
We prove an asymptotically sharp version of the Bourgain-Clozel-Kahane and Cohn-Gonçalves sign uncertainty principles for polynomials of sublinear degree times a Gaussian, as the dimension tends to infinity. In particular, we show that polynomials whose degree is sublinear in the dimension cannot improve asymptotically on those of degree at most three. This question arises naturally in the study of both linear programming bounds for sphere packing and the spinless modular bootstrap bound for free bosons.
△ Less
Submitted 4 April, 2024; v1 submitted 4 October, 2022;
originally announced October 2022.
-
Nearly all $k$-SAT functions are unate
Authors:
József Balogh,
Dingding Dong,
Bernard Lidický,
Nitya Mani,
Yufei Zhao
Abstract:
We prove that $1-o(1)$ fraction of all $k$-SAT functions on $n$ Boolean variables are unate (i.e., monotone after first negating some variables), for any fixed positive integer $k$ and as $n \to \infty$. This resolves a conjecture by Bollobás, Brightwell, and Leader from 2003.
We prove that $1-o(1)$ fraction of all $k$-SAT functions on $n$ Boolean variables are unate (i.e., monotone after first negating some variables), for any fixed positive integer $k$ and as $n \to \infty$. This resolves a conjecture by Bollobás, Brightwell, and Leader from 2003.
△ Less
Submitted 3 October, 2023; v1 submitted 11 September, 2022;
originally announced September 2022.
-
Connectedness and Cycle Spaces of Friends-and-Strangers Graphs
Authors:
Colin Defant,
David Dong,
Alan Lee,
Michelle Wei
Abstract:
If $X=(V(X),E(X))$ and $Y=(V(Y),E(Y))$ are $n$-vertex graphs, then their friends-and-strangers graph $\mathsf{FS}(X,Y)$ is the graph whose vertices are the bijections from $V(X)$ to $V(Y)$ in which two bijections $σ$ and $σ'$ are adjacent if and only if there is an edge $\{a,b\}\in E(X)$ such that $\{σ(a),σ(b)\}\in E(Y)$ and $σ'=σ\circ (a\,\,b)$, where $(a\,\,b)$ is the permutation of $V(X)$ that…
▽ More
If $X=(V(X),E(X))$ and $Y=(V(Y),E(Y))$ are $n$-vertex graphs, then their friends-and-strangers graph $\mathsf{FS}(X,Y)$ is the graph whose vertices are the bijections from $V(X)$ to $V(Y)$ in which two bijections $σ$ and $σ'$ are adjacent if and only if there is an edge $\{a,b\}\in E(X)$ such that $\{σ(a),σ(b)\}\in E(Y)$ and $σ'=σ\circ (a\,\,b)$, where $(a\,\,b)$ is the permutation of $V(X)$ that swaps $a$ and $b$. We prove general theorems that provide necessary and/or sufficient conditions for $\mathsf{FS}(X,Y)$ to be connected. As a corollary, we obtain a complete characterization of the graphs $Y$ such that $\mathsf{FS}(\mathsf{Dand}_{k,n},Y)$ is connected, where $\mathsf{Dand}_{k,n}$ is a dandelion graph; this substantially generalizes a theorem of the first author and Kravitz in the case $k=3$. For specific choices of $Y$, we characterize the spider graphs $X$ such that $\mathsf{FS}(X,Y)$ is connected. In a different vein, we study the cycle spaces of friends-and-strangers graphs. Naatz proved that if $X$ is a path graph, then the cycle space of $\mathsf{FS}(X,Y)$ is spanned by $4$-cycles and $6$-cycles; we show that the same statement holds when $X$ is a cycle and $Y$ has domination number at least $3$. When $X$ is a cycle and $Y$ has domination number at least $2$, our proof sheds light on how walks in $\mathsf{FS}(X,Y)$ behave under certain Coxeter moves.
△ Less
Submitted 4 September, 2022;
originally announced September 2022.
-
A Multislice computational model for birefringent scattering
Authors:
Shuqi Mu,
Yingtong Shi,
Yintong Song,
Wei Liu,
Wanxue Wei,
Qihuang Gong,
Dashan Dong,
Kebin Shi
Abstract:
Modeling optical field propagation in highly scattering and birefringent medium is of important interest to many photonic research branches. Despite the existence of numerical electromagnetic simulation tools and beam propagation method frameworks, there has been a lack of an analytical model including the full tensor nature of birefringence, which is an essential forward-propagation tool for appl…
▽ More
Modeling optical field propagation in highly scattering and birefringent medium is of important interest to many photonic research branches. Despite the existence of numerical electromagnetic simulation tools and beam propagation method frameworks, there has been a lack of an analytical model including the full tensor nature of birefringence, which is an essential forward-propagation tool for applications requiring efficiently iterative regularization and end-to-end designs. Here, we present an analytical tool for modeling field propagation in a birefringent scattering medium by including a full set of field tensor elements and multiple scattering characteristics. Birefringence-controlled field propagation experiments were successfully carried out to validate the proposed model.
△ Less
Submitted 23 August, 2022;
originally announced August 2022.
-
Quantum bandit with amplitude amplification exploration in an adversarial environment
Authors:
Byung** Cho,
Yu Xiao,
Pan Hui,
Daoyi Dong
Abstract:
The rapid proliferation of learning systems in an arbitrarily changing environment mandates the need for managing tensions between exploration and exploitation. This work proposes a quantum-inspired bandit learning approach for the learning-and-adapting-based offloading problem where a client observes and learns the costs of each task offloaded to the candidate resource providers, e.g., fog nodes.…
▽ More
The rapid proliferation of learning systems in an arbitrarily changing environment mandates the need for managing tensions between exploration and exploitation. This work proposes a quantum-inspired bandit learning approach for the learning-and-adapting-based offloading problem where a client observes and learns the costs of each task offloaded to the candidate resource providers, e.g., fog nodes. In this approach, a new action update strategy and novel probabilistic action selection are adopted, provoked by the amplitude amplification and collapse postulate in quantum computation theory, respectively. We devise a locally linear map** between a quantum-mechanical phase in a quantum domain, e.g., Grover-type search algorithm, and a distilled probability-magnitude in a value-based decision-making domain, e.g., adversarial multi-armed bandit algorithm. The proposed algorithm is generalized, via the devised map**, for better learning weight adjustments on favourable/unfavourable actions and its effectiveness is verified via simulation.
△ Less
Submitted 20 May, 2023; v1 submitted 15 August, 2022;
originally announced August 2022.
-
On the regularization and optimization in quantum detector tomography
Authors:
Shuixin Xiao,
Yuanlong Wang,
Jun Zhang,
Daoyi Dong,
Shota Yokoyama,
Ian R. Petersen,
Hidehiro Yonezawa
Abstract:
Quantum detector tomography (QDT) is a fundamental technique for calibrating quantum devices and performing quantum engineering tasks. In this paper, we utilize regularization to improve the QDT accuracy whenever the probe states are informationally complete or informationally incomplete. In the informationally complete scenario, without regularization, we optimize the resource (probe state) distr…
▽ More
Quantum detector tomography (QDT) is a fundamental technique for calibrating quantum devices and performing quantum engineering tasks. In this paper, we utilize regularization to improve the QDT accuracy whenever the probe states are informationally complete or informationally incomplete. In the informationally complete scenario, without regularization, we optimize the resource (probe state) distribution by converting it to a semidefinite programming problem. Then in both the informationally complete and informationally incomplete scenarios, we discuss different regularization forms and prove the mean squared error scales as $ O(\frac{1}{N}) $ or tends to a constant with $ N $ state copies under the static assumption. We also characterize the ideal best regularization for the identifiable parameters, accounting for both the informationally complete and informationally incomplete scenarios. Numerical examples demonstrate the effectiveness of different regularization forms and a quantum optical experiment test shows that a suitable regularization form can reach a reduced mean squared error.
△ Less
Submitted 10 April, 2023; v1 submitted 21 July, 2022;
originally announced July 2022.
-
Large-scale Knowledge Distillation with Elastic Heterogeneous Computing Resources
Authors:
Ji Liu,
Daxiang Dong,
Xi Wang,
An Qin,
Xingjian Li,
Patrick Valduriez,
De**g Dou,
Dianhai Yu
Abstract:
Although more layers and more parameters generally improve the accuracy of the models, such big models generally have high computational complexity and require big memory, which exceed the capacity of small devices for inference and incurs long training time. In addition, it is difficult to afford long training time and inference time of big models even in high performance servers, as well. As an…
▽ More
Although more layers and more parameters generally improve the accuracy of the models, such big models generally have high computational complexity and require big memory, which exceed the capacity of small devices for inference and incurs long training time. In addition, it is difficult to afford long training time and inference time of big models even in high performance servers, as well. As an efficient approach to compress a large deep model (a teacher model) to a compact model (a student model), knowledge distillation emerges as a promising approach to deal with the big models. Existing knowledge distillation methods cannot exploit the elastic available computing resources and correspond to low efficiency. In this paper, we propose an Elastic Deep Learning framework for knowledge Distillation, i.e., EDL-Dist. The advantages of EDL-Dist are three-fold. First, the inference and the training process is separated. Second, elastic available computing resources can be utilized to improve the efficiency. Third, fault-tolerance of the training and inference processes is supported. We take extensive experimentation to show that the throughput of EDL-Dist is up to 3.125 times faster than the baseline method (online knowledge distillation) while the accuracy is similar or higher.
△ Less
Submitted 14 July, 2022;
originally announced July 2022.
-
A candidate relativistic tidal disruption event at 340 Mpc
Authors:
Jean J. Somalwar,
Vikram Ravi,
Dillon Z. Dong,
Yuyang Chen,
Shari Breen,
Poonam Chandra,
Tracy Clarke,
Kishalay De,
B. M. Gaensler,
Gregg Hallinan,
Sibasish Laha,
Casey Law,
Steven T. Myers,
Tyler Parsotan,
Wendy Peters,
Emil Polisensky
Abstract:
We present observations of an extreme radio flare, VT J024345.70-284040.08, hereafter VT J0243, from the nucleus of a galaxy with evidence for historic Seyfert activity at redshift $z=0.074$. Between NRAO VLA Sky Survey observations in 1993 to VLA Sky Survey observations in 2018, VT J0243 rose from a ${\sim}$GHz radio luminosity of $νL_ν\lesssim 10^{38}$ erg s$^{-1}$ to $νL_ν{\sim}10^{40}$ erg s…
▽ More
We present observations of an extreme radio flare, VT J024345.70-284040.08, hereafter VT J0243, from the nucleus of a galaxy with evidence for historic Seyfert activity at redshift $z=0.074$. Between NRAO VLA Sky Survey observations in 1993 to VLA Sky Survey observations in 2018, VT J0243 rose from a ${\sim}$GHz radio luminosity of $νL_ν\lesssim 10^{38}$ erg s$^{-1}$ to $νL_ν{\sim}10^{40}$ erg s$^{-1}$, and still continues to brighten. The radio spectral energy distribution (SED) evolution is consistent with a nascent jet that has slowed over ${\sim}3000$ days with an average $0.1 < \langle β\rangle < 0.6$. The jet is energetic (${\sim}10^{51-52}$ erg), and had a radius ${\sim}0.7$ pc in Dec. 2021. X-ray observations suggest a persistent or evolving corona, possibly associated with an accretion disk, and IR and optical observations constrain any high-energy counterpart to be sub-Eddington. VT J0243 may be an example of a young, off-axis radio jet from a slowly evolving tidal disruption event. Other more mysterious triggers for the accretion enhancement and jet launching are possible. In either case, VT J0243 is a unique example of a nascent jet, highlighting the unknown connection between supermassive black holes, the properties of their accretion flows, and jet launching.
△ Less
Submitted 6 July, 2022;
originally announced July 2022.
-
Robust optimization for quantum reinforcement learning control using partial observations
Authors:
Chen Jiang,
Yu Pan,
Zheng-Guang Wu,
Qing Gao,
Daoyi Dong
Abstract:
The current quantum reinforcement learning control models often assume that the quantum states are known a priori for control optimization. However, full observation of quantum state is experimentally infeasible due to the exponential scaling of the number of required quantum measurements on the number of qubits. In this paper, we investigate a robust reinforcement learning method using partial ob…
▽ More
The current quantum reinforcement learning control models often assume that the quantum states are known a priori for control optimization. However, full observation of quantum state is experimentally infeasible due to the exponential scaling of the number of required quantum measurements on the number of qubits. In this paper, we investigate a robust reinforcement learning method using partial observations to overcome this difficulty. This control scheme is compatible with near-term quantum devices, where the noise is prevalent and predetermining the dynamics of quantum state is practically impossible. We show that this simplified control scheme can achieve similar or even better performance when compared to the conventional methods relying on full observation. We demonstrate the effectiveness of this scheme on examples of quantum state control and quantum approximate optimization algorithm. It has been shown that high-fidelity state control can be achieved even if the noise amplitude is at the same level as the control amplitude. Besides, an acceptable level of optimization accuracy can be achieved for QAOA with noisy control Hamiltonian. This robust control optimization model can be trained to compensate the uncertainties in practical quantum computing.
△ Less
Submitted 29 June, 2022;
originally announced June 2022.
-
A Flat-Spectrum Radio Transient at 122 Mpc consistent with an Emerging Pulsar Wind Nebula
Authors:
Dillon Dong,
Gregg Hallinan
Abstract:
We report the discovery and follow-up observations of VT 1137-0337: an unusual radio transient found in our systematic search for extragalactic explosions in the VLA Sky Survey (VLASS). VT 1137-0337 is located in the brightest region of a dwarf starburst galaxy (stellar mass $\sim 10^{8.3} M_{\odot}$, star formation rate $\sim 0.5 M_{\odot}$ yr$^{-1}$) at a luminosity distance of 121.6 Mpc. Its 3…
▽ More
We report the discovery and follow-up observations of VT 1137-0337: an unusual radio transient found in our systematic search for extragalactic explosions in the VLA Sky Survey (VLASS). VT 1137-0337 is located in the brightest region of a dwarf starburst galaxy (stellar mass $\sim 10^{8.3} M_{\odot}$, star formation rate $\sim 0.5 M_{\odot}$ yr$^{-1}$) at a luminosity distance of 121.6 Mpc. Its 3 GHz luminosity of $\sim 2.5 \times 10^{28}$ erg s$^{-1}$ Hz$^{-1}$ is comparable to luminous radio supernovae associated with dense circumstellar interaction and relativistic outflows. However, its broadband radio spectrum - a featureless power law $\propto ν^{-0.35 \pm 0.02}$ over a range of $\gtrsim$10$\times$ in frequency and fading at a rate of $\sim$ 5% per year over 4 years - cannot be directly explained by the shock of a stellar explosion. Jets launched by various classes of accreting black holes also struggle to account for VT 1137-0337's combination of observational properties. Instead, we propose that VT 1137-0337 is a $\sim$decades old pulsar wind nebula that has recently emerged from within the free-free opacity of its surrounding supernova ejecta. If the nebula is powered by spindown, the central neutron star should be highly magnetized, with a surface dipole field of $\sim 10^{13} - 10^{14}$ G and a present-day spin period of $\sim 10 - 100$ ms. Alternatively, the nebula may be powered by the release of magnetic energy from a magnetar. Magnetar nebulae have been proposed to explain the persistent radio sources associated with the repeating fast radio bursts FRB 121102 and FRB 190520B. These FRB persistent sources have not previously been observed as transients, but do bear a striking resemblance to VT 1137-0337 in their radio luminosity, spectral index, and host galaxy properties.
△ Less
Submitted 29 March, 2023; v1 submitted 23 June, 2022;
originally announced June 2022.
-
Subspace Phase Retrieval
Authors:
Mengchu Xu,
Dekuan Dong,
Jian Wang
Abstract:
In recent years, phase retrieval has received much attention in statistics, applied mathematics and optical engineering. In this paper, we propose an efficient algorithm, termed Subspace Phase Retrieval (SPR), which can accurately recover an $n$-dimensional $k$-sparse complex-valued signal $\x$ given its $Ω(k^2\log n)$ magnitude-only Gaussian samples if the minimum nonzero entry of $\x$ satisfies…
▽ More
In recent years, phase retrieval has received much attention in statistics, applied mathematics and optical engineering. In this paper, we propose an efficient algorithm, termed Subspace Phase Retrieval (SPR), which can accurately recover an $n$-dimensional $k$-sparse complex-valued signal $\x$ given its $Ω(k^2\log n)$ magnitude-only Gaussian samples if the minimum nonzero entry of $\x$ satisfies $|x_{\min}| = Ω(\|\x\|/\sqrt{k})$. Furthermore, if the energy sum of the most significant $\sqrt{k}$ elements in $\x$ is comparable to $\|\x\|^2$, the SPR algorithm can exactly recover $\x$ with $Ω(k \log n)$ magnitude-only samples, which attains the information-theoretic sampling complexity for sparse phase retrieval. Numerical Experiments demonstrate that the proposed algorithm achieves the state-of-the-art reconstruction performance compared to existing ones.
△ Less
Submitted 7 April, 2024; v1 submitted 6 June, 2022;
originally announced June 2022.
-
On the number of error correcting codes
Authors:
Dingding Dong,
Nitya Mani,
Yufei Zhao
Abstract:
We show that for a fixed $q$, the number of $q$-ary $t$-error correcting codes of length $n$ is at most $2^{(1 + o(1)) H_q(n,t)}$ for all $t \leq (1 - q^{-1})n - C_q\sqrt{n \log n}$ (for sufficiently large constant $C_q$), where $H_q(n, t) = q^n / V_q(n,t)$ is the Hamming bound and $V_q(n,t)$ is the cardinality of the radius $t$ Hamming ball. This proves a conjecture of Balogh, Treglown, and Wagne…
▽ More
We show that for a fixed $q$, the number of $q$-ary $t$-error correcting codes of length $n$ is at most $2^{(1 + o(1)) H_q(n,t)}$ for all $t \leq (1 - q^{-1})n - C_q\sqrt{n \log n}$ (for sufficiently large constant $C_q$), where $H_q(n, t) = q^n / V_q(n,t)$ is the Hamming bound and $V_q(n,t)$ is the cardinality of the radius $t$ Hamming ball. This proves a conjecture of Balogh, Treglown, and Wagner, who showed the result for $t = o(n^{1/3} (\log n)^{-2/3})$.
△ Less
Submitted 24 May, 2022;
originally announced May 2022.
-
A Dirichlet Process Mixture of Robust Task Models for Scalable Lifelong Reinforcement Learning
Authors:
Zhi Wang,
Chunlin Chen,
Daoyi Dong
Abstract:
While reinforcement learning (RL) algorithms are achieving state-of-the-art performance in various challenging tasks, they can easily encounter catastrophic forgetting or interference when faced with lifelong streaming information. In the paper, we propose a scalable lifelong RL method that dynamically expands the network capacity to accommodate new knowledge while preventing past memories from be…
▽ More
While reinforcement learning (RL) algorithms are achieving state-of-the-art performance in various challenging tasks, they can easily encounter catastrophic forgetting or interference when faced with lifelong streaming information. In the paper, we propose a scalable lifelong RL method that dynamically expands the network capacity to accommodate new knowledge while preventing past memories from being perturbed. We use a Dirichlet process mixture to model the non-stationary task distribution, which captures task relatedness by estimating the likelihood of task-to-cluster assignments and clusters the task models in a latent space. We formulate the prior distribution of the mixture as a Chinese restaurant process (CRP) that instantiates new mixture components as needed. The update and expansion of the mixture are governed by the Bayesian non-parametric framework with an expectation maximization (EM) procedure, which dynamically adapts the model complexity without explicit task boundaries or heuristics. Moreover, we use the domain randomization technique to train robust prior parameters for the initialization of each task model in the mixture, thus the resulting model can better generalize and adapt to unseen tasks. With extensive experiments conducted on robot navigation and locomotion domains, we show that our method successfully facilitates scalable lifelong RL and outperforms relevant existing methods.
△ Less
Submitted 22 May, 2022;
originally announced May 2022.
-
Unification of quantum resources in tripartite systems
Authors:
Dong-Dong Dong,
Geng-Biao Wei,
Xue-Ke Song,
Dong Wang,
Liu Ye
Abstract:
In quantum resource theories (QRTs), there exists evidences of intrinsic connections among different measures of quantum resources, including entanglement, coherence, quantum steering, and so on. However, building the relations among different quantum resources is a vital yet challenging task in multipartite quantum systems. Here, we focus on a unified framework of interpreting the interconversion…
▽ More
In quantum resource theories (QRTs), there exists evidences of intrinsic connections among different measures of quantum resources, including entanglement, coherence, quantum steering, and so on. However, building the relations among different quantum resources is a vital yet challenging task in multipartite quantum systems. Here, we focus on a unified framework of interpreting the interconversions among different quantum resources in tripartite systems. In particular, an exact relation between the generalized geometric measure and the genuinely multipartite concurrence are derived for tripartite entanglement states. Then we obtain the tradeoff relation between the first-order coherence and the genuine tripartite entanglement by the genuinely multipartite concurrence and concurrence fill. Furthermore, the tradeoff relation between the maximum steering inequality violation and concurrence fill for an arbitrary three-qubit pure state is found. In addition, we investigate the close relation between the maximum steering inequality violation and the first-order coherence. The results show that these quantum resources are intrinsic related and can be converted to each other in the framework of QRTs, although they are still regarded to be different.
△ Less
Submitted 15 May, 2022;
originally announced May 2022.
-
Efficient Bayesian Policy Reuse with a Scalable Observation Model in Deep Reinforcement Learning
Authors:
**mei Liu,
Zhi Wang,
Chunlin Chen,
Daoyi Dong
Abstract:
Bayesian policy reuse (BPR) is a general policy transfer framework for selecting a source policy from an offline library by inferring the task belief based on some observation signals and a trained observation model. In this paper, we propose an improved BPR method to achieve more efficient policy transfer in deep reinforcement learning (DRL). First, most BPR algorithms use the episodic return as…
▽ More
Bayesian policy reuse (BPR) is a general policy transfer framework for selecting a source policy from an offline library by inferring the task belief based on some observation signals and a trained observation model. In this paper, we propose an improved BPR method to achieve more efficient policy transfer in deep reinforcement learning (DRL). First, most BPR algorithms use the episodic return as the observation signal that contains limited information and cannot be obtained until the end of an episode. Instead, we employ the state transition sample, which is informative and instantaneous, as the observation signal for faster and more accurate task inference. Second, BPR algorithms usually require numerous samples to estimate the probability distribution of the tabular-based observation model, which may be expensive and even infeasible to learn and maintain, especially when using the state transition sample as the signal. Hence, we propose a scalable observation model based on fitting state transition functions of source tasks from only a small number of samples, which can generalize to any signals observed in the target task. Moreover, we extend the offline-mode BPR to the continual learning setting by expanding the scalable observation model in a plug-and-play fashion, which can avoid negative transfer when faced with new unknown tasks. Experimental results show that our method can consistently facilitate faster and more efficient policy transfer.
△ Less
Submitted 13 July, 2023; v1 submitted 16 April, 2022;
originally announced April 2022.
-
SD-Conv: Towards the Parameter-Efficiency of Dynamic Convolution
Authors:
Shwai He,
Chenbo Jiang,
Daize Dong,
Liang Ding
Abstract:
Dynamic convolution achieves better performance for efficient CNNs at the cost of negligible FLOPs increase. However, the performance increase can not match the significantly expanded number of parameters, which is the main bottleneck in real-world applications. Contrastively, mask-based unstructured pruning obtains a lightweight network by removing redundancy in the heavy network. In this paper,…
▽ More
Dynamic convolution achieves better performance for efficient CNNs at the cost of negligible FLOPs increase. However, the performance increase can not match the significantly expanded number of parameters, which is the main bottleneck in real-world applications. Contrastively, mask-based unstructured pruning obtains a lightweight network by removing redundancy in the heavy network. In this paper, we propose a new framework, \textbf{Sparse Dynamic Convolution} (\textsc{SD-Conv}), to naturally integrate these two paths such that it can inherit the advantage of dynamic mechanism and sparsity. We first design a binary mask derived from a learnable threshold to prune static kernels, significantly reducing the parameters and computational cost but achieving higher performance in Imagenet-1K. We further transfer pretrained models into a variety of downstream tasks, showing consistently better results than baselines. We hope our SD-Conv could be an efficient alternative to conventional dynamic convolutions.
△ Less
Submitted 26 May, 2023; v1 submitted 5 April, 2022;
originally announced April 2022.
-
Depthwise Convolution for Multi-Agent Communication with Enhanced Mean-Field Approximation
Authors:
Donghan Xie,
Zhi Wang,
Chunlin Chen,
Daoyi Dong
Abstract:
Multi-agent settings remain a fundamental challenge in the reinforcement learning (RL) domain due to the partial observability and the lack of accurate real-time interactions across agents. In this paper, we propose a new method based on local communication learning to tackle the multi-agent RL (MARL) challenge within a large number of agents coexisting. First, we design a new communication protoc…
▽ More
Multi-agent settings remain a fundamental challenge in the reinforcement learning (RL) domain due to the partial observability and the lack of accurate real-time interactions across agents. In this paper, we propose a new method based on local communication learning to tackle the multi-agent RL (MARL) challenge within a large number of agents coexisting. First, we design a new communication protocol that exploits the ability of depthwise convolution to efficiently extract local relations and learn local communication between neighboring agents. To facilitate multi-agent coordination, we explicitly learn the effect of joint actions by taking the policies of neighboring agents as inputs. Second, we introduce the mean-field approximation into our method to reduce the scale of agent interactions. To more effectively coordinate behaviors of neighboring agents, we enhance the mean-field approximation by a supervised policy rectification network (PRN) for rectifying real-time agent interactions and by a learnable compensation term for correcting the approximation bias. The proposed method enables efficient coordination as well as outperforms several baseline approaches on the adaptive traffic signal control (ATSC) task and the StarCraft II multi-agent challenge (SMAC).
△ Less
Submitted 1 January, 2023; v1 submitted 6 March, 2022;
originally announced March 2022.
-
Generation of Maximally Entangled States by Lyapunov Control Based on Entanglement Measure
Authors:
Yun-Yan Lee,
Daoyi Dong,
Ciann-Dong Yang
Abstract:
Maximally entangled states (MES) are highly valued in quantum information processing. In quantum control, the creation of MES is typically treated as a state transfer problem with a predefined MES as the target. However, this approach is limited by the requirement to predetermine the MES structure. This paper introduces an improved quantum Lyapunov control approach that relies on the quantum entan…
▽ More
Maximally entangled states (MES) are highly valued in quantum information processing. In quantum control, the creation of MES is typically treated as a state transfer problem with a predefined MES as the target. However, this approach is limited by the requirement to predetermine the MES structure. This paper introduces an improved quantum Lyapunov control approach that relies on the quantum entanglement measure to construct the Lyapunov function, instead of using the distance between quantum states. This strategy enables the preparation of any MES, regardless of whether its structure is known beforehand, using a single control scheme. The proposed entanglement control technique is unaffected by the number of entangled subsystems since it targets the entanglement measure as a scalar. Initially applied to bipartite pure states, this method demonstrates its capability to generate Bell states and their equivalents. Subsequent applications to bipartite mixed states and multipartite systems illustrate that the technique can produce MES with unspecified structures.
△ Less
Submitted 17 June, 2024; v1 submitted 28 February, 2022;
originally announced March 2022.
-
Quantum estimation, control and learning: opportunities and challenges
Authors:
Daoyi Dong,
Ian R Petersen
Abstract:
The development of estimation and control theories for quantum systems is a fundamental task for practical quantum technology. This vision article presents a brief introduction to challenging problems and potential opportunities in the emerging areas of quantum estimation, control and learning. The topics cover quantum state estimation, quantum parameter identification, quantum filtering, quantum…
▽ More
The development of estimation and control theories for quantum systems is a fundamental task for practical quantum technology. This vision article presents a brief introduction to challenging problems and potential opportunities in the emerging areas of quantum estimation, control and learning. The topics cover quantum state estimation, quantum parameter identification, quantum filtering, quantum open-loop control, quantum feedback control, machine learning for estimation and control of quantum systems, and quantum machine learning.
△ Less
Submitted 15 January, 2022;
originally announced January 2022.
-
Echo state graph neural networks with analogue random resistor arrays
Authors:
Shaocong Wang,
Yi Li,
Dingchen Wang,
Woyu Zhang,
Xi Chen,
Danian Dong,
Songqi Wang,
Xumeng Zhang,
Peng Lin,
Claudio Gallicchio,
Xiaoxin Xu,
Qi Liu,
Kwang-Ting Cheng,
Zhongrui Wang,
Dashan Shang,
Ming Liu
Abstract:
Recent years have witnessed an unprecedented surge of interest, from social networks to drug discovery, in learning representations of graph-structured data. However, graph neural networks, the machine learning models for handling graph-structured data, face significant challenges when running on conventional digital hardware, including von Neumann bottleneck incurred by physically separated memor…
▽ More
Recent years have witnessed an unprecedented surge of interest, from social networks to drug discovery, in learning representations of graph-structured data. However, graph neural networks, the machine learning models for handling graph-structured data, face significant challenges when running on conventional digital hardware, including von Neumann bottleneck incurred by physically separated memory and processing units, slowdown of Moore's law due to transistor scaling limit, and expensive training cost. Here we present a novel hardware-software co-design, the random resistor array-based echo state graph neural network, which addresses these challenges. The random resistor arrays not only harness low-cost, nanoscale and stackable resistors for highly efficient in-memory computing using simple physical laws, but also leverage the intrinsic stochasticity of dielectric breakdown to implement random projections in hardware for an echo state network that effectively minimizes the training cost thanks to its fixed and random weights. The system demonstrates state-of-the-art performance on both graph classification using the MUTAG and COLLAB datasets and node classification using the CORA dataset, achieving 34.2x, 93.2x, and 570.4x improvement of energy efficiency and 98.27%, 99.46%, and 95.12% reduction of training cost compared to conventional graph learning on digital hardware, respectively, which may pave the way for the next generation AI system for graph learning.
△ Less
Submitted 30 December, 2021;
originally announced December 2021.
-
On how neural networks enhance quantum state tomography with constrained measurements
Authors:
Hailan Ma,
Daoyi Dong,
Ian R. Petersen,
Chang-Jiang Huang,
Guo-Yong Xiang
Abstract:
Quantum state tomography aiming at reconstructing the density matrix of a quantum state plays an important role in various emerging quantum technologies. Inspired by the intuition that machine learning has favorable robustness and generalization, we propose a deep neural networks based quantum state tomography (DNN-QST) approach, which are applied to three measurement-constrained cases, including…
▽ More
Quantum state tomography aiming at reconstructing the density matrix of a quantum state plays an important role in various emerging quantum technologies. Inspired by the intuition that machine learning has favorable robustness and generalization, we propose a deep neural networks based quantum state tomography (DNN-QST) approach, which are applied to three measurement-constrained cases, including few measurement copies and incomplete measurements as well as noisy measurements. Numerical results demonstrate that DNN-QST exhibits a great potential to achieve high fidelity for quantum state tomography with limited measurement resources and can achieve improved estimation when tomographic measurements suffer from noise. In addition, the results for 2-qubit states from quantum optical devices demonstrate the generalization of DNN-QST and its robustness against possible error in the experimental devices.
△ Less
Submitted 2 August, 2023; v1 submitted 17 November, 2021;
originally announced November 2021.
-
Simultaneous estimation of parameters and the state of an optical parametric oscillator system
Authors:
Qi Yu,
Shota Yokoyama,
Daoyi Dong,
David McManus,
Hidehiro Yonezawa
Abstract:
In this paper, we consider the filtering problem of an optical parametric oscillator (OPO). The OPO pump power may fluctuate due to environmental disturbances, resulting in uncertainty in the system modeling. Thus, both the state and the unknown parameter may need to be estimated simultaneously. We formulate this problem using a state-space representation of the OPO dynamics. Under the assumption…
▽ More
In this paper, we consider the filtering problem of an optical parametric oscillator (OPO). The OPO pump power may fluctuate due to environmental disturbances, resulting in uncertainty in the system modeling. Thus, both the state and the unknown parameter may need to be estimated simultaneously. We formulate this problem using a state-space representation of the OPO dynamics. Under the assumption of Gaussianity and proper constraints, the dual Kalman filter method and the joint extended Kalman filter method are employed to simultaneously estimate the system state and the pump power. Numerical examples demonstrate the effectiveness of the proposed algorithms.
△ Less
Submitted 14 November, 2021;
originally announced November 2021.
-
Human factors engineering research on single pilot operations for large commercial aircraft: Status and prospect
Authors:
Wei Xu,
Yong Chen,
Wenjun Dong,
Dayong Dong,
Liezhong Ge
Abstract:
The civil aviation community is actively exploring and develo** the solutions of single pilot operations SPO for large commercial aircraft. Human factors engineering research for SPO has been launched, and the research mainly focuses on three research solutions: flight deck airborne equipment upgrade, flight support from ground stations, and the combined SPO solution of "flight deck airborne equ…
▽ More
The civil aviation community is actively exploring and develo** the solutions of single pilot operations SPO for large commercial aircraft. Human factors engineering research for SPO has been launched, and the research mainly focuses on three research solutions: flight deck airborne equipment upgrade, flight support from ground stations, and the combined SPO solution of "flight deck airborne equipment upgrade, flight support from ground stations". This paper reviews and analyzez the progress of human factors engineering research on SPO. The preliminary research outcome tends to support the combined SPO solution. However, the current human factors engineering research is not comprehensive and cannot provide a complete human factors engineering solution for SPO. For future human factors engineering research, this paper analyzes the key human factors issues on SPO and points out the gaps in the current research and the areas for future work. Finally, this paper puts forward an overall strategy and recommendations for future human factors engineering research on SPO.
△ Less
Submitted 14 October, 2021;
originally announced October 2021.
-
Stabilizing Preparation of Quantum Gaussian States via Continuous Measurement
Authors:
Liying Bao,
Bo Qi,
Daoyi Dong
Abstract:
This paper provides a stabilizing preparation method for quantum Gaussian states by utilizing continuous measurement. The stochastic evolution of the open quantum system is described in terms of the quantum stochastic master equation. We present necessary and sufficient conditions for the system to have a unique stabilizing steady Gaussian state. The conditions are much weaker than those existing…
▽ More
This paper provides a stabilizing preparation method for quantum Gaussian states by utilizing continuous measurement. The stochastic evolution of the open quantum system is described in terms of the quantum stochastic master equation. We present necessary and sufficient conditions for the system to have a unique stabilizing steady Gaussian state. The conditions are much weaker than those existing results presented in the approach of preparing Gaussian states through environment engineering. Parametric conditions of how to prepare an arbitrary pure Gaussian state are provided. This approach provides more degrees of freedom to choose the system Hamiltonian and the system-environment coupling operators, as compared with the case where dissipation-induced approach is employed. The stabilizing conditions for the case of imperfect measurement efficiency are also presented. These results may benefit practical experimental implementation in preparing quantum Gaussian states.
△ Less
Submitted 26 September, 2021;
originally announced September 2021.
-
Exponentially-enhanced Quantum Non-Hermitian Sensing via Optimized Coherent Drive
Authors:
Liying Bao,
Bo Qi,
Daoyi Dong
Abstract:
Distinct non-Hermitian dynamics has demonstrated its advantages in improving measurement precision over traditional sensing protocols. Multi-mode non-Hermitian lattice dynamics can provide exponentially-enhanced quantum sensing where the quantum Fisher information (QFI) per photon increases exponentially with the lattice size. However, somewhat surprisingly, it was also shown that the quintessenti…
▽ More
Distinct non-Hermitian dynamics has demonstrated its advantages in improving measurement precision over traditional sensing protocols. Multi-mode non-Hermitian lattice dynamics can provide exponentially-enhanced quantum sensing where the quantum Fisher information (QFI) per photon increases exponentially with the lattice size. However, somewhat surprisingly, it was also shown that the quintessential non-Hermitian skin effect does not provide any true advantage. In this paper, we demonstrate the importance of optimizing the phase of the coherent drive, and the position of the injection and detection in multi-mode non-Hermitian quantum sensing. The QFI per photon can be exponentially-enhanced or exponentially-reduced depending on parameters of the drive and detection. Specifically, it is demonstrated that for large amplification by choosing appropriate coherent drive parameters, the non-Hermitian skin effect can provide exponentially-enhanced quantum sensing. Moreover, in the regime beyond linear response, skin-effect can also provide a dramatic advantage as compared to the local perturbation, and the proposed protocol is robust in tuning the amplification factor.
△ Less
Submitted 29 December, 2021; v1 submitted 9 September, 2021;
originally announced September 2021.
-
A transient radio source consistent with a merger-triggered core collapse supernova
Authors:
Dillon Z. Dong,
Gregg Hallinan,
Ehud Nakar,
Anna Y. Q. Ho,
Andrew K. Hughes,
Kenta Hotokezaka,
Steve T. Myers,
Kishalay De,
Kunal Mooley,
Vikram Ravi,
Assaf Horesh,
Mansi M. Kasliwal,
Shri R. Kulkarni
Abstract:
A core-collapse supernova occurs when exothermic fusion ceases in the core of a massive star, typically due to exhaustion of nuclear fuel. Theory predicts that fusion could be interrupted earlier, by merging of the star with a compact binary companion. We report a luminous radio transient, VT J121001+495647, found in the Very Large Array Sky Survey. The radio emission is consistent with supernova…
▽ More
A core-collapse supernova occurs when exothermic fusion ceases in the core of a massive star, typically due to exhaustion of nuclear fuel. Theory predicts that fusion could be interrupted earlier, by merging of the star with a compact binary companion. We report a luminous radio transient, VT J121001+495647, found in the Very Large Array Sky Survey. The radio emission is consistent with supernova ejecta colliding with a dense shell of material, potentially ejected by binary interaction in the centuries prior to explosion. We associate the supernova with an archival X-ray transient, which implies a relativistic jet was launched during the explosion. The combination of an early relativistic jet and late-time dense interaction is consistent with expectations for a merger-driven explosion.
△ Less
Submitted 22 September, 2021; v1 submitted 3 September, 2021;
originally announced September 2021.
-
Path Planning for Cellular-Connected UAV: A DRL Solution with Quantum-Inspired Experience Replay
Authors:
Yuanjian Li,
A. Hamid Aghvami,
Daoyi Dong
Abstract:
In cellular-connected unmanned aerial vehicle (UAV) network, a minimization problem on the weighted sum of time cost and expected outage duration is considered. Taking advantage of UAV's adjustable mobility, an intelligent UAV navigation approach is formulated to achieve the aforementioned optimization goal. Specifically, after map** the navigation task into a Markov decision process (MDP), a de…
▽ More
In cellular-connected unmanned aerial vehicle (UAV) network, a minimization problem on the weighted sum of time cost and expected outage duration is considered. Taking advantage of UAV's adjustable mobility, an intelligent UAV navigation approach is formulated to achieve the aforementioned optimization goal. Specifically, after map** the navigation task into a Markov decision process (MDP), a deep reinforcement learning (DRL) solution with novel quantum-inspired experience replay (QiER) framework is proposed to help the UAV find the optimal flying direction within each time slot, and thus the designed trajectory towards the destination can be generated. Via relating experienced transition's importance to its associated quantum bit (qubit) and applying Grover iteration based amplitude amplification technique, the proposed DRL-QiER solution commits a better trade-off between sampling priority and diversity. Compared to several representative baselines, the effectiveness and supremacy of the proposed DRL-QiER solution are demonstrated and validated in numerical results.
△ Less
Submitted 16 September, 2021; v1 submitted 30 August, 2021;
originally announced August 2021.
-
The nascent milliquasar VT J154843.06+220812.6: tidal disruption event or extreme accretion-state change?
Authors:
Jean J. Somalwar,
Vikram Ravi,
Dillon Dong,
Matthew Graham,
Gregg Hallinan,
Casey Law,
Wenbin Lu,
Steven T. Myers
Abstract:
We present detailed multiwavelength follow up of a nuclear radio flare, VT J154843.06+220812.6, hereafter VT J1548. VT J1548 was selected as a ${\sim}1$ mJy radio flare in 3 GHz observations from the VLA Sky Survey (VLASS). It is located in the nucleus of a low mass ($\log M_{\rm BH}/M_\odot \sim6$) host galaxy with weak or no past AGN activity. VT J1548 is associated with a slow rising (multiple…
▽ More
We present detailed multiwavelength follow up of a nuclear radio flare, VT J154843.06+220812.6, hereafter VT J1548. VT J1548 was selected as a ${\sim}1$ mJy radio flare in 3 GHz observations from the VLA Sky Survey (VLASS). It is located in the nucleus of a low mass ($\log M_{\rm BH}/M_\odot \sim6$) host galaxy with weak or no past AGN activity. VT J1548 is associated with a slow rising (multiple year), bright mid IR flare in the WISE survey, peaking at ${\sim}10\%L_{\rm edd.}$. No associated optical transient is detected, although we cannot rule out a short, early optical flare given the limited data available. Constant late time (${\sim}3$ years post-flare) X-ray emission is detected at ${\sim}10^{42}$ erg s$^{-1}$. The radio SED is consistent with synchrotron emission from an outflow incident on an asymmetric medium. A follow-up, optical spectrum shows transient, bright, high-ionization coronal line emission ($[{\rm Fe\,X}]\,λ6375,[{\rm Fe\,XI}]\,λ7894,[{\rm S\,XII}]\,λ7612$). Transient broad H$α$ is also detected but without corresponding broad H$β$ emission, suggesting high nuclear extinction. We interpret this event as either a tidal disruption event or an extreme flare of an active galactic nucleus, in both cases obscured by a dusty torus. Although these individual properties have been observed in previous transients, the combination is unprecedented. This event highlights the importance of searches across all wave bands for assembling a sample of nuclear flares that spans the range of observable properties and possible triggers.
△ Less
Submitted 27 August, 2021;
originally announced August 2021.
-
Residual Tensor Train: A Quantum-inspired Approach for Learning Multiple Multilinear Correlations
Authors:
Yiwei Chen,
Yu Pan,
Daoyi Dong
Abstract:
States of quantum many-body systems are defined in a high-dimensional Hilbert space, where rich and complex interactions among subsystems can be modelled. In machine learning, complex multiple multilinear correlations may also exist within input features. In this paper, we present a quantum-inspired multilinear model, named Residual Tensor Train (ResTT), to capture the multiple multilinear correla…
▽ More
States of quantum many-body systems are defined in a high-dimensional Hilbert space, where rich and complex interactions among subsystems can be modelled. In machine learning, complex multiple multilinear correlations may also exist within input features. In this paper, we present a quantum-inspired multilinear model, named Residual Tensor Train (ResTT), to capture the multiple multilinear correlations of features, from low to high orders, within a single model. ResTT is able to build a robust decision boundary in a high-dimensional space for solving fitting and classification tasks. In particular, we prove that the fully-connected layer and the Volterra series can be taken as special cases of ResTT. Furthermore, we derive the rule for weight initialization that stabilizes the training of ResTT based on a mean-field analysis. We prove that such a rule is much more relaxed than that of TT, which means ResTT can easily address the vanishing and exploding gradient problem that exists in the existing TT models. Numerical experiments demonstrate that ResTT outperforms the state-of-the-art tensor network and benchmark deep learning models on MNIST and Fashion-MNIST datasets. Moreover, ResTT achieves better performance than other statistical methods on two practical examples with limited data which are known to have complex feature interactions.
△ Less
Submitted 1 August, 2022; v1 submitted 19 August, 2021;
originally announced August 2021.
-
Enumerating k-SAT functions
Authors:
Dingding Dong,
Nitya Mani,
Yufei Zhao
Abstract:
How many $k$-SAT functions on $n$ boolean variables are there? What does a typical such function look like? Bollobás, Brightwell, and Leader conjectured that, for each fixed $k \ge 2$, the number of $k$-SAT functions on $n$ variables is $(1+o(1))2^{\binom{n}{k} + n}$, or equivalently: a $1-o(1)$ fraction of all $k$-SAT functions are unate, i.e., monotone after negating some variables. They proved…
▽ More
How many $k$-SAT functions on $n$ boolean variables are there? What does a typical such function look like? Bollobás, Brightwell, and Leader conjectured that, for each fixed $k \ge 2$, the number of $k$-SAT functions on $n$ variables is $(1+o(1))2^{\binom{n}{k} + n}$, or equivalently: a $1-o(1)$ fraction of all $k$-SAT functions are unate, i.e., monotone after negating some variables. They proved a weaker version of the conjecture for $k=2$. The conjecture was confirmed for $k=2$ by Allen and $k=3$ by Ilinca and Kahn.
We show that the problem of enumerating $k$-SAT functions is equivalent to a Turán density problem for partially directed hypergraphs. Our proof uses the hypergraph container method. Furthermore, we confirm the Bollobás--Brightwell--Leader conjecture for $k=4$ by solving the corresponding Turán density problem. Our solution applies a recent result of Füredi and Maleki on the minimum triangular edge density in a graph of given edge density. In an appendix (by Nitya Mani and Edward Yu), we further confirm the $k=5$ case of the conjecture via a brute force computer search.
△ Less
Submitted 25 April, 2022; v1 submitted 19 July, 2021;
originally announced July 2021.
-
Late-Time Evolution and Modeling of the Off-Axis Gamma-ray Burst Candidate FIRST J141918.9+394036
Authors:
K. P. Mooley,
B. Margalit,
C. J. Law,
D. A. Perley,
A. T. Deller,
T. J. W. Lazio,
M. F. Bietenholz,
T. Shimwell,
H. T. Intema,
B. M. Gaensler,
B. D. Metzger,
D. Z. Dong,
G. Hallinan,
E. O. Ofek,
L. Sironi
Abstract:
We present new radio and optical data, including very long baseline interferometry, as well as archival data analysis, for the luminous decades-long radio transient FIRST J141918.9+394036. The radio data reveal a synchrotron self-absorption peak around 0.3 GHz and a radius of around 1.3 mas (0.5 pc) 26 years post-discovery, indicating a blastwave energy $\sim5 \times 10^{50}$ erg. The optical spec…
▽ More
We present new radio and optical data, including very long baseline interferometry, as well as archival data analysis, for the luminous decades-long radio transient FIRST J141918.9+394036. The radio data reveal a synchrotron self-absorption peak around 0.3 GHz and a radius of around 1.3 mas (0.5 pc) 26 years post-discovery, indicating a blastwave energy $\sim5 \times 10^{50}$ erg. The optical spectrum shows a broad [OIII]$λ$4959,5007 emission-line that may indicate collisional-excitation in the host galaxy, but its association with the transient cannot be ruled out. The properties of the host galaxy are suggestive of a massive stellar progenitor that formed at low metallicity. Based on the radio light curve, blastwave velocity, energetics, nature of the host galaxy and transient rates we find that the properties of FIRST J1419+39 are most consistent with long gamma-ray burst (LGRB) afterglows. Other classes of (optically-discovered) stellar explosions as well as neutron star mergers are disfavored, and invoking any exotic scenario may not be necessary. It is therefore likely that FIRST J1419+39 is an off-axis LGRB afterglow (as suggested by Law et al. and Marcote et al.), and under this premise the inverse beaming fraction is found to be $f_b^{-1}\simeq280^{+700}_{-200}$, corresponding to an average jet half-opening angle $<θ_j>\simeq5^{+4}_{-2}$ degrees (68% confidence), consistent with previous estimates. From the volumetric rate we predict that surveys with the VLA, ASKAP and MeerKAT will find a handful of FIRST J1419+39-like events over the coming years.
△ Less
Submitted 23 November, 2021; v1 submitted 9 July, 2021;
originally announced July 2021.
-
CD-SGD: Distributed Stochastic Gradient Descent with Compression and Delay Compensation
Authors:
Enda Yu,
Dezun Dong,
Yemao Xu,
Shuo Ouyang,
Xiangke Liao
Abstract:
Communication overhead is the key challenge for distributed training. Gradient compression is a widely used approach to reduce communication traffic. When combining with parallel communication mechanism method like pipeline, gradient compression technique can greatly alleviate the impact of communication overhead. However, there exists two problems of gradient compression technique to be solved. F…
▽ More
Communication overhead is the key challenge for distributed training. Gradient compression is a widely used approach to reduce communication traffic. When combining with parallel communication mechanism method like pipeline, gradient compression technique can greatly alleviate the impact of communication overhead. However, there exists two problems of gradient compression technique to be solved. Firstly, gradient compression brings in extra computation cost, which will delay the next training iteration. Secondly, gradient compression usually leads to the decrease of convergence accuracy.
△ Less
Submitted 6 September, 2021; v1 submitted 20 June, 2021;
originally announced June 2021.
-
Optimal and two-step adaptive quantum detector tomography
Authors:
Shuixin Xiao,
Yuanlong Wang,
Daoyi Dong,
Jun Zhang
Abstract:
Quantum detector tomography is a fundamental technique for calibrating quantum devices and performing quantum engineering tasks. In this paper, we design optimal probe states for detector estimation based on the minimum upper bound of the mean squared error (UMSE) and the maximum robustness. We establish the minimum UMSE and the minimum condition number for quantum detectors and provide concrete e…
▽ More
Quantum detector tomography is a fundamental technique for calibrating quantum devices and performing quantum engineering tasks. In this paper, we design optimal probe states for detector estimation based on the minimum upper bound of the mean squared error (UMSE) and the maximum robustness. We establish the minimum UMSE and the minimum condition number for quantum detectors and provide concrete examples that can achieve optimal detector tomography. In order to enhance the estimation precision, we also propose a two-step adaptive detector tomography algorithm to optimize the probe states adaptively based on a modified fidelity index. We present a sufficient condition on when the estimation error of our two-step strategy scales inversely proportional to the number of state copies. Moreover, the superposition of coherent states is used as probe states for quantum detector tomography and the estimation error is analyzed. Numerical results demonstrate the effectiveness of both the proposed optimal and adaptive quantum detector tomography methods.
△ Less
Submitted 11 January, 2022; v1 submitted 17 June, 2021;
originally announced June 2021.
-
JIZHI: A Fast and Cost-Effective Model-As-A-Service System for Web-Scale Online Inference at Baidu
Authors:
Hao Liu,
Qian Gao,
Jiang Li,
Xiaochao Liao,
Hao Xiong,
Guangxing Chen,
Wenlin Wang,
Guobao Yang,
Zhiwei Zha,
Daxiang Dong,
De**g Dou,
Haoyi Xiong
Abstract:
In modern internet industries, deep learning based recommender systems have became an indispensable building block for a wide spectrum of applications, such as search engine, news feed, and short video clips. However, it remains challenging to carry the well-trained deep models for online real-time inference serving, with respect to the time-varying web-scale traffics from billions of users, in a…
▽ More
In modern internet industries, deep learning based recommender systems have became an indispensable building block for a wide spectrum of applications, such as search engine, news feed, and short video clips. However, it remains challenging to carry the well-trained deep models for online real-time inference serving, with respect to the time-varying web-scale traffics from billions of users, in a cost-effective manner. In this work, we present JIZHI - a Model-as-a-Service system - that per second handles hundreds of millions of online inference requests to huge deep models with more than trillions of sparse parameters, for over twenty real-time recommendation services at Baidu, Inc. In JIZHI, the inference workflow of every recommendation request is transformed to a Staged Event-Driven Pipeline (SEDP), where each node in the pipeline refers to a staged computation or I/O intensive task processor. With traffics of real-time inference requests arrived, each modularized processor can be run in a fully asynchronized way and managed separately. Besides, JIZHI introduces heterogeneous and hierarchical storage to further accelerate the online inference process by reducing unnecessary computations and potential data access latency induced by ultra-sparse model parameters. Moreover, an intelligent resource manager has been deployed to maximize the throughput of JIZHI over the shared infrastructure by searching the optimal resource allocation plan from historical logs and fine-tuning the load shedding policies over intermediate system feedback. Extensive experiments have been done to demonstrate the advantages of JIZHI from the perspectives of end-to-end service latency, system-wide throughput, and resource consumption. JIZHI has helped Baidu saved more than ten million US dollars in hardware and utility costs while handling 200% more traffics without sacrificing inference efficiency.
△ Less
Submitted 3 June, 2021;
originally announced June 2021.
-
Fundamental limits for reciprocal and non-reciprocal non-Hermitian quantum sensing
Authors:
Liying Bao,
Bo Qi,
Daoyi Dong,
Franco Nori
Abstract:
Non-Hermitian dynamics has been widely studied to enhance the precision of quantum sensing; and non-reciprocity can be a powerful resource for non-Hermitian quantum sensing, as non-reciprocity allows to arbitrarily exceed the fundamental bound on the measurement rate of any reciprocal sensors. Here we establish fundamental limits on signal-to-noise ratio for reciprocal and non-reciprocal non-Hermi…
▽ More
Non-Hermitian dynamics has been widely studied to enhance the precision of quantum sensing; and non-reciprocity can be a powerful resource for non-Hermitian quantum sensing, as non-reciprocity allows to arbitrarily exceed the fundamental bound on the measurement rate of any reciprocal sensors. Here we establish fundamental limits on signal-to-noise ratio for reciprocal and non-reciprocal non-Hermitian quantum sensing. In particular, for two-mode linear systems with two coherent drives, an approximately attainable uniform bound on the best possible measurement rate per photon is derived for both reciprocal and non-reciprocal sensors. This bound is only related to the coupling coefficients and, in principle, can be made arbitrarily large. Our results thus demonstrate that a conventional reciprocal sensor with two drives can simulate any non-reciprocal sensor. This work also demonstrates a clear signature on how the excitation signals affect the signal-to-noise ratio in non-Hermitian quantum sensing.
△ Less
Submitted 21 April, 2021;
originally announced April 2021.
-
Rule-Based Reinforcement Learning for Efficient Robot Navigation with Space Reduction
Authors:
Yuanyang Zhu,
Zhi Wang,
Chunlin Chen,
Daoyi Dong
Abstract:
For real-world deployments, it is critical to allow robots to navigate in complex environments autonomously. Traditional methods usually maintain an internal map of the environment, and then design several simple rules, in conjunction with a localization and planning approach, to navigate through the internal map. These approaches often involve a variety of assumptions and prior knowledge. In cont…
▽ More
For real-world deployments, it is critical to allow robots to navigate in complex environments autonomously. Traditional methods usually maintain an internal map of the environment, and then design several simple rules, in conjunction with a localization and planning approach, to navigate through the internal map. These approaches often involve a variety of assumptions and prior knowledge. In contrast, recent reinforcement learning (RL) methods can provide a model-free, self-learning mechanism as the robot interacts with an initially unknown environment, but are expensive to deploy in real-world scenarios due to inefficient exploration. In this paper, we focus on efficient navigation with the RL technique and combine the advantages of these two kinds of methods into a rule-based RL (RuRL) algorithm for reducing the sample complexity and cost of time. First, we use the rule of wall-following to generate a closed-loop trajectory. Second, we employ a reduction rule to shrink the trajectory, which in turn effectively reduces the redundant exploration space. Besides, we give the detailed theoretical guarantee that the optimal navigation path is still in the reduced space. Third, in the reduced space, we utilize the Pledge rule to guide the exploration strategy for accelerating the RL process at the early stage. Experiments conducted on real robot navigation problems in hex-grid environments demonstrate that RuRL can achieve improved navigation performance.
△ Less
Submitted 15 April, 2021;
originally announced April 2021.
-
Bayesian adversarial multi-node bandit for optimal smart grid protection against cyber attacks
Authors:
Jianyu Xu,
Bin Liu,
Huadong Mo,
Daoyi Dong
Abstract:
The cybersecurity of smart grids has become one of key problems in develo** reliable modern power and energy systems. This paper introduces a non-stationary adversarial cost with a variation constraint for smart grids and enables us to investigate the problem of optimal smart grid protection against cyber attacks in a relatively practical scenario. In particular, a Bayesian multi-node bandit (MN…
▽ More
The cybersecurity of smart grids has become one of key problems in develo** reliable modern power and energy systems. This paper introduces a non-stationary adversarial cost with a variation constraint for smart grids and enables us to investigate the problem of optimal smart grid protection against cyber attacks in a relatively practical scenario. In particular, a Bayesian multi-node bandit (MNB) model with adversarial costs is constructed and a new regret function is defined for this model. An algorithm called Thompson-Hedge algorithm is presented to solve the problem and the superior performance of the proposed algorithm is proven in terms of the convergence rate of the regret function. The applicability of the algorithm to real smart grid scenarios is verified and the performance of the algorithm is also demonstrated by numerical examples.
△ Less
Submitted 20 February, 2021;
originally announced April 2021.
-
Dark Modes in Non-Markovian Linear Quantum Systems
Authors:
Shikun Zhang,
Daoyi Dong,
Kun Liu
Abstract:
In this note, we are concerned with dark modes in a class of non-Markovian open quantum systems. Based on a microscopic model, a time-convoluted linear quantum stochastic differential equation and an output equation are derived to describe the system dynamics. The definition of dark modes is given building on the input-output structure of the system. Then, we present a necessary and sufficient con…
▽ More
In this note, we are concerned with dark modes in a class of non-Markovian open quantum systems. Based on a microscopic model, a time-convoluted linear quantum stochastic differential equation and an output equation are derived to describe the system dynamics. The definition of dark modes is given building on the input-output structure of the system. Then, we present a necessary and sufficient condition for the existence of dark modes. Also, the problem of dark mode synthesis via Hamiltonian engineering is constructively solved and an example is presented to illustrate our results.
△ Less
Submitted 30 March, 2021;
originally announced March 2021.
-
FIRST J153350.8+272729: the radio afterglow of a decades-old tidal disruption event
Authors:
Vikram Ravi,
Hannah Dykaar,
Jackson Codd,
Ginevra Zaccagnini,
Dillon Dong,
Maria R. Drout,
Bryan M. Gaensler,
Gregg Hallinan,
Casey Law
Abstract:
We present the discovery of the fading radio transient FIRST J153350.8+272729. The source had a maximum observed 5-GHz radio luminosity of $8\times10^{39}$ erg s$^{-1}$ in 1986, but by 2019 had faded by a factor of nearly 400. It is located 0.15 arcsec from the center of a galaxy (SDSS J153350.89+272729) at 147 Mpc, which shows weak Type II Seyfert activity. We show that a tidal disruption event (…
▽ More
We present the discovery of the fading radio transient FIRST J153350.8+272729. The source had a maximum observed 5-GHz radio luminosity of $8\times10^{39}$ erg s$^{-1}$ in 1986, but by 2019 had faded by a factor of nearly 400. It is located 0.15 arcsec from the center of a galaxy (SDSS J153350.89+272729) at 147 Mpc, which shows weak Type II Seyfert activity. We show that a tidal disruption event (TDE) is the preferred scenario for FIRST J153350.8+272729, although it could plausibly be interpreted as the afterglow of a long-duration gamma-ray burst. This is only the second TDE candidate to be first discovered at radio wavelengths. Its luminosity fills a gap between the radio afterglows of sub-relativistic TDEs in the local universe, and relativistic TDEs at high redshifts. The unusual properties of FIRST J153350.8+272729 (ongoing nuclear activity in the host galaxy, high radio luminosity) motivate more extensive TDE searches in untargeted radio surveys.
△ Less
Submitted 10 February, 2021;
originally announced February 2021.
-
Learning Control of Quantum Systems
Authors:
Daoyi Dong
Abstract:
This paper provides a brief introduction to learning control of quantum systems. In particular, the following aspects are outlined, including gradient-based learning for optimal control of quantum systems, evolutionary computation for learning control of quantum systems, learning-based quantum robust control, and reinforcement learning for quantum control.
This paper provides a brief introduction to learning control of quantum systems. In particular, the following aspects are outlined, including gradient-based learning for optimal control of quantum systems, evolutionary computation for learning control of quantum systems, learning-based quantum robust control, and reinforcement learning for quantum control.
△ Less
Submitted 18 January, 2021;
originally announced January 2021.
-
Deep Reinforcement Learning with Quantum-inspired Experience Replay
Authors:
Qing Wei,
Hailan Ma,
Chunlin Chen,
Daoyi Dong
Abstract:
In this paper, a novel training paradigm inspired by quantum computation is proposed for deep reinforcement learning (DRL) with experience replay. In contrast to traditional experience replay mechanism in DRL, the proposed deep reinforcement learning with quantum-inspired experience replay (DRL-QER) adaptively chooses experiences from the replay buffer according to the complexity and the replayed…
▽ More
In this paper, a novel training paradigm inspired by quantum computation is proposed for deep reinforcement learning (DRL) with experience replay. In contrast to traditional experience replay mechanism in DRL, the proposed deep reinforcement learning with quantum-inspired experience replay (DRL-QER) adaptively chooses experiences from the replay buffer according to the complexity and the replayed times of each experience (also called transition), to achieve a balance between exploration and exploitation. In DRL-QER, transitions are first formulated in quantum representations, and then the preparation operation and the depreciation operation are performed on the transitions. In this progress, the preparation operation reflects the relationship between the temporal difference errors (TD-errors) and the importance of the experiences, while the depreciation operation is taken into account to ensure the diversity of the transitions. The experimental results on Atari 2600 games show that DRL-QER outperforms state-of-the-art algorithms such as DRL-PER and DCRL on most of these games with improved training efficiency, and is also applicable to such memory-based DRL approaches as double network and dueling network.
△ Less
Submitted 6 January, 2021;
originally announced January 2021.
-
Expectation Synchronization Synthesis in Non-Markovian Open Quantum Systems
Authors:
Shikun Zhang,
Kun Liu,
Daoyi Dong,
Xiaoxue Feng,
Feng Pan
Abstract:
In this article, we investigate the problem of engineering synchronization in non-Markovian quantum systems. First, a time-convoluted linear quantum stochastic differential equation is derived which describes the Heisenberg evolution of a localized quantum system driven by multiple colored noise inputs. Then, we define quantum expectation synchronization in an augmented system consisting of two su…
▽ More
In this article, we investigate the problem of engineering synchronization in non-Markovian quantum systems. First, a time-convoluted linear quantum stochastic differential equation is derived which describes the Heisenberg evolution of a localized quantum system driven by multiple colored noise inputs. Then, we define quantum expectation synchronization in an augmented system consisting of two subsystems. We prove that, for two homogenous subsystems, synchronization can always be synthesized without designing direct Hamiltonian coupling given that the degree of non-Markovianity is below a certain threshold. System parameters are explicitly designed to achieve quantum synchronization. Also, a numerical example is presented to illustrate our results.
△ Less
Submitted 10 February, 2021; v1 submitted 4 January, 2021;
originally announced January 2021.
-
Curriculum-based Deep Reinforcement Learning for Quantum Control
Authors:
Hailan Ma,
Daoyi Dong,
Steven X. Ding,
Chunlin Chen
Abstract:
Deep reinforcement learning has been recognized as an efficient technique to design optimal strategies for different complex systems without prior knowledge of the control landscape. To achieve a fast and precise control for quantum systems, we propose a novel deep reinforcement learning approach by constructing a curriculum consisting of a set of intermediate tasks defined by a fidelity threshold…
▽ More
Deep reinforcement learning has been recognized as an efficient technique to design optimal strategies for different complex systems without prior knowledge of the control landscape. To achieve a fast and precise control for quantum systems, we propose a novel deep reinforcement learning approach by constructing a curriculum consisting of a set of intermediate tasks defined by a fidelity threshold. Tasks among a curriculum can be statically determined using empirical knowledge or adaptively generated with the learning process. By transferring knowledge between two successive tasks and sequencing tasks according to their difficulties, the proposed curriculum-based deep reinforcement learning (CDRL) method enables the agent to focus on easy tasks in the early stage, then move onto difficult tasks, and eventually approaches the final task. Numerical simulations on closed quantum systems and open quantum systems demonstrate that the proposed method exhibits improved control performance for quantum systems and also provides an efficient way to identify optimal strategies with fewer control pulses.
△ Less
Submitted 2 January, 2021; v1 submitted 30 December, 2020;
originally announced December 2020.