Search | arXiv e-print repository

doi 10.14778/3665844.3665854

DET-LSH: A Locality-Sensitive Hashing Scheme with Dynamic Encoding Tree for Approximate Nearest Neighbor Search

Authors: Jiuqi Wei, Botao Peng, Xiaodong Lee, Themis Palpanas

Abstract: Locality-sensitive hashing (LSH) is a well-known solution for approximate nearest neighbor (ANN) search in high-dimensional spaces due to its robust theoretical guarantee on query accuracy. Traditional LSH-based methods mainly focus on improving the efficiency and accuracy of the query phase by designing different query strategies, but pay little attention to improving the efficiency of the indexi… ▽ More Locality-sensitive hashing (LSH) is a well-known solution for approximate nearest neighbor (ANN) search in high-dimensional spaces due to its robust theoretical guarantee on query accuracy. Traditional LSH-based methods mainly focus on improving the efficiency and accuracy of the query phase by designing different query strategies, but pay little attention to improving the efficiency of the indexing phase. They typically fine-tune existing data-oriented partitioning trees to index data points and support their query strategies. However, their strategy to directly partition the multi-dimensional space is time-consuming, and performance degrades as the space dimensionality increases. In this paper, we design an encoding-based tree called Dynamic Encoding Tree (DE-Tree) to improve the indexing efficiency and support efficient range queries based on Euclidean distance. Based on DE-Tree, we propose a novel LSH scheme called DET-LSH. DET-LSH adopts a novel query strategy, which performs range queries in multiple independent index DE-Trees to reduce the probability of missing exact NN points, thereby improving the query accuracy. Our theoretical studies show that DET-LSH enjoys probabilistic guarantees on query accuracy. Extensive experiments on real-world datasets demonstrate the superiority of DET-LSH over the state-of-the-art LSH-based methods on both efficiency and accuracy. While achieving better query accuracy than competitors, DET-LSH achieves up to 6x speedup in indexing time and 2x speedup in query time over the state-of-the-art LSH-based methods. This paper was published in PVLDB 2024. △ Less

Submitted 16 June, 2024; originally announced June 2024.

Journal ref: PVLDB, 17(9): 2241 - 2254, 2024

arXiv:2405.03173 [pdf, other]

Performance Upper Bound of Grover-Mixer Quantum Alternating Operator Ansatz

Authors: Ningyi Xie, Jiahua Xu, Tie** Chen, Xinwei Lee, Yoshiyuki Saito, Nobuyoshi Asai, Dongsheng Cai

Abstract: The Quantum Alternating Operator Ansatz (QAOA) represents a branch of quantum algorithms for solving combinatorial optimization problems. A specific variant, the Grover-Mixer Quantum Alternating Operator Ansatz (GM-QAOA), ensures uniform amplitude across states that share equivalent objective values. This property makes the algorithm independent of the problem structure, focusing instead on the di… ▽ More The Quantum Alternating Operator Ansatz (QAOA) represents a branch of quantum algorithms for solving combinatorial optimization problems. A specific variant, the Grover-Mixer Quantum Alternating Operator Ansatz (GM-QAOA), ensures uniform amplitude across states that share equivalent objective values. This property makes the algorithm independent of the problem structure, focusing instead on the distribution of objective values within the problem. In this work, we prove the probability upper bound for measuring a computational basis state from a GM-QAOA circuit with a given depth, which is a critical factor in QAOA cost. Using this, we derive the upper bounds for the probability of sampling an optimal solution, and for the approximation ratio of maximum optimization problems, both dependent on the objective value distribution. Through numerical analysis, we link the distribution to the problem size and build the regression models that relate the problem size, QAOA depth, and performance upper bound. Our results suggest that the GM-QAOA provides a quadratic enhancement in sampling probability and requires circuit depth that scales exponentially with problem size to maintain consistent performance. △ Less

Submitted 24 May, 2024; v1 submitted 6 May, 2024; originally announced May 2024.

Comments: 19 pages, 7 figures, 1 table

arXiv:2405.01833 [pdf, other]

Feed-Forward Probabilistic Error Cancellation with Noisy Recovery Gates

Authors: Leo Kurosawa, Yoshiyuki Saito, Xinwei Lee, Xinjian Yan, Ningyi Xie, Dongsheng Cai, Nobuyoshi Asai

Abstract: Probabilistic Error Cancellation (PEC) aims to improve the accuracy of expectation values for observables.This is accomplished using the probabilistic insertion of recovery gates, which correspond to the inverse of errors.However, the inserted recovery gates also induce errors. Thus, it is difficult to obtain accurate expectation values with PEC since the estimator of PEC has a bias due to noise i… ▽ More Probabilistic Error Cancellation (PEC) aims to improve the accuracy of expectation values for observables.This is accomplished using the probabilistic insertion of recovery gates, which correspond to the inverse of errors.However, the inserted recovery gates also induce errors. Thus, it is difficult to obtain accurate expectation values with PEC since the estimator of PEC has a bias due to noise induced by recovery gates.To address this challenge, we propose an improved version of PEC that considers the noise resulting from gate insertion, called Feed-Forward PEC (FFPEC). FFPEC provides an unbiased estimator of expectation values by cancelling out the noise induced by recovery gates.We demonstrate that FFPEC yields more accurate expectation values compared to conventional PEC method through numerical simulations with bit-flip and depolarizing noises. △ Less

Submitted 2 May, 2024; originally announced May 2024.

arXiv:2404.19497 [pdf, other]

Light Cone Cancellation for Variational Quantum Eigensolver Ansatz

Authors: Xinjian Yan, Xinwei Lee, Ningyi Xie, Yoshiyuki Saito, Leo Kurosawa, Nobuyoshi Asai, Dongsheng Cai, HoongChuin Lau

Abstract: Variational Quantum Algorithms (VQAs) represent a class of algorithms that utilize a hybrid approach, combining classical and quantum computing techniques. In this approach, classical computers serve as optimizers that update circuit parameters to find approximate solutions to complex problems. In this study, we apply a method known as Light Cone Cancellation (LCC) to optimize variational circuits… ▽ More Variational Quantum Algorithms (VQAs) represent a class of algorithms that utilize a hybrid approach, combining classical and quantum computing techniques. In this approach, classical computers serve as optimizers that update circuit parameters to find approximate solutions to complex problems. In this study, we apply a method known as Light Cone Cancellation (LCC) to optimize variational circuits, effectively reducing the required number of qubits and gates for circuit simulation. We then evaluate the performance of LCC one of the VQAs -- the Variational Quantum Eigensolver (VQE) -- to address the Max-Cut problem. Compared with the Quantum Approximate Optimization Algorithm (QAOA), VQE offers greater degrees of freedom at lower circuit depths. By applying LCC to VQE, we can shift the complexity of circuit simulation from the number of qubits to the number of edges in the graph, i.e., from exponential time to polynomial time. This enables us to solve large problems up to 50 vertices, without actually simulating the entire circuit. From our simulation in a 7-qubit and a 27-qubit noisy devices, we show that LCC yields higher approximation ratios than those cases without LCC, implying that the effect of noise is reduced when LCC is applied. △ Less

Submitted 30 April, 2024; originally announced April 2024.

arXiv:2403.03080 [pdf, other]

Demonstrating efficient and robust bosonic state reconstruction via optimized excitation counting

Authors: Tanjung Krisnanda, Clara Yun Fontaine, Adrian Copetudo, Pengtao Song, Kai Xiang Lee, Ni-Ni Huang, Fernando Valadares, Timothy C. H. Liew, Yvonne Y. Gao

Abstract: Quantum state reconstruction is an essential element in quantum information processing. However, efficient and reliable reconstruction of non-trivial quantum states in the presence of hardware imperfections can be challenging. This task is particularly demanding for high-dimensional states encoded in continuous-variable (CV) systems, as many error-prone measurements are needed to cover the relevan… ▽ More Quantum state reconstruction is an essential element in quantum information processing. However, efficient and reliable reconstruction of non-trivial quantum states in the presence of hardware imperfections can be challenging. This task is particularly demanding for high-dimensional states encoded in continuous-variable (CV) systems, as many error-prone measurements are needed to cover the relevant degrees of freedom of the system in phase space. In this work, we introduce an efficient and robust technique for optimized reconstruction based on excitation number sampling (ORENS). We use a standard bosonic circuit quantum electrodynamics (cQED) setup to experimentally demonstrate the robustness of ORENS and show that it outperforms the existing cQED reconstruction techniques such as Wigner and Husimi Q tomography. Our investigation highlights that ORENS is naturally free of parasitic system dynamics and resilient to decoherence effects in the hardware. Finally, ORENS relies only on the ability to accurately measure the excitation number of the state, making it a versatile and accessible tool for a wide range of CV platforms and readily scalable to multimode systems. Thus, our work provides a crucial and valuable primitive for practical quantum information processing using bosonic modes. △ Less

Submitted 25 March, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

Comments: Main text (6 pages with 4 figures) and Appendices (10 pages with 7 figures and 4 tables)

arXiv:2309.13552 [pdf, other]

Iterative Layerwise Training for Quantum Approximate Optimization Algorithm

Authors: Xinwei Lee, Xinjian Yan, Ningyi Xie, Yoshiyuki Saito, Dongsheng Cai, Nobuyoshi Asai

Abstract: The capability of the quantum approximate optimization algorithm (QAOA) in solving the combinatorial optimization problems has been intensively studied in recent years due to its application in the quantum-classical hybrid regime. Despite having difficulties that are innate in the variational quantum algorithms (VQA), such as barren plateaus and the local minima problem, QAOA remains one of the ap… ▽ More The capability of the quantum approximate optimization algorithm (QAOA) in solving the combinatorial optimization problems has been intensively studied in recent years due to its application in the quantum-classical hybrid regime. Despite having difficulties that are innate in the variational quantum algorithms (VQA), such as barren plateaus and the local minima problem, QAOA remains one of the applications that is suitable for the recent noisy intermediate scale quantum (NISQ) devices. Recent works have shown that the performance of QAOA largely depends on the initial parameters, which motivate parameter initialization strategies to obtain good initial points for the optimization of QAOA. On the other hand, optimization strategies focus on the optimization part of QAOA instead of the parameter initialization. Instead of having absolute advantages, these strategies usually impose trade-offs to the performance of the optimization problems. One of such examples is the layerwise optimization strategy, in which the QAOA parameters are optimized layer-by-layer instead of the full optimization. The layerwise strategy costs less in total compared to the full optimization, in exchange of lower approximation ratio. In this work, we propose the iterative layerwise optimization strategy and explore the possibility for the reduction of optimization cost in solving problems with QAOA. Using numerical simulations, we found out that by combining the iterative layerwise with proper initialization strategies, the optimization cost can be significantly reduced in exchange for a minor reduction in the approximation ratio. We also show that in some cases, the approximation ratio given by the iterative layerwise strategy is even higher than that given by the full optimization. △ Less

Submitted 24 September, 2023; originally announced September 2023.

Comments: 9 pages, 3 figures

arXiv:2308.08785 [pdf, other]

A Feasibility-Preserved Quantum Approximate Solver for the Capacitated Vehicle Routing Problem

Authors: Ningyi Xie, Xinwei Lee, Dongsheng Cai, Yoshiyuki Saito, Nobuyoshi Asai, Hoong Chuin Lau

Abstract: The Capacitated Vehicle Routing Problem (CVRP) is an NP-optimization problem (NPO) that arises in various fields including transportation and logistics. The CVRP extends from the Vehicle Routing Problem (VRP), aiming to determine the most efficient plan for a fleet of vehicles to deliver goods to a set of customers, subject to the limited carrying capacity of each vehicle. As the number of possibl… ▽ More The Capacitated Vehicle Routing Problem (CVRP) is an NP-optimization problem (NPO) that arises in various fields including transportation and logistics. The CVRP extends from the Vehicle Routing Problem (VRP), aiming to determine the most efficient plan for a fleet of vehicles to deliver goods to a set of customers, subject to the limited carrying capacity of each vehicle. As the number of possible solutions skyrockets when the number of customers increases, finding the optimal solution remains a significant challenge. Recently, the Quantum Approximate Optimization Algorithm (QAOA), a quantum-classical hybrid algorithm, has exhibited enhanced performance in certain combinatorial optimization problems compared to classical heuristics. However, its ability diminishes notably in solving constrained optimization problems including the CVRP. This limitation primarily arises from the typical approach of encoding the given problems as penalty-inclusive binary optimization problems. In this case, the QAOA faces challenges in sampling solutions satisfying all constraints. Addressing this, our work presents a new binary encoding for the CVRP, with an alternative objective function of minimizing the shortest path that bypasses the vehicle capacity constraint of the CVRP. The search space is further restricted by the constraint-preserving mixing operation. We examine and discuss the effectiveness of the proposed encoding under the framework of the variant of the QAOA, Quantum Alternating Operator Ansatz (AOA), through its application to several illustrative examples. Compared to the typical QAOA approach, the proposed method not only preserves the feasibility but also achieves a significant enhancement in the probability of measuring optimal solutions. △ Less

Submitted 21 April, 2024; v1 submitted 17 August, 2023; originally announced August 2023.

Comments: 10 pages, 10 figures, 1 table

arXiv:2308.03990 [pdf, ps, other]

NEOLAF, an LLM-powered neural-symbolic cognitive architecture

Authors: Richard Jiarui Tong, Cassie Chen Cao, Timothy Xueqian Lee, Guodong Zhao, Ray Wan, Feiyue Wang, Xiangen Hu, Robin Schmucker, **sheng Pan, Julian Quevedo, Yu Lu

Abstract: This paper presents the Never Ending Open Learning Adaptive Framework (NEOLAF), an integrated neural-symbolic cognitive architecture that models and constructs intelligent agents. The NEOLAF framework is a superior approach to constructing intelligent agents than both the pure connectionist and pure symbolic approaches due to its explainability, incremental learning, efficiency, collaborative and… ▽ More This paper presents the Never Ending Open Learning Adaptive Framework (NEOLAF), an integrated neural-symbolic cognitive architecture that models and constructs intelligent agents. The NEOLAF framework is a superior approach to constructing intelligent agents than both the pure connectionist and pure symbolic approaches due to its explainability, incremental learning, efficiency, collaborative and distributed learning, human-in-the-loop enablement, and self-improvement. The paper further presents a compelling experiment where a NEOLAF agent, built as a problem-solving agent, is fed with complex math problems from the open-source MATH dataset. The results demonstrate NEOLAF's superior learning capability and its potential to revolutionize the field of cognitive architectures and self-improving adaptive instructional systems. △ Less

Submitted 7 August, 2023; originally announced August 2023.

arXiv:2307.08620 [pdf, other]

Magnetic polarizability of a charged pion from four-point functions in lattice QCD

Authors: Frank X. Lee, Walter Wilcox, Andrei Alexandru, Chris Culver

Abstract: Electromagnetic dipole polarizabilities are fundamental properties of a hadron that represent its resistance to deformation under external fields. For a charged hadron, the presence of acceleration and Landau levels complicates the isolation of its deformation energy in the conventional background field method. In this work, we explore a general method based on four-point functions in lattice QCD… ▽ More Electromagnetic dipole polarizabilities are fundamental properties of a hadron that represent its resistance to deformation under external fields. For a charged hadron, the presence of acceleration and Landau levels complicates the isolation of its deformation energy in the conventional background field method. In this work, we explore a general method based on four-point functions in lattice QCD that takes into account all photon, quark and gluon interactions. The electric polarizability ($α_E$) has been determined from the method in a previous proof-of-principle simulation. Here we focus on the magnetic polarizability ($β_M$) using the same quenched Wilson action on a $24^3\times 48$ lattice at $β=6.0$ with pion mass from 1100 to 370 MeV. The results from the connected diagrams show a large cancellation between the elastic and inelastic contributions, leading to a relatively small and negative value for $β_M$ consistent with chiral perturbation theory. We also discuss the mechanism for $α_E+β_M$ from combining the two studies. △ Less

Submitted 26 September, 2023; v1 submitted 17 July, 2023; originally announced July 2023.

Comments: 11 pages, 8 figures, 1 table. Version accepted for publication in PRD. arXiv admin note: substantial text overlap with arXiv:2301.05200

arXiv:2306.12144 [pdf, other]

PrivSketch: A Private Sketch-based Frequency Estimation Protocol for Data Streams

Authors: Ying Li, Xiaodong Lee, Botao Peng, Themis Palpanas, **gan Xue

Abstract: Local differential privacy (LDP) has recently become a popular privacy-preserving data collection technique protecting users' privacy. The main problem of data stream collection under LDP is the poor utility due to multi-item collection from a very large domain. This paper proposes PrivSketch, a high-utility frequency estimation protocol taking advantage of sketches, suitable for private data stre… ▽ More Local differential privacy (LDP) has recently become a popular privacy-preserving data collection technique protecting users' privacy. The main problem of data stream collection under LDP is the poor utility due to multi-item collection from a very large domain. This paper proposes PrivSketch, a high-utility frequency estimation protocol taking advantage of sketches, suitable for private data stream collection. Combining the proposed background information and a decode-first collection-side workflow, PrivSketch improves the utility by reducing the errors introduced by the sketching algorithm and the privacy budget utilization when collecting multiple items. We analytically prove the superior accuracy and privacy characteristics of PrivSketch, and also evaluate them experimentally. Our evaluation, with several diverse synthetic and real datasets, demonstrates that PrivSketch is 1-3 orders of magnitude better than the competitors in terms of utility in both frequency estimation and frequent item estimation, while being up to ~100x faster. △ Less

Submitted 21 June, 2023; originally announced June 2023.

arXiv:2306.11706 [pdf, other]

RoboCat: A Self-Improving Generalist Agent for Robotic Manipulation

Authors: Konstantinos Bousmalis, Giulia Vezzani, Dushyant Rao, Coline Devin, Alex X. Lee, Maria Bauza, Todor Davchev, Yuxiang Zhou, Agrim Gupta, Akhil Raju, Antoine Laurens, Claudio Fantacci, Valentin Dalibard, Martina Zambelli, Murilo Martins, Rugile Pevceviciute, Michiel Blokzijl, Misha Denil, Nathan Batchelor, Thomas Lampe, Emilio Parisotto, Konrad Żołna, Scott Reed, Sergio Gómez Colmenarejo, Jon Scholz , et al. (14 additional authors not shown)

Abstract: The ability to leverage heterogeneous robotic experience from different robots and tasks to quickly master novel skills and embodiments has the potential to transform robot learning. Inspired by recent advances in foundation models for vision and language, we propose a multi-embodiment, multi-task generalist agent for robotic manipulation. This agent, named RoboCat, is a visual goal-conditioned de… ▽ More The ability to leverage heterogeneous robotic experience from different robots and tasks to quickly master novel skills and embodiments has the potential to transform robot learning. Inspired by recent advances in foundation models for vision and language, we propose a multi-embodiment, multi-task generalist agent for robotic manipulation. This agent, named RoboCat, is a visual goal-conditioned decision transformer capable of consuming action-labelled visual experience. This data spans a large repertoire of motor control skills from simulated and real robotic arms with varying sets of observations and actions. With RoboCat, we demonstrate the ability to generalise to new tasks and robots, both zero-shot as well as through adaptation using only 100-1000 examples for the target task. We also show how a trained model itself can be used to generate data for subsequent training iterations, thus providing a basic building block for an autonomous improvement loop. We investigate the agent's capabilities, with large-scale evaluations both in simulation and on three different real robot embodiments. We find that as we grow and diversify its training data, RoboCat not only shows signs of cross-task transfer, but also becomes more efficient at adapting to new tasks. △ Less

Submitted 22 December, 2023; v1 submitted 20 June, 2023; originally announced June 2023.

Comments: Transactions on Machine Learning Research (12/2023)

arXiv:2306.06843 [pdf, other]

Recurrent Attention Networks for Long-text Modeling

Authors: Xianming Li, Zongxi Li, Xiaotian Luo, Haoran Xie, Xing Lee, Yingbin Zhao, Fu Lee Wang, Qing Li

Abstract: Self-attention-based models have achieved remarkable progress in short-text mining. However, the quadratic computational complexities restrict their application in long text processing. Prior works have adopted the chunking strategy to divide long documents into chunks and stack a self-attention backbone with the recurrent structure to extract semantic representation. Such an approach disables par… ▽ More Self-attention-based models have achieved remarkable progress in short-text mining. However, the quadratic computational complexities restrict their application in long text processing. Prior works have adopted the chunking strategy to divide long documents into chunks and stack a self-attention backbone with the recurrent structure to extract semantic representation. Such an approach disables parallelization of the attention mechanism, significantly increasing the training cost and raising hardware requirements. Revisiting the self-attention mechanism and the recurrent structure, this paper proposes a novel long-document encoding model, Recurrent Attention Network (RAN), to enable the recurrent operation of self-attention. Combining the advantages from both sides, the well-designed RAN is capable of extracting global semantics in both token-level and document-level representations, making it inherently compatible with both sequential and classification tasks, respectively. Furthermore, RAN is computationally scalable as it supports parallelization on long document processing. Extensive experiments demonstrate the long-text encoding ability of the proposed RAN model on both classification and sequential tasks, showing its potential for a wide range of applications. △ Less

Submitted 11 June, 2023; originally announced June 2023.

arXiv:2305.16727 [pdf, other]

A Novel real-time arrhythmia detection model using YOLOv8

Authors: Guang Jun Nicholas Ang, Aritejh Kr Goil, Henryk Chan, Jieyi Jeric Lew, Xin Chun Lee, Raihan Bin Ahmad Mustaffa, Timotius Jason, Ze Ting Woon, Bingquan Shen

Abstract: In a landscape characterized by heightened connectivity and mobility, coupled with a surge in cardiovascular ailments, the imperative to curtail healthcare expenses through remote monitoring of cardiovascular health has become more pronounced. The accurate detection and classification of cardiac arrhythmias are pivotal for diagnosing individuals with heart irregularities. This study underscores th… ▽ More In a landscape characterized by heightened connectivity and mobility, coupled with a surge in cardiovascular ailments, the imperative to curtail healthcare expenses through remote monitoring of cardiovascular health has become more pronounced. The accurate detection and classification of cardiac arrhythmias are pivotal for diagnosing individuals with heart irregularities. This study underscores the feasibility of employing electrocardiograms (ECG) measurements in the home environment for real-time arrhythmia detection. Presenting a fresh application for arrhythmia detection, this paper leverages the cutting-edge You-Only-Look-Once (YOLO)v8 algorithm to categorize single-lead ECG signals. We introduce a novel loss-modified YOLOv8 model, fine-tuned on the MIT-BIH arrhythmia dataset, enabling real-time continuous monitoring. The obtained results substantiate the efficacy of our approach, with the model attaining an average accuracy of 99.5% and 0.992 mAP@50, and a rapid detection time of 0.002 seconds on an NVIDIA Tesla V100. Our investigation exemplifies the potential of real-time arrhythmia detection, enabling users to visually interpret the model output within the comfort of their homes. Furthermore, this study lays the groundwork for an extension into a real-time explainable AI (XAI) model capable of deployment in the healthcare sector, thereby significantly advancing the realm of healthcare solutions. △ Less

Submitted 7 January, 2024; v1 submitted 26 May, 2023; originally announced May 2023.

arXiv:2305.05532 [pdf, other]

doi 10.1109/ICPHM57936.2023.10194112

An ensemble of convolution-based methods for fault detection using vibration signals

Authors: Xian Yeow Lee, Aman Kumar, Lasitha Vidyaratne, Aniruddha Rajendra Rao, Ahmed Farahat, Chetan Gupta

Abstract: This paper focuses on solving a fault detection problem using multivariate time series of vibration signals collected from planetary gearboxes in a test rig. Various traditional machine learning and deep learning methods have been proposed for multivariate time-series classification, including distance-based, functional data-oriented, feature-driven, and convolution kernel-based methods. Recent st… ▽ More This paper focuses on solving a fault detection problem using multivariate time series of vibration signals collected from planetary gearboxes in a test rig. Various traditional machine learning and deep learning methods have been proposed for multivariate time-series classification, including distance-based, functional data-oriented, feature-driven, and convolution kernel-based methods. Recent studies have shown using convolution kernel-based methods like ROCKET, and 1D convolutional neural networks with ResNet and FCN, have robust performance for multivariate time-series data classification. We propose an ensemble of three convolution kernel-based methods and show its efficacy on this fault detection problem by outperforming other approaches and achieving an accuracy of more than 98.8\%. △ Less

Submitted 4 May, 2023; originally announced May 2023.

Comments: 12 Pages, 9 Figures, 2 Tables. Accepted at ICPHM 2023

Journal ref: 2023 IEEE International Conference on Prognostics and Health Management (ICPHM)

arXiv:2305.02051 [pdf, other]

Contact Edit: Artist Tools for Intuitive Modeling of Hand-Object Interactions

Authors: Arjun S. Lakshmipathy, Nicole Feng, Yu Xi Lee, Moshe Mahler, Nancy S. Pollard

Abstract: Posing high-contact interactions is challenging and time-consuming, with hand-object interactions being especially difficult due to the large number of degrees of freedom (DOF) of the hand and the fact that humans are experts at judging hand poses. This paper addresses this challenge by elevating contact areas to first-class primitives. We provide \textit{end-to-end art-directable} (EAD) tools to… ▽ More Posing high-contact interactions is challenging and time-consuming, with hand-object interactions being especially difficult due to the large number of degrees of freedom (DOF) of the hand and the fact that humans are experts at judging hand poses. This paper addresses this challenge by elevating contact areas to first-class primitives. We provide \textit{end-to-end art-directable} (EAD) tools to model interactions based on contact areas, directly manipulate contact areas, and compute corresponding poses automatically. To make these operations intuitive and fast, we present a novel axis-based contact model that supports real-time approximately isometry-preserving operations on triangulated surfaces, permits movement between surfaces, and is both robust and scalable to large areas. We show that use of our contact model facilitates high quality posing even for unconstrained, high-DOF custom rigs intended for traditional keyframe-based animation pipelines. We additionally evaluate our approach with comparisons to prior art, ablation studies, user studies, qualitative assessments, and extensions to full-body interaction. △ Less

Submitted 18 May, 2023; v1 submitted 3 May, 2023; originally announced May 2023.

arXiv:2304.05960 [pdf, other]

Quantum Multi-Resolution Measurement with application to Quantum Linear Solver

Authors: Yoshiyuki Saito, Xinwei Lee, Dongsheng Cai, Nobuyoshi Asai

Abstract: Quantum computation consists of a quantum state corresponding to a solution, and measurements with some observables. To obtain a solution with an accuracy $ε$, measurements $O(n/ε^2)$ are required, where $n$ is the size of a problem. The cost of these measurements requires a large computing time for an accurate solution. In this paper, we propose a quantum multi-resolution measurement (QMRM), whic… ▽ More Quantum computation consists of a quantum state corresponding to a solution, and measurements with some observables. To obtain a solution with an accuracy $ε$, measurements $O(n/ε^2)$ are required, where $n$ is the size of a problem. The cost of these measurements requires a large computing time for an accurate solution. In this paper, we propose a quantum multi-resolution measurement (QMRM), which is a hybrid quantum-classical algorithm that gives a solution with an accuracy $ε$ in $O(n\log(1/ε))$ measurements using a pair of functions. The QMRM computational cost with an accuracy $ε$ is smaller than $O(n/ε^2)$. We also propose an algorithm entitled QMRM-QLS (quantum linear solver) for solving a linear system of equations using the Harrow-Hassidim-Lloyd (HHL) algorithm as one of the examples. We perform some numerical experiments that QMRM gives solutions to with an accuracy $ε$ in $O(n\log(1/ε))$ measurements. △ Less

Submitted 12 April, 2023; originally announced April 2023.

arXiv:2303.02347 [pdf, other]

MetaGrad: Adaptive Gradient Quantization with Hypernetworks

Authors: Kaixin Xu, Alina Hui Xiu Lee, Ziyuan Zhao, Zhe Wang, Min Wu, Weisi Lin

Abstract: A popular track of network compression approach is Quantization aware Training (QAT), which accelerates the forward pass during the neural network training and inference. However, not much prior efforts have been made to quantize and accelerate the backward pass during training, even though that contributes around half of the training time. This can be partly attributed to the fact that errors of… ▽ More A popular track of network compression approach is Quantization aware Training (QAT), which accelerates the forward pass during the neural network training and inference. However, not much prior efforts have been made to quantize and accelerate the backward pass during training, even though that contributes around half of the training time. This can be partly attributed to the fact that errors of low-precision gradients during backward cannot be amortized by the training objective as in the QAT setting. In this work, we propose to solve this problem by incorporating the gradients into the computation graph of the next training iteration via a hypernetwork. Various experiments on CIFAR-10 dataset with different CNN network architectures demonstrate that our hypernetwork-based approach can effectively reduce the negative effect of gradient quantization noise and successfully quantizes the gradients to INT4 with only 0.64 accuracy drop for VGG-16 on CIFAR-10. △ Less

Submitted 31 October, 2023; v1 submitted 4 March, 2023; originally announced March 2023.

arXiv:2301.05200 [pdf, other]

doi 10.1103/PhysRevD.108.014512

Charged pion electric polarizability from four-point functions in lattice QCD

Authors: Frank X. Lee, Andrei Alexandru, Chris Culver, Walter Wilcox

Abstract: Polarizabilities reveal valuable information on the internal structure of hadrons in terms of charge and current distributions. For neutral hadrons, the standard approach is the background field method. But for a charged hadron, its acceleration under the applied field complicates the isolation of the polarization energy. In this work, we explore an alternative method based on four-point functions… ▽ More Polarizabilities reveal valuable information on the internal structure of hadrons in terms of charge and current distributions. For neutral hadrons, the standard approach is the background field method. But for a charged hadron, its acceleration under the applied field complicates the isolation of the polarization energy. In this work, we explore an alternative method based on four-point functions in lattice QCD. The approach offers a transparent picture on how polarizabilities arise from photon, quark, and gluon interactions. We carry out a proof-of-concept simulation on the electric polarizability of a charged pion, using quenched Wilson action on a $24^3\times 48$ lattice at $β=6.0$ with pion mass from 1100 to 370 MeV. We show in detail the evaluation and analysis of the four-point correlation functions and report results on charge radius and electric polarizability. Our results from connected diagrams suggest that charged pion $α_E$ is due to a cancellation between elastic and inelastic contributions. It would be interesting to see how the cancellation plays out at smaller pion masses in future simulations. △ Less

Submitted 11 July, 2023; v1 submitted 12 January, 2023; originally announced January 2023.

Comments: 24 pages, 13 figures, 2 tables. Version accepted for publication in PRD

arXiv:2211.09513 [pdf]

doi 10.1088/1742-6596/2595/1/012001

Quantum Approximate Optimization Algorithm Parameter Prediction Using a Convolutional Neural Network

Authors: Ningyi Xie, Xinwei Lee, Dongsheng Cai, Yoshiyuki Saito, Nobuyoshi Asai

Abstract: The Quantum approximate optimization algorithm (QAOA) is a quantum-classical hybrid algorithm aiming to produce approximate solutions for combinatorial optimization problems. In the QAOA, the quantum part prepares a quantum parameterized state that encodes the solution, where the parameters are optimized by a classical optimizer. However, it is difficult to find optimal parameters when the quantum… ▽ More The Quantum approximate optimization algorithm (QAOA) is a quantum-classical hybrid algorithm aiming to produce approximate solutions for combinatorial optimization problems. In the QAOA, the quantum part prepares a quantum parameterized state that encodes the solution, where the parameters are optimized by a classical optimizer. However, it is difficult to find optimal parameters when the quantum circuit becomes deeper. Hence, there is numerous active research on the performance and the optimization cost of QAOA. In this work, we build a convolutional neural network to predict parameters of depth QAOA instance by the parameters from the depth QAOA counterpart. We propose two strategies based on this model. First, we recurrently apply the model to generate a set of initial values for a certain depth QAOA. It successfully initiates depth 10 QAOA instances, whereas each model is only trained with the parameters from depths less than 6. Second, the model is applied repetitively until the maximum expected value is reached. An average approximation ratio of 0.9759 for Max-Cut over 264 Erdős-Rényi graphs is obtained, while the optimizer is only adopted for generating the first input of the model. △ Less

Submitted 16 February, 2023; v1 submitted 17 November, 2022; originally announced November 2022.

Comments: 9 pages, 4 figures, 1 tables

Journal ref: J. Phys.: Conf. Ser. 2595, 012001 (2023)

arXiv:2209.11348 [pdf, other]

A Depth-Progressive Initialization Strategy for Quantum Approximate Optimization Algorithm

Authors: Xinwei Lee, Ningyi Xie, Yoshiyuki Saito, Dongsheng Cai, Nobuyoshi Asai

Abstract: The quantum approximate optimization algorithm (QAOA) is known for its capability and universality in solving combinatorial optimization problems on near-term quantum devices. The results yielded by QAOA depend strongly on its initial variational parameters. Hence, parameters selection for QAOA becomes an active area of research as bad initialization might deteriorate the quality of the results, e… ▽ More The quantum approximate optimization algorithm (QAOA) is known for its capability and universality in solving combinatorial optimization problems on near-term quantum devices. The results yielded by QAOA depend strongly on its initial variational parameters. Hence, parameters selection for QAOA becomes an active area of research as bad initialization might deteriorate the quality of the results, especially at great circuit depths. We first discuss on the patterns of optimal parameters in QAOA in two directions: the angle index and the circuit depth. Then, we discuss on the symmetries and periodicity of the expectation that is used to determine the bounds of the search space. Based on the patterns in optimal parameters and the bounds restriction, we propose a strategy which predicts the new initial parameters by taking the difference between previous optimal parameters. Unlike most other strategies, the strategy we propose does not require multiple trials to ensure success. It only requires one prediction when progressing to the next depth. We compare this strategy with our previously proposed strategy and the layerwise strategy on solving the Max-cut problem, in terms of the approximation ratio and the optimization cost. We also address the non-optimality in previous parameters, which is seldom discussed in other works, despite its importance in explaining the behavior of variational quantum algorithms. △ Less

Submitted 27 September, 2022; v1 submitted 22 September, 2022; originally announced September 2022.

Comments: 10 pages, 4 figures

arXiv:2209.10105 [pdf, ps, other]

Distributed Online Non-convex Optimization with Composite Regret

Authors: Zhanhong Jiang, Aditya Balu, Xian Yeow Lee, Young M. Lee, Chinmay Hegde, Soumik Sarkar

Abstract: Regret has been widely adopted as the metric of choice for evaluating the performance of online optimization algorithms for distributed, multi-agent systems. However, data/model variations associated with agents can significantly impact decisions and requires consensus among agents. Moreover, most existing works have focused on develo** approaches for (either strongly or non-strongly) convex los… ▽ More Regret has been widely adopted as the metric of choice for evaluating the performance of online optimization algorithms for distributed, multi-agent systems. However, data/model variations associated with agents can significantly impact decisions and requires consensus among agents. Moreover, most existing works have focused on develo** approaches for (either strongly or non-strongly) convex losses, and very few results have been obtained regarding regret bounds in distributed online optimization for general non-convex losses. To address these two issues, we propose a novel composite regret with a new network regret-based metric to evaluate distributed online optimization algorithms. We concretely define static and dynamic forms of the composite regret. By leveraging the dynamic form of our composite regret, we develop a consensus-based online normalized gradient (CONGD) approach for pseudo-convex losses, and it provably shows a sublinear behavior relating to a regularity term for the path variation of the optimizer. For general non-convex losses, we first shed light on the regret for the setting of distributed online non-convex learning based on recent advances such that no deterministic algorithm can achieve the sublinear regret. We then develop the distributed online non-convex optimization with composite regret (DINOCO) without access to the gradients, depending on an offline optimization oracle. DINOCO is shown to achieve sublinear regret; to our knowledge, this is the first regret bound for general distributed online non-convex learning. △ Less

Submitted 21 September, 2022; originally announced September 2022.

Comments: 41 pages, presented in allerton conference 2022

arXiv:2205.03353 [pdf, other]

How to Spend Your Robot Time: Bridging Kickstarting and Offline Reinforcement Learning for Vision-based Robotic Manipulation

Authors: Alex X. Lee, Coline Devin, Jost Tobias Springenberg, Yuxiang Zhou, Thomas Lampe, Abbas Abdolmaleki, Konstantinos Bousmalis

Abstract: Reinforcement learning (RL) has been shown to be effective at learning control from experience. However, RL typically requires a large amount of online interaction with the environment. This limits its applicability to real-world settings, such as in robotics, where such interaction is expensive. In this work we investigate ways to minimize online interactions in a target task, by reusing a subopt… ▽ More Reinforcement learning (RL) has been shown to be effective at learning control from experience. However, RL typically requires a large amount of online interaction with the environment. This limits its applicability to real-world settings, such as in robotics, where such interaction is expensive. In this work we investigate ways to minimize online interactions in a target task, by reusing a suboptimal policy we might have access to, for example from training on related prior tasks, or in simulation. To this end, we develop two RL algorithms that can speed up training by using not only the action distributions of teacher policies, but also data collected by such policies on the task at hand. We conduct a thorough experimental study of how to use suboptimal teachers on a challenging robotic manipulation benchmark on vision-based stacking with diverse objects. We compare our methods to offline, online, offline-to-online, and kickstarting RL algorithms. By doing so, we find that training on data from both the teacher and student, enables the best performance for limited data budgets. We examine how to best allocate a limited data budget -- on the target task -- between the teacher and the student policy, and report experiments using varying budgets, two teachers with different degrees of suboptimality, and five stacking tasks that require a diverse set of behaviors. Our analysis, both in simulation and in the real world, shows that our approach is the best across data budgets, while standard offline RL from teacher rollouts is surprisingly effective when enough data is given. △ Less

Submitted 6 May, 2022; originally announced May 2022.

arXiv:2203.15629 [pdf, other]

Stochastic Conservative Contextual Linear Bandits

Authors: Jiabin Lin, Xian Yeow Lee, Talukder Jubery, Shana Moothedath, Soumik Sarkar, Baskar Ganapathysubramanian

Abstract: Many physical systems have underlying safety considerations that require that the strategy deployed ensures the satisfaction of a set of constraints. Further, often we have only partial information on the state of the system. We study the problem of safe real-time decision making under uncertainty. In this paper, we formulate a conservative stochastic contextual bandit formulation for real-time de… ▽ More Many physical systems have underlying safety considerations that require that the strategy deployed ensures the satisfaction of a set of constraints. Further, often we have only partial information on the state of the system. We study the problem of safe real-time decision making under uncertainty. In this paper, we formulate a conservative stochastic contextual bandit formulation for real-time decision making when an adversary chooses a distribution on the set of possible contexts and the learner is subject to certain safety/performance constraints. The learner observes only the context distribution and the exact context is unknown, and the goal is to develop an algorithm that selects a sequence of optimal actions to maximize the cumulative reward without violating the safety constraints at any time step. By leveraging the UCB algorithm for this setting, we propose a conservative linear UCB algorithm for stochastic bandits with context distribution. We prove an upper bound on the regret of the algorithm and show that it can be decomposed into three terms: (i) an upper bound for the regret of the standard linear UCB algorithm, (ii) a constant term (independent of time horizon) that accounts for the loss of being conservative in order to satisfy the safety constraint, and (ii) a constant term (independent of time horizon) that accounts for the loss for the contexts being unknown and only the distribution being known. To validate the performance of our approach we perform extensive simulations on synthetic data and on real-world maize data collected through the Genomes to Fields (G2F) initiative. △ Less

Submitted 29 March, 2022; originally announced March 2022.

arXiv:2112.03355 [pdf, other]

doi 10.1103/PhysRevD.105.054020

Pole position of the $a_1(1260)$ resonance in a three-body unitary framework

Authors: Daniel Sadasivan, Andrei Alexandru, Hakan Akdag, Felipe Amorim, Ruairí Brett, Chris Culver, Michael Döring, Frank X. Lee, Maxim Mai

Abstract: Masses, widths, and branching ratios of hadronic resonances are quantified by their pole positions and residues with respect to transition amplitudes on the Riemann sheets of the complex energy-plane. In this study we discuss the analytic structure in the physical energy region of three-body scattering amplitudes on such manifolds. As an application, we determine the pole position of the… ▽ More Masses, widths, and branching ratios of hadronic resonances are quantified by their pole positions and residues with respect to transition amplitudes on the Riemann sheets of the complex energy-plane. In this study we discuss the analytic structure in the physical energy region of three-body scattering amplitudes on such manifolds. As an application, we determine the pole position of the $a_1(1260)$ meson from the ALEPH experiment by allowing for $πρ$ coupled channels in S- and D-wave. We find it to be $\sqrt{s_0}=(1232^{+15+9}_{-0-11}-i266^{+0+15}_{-22-27})~\text{MeV}$. △ Less

Submitted 28 February, 2022; v1 submitted 6 December, 2021; originally announced December 2021.

Comments: 17 pages, 13 figures

arXiv:2112.02813 [pdf, other]

MDPGT: Momentum-based Decentralized Policy Gradient Tracking

Authors: Zhanhong Jiang, Xian Yeow Lee, Sin Yong Tan, Kai Liang Tan, Aditya Balu, Young M. Lee, Chinmay Hegde, Soumik Sarkar

Abstract: We propose a novel policy gradient method for multi-agent reinforcement learning, which leverages two different variance-reduction techniques and does not require large batches over iterations. Specifically, we propose a momentum-based decentralized policy gradient tracking (MDPGT) where a new momentum-based variance reduction technique is used to approximate the local policy gradient surrogate wi… ▽ More We propose a novel policy gradient method for multi-agent reinforcement learning, which leverages two different variance-reduction techniques and does not require large batches over iterations. Specifically, we propose a momentum-based decentralized policy gradient tracking (MDPGT) where a new momentum-based variance reduction technique is used to approximate the local policy gradient surrogate with importance sampling, and an intermediate parameter is adopted to track two consecutive policy gradient surrogates. Moreover, MDPGT provably achieves the best available sample complexity of $\mathcal{O}(N^{-1}ε^{-3})$ for converging to an $ε$-stationary point of the global average of $N$ local performance functions (possibly nonconcave). This outperforms the state-of-the-art sample complexity in decentralized model-free reinforcement learning, and when initialized with a single trajectory, the sample complexity matches those obtained by the existing decentralized policy gradient methods. We further validate the theoretical claim for the Gaussian policy function. When the required error tolerance $ε$ is small enough, MDPGT leads to a linear speed up, which has been previously established in decentralized stochastic optimization, but not for reinforcement learning. Lastly, we provide empirical results on a multi-agent reinforcement learning benchmark environment to support our theoretical findings. △ Less

Submitted 6 December, 2021; originally announced December 2021.

arXiv:2110.06192 [pdf, other]

Beyond Pick-and-Place: Tackling Robotic Stacking of Diverse Shapes

Authors: Alex X. Lee, Coline Devin, Yuxiang Zhou, Thomas Lampe, Konstantinos Bousmalis, Jost Tobias Springenberg, Arunkumar Byravan, Abbas Abdolmaleki, Nimrod Gileadi, David Khosid, Claudio Fantacci, Jose Enrique Chen, Akhil Raju, Rae Jeong, Michael Neunert, Antoine Laurens, Stefano Saliceti, Federico Casarini, Martin Riedmiller, Raia Hadsell, Francesco Nori

Abstract: We study the problem of robotic stacking with objects of complex geometry. We propose a challenging and diverse set of such objects that was carefully designed to require strategies beyond a simple "pick-and-place" solution. Our method is a reinforcement learning (RL) approach combined with vision-based interactive policy distillation and simulation-to-reality transfer. Our learned policies can ef… ▽ More We study the problem of robotic stacking with objects of complex geometry. We propose a challenging and diverse set of such objects that was carefully designed to require strategies beyond a simple "pick-and-place" solution. Our method is a reinforcement learning (RL) approach combined with vision-based interactive policy distillation and simulation-to-reality transfer. Our learned policies can efficiently handle multiple object combinations in the real world and exhibit a large variety of stacking skills. In a large experimental study, we investigate what choices matter for learning such general vision-based agents in simulation, and what affects optimal transfer to the real robot. We then leverage data collected by such policies and improve upon them with offline RL. A video and a blog post of our work are provided as supplementary material. △ Less

Submitted 3 November, 2021; v1 submitted 12 October, 2021; originally announced October 2021.

Comments: CoRL 2021. Video: https://dpmd.ai/robotics-stacking-YT . Blog: https://dpmd.ai/robotics-stacking . Code: https://github.com/deepmind/rgb_stacking

arXiv:2110.03750 [pdf, other]

doi 10.22323/1.396.0235

Higher order quantization conditions for two spinless particles

Authors: Frank X. Lee, Andrei Alexandru, Ruairí Brett

Abstract: Lattice QCD calculations of scattering phaseshifts and resonance parameters in the two-body sector are becoming precision studies. Early calculations employed Lüscher's formula for extracting these quantities at lowest order. As the calculations become more ambitious, higher-order relations are required. In this study we derive higher-order quantization conditions and introduce a method to transpa… ▽ More Lattice QCD calculations of scattering phaseshifts and resonance parameters in the two-body sector are becoming precision studies. Early calculations employed Lüscher's formula for extracting these quantities at lowest order. As the calculations become more ambitious, higher-order relations are required. In this study we derive higher-order quantization conditions and introduce a method to transparently cross-check our results. This is an important step given the involved derivations of these formulae. We derive quantization conditions up to $\ell=5$ partial waves in both cubic and elongated geometries, and for states with zero and non-zero total momentum. All 45 quantization conditions we include here (22 in cubic box, 23 in elongated box) pass our cross-check test. △ Less

Submitted 7 October, 2021; originally announced October 2021.

Comments: 10 pages, 3 figures, 1 table, presented at The 38th International Symposium on Lattice Field Theory, LATTICE2021 26th-30th July, 2021 Zoom/Gather@Massachusetts Institute of Technology

arXiv:2110.03148 [pdf, other]

Measuring charged particle polarizabilities on the lattice without background fields

Authors: Walter Wilcox, Frank X. Lee

Abstract: We show how to compute electromagnetic polarizabilities of charged hadrons without the use of background fields in lattice QCD. The low-energy behavior of the Compton scattering amplitude is matched to matrix elements of current-current correlation functions on the lattice. Working in momentum space, formulas for electric polarizability ($α_E$) and magnetic polarizability ($β_M$) are derived for b… ▽ More We show how to compute electromagnetic polarizabilities of charged hadrons without the use of background fields in lattice QCD. The low-energy behavior of the Compton scattering amplitude is matched to matrix elements of current-current correlation functions on the lattice. Working in momentum space, formulas for electric polarizability ($α_E$) and magnetic polarizability ($β_M$) are derived for both charged pion and proton. Lattice four-point correlation functions are constructed from quark and gluon fields to be used in Monte-Carlo simulations. We also draw attention to the potential of four-point functions as a multi-purpose tool for hadron structure. △ Less

Submitted 6 October, 2021; originally announced October 2021.

Comments: 9 pages, 1 figure, presented at the 38th International Symposium on Lattice Field Theory, LATTICE2021, July 36-30, 2021 Zoom/Gather@Massachusetts Institute of Technology

arXiv:2109.12073 [pdf, other]

A Graph Policy Network Approach for Volt-Var Control in Power Distribution Systems

Authors: Xian Yeow Lee, Soumik Sarkar, Yubo Wang

Abstract: Volt-var control (VVC) is the problem of operating power distribution systems within healthy regimes by controlling actuators in power systems. Existing works have mostly adopted the conventional routine of representing the power systems (a graph with tree topology) as vectors to train deep reinforcement learning (RL) policies. We propose a framework that combines RL with graph neural networks and… ▽ More Volt-var control (VVC) is the problem of operating power distribution systems within healthy regimes by controlling actuators in power systems. Existing works have mostly adopted the conventional routine of representing the power systems (a graph with tree topology) as vectors to train deep reinforcement learning (RL) policies. We propose a framework that combines RL with graph neural networks and study the benefits and limitations of graph-based policy in the VVC setting. Our results show that graph-based policies converge to the same rewards asymptotically however at a slower rate when compared to vector representation counterpart. We conduct further analysis on the impact of both observations and actions: on the observation end, we examine the robustness of graph-based policy on two typical data acquisition errors in power systems, namely sensor communication failure and measurement misalignment. On the action end, we show that actuators have various impacts on the system, thus using a graph representation induced by power systems topology may not be the optimal choice. In the end, we conduct a case study to demonstrate that the choice of readout function architecture and graph augmentation can further improve training performance and robustness. △ Less

Submitted 20 June, 2022; v1 submitted 24 September, 2021; originally announced September 2021.

Comments: Presented at NeurIPS 2021 Deep RL Workshop

arXiv:2109.03970 [pdf, other]

PowerGym: A Reinforcement Learning Environment for Volt-Var Control in Power Distribution Systems

Authors: Ting-Han Fan, Xian Yeow Lee, Yubo Wang

Abstract: We introduce PowerGym, an open-source reinforcement learning environment for Volt-Var control in power distribution systems. Following OpenAI Gym APIs, PowerGym targets minimizing power loss and voltage violations under physical networked constraints. PowerGym provides four distribution systems (13Bus, 34Bus, 123Bus, and 8500Node) based on IEEE benchmark systems and design variants for various con… ▽ More We introduce PowerGym, an open-source reinforcement learning environment for Volt-Var control in power distribution systems. Following OpenAI Gym APIs, PowerGym targets minimizing power loss and voltage violations under physical networked constraints. PowerGym provides four distribution systems (13Bus, 34Bus, 123Bus, and 8500Node) based on IEEE benchmark systems and design variants for various control difficulties. To foster generalization, PowerGym offers a detailed customization guide for users working with their distribution systems. As a demonstration, we examine state-of-the-art reinforcement learning algorithms in PowerGym and validate the environment by studying controller behaviors. The repository is available at \url{https://github.com/siemens/powergym}. △ Less

Submitted 14 March, 2022; v1 submitted 8 September, 2021; originally announced September 2021.

Comments: The 4th Annual Learning for Dynamics & Control Conference (L4DC) 2022

arXiv:2108.07744 [pdf, other]

An Iterative Improvement Method for HHL algorithm for Solving Linear System of Equations

Authors: Yoshiyuki Saito, Xinwei Lee, Dongsheng Cai, Nobuyoshi Asai

Abstract: We propose an iterative improvement method for the Harrow-Hassidim-Lloyd (HHL) algorithm to solve a linear system of equations. This is a quantum-classical hybrid algorithm. The accuracy is essential to solve the linear system of equations. However, the accuracy of the HHL algorithm is limited by the number of quantum bits used to express the eigenvalues of the matrix. Our iterative method improve… ▽ More We propose an iterative improvement method for the Harrow-Hassidim-Lloyd (HHL) algorithm to solve a linear system of equations. This is a quantum-classical hybrid algorithm. The accuracy is essential to solve the linear system of equations. However, the accuracy of the HHL algorithm is limited by the number of quantum bits used to express the eigenvalues of the matrix. Our iterative method improves the accuracy of the HHL solutions, and gives higher accuracy which surpasses the accuracy limited by the number of quantum bits. In practical HHL algorithm, a huge number of measurements is required to obtain good accuracy, even if we provide a sufficient number of quantum bits for the eigenvalue expression, since the solution is statistically processed from the measurements. Our improved iterative method can reduce the number of measurements. Moreover, the sign information for each eigenstate of the solution is lost once the measurement is made, although the sign is significant. Therefore, the naïve iterative method of the HHL algorithm may slow down, especially, when the solution includes wrong signs. In this paper, we propose and evaluate an improved iterative method for the HHL algorithm that is robust against the sign information loss, in terms of the number of iterations and the computational accuracy. △ Less

Submitted 17 August, 2021; originally announced August 2021.

Comments: 7 pages, 7 figures

arXiv:2108.05288 [pdf, other]

doi 10.1109/QCE52317.2021.00016

Parameters Fixing Strategy for Quantum Approximate Optimization Algorithm

Authors: Xinwei Lee, Yoshiyuki Saito, Dongsheng Cai, Nobuyoshi Asai

Abstract: The quantum approximate optimization algorithm (QAOA) has numerous promising applications in solving the combinatorial optimization problems on near-term Noisy Intermediate Scalable Quantum (NISQ) devices. QAOA has a quantum-classical hybrid structure. Its quantum part consists of a parameterized alternating operator ansatz, and its classical part comprises an optimization algorithm, which optimiz… ▽ More The quantum approximate optimization algorithm (QAOA) has numerous promising applications in solving the combinatorial optimization problems on near-term Noisy Intermediate Scalable Quantum (NISQ) devices. QAOA has a quantum-classical hybrid structure. Its quantum part consists of a parameterized alternating operator ansatz, and its classical part comprises an optimization algorithm, which optimizes the parameters to maximize the expectation value of the problem Hamiltonian. This expectation value depends highly on the parameters, this implies that a set of good parameters leads to an accurate solution. However, at large circuit depth of QAOA, it is difficult to achieve global optimization due to the multiple occurrences of local minima or maxima. In this paper, we propose a parameters fixing strategy which gives high approximation ratio on average, even at large circuit depths, by initializing QAOA with the optimal parameters obtained from the previous depths. We test our strategy on the Max-cut problem of certain classes of graphs such as the 3-regular graphs and the Erdös-Rényi graphs. △ Less

Submitted 11 August, 2021; originally announced August 2021.

Comments: 7 pages, 5 figures, accepted in the IEEE International Conference on Quantum Computing and Engineering

arXiv:2107.04430 [pdf, other]

doi 10.1103/PhysRevD.105.054517

Higher order finite volume quantization conditions for two spinless particles

Authors: Frank X. Lee, Andrei Alexandru, Ruairí Brett

Abstract: Lattice QCD calculations of scattering phaseshifts and resonance parameters in the two-body sector are becoming precision studies. Early calculations employed Lüscher's formula for extracting these quantities at lowest order. As the calculations become more ambitious, higher-order relations are required. In this study we present a way to validate the higher-order quantization conditions. This is a… ▽ More Lattice QCD calculations of scattering phaseshifts and resonance parameters in the two-body sector are becoming precision studies. Early calculations employed Lüscher's formula for extracting these quantities at lowest order. As the calculations become more ambitious, higher-order relations are required. In this study we present a way to validate the higher-order quantization conditions. This is an important step given the involved derivations of these formulae. We derive and validate quantization conditions up to $\ell=5$ partial waves in both cubic and elongated geometries, and for states zero and non-zero total momentum. For all 45 quantization conditions we considered (22 in cubic box, 23 in elongated box) we find perfect agreement. △ Less

Submitted 19 October, 2022; v1 submitted 9 July, 2021; originally announced July 2021.

Comments: 55 pages, 9 figures, 45 tables, 1 supplement. This update matches the published version in PRD

Journal ref: Phys.Rev.D 105 (2022) 5, 054517

arXiv:2107.03973 [pdf, other]

doi 10.1103/PhysRevLett.127.222001

Three-body dynamics of the $a_1(1260)$ resonance from lattice QCD

Authors: Maxim Mai, Andrei Alexandru, Ruairí Brett, Chris Culver, Michael Döring, Frank X. Lee, Daniel Sadasivan

Abstract: Resonant hadronic systems often exhibit a complicated decay pattern in which three-body dynamics play a relevant or even dominant role. In this work we focus on the $a_1(1260)$ resonance. For the first time, the pole position and branching ratios of a three-body resonance are calculated from lattice QCD using one-, two-, and three-meson interpolators and a three-body finite-volume formalism extend… ▽ More Resonant hadronic systems often exhibit a complicated decay pattern in which three-body dynamics play a relevant or even dominant role. In this work we focus on the $a_1(1260)$ resonance. For the first time, the pole position and branching ratios of a three-body resonance are calculated from lattice QCD using one-, two-, and three-meson interpolators and a three-body finite-volume formalism extended to spin and coupled channels. This marks a new milestone for ab-initio studies of ordinary resonances along with hybrid and exotic hadrons involving three-body dynamics. △ Less

Submitted 8 July, 2021; originally announced July 2021.

Comments: 14 pages, 7 figures

arXiv:2106.02557 [pdf, other]

doi 10.1103/PhysRevD.104.034506

Towards charged hadron polarizabilities from four-point functions in lattice QCD

Authors: Walter Wilcox, Frank X. Lee

Abstract: We show how to compute electromagnetic polarizabilities of charged hadrons using four-point functions in lattice QCD. The low-energy behavior of Compton scattering amplitude is matched to matrix elements of current-current correlation functions on the lattice. Working in momentum space, formulas for electric polarizability ($α_E$) and magnetic polarizability ($β_M$) are derived for both charged pi… ▽ More We show how to compute electromagnetic polarizabilities of charged hadrons using four-point functions in lattice QCD. The low-energy behavior of Compton scattering amplitude is matched to matrix elements of current-current correlation functions on the lattice. Working in momentum space, formulas for electric polarizability ($α_E$) and magnetic polarizability ($β_M$) are derived for both charged pion and proton. Lattice four-point correlation functions are constructed from quark and gluon fields to be used in Monte-Carlo simulations. The content of the functions is assessed in detail and specific prescriptions are given to isolate the polarizabilities. The connected quark-line diagrams can be done today as a small lattice project. The disconnected diagrams are more challenging but are within reach of dedicated resources for medium to large lattice projects. We also draw attention to the potential of four-point functions as a multi-purpose tool for hadron structure. △ Less

Submitted 30 December, 2021; v1 submitted 4 June, 2021; originally announced June 2021.

Comments: 13 pages, 3 figures. This version provides some corrections and clarifications to V2

Report number: BU-HEPP-21-01

Journal ref: Phys. Rev. D 104, 034506 (2021)

arXiv:2105.06906 [pdf, other]

doi 10.1103/PhysRevD.104.014510

Charged pion electric polarizability from lattice QCD

Authors: Hossein Niyazi, Andrei Alexandru, Frank X. Lee, Michael Lujan

Abstract: We present a calculation of the charged pion electric polarizability using the background field method. To extract the mass-shift induced by the electric field for the accelerated charged particle we fit the lattice QCD correlators using correlators derived from an effective model. The methodology outlined in this study (boundary conditions, fitting procedure, etc.) is designed to ensure that the… ▽ More We present a calculation of the charged pion electric polarizability using the background field method. To extract the mass-shift induced by the electric field for the accelerated charged particle we fit the lattice QCD correlators using correlators derived from an effective model. The methodology outlined in this study (boundary conditions, fitting procedure, etc.) is designed to ensure that the results are invariant under gauge transformations of the background field. We apply the method to four $N_f=2$ dynamical ensembles to extract $α_{π^\pm}$ at pion mass of $315$ MeV. △ Less

Submitted 14 May, 2021; originally announced May 2021.

Journal ref: Phys. Rev. D 104, 014510 (2021)

arXiv:2104.05154 [pdf, other]

Machine Learning Approach to Uncovering Residential Energy Consumption Patterns Based on Socioeconomic and Smart Meter Data

Authors: Wenjun Tang, Hao Wang, Xian-Long Lee, Hong-Tzer Yang

Abstract: The smart meter data analysis contributes to better planning and operations for the power system. This study aims to identify the drivers of residential energy consumption patterns from the socioeconomic perspective based on the consumption and demographic data using machine learning. We model consumption patterns by representative loads and reveal the relationship between load patterns and socioe… ▽ More The smart meter data analysis contributes to better planning and operations for the power system. This study aims to identify the drivers of residential energy consumption patterns from the socioeconomic perspective based on the consumption and demographic data using machine learning. We model consumption patterns by representative loads and reveal the relationship between load patterns and socioeconomic characteristics. Specifically, we analyze the real-world smart meter data and extract load patterns by clustering in a robust way. We further identify the influencing socioeconomic attributes on load patterns to improve our method's interpretability. The relationship between consumers' load patterns and selected socioeconomic features is characterized via machine learning models. The findings are as follows. (1) Twelve load clusters, consisting of six for weekdays and six for weekends, exhibit a diverse pattern of lifestyle and a difference between weekdays and weekends. (2) Among various socioeconomic features, age and education level are suggested to influence the load patterns. (3) Our proposed analytical model using feature selection and machine learning is proved to be more effective than XGBoost and conventional neural network model in map** the relationship between load patterns and socioeconomic features. △ Less

Submitted 31 October, 2021; v1 submitted 11 April, 2021; originally announced April 2021.

Journal ref: Energy 2021

arXiv:2101.06144 [pdf, other]

doi 10.1103/PhysRevD.104.014501

Three-body interactions from the finite-volume QCD spectrum

Authors: Ruairí Brett, Chris Culver, Maxim Mai, Andrei Alexandru, Michael Döring, Frank X. Lee

Abstract: We perform a fit of the finite-volume QCD spectrum of three pions at maximal isospin to constrain the three-body force. We use the unitarity-based relativistic three-particle quantization condition, with the GWUQCD spectrum obtained at 315 MeV and 220 MeV pion mass in two-flavor QCD. For the heavier pion mass we find that the data is consistent with a constant contact term close to zero, whereas f… ▽ More We perform a fit of the finite-volume QCD spectrum of three pions at maximal isospin to constrain the three-body force. We use the unitarity-based relativistic three-particle quantization condition, with the GWUQCD spectrum obtained at 315 MeV and 220 MeV pion mass in two-flavor QCD. For the heavier pion mass we find that the data is consistent with a constant contact term close to zero, whereas for the lighter mass we see a statistically significant energy dependence in tension with the prediction of leading order ChPT. Our results also suggest that with enough three-body energy levels, the two-body amplitude could be constrained. △ Less

Submitted 21 June, 2021; v1 submitted 15 January, 2021; originally announced January 2021.

Comments: 17 pages, 6 figures. Updated to match published version

Journal ref: Phys. Rev. D 104, 014501 (2021)

arXiv:2011.07114 [pdf, other]

Query-based Targeted Action-Space Adversarial Policies on Deep Reinforcement Learning Agents

Authors: Xian Yeow Lee, Yasaman Esfandiari, Kai Liang Tan, Soumik Sarkar

Abstract: Advances in computing resources have resulted in the increasing complexity of cyber-physical systems (CPS). As the complexity of CPS evolved, the focus has shifted from traditional control methods to deep reinforcement learning-based (DRL) methods for control of these systems. This is due to the difficulty of obtaining accurate models of complex CPS for traditional control. However, to securely de… ▽ More Advances in computing resources have resulted in the increasing complexity of cyber-physical systems (CPS). As the complexity of CPS evolved, the focus has shifted from traditional control methods to deep reinforcement learning-based (DRL) methods for control of these systems. This is due to the difficulty of obtaining accurate models of complex CPS for traditional control. However, to securely deploy DRL in production, it is essential to examine the weaknesses of DRL-based controllers (policies) towards malicious attacks from all angles. In this work, we investigate targeted attacks in the action-space domain, also commonly known as actuation attacks in CPS literature, which perturbs the outputs of a controller. We show that a query-based black-box attack model that generates optimal perturbations with respect to an adversarial goal can be formulated as another reinforcement learning problem. Thus, such an adversarial policy can be trained using conventional DRL methods. Experimental results showed that adversarial policies that only observe the nominal policy's output generate stronger attacks than adversarial policies that observe the nominal policy's input and output. Further analysis reveals that nominal policies whose outputs are frequently at the boundaries of the action space are naturally more robust towards adversarial policies. Lastly, we propose the use of adversarial training with transfer learning to induce robust behaviors into the nominal policy, which decreases the rate of successful targeted attacks by 50%. △ Less

Submitted 20 February, 2021; v1 submitted 13 November, 2020; originally announced November 2020.

arXiv:2010.12559 [pdf, other]

Capturing missing physics in climate model parameterizations using neural differential equations

Authors: Ali Ramadhan, John Marshall, Andre Souza, Xin Kai Lee, Ulyana Piterbarg, Adeline Hillier, Gregory LeClaire Wagner, Christopher Rackauckas, Chris Hill, Jean-Michel Campin, Raffaele Ferrari

Abstract: We explore how neural differential equations (NDEs) may be trained on highly resolved fluid-dynamical models of unresolved scales providing an ideal framework for data-driven parameterizations in climate models. NDEs overcome some of the limitations of traditional neural networks (NNs) in fluid dynamical applications in that they can readily incorporate conservation laws and boundary conditions an… ▽ More We explore how neural differential equations (NDEs) may be trained on highly resolved fluid-dynamical models of unresolved scales providing an ideal framework for data-driven parameterizations in climate models. NDEs overcome some of the limitations of traditional neural networks (NNs) in fluid dynamical applications in that they can readily incorporate conservation laws and boundary conditions and are stable when integrated over time. We advocate a method that employs a 'residual' approach, in which the NN is used to improve upon an existing parameterization through the representation of residual fluxes which are not captured by the base parameterization. This reduces the amount of training required and providing a method for capturing up-gradient and nonlocal fluxes. As an illustrative example, we consider the parameterization of free convection of the oceanic boundary layer triggered by buoyancy loss at the surface. We demonstrate that a simple parameterization of the process - convective adjustment - can be improved upon by training a NDE against highly resolved explicit models, to capture entrainment fluxes at the base of the well-mixed layer, fluxes that convective adjustment itself cannot represent. The augmented parameterization outperforms existing commonly used parameterizations such as the K-Profile Parameterization (KPP). We showcase that the NDE performs well independent of the time-stepper and that an online training approach using differentiable simulation via the Julia scientific machine learning software stack improves accuracy by an order-of-magnitude. We conclude that NDEs provide an exciting route forward to the development of representations of sub-grid-scale processes for climate science, opening up myriad new opportunities. △ Less

Submitted 6 March, 2023; v1 submitted 23 October, 2020; originally announced October 2020.

Comments: 47 pages, 10 figures, 2 tables, 7 appendices

arXiv:2009.12358 [pdf, other]

doi 10.1103/PhysRevD.102.114523

Finite-volume energy spectrum of the $K^-K^-K^-$ system

Authors: Andrei Alexandru, Ruairí Brett, Chris Culver, Michael Döring, Dehua Guo, Frank X. Lee, Maxim Mai

Abstract: The dynamics of multi-kaon systems are of relevance for several areas of nuclear physics. However, even the simplest systems, two and three kaons, are hard to prepare and study experimentally. Here we show how to extract this information using first-principle lattice QCD results. We (1) extend the relativistic three-body quantization condition to the strangeness sector, predicting for the first ti… ▽ More The dynamics of multi-kaon systems are of relevance for several areas of nuclear physics. However, even the simplest systems, two and three kaons, are hard to prepare and study experimentally. Here we show how to extract this information using first-principle lattice QCD results. We (1) extend the relativistic three-body quantization condition to the strangeness sector, predicting for the first time the excited level finite-volume spectrum of three kaon systems at maximal isospin, and (2) present a first lattice QCD calculation of the excited levels of this system in a finite box. We compare our predictions with the lattice results reported here and with previous ground state calculations and find very good agreement. △ Less

Submitted 2 October, 2020; v1 submitted 25 September, 2020; originally announced September 2020.

Comments: 9 pages, 3 figures; v2 -- typos corrected

Journal ref: Phys. Rev. D 102, 114523 (2020)

arXiv:2008.13022 [pdf, other]

doi 10.1103/PhysRevD.102.094506

Setting the scale for nHYP fermions with the Lüscher-Weisz gauge action

Authors: Hossein Niyazi, Andrei Alexandru, Frank X. Lee, Ruairí Brett

Abstract: Lattice QCD calculations using gauge smearing for fermion kernels are computationally efficient. Hypercubic blocking (nHYP smearing) has been shown to reduce scaling errors. In this work we use an improved action for $N_f=2$ QCD, based on the Lüscher-Weisz gauge action and clover-improved Wilson fermions with nHYP smeared gauge links. We perform a parameter scan in the region with lattice spacing… ▽ More Lattice QCD calculations using gauge smearing for fermion kernels are computationally efficient. Hypercubic blocking (nHYP smearing) has been shown to reduce scaling errors. In this work we use an improved action for $N_f=2$ QCD, based on the Lüscher-Weisz gauge action and clover-improved Wilson fermions with nHYP smeared gauge links. We perform a parameter scan in the region with lattice spacing between $0.066 \mathop{\hbox{fm}}$ and $0.115 \mathop{\hbox{fm}}$ and pion mass between $207 \mathop{\hbox{MeV}}$ and $834 \mathop{\hbox{MeV}}$. We determine the lattice spacing and pion mass as a function of the bare coupling parameters ($β$ and $κ$). The results are obtained from twenty-two ensembles on a $24^3\times 48$ lattice to percent level in statistical accuracy. The finite-volume effects for these ensemble are at the sub-percent level. From these measurements we produce easy-to-use parameterizations to help tune simulations with this action. The lattice spacing is fixed using a mass-independent procedure, by matching observables in the chiral limit. We also provide a parameterization for the chiral extrapolation which is universal and should hold for all discretizations of $N_f=2$ QCD. △ Less

Submitted 29 August, 2020; originally announced August 2020.

Journal ref: Phys. Rev. D 102, 094506 (2020)

arXiv:2007.07176 [pdf, other]

Robustifying Reinforcement Learning Agents via Action Space Adversarial Training

Authors: Kai Liang Tan, Yasaman Esfandiari, Xian Yeow Lee, Aakanksha, Soumik Sarkar

Abstract: Adoption of machine learning (ML)-enabled cyber-physical systems (CPS) are becoming prevalent in various sectors of modern society such as transportation, industrial, and power grids. Recent studies in deep reinforcement learning (DRL) have demonstrated its benefits in a large variety of data-driven decisions and control applications. As reliance on ML-enabled systems grows, it is imperative to st… ▽ More Adoption of machine learning (ML)-enabled cyber-physical systems (CPS) are becoming prevalent in various sectors of modern society such as transportation, industrial, and power grids. Recent studies in deep reinforcement learning (DRL) have demonstrated its benefits in a large variety of data-driven decisions and control applications. As reliance on ML-enabled systems grows, it is imperative to study the performance of these systems under malicious state and actuator attacks. Traditional control systems employ resilient/fault-tolerant controllers that counter these attacks by correcting the system via error observations. However, in some applications, a resilient controller may not be sufficient to avoid a catastrophic failure. Ideally, a robust approach is more useful in these scenarios where a system is inherently robust (by design) to adversarial attacks. While robust control has a long history of development, robust ML is an emerging research area that has already demonstrated its relevance and urgency. However, the majority of robust ML research has focused on perception tasks and not on decision and control tasks, although the ML (specifically RL) models used for control applications are equally vulnerable to adversarial attacks. In this paper, we show that a well-performing DRL agent that is initially susceptible to action space perturbations (e.g. actuator attacks) can be robustified against similar perturbations through adversarial training. △ Less

Submitted 14 July, 2020; originally announced July 2020.

Comments: Accepted for publication in American Control Conference 2020, 6 Pages

arXiv:2005.06883 [pdf, ps, other]

On mean and/or variance mixtures of normal distributions

Authors: Sharon X. Lee, Geoffrey J. McLachlan

Abstract: Parametric distributions are an important part of statistics. There is now a voluminous literature on different fascinating formulations of flexible distributions. We present a selective and brief overview of a small subset of these distributions, focusing on those that are obtained by scaling the mean and/or covariance matrix of the (multivariate) normal distribution with some scaling variable(s)… ▽ More Parametric distributions are an important part of statistics. There is now a voluminous literature on different fascinating formulations of flexible distributions. We present a selective and brief overview of a small subset of these distributions, focusing on those that are obtained by scaling the mean and/or covariance matrix of the (multivariate) normal distribution with some scaling variable(s). Namely, we consider the families of mean mixture, variance mixture, and mean-variance mixture of normal distributions. Its basic properties, some notable special/limiting cases, and parameter estimation methods are also described. △ Less

Submitted 14 May, 2020; originally announced May 2020.

Comments: 10 pages, 0 figures

arXiv:2005.06848 [pdf, ps, other]

Multi-Node EM Algorithm for Finite Mixture Models

Authors: Sharon X. Lee, Geoffrey J. McLachlan, Kaleb L. Leemaqz

Abstract: Finite mixture models are powerful tools for modelling and analyzing heterogeneous data. Parameter estimation is typically carried out using maximum likelihood estimation via the Expectation-Maximization (EM) algorithm. Recently, the adoption of flexible distributions as component densities has become increasingly popular. Often, the EM algorithm for these models involves complicated expressions t… ▽ More Finite mixture models are powerful tools for modelling and analyzing heterogeneous data. Parameter estimation is typically carried out using maximum likelihood estimation via the Expectation-Maximization (EM) algorithm. Recently, the adoption of flexible distributions as component densities has become increasingly popular. Often, the EM algorithm for these models involves complicated expressions that are time-consuming to evaluate numerically. In this paper, we describe a parallel implementation of the EM-algorithm suitable for both single-threaded and multi-threaded processors and for both single machine and multiple-node systems. Numerical experiments are performed to demonstrate the potential performance gain n different settings. Comparison is also made across two commonly used platforms - R and MATLAB. For illustration, a fairly general mixture model is used in the comparison. △ Less

Submitted 14 May, 2020; originally announced May 2020.

Comments: 12 Pages,1 figure

arXiv:1911.02635 [pdf, other]

doi 10.1103/PhysRevD.101.054511

Roper State from Overlap Fermions

Authors: Mingyang Sun, Ying Chen, Gen Wang, Andrei Alexandru, Shao-**g Dong, Terrence Draper, Jacob Fallica, Ming Gong, Frank X. Lee, Anyi Li, Jian Liang, Keh-Fei Liu, Nilmani Mathur, Yi-Bo Yang

Abstract: The Roper state is extracted with valence overlap fermions on a $2+1$-flavor domain-wall fermion lattice (spacing $a = 0.114$ fm and $m_π = 330$ MeV) using both the Sequential Empirical Bayes (SEB) method and the variational method. The results are consistent, provided that a large smearing-size interpolation operator is included in the variational calculation to have better overlap with the lowes… ▽ More The Roper state is extracted with valence overlap fermions on a $2+1$-flavor domain-wall fermion lattice (spacing $a = 0.114$ fm and $m_π = 330$ MeV) using both the Sequential Empirical Bayes (SEB) method and the variational method. The results are consistent, provided that a large smearing-size interpolation operator is included in the variational calculation to have better overlap with the lowest radial excitation. Similar calculations carried out for an anisotropic clover lattice with similar parameters find the Roper $\approx 280$ MeV higher than that of the overlap fermion. The fact that the prediction of the Roper state by overlap fermions is consistently lower than those of clover fermions, chirally improved fermions, and twisted-mass fermions over a wide range of pion masses has been dubbed a "Roper puzzle." To understand the origin of this difference, we study the hairpin $Z$-diagram in the isovector scalar meson ($a_0$) correlator in the quenched approximation. Comparing the $a_0$ correlators for clover and overlap fermions, at a pion mass of 290 MeV, we find that the spectral weight of the ghost state with clover fermions is smaller than that of the overlap at $a = 0.12$ fm and $0.09$ fm, whereas the whole $a_0$ correlators of clover and overlap at $a = 0.06$ fm coincide within errors. This suggests that chiral symmetry is restored for clover at $a \le 0.06$ fm and that the Roper should come down at and below this $a$. We conclude that this work supports a resolution of the "Roper puzzle" due to $Z$-graph type chiral dynamics. This entails coupling to higher components in the Fock space (e.g. $Nπ$, $Nππ$ states) to induce the effective flavor-spin interaction between quarks as prescribed in the chiral quark model, resulting in the parity-reversal pattern as observed in the experimental excited states of $N, Δ$ and $Λ$. △ Less

Submitted 30 March, 2020; v1 submitted 6 November, 2019; originally announced November 2019.

Comments: 15 pages, 16 figures, revised manuscript accepted for publication in Phys. Rev. D

Report number: INT-PUB-19-005

Journal ref: Phys. Rev. D 101, 054511 (2020)

arXiv:1909.02583 [pdf, other]

Spatiotemporally Constrained Action Space Attacks on Deep Reinforcement Learning Agents

Authors: Xian Yeow Lee, Sambit Ghadai, Kai Liang Tan, Chinmay Hegde, Soumik Sarkar

Abstract: Robustness of Deep Reinforcement Learning (DRL) algorithms towards adversarial attacks in real world applications such as those deployed in cyber-physical systems (CPS) are of increasing concern. Numerous studies have investigated the mechanisms of attacks on the RL agent's state space. Nonetheless, attacks on the RL agent's action space (AS) (corresponding to actuators in engineering systems) are… ▽ More Robustness of Deep Reinforcement Learning (DRL) algorithms towards adversarial attacks in real world applications such as those deployed in cyber-physical systems (CPS) are of increasing concern. Numerous studies have investigated the mechanisms of attacks on the RL agent's state space. Nonetheless, attacks on the RL agent's action space (AS) (corresponding to actuators in engineering systems) are equally perverse; such attacks are relatively less studied in the ML literature. In this work, we first frame the problem as an optimization problem of minimizing the cumulative reward of an RL agent with decoupled constraints as the budget of attack. We propose a white-box Myopic Action Space (MAS) attack algorithm that distributes the attacks across the action space dimensions. Next, we reformulate the optimization problem above with the same objective function, but with a temporally coupled constraint on the attack budget to take into account the approximated dynamics of the agent. This leads to the white-box Look-ahead Action Space (LAS) attack algorithm that distributes the attacks across the action and temporal dimensions. Our results shows that using the same amount of resources, the LAS attack deteriorates the agent's performance significantly more than the MAS attack. This reveals the possibility that with limited resource, an adversary can utilize the agent's dynamics to malevolently craft attacks that causes the agent to fail. Additionally, we leverage these attack strategies as a possible tool to gain insights on the potential vulnerabilities of DRL agents. △ Less

Submitted 18 November, 2019; v1 submitted 5 September, 2019; originally announced September 2019.

Comments: Version 2 with supplementary materials

arXiv:1908.01847 [pdf, other]

doi 10.1103/PhysRevD.100.114514

A cross-channel study of pion scattering from lattice QCD

Authors: Maxim Mai, Chris Culver, Andrei Alexandru, Michael Döring, Frank X. Lee

Abstract: We use a chiral model for pion interactions, in the inverse amplitude formalism, to perform a simultaneous analysis of lattice QCD results for pion-pion scattering in all three isospin channels. The input is the finite-volume two-pion spectrum computed using lattice QCD from six ensembles on lattices elongated in one of the spatial dimensions. A two-flavor dynamical lattice QCD action is used with… ▽ More We use a chiral model for pion interactions, in the inverse amplitude formalism, to perform a simultaneous analysis of lattice QCD results for pion-pion scattering in all three isospin channels. The input is the finite-volume two-pion spectrum computed using lattice QCD from six ensembles on lattices elongated in one of the spatial dimensions. A two-flavor dynamical lattice QCD action is used with two quark masses corresponding to a pion mass of 315 MeV and 224 MeV. The spectrum in the elastic region is subjected to a global fit which takes into account full correlations across isospin, pion mass and decay constant. The parameters from the fit are used to perform a chiral extrapolation to the physical point. The cross-channel fit results in a more precise determination of the parameters of the model when compared with single channel fits. We obtain $m_πa_0^{I=0}=0.2132(9)$, and $m_πa_0^{I=2}=0.0433(2)$ as well as $m_σ=443(3)-i221(6)$ MeV and $m_ρ=724(4)-i67(1)$ MeV. Several aspects of scale setting and consistency with previous analyses of lattice QCD results are discussed as well. △ Less

Submitted 5 August, 2019; originally announced August 2019.

Comments: 11 pages, 3 figures

Journal ref: Phys. Rev. D 100, 114514 (2019)

arXiv:1907.05695 [pdf]

Leveraging Socioeconomic Information and Deep Learning for Residential Load Pattern Prediction

Authors: Wen-Jun Tang, Xian-Long Lee, Hao Wang, Hong-Tzer Yang

Abstract: Advanced metering infrastructure systems record a high volume of residential load data, opening up an opportunity for utilities to understand consumer energy consumption behaviors. Existing studies have focused on load profiling and prediction, but neglected the role of socioeconomic characteristics of consumers in their energy consumption behaviors. In this paper, we develop a prediction model us… ▽ More Advanced metering infrastructure systems record a high volume of residential load data, opening up an opportunity for utilities to understand consumer energy consumption behaviors. Existing studies have focused on load profiling and prediction, but neglected the role of socioeconomic characteristics of consumers in their energy consumption behaviors. In this paper, we develop a prediction model using deep neural networks to predict load patterns of consumers based on their socioeconomic information. We analyze load patterns using the K-means clustering method and use an entropy-based feature selection method to select the key socioeconomic characteristics that affect consumers' load patterns. Our prediction method with feature selection achieves a higher prediction accuracy compared with the benchmark schemes, e.g. 80% reduction in the prediction error. △ Less

Submitted 2 July, 2019; originally announced July 2019.

arXiv:1907.00953 [pdf, other]

Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model

Authors: Alex X. Lee, Anusha Nagabandi, Pieter Abbeel, Sergey Levine

Abstract: Deep reinforcement learning (RL) algorithms can use high-capacity deep networks to learn directly from image observations. However, these high-dimensional observation spaces present a number of challenges in practice, since the policy must now solve two problems: representation learning and task learning. In this work, we tackle these two problems separately, by explicitly learning latent represen… ▽ More Deep reinforcement learning (RL) algorithms can use high-capacity deep networks to learn directly from image observations. However, these high-dimensional observation spaces present a number of challenges in practice, since the policy must now solve two problems: representation learning and task learning. In this work, we tackle these two problems separately, by explicitly learning latent representations that can accelerate reinforcement learning from images. We propose the stochastic latent actor-critic (SLAC) algorithm: a sample-efficient and high-performing RL algorithm for learning policies for complex continuous control tasks directly from high-dimensional image inputs. SLAC provides a novel and principled approach for unifying stochastic sequential models and RL into a single method, by learning a compact latent representation and then performing RL in the model's learned latent space. Our experimental evaluation demonstrates that our method outperforms both model-free and model-based alternatives in terms of final performance and sample efficiency, on a range of difficult image-based control tasks. Our code and videos of our results are available at our website. △ Less

Submitted 26 October, 2020; v1 submitted 1 July, 2019; originally announced July 2019.

Comments: Project website: https://alexlee-gk.github.io/slac/

Showing 1–50 of 192 results for author: Lee, X