-
A Policy Gradient Primal-Dual Algorithm for Constrained MDPs with Uniform PAC Guarantees
Authors:
Toshinori Kitamura,
Tadashi Kozuno,
Masahiro Kato,
Yuki Ichihara,
Soichiro Nishimori,
Akiyoshi Sannai,
Sho Sonoda,
Wataru Kumagai,
Yutaka Matsuo
Abstract:
We study a primal-dual (PD) reinforcement learning (RL) algorithm for online constrained Markov decision processes (CMDPs). Despite its widespread practical use, the existing theoretical literature on PD-RL algorithms for this problem only provides sublinear regret guarantees and fails to ensure convergence to optimal policies. In this paper, we introduce a novel policy gradient PD algorithm with…
▽ More
We study a primal-dual (PD) reinforcement learning (RL) algorithm for online constrained Markov decision processes (CMDPs). Despite its widespread practical use, the existing theoretical literature on PD-RL algorithms for this problem only provides sublinear regret guarantees and fails to ensure convergence to optimal policies. In this paper, we introduce a novel policy gradient PD algorithm with uniform probably approximate correctness (Uniform-PAC) guarantees, simultaneously ensuring convergence to optimal policies, sublinear regret, and polynomial sample complexity for any target accuracy. Notably, this represents the first Uniform-PAC algorithm for the online CMDP problem. In addition to the theoretical guarantees, we empirically demonstrate in a simple CMDP that our algorithm converges to optimal policies, while baseline algorithms exhibit oscillatory performance and constraint violation.
△ Less
Submitted 1 July, 2024; v1 submitted 31 January, 2024;
originally announced January 2024.
-
Towards Autonomous Hypothesis Verification via Language Models with Minimal Guidance
Authors:
Shiro Takagi,
Ryutaro Yamauchi,
Wataru Kumagai
Abstract:
Research automation efforts usually employ AI as a tool to automate specific tasks within the research process. To create an AI that truly conduct research themselves, it must independently generate hypotheses, design verification plans, and execute verification. Therefore, we investigated if an AI itself could autonomously generate and verify hypothesis for a toy machine learning research problem…
▽ More
Research automation efforts usually employ AI as a tool to automate specific tasks within the research process. To create an AI that truly conduct research themselves, it must independently generate hypotheses, design verification plans, and execute verification. Therefore, we investigated if an AI itself could autonomously generate and verify hypothesis for a toy machine learning research problem. We prompted GPT-4 to generate hypotheses and Python code for hypothesis verification with limited methodological guidance. Our findings suggest that, in some instances, GPT-4 can autonomously generate and validate hypotheses without detailed guidance. While this is a promising result, we also found that none of the verifications were flawless, and there remain significant challenges in achieving autonomous, human-level research using only generic instructions. These findings underscore the need for continued exploration to develop a general and autonomous AI researcher.
△ Less
Submitted 16 November, 2023;
originally announced November 2023.
-
LPML: LLM-Prompting Markup Language for Mathematical Reasoning
Authors:
Ryutaro Yamauchi,
Sho Sonoda,
Akiyoshi Sannai,
Wataru Kumagai
Abstract:
In utilizing large language models (LLMs) for mathematical reasoning, addressing the errors in the reasoning and calculation present in the generated text by LLMs is a crucial challenge. In this paper, we propose a novel framework that integrates the Chain-of-Thought (CoT) method with an external tool (Python REPL). We discovered that by prompting LLMs to generate structured text in XML-like marku…
▽ More
In utilizing large language models (LLMs) for mathematical reasoning, addressing the errors in the reasoning and calculation present in the generated text by LLMs is a crucial challenge. In this paper, we propose a novel framework that integrates the Chain-of-Thought (CoT) method with an external tool (Python REPL). We discovered that by prompting LLMs to generate structured text in XML-like markup language, we could seamlessly integrate CoT and the external tool and control the undesired behaviors of LLMs. With our approach, LLMs can utilize Python computation to rectify errors within CoT. We applied our method to ChatGPT (GPT-3.5) to solve challenging mathematical problems and demonstrated that combining CoT and Python REPL through the markup language enhances the reasoning capability of LLMs. Our approach enables LLMs to write the markup language and perform advanced mathematical reasoning using only zero-shot prompting.
△ Less
Submitted 11 October, 2023; v1 submitted 20 September, 2023;
originally announced September 2023.
-
Regularization and Variance-Weighted Regression Achieves Minimax Optimality in Linear MDPs: Theory and Practice
Authors:
Toshinori Kitamura,
Tadashi Kozuno,
Yunhao Tang,
Nino Vieillard,
Michal Valko,
Wenhao Yang,
**cheng Mei,
Pierre Ménard,
Mohammad Gheshlaghi Azar,
Rémi Munos,
Olivier Pietquin,
Matthieu Geist,
Csaba Szepesvári,
Wataru Kumagai,
Yutaka Matsuo
Abstract:
Mirror descent value iteration (MDVI), an abstraction of Kullback-Leibler (KL) and entropy-regularized reinforcement learning (RL), has served as the basis for recent high-performing practical RL algorithms. However, despite the use of function approximation in practice, the theoretical understanding of MDVI has been limited to tabular Markov decision processes (MDPs). We study MDVI with linear fu…
▽ More
Mirror descent value iteration (MDVI), an abstraction of Kullback-Leibler (KL) and entropy-regularized reinforcement learning (RL), has served as the basis for recent high-performing practical RL algorithms. However, despite the use of function approximation in practice, the theoretical understanding of MDVI has been limited to tabular Markov decision processes (MDPs). We study MDVI with linear function approximation through its sample complexity required to identify an $\varepsilon$-optimal policy with probability $1-δ$ under the settings of an infinite-horizon linear MDP, generative model, and G-optimal design. We demonstrate that least-squares regression weighted by the variance of an estimated optimal value function of the next state is crucial to achieving minimax optimality. Based on this observation, we present Variance-Weighted Least-Squares MDVI (VWLS-MDVI), the first theoretical algorithm that achieves nearly minimax optimal sample complexity for infinite-horizon linear MDPs. Furthermore, we propose a practical VWLS algorithm for value-based deep RL, Deep Variance Weighting (DVW). Our experiments demonstrate that DVW improves the performance of popular value-based deep RL algorithms on a set of MinAtar benchmarks.
△ Less
Submitted 22 May, 2023;
originally announced May 2023.
-
Langevin Autoencoders for Learning Deep Latent Variable Models
Authors:
Shohei Taniguchi,
Yusuke Iwasawa,
Wataru Kumagai,
Yutaka Matsuo
Abstract:
Markov chain Monte Carlo (MCMC), such as Langevin dynamics, is valid for approximating intractable distributions. However, its usage is limited in the context of deep latent variable models owing to costly datapoint-wise sampling iterations and slow convergence. This paper proposes the amortized Langevin dynamics (ALD), wherein datapoint-wise MCMC iterations are entirely replaced with updates of a…
▽ More
Markov chain Monte Carlo (MCMC), such as Langevin dynamics, is valid for approximating intractable distributions. However, its usage is limited in the context of deep latent variable models owing to costly datapoint-wise sampling iterations and slow convergence. This paper proposes the amortized Langevin dynamics (ALD), wherein datapoint-wise MCMC iterations are entirely replaced with updates of an encoder that maps observations into latent variables. This amortization enables efficient posterior sampling without datapoint-wise iterations. Despite its efficiency, we prove that ALD is valid as an MCMC algorithm, whose Markov chain has the target posterior as a stationary distribution under mild assumptions. Based on the ALD, we also present a new deep latent variable model named the Langevin autoencoder (LAE). Interestingly, the LAE can be implemented by slightly modifying the traditional autoencoder. Using multiple synthetic datasets, we first validate that ALD can properly obtain samples from target posteriors. We also evaluate the LAE on the image generation task, and show that our LAE can outperform existing methods based on variational inference, such as the variational autoencoder, and other MCMC-based methods in terms of the test likelihood.
△ Less
Submitted 11 October, 2022; v1 submitted 15 September, 2022;
originally announced September 2022.
-
Equivariant and Invariant Reynolds Networks
Authors:
Akiyoshi Sannai,
Makoto Kawano,
Wataru Kumagai
Abstract:
Invariant and equivariant networks are useful in learning data with symmetry, including images, sets, point clouds, and graphs. In this paper, we consider invariant and equivariant networks for symmetries of finite groups. Invariant and equivariant networks have been constructed by various researchers using Reynolds operators. However, Reynolds operators are computationally expensive when the orde…
▽ More
Invariant and equivariant networks are useful in learning data with symmetry, including images, sets, point clouds, and graphs. In this paper, we consider invariant and equivariant networks for symmetries of finite groups. Invariant and equivariant networks have been constructed by various researchers using Reynolds operators. However, Reynolds operators are computationally expensive when the order of the group is large because they use the sum over the whole group, which poses an implementation difficulty. To overcome this difficulty, we consider representing the Reynolds operator as a sum over a subset instead of a sum over the whole group. We call such a subset a Reynolds design, and an operator defined by a sum over a Reynolds design a reductive Reynolds operator. For example, in the case of a graph with $n$ nodes, the computational complexity of the reductive Reynolds operator is reduced to $O(n^2)$, while the computational complexity of the Reynolds operator is $O(n!)$. We construct learning models based on the reductive Reynolds operator called equivariant and invariant Reynolds networks (ReyNets) and prove that they have universal approximation property. Reynolds designs for equivariant ReyNets are derived from combinatorial observations with Young diagrams, while Reynolds designs for invariant ReyNets are derived from invariants called Reynolds dimensions defined on the set of invariant polynomials. Numerical experiments show that the performance of our models is comparable to state-of-the-art methods.
△ Less
Submitted 15 October, 2021;
originally announced October 2021.
-
Group Equivariant Conditional Neural Processes
Authors:
Makoto Kawano,
Wataru Kumagai,
Akiyoshi Sannai,
Yusuke Iwasawa,
Yutaka Matsuo
Abstract:
We present the group equivariant conditional neural process (EquivCNP), a meta-learning method with permutation invariance in a data set as in conventional conditional neural processes (CNPs), and it also has transformation equivariance in data space. Incorporating group equivariance, such as rotation and scaling equivariance, provides a way to consider the symmetry of real-world data. We give a d…
▽ More
We present the group equivariant conditional neural process (EquivCNP), a meta-learning method with permutation invariance in a data set as in conventional conditional neural processes (CNPs), and it also has transformation equivariance in data space. Incorporating group equivariance, such as rotation and scaling equivariance, provides a way to consider the symmetry of real-world data. We give a decomposition theorem for permutation-invariant and group-equivariant maps, which leads us to construct EquivCNPs with an infinite-dimensional latent space to handle group symmetries. In this paper, we build architecture using Lie group convolutional layers for practical implementation. We show that EquivCNP with translation equivariance achieves comparable performance to conventional CNPs in a 1D regression task. Moreover, we demonstrate that incorporating an appropriate Lie group equivariance, EquivCNP is capable of zero-shot generalization for an image-completion task by selecting an appropriate Lie group equivariance.
△ Less
Submitted 17 February, 2021;
originally announced February 2021.
-
Universal Approximation Theorem for Equivariant Maps by Group CNNs
Authors:
Wataru Kumagai,
Akiyoshi Sannai
Abstract:
Group symmetry is inherent in a wide variety of data distributions. Data processing that preserves symmetry is described as an equivariant map and often effective in achieving high performance. Convolutional neural networks (CNNs) have been known as models with equivariance and shown to approximate equivariant maps for some specific groups. However, universal approximation theorems for CNNs have b…
▽ More
Group symmetry is inherent in a wide variety of data distributions. Data processing that preserves symmetry is described as an equivariant map and often effective in achieving high performance. Convolutional neural networks (CNNs) have been known as models with equivariance and shown to approximate equivariant maps for some specific groups. However, universal approximation theorems for CNNs have been separately derived with individual techniques according to each group and setting. This paper provides a unified method to obtain universal approximation theorems for equivariant maps by CNNs in various settings. As its significant advantage, we can handle non-linear equivariant maps between infinite-dimensional spaces for non-compact groups.
△ Less
Submitted 27 December, 2020;
originally announced December 2020.
-
Variable Selection for Nonparametric Learning with Power Series Kernels
Authors:
Kota Matsui,
Wataru Kumagai,
Kenta Kanamori,
Mitsuaki Nishikimi,
Takafumi Kanamori
Abstract:
In this paper, we propose a variable selection method for general nonparametric kernel-based estimation. The proposed method consists of two-stage estimation: (1) construct a consistent estimator of the target function, (2) approximate the estimator using a few variables by l1-type penalized estimation. We see that the proposed method can be applied to various kernel nonparametric estimation such…
▽ More
In this paper, we propose a variable selection method for general nonparametric kernel-based estimation. The proposed method consists of two-stage estimation: (1) construct a consistent estimator of the target function, (2) approximate the estimator using a few variables by l1-type penalized estimation. We see that the proposed method can be applied to various kernel nonparametric estimation such as kernel ridge regression, kernel-based density and density-ratio estimation. We prove that the proposed method has the property of the variable selection consistency when the power series kernel is used. This result is regarded as an extension of the variable selection consistency for the non-negative garrote to the kernel-based estimators. Several experiments including simulation studies and real data applications show the effectiveness of the proposed method.
△ Less
Submitted 4 December, 2018; v1 submitted 1 June, 2018;
originally announced June 2018.
-
Regret Analysis for Continuous Dueling Bandit
Authors:
Wataru Kumagai
Abstract:
The dueling bandit is a learning framework wherein the feedback information in the learning process is restricted to a noisy comparison between a pair of actions. In this research, we address a dueling bandit problem based on a cost function over a continuous space. We propose a stochastic mirror descent algorithm and show that the algorithm achieves an $O(\sqrt{T\log T})$-regret bound under stron…
▽ More
The dueling bandit is a learning framework wherein the feedback information in the learning process is restricted to a noisy comparison between a pair of actions. In this research, we address a dueling bandit problem based on a cost function over a continuous space. We propose a stochastic mirror descent algorithm and show that the algorithm achieves an $O(\sqrt{T\log T})$-regret bound under strong convexity and smoothness assumptions for the cost function. Subsequently, we clarify the equivalence between regret minimization in dueling bandit and convex optimization for the cost function. Moreover, when considering a lower bound in convex optimization, our algorithm is shown to achieve the optimal convergence rate in convex optimization and the optimal regret in dueling bandit except for a logarithmic factor.
△ Less
Submitted 12 December, 2017; v1 submitted 21 November, 2017;
originally announced November 2017.
-
Learning Bound for Parameter Transfer Learning
Authors:
Wataru Kumagai
Abstract:
We consider a transfer-learning problem by using the parameter transfer approach, where a suitable parameter of feature map** is learned through one task and applied to another objective task. Then, we introduce the notion of the local stability and parameter transfer learnability of parametric feature map**,and thereby derive a learning bound for parameter transfer algorithms. As an applicati…
▽ More
We consider a transfer-learning problem by using the parameter transfer approach, where a suitable parameter of feature map** is learned through one task and applied to another objective task. Then, we introduce the notion of the local stability and parameter transfer learnability of parametric feature map**,and thereby derive a learning bound for parameter transfer algorithms. As an application of parameter transfer learning, we discuss the performance of sparse coding in self-taught learning. Although self-taught learning algorithms with plentiful unlabeled data often show excellent empirical performance, their theoretical analysis has not been studied. In this paper, we also provide the first theoretical learning bound for self-taught learning.
△ Less
Submitted 17 January, 2017; v1 submitted 27 October, 2016;
originally announced October 2016.
-
Asymptotic Compatibility between LOCC Conversion and Recovery
Authors:
Kosuke Ito,
Wataru Kumagai,
Masahito Hayashi
Abstract:
Recently, entanglement concentration was explicitly shown to be irreversible. However, it is still not clear what kind of states can be reversibly converted in the asymptotic setting by LOCC when neither the initial nor the target state is maximally entangled. We derive the necessary and sufficient condition for the reversibility of LOCC conversions between two bipartite pure entangled states in t…
▽ More
Recently, entanglement concentration was explicitly shown to be irreversible. However, it is still not clear what kind of states can be reversibly converted in the asymptotic setting by LOCC when neither the initial nor the target state is maximally entangled. We derive the necessary and sufficient condition for the reversibility of LOCC conversions between two bipartite pure entangled states in the asymptotic setting. In addition, we show that conversion can be achieved perfectly with only local unitary operation under such condition except for special cases. Interestingly, our result implies that an error-free reversible conversion is asymptotically possible even between states whose copies can never be locally unitarily equivalent with any finite numbers of copies, although such a conversion is impossible in the finite setting. In fact, we show such an example. Moreover, we establish how to overcome the irreversibility of LOCC conversion in two ways. As for the first method, we evaluate how many copies of the initial state is to be lost to overcome the irreversibility of LOCC conversion. The second method is to add a supplementary state appropriately, which also works for LU conversion unlike the first method. Especially, for the qubit system, any non-maximally pure entangled state can be a universal resource for the asymptotic reversibility when copies of the state is sufficiently many. More interestingly, our analysis implies that far-from-maximally entangled states can be better than nearly maximally entangled states as this type of resource. This fact brings new insight to the resource theory of state conversion.
△ Less
Submitted 21 August, 2015; v1 submitted 12 April, 2015;
originally announced April 2015.
-
Parallel Distributed Block Coordinate Descent Methods based on Pairwise Comparison Oracle
Authors:
Kota Matsui,
Wataru Kumagai,
Takafumi Kanamori
Abstract:
This paper provides a block coordinate descent algorithm to solve unconstrained optimization problems. In our algorithm, computation of function values or gradients is not required. Instead, pairwise comparison of function values is used. Our algorithm consists of two steps; one is the direction estimate step and the other is the search step. Both steps require only pairwise comparison of function…
▽ More
This paper provides a block coordinate descent algorithm to solve unconstrained optimization problems. In our algorithm, computation of function values or gradients is not required. Instead, pairwise comparison of function values is used. Our algorithm consists of two steps; one is the direction estimate step and the other is the search step. Both steps require only pairwise comparison of function values, which tells us only the order of function values over two points. In the direction estimate step, a Newton type search direction is estimated. A computation method like block coordinate descent methods is used with the pairwise comparison. In the search step, a numerical solution is updated along the estimated direction. The computation in the direction estimate step can be easily parallelized, and thus, the algorithm works efficiently to find the minimizer of the objective function. Also, we show an upper bound of the convergence rate. In numerical experiments, we show that our method efficiently finds the optimal solution compared to some existing methods based on the pairwise comparison.
△ Less
Submitted 13 September, 2014;
originally announced September 2014.
-
Random Number Conversion and LOCC Conversion via Restricted Storage
Authors:
Wataru Kumagai,
Masahito Hayashi
Abstract:
We consider random number conversion (RNC) through random number storage with restricted size. We clarify the relation between the performance of RNC and the size of storage in the framework of first- and second- order asymptotics, and derive their rate regions. Then, we show that the results for RNC with restricted storage recover those for conventional RNC without storage in the limit of storage…
▽ More
We consider random number conversion (RNC) through random number storage with restricted size. We clarify the relation between the performance of RNC and the size of storage in the framework of first- and second- order asymptotics, and derive their rate regions. Then, we show that the results for RNC with restricted storage recover those for conventional RNC without storage in the limit of storage size. To treat RNC via restricted storage, we introduce a new kind of probability distributions named generalized Rayleigh-normal distributions. Using the generalized Rayleigh-normal distributions, we can describe the second-order asymptotic behaviour of RNC via restricted storage in a unified manner. As an application to quantum information theory, we analyze LOCC conversion via entanglement storage with restricted size. Moreover, we derive the optimal LOCC compression rate under a constraint of conversion accuracy.
△ Less
Submitted 21 November, 2017; v1 submitted 15 January, 2014;
originally announced January 2014.
-
Second-Order Asymptotics of Conversions of Distributions and Entangled States Based on Rayleigh-Normal Probability Distributions
Authors:
Wataru Kumagai,
Masahito Hayashi
Abstract:
We discuss the asymptotic behavior of conversions between two independent and identical distributions up to the second-order conversion rate when the conversion is produced by a deterministic function from the input probability space to the output probability space. To derive the second-order conversion rate, we introduce new probability distributions named Rayleigh-normal distributions. The famil…
▽ More
We discuss the asymptotic behavior of conversions between two independent and identical distributions up to the second-order conversion rate when the conversion is produced by a deterministic function from the input probability space to the output probability space. To derive the second-order conversion rate, we introduce new probability distributions named Rayleigh-normal distributions. The family of Rayleigh-normal distributions includes a Rayleigh distribution and coincides with the standard normal distribution in the limit case. Using this family of probability distributions, we represent the asymptotic second-order rates for the distribution conversion. As an application, we also consider the asymptotic behavior of conversions between the multiple copies of two pure entangled states in quantum systems when only local operations and classical communications (LOCC) are allowed. This problem contains entanglement concentration, entanglement dilution and a kind of cloning problem with LOCC restriction as special cases.
△ Less
Submitted 21 November, 2017; v1 submitted 18 June, 2013;
originally announced June 2013.
-
Trade-off between Performance and Reversibility of Entanglement Concentration for Pure Entangled State
Authors:
Wataru Kumagai,
Masahito Hayashi
Abstract:
In quantum information theory, it is widely believed that entanglement concentration for bipartite pure states is asymptotically reversible. In order to examine this, we give a precise formulation of the problem, and show a trade-off relation between performance and reversibility, which implies the irreversibility of entanglement concentration. Then, we regard entanglement concentration as entangl…
▽ More
In quantum information theory, it is widely believed that entanglement concentration for bipartite pure states is asymptotically reversible. In order to examine this, we give a precise formulation of the problem, and show a trade-off relation between performance and reversibility, which implies the irreversibility of entanglement concentration. Then, we regard entanglement concentration as entangled state compression in an entanglement storage with lower dimension. Because of the irreversibility of entanglement concentration, an initial state can not be completely recovered after the compression process and a loss inevitably arises in the process. We numerically calculate this loss and also derive for it a highly accurate analytical approximation.
△ Less
Submitted 12 September, 2013; v1 submitted 27 May, 2013;
originally announced May 2013.
-
Second Order Asymptotics for Random Number Generation
Authors:
Wataru Kumagai,
Masahito Hayashi
Abstract:
We treat a random number generation from an i.i.d. probability distribution of $P$ to that of $Q$. When $Q$ or $P$ is a uniform distribution, the problems have been well-known as the uniform random number generation and the resolvability problem respectively, and analyzed not only in the context of the first order asymptotic theory but also that in the second asymptotic theory. On the other hand,…
▽ More
We treat a random number generation from an i.i.d. probability distribution of $P$ to that of $Q$. When $Q$ or $P$ is a uniform distribution, the problems have been well-known as the uniform random number generation and the resolvability problem respectively, and analyzed not only in the context of the first order asymptotic theory but also that in the second asymptotic theory. On the other hand, when both $P$ and $Q$ are not a uniform distribution, the second order asymptotics has not been treated. In this paper, we focus on the second order asymptotics of a random number generation for arbitrary probability distributions $P$ and $Q$ on a finite set. In particular, we derive the optimal second order generation rate under an arbitrary permissible confidence coefficient.
△ Less
Submitted 4 March, 2013;
originally announced March 2013.
-
Irreversibility of Entanglement Concentration for Pure State
Authors:
Wataru Kumagai,
Masahito Hayashi
Abstract:
For a pure state $ψ$ on a composite system $\mathcal{H}_A\otimes\mathcal{H}_B$, both the entanglement cost $E_C(ψ)$ and the distillable entanglement $E_D(ψ)$ coincide with the von Neumann entropy $H(\mathrm{Tr}_{B}ψ)$. Therefore, the entanglement concentration from the multiple state $ψ^{\otimes n}$ of a pure state $ψ$ to the multiple state $Φ^{\otimes L_n}$ of the EPR state $Φ$ seems to be able t…
▽ More
For a pure state $ψ$ on a composite system $\mathcal{H}_A\otimes\mathcal{H}_B$, both the entanglement cost $E_C(ψ)$ and the distillable entanglement $E_D(ψ)$ coincide with the von Neumann entropy $H(\mathrm{Tr}_{B}ψ)$. Therefore, the entanglement concentration from the multiple state $ψ^{\otimes n}$ of a pure state $ψ$ to the multiple state $Φ^{\otimes L_n}$ of the EPR state $Φ$ seems to be able to be reversibly performed with an asymptotically infinitesimal error when the rate ${L_n}/{n}$ goes to $H(\mathrm{Tr}_{B}ψ)$. In this paper, we show that it is impossible to reversibly perform the entanglement concentration for a multiple pure state even in asymptotic situation. In addition, in the case when we recover the multiple state $ψ^{\otimes M_n}$ after the concentration for $ψ^{\otimes n}$, we evaluate the asymptotic behavior of the loss number $n-M_n$ of $ψ$. This evaluation is thought to be closely related to the entanglement compression in distant parties.
△ Less
Submitted 19 May, 2012;
originally announced May 2012.
-
Quantum hypothesis testing for quantum Gaussian states: Quantum analogues of chi-square, t and F tests
Authors:
Wataru Kumagai,
Masahito Hayashi
Abstract:
We treat quantum counterparts of testing problems whose optimal tests are given by chi-square, t and F tests. These quantum counterparts are formulated as quantum hypothesis testing problems concerning quantum Gaussian states families, and contain disturbance parameters, which have group symmetry. Quantum Hunt-Stein Theorem removes a part of these disturbance parameters, but other types of difficu…
▽ More
We treat quantum counterparts of testing problems whose optimal tests are given by chi-square, t and F tests. These quantum counterparts are formulated as quantum hypothesis testing problems concerning quantum Gaussian states families, and contain disturbance parameters, which have group symmetry. Quantum Hunt-Stein Theorem removes a part of these disturbance parameters, but other types of difficulty still remain. In order to remove them, combining quantum Hunt-Stein theorem and other reduction methods, we establish a general reduction theorem that reduces a complicated quantum hypothesis testing problem to a fundamental quantum hypothesis testing problem. Using these methods, we derive quantum counterparts of chi-square, t and F tests as optimal tests in the respective settings.
△ Less
Submitted 28 October, 2011;
originally announced October 2011.