Search | arXiv e-print repository

A Policy Gradient Primal-Dual Algorithm for Constrained MDPs with Uniform PAC Guarantees

Authors: Toshinori Kitamura, Tadashi Kozuno, Masahiro Kato, Yuki Ichihara, Soichiro Nishimori, Akiyoshi Sannai, Sho Sonoda, Wataru Kumagai, Yutaka Matsuo

Abstract: We study a primal-dual (PD) reinforcement learning (RL) algorithm for online constrained Markov decision processes (CMDPs). Despite its widespread practical use, the existing theoretical literature on PD-RL algorithms for this problem only provides sublinear regret guarantees and fails to ensure convergence to optimal policies. In this paper, we introduce a novel policy gradient PD algorithm with… ▽ More We study a primal-dual (PD) reinforcement learning (RL) algorithm for online constrained Markov decision processes (CMDPs). Despite its widespread practical use, the existing theoretical literature on PD-RL algorithms for this problem only provides sublinear regret guarantees and fails to ensure convergence to optimal policies. In this paper, we introduce a novel policy gradient PD algorithm with uniform probably approximate correctness (Uniform-PAC) guarantees, simultaneously ensuring convergence to optimal policies, sublinear regret, and polynomial sample complexity for any target accuracy. Notably, this represents the first Uniform-PAC algorithm for the online CMDP problem. In addition to the theoretical guarantees, we empirically demonstrate in a simple CMDP that our algorithm converges to optimal policies, while baseline algorithms exhibit oscillatory performance and constraint violation. △ Less

Submitted 1 July, 2024; v1 submitted 31 January, 2024; originally announced January 2024.

arXiv:2311.09706 [pdf, other]

Towards Autonomous Hypothesis Verification via Language Models with Minimal Guidance

Authors: Shiro Takagi, Ryutaro Yamauchi, Wataru Kumagai

Abstract: Research automation efforts usually employ AI as a tool to automate specific tasks within the research process. To create an AI that truly conduct research themselves, it must independently generate hypotheses, design verification plans, and execute verification. Therefore, we investigated if an AI itself could autonomously generate and verify hypothesis for a toy machine learning research problem… ▽ More Research automation efforts usually employ AI as a tool to automate specific tasks within the research process. To create an AI that truly conduct research themselves, it must independently generate hypotheses, design verification plans, and execute verification. Therefore, we investigated if an AI itself could autonomously generate and verify hypothesis for a toy machine learning research problem. We prompted GPT-4 to generate hypotheses and Python code for hypothesis verification with limited methodological guidance. Our findings suggest that, in some instances, GPT-4 can autonomously generate and validate hypotheses without detailed guidance. While this is a promising result, we also found that none of the verifications were flawless, and there remain significant challenges in achieving autonomous, human-level research using only generic instructions. These findings underscore the need for continued exploration to develop a general and autonomous AI researcher. △ Less

Submitted 16 November, 2023; originally announced November 2023.

arXiv:2309.13078 [pdf, other]

LPML: LLM-Prompting Markup Language for Mathematical Reasoning

Authors: Ryutaro Yamauchi, Sho Sonoda, Akiyoshi Sannai, Wataru Kumagai

Abstract: In utilizing large language models (LLMs) for mathematical reasoning, addressing the errors in the reasoning and calculation present in the generated text by LLMs is a crucial challenge. In this paper, we propose a novel framework that integrates the Chain-of-Thought (CoT) method with an external tool (Python REPL). We discovered that by prompting LLMs to generate structured text in XML-like marku… ▽ More In utilizing large language models (LLMs) for mathematical reasoning, addressing the errors in the reasoning and calculation present in the generated text by LLMs is a crucial challenge. In this paper, we propose a novel framework that integrates the Chain-of-Thought (CoT) method with an external tool (Python REPL). We discovered that by prompting LLMs to generate structured text in XML-like markup language, we could seamlessly integrate CoT and the external tool and control the undesired behaviors of LLMs. With our approach, LLMs can utilize Python computation to rectify errors within CoT. We applied our method to ChatGPT (GPT-3.5) to solve challenging mathematical problems and demonstrated that combining CoT and Python REPL through the markup language enhances the reasoning capability of LLMs. Our approach enables LLMs to write the markup language and perform advanced mathematical reasoning using only zero-shot prompting. △ Less

Submitted 11 October, 2023; v1 submitted 20 September, 2023; originally announced September 2023.

arXiv:2305.13185 [pdf, other]

Regularization and Variance-Weighted Regression Achieves Minimax Optimality in Linear MDPs: Theory and Practice

Authors: Toshinori Kitamura, Tadashi Kozuno, Yunhao Tang, Nino Vieillard, Michal Valko, Wenhao Yang, **cheng Mei, Pierre Ménard, Mohammad Gheshlaghi Azar, Rémi Munos, Olivier Pietquin, Matthieu Geist, Csaba Szepesvári, Wataru Kumagai, Yutaka Matsuo

Abstract: Mirror descent value iteration (MDVI), an abstraction of Kullback-Leibler (KL) and entropy-regularized reinforcement learning (RL), has served as the basis for recent high-performing practical RL algorithms. However, despite the use of function approximation in practice, the theoretical understanding of MDVI has been limited to tabular Markov decision processes (MDPs). We study MDVI with linear fu… ▽ More Mirror descent value iteration (MDVI), an abstraction of Kullback-Leibler (KL) and entropy-regularized reinforcement learning (RL), has served as the basis for recent high-performing practical RL algorithms. However, despite the use of function approximation in practice, the theoretical understanding of MDVI has been limited to tabular Markov decision processes (MDPs). We study MDVI with linear function approximation through its sample complexity required to identify an $\varepsilon$-optimal policy with probability $1-δ$ under the settings of an infinite-horizon linear MDP, generative model, and G-optimal design. We demonstrate that least-squares regression weighted by the variance of an estimated optimal value function of the next state is crucial to achieving minimax optimality. Based on this observation, we present Variance-Weighted Least-Squares MDVI (VWLS-MDVI), the first theoretical algorithm that achieves nearly minimax optimal sample complexity for infinite-horizon linear MDPs. Furthermore, we propose a practical VWLS algorithm for value-based deep RL, Deep Variance Weighting (DVW). Our experiments demonstrate that DVW improves the performance of popular value-based deep RL algorithms on a set of MinAtar benchmarks. △ Less

Submitted 22 May, 2023; originally announced May 2023.

Comments: ICML 2023 accepted

arXiv:2209.07036 [pdf, other]

Langevin Autoencoders for Learning Deep Latent Variable Models

Authors: Shohei Taniguchi, Yusuke Iwasawa, Wataru Kumagai, Yutaka Matsuo

Abstract: Markov chain Monte Carlo (MCMC), such as Langevin dynamics, is valid for approximating intractable distributions. However, its usage is limited in the context of deep latent variable models owing to costly datapoint-wise sampling iterations and slow convergence. This paper proposes the amortized Langevin dynamics (ALD), wherein datapoint-wise MCMC iterations are entirely replaced with updates of a… ▽ More Markov chain Monte Carlo (MCMC), such as Langevin dynamics, is valid for approximating intractable distributions. However, its usage is limited in the context of deep latent variable models owing to costly datapoint-wise sampling iterations and slow convergence. This paper proposes the amortized Langevin dynamics (ALD), wherein datapoint-wise MCMC iterations are entirely replaced with updates of an encoder that maps observations into latent variables. This amortization enables efficient posterior sampling without datapoint-wise iterations. Despite its efficiency, we prove that ALD is valid as an MCMC algorithm, whose Markov chain has the target posterior as a stationary distribution under mild assumptions. Based on the ALD, we also present a new deep latent variable model named the Langevin autoencoder (LAE). Interestingly, the LAE can be implemented by slightly modifying the traditional autoencoder. Using multiple synthetic datasets, we first validate that ALD can properly obtain samples from target posteriors. We also evaluate the LAE on the image generation task, and show that our LAE can outperform existing methods based on variational inference, such as the variational autoencoder, and other MCMC-based methods in terms of the test likelihood. △ Less

Submitted 11 October, 2022; v1 submitted 15 September, 2022; originally announced September 2022.

Comments: accepted at Neural Information Processing Systems (NeurIPS 2022)

arXiv:2110.08092 [pdf, other]

Equivariant and Invariant Reynolds Networks

Authors: Akiyoshi Sannai, Makoto Kawano, Wataru Kumagai

Abstract: Invariant and equivariant networks are useful in learning data with symmetry, including images, sets, point clouds, and graphs. In this paper, we consider invariant and equivariant networks for symmetries of finite groups. Invariant and equivariant networks have been constructed by various researchers using Reynolds operators. However, Reynolds operators are computationally expensive when the orde… ▽ More Invariant and equivariant networks are useful in learning data with symmetry, including images, sets, point clouds, and graphs. In this paper, we consider invariant and equivariant networks for symmetries of finite groups. Invariant and equivariant networks have been constructed by various researchers using Reynolds operators. However, Reynolds operators are computationally expensive when the order of the group is large because they use the sum over the whole group, which poses an implementation difficulty. To overcome this difficulty, we consider representing the Reynolds operator as a sum over a subset instead of a sum over the whole group. We call such a subset a Reynolds design, and an operator defined by a sum over a Reynolds design a reductive Reynolds operator. For example, in the case of a graph with $n$ nodes, the computational complexity of the reductive Reynolds operator is reduced to $O(n^2)$, while the computational complexity of the Reynolds operator is $O(n!)$. We construct learning models based on the reductive Reynolds operator called equivariant and invariant Reynolds networks (ReyNets) and prove that they have universal approximation property. Reynolds designs for equivariant ReyNets are derived from combinatorial observations with Young diagrams, while Reynolds designs for invariant ReyNets are derived from invariants called Reynolds dimensions defined on the set of invariant polynomials. Numerical experiments show that the performance of our models is comparable to state-of-the-art methods. △ Less

Submitted 15 October, 2021; originally announced October 2021.

Comments: 15 pages, 4 figures

arXiv:2102.08759 [pdf, other]

Group Equivariant Conditional Neural Processes

Authors: Makoto Kawano, Wataru Kumagai, Akiyoshi Sannai, Yusuke Iwasawa, Yutaka Matsuo

Abstract: We present the group equivariant conditional neural process (EquivCNP), a meta-learning method with permutation invariance in a data set as in conventional conditional neural processes (CNPs), and it also has transformation equivariance in data space. Incorporating group equivariance, such as rotation and scaling equivariance, provides a way to consider the symmetry of real-world data. We give a d… ▽ More We present the group equivariant conditional neural process (EquivCNP), a meta-learning method with permutation invariance in a data set as in conventional conditional neural processes (CNPs), and it also has transformation equivariance in data space. Incorporating group equivariance, such as rotation and scaling equivariance, provides a way to consider the symmetry of real-world data. We give a decomposition theorem for permutation-invariant and group-equivariant maps, which leads us to construct EquivCNPs with an infinite-dimensional latent space to handle group symmetries. In this paper, we build architecture using Lie group convolutional layers for practical implementation. We show that EquivCNP with translation equivariance achieves comparable performance to conventional CNPs in a 1D regression task. Moreover, we demonstrate that incorporating an appropriate Lie group equivariance, EquivCNP is capable of zero-shot generalization for an image-completion task by selecting an appropriate Lie group equivariance. △ Less

Submitted 17 February, 2021; originally announced February 2021.

arXiv:2012.13882 [pdf, ps, other]

Universal Approximation Theorem for Equivariant Maps by Group CNNs

Authors: Wataru Kumagai, Akiyoshi Sannai

Abstract: Group symmetry is inherent in a wide variety of data distributions. Data processing that preserves symmetry is described as an equivariant map and often effective in achieving high performance. Convolutional neural networks (CNNs) have been known as models with equivariance and shown to approximate equivariant maps for some specific groups. However, universal approximation theorems for CNNs have b… ▽ More Group symmetry is inherent in a wide variety of data distributions. Data processing that preserves symmetry is described as an equivariant map and often effective in achieving high performance. Convolutional neural networks (CNNs) have been known as models with equivariance and shown to approximate equivariant maps for some specific groups. However, universal approximation theorems for CNNs have been separately derived with individual techniques according to each group and setting. This paper provides a unified method to obtain universal approximation theorems for equivariant maps by CNNs in various settings. As its significant advantage, we can handle non-linear equivariant maps between infinite-dimensional spaces for non-compact groups. △ Less

Submitted 27 December, 2020; originally announced December 2020.

arXiv:1806.00569 [pdf, other]

Variable Selection for Nonparametric Learning with Power Series Kernels

Authors: Kota Matsui, Wataru Kumagai, Kenta Kanamori, Mitsuaki Nishikimi, Takafumi Kanamori

Abstract: In this paper, we propose a variable selection method for general nonparametric kernel-based estimation. The proposed method consists of two-stage estimation: (1) construct a consistent estimator of the target function, (2) approximate the estimator using a few variables by l1-type penalized estimation. We see that the proposed method can be applied to various kernel nonparametric estimation such… ▽ More In this paper, we propose a variable selection method for general nonparametric kernel-based estimation. The proposed method consists of two-stage estimation: (1) construct a consistent estimator of the target function, (2) approximate the estimator using a few variables by l1-type penalized estimation. We see that the proposed method can be applied to various kernel nonparametric estimation such as kernel ridge regression, kernel-based density and density-ratio estimation. We prove that the proposed method has the property of the variable selection consistency when the power series kernel is used. This result is regarded as an extension of the variable selection consistency for the non-negative garrote to the kernel-based estimators. Several experiments including simulation studies and real data applications show the effectiveness of the proposed method. △ Less

Submitted 4 December, 2018; v1 submitted 1 June, 2018; originally announced June 2018.

Comments: 24 pages, 3 tables, 2 figures

arXiv:1711.07693 [pdf, other]

Regret Analysis for Continuous Dueling Bandit

Authors: Wataru Kumagai

Abstract: The dueling bandit is a learning framework wherein the feedback information in the learning process is restricted to a noisy comparison between a pair of actions. In this research, we address a dueling bandit problem based on a cost function over a continuous space. We propose a stochastic mirror descent algorithm and show that the algorithm achieves an $O(\sqrt{T\log T})$-regret bound under stron… ▽ More The dueling bandit is a learning framework wherein the feedback information in the learning process is restricted to a noisy comparison between a pair of actions. In this research, we address a dueling bandit problem based on a cost function over a continuous space. We propose a stochastic mirror descent algorithm and show that the algorithm achieves an $O(\sqrt{T\log T})$-regret bound under strong convexity and smoothness assumptions for the cost function. Subsequently, we clarify the equivalence between regret minimization in dueling bandit and convex optimization for the cost function. Moreover, when considering a lower bound in convex optimization, our algorithm is shown to achieve the optimal convergence rate in convex optimization and the optimal regret in dueling bandit except for a logarithmic factor. △ Less

Submitted 12 December, 2017; v1 submitted 21 November, 2017; originally announced November 2017.

Comments: 14 pages. This paper was accepted at NIPS 2017 as a spotlight presentation

arXiv:1610.08696 [pdf, ps, other]

Learning Bound for Parameter Transfer Learning

Authors: Wataru Kumagai

Abstract: We consider a transfer-learning problem by using the parameter transfer approach, where a suitable parameter of feature map** is learned through one task and applied to another objective task. Then, we introduce the notion of the local stability and parameter transfer learnability of parametric feature map**,and thereby derive a learning bound for parameter transfer algorithms. As an applicati… ▽ More We consider a transfer-learning problem by using the parameter transfer approach, where a suitable parameter of feature map** is learned through one task and applied to another objective task. Then, we introduce the notion of the local stability and parameter transfer learnability of parametric feature map**,and thereby derive a learning bound for parameter transfer algorithms. As an application of parameter transfer learning, we discuss the performance of sparse coding in self-taught learning. Although self-taught learning algorithms with plentiful unlabeled data often show excellent empirical performance, their theoretical analysis has not been studied. In this paper, we also provide the first theoretical learning bound for self-taught learning. △ Less

Submitted 17 January, 2017; v1 submitted 27 October, 2016; originally announced October 2016.

Comments: This paper was accepted at NIPS 2016 as a poster presentation

arXiv:1504.02967 [pdf, ps, other]

doi 10.1103/PhysRevA.92.052308

Asymptotic Compatibility between LOCC Conversion and Recovery

Authors: Kosuke Ito, Wataru Kumagai, Masahito Hayashi

Abstract: Recently, entanglement concentration was explicitly shown to be irreversible. However, it is still not clear what kind of states can be reversibly converted in the asymptotic setting by LOCC when neither the initial nor the target state is maximally entangled. We derive the necessary and sufficient condition for the reversibility of LOCC conversions between two bipartite pure entangled states in t… ▽ More Recently, entanglement concentration was explicitly shown to be irreversible. However, it is still not clear what kind of states can be reversibly converted in the asymptotic setting by LOCC when neither the initial nor the target state is maximally entangled. We derive the necessary and sufficient condition for the reversibility of LOCC conversions between two bipartite pure entangled states in the asymptotic setting. In addition, we show that conversion can be achieved perfectly with only local unitary operation under such condition except for special cases. Interestingly, our result implies that an error-free reversible conversion is asymptotically possible even between states whose copies can never be locally unitarily equivalent with any finite numbers of copies, although such a conversion is impossible in the finite setting. In fact, we show such an example. Moreover, we establish how to overcome the irreversibility of LOCC conversion in two ways. As for the first method, we evaluate how many copies of the initial state is to be lost to overcome the irreversibility of LOCC conversion. The second method is to add a supplementary state appropriately, which also works for LU conversion unlike the first method. Especially, for the qubit system, any non-maximally pure entangled state can be a universal resource for the asymptotic reversibility when copies of the state is sufficiently many. More interestingly, our analysis implies that far-from-maximally entangled states can be better than nearly maximally entangled states as this type of resource. This fact brings new insight to the resource theory of state conversion. △ Less

Submitted 21 August, 2015; v1 submitted 12 April, 2015; originally announced April 2015.

Comments: 16 pages, 6 figures

Journal ref: Phys. Rev. A 92, 052308 (2015)

arXiv:1409.3912 [pdf, other]

Parallel Distributed Block Coordinate Descent Methods based on Pairwise Comparison Oracle

Authors: Kota Matsui, Wataru Kumagai, Takafumi Kanamori

Abstract: This paper provides a block coordinate descent algorithm to solve unconstrained optimization problems. In our algorithm, computation of function values or gradients is not required. Instead, pairwise comparison of function values is used. Our algorithm consists of two steps; one is the direction estimate step and the other is the search step. Both steps require only pairwise comparison of function… ▽ More This paper provides a block coordinate descent algorithm to solve unconstrained optimization problems. In our algorithm, computation of function values or gradients is not required. Instead, pairwise comparison of function values is used. Our algorithm consists of two steps; one is the direction estimate step and the other is the search step. Both steps require only pairwise comparison of function values, which tells us only the order of function values over two points. In the direction estimate step, a Newton type search direction is estimated. A computation method like block coordinate descent methods is used with the pairwise comparison. In the search step, a numerical solution is updated along the estimated direction. The computation in the direction estimate step can be easily parallelized, and thus, the algorithm works efficiently to find the minimizer of the objective function. Also, we show an upper bound of the convergence rate. In numerical experiments, we show that our method efficiently finds the optimal solution compared to some existing methods based on the pairwise comparison. △ Less

Submitted 13 September, 2014; originally announced September 2014.

arXiv:1401.3781 [pdf, ps, other]

Random Number Conversion and LOCC Conversion via Restricted Storage

Authors: Wataru Kumagai, Masahito Hayashi

Abstract: We consider random number conversion (RNC) through random number storage with restricted size. We clarify the relation between the performance of RNC and the size of storage in the framework of first- and second- order asymptotics, and derive their rate regions. Then, we show that the results for RNC with restricted storage recover those for conventional RNC without storage in the limit of storage… ▽ More We consider random number conversion (RNC) through random number storage with restricted size. We clarify the relation between the performance of RNC and the size of storage in the framework of first- and second- order asymptotics, and derive their rate regions. Then, we show that the results for RNC with restricted storage recover those for conventional RNC without storage in the limit of storage size. To treat RNC via restricted storage, we introduce a new kind of probability distributions named generalized Rayleigh-normal distributions. Using the generalized Rayleigh-normal distributions, we can describe the second-order asymptotic behaviour of RNC via restricted storage in a unified manner. As an application to quantum information theory, we analyze LOCC conversion via entanglement storage with restricted size. Moreover, we derive the optimal LOCC compression rate under a constraint of conversion accuracy. △ Less

Submitted 21 November, 2017; v1 submitted 15 January, 2014; originally announced January 2014.

Comments: 53 pages

arXiv:1306.4166 [pdf, ps, other]

Second-Order Asymptotics of Conversions of Distributions and Entangled States Based on Rayleigh-Normal Probability Distributions

Authors: Wataru Kumagai, Masahito Hayashi

Abstract: We discuss the asymptotic behavior of conversions between two independent and identical distributions up to the second-order conversion rate when the conversion is produced by a deterministic function from the input probability space to the output probability space. To derive the second-order conversion rate, we introduce new probability distributions named Rayleigh-normal distributions. The famil… ▽ More We discuss the asymptotic behavior of conversions between two independent and identical distributions up to the second-order conversion rate when the conversion is produced by a deterministic function from the input probability space to the output probability space. To derive the second-order conversion rate, we introduce new probability distributions named Rayleigh-normal distributions. The family of Rayleigh-normal distributions includes a Rayleigh distribution and coincides with the standard normal distribution in the limit case. Using this family of probability distributions, we represent the asymptotic second-order rates for the distribution conversion. As an application, we also consider the asymptotic behavior of conversions between the multiple copies of two pure entangled states in quantum systems when only local operations and classical communications (LOCC) are allowed. This problem contains entanglement concentration, entanglement dilution and a kind of cloning problem with LOCC restriction as special cases. △ Less

Submitted 21 November, 2017; v1 submitted 18 June, 2013; originally announced June 2013.

Comments: 49 pages

arXiv:1305.6250 [pdf, ps, other]

Trade-off between Performance and Reversibility of Entanglement Concentration for Pure Entangled State

Authors: Wataru Kumagai, Masahito Hayashi

Abstract: In quantum information theory, it is widely believed that entanglement concentration for bipartite pure states is asymptotically reversible. In order to examine this, we give a precise formulation of the problem, and show a trade-off relation between performance and reversibility, which implies the irreversibility of entanglement concentration. Then, we regard entanglement concentration as entangl… ▽ More In quantum information theory, it is widely believed that entanglement concentration for bipartite pure states is asymptotically reversible. In order to examine this, we give a precise formulation of the problem, and show a trade-off relation between performance and reversibility, which implies the irreversibility of entanglement concentration. Then, we regard entanglement concentration as entangled state compression in an entanglement storage with lower dimension. Because of the irreversibility of entanglement concentration, an initial state can not be completely recovered after the compression process and a loss inevitably arises in the process. We numerically calculate this loss and also derive for it a highly accurate analytical approximation. △ Less

Submitted 12 September, 2013; v1 submitted 27 May, 2013; originally announced May 2013.

Comments: 10 pages, 5 figures. Harrow & Lo's paper and Hayden & Winter's paper were added in references and a relation between those papers and the paper was clarified. The title was changed

Journal ref: Phys. Rev. Lett. 111, 130407 (2013)

arXiv:1303.0669 [pdf, ps, other]

Second Order Asymptotics for Random Number Generation

Authors: Wataru Kumagai, Masahito Hayashi

Abstract: We treat a random number generation from an i.i.d. probability distribution of $P$ to that of $Q$. When $Q$ or $P$ is a uniform distribution, the problems have been well-known as the uniform random number generation and the resolvability problem respectively, and analyzed not only in the context of the first order asymptotic theory but also that in the second asymptotic theory. On the other hand,… ▽ More We treat a random number generation from an i.i.d. probability distribution of $P$ to that of $Q$. When $Q$ or $P$ is a uniform distribution, the problems have been well-known as the uniform random number generation and the resolvability problem respectively, and analyzed not only in the context of the first order asymptotic theory but also that in the second asymptotic theory. On the other hand, when both $P$ and $Q$ are not a uniform distribution, the second order asymptotics has not been treated. In this paper, we focus on the second order asymptotics of a random number generation for arbitrary probability distributions $P$ and $Q$ on a finite set. In particular, we derive the optimal second order generation rate under an arbitrary permissible confidence coefficient. △ Less

Submitted 4 March, 2013; originally announced March 2013.

Comments: 6 pages, 3 figures

MSC Class: 94A15

arXiv:1205.4370 [pdf, ps, other]

Irreversibility of Entanglement Concentration for Pure State

Authors: Wataru Kumagai, Masahito Hayashi

Abstract: For a pure state $ψ$ on a composite system $\mathcal{H}_A\otimes\mathcal{H}_B$, both the entanglement cost $E_C(ψ)$ and the distillable entanglement $E_D(ψ)$ coincide with the von Neumann entropy $H(\mathrm{Tr}_{B}ψ)$. Therefore, the entanglement concentration from the multiple state $ψ^{\otimes n}$ of a pure state $ψ$ to the multiple state $Φ^{\otimes L_n}$ of the EPR state $Φ$ seems to be able t… ▽ More For a pure state $ψ$ on a composite system $\mathcal{H}_A\otimes\mathcal{H}_B$, both the entanglement cost $E_C(ψ)$ and the distillable entanglement $E_D(ψ)$ coincide with the von Neumann entropy $H(\mathrm{Tr}_{B}ψ)$. Therefore, the entanglement concentration from the multiple state $ψ^{\otimes n}$ of a pure state $ψ$ to the multiple state $Φ^{\otimes L_n}$ of the EPR state $Φ$ seems to be able to be reversibly performed with an asymptotically infinitesimal error when the rate ${L_n}/{n}$ goes to $H(\mathrm{Tr}_{B}ψ)$. In this paper, we show that it is impossible to reversibly perform the entanglement concentration for a multiple pure state even in asymptotic situation. In addition, in the case when we recover the multiple state $ψ^{\otimes M_n}$ after the concentration for $ψ^{\otimes n}$, we evaluate the asymptotic behavior of the loss number $n-M_n$ of $ψ$. This evaluation is thought to be closely related to the entanglement compression in distant parties. △ Less

Submitted 19 May, 2012; originally announced May 2012.

Comments: 6 pages, 1 figure

arXiv:1110.6255 [pdf, ps, other]

doi 10.1007/s00220-013-1678-1

Quantum hypothesis testing for quantum Gaussian states: Quantum analogues of chi-square, t and F tests

Authors: Wataru Kumagai, Masahito Hayashi

Abstract: We treat quantum counterparts of testing problems whose optimal tests are given by chi-square, t and F tests. These quantum counterparts are formulated as quantum hypothesis testing problems concerning quantum Gaussian states families, and contain disturbance parameters, which have group symmetry. Quantum Hunt-Stein Theorem removes a part of these disturbance parameters, but other types of difficu… ▽ More We treat quantum counterparts of testing problems whose optimal tests are given by chi-square, t and F tests. These quantum counterparts are formulated as quantum hypothesis testing problems concerning quantum Gaussian states families, and contain disturbance parameters, which have group symmetry. Quantum Hunt-Stein Theorem removes a part of these disturbance parameters, but other types of difficulty still remain. In order to remove them, combining quantum Hunt-Stein theorem and other reduction methods, we establish a general reduction theorem that reduces a complicated quantum hypothesis testing problem to a fundamental quantum hypothesis testing problem. Using these methods, we derive quantum counterparts of chi-square, t and F tests as optimal tests in the respective settings. △ Less

Submitted 28 October, 2011; originally announced October 2011.

Comments: 34 pages, 3 figures

Journal ref: Communications in Mathematical Physics, 318(2), 535-574, 2013

Showing 1–19 of 19 results for author: Kumagai, W