Skip to main content

Showing 1–11 of 11 results for author: Futami, F

.
  1. arXiv:2406.06227  [pdf, other

    cs.LG stat.ML

    PAC-Bayes Analysis for Recalibration in Classification

    Authors: Masahiro Fujisawa, Futoshi Futami

    Abstract: Nonparametric estimation with binning is widely employed in the calibration error evaluation and the recalibration of machine learning models. Recently, theoretical analyses of the bias induced by this estimation approach have been actively pursued; however, the understanding of the generalization of the calibration error to unknown data remains limited. In addition, although many recalibration al… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 27 pages, 3 figures

  2. arXiv:2405.15709  [pdf, other

    cs.LG math.ST stat.ML

    Information-theoretic Generalization Analysis for Expected Calibration Error

    Authors: Futoshi Futami, Masahiro Fujisawa

    Abstract: While the expected calibration error (ECE), which employs binning, is widely adopted to evaluate the calibration performance of machine learning models, theoretical understanding of its estimation bias is limited. In this paper, we present the first comprehensive analysis of the estimation bias in the two common binning strategies, uniform mass and uniform width binning. Our analysis establishes u… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: 34 pages, 3 figures

  3. arXiv:2311.01046  [pdf, ps, other

    cs.LG stat.ML

    Time-Independent Information-Theoretic Generalization Bounds for SGLD

    Authors: Futoshi Futami, Masahiro Fujisawa

    Abstract: We provide novel information-theoretic generalization bounds for stochastic gradient Langevin dynamics (SGLD) under the assumptions of smoothness and dissipativity, which are widely used in sampling and non-convex optimization studies. Our bounds are time-independent and decay to zero as the sample size increases, regardless of the number of iterations and whether the step size is fixed. Unlike pr… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

    Comments: Accepted by the Thirty-seventh Conference on Neural Information Processing Systems (NeurIPS2023), 29 pages

  4. arXiv:2307.12456  [pdf, other

    stat.ML cs.LG

    Information-theoretic Analysis of Test Data Sensitivity in Uncertainty

    Authors: Futoshi Futami, Tomoharu Iwata

    Abstract: Bayesian inference is often utilized for uncertainty quantification tasks. A recent analysis by Xu and Raginsky 2022 rigorously decomposed the predictive uncertainty in Bayesian inference into two uncertainties, called aleatoric and epistemic uncertainties, which represent the inherent randomness in the data-generating process and the variability due to insufficient data, respectively. They analyz… ▽ More

    Submitted 23 July, 2023; originally announced July 2023.

  5. arXiv:2206.01606  [pdf, ps, other

    stat.ML cs.LG

    Excess risk analysis for epistemic uncertainty with application to variational inference

    Authors: Futoshi Futami, Tomoharu Iwata, Naonori Ueda, Issei Sato, Masashi Sugiyama

    Abstract: Bayesian deep learning plays an important role especially for its ability evaluating epistemic uncertainty (EU). Due to computational complexity issues, approximation methods such as variational inference (VI) have been used in practice to obtain posterior distributions and their generalization abilities have been analyzed extensively, for example, by PAC-Bayesian theory; however, little analysis… ▽ More

    Submitted 11 October, 2022; v1 submitted 2 June, 2022; originally announced June 2022.

  6. arXiv:2106.05010  [pdf, ps, other

    stat.ML cs.LG

    Loss function based second-order Jensen inequality and its application to particle variational inference

    Authors: Futoshi Futami, Tomoharu Iwata, Naonori Ueda, Issei Sato, Masashi Sugiyama

    Abstract: Bayesian model averaging, obtained as the expectation of a likelihood function by a posterior distribution, has been widely used for prediction, evaluation of uncertainty, and model selection. Various approaches have been developed to efficiently capture the information in the posterior distribution; one such approach is the optimization of a set of models simultaneously with interaction to ensure… ▽ More

    Submitted 9 June, 2021; v1 submitted 9 June, 2021; originally announced June 2021.

  7. Experimental Demonstration of 4,294,967,296-QAM Based Y-00 Quantum Stream Cipher Carrying 160-Gb/s 16-QAM Signals

    Authors: Xi Chen, Ken Tanizawa, Peter Winzer, Po Dong, Junho Cho, Fumio Futami, Kentaro Kato, Argishti Melikyan, Kw Kim

    Abstract: We demonstrate a 4,294,967,296-ary quadrature amplitude modulation (QAM) based Y-00 quantum stream cipher system carrying 160-Gb/s 16-QAM signal transmitted over 320-km SSMF. The ultra-dense QAM cipher template is realized by an integrated two-segment silicon photonics I/Q modulator.

    Submitted 23 September, 2020; originally announced September 2020.

  8. arXiv:2003.04691  [pdf, other

    stat.ML cs.LG

    Time-varying Gaussian Process Bandit Optimization with Non-constant Evaluation Time

    Authors: Hideaki Imamura, Nontawat Charoenphakdee, Futoshi Futami, Issei Sato, Junya Honda, Masashi Sugiyama

    Abstract: The Gaussian process bandit is a problem in which we want to find a maximizer of a black-box function with the minimum number of function evaluations. If the black-box function varies with time, then time-varying Bayesian optimization is a promising framework. However, a drawback with current methods is in the assumption that the evaluation time for every observation is constant, which can be unre… ▽ More

    Submitted 10 March, 2020; v1 submitted 10 March, 2020; originally announced March 2020.

  9. arXiv:1805.07912  [pdf, ps, other

    stat.ML cs.LG

    Bayesian posterior approximation via greedy particle optimization

    Authors: Futoshi Futami, Zhenghang Cui, Issei Sato, Masashi Sugiyama

    Abstract: In Bayesian inference, the posterior distributions are difficult to obtain analytically for complex models such as neural networks. Variational inference usually uses a parametric distribution for approximation, from which we can easily draw samples. Recently discrete approximation by particles has attracted attention because of its high expression ability. An example is Stein variational gradient… ▽ More

    Submitted 31 January, 2019; v1 submitted 21 May, 2018; originally announced May 2018.

  10. arXiv:1710.06595  [pdf, other

    stat.ML

    Variational Inference based on Robust Divergences

    Authors: Futoshi Futami, Issei Sato, Masashi Sugiyama

    Abstract: Robustness to outliers is a central issue in real-world machine learning applications. While replacing a model to a heavy-tailed one (e.g., from Gaussian to Student-t) is a standard approach for robustification, it can only be applied to simple models. In this paper, based on Zellner's optimization and variational formulation of Bayesian inference, we propose an outlier-robust pseudo-Bayesian vari… ▽ More

    Submitted 28 February, 2018; v1 submitted 18 October, 2017; originally announced October 2017.

  11. arXiv:1705.09046  [pdf, ps, other

    stat.ML

    Expectation Propagation for t-Exponential Family Using Q-Algebra

    Authors: Futoshi Futami, Issei Sato, Masashi Sugiyama

    Abstract: Exponential family distributions are highly useful in machine learning since their calculation can be performed efficiently through natural parameters. The exponential family has recently been extended to the t-exponential family, which contains Student-t distributions as family members and thus allows us to handle noisy data well. However, since the t-exponential family is denied by the deformed… ▽ More

    Submitted 28 May, 2017; v1 submitted 25 May, 2017; originally announced May 2017.