Skip to main content

Showing 1–12 of 12 results for author: Hayakawa, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.12219  [pdf, other

    cs.LG math.NA stat.ML

    A Quadrature Approach for General-Purpose Batch Bayesian Optimization via Probabilistic Lifting

    Authors: Masaki Adachi, Satoshi Hayakawa, Martin Jørgensen, Saad Hamid, Harald Oberhauser, Michael A. Osborne

    Abstract: Parallelisation in Bayesian optimisation is a common strategy but faces several challenges: the need for flexibility in acquisition functions and kernel choices, flexibility dealing with discrete and continuous variables simultaneously, model misspecification, and lastly fast massive parallelisation. To address these challenges, we introduce a versatile and modular framework for batch Bayesian opt… ▽ More

    Submitted 19 April, 2024; v1 submitted 18 April, 2024; originally announced April 2024.

    Comments: This work is the journal extension of the workshop paper (arXiv:2301.11832) and AISTATS paper (arXiv:2306.05843). 48 pages, 11 figures

    MSC Class: 62C10; 62F15

  2. arXiv:2310.14768  [pdf, other

    cs.LG cs.AI

    Policy Gradient with Kernel Quadrature

    Authors: Satoshi Hayakawa, Tetsuro Morimura

    Abstract: Reward evaluation of episodes becomes a bottleneck in a broad range of reinforcement learning tasks. Our aim in this paper is to select a small but representative subset of a large batch of episodes, only on which we actually compute rewards for more efficient policy gradient iterations. We build a Gaussian process modeling of discounted returns or rewards to derive a positive definite kernel on t… ▽ More

    Submitted 5 December, 2023; v1 submitted 23 October, 2023; originally announced October 2023.

    Comments: 18 pages, 2 figures

  3. arXiv:2306.05843  [pdf, other

    cs.LG cs.AI math.NA stat.CO stat.ML

    Adaptive Batch Sizes for Active Learning A Probabilistic Numerics Approach

    Authors: Masaki Adachi, Satoshi Hayakawa, Martin Jørgensen, Xingchen Wan, Vu Nguyen, Harald Oberhauser, Michael A. Osborne

    Abstract: Active learning parallelization is widely used, but typically relies on fixing the batch size throughout experimentation. This fixed approach is inefficient because of a dynamic trade-off between cost and speed -- larger batches are more costly, smaller batches lead to slower wall-clock run-times -- and the trade-off may change over the run (larger batches are often preferable earlier). To address… ▽ More

    Submitted 21 February, 2024; v1 submitted 9 June, 2023; originally announced June 2023.

    Comments: Accepted at AISTATS 2024. 33 pages, 6 figures

    MSC Class: 62C10; 62F15

  4. arXiv:2301.11936  [pdf, other

    quant-ph cs.LG stat.ML

    Quantum Ridgelet Transform: Winning Lottery Ticket of Neural Networks with Quantum Computation

    Authors: Hayata Yamasaki, Sathyawageeswar Subramanian, Satoshi Hayakawa, Sho Sonoda

    Abstract: A significant challenge in the field of quantum machine learning (QML) is to establish applications of quantum computation to accelerate common tasks in machine learning such as those for neural networks. Ridgelet transform has been a fundamental mathematical tool in the theoretical studies of neural networks, but the practical applicability of ridgelet transform to conducting learning tasks was l… ▽ More

    Submitted 11 September, 2023; v1 submitted 27 January, 2023; originally announced January 2023.

    Comments: 27 pages, 4 figures

    Journal ref: Proceedings of the 40th International Conference on Machine Learning (ICML2023) https://proceedings.mlr.press/v202/yamasaki23a.html

  5. arXiv:2301.11832  [pdf, other

    cs.LG math.NA stat.CO stat.ML

    SOBER: Highly Parallel Bayesian Optimization and Bayesian Quadrature over Discrete and Mixed Spaces

    Authors: Masaki Adachi, Satoshi Hayakawa, Saad Hamid, Martin Jørgensen, Harald Oberhauser, Micheal A. Osborne

    Abstract: Batch Bayesian optimisation and Bayesian quadrature have been shown to be sample-efficient methods of performing optimisation and quadrature where expensive-to-evaluate objective functions can be queried in parallel. However, current methods do not scale to large batch sizes -- a frequent desideratum in practice (e.g. drug discovery or simulation-based inference). We present a novel algorithm, SOB… ▽ More

    Submitted 5 July, 2023; v1 submitted 27 January, 2023; originally announced January 2023.

    Comments: 34 pages, 12 figures

    MSC Class: 62C10; 62F15

  6. arXiv:2301.09517  [pdf, other

    math.NA cs.LG stat.ML

    Sampling-based Nyström Approximation and Kernel Quadrature

    Authors: Satoshi Hayakawa, Harald Oberhauser, Terry Lyons

    Abstract: We analyze the Nyström approximation of a positive definite kernel associated with a probability measure. We first prove an improved error bound for the conventional Nyström approximation with i.i.d. sampling and singular-value decomposition in the continuous regime; the proof techniques are borrowed from statistical learning theory. We further introduce a refined selection of subspaces in Nyström… ▽ More

    Submitted 22 May, 2023; v1 submitted 23 January, 2023; originally announced January 2023.

    Comments: 22 pages, ICML 2023 camera-ready version. Typos fixed

  7. arXiv:2206.04734  [pdf, other

    cs.LG math.NA stat.CO stat.ML

    Fast Bayesian Inference with Batch Bayesian Quadrature via Kernel Recombination

    Authors: Masaki Adachi, Satoshi Hayakawa, Martin Jørgensen, Harald Oberhauser, Michael A. Osborne

    Abstract: Calculation of Bayesian posteriors and model evidences typically requires numerical integration. Bayesian quadrature (BQ), a surrogate-model-based approach to numerical integration, is capable of superb sample efficiency, but its lack of parallelisation has hindered its practical applications. In this work, we propose a parallelised (batch) BQ method, employing techniques from kernel quadrature, t… ▽ More

    Submitted 27 January, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 38 pages, 6 figures

    MSC Class: 62C10; 62F15

    Journal ref: NeurIPS 35, 16533--16547 (2022)

  8. arXiv:2203.10585  [pdf, other

    cs.RO

    A Dual-Arm Robot that Manipulates Heavy Plates Cooperatively with a Vacuum Lifter

    Authors: Shogo Hayakawa, Weiwei Wan, Keisuke Koyama, Kensuke Harada

    Abstract: A vacuum lifter is widely used to hold and pick up large, heavy, and flat objects. Conventionally, when using a vacuum lifter, a human worker watches the state of a running vacuum lifter and adjusts the object's pose to maintain balance. In this work, we propose using a dual-arm robot to replace the human workers and develop planning and control methods for a dual-arm robot to raise a heavy plate… ▽ More

    Submitted 20 March, 2022; originally announced March 2022.

  9. arXiv:2107.09597  [pdf, other

    math.NA cs.LG stat.ML

    Positively Weighted Kernel Quadrature via Subsampling

    Authors: Satoshi Hayakawa, Harald Oberhauser, Terry Lyons

    Abstract: We study kernel quadrature rules with convex weights. Our approach combines the spectral properties of the kernel with recombination results about point measures. This results in effective algorithms that construct convex quadrature rules using only access to i.i.d. samples from the underlying measure and evaluation of the kernel and that result in a small worst-case error. In addition to our theo… ▽ More

    Submitted 11 October, 2022; v1 submitted 20 July, 2021; originally announced July 2021.

    Comments: 29 pages, NeurIPS 2022 camera-ready version

  10. arXiv:2101.09526  [pdf, other

    cs.RO

    A Dual-arm Robot that Autonomously Lifts Up and Tumbles Heavy Plates Using Crane Pulley Blocks

    Authors: Shogo Hayakawa, Weiwei Wan, Keisuke Koyama, Kensuke Harada

    Abstract: This paper develops a planner that plans the action sequences and motion for a dual-arm robot to lift up and flip heavy plates using crane pulley blocks. The problem is motivated by the low payload of modern collaborative robots. Instead of directly manipulating heavy plates that collaborative robots cannot afford, the paper develops a planner for collaborative robots to operate crane pulley block… ▽ More

    Submitted 23 January, 2021; originally announced January 2021.

  11. arXiv:1905.09195  [pdf, other

    stat.ML cs.LG math.ST

    On the minimax optimality and superiority of deep neural network learning over sparse parameter spaces

    Authors: Satoshi Hayakawa, Taiji Suzuki

    Abstract: Deep learning has been applied to various tasks in the field of machine learning and has shown superiority to other common procedures such as kernel methods. To provide a better theoretical understanding of the reasons for its success, we discuss the performance of deep learning and other methods on a nonparametric regression problem with a Gaussian noise. Whereas existing theoretical studies of d… ▽ More

    Submitted 20 September, 2019; v1 submitted 22 May, 2019; originally announced May 2019.

    Comments: 33 pages

    MSC Class: 62G08

    Journal ref: Neural Networks, 2020

  12. arXiv:1902.00651  [pdf, ps, other

    cs.SD eess.AS

    FurcaNet: An end-to-end deep gated convolutional, long short-term memory, deep neural networks for single channel speech separation

    Authors: Ziqiang Shi, Huibin Lin, Liu Liu, Rujie Liu, Shoji Hayakawa, Shouji Harada, Jiqing Han

    Abstract: Deep gated convolutional networks have been proved to be very effective in single channel speech separation. However current state-of-the-art framework often considers training the gated convolutional networks in time-frequency (TF) domain. Such an approach will result in limited perceptual score, such as signal-to-distortion ratio (SDR) upper bound of separated utterances and also fail to exploit… ▽ More

    Submitted 17 March, 2019; v1 submitted 2 February, 2019; originally announced February 2019.

    Comments: arXiv admin note: text overlap with arXiv:1902.00631