Skip to main content

Showing 1–19 of 19 results for author: Shirakawa, S

.
  1. arXiv:2407.07388  [pdf, other

    cs.NE

    Tail Bounds on the Runtime of Categorical Compact Genetic Algorithm

    Authors: Ryoki Hamano, Kento Uchida, Shinichi Shirakawa, Daiki Morinaga, Youhei Akimoto

    Abstract: The majority of theoretical analyses of evolutionary algorithms in the discrete domain focus on binary optimization algorithms, even though black-box optimization on the categorical domain has a lot of practical applications. In this paper, we consider a probabilistic model-based algorithm using the family of categorical distributions as its underlying distribution and set the sample size as two.… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

  2. arXiv:2406.16506  [pdf, other

    cs.NE

    Natural Gradient Interpretation of Rank-One Update in CMA-ES

    Authors: Ryoki Hamano, Shinichi Shirakawa, Masahiro Nomura

    Abstract: The covariance matrix adaptation evolution strategy (CMA-ES) is a stochastic search algorithm using a multivariate normal distribution for continuous black-box optimization. In addition to strong empirical results, part of the CMA-ES can be described by a stochastic natural gradient method and can be derived from information geometric optimization (IGO) framework. However, there are some component… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: This paper has been accepted for presentation at PPSN2024

  3. arXiv:2405.11471  [pdf, other

    cs.NE

    CMA-ES with Adaptive Reevaluation for Multiplicative Noise

    Authors: Kento Uchida, Kenta Nishihara, Shinichi Shirakawa

    Abstract: The covariance matrix adaptation evolution strategy (CMA-ES) is a powerful optimization method for continuous black-box optimization problems. Several noise-handling methods have been proposed to bring out the optimization performance of the CMA-ES on noisy objective functions. The adaptations of the population size and the learning rate are two major approaches that perform well under additive Ga… ▽ More

    Submitted 19 May, 2024; originally announced May 2024.

    Comments: This paper has been accepted as a full paper at GECCO2024

  4. arXiv:2405.10534  [pdf, other

    cs.NE

    CMA-ES for Safe Optimization

    Authors: Kento Uchida, Ryoki Hamano, Masahiro Nomura, Shota Saito, Shinichi Shirakawa

    Abstract: In several real-world applications in medical and control engineering, there are unsafe solutions whose evaluations involve inherent risk. This optimization setting is known as safe optimization and formulated as a specialized type of constrained optimization problem with constraints for safety functions. Safe optimization requires performing efficient optimization without evaluating unsafe soluti… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

    Comments: This paper has been accepted as a full paper at GECCO2024

  5. CatCMA : Stochastic Optimization for Mixed-Category Problems

    Authors: Ryoki Hamano, Shota Saito, Masahiro Nomura, Kento Uchida, Shinichi Shirakawa

    Abstract: Black-box optimization problems often require simultaneously optimizing different types of variables, such as continuous, integer, and categorical variables. Unlike integer variables, categorical variables do not necessarily have a meaningful order, and the discretization approach of continuous variables does not work well. Although several Bayesian optimization methods can deal with mixed-categor… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

    Comments: This paper has been accepted for presentation at GECCO2024

  6. arXiv:2305.00849  [pdf, other

    cs.NE

    (1+1)-CMA-ES with Margin for Discrete and Mixed-Integer Problems

    Authors: Yohei Watanabe, Kento Uchida, Ryoki Hamano, Shota Saito, Masahiro Nomura, Shinichi Shirakawa

    Abstract: The covariance matrix adaptation evolution strategy (CMA-ES) is an efficient continuous black-box optimization method. The CMA-ES possesses many attractive features, including invariance properties and a well-tuned default hyperparameter setting. Moreover, several components to specialize the CMA-ES have been proposed, such as noise handling and constraint handling. To utilize these advantages in… ▽ More

    Submitted 1 May, 2023; originally announced May 2023.

  7. arXiv:2303.18031  [pdf, other

    cs.CV cs.AI cs.LG

    Simple Domain Generalization Methods are Strong Baselines for Open Domain Generalization

    Authors: Masashi Noguchi, Shinichi Shirakawa

    Abstract: In real-world applications, a machine learning model is required to handle an open-set recognition (OSR), where unknown classes appear during the inference, in addition to a domain shift, where the distribution of data differs between the training and inference phases. Domain generalization (DG) aims to handle the domain shift situation where the target domain of the inference phase is inaccessibl… ▽ More

    Submitted 31 March, 2023; originally announced March 2023.

    Comments: 18 pages, 5 figures

  8. arXiv:2212.09260  [pdf, other

    cs.NE math.OC

    Marginal Probability-Based Integer Handling for CMA-ES Tackling Single-and Multi-Objective Mixed-Integer Black-Box Optimization

    Authors: Ryoki Hamano, Shota Saito, Masahiro Nomura, Shinichi Shirakawa

    Abstract: This study targets the mixed-integer black-box optimization (MI-BBO) problem where continuous and integer variables should be optimized simultaneously. The CMA-ES, our focus in this study, is a population-based stochastic search method that samples solution candidates from a multivariate Gaussian distribution (MGD), which shows excellent performance in continuous BBO. The parameters of MGD, mean a… ▽ More

    Submitted 11 January, 2024; v1 submitted 19 December, 2022; originally announced December 2022.

    Comments: Camera-ready version for ACM Transactions on Evolutionary Learning and Optimization (TELO). This paper is an extended version of the work presented in arXiv:2205.13482

  9. arXiv:2208.13968  [pdf, other

    cs.LG cs.DC

    Neural Architecture Search for Improving Latency-Accuracy Trade-off in Split Computing

    Authors: Shoma Shimizu, Takayuki Nishio, Shota Saito, Yoichi Hirose, Chen Yen-Hsiu, Shinichi Shirakawa

    Abstract: This paper proposes a neural architecture search (NAS) method for split computing. Split computing is an emerging machine-learning inference technique that addresses the privacy and latency challenges of deploying deep learning in IoT systems. In split computing, neural network models are separated and cooperatively processed using edge servers and IoT devices via networks. Thus, the architecture… ▽ More

    Submitted 29 August, 2022; originally announced August 2022.

  10. arXiv:2207.10334  [pdf, other

    cs.NE cs.LG stat.ML

    Efficient Search of Multiple Neural Architectures with Different Complexities via Importance Sampling

    Authors: Yuhei Noda, Shota Saito, Shinichi Shirakawa

    Abstract: Neural architecture search (NAS) aims to automate architecture design processes and improve the performance of deep neural networks. Platform-aware NAS methods consider both performance and complexity and can find well-performing architectures with low computational resources. Although ordinary NAS methods result in tremendous computational costs owing to the repetition of model training, one-shot… ▽ More

    Submitted 21 July, 2022; originally announced July 2022.

    Comments: Accepted as a conference paper at the 31st International Conference on Artificial Neural Networks (ICANN 2022). The final authenticated publication will be available in the Springer Lecture Notes in Computer Science (LNCS)

  11. CMA-ES with Margin: Lower-Bounding Marginal Probability for Mixed-Integer Black-Box Optimization

    Authors: Ryoki Hamano, Shota Saito, Masahiro Nomura, Shinichi Shirakawa

    Abstract: This study targets the mixed-integer black-box optimization (MI-BBO) problem where continuous and integer variables should be optimized simultaneously. The CMA-ES, our focus in this study, is a population-based stochastic search method that samples solution candidates from a multivariate Gaussian distribution (MGD), which shows excellent performance in continuous BBO. The parameters of MGD, mean a… ▽ More

    Submitted 12 January, 2023; v1 submitted 26 May, 2022; originally announced May 2022.

    Comments: Nominated for the best paper of GECCO'22 ENUM Track. We have corrected the error of Algorithm 1 in the Appendix. In addition, an extended version is published at arXiv:2212.09260 that describes support for the multi-objective MI-BBO

  12. A Two-phase Framework with a Bézier Simplex-based Interpolation Method for Computationally Expensive Multi-objective Optimization

    Authors: Ryoji Tanabe, Youhei Akimoto, Ken Kobayashi, Hiroshi Umeki, Shinichi Shirakawa, Naoki Hamada

    Abstract: This paper proposes a two-phase framework with a Bézier simplex-based interpolation method (TPB) for computationally expensive multi-objective optimization. The first phase in TPB aims to approximate a few Pareto optimal solutions by optimizing a sequence of single-objective scalar problems. The first phase in TPB can fully exploit a state-of-the-art single-objective derivative-free optimizer. The… ▽ More

    Submitted 29 March, 2022; originally announced March 2022.

    Comments: This is an accepted version of a paper published in the proceedings of GECCO 2022

  13. arXiv:2110.10165  [pdf, other

    cs.LG cs.AI

    NAS-HPO-Bench-II: A Benchmark Dataset on Joint Optimization of Convolutional Neural Network Architecture and Training Hyperparameters

    Authors: Yoichi Hirose, Nozomu Yoshinari, Shinichi Shirakawa

    Abstract: The benchmark datasets for neural architecture search (NAS) have been developed to alleviate the computationally expensive evaluation process and ensure a fair comparison. Recent NAS benchmarks only focus on architecture optimization, although the training hyperparameters affect the obtained model performances. Building the benchmark dataset for joint optimization of architecture and training hype… ▽ More

    Submitted 19 October, 2021; originally announced October 2021.

    Comments: 16 pages, 6 figures. Accepted at ACML2021 (long oral). API is available at https://github.com/yoichii/nashpobench2api

  14. Controlling Model Complexity in Probabilistic Model-Based Dynamic Optimization of Neural Network Structures

    Authors: Shota Saito, Shinichi Shirakawa

    Abstract: A method of simultaneously optimizing both the structure of neural networks and the connection weights in a single training loop can reduce the enormous computational cost of neural architecture search. We focus on the probabilistic model-based dynamic neural network structure optimization that considers the probability distribution of structure parameters and simultaneously optimizes both the dis… ▽ More

    Submitted 15 July, 2019; originally announced July 2019.

    Comments: Accepted as a conference paper at the 28th International Conference on Artificial Neural Networks (ICANN 2019). The final authenticated publication will be available in the Springer Lecture Notes in Computer Science (LNCS). 13 pages

  15. arXiv:1905.08537  [pdf, other

    cs.LG cs.NE stat.ML

    Adaptive Stochastic Natural Gradient Method for One-Shot Neural Architecture Search

    Authors: Youhei Akimoto, Shinichi Shirakawa, Nozomu Yoshinari, Kento Uchida, Shota Saito, Kouhei Nishida

    Abstract: High sensitivity of neural architecture search (NAS) methods against their input such as step-size (i.e., learning rate) and search space prevents practitioners from applying them out-of-the-box to their own problems, albeit its purpose is to automate a part of tuning process. Aiming at a fast, robust, and widely-applicable NAS, we develop a generic optimization framework for NAS. We turn a couple… ▽ More

    Submitted 21 May, 2019; originally announced May 2019.

    Comments: Accepted to ICML 2019. Code is available at https://github.com/shirakawas/ASNG-NAS

  16. arXiv:1809.06517  [pdf, other

    cs.LG math.OC stat.ML

    Parameterless Stochastic Natural Gradient Method for Discrete Optimization and its Application to Hyper-Parameter Optimization for Neural Network

    Authors: Kouhei Nishida, Hernan Aguirre, Shota Saito, Shinichi Shirakawa, Youhei Akimoto

    Abstract: Black box discrete optimization (BBDO) appears in wide range of engineering tasks. Evolutionary or other BBDO approaches have been applied, aiming at automating necessary tuning of system parameters, such as hyper parameter tuning of machine learning based systems when being installed for a specific task. However, automation is often jeopardized by the need of strategy parameter tuning for BBDO al… ▽ More

    Submitted 17 September, 2018; originally announced September 2018.

  17. arXiv:1805.12388  [pdf, ps, other

    cs.NE

    Sample Reuse via Importance Sampling in Information Geometric Optimization

    Authors: Shinichi Shirakawa, Youhei Akimoto, Kazuki Ouchi, Kouzou Ohara

    Abstract: In this paper we propose a technique to reduce the number of function evaluations, which is often the bottleneck of the black-box optimization, in the information geometric optimization (IGO) that is a generic framework of the probability model-based black-box optimization algorithms and generalizes several well-known evolutionary algorithms, such as the population-based incremental learning (PBIL… ▽ More

    Submitted 31 May, 2018; originally announced May 2018.

    Comments: 24 pages

  18. arXiv:1801.07650  [pdf, other

    cs.NE cs.LG stat.ML

    Dynamic Optimization of Neural Network Structures Using Probabilistic Modeling

    Authors: Shinichi Shirakawa, Yasushi Iwata, Youhei Akimoto

    Abstract: Deep neural networks (DNNs) are powerful machine learning models and have succeeded in various artificial intelligence tasks. Although various architectures and modules for the DNNs have been proposed, selecting and designing the appropriate network structure for a target problem is a challenging task. In this paper, we propose a method to simultaneously optimize the network structure and weight p… ▽ More

    Submitted 23 January, 2018; originally announced January 2018.

    Comments: To appear in the Thirty-Second AAAI Conference on Artificial Intelligence (AAAI-18), 9 pages

  19. arXiv:1704.00764  [pdf, other

    cs.NE

    A Genetic Programming Approach to Designing Convolutional Neural Network Architectures

    Authors: Masanori Suganuma, Shinichi Shirakawa, Tomoharu Nagao

    Abstract: The convolutional neural network (CNN), which is one of the deep learning models, has seen much success in a variety of computer vision tasks. However, designing CNN architectures still requires expert knowledge and a lot of trial and error. In this paper, we attempt to automatically construct CNN architectures for an image classification task based on Cartesian genetic programming (CGP). In our m… ▽ More

    Submitted 11 August, 2017; v1 submitted 3 April, 2017; originally announced April 2017.

    Comments: This is the revised version of the GECCO 2017 paper. The code of our method is available at https://github.com/sg-nm/cgp-cnn