Skip to main content

Showing 1–27 of 27 results for author: Karasuyama, M

.
  1. arXiv:2402.06932  [pdf, other

    cs.LG

    Learning Attributed Graphlets: Predictive Graph Mining by Graphlets with Trainable Attribute

    Authors: Tajima Shinji, Ren Sugihara, Ryota Kitahara, Masayuki Karasuyama

    Abstract: The graph classification problem has been widely studied; however, achieving an interpretable model with high predictive performance remains a challenging issue. This paper proposes an interpretable classification algorithm for attributed graph data, called LAGRA (Learning Attributed GRAphlets). LAGRA learns importance weights for small attributed subgraphs, called attributed graphlets (AGs), whil… ▽ More

    Submitted 10 February, 2024; originally announced February 2024.

  2. arXiv:2311.13460  [pdf, other

    cs.LG stat.ML

    Multi-Objective Bayesian Optimization with Active Preference Learning

    Authors: Ryota Ozaki, Kazuki Ishikawa, Youhei Kanzaki, Shinya Suzuki, Shion Takeno, Ichiro Takeuchi, Masayuki Karasuyama

    Abstract: There are a lot of real-world black-box optimization problems that need to optimize multiple criteria simultaneously. However, in a multi-objective optimization (MOO) problem, identifying the whole Pareto front requires the prohibitive search cost, while in many practical scenarios, the decision maker (DM) only needs a specific solution among the set of the Pareto optimal solutions. We propose a B… ▽ More

    Submitted 22 November, 2023; originally announced November 2023.

  3. arXiv:2311.03760  [pdf, other

    cs.LG stat.ML

    Posterior Sampling-Based Bayesian Optimization with Tighter Bayesian Regret Bounds

    Authors: Shion Takeno, Yu Inatsu, Masayuki Karasuyama, Ichiro Takeuchi

    Abstract: Among various acquisition functions (AFs) in Bayesian optimization (BO), Gaussian process upper confidence bound (GP-UCB) and Thompson sampling (TS) are well-known options with established theoretical properties regarding Bayesian cumulative regret (BCR). Recently, it has been shown that a randomized variant of GP-UCB achieves a tighter BCR bound compared with GP-UCB, which we call the tighter BCR… ▽ More

    Submitted 4 June, 2024; v1 submitted 7 November, 2023; originally announced November 2023.

    Comments: 28 pages, 3 figures, 2 tables, Accepted to ICML2024

  4. arXiv:2302.01513  [pdf, other

    cs.LG

    Towards Practical Preferential Bayesian Optimization with Skew Gaussian Processes

    Authors: Shion Takeno, Masahiro Nomura, Masayuki Karasuyama

    Abstract: We study preferential Bayesian optimization (BO) where reliable feedback is limited to pairwise comparison called duels. An important challenge in preferential BO, which uses the preferential Gaussian process (GP) model to represent flexible preference structure, is that the posterior distribution is a computationally intractable skew GP. The most widely used approach for preferential BO is Gaussi… ▽ More

    Submitted 11 June, 2023; v1 submitted 2 February, 2023; originally announced February 2023.

    Comments: 25 pages, 9 figures, Accepted to ICML2023

  5. arXiv:2302.01511  [pdf, other

    cs.LG

    Randomized Gaussian Process Upper Confidence Bound with Tighter Bayesian Regret Bounds

    Authors: Shion Takeno, Yu Inatsu, Masayuki Karasuyama

    Abstract: Gaussian process upper confidence bound (GP-UCB) is a theoretically promising approach for black-box optimization; however, the confidence parameter $β$ is considerably large in the theorem and chosen heuristically in practice. Then, randomized GP-UCB (RGP-UCB) uses a randomized confidence parameter, which follows the Gamma distribution, to mitigate the impact of manually specifying $β$. This stud… ▽ More

    Submitted 11 June, 2023; v1 submitted 2 February, 2023; originally announced February 2023.

    Comments: 33 pages, 3 figures, Accepted to ICML2023

  6. arXiv:2201.13112  [pdf, other

    stat.ML cs.LG

    Bayesian Optimization for Distributionally Robust Chance-constrained Problem

    Authors: Yu Inatsu, Shion Takeno, Masayuki Karasuyama, Ichiro Takeuchi

    Abstract: In black-box function optimization, we need to consider not only controllable design variables but also uncontrollable stochastic environment variables. In such cases, it is necessary to solve the optimization problem by taking into account the uncertainty of the environmental variables. Chance-constrained (CC) problem, the problem of maximizing the expected value under a certain level of constrai… ▽ More

    Submitted 2 February, 2022; v1 submitted 31 January, 2022; originally announced January 2022.

    Comments: 18 pages, 2 figures

  7. arXiv:2111.08330  [pdf, ps, other

    stat.ML cs.LG math.OC

    Bayesian Optimization for Cascade-type Multi-stage Processes

    Authors: Shunya Kusakawa, Shion Takeno, Yu Inatsu, Kentaro Kutsukake, Shogo Iwazaki, Takashi Nakano, Toru Ujihara, Masayuki Karasuyama, Ichiro Takeuchi

    Abstract: Complex processes in science and engineering are often formulated as multistage decision-making problems. In this paper, we consider a type of multistage decision-making process called a cascade process. A cascade process is a multistage process in which the output of one stage is used as an input for the subsequent stage. When the cost of each stage is expensive, it is difficult to search for the… ▽ More

    Submitted 7 March, 2023; v1 submitted 16 November, 2021; originally announced November 2021.

    Comments: 70pages, 7 figures

    Journal ref: Neural Computation (2022) 34 (12): 2408-2431

  8. arXiv:2102.09788  [pdf, other

    cs.LG

    Sequential- and Parallel- Constrained Max-value Entropy Search via Information Lower Bound

    Authors: Shion Takeno, Tomoyuki Tamura, Kazuki Shitara, Masayuki Karasuyama

    Abstract: Max-value entropy search (MES) is one of the state-of-the-art approaches in Bayesian optimization (BO). In this paper, we propose a novel variant of MES for constrained problems, called Constrained MES via Information lower BOund (CMES-IBO), that is based on a Monte Carlo (MC) estimator of a lower bound of a mutual information (MI). Unlike existing studies, our MI is defined so that uncertainty wi… ▽ More

    Submitted 2 February, 2022; v1 submitted 19 February, 2021; originally announced February 2021.

    Comments: 39pages, 8 figures

    Journal ref: Proceedings of the 39th International Conference on Machine Learning, PMLR 162:20960-20986, 2022

  9. arXiv:2003.13428  [pdf, other

    cond-mat.mtrl-sci cs.LG physics.comp-ph stat.ML

    Cost-effective search for lower-error region in material parameter space using multifidelity Gaussian process modeling

    Authors: Shion Takeno, Yuhki Tsukada, Hitoshi Fukuoka, Toshiyuki Koyama, Motoki Shiga, Masayuki Karasuyama

    Abstract: Information regarding precipitate shapes is critical for estimating material parameters. Hence, we considered estimating a region of material parameter space in which a computational model produces precipitates having shapes similar to those observed in the experimental images. This region, called the lower-error region (LER), reflects intrinsic information of the material contained in the precipi… ▽ More

    Submitted 15 March, 2020; originally announced March 2020.

    Comments: 23 pages, 6 figures

    Journal ref: Phys. Rev. Materials 4, 083802 (2020)

  10. arXiv:2003.09036  [pdf, other

    physics.comp-ph cond-mat.mtrl-sci

    Computational Design of Stable and Highly Ion-conductive Materials using Multi-objective Bayesian Optimization: Case Studies on Diffusion of Oxygen and Lithium

    Authors: Masayuki Karasuyama, Hiroki Kasugai, Tomoyuki Tamura, Kazuki Shitara

    Abstract: Ion-conducting solid electrolytes are widely used for a variety of purposes. Therefore, designing highly ion-conductive materials is in strongly demand. Because of advancement in computers and enhancement of computational codes, theoretical simulations have become effective tools for investigating the performance of ion-conductive materials. However, an exhaustive search conducted by theoretical c… ▽ More

    Submitted 19 March, 2020; originally announced March 2020.

  11. Distance Metric Learning for Graph Structured Data

    Authors: Tomoki Yoshida, Ichiro Takeuchi, Masayuki Karasuyama

    Abstract: Graphs are versatile tools for representing structured data. As a result, a variety of machine learning methods have been studied for graph data analysis. Although many such learning methods depend on the measurement of differences between input graphs, defining an appropriate distance metric for graphs remains a controversial issue. Hence, we propose a supervised distance metric learning method f… ▽ More

    Submitted 17 June, 2021; v1 submitted 3 February, 2020; originally announced February 2020.

    Comments: 38 pages, 11 figures. This is a pre-print of an article published in Machine Learning Journal. The final authenticated version is available online at: https://doi.org/10.1007/s10994-021-06009-3

  12. arXiv:1912.04596  [pdf, other

    cond-mat.mtrl-sci physics.comp-ph

    Active-learning-based efficient prediction of ab-initio atomic energy: a case study on a Fe random grain boundary model with millions of atoms

    Authors: Tomoyuki Tamura, Masayuki Karasuyama

    Abstract: We have developed a method that can analyze large random grain boundary (GB) models with the accuracy of density functional theory (DFT) calculations using active learning. It is assumed that the atomic energy is represented by the linear regression of the atomic structural descriptor. The atomic energy is obtained through DFT calculations using a small cell extracted from a huge GB model, called… ▽ More

    Submitted 17 April, 2020; v1 submitted 10 December, 2019; originally announced December 2019.

    Journal ref: Phys. Rev. Materials 4, 113602 (2020)

  13. arXiv:1909.06064  [pdf, other

    stat.ML cs.LG

    Active learning for level set estimation under cost-dependent input uncertainty

    Authors: Yu Inatsu, Masayuki Karasuyama, Keiichi Inoue, Ichiro Takeuchi

    Abstract: As part of a quality control process in manufacturing it is often necessary to test whether all parts of a product satisfy a required property, with as few inspections as possible. When multiple inspection apparatuses with different costs and precision exist, it is desirable that testing can be carried out cost-effectively by properly controlling the trade-off between the costs and the precision.… ▽ More

    Submitted 13 September, 2019; originally announced September 2019.

  14. arXiv:1906.00127  [pdf, other

    cs.LG stat.ML

    Multi-objective Bayesian Optimization using Pareto-frontier Entropy

    Authors: Shinya Suzuki, Shion Takeno, Tomoyuki Tamura, Kazuki Shitara, Masayuki Karasuyama

    Abstract: This paper studies an entropy-based multi-objective Bayesian optimization (MBO). The entropy search is successful approach to Bayesian optimization. However, for MBO, existing entropy-based methods ignore trade-off among objectives or introduce unreliable approximations. We propose a novel entropy-based MBO called Pareto-frontier entropy search (PFES) by considering the entropy of Pareto-frontier,… ▽ More

    Submitted 10 February, 2020; v1 submitted 31 May, 2019; originally announced June 2019.

  15. arXiv:1905.01788  [pdf, other

    stat.ML cs.LG

    Statistically Discriminative Sub-trajectory Mining

    Authors: Vo Nguyen Le Duy, Takuto Sakuma, Taiju Ishiyama, Hiroki Toda, Kazuya Nishi, Masayuki Karasuyama, Yuta Okubo, Masayuki Sunaga, Yasuo Tabei, Ichiro Takeuchi

    Abstract: We study the problem of discriminative sub-trajectory mining. Given two groups of trajectories, the goal of this problem is to extract moving patterns in the form of sub-trajectories which are more similar to sub-trajectories of one group and less similar to those of the other. We propose a new method called Statistically Discriminative Sub-trajectory Mining (SDSM) for this problem. An advantage o… ▽ More

    Submitted 5 May, 2019; originally announced May 2019.

  16. arXiv:1901.08275  [pdf, other

    stat.ML cs.LG

    Multi-fidelity Bayesian Optimization with Max-value Entropy Search and its parallelization

    Authors: Shion Takeno, Hitoshi Fukuoka, Yuhki Tsukada, Toshiyuki Koyama, Motoki Shiga, Ichiro Takeuchi, Masayuki Karasuyama

    Abstract: In a standard setting of Bayesian optimization (BO), the objective function evaluation is assumed to be highly expensive. Multi-fidelity Bayesian optimization (MFBO) accelerates BO by incorporating lower fidelity observations available with a lower sampling cost. In this paper, we focus on the information-based approach, which is a popular and empirically successful approach in BO. For MFBO, howev… ▽ More

    Submitted 12 February, 2020; v1 submitted 24 January, 2019; originally announced January 2019.

    Comments: 31 pages, 5 figures

  17. arXiv:1802.03923  [pdf, other

    stat.ML

    Safe Triplet Screening for Distance Metric Learning

    Authors: Tomoki Yoshida, Ichiro Takeuchi, Masayuki Karasuyama

    Abstract: We study safe screening for metric learning. Distance metric learning can optimize a metric over a set of triplets, each one of which is defined by a pair of same class instances and an instance in a different class. However, the number of possible triplets is quite huge even for a small dataset. Our safe triplet screening identifies triplets which can be safely removed from the optimization probl… ▽ More

    Submitted 5 October, 2018; v1 submitted 12 February, 2018; originally announced February 2018.

    Comments: 36 pages, 12 figures

  18. Exploring a potential energy surface by machine learning for characterizing atomic transport

    Authors: Kenta Kanamori, Kazuaki Toyoura, Junya Honda, Kazuki Hattori, Atsuto Seko, Masayuki Karasuyama, Kazuki Shitara, Motoki Shiga, Akihide Kuwabara, Ichiro Takeuchi

    Abstract: We propose a machine-learning method for evaluating the potential barrier governing atomic transport based on the preferential selection of dominant points for the atomic transport. The proposed method generates numerous random samples of the entire potential energy surface (PES) from a probabilistic Gaussian process model of the PES, which enables defining the likelihood of the dominant points. T… ▽ More

    Submitted 18 January, 2018; v1 submitted 10 October, 2017; originally announced October 2017.

    Journal ref: Phys. Rev. B 97, 125124 (2018)

  19. Knowledge-Transfer based Cost-effective Search for Interface Structures: A Case Study on fcc-Al [110] Tilt Grain Boundary

    Authors: Tomohiro Yonezu, Tomoyuki Tamura, Ichiro Takeuchi, Masayuki Karasuyama

    Abstract: Determining the atomic configuration of an interface is one of the most important issues in materials science research. Although theoretical simulations are effective tools, an exhaustive search is computationally prohibitive due to the high degrees of freedom of the interface structure. In the interface structure search, multiple energy surfaces created by a variety of orientation angles need to… ▽ More

    Submitted 10 October, 2018; v1 submitted 10 August, 2017; originally announced August 2017.

    Journal ref: Phys. Rev. Materials 2, 113802 (2018)

  20. arXiv:1602.04548  [pdf, other

    stat.ML

    Safe Pattern Pruning: An Efficient Approach for Predictive Pattern Mining

    Authors: Kazuya Nakagawa, Shinya Suzumura, Masayuki Karasuyama, Koji Tsuda, Ichiro Takeuchi

    Abstract: In this paper we study predictive pattern mining problems where the goal is to construct a predictive model based on a subset of predictive patterns in the database. Our main contribution is to introduce a novel method called safe pattern pruning (SPP) for a class of predictive pattern mining problems. The SPP method allows us to efficiently find a superset of all the predictive patterns in the da… ▽ More

    Submitted 14 February, 2016; originally announced February 2016.

  21. arXiv:1602.02485  [pdf, other

    stat.ML

    Simultaneous Safe Screening of Features and Samples in Doubly Sparse Modeling

    Authors: Atsushi Shibagaki, Masayuki Karasuyama, Kohei Hatano, Ichiro Takeuchi

    Abstract: The problem of learning a sparse model is conceptually interpreted as the process of identifying active features/samples and then optimizing the model over them. Recently introduced safe screening allows us to identify a part of non-active features/samples. So far, safe screening has been individually studied either for feature screening or for sample screening. In this paper, we introduce a new a… ▽ More

    Submitted 8 February, 2016; originally announced February 2016.

  22. A machine learning-based selective sampling procedure for identifying the low energy region in a potential energy surface: a case study on proton conduction in oxides

    Authors: Kazuaki Toyoura, Daisuke Hirano, Atsuto Seko, Motoki Shiga, Akihide Kuwabara, Masayuki Karasuyama, Kazuki Shitara, Ichiro Takeuchi

    Abstract: In this paper, we propose a selective sampling procedure to preferentially evaluate a potential energy surface (PES) in a part of the configuration space governing a physical property of interest. The proposed sampling procedure is based on a machine learning method called the Gaussian process (GP), which is used to construct a statistical model of the PES for identifying the region of interest in… ▽ More

    Submitted 3 December, 2015; v1 submitted 2 December, 2015; originally announced December 2015.

    Journal ref: Phys. Rev. B 93, 054112 (2016)

  23. arXiv:1507.03229  [pdf, ps, other

    stat.ML cs.LG

    Homotopy Continuation Approaches for Robust SV Classification and Regression

    Authors: Shinya Suzumura, Kohei Ogawa, Masashi Sugiyama, Masayuki Karasuyama, Ichiro Takeuchi

    Abstract: In support vector machine (SVM) applications with unreliable data that contains a portion of outliers, non-robustness of SVMs often causes considerable performance deterioration. Although many approaches for improving the robustness of SVMs have been studied, two major challenges remain in robust SVM learning. First, robust learning algorithms are essentially formulated as non-convex optimization… ▽ More

    Submitted 12 July, 2015; originally announced July 2015.

  24. arXiv:1506.08002  [pdf, ps, other

    stat.ML

    Safe Feature Pruning for Sparse High-Order Interaction Models

    Authors: Kazuya Nakagawa, Shinya Suzumura, Masayuki Karasuyama, Koji Tsuda, Ichiro Takeuchi

    Abstract: Taking into account high-order interactions among covariates is valuable in many practical regression problems. This is, however, computationally challenging task because the number of high-order interaction features to be considered would be extremely large unless the number of covariates is sufficiently small. In this paper, we propose a novel efficient algorithm for LASSO-based sparse learning… ▽ More

    Submitted 26 June, 2015; originally announced June 2015.

  25. arXiv:1502.02344  [pdf, ps, other

    stat.ML

    Regularization Path of Cross-Validation Error Lower Bounds

    Authors: Atsushi Shibagaki, Yoshiki Suzuki, Masayuki Karasuyama, Ichiro Takeuchi

    Abstract: Careful tuning of a regularization parameter is indispensable in many machine learning tasks because it has a significant impact on generalization performances. Nevertheless, current practice of regularization parameter tuning is more of an art than a science, e.g., it is hard to tell how many grid-points would be needed in cross-validation (CV) for obtaining a solution with sufficiently small CV… ▽ More

    Submitted 22 June, 2015; v1 submitted 8 February, 2015; originally announced February 2015.

  26. arXiv:1105.0471  [pdf, ps, other

    cs.LG

    Suboptimal Solution Path Algorithm for Support Vector Machine

    Authors: Masayuki Karasuyama, Ichiro Takeuchi

    Abstract: We consider a suboptimal solution path algorithm for the Support Vector Machine. The solution path algorithm is an effective tool for solving a sequence of a parametrized optimization problems in machine learning. The path of the solutions provided by this algorithm are very accurate and they satisfy the optimality conditions more strictly than other SVM optimization algorithms. In many machine le… ▽ More

    Submitted 2 May, 2011; originally announced May 2011.

    Comments: A shorter version of this paper is submitted to ICML 2011

  27. arXiv:1009.4791  [pdf, ps, other

    cs.LG

    Multi-parametric Solution-path Algorithm for Instance-weighted Support Vector Machines

    Authors: Masayuki Karasuyama, Naoyuki Harada, Masashi Sugiyama, Ichiro Takeuchi

    Abstract: An instance-weighted variant of the support vector machine (SVM) has attracted considerable attention recently since they are useful in various machine learning tasks such as non-stationary data analysis, heteroscedastic data modeling, transfer learning, learning to rank, and transduction. An important challenge in these scenarios is to overcome the computational bottleneck---instance weights ofte… ▽ More

    Submitted 1 November, 2010; v1 submitted 24 September, 2010; originally announced September 2010.

    Comments: Submitted to Journal of Machine Learning Research