Skip to main content

Showing 1–37 of 37 results for author: Tomioka, R

.
  1. arXiv:2312.03687  [pdf, other

    cond-mat.mtrl-sci cs.AI

    MatterGen: a generative model for inorganic materials design

    Authors: Claudio Zeni, Robert Pinsler, Daniel Zügner, Andrew Fowler, Matthew Horton, Xiang Fu, Sasha Shysheya, Jonathan Crabbé, Lixin Sun, Jake Smith, Bichlien Nguyen, Hannes Schulz, Sarah Lewis, Chin-Wei Huang, Ziheng Lu, Yichi Zhou, Han Yang, Hongxia Hao, Jielan Li, Ryota Tomioka, Tian Xie

    Abstract: The design of functional materials with desired properties is essential in driving technological advances in areas like energy storage, catalysis, and carbon capture. Generative models provide a new paradigm for materials design by directly generating entirely novel materials given desired property constraints. Despite recent progress, current generative models have low success rate in proposing s… ▽ More

    Submitted 29 January, 2024; v1 submitted 6 December, 2023; originally announced December 2023.

    Comments: 13 pages main text, 35 pages supplementary information

  2. arXiv:2309.07200  [pdf, other

    cs.LG cs.AI cs.IT

    Latent Representation and Simulation of Markov Processes via Time-Lagged Information Bottleneck

    Authors: Marco Federici, Patrick Forré, Ryota Tomioka, Bastiaan S. Veeling

    Abstract: Markov processes are widely used mathematical models for describing dynamic systems in various fields. However, accurately simulating large-scale systems at long time scales is computationally expensive due to the short time steps required for accurate integration. In this paper, we introduce an inference process that maps complex systems into a simplified representational space and models large j… ▽ More

    Submitted 26 January, 2024; v1 submitted 13 September, 2023; originally announced September 2023.

    Comments: 10 pages, 15 figures, Accepted ICLR 2024

  3. arXiv:2302.01170  [pdf, other

    stat.ML cond-mat.stat-mech cs.LG physics.chem-ph

    Timewarp: Transferable Acceleration of Molecular Dynamics by Learning Time-Coarsened Dynamics

    Authors: Leon Klein, Andrew Y. K. Foong, Tor Erlend Fjelde, Bruno Mlodozeniec, Marc Brockschmidt, Sebastian Nowozin, Frank Noé, Ryota Tomioka

    Abstract: Molecular dynamics (MD) simulation is a widely used technique to simulate molecular systems, most commonly at the all-atom resolution where equations of motion are integrated with timesteps on the order of femtoseconds ($1\textrm{fs}=10^{-15}\textrm{s}$). MD is often used to compute equilibrium properties, which requires sampling from an equilibrium distribution such as the Boltzmann distribution.… ▽ More

    Submitted 1 December, 2023; v1 submitted 2 February, 2023; originally announced February 2023.

  4. arXiv:2111.05426  [pdf, other

    cs.LG cs.DC

    DistIR: An Intermediate Representation and Simulator for Efficient Neural Network Distribution

    Authors: Keshav Santhanam, Siddharth Krishna, Ryota Tomioka, Tim Harris, Matei Zaharia

    Abstract: The rapidly growing size of deep neural network (DNN) models and datasets has given rise to a variety of distribution strategies such as data, tensor-model, pipeline parallelism, and hybrid combinations thereof. Each of these strategies offers its own trade-offs and exhibits optimal performance across different models and hardware topologies. Selecting the best set of strategies for a given setup… ▽ More

    Submitted 9 November, 2021; originally announced November 2021.

  5. arXiv:2106.03783  [pdf, other

    cs.LG cs.IT

    An Information-theoretic Approach to Distribution Shifts

    Authors: Marco Federici, Ryota Tomioka, Patrick Forré

    Abstract: Safely deploying machine learning models to the real world is often a challenging process. Models trained with data obtained from a specific geographic location tend to fail when queried with data obtained elsewhere, agents trained in a simulation can struggle to adapt when deployed in the real world or novel environments, and neural networks that are fit to a subset of the population might carry… ▽ More

    Submitted 1 November, 2021; v1 submitted 7 June, 2021; originally announced June 2021.

  6. arXiv:2101.07012  [pdf, other

    cs.LG stat.ML

    Regularized Policies are Reward Robust

    Authors: Hisham Husain, Kamil Ciosek, Ryota Tomioka

    Abstract: Entropic regularization of policies in Reinforcement Learning (RL) is a commonly used heuristic to ensure that the learned policy explores the state-space sufficiently before overfitting to a local optimal policy. The primary motivation for using entropy is for exploration and disambiguating optimal policies; however, the theoretical effects are not entirely understood. In this work, we study the… ▽ More

    Submitted 18 January, 2021; originally announced January 2021.

  7. arXiv:2006.08403  [pdf, other

    cs.LG stat.ML

    On the Loss Landscape of Adversarial Training: Identifying Challenges and How to Overcome Them

    Authors: Chen Liu, Mathieu Salzmann, Tao Lin, Ryota Tomioka, Sabine Süsstrunk

    Abstract: We analyze the influence of adversarial training on the loss landscape of machine learning models. To this end, we first provide analytical studies of the properties of adversarial loss functions under different adversarial budgets. We then demonstrate that the adversarial loss landscape is less favorable to optimization, due to increased curvature and more scattered gradients. Our conclusions are… ▽ More

    Submitted 2 November, 2020; v1 submitted 15 June, 2020; originally announced June 2020.

  8. arXiv:1903.06603  [pdf, other

    cs.LG stat.ML

    On Certifying Non-uniform Bound against Adversarial Attacks

    Authors: Chen Liu, Ryota Tomioka, Volkan Cevher

    Abstract: This work studies the robustness certification problem of neural network models, which aims to find certified adversary-free regions as large as possible around data points. In contrast to the existing approaches that seek regions bounded uniformly along all input features, we consider non-uniform bounds and use it to study the decision boundary of neural network models. We formulate our target as… ▽ More

    Submitted 12 July, 2019; v1 submitted 15 March, 2019; originally announced March 2019.

  9. arXiv:1901.06033  [pdf, other

    stat.ML cs.LG

    Continuous Hierarchical Representations with Poincaré Variational Auto-Encoders

    Authors: Emile Mathieu, Charline Le Lan, Chris J. Maddison, Ryota Tomioka, Yee Whye Teh

    Abstract: The variational auto-encoder (VAE) is a popular method for learning a generative model and embeddings of the data. Many real datasets are hierarchically structured. However, traditional VAEs map data in a Euclidean latent space which cannot efficiently embed tree-like structures. Hyperbolic spaces with negative curvature can. We therefore endow VAEs with a Poincaré ball model of hyperbolic geometr… ▽ More

    Submitted 25 November, 2019; v1 submitted 17 January, 2019; originally announced January 2019.

    Comments: Advances in Neural Information Processing Systems

  10. arXiv:1805.11711  [pdf, other

    cs.LG cs.AI stat.ML

    Depth and nonlinearity induce implicit exploration for RL

    Authors: Justas Dauparas, Ryota Tomioka, Katja Hofmann

    Abstract: The question of how to explore, i.e., take actions with uncertain outcomes to learn about possible future rewards, is a key question in reinforcement learning (RL). Here, we show a surprising result: We show that Q-learning with nonlinear Q-function and no explicit exploration (i.e., a purely greedy policy) can learn several standard benchmark tasks, including mountain car, equally well as, or bet… ▽ More

    Submitted 29 May, 2018; originally announced May 2018.

  11. arXiv:1705.09786  [pdf, other

    cs.LG cs.AI cs.DC stat.ML

    AMPNet: Asynchronous Model-Parallel Training for Dynamic Neural Networks

    Authors: Alexander L. Gaunt, Matthew A. Johnson, Maik Riechert, Daniel Tarlow, Ryota Tomioka, Dimitrios Vytiniotis, Sam Webster

    Abstract: New types of machine learning hardware in development and entering the market hold the promise of revolutionizing deep learning in a manner as profound as GPUs. However, existing software frameworks and training algorithms for deep learning have yet to evolve to fully leverage the capability of the new wave of silicon. We already see the limitations of existing algorithms for models that exploit s… ▽ More

    Submitted 22 June, 2017; v1 submitted 27 May, 2017; originally announced May 2017.

    Comments: 17 pages, 13 figures

  12. arXiv:1705.08841  [pdf, other

    cs.LG stat.ML

    Multi-Level Variational Autoencoder: Learning Disentangled Representations from Grouped Observations

    Authors: Diane Bouchacourt, Ryota Tomioka, Sebastian Nowozin

    Abstract: We would like to learn a representation of the data which decomposes an observation into factors of variation which we can independently control. Specifically, we want to use minimal supervision to learn a latent representation that reflects the semantics behind a specific grou** of the data, where within a group the samples share a common factor of variation. For example, consider a collection… ▽ More

    Submitted 24 May, 2017; originally announced May 2017.

  13. arXiv:1705.03071  [pdf, other

    cs.LG

    Geometry of Optimization and Implicit Regularization in Deep Learning

    Authors: Behnam Neyshabur, Ryota Tomioka, Ruslan Salakhutdinov, Nathan Srebro

    Abstract: We argue that the optimization plays a crucial role in generalization of deep learning models through implicit regularization. We do this by demonstrating that generalization ability is not controlled by network size but rather by some other implicit control. We then demonstrate how changing the empirical optimization procedure can improve generalization, even if actual optimization quality is not… ▽ More

    Submitted 8 May, 2017; originally announced May 2017.

    Comments: This survey chapter was done as a part of Intel Collaborative Research institute for Computational Intelligence (ICRI-CI) "Why & When Deep Learning works -- looking inside Deep Learning" compendium with the generous support of ICRI-CI. arXiv admin note: substantial text overlap with arXiv:1506.02617

  14. arXiv:1702.03334  [pdf, other

    stat.ML cs.LG

    Batch Policy Gradient Methods for Improving Neural Conversation Models

    Authors: Kirthevasan Kandasamy, Yoram Bachrach, Ryota Tomioka, Daniel Tarlow, David Carter

    Abstract: We study reinforcement learning of chatbots with recurrent neural network architectures when the rewards are noisy and expensive to obtain. For instance, a chatbot used in automated customer service support can be scored by quality assurance agents, but this process can be expensive, time consuming and noisy. Previous reinforcement learning work for natural language processing uses on-policy updat… ▽ More

    Submitted 10 February, 2017; originally announced February 2017.

    Comments: International Conference on Learning Representations (ICLR) 2017

  15. arXiv:1611.02266  [pdf, other

    stat.ML cs.AI cs.CL cs.LG

    Gaussian Attention Model and Its Application to Knowledge Base Embedding and Question Answering

    Authors: Liwen Zhang, John Winn, Ryota Tomioka

    Abstract: We propose the Gaussian attention model for content-based neural memory access. With the proposed attention model, a neural network has the additional degree of freedom to control the focus of its attention from a laser sharp attention to a broad attention. It is applicable whenever we can assume that the distance in the latent space reflects some notion of semantics. We use the proposed attention… ▽ More

    Submitted 30 November, 2016; v1 submitted 7 November, 2016; originally announced November 2016.

    Comments: 16 pages, 4 figures

  16. arXiv:1610.02132  [pdf, other

    cs.LG cs.DS

    QSGD: Communication-Efficient SGD via Gradient Quantization and Encoding

    Authors: Dan Alistarh, Demjan Grubic, Jerry Li, Ryota Tomioka, Milan Vojnovic

    Abstract: Parallel implementations of stochastic gradient descent (SGD) have received significant research attention, thanks to excellent scalability properties of this algorithm, and to its efficiency in the context of training deep neural networks. A fundamental barrier for parallelizing large-scale SGD is the fact that the cost of communicating the gradient updates between nodes can be very large. Conseq… ▽ More

    Submitted 6 December, 2017; v1 submitted 6 October, 2016; originally announced October 2016.

  17. arXiv:1606.00709  [pdf, other

    stat.ML cs.LG stat.ME

    f-GAN: Training Generative Neural Samplers using Variational Divergence Minimization

    Authors: Sebastian Nowozin, Botond Cseke, Ryota Tomioka

    Abstract: Generative neural samplers are probabilistic models that implement sampling using feedforward neural networks: they take a random input vector and produce a sample from a probability distribution defined by the network weights. These models are expressive and allow efficient computation of samples and derivatives, but cannot be used for computing likelihoods or for marginalization. The generative-… ▽ More

    Submitted 2 June, 2016; originally announced June 2016.

    Comments: 17 pages

  18. arXiv:1511.06747  [pdf, other

    cs.LG

    Data-Dependent Path Normalization in Neural Networks

    Authors: Behnam Neyshabur, Ryota Tomioka, Ruslan Salakhutdinov, Nathan Srebro

    Abstract: We propose a unified framework for neural net normalization, regularization and optimization, which includes Path-SGD and Batch-Normalization and interpolates between them across two different dimensions. Through this framework we investigate issue of invariance of the optimization, data dependence and the connection with natural gradients.

    Submitted 19 January, 2016; v1 submitted 20 November, 2015; originally announced November 2015.

    Comments: 17 pages, 3 figures

  19. arXiv:1509.01770  [pdf, ps, other

    cs.LG stat.ML

    Theoretical and Experimental Analyses of Tensor-Based Regression and Classification

    Authors: Kishan Wimalawarne, Ryota Tomioka, Masashi Sugiyama

    Abstract: We theoretically and experimentally investigate tensor-based regression and classification. Our focus is regularization with various tensor norms, including the overlapped trace norm, the latent trace norm, and the scaled latent trace norm. We first give dual optimization methods using the alternating direction method of multipliers, which is computationally efficient when the number of training s… ▽ More

    Submitted 6 September, 2015; originally announced September 2015.

  20. arXiv:1503.05479  [pdf, other

    cs.LG cs.AI stat.ML

    Interpolating Convex and Non-Convex Tensor Decompositions via the Subspace Norm

    Authors: Qinqing Zheng, Ryota Tomioka

    Abstract: We consider the problem of recovering a low-rank tensor from its noisy observation. Previous work has shown a recovery guarantee with signal to noise ratio $O(n^{\lceil K/2 \rceil /2})$ for recovering a $K$th order rank one tensor of size $n\times \cdots \times n$ by recursive unfolding. In this paper, we first improve this bound to $O(n^{K/4})$ by a much simpler approach, but with a more careful… ▽ More

    Submitted 26 October, 2015; v1 submitted 18 March, 2015; originally announced March 2015.

  21. arXiv:1503.01521  [pdf, other

    stat.ML cs.AI cs.CV cs.LG

    Jointly Learning Multiple Measures of Similarities from Triplet Comparisons

    Authors: Liwen Zhang, Subhransu Maji, Ryota Tomioka

    Abstract: Similarity between objects is multi-faceted and it can be easier for human annotators to measure it when the focus is on a specific aspect. We consider the problem of map** objects into view-specific embeddings where the distance between them is consistent with the similarity comparisons of the form "from the t-th view, object A is more similar to B than to C". Our framework jointly learns view-… ▽ More

    Submitted 6 October, 2015; v1 submitted 4 March, 2015; originally announced March 2015.

  22. arXiv:1503.00036  [pdf, ps, other

    cs.LG cs.AI cs.NE stat.ML

    Norm-Based Capacity Control in Neural Networks

    Authors: Behnam Neyshabur, Ryota Tomioka, Nathan Srebro

    Abstract: We investigate the capacity, convexity and characterization of a general family of norm-constrained feed-forward networks.

    Submitted 14 April, 2015; v1 submitted 27 February, 2015; originally announced March 2015.

    Comments: 29 pages

  23. arXiv:1412.6614  [pdf, ps, other

    cs.LG cs.AI cs.CV stat.ML

    In Search of the Real Inductive Bias: On the Role of Implicit Regularization in Deep Learning

    Authors: Behnam Neyshabur, Ryota Tomioka, Nathan Srebro

    Abstract: We present experiments demonstrating that some other form of capacity control, different from network size, plays a central role in learning multilayer feed-forward networks. We argue, partially through analogy to matrix factorization, that this is an inductive bias that can help shed light on deep learning.

    Submitted 16 April, 2015; v1 submitted 20 December, 2014; originally announced December 2014.

    Comments: 9 pages, 2 figures

  24. arXiv:1407.1870  [pdf, ps, other

    math.ST stat.ML

    Spectral norm of random tensors

    Authors: Ryota Tomioka, Taiji Suzuki

    Abstract: We show that the spectral norm of a random $n_1\times n_2\times \cdots \times n_K$ tensor (or higher-order array) scales as $O\left(\sqrt{(\sum_{k=1}^{K}n_k)\log(K)}\right)$ under some sub-Gaussian assumption on the entries. The proof is based on a covering number argument. Since the spectral norm is dual to the tensor nuclear norm (the tightest convex relaxation of the set of rank one tensors), t… ▽ More

    Submitted 7 July, 2014; originally announced July 2014.

    Comments: 5 pages

  25. arXiv:1303.6370  [pdf, ps, other

    stat.ML cs.LG math.NA

    Convex Tensor Decomposition via Structured Schatten Norm Regularization

    Authors: Ryota Tomioka, Taiji Suzuki

    Abstract: We discuss structured Schatten norms for tensor decomposition that includes two recently proposed norms ("overlapped" and "latent") for convex-optimization-based tensor decomposition, and connect tensor decomposition with wider literature on structured sparsity. Based on the properties of the structured Schatten norms, we mathematically analyze the performance of "latent" approach for tensor decom… ▽ More

    Submitted 25 March, 2013; originally announced March 2013.

    Comments: 12 pages, 3 figures

  26. arXiv:1211.4116  [pdf, other

    cs.LG math.AG math.CO math.NA stat.ML

    The Algebraic Combinatorial Approach for Low-Rank Matrix Completion

    Authors: Franz J. Király, Louis Theran, Ryota Tomioka

    Abstract: We present a novel algebraic combinatorial view on low-rank matrix completion based on studying relations between a few entries with tools from algebraic geometry and matroid theory. The intrinsic locality of the approach allows for the treatment of single entries in a closed theoretical and practical framework. More specifically, apart from introducing an algebraic combinatorial theory of low-ran… ▽ More

    Submitted 19 August, 2014; v1 submitted 17 November, 2012; originally announced November 2012.

    Comments: 37 pages, with an appendix by Takeaki Uno

  27. arXiv:1206.6470  [pdf

    cs.LG cs.DM math.NA stat.ML

    A Combinatorial Algebraic Approach for the Identifiability of Low-Rank Matrix Completion

    Authors: Franz Kiraly, Ryota Tomioka

    Abstract: In this paper, we review the problem of matrix completion and expose its intimate relations with algebraic geometry, combinatorics and graph theory. We present the first necessary and sufficient combinatorial conditions for matrices of arbitrary rank to be identifiable from a set of matrix entries, yielding theoretical constraints and new algorithms for the problem of matrix completion. We conclud… ▽ More

    Submitted 27 June, 2012; originally announced June 2012.

    Comments: Appears in Proceedings of the 29th International Conference on Machine Learning (ICML 2012)

  28. arXiv:1110.2899  [pdf, ps, other

    stat.ML cs.LG cs.SI physics.soc-ph

    Discovering Emerging Topics in Social Streams via Link Anomaly Detection

    Authors: Toshimitsu Takahashi, Ryota Tomioka, Kenji Yamanishi

    Abstract: Detection of emerging topics are now receiving renewed interest motivated by the rapid growth of social networks. Conventional term-frequency-based approaches may not be appropriate in this context, because the information exchanged are not only texts but also images, URLs, and videos. We focus on the social aspects of theses networks. That is, the links between users that are generated dynamicall… ▽ More

    Submitted 13 October, 2011; originally announced October 2011.

    Comments: 10 pages, 6 figures

  29. arXiv:1103.5201  [pdf, ps, other

    stat.ML

    Sharp Convergence Rate and Support Consistency of Multiple Kernel Learning with Sparse and Dense Regularization

    Authors: Taiji Suzuki, Ryota Tomioka, Masashi Sugiyama

    Abstract: We theoretically investigate the convergence rate and support consistency (i.e., correctly identifying the subset of non-zero coefficients in the large sample limit) of multiple kernel learning (MKL). We focus on MKL with block-l1 regularization (inducing sparse kernel combination), block-l2 regularization (inducing uniform kernel combination), and elastic-net regularization (including both block-… ▽ More

    Submitted 28 July, 2011; v1 submitted 27 March, 2011; originally announced March 2011.

    Comments: 26 pages, 1 figure

  30. arXiv:1103.0431  [pdf, ps, other

    stat.ML

    Fast Convergence Rate of Multiple Kernel Learning with Elastic-net Regularization

    Authors: Taiji Suzuki, Ryota Tomioka, Masashi Sugiyama

    Abstract: We investigate the learning rate of multiple kernel leaning (MKL) with elastic-net regularization, which consists of an $\ell_1$-regularizer for inducing the sparsity and an $\ell_2$-regularizer for controlling the smoothness. We focus on a sparse setting where the total number of kernels is large but the number of non-zero components of the ground truth is relatively small, and prove that elastic… ▽ More

    Submitted 13 July, 2011; v1 submitted 2 March, 2011; originally announced March 2011.

    Comments: 21 pages, 0 figure

  31. arXiv:1011.3090  [pdf, ps, other

    stat.ML cs.LG

    Regularization Strategies and Empirical Bayesian Learning for MKL

    Authors: Ryota Tomioka, Taiji Suzuki

    Abstract: Multiple kernel learning (MKL), structured sparsity, and multi-task learning have recently received considerable attention. In this paper, we show how different MKL algorithms can be understood as applications of either regularization on the kernel weights or block-norm-based regularization, which is more common in structured sparsity and multi-task learning. We show that these two regularization… ▽ More

    Submitted 2 March, 2011; v1 submitted 12 November, 2010; originally announced November 2010.

    Comments: 19pages, 6 figures

  32. arXiv:1010.0789  [pdf, ps, other

    stat.ML math.NA

    Estimation of low-rank tensors via convex optimization

    Authors: Ryota Tomioka, Kohei Hayashi, Hisashi Kashima

    Abstract: In this paper, we propose three approaches for the estimation of the Tucker decomposition of multi-way arrays (tensors) from partial observations. All approaches are formulated as convex minimization problems. Therefore, the minimum is guaranteed to be unique. The proposed approaches can automatically estimate the number of factors (rank) through the optimization. Thus, there is no need to specify… ▽ More

    Submitted 2 March, 2011; v1 submitted 5 October, 2010; originally announced October 2010.

    Comments: 19 pages, 7 figures

  33. arXiv:1001.2615  [pdf, ps, other

    stat.ML stat.AP stat.ME

    Sparsity-accuracy trade-off in MKL

    Authors: Ryota Tomioka, Taiji Suzuki

    Abstract: We empirically investigate the best trade-off between sparse and uniformly-weighted multiple kernel learning (MKL) using the elastic-net regularization on real and simulated datasets. We find that the best trade-off parameter depends not only on the sparsity of the true kernel-weight spectrum but also on the linear dependence among kernels and the number of samples.

    Submitted 15 January, 2010; originally announced January 2010.

    Comments: 8pages, 2 figures

  34. arXiv:0912.2412  [pdf, other

    stat.ME stat.AP stat.ML

    Modeling sparse connectivity between underlying brain sources for EEG/MEG

    Authors: Stefan Haufe, Ryota Tomioka, Guido Nolte, Klaus-Robert Mueller, Motoaki Kawanabe

    Abstract: We propose a novel technique to assess functional brain connectivity in EEG/MEG signals. Our method, called Sparsely-Connected Sources Analysis (SCSA), can overcome the problem of volume conduction by modeling neural data innovatively with the following ingredients: (a) the EEG is assumed to be a linear mixture of correlated sources following a multivariate autoregressive (MVAR) model, (b) the d… ▽ More

    Submitted 12 December, 2009; originally announced December 2009.

    Comments: 9 pages, 6 figures

    Journal ref: IEEE Trans. Biomed. Eng. 57(8) (2010) 1954 - 1963;

  35. arXiv:0911.4046  [pdf, ps, other

    stat.ML cs.LG stat.ME

    Super-Linear Convergence of Dual Augmented-Lagrangian Algorithm for Sparsity Regularized Estimation

    Authors: Ryota Tomioka, Taiji Suzuki, Masashi Sugiyama

    Abstract: We analyze the convergence behaviour of a recently proposed algorithm for regularized estimation called Dual Augmented Lagrangian (DAL). Our analysis is based on a new interpretation of DAL as a proximal minimization algorithm. We theoretically show under some conditions that DAL converges super-linearly in a non-asymptotic and global sense. Due to a special modelling of sparse estimation problems… ▽ More

    Submitted 2 January, 2011; v1 submitted 20 November, 2009; originally announced November 2009.

    Comments: 51 pages, 9 figures

    Journal ref: Journal of Machine Learning Research, 12(May):1537-1586, 2011

  36. arXiv:0909.5026  [pdf, ps, other

    stat.ML stat.CO

    SpicyMKL

    Authors: Taiji Suzuki, Ryota Tomioka

    Abstract: We propose a new optimization algorithm for Multiple Kernel Learning (MKL) called SpicyMKL, which is applicable to general convex loss functions and general types of regularization. The proposed SpicyMKL iteratively solves smooth minimization problems. Thus, there is no need of solving SVM, LP, or QP internally. SpicyMKL can be viewed as a proximal minimization method and converges super-linearly.… ▽ More

    Submitted 8 May, 2011; v1 submitted 28 September, 2009; originally announced September 2009.

    Comments: 30 pages, 6 figures

  37. Dual Augmented Lagrangian Method for Efficient Sparse Reconstruction

    Authors: Ryota Tomioka, Masashi Sugiyama

    Abstract: We propose an efficient algorithm for sparse signal reconstruction problems. The proposed algorithm is an augmented Lagrangian method based on the dual sparse reconstruction problem. It is efficient when the number of unknown variables is much larger than the number of observations because of the dual formulation. Moreover, the primal variable is explicitly updated and the sparsity in the soluti… ▽ More

    Submitted 3 April, 2009; originally announced April 2009.

    Comments: 10 pages, 3 figures

    Journal ref: IEEE Signal Processing Letters, volume 16, issue 12, pages 1067 - 1070, 2009