Skip to main content

Showing 1–22 of 22 results for author: Klami, A

.
  1. arXiv:2406.00502  [pdf, other

    math.OC cs.LG

    Non-geodesically-convex optimization in the Wasserstein space

    Authors: Hoang Phuc Hau Luu, Hanlin Yu, Bernardo Williams, Petrus Mikkola, Marcelo Hartmann, Kai Puolamäki, Arto Klami

    Abstract: We study a class of optimization problems in the Wasserstein space (the space of probability measures) where the objective function is \emph{nonconvex} along generalized geodesics. When the regularization term is the negative entropy, the optimization problem becomes a sampling problem where it minimizes the Kullback-Leibler divergence between a probability measure (optimization variable) and a ta… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

  2. arXiv:2311.02766  [pdf, other

    cs.LG stat.ME stat.ML

    Riemannian Laplace Approximation with the Fisher Metric

    Authors: Hanlin Yu, Marcelo Hartmann, Bernardo Williams, Mark Girolami, Arto Klami

    Abstract: Laplace's method approximates a target density with a Gaussian distribution at its mode. It is computationally efficient and asymptotically exact for Bayesian inference due to the Bernstein-von Mises theorem, but for complex targets and finite-data posteriors it is often too crude an approximation. A recent generalization of the Laplace Approximation transforms the Gaussian approximation according… ▽ More

    Submitted 7 May, 2024; v1 submitted 5 November, 2023; originally announced November 2023.

    Comments: AISTATS 2024, with additional fixes and improvements

  3. arXiv:2308.08305  [pdf, other

    stat.ML cs.LG

    Warped geometric information on the optimisation of Euclidean functions

    Authors: Marcelo Hartmann, Bernardo Williams, Hanlin Yu, Mark Girolami, Alessandro Barp, Arto Klami

    Abstract: We consider the fundamental task of optimising a real-valued function defined in a potentially high-dimensional Euclidean space, such as the loss function in many machine-learning tasks or the logarithm of the probability distribution in statistical inference. We use Riemannian geometry notions to redefine the optimisation problem of a function on the Euclidean space to a Riemannian manifold with… ▽ More

    Submitted 18 March, 2024; v1 submitted 16 August, 2023; originally announced August 2023.

  4. arXiv:2303.05101  [pdf, other

    cs.LG stat.CO

    Scalable Stochastic Gradient Riemannian Langevin Dynamics in Non-Diagonal Metrics

    Authors: Hanlin Yu, Marcelo Hartmann, Bernardo Williams, Arto Klami

    Abstract: Stochastic-gradient sampling methods are often used to perform Bayesian inference on neural networks. It has been observed that the methods in which notions of differential geometry are included tend to have better performances, with the Riemannian metric improving posterior exploration by accounting for the local curvature. However, the existing methods often resort to simple diagonal metrics to… ▽ More

    Submitted 31 March, 2024; v1 submitted 9 March, 2023; originally announced March 2023.

    Comments: Adjust the template and minor fixes

  5. arXiv:2210.10487  [pdf, other

    cs.LG stat.ML

    Estimating the Contamination Factor's Distribution in Unsupervised Anomaly Detection

    Authors: Lorenzo Perini, Paul Buerkner, Arto Klami

    Abstract: Anomaly detection methods identify examples that do not follow the expected behaviour, typically in an unsupervised fashion, by assigning real-valued anomaly scores to the examples based on various heuristics. These scores need to be transformed into actual predictions by thresholding, so that the proportion of examples marked as anomalies equals the expected proportion of anomalies, called contam… ▽ More

    Submitted 17 October, 2023; v1 submitted 19 October, 2022; originally announced October 2022.

  6. arXiv:2210.01006  [pdf, other

    astro-ph.EP astro-ph.IM cs.LG

    Neural network for determining an asteroid mineral composition from reflectance spectra

    Authors: David Korda, Antti Penttilä, Arto Klami, Tomáš Kohout

    Abstract: Chemical and mineral compositions of asteroids reflect the formation and history of our Solar System. This knowledge is also important for planetary defence and in-space resource utilisation. We aim to develop a fast and robust neural-network-based method for deriving the mineral modal and chemical compositions of silicate materials from their visible and near-infrared spectra. The method should b… ▽ More

    Submitted 3 October, 2022; originally announced October 2022.

    Comments: main text: 12 pages, 12 figures, 10 tables; appendix: 8 pages, 20 figures, 6 tables

    Journal ref: A&A 669, A101 (2023)

  7. arXiv:2202.00755  [pdf, other

    stat.ME cs.AI cs.LG

    Lagrangian Manifold Monte Carlo on Monge Patches

    Authors: Marcelo Hartmann, Mark Girolami, Arto Klami

    Abstract: The efficiency of Markov Chain Monte Carlo (MCMC) depends on how the underlying geometry of the problem is taken into account. For distributions with strongly varying curvature, Riemannian metrics help in efficient exploration of the target distribution. Unfortunately, they have significant computational overhead due to e.g. repeated inversion of the metric tensor, and current geometric MCMC metho… ▽ More

    Submitted 1 February, 2022; originally announced February 2022.

  8. arXiv:2112.03230  [pdf, other

    cs.LG stat.ML

    Traversing Time with Multi-Resolution Gaussian Process State-Space Models

    Authors: Krista Longi, Jakob Lindinger, Olaf Duennbier, Melih Kandemir, Arto Klami, Barbara Rakitsch

    Abstract: Gaussian Process state-space models capture complex temporal dependencies in a principled manner by placing a Gaussian Process prior on the transition function. These models have a natural interpretation as discretized stochastic differential equations, but inference for long sequences with fast and slow transitions is difficult. Fast transitions need tight discretizations whereas slow transitions… ▽ More

    Submitted 23 February, 2022; v1 submitted 6 December, 2021; originally announced December 2021.

    Comments: Added links to code and dataset. Added author contributions

  9. arXiv:2112.01380  [pdf, other

    stat.ME

    Prior knowledge elicitation: The past, present, and future

    Authors: Petrus Mikkola, Osvaldo A. Martin, Suyog Chandramouli, Marcelo Hartmann, Oriol Abril Pla, Owen Thomas, Henri Pesonen, Jukka Corander, Aki Vehtari, Samuel Kaski, Paul-Christian Bürkner, Arto Klami

    Abstract: Specification of the prior distribution for a Bayesian model is a central part of the Bayesian workflow for data analysis, but it is often difficult even for statistical experts. In principle, prior elicitation transforms domain knowledge of various kinds into well-defined prior distributions, and offers a solution to the prior specification problem. In practice, however, we are still fairly far f… ▽ More

    Submitted 9 May, 2023; v1 submitted 1 December, 2021; originally announced December 2021.

    Comments: 69 pages, 1 figure

  10. arXiv:2006.15568  [pdf, other

    cs.LG stat.ML

    Reliable Categorical Variational Inference with Mixture of Discrete Normalizing Flows

    Authors: Tomasz Kuśmierczyk, Arto Klami

    Abstract: Variational approximations are increasingly based on gradient-based optimization of expectations estimated by sampling. Handling discrete latent variables is then challenging because the sampling process is not differentiable. Continuous relaxations, such as the Gumbel-Softmax for categorical distribution, enable gradient-based optimization, but do not define a valid probability mass for discrete… ▽ More

    Submitted 8 February, 2021; v1 submitted 28 June, 2020; originally announced June 2020.

  11. Multi-scale Cloud Detection in Remote Sensing Images using a Dual Convolutional Neural Network

    Authors: Markku Luotamo, Sari Metsämäki, Arto Klami

    Abstract: Semantic segmentation by convolutional neural networks (CNN) has advanced the state of the art in pixel-level classification of remote sensing images. However, processing large images typically requires analyzing the image in small patches, and hence features that have large spatial extent still cause challenges in tasks such as cloud masking. To support a wider scale of spatial features while sim… ▽ More

    Submitted 1 June, 2020; originally announced June 2020.

    ACM Class: I.2.6; I.4.6

  12. arXiv:2002.09868  [pdf, other

    stat.ME

    Flexible Prior Elicitation via the Prior Predictive Distribution

    Authors: Marcelo Hartmann, Georgi Agiashvili, Paul Bürkner, Arto Klami

    Abstract: The prior distribution for the unknown model parameters plays a crucial role in the process of statistical inference based on Bayesian methods. However, specifying suitable priors is often difficult even when detailed prior knowledge is available in principle. The challenge is to express quantitative information in the form of a probability distribution. Prior elicitation addresses this question b… ▽ More

    Submitted 16 March, 2020; v1 submitted 23 February, 2020; originally announced February 2020.

    Comments: 24 pages, 3 figures, conference submission

  13. arXiv:1910.12263  [pdf, other

    stat.ML cs.LG

    Prior Specification for Bayesian Matrix Factorization via Prior Predictive Matching

    Authors: Eliezer de Souza da Silva, Tomasz Kuśmierczyk, Marcelo Hartmann, Arto Klami

    Abstract: The behavior of many Bayesian models used in machine learning critically depends on the choice of prior distributions, controlled by some hyperparameters that are typically selected by Bayesian optimization or cross-validation. This requires repeated, costly, posterior inference. We provide an alternative for selecting good priors without carrying out posterior inference, building on the prior pre… ▽ More

    Submitted 30 September, 2022; v1 submitted 27 October, 2019; originally announced October 2019.

    Journal ref: Journal of Machine Learning Research 24 (2023) 1-51

  14. arXiv:1909.04919  [pdf, other

    stat.ML cs.LG

    Correcting Predictions for Approximate Bayesian Inference

    Authors: Tomasz Kuśmierczyk, Joseph Sakaya, Arto Klami

    Abstract: Bayesian models quantify uncertainty and facilitate optimal decision-making in downstream applications. For most models, however, practitioners are forced to use approximate inference techniques that lead to sub-optimal decisions due to incorrect posterior predictive distributions. We present a novel approach that corrects for inaccuracies in posterior inference by altering the decision-making pro… ▽ More

    Submitted 11 September, 2019; originally announced September 2019.

  15. arXiv:1902.00792  [pdf, other

    stat.ML cs.LG

    Variational Bayesian Decision-making for Continuous Utilities

    Authors: Tomasz Kuśmierczyk, Joseph Sakaya, Arto Klami

    Abstract: Bayesian decision theory outlines a rigorous framework for making optimal decisions based on maximizing expected utility over a model posterior. However, practitioners often do not have access to the full posterior and resort to approximate inference strategies. In such cases, taking the eventual decision-making task into account while performing the inference allows for calibrating the posterior… ▽ More

    Submitted 27 October, 2019; v1 submitted 2 February, 2019; originally announced February 2019.

    Comments: Appearing at Neural Information Processing Systems 32 (NeurIPS 2019)

  16. arXiv:1704.05786  [pdf, other

    stat.ML

    Importance Sampled Stochastic Optimization for Variational Inference

    Authors: Joseph Sakaya, Arto Klami

    Abstract: Variational inference approximates the posterior distribution of a probabilistic model with a parameterized density by maximizing a lower bound for the model evidence. Modern solutions fit a flexible approximation with stochastic gradient descent, using Monte Carlo approximation for the gradients. This enables variational inference for arbitrary differentiable probabilistic models, and consequentl… ▽ More

    Submitted 12 July, 2017; v1 submitted 19 April, 2017; originally announced April 2017.

    Comments: 10 pages, 10 figures; published in Uncertainty in Artificial Intelligence, 2017

  17. arXiv:1411.5799  [pdf, other

    stat.ML

    Group Factor Analysis

    Authors: Arto Klami, Seppo Virtanen, Eemeli Leppäaho, Samuel Kaski

    Abstract: Factor analysis provides linear factors that describe relationships between individual variables of a data set. We extend this classical formulation into linear factors that describe relationships between groups of variables, where each group represents either a set of related variables or a data set. The model also naturally extends canonical correlation analysis to more than two sets, in a way t… ▽ More

    Submitted 2 December, 2014; v1 submitted 21 November, 2014; originally announced November 2014.

  18. arXiv:1410.0471  [pdf, other

    cs.IR cs.AI

    PinView: Implicit Feedback in Content-Based Image Retrieval

    Authors: Zakria Hussain, Arto Klami, Jussi Kujala, Alex P. Leung, Kitsuchart Pasupa, Peter Auer, Samuel Kaski, Jorma Laaksonen, John Shawe-Taylor

    Abstract: This paper describes PinView, a content-based image retrieval system that exploits implicit relevance feedback collected during a search session. PinView contains several novel methods to infer the intent of the user. From relevance feedback, such as eye movements or pointer clicks, and visual features of images, PinView learns a similarity metric between images which depends on the current intere… ▽ More

    Submitted 2 October, 2014; originally announced October 2014.

    Comments: 12 pages

  19. arXiv:1312.5921  [pdf, other

    stat.ML cs.LG

    Group-sparse Embeddings in Collective Matrix Factorization

    Authors: Arto Klami, Guillaume Bouchard, Abhishek Tripathi

    Abstract: CMF is a technique for simultaneously learning low-rank representations based on a collection of matrices with shared entities. A typical example is the joint modeling of user-item, item-property, and user-feature matrices in a recommender system. The key idea in CMF is that the embeddings are shared across the matrices, which enables transferring information between them. The existing solutions,… ▽ More

    Submitted 18 February, 2014; v1 submitted 20 December, 2013; originally announced December 2013.

    Comments: 9+2 pages, submitted for International Conference on Learning Representations 2014. This version fixes minor typographic mistakes, has one new paragraph on computational efficiency, and describes the algorithm in more detail in the Supplementary material

  20. arXiv:1210.4920  [pdf

    cs.LG cs.IR stat.ML

    Factorized Multi-Modal Topic Model

    Authors: Seppo Virtanen, Yangqing Jia, Arto Klami, Trevor Darrell

    Abstract: Multi-modal data collections, such as corpora of paired images and text snippets, require analysis methods beyond single-view component and topic models. For continuous observations the current dominant approach is based on extensions of canonical correlation analysis, factorizing the variation into components shared by the different modalities and those private to each of them. For count data, mu… ▽ More

    Submitted 16 October, 2012; originally announced October 2012.

    Comments: Appears in Proceedings of the Twenty-Eighth Conference on Uncertainty in Artificial Intelligence (UAI2012)

    Report number: UAI-P-2012-PG-843-851

  21. arXiv:1203.3489  [pdf

    cs.LG stat.ML

    Bayesian exponential family projections for coupled data sources

    Authors: Arto Klami, Seppo Virtanen, Samuel Kaski

    Abstract: Exponential family extensions of principal component analysis (EPCA) have received a considerable amount of attention in recent years, demonstrating the growing need for basic modeling tools that do not assume the squared loss or Gaussian distribution. We extend the EPCA model toolbox by presenting the first exponential family multi-view learning methods of the partial least squares and canonical… ▽ More

    Submitted 15 March, 2012; originally announced March 2012.

    Comments: Appears in Proceedings of the Twenty-Sixth Conference on Uncertainty in Artificial Intelligence (UAI2010)

    Report number: UAI-P-2010-PG-286-293

  22. arXiv:1110.3204  [pdf, other

    stat.ML

    Bayesian Group Factor Analysis

    Authors: Seppo Virtanen, Arto Klami, Suleiman A. Khan, Samuel Kaski

    Abstract: We introduce a factor analysis model that summarizes the dependencies between observed variable groups, instead of dependencies between individual variables as standard factor analysis does. A group may correspond to one view of the same set of objects, one of many data sets tied by co-occurrence, or a set of alternative variables collected from statistics tables to measure one property of interes… ▽ More

    Submitted 14 October, 2011; originally announced October 2011.

    Comments: 9 pages, 5 figures

    Journal ref: Proceedings of the 15th AISTATS, JMLR W&CP 22: 1269-1277, 2012