Skip to main content

Showing 1–50 of 64 results for author: Solin, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.02696  [pdf, other

    cs.LG

    iQRL -- Implicitly Quantized Representations for Sample-efficient Reinforcement Learning

    Authors: Aidan Scannell, Kalle Kujanpää, Yi Zhao, Mohammadreza Nakhaei, Arno Solin, Joni Pajarinen

    Abstract: Learning representations for reinforcement learning (RL) has shown much promise for continuous control. We propose an efficient representation learning method using only a self-supervised latent-state consistency loss. Our approach employs an encoder and a dynamics model to map observations to latent states and predict future latent states, respectively. We achieve high performance and prevent rep… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: 9 pages, 11 figures

  2. arXiv:2406.00561  [pdf, other

    cs.LG

    Learning to Approximate Particle Smoothing Trajectories via Diffusion Generative Models

    Authors: Ella Tamir, Arno Solin

    Abstract: Learning dynamical systems from sparse observations is critical in numerous fields, including biology, finance, and physics. Even if tackling such problems is standard in general information fusion, it remains challenging for contemporary machine learning models, such as diffusion models. We introduce a method that integrates conditional particle filtering with ancestral sampling and diffusion mod… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

  3. arXiv:2405.17889  [pdf, other

    cs.LG

    Improving Discrete Diffusion Models via Structured Preferential Generation

    Authors: Severi Rissanen, Markus Heinonen, Arno Solin

    Abstract: In the domains of image and audio, diffusion models have shown impressive performance. However, their application to discrete data types, such as language, has often been suboptimal compared to autoregressive generative models. This paper tackles the challenge of improving discrete diffusion models by introducing a structured forward process that leverages the inherent information hierarchy in dis… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: 10 pages, 7 figures

  4. arXiv:2405.17656  [pdf, other

    cs.LG q-bio.QM

    Alignment is Key for Applying Diffusion Models to Retrosynthesis

    Authors: Najwa Laabid, Severi Rissanen, Markus Heinonen, Arno Solin, Vikas Garg

    Abstract: Retrosynthesis, the task of identifying precursors for a given molecule, can be naturally framed as a conditional graph generation task. Diffusion models are a particularly promising modelling approach, enabling post-hoc conditioning and trading off quality for speed during generation. We show mathematically that permutation equivariant denoisers severely limit the expressiveness of graph diffusio… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: 28 pages, 9 figures

  5. arXiv:2404.07696  [pdf, other

    cs.LG cs.CV

    Flatness Improves Backbone Generalisation in Few-shot Classification

    Authors: Rui Li, Martin Trapp, Marcus Klasson, Arno Solin

    Abstract: Deployment of deep neural networks in real-world settings typically requires adaptation to new tasks with few examples. Few-shot classification (FSC) provides a solution to this problem by leveraging pre-trained backbones for fast adaptation to new classes. Surprisingly, most efforts have only focused on develo** architectures for easing the adaptation to the target domain without considering th… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

  6. arXiv:2403.13327  [pdf, other

    cs.CV

    Gaussian Splatting on the Move: Blur and Rolling Shutter Compensation for Natural Camera Motion

    Authors: Otto Seiskari, Jerry Ylilammi, Valtteri Kaatrasalo, Pekka Rantalankila, Matias Turkulainen, Juho Kannala, Arno Solin

    Abstract: High-quality scene reconstruction and novel view synthesis based on Gaussian Splatting (3DGS) typically require steady, high-quality photographs, often impractical to capture with handheld cameras. We present a method that adapts to camera motion and allows high-quality scene reconstruction with handheld video data suffering from motion blur and rolling shutter distortion. Our approach is based on… ▽ More

    Submitted 24 May, 2024; v1 submitted 20 March, 2024; originally announced March 2024.

    Comments: Source code available at https://github.com/SpectacularAI/3dgs-deblur

  7. arXiv:2403.10929  [pdf, other

    stat.ML cs.LG

    Function-space Parameterization of Neural Networks for Sequential Learning

    Authors: Aidan Scannell, Riccardo Mereu, Paul Chang, Ella Tamir, Joni Pajarinen, Arno Solin

    Abstract: Sequential learning paradigms pose challenges for gradient-based deep learning due to difficulties incorporating new data and retaining prior knowledge. While Gaussian processes elegantly tackle these problems, they struggle with scalability and handling rich inputs, such as images. To address these issues, we introduce a technique that converts neural networks from weight space to function space,… ▽ More

    Submitted 16 March, 2024; originally announced March 2024.

    Comments: 29 pages, 8 figures, Published in The Twelfth International Conference on Learning Representations

  8. arXiv:2310.00724  [pdf, other

    cs.LG cs.AI

    Subtractive Mixture Models via Squaring: Representation and Learning

    Authors: Lorenzo Loconte, Aleksanteri M. Sladek, Stefan Mengel, Martin Trapp, Arno Solin, Nicolas Gillis, Antonio Vergari

    Abstract: Mixture models are traditionally represented and learned by adding several distributions as components. Allowing mixtures to subtract probability mass or density can drastically reduce the number of components needed to model complex distributions. However, learning such subtractive mixtures while ensuring they still encode a non-negative function is challenging. We investigate how to learn and pe… ▽ More

    Submitted 26 April, 2024; v1 submitted 1 October, 2023; originally announced October 2023.

  9. arXiv:2309.15478  [pdf, other

    cs.CV cs.LG

    The Robust Semantic Segmentation UNCV2023 Challenge Results

    Authors: Xuanlong Yu, Yi Zuo, Zitao Wang, Xiaowen Zhang, Jiaxuan Zhao, Yuting Yang, Licheng Jiao, Rui Peng, Xinyi Wang, Junpei Zhang, Kexin Zhang, Fang Liu, Roberto Alcover-Couso, Juan C. SanMiguel, Marcos Escudero-Viñolo, Hanlin Tian, Kenta Matsui, Tianhao Wang, Fahmy Adan, Zhitong Gao, Xuming He, Quentin Bouniot, Hossein Moghaddam, Shyam Nandan Rai, Fabio Cermelli , et al. (12 additional authors not shown)

    Abstract: This paper outlines the winning solutions employed in addressing the MUAD uncertainty quantification challenge held at ICCV 2023. The challenge was centered around semantic segmentation in urban environments, with a particular focus on natural adversarial scenarios. The report presents the results of 19 submitted entries, with numerous techniques drawing inspiration from cutting-edge uncertainty q… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

    Comments: 11 pages, 4 figures, accepted at ICCV 2023 UNCV workshop

  10. arXiv:2309.02195  [pdf, ps, other

    stat.ML cs.LG

    Sparse Function-space Representation of Neural Networks

    Authors: Aidan Scannell, Riccardo Mereu, Paul Chang, Ella Tamir, Joni Pajarinen, Arno Solin

    Abstract: Deep neural networks (NNs) are known to lack uncertainty estimates and struggle to incorporate new data. We present a method that mitigates these issues by converting NNs from weight space to function space, via a dual parameterization. Importantly, the dual parameterization enables us to formulate a sparse representation that captures information from the entire data set. This offers a compact an… ▽ More

    Submitted 5 September, 2023; originally announced September 2023.

    Comments: Accepted to ICML 2023 Workshop on Duality for Modern Machine Learning, Honolulu, Hawaii, USA. 4 pages, 2 figures, 1 table

  11. arXiv:2306.04201  [pdf, other

    cs.LG stat.ML

    Improving Hyperparameter Learning under Approximate Inference in Gaussian Process Models

    Authors: Rui Li, ST John, Arno Solin

    Abstract: Approximate inference in Gaussian process (GP) models with non-conjugate likelihoods gets entangled with the learning of the model hyperparameters. We improve hyperparameter learning in GP models and focus on the interplay between variational inference (VI) and the learning target. While VI's lower bound to the marginal likelihood is a suitable objective for inferring the approximate posterior, we… ▽ More

    Submitted 7 June, 2023; originally announced June 2023.

    Comments: International Conference on Machine Learning (ICML) 2023

  12. arXiv:2306.03953  [pdf, other

    cs.RO eess.SP eess.SY

    Rao-Blackwellized Particle Smoothing for Simultaneous Localization and Map**

    Authors: Manon Kok, Arno Solin, Thomas B. Schön

    Abstract: Simultaneous localization and map** (SLAM) is the task of building a map representation of an unknown environment while at the same time using it for positioning. A probabilistic interpretation of the SLAM task allows for incorporating prior knowledge and for operation under uncertainty. Contrary to the common practice of computing point estimates of the system states, we capture the full poster… ▽ More

    Submitted 5 June, 2024; v1 submitted 6 June, 2023; originally announced June 2023.

    Comments: 23 pages, 7 figures

    Journal ref: Data-Centric Engineering. 2024;5:e15

  13. arXiv:2306.03566  [pdf, other

    cs.LG stat.ML

    Memory-Based Dual Gaussian Processes for Sequential Learning

    Authors: Paul E. Chang, Prakhar Verma, S. T. John, Arno Solin, Mohammad Emtiyaz Khan

    Abstract: Sequential learning with Gaussian processes (GPs) is challenging when access to past data is limited, for example, in continual and active learning. In such cases, errors can accumulate over time due to inaccuracies in the posterior, hyperparameters, and inducing points, making accurate learning challenging. Here, we present a method to keep all such errors in check using the recently proposed dua… ▽ More

    Submitted 6 June, 2023; originally announced June 2023.

    Comments: International Conference on Machine Learning (ICML) 2023

  14. arXiv:2306.02066  [pdf, other

    cs.LG stat.ML

    Variational Gaussian Process Diffusion Processes

    Authors: Prakhar Verma, Vincent Adam, Arno Solin

    Abstract: Diffusion processes are a class of stochastic differential equations (SDEs) providing a rich family of expressive models that arise naturally in dynamic modelling tasks. Probabilistic inference and learning under generative models with latent processes endowed with a non-linear diffusion process prior are intractable problems. We build upon work within variational inference, approximating the post… ▽ More

    Submitted 27 February, 2024; v1 submitted 3 June, 2023; originally announced June 2023.

    Comments: International Conference on Artificial Intelligence and Statistics (AISTATS) 2024

  15. arXiv:2304.04307  [pdf, other

    stat.ML cs.LG

    PriorCVAE: scalable MCMC parameter inference with Bayesian deep generative modelling

    Authors: Elizaveta Semenova, Prakhar Verma, Max Cairney-Leeming, Arno Solin, Samir Bhatt, Seth Flaxman

    Abstract: Recent advances have shown that GP priors, or their finite realisations, can be encoded using deep generative models such as variational autoencoders (VAEs). These learned generators can serve as drop-in replacements for the original priors during MCMC inference. While this approach enables efficient inference, it loses information about the hyperparameters of the original models, and consequently… ▽ More

    Submitted 10 November, 2023; v1 submitted 9 April, 2023; originally announced April 2023.

  16. arXiv:2302.06359  [pdf, other

    cs.LG

    Fixing Overconfidence in Dynamic Neural Networks

    Authors: Lassi Meronen, Martin Trapp, Andrea Pilzer, Le Yang, Arno Solin

    Abstract: Dynamic neural networks are a recent technique that promises a remedy for the increasing size of modern deep learning models by dynamically adapting their computational cost to the difficulty of the inputs. In this way, the model can adjust to a limited computational budget. However, the poor quality of uncertainty estimates in deep learning models makes it difficult to distinguish between hard an… ▽ More

    Submitted 8 December, 2023; v1 submitted 13 February, 2023; originally announced February 2023.

    Comments: In IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2024

  17. arXiv:2301.13636  [pdf, other

    cs.LG

    Transport with Support: Data-Conditional Diffusion Bridges

    Authors: Ella Tamir, Martin Trapp, Arno Solin

    Abstract: The dynamic Schrödinger bridge problem provides an appealing setting for solving constrained time-series data generation tasks posed as optimal transport problems. It consists of learning non-linear diffusion processes using efficient iterative solvers. Recent works have demonstrated state-of-the-art results (eg. in modelling single-cell embryo RNA sequences or sampling from complex posteriors) bu… ▽ More

    Submitted 24 November, 2023; v1 submitted 31 January, 2023; originally announced January 2023.

    Comments: 27 pages, 11 figures

  18. arXiv:2212.13381  [pdf, other

    cs.LG cs.CV

    MixupE: Understanding and Improving Mixup from Directional Derivative Perspective

    Authors: Yingtian Zou, Vikas Verma, Sarthak Mittal, Wai Hoh Tang, Hieu Pham, Juho Kannala, Yoshua Bengio, Arno Solin, Kenji Kawaguchi

    Abstract: Mixup is a popular data augmentation technique for training deep neural networks where additional samples are generated by linearly interpolating pairs of inputs and their labels. This technique is known to improve the generalization performance in many learning paradigms and applications. In this work, we first analyze Mixup and show that it implicitly regularizes infinitely many directional deri… ▽ More

    Submitted 15 October, 2023; v1 submitted 27 December, 2022; originally announced December 2022.

    Comments: 16 pages, Best Student Paper Award at UAI 2023

  19. arXiv:2211.06260  [pdf, other

    cs.LG stat.ML

    Towards Improved Learning in Gaussian Processes: The Best of Two Worlds

    Authors: Rui Li, ST John, Arno Solin

    Abstract: Gaussian process training decomposes into inference of the (approximate) posterior and learning of the hyperparameters. For non-Gaussian (non-conjugate) likelihoods, two common choices for approximate inference are Expectation Propagation (EP) and Variational Inference (VI), which have complementary strengths and weaknesses. While VI's lower bound to the marginal likelihood is a suitable objective… ▽ More

    Submitted 11 November, 2022; originally announced November 2022.

    Comments: In the 2022 NeurIPS Workshop on Gaussian Processes, Spatiotemporal Modeling, and Decision-making Systems

  20. arXiv:2211.01053  [pdf, other

    cs.LG stat.ML

    Fantasizing with Dual GPs in Bayesian Optimization and Active Learning

    Authors: Paul E. Chang, Prakhar Verma, ST John, Victor Picheny, Henry Moss, Arno Solin

    Abstract: Gaussian processes (GPs) are the main surrogate functions used for sequential modelling such as Bayesian Optimization and Active Learning. Their drawbacks are poor scaling with data and the need to run an optimization loop when using a non-Gaussian likelihood. In this paper, we focus on `fantasizing' batch acquisition functions that need the ability to condition on new fantasized data computationa… ▽ More

    Submitted 2 November, 2022; originally announced November 2022.

    Comments: In the 2022 NeurIPS Workshop on Gaussian Processes, Spatiotemporal Modeling, and Decision-making Systems

  21. arXiv:2211.00392  [pdf, other

    cs.CV

    Expansion of Visual Hints for Improved Generalization in Stereo Matching

    Authors: Andrea Pilzer, Yuxin Hou, Niki Loppi, Arno Solin, Juho Kannala

    Abstract: We introduce visual hints expansion for guiding stereo matching to improve generalization. Our work is motivated by the robustness of Visual Inertial Odometry (VIO) in computer vision and robotics, where a sparse and unevenly distributed set of feature points characterizes a scene. To improve stereo matching, we propose to elevate 2D hints to 3D points. These sparse and unevenly distributed 3D vis… ▽ More

    Submitted 1 November, 2022; originally announced November 2022.

    Comments: 2023 IEEE Winter Conference on Applications of Computer Vision (WACV)

  22. arXiv:2208.07591  [pdf, other

    cs.CV cs.LG

    Uncertainty-guided Source-free Domain Adaptation

    Authors: Subhankar Roy, Martin Trapp, Andrea Pilzer, Juho Kannala, Nicu Sebe, Elisa Ricci, Arno Solin

    Abstract: Source-free domain adaptation (SFDA) aims to adapt a classifier to an unlabelled target data set by only using a pre-trained source model. However, the absence of the source data and the domain shift makes the predictions on the target data unreliable. We propose quantifying the uncertainty in the source model predictions and utilizing it to guide the target adaptation. For this, we construct a pr… ▽ More

    Submitted 16 August, 2022; originally announced August 2022.

    Comments: ECCV 2022

  23. arXiv:2206.13397  [pdf, other

    cs.CV cs.LG stat.ML

    Generative Modelling With Inverse Heat Dissipation

    Authors: Severi Rissanen, Markus Heinonen, Arno Solin

    Abstract: While diffusion models have shown great success in image generation, their noise-inverting generative process does not explicitly consider the structure of images, such as their inherent multi-scale nature. Inspired by diffusion models and the empirical success of coarse-to-fine modelling, we propose a new diffusion-like model that generates images through stochastically reversing the heat equatio… ▽ More

    Submitted 12 April, 2023; v1 submitted 21 June, 2022; originally announced June 2022.

  24. arXiv:2206.08890  [pdf, other

    cs.LG cs.CV

    Disentangling Model Multiplicity in Deep Learning

    Authors: Ari Heljakka, Martin Trapp, Juho Kannala, Arno Solin

    Abstract: Model multiplicity is a well-known but poorly understood phenomenon that undermines the generalisation guarantees of machine learning models. It appears when two models with similar training-time performance differ in their predictions and real-world performance characteristics. This observed 'predictive' multiplicity (PM) also implies elusive differences in the internals of the models, their 'rep… ▽ More

    Submitted 31 January, 2023; v1 submitted 17 June, 2022; originally announced June 2022.

    Comments: 13 pages, 6 figures

  25. arXiv:2205.13821  [pdf, other

    cs.RO cs.CV

    A Look at Improving Robustness in Visual-inertial SLAM by Moment Matching

    Authors: Arno Solin, Rui Li, Andrea Pilzer

    Abstract: The fusion of camera sensor and inertial data is a leading method for ego-motion tracking in autonomous and smart devices. State estimation techniques that rely on non-linear filtering are a strong paradigm for solving the associated information fusion task. The de facto inference method in this space is the celebrated extended Kalman filter (EKF), which relies on first-order linearizations of bot… ▽ More

    Submitted 27 May, 2022; originally announced May 2022.

    Comments: 8 pages, to appear in Proceedings of FUSION 2022

  26. arXiv:2111.08524  [pdf, other

    cs.LG stat.ML

    Non-separable Spatio-temporal Graph Kernels via SPDEs

    Authors: Alexander Nikitin, ST John, Arno Solin, Samuel Kaski

    Abstract: Gaussian processes (GPs) provide a principled and direct approach for inference and learning on graphs. However, the lack of justified graph kernels for spatio-temporal modelling has held back their use in graph problems. We leverage an explicit link between stochastic partial differential equations (SPDEs) and GPs on graphs, introduce a framework for deriving graph kernels via SPDEs, and derive n… ▽ More

    Submitted 22 March, 2022; v1 submitted 16 November, 2021; originally announced November 2021.

  27. arXiv:2111.03412  [pdf, other

    cs.LG stat.ML

    Dual Parameterization of Sparse Variational Gaussian Processes

    Authors: Vincent Adam, Paul E. Chang, Mohammad Emtiyaz Khan, Arno Solin

    Abstract: Sparse variational Gaussian process (SVGP) methods are a common choice for non-conjugate Gaussian process inference because of their computational benefits. In this paper, we improve their computational efficiency by using a dual parameterization where each data example is assigned dual parameters, similarly to site parameters used in expectation propagation. Our dual parameterization speeds-up in… ▽ More

    Submitted 19 January, 2022; v1 submitted 5 November, 2021; originally announced November 2021.

    Comments: Advances in Neural Information Processing Systems (NeurIPS 2021)

  28. arXiv:2111.01732  [pdf, other

    cs.LG stat.ML

    Spatio-Temporal Variational Gaussian Processes

    Authors: Oliver Hamelijnck, William J. Wilkinson, Niki A. Loppi, Arno Solin, Theodoros Damoulas

    Abstract: We introduce a scalable approach to Gaussian process inference that combines spatio-temporal filtering with natural gradient variational inference, resulting in a non-conjugate GP method for multivariate data that scales linearly with respect to time. Our natural gradient approach enables application of parallel filtering and smoothing, further reducing the temporal span complexity to be logarithm… ▽ More

    Submitted 2 November, 2021; originally announced November 2021.

  29. arXiv:2111.01721  [pdf, ps, other

    stat.ML cs.LG

    Bayes-Newton Methods for Approximate Bayesian Inference with PSD Guarantees

    Authors: William J. Wilkinson, Simo Särkkä, Arno Solin

    Abstract: We formulate natural gradient variational inference (VI), expectation propagation (EP), and posterior linearisation (PL) as extensions of Newton's method for optimising the parameters of a Bayesian posterior distribution. This viewpoint explicitly casts inference algorithms under the framework of numerical optimisation. We show that common approximations to Newton's method from the optimisation li… ▽ More

    Submitted 6 December, 2022; v1 submitted 2 November, 2021; originally announced November 2021.

    Comments: Code for methods and experiments: https://github.com/AaltoML/BayesNewton

  30. arXiv:2110.15739  [pdf, other

    cs.LG stat.ML

    Scalable Inference in SDEs by Direct Matching of the Fokker-Planck-Kolmogorov Equation

    Authors: Arno Solin, Ella Tamir, Prakhar Verma

    Abstract: Simulation-based techniques such as variants of stochastic Runge-Kutta are the de facto approach for inference with stochastic differential equations (SDEs) in machine learning. These methods are general-purpose and used with parametric and non-parametric models, and neural SDEs. Stochastic Runge-Kutta relies on the use of sampling schemes that can be inefficient in high dimensions. We address thi… ▽ More

    Submitted 29 October, 2021; originally announced October 2021.

    Comments: To appear in Advances in Neural Information Processing Systems (NeurIPS 2021)

  31. arXiv:2110.13572  [pdf, other

    cs.LG stat.ML

    Periodic Activation Functions Induce Stationarity

    Authors: Lassi Meronen, Martin Trapp, Arno Solin

    Abstract: Neural network models are known to reinforce hidden data biases, making them unreliable and difficult to interpret. We seek to build models that `know what they do not know' by introducing inductive biases in the function space. We show that periodic activation functions in Bayesian neural networks establish a connection between the prior on the network weights and translation-invariant, stationar… ▽ More

    Submitted 20 December, 2021; v1 submitted 26 October, 2021; originally announced October 2021.

    Comments: Appeared in Advances in Neural Information Processing Systems (NeurIPS 2021)

  32. HybVIO: Pushing the Limits of Real-time Visual-inertial Odometry

    Authors: Otto Seiskari, Pekka Rantalankila, Juho Kannala, Jerry Ylilammi, Esa Rahtu, Arno Solin

    Abstract: We present HybVIO, a novel hybrid approach for combining filtering-based visual-inertial odometry (VIO) with optimization-based SLAM. The core of our method is highly robust, independent VIO with improved IMU bias modeling, outlier rejection, stationarity detection, and feature track selection, which is adjustable to run on embedded hardware. Long-term consistency is achieved with a loosely-couple… ▽ More

    Submitted 25 November, 2021; v1 submitted 22 June, 2021; originally announced June 2021.

    Comments: 2022 IEEE Winter Conference on Applications of Computer Vision (WACV)

  33. arXiv:2106.10210  [pdf, other

    cs.LG stat.ML

    Combining Pseudo-Point and State Space Approximations for Sum-Separable Gaussian Processes

    Authors: Will Tebbutt, Arno Solin, Richard E. Turner

    Abstract: Gaussian processes (GPs) are important probabilistic tools for inference and learning in spatio-temporal modelling problems such as those in climate science and epidemiology. However, existing GP approximations do not simultaneously support large numbers of off-the-grid spatial data-points and long time-series which is a hallmark of many applications. Pseudo-point approximations, one of the gold… ▽ More

    Submitted 18 June, 2021; originally announced June 2021.

  34. arXiv:2103.10710  [pdf, other

    stat.ML cs.LG

    Sparse Algorithms for Markovian Gaussian Processes

    Authors: William J. Wilkinson, Arno Solin, Vincent Adam

    Abstract: Approximate Bayesian inference methods that scale to very large datasets are crucial in leveraging probabilistic models for real-world time series. Sparse Markovian Gaussian processes combine the use of inducing variables with efficient Kalman filter-like recursions, resulting in algorithms whose computational and memory requirements scale linearly in the number of inducing points, whilst also ena… ▽ More

    Submitted 9 June, 2021; v1 submitted 19 March, 2021; originally announced March 2021.

    Comments: Appearing in the 24th International Conference on Artificial Intelligence and Statistics (AISTATS) 2021

  35. arXiv:2101.01619  [pdf, other

    cs.CV

    Novel View Synthesis via Depth-guided Skip Connections

    Authors: Yuxin Hou, Arno Solin, Juho Kannala

    Abstract: We introduce a principled approach for synthesizing new views of a scene given a single source image. Previous methods for novel view synthesis can be divided into image-based rendering methods (e.g. flow prediction) or pixel generation methods. Flow predictions enable the target view to re-use pixels directly, but can easily lead to distorted results. Directly regressing pixels can produce struct… ▽ More

    Submitted 5 January, 2021; originally announced January 2021.

  36. arXiv:2011.03085  [pdf, other

    cs.RO cs.AI

    RealAnt: An Open-Source Low-Cost Quadruped for Education and Research in Real-World Reinforcement Learning

    Authors: Rinu Boney, Jussi Sainio, Mikko Kaivola, Arno Solin, Juho Kannala

    Abstract: Current robot platforms available for research are either very expensive or unable to handle the abuse of exploratory controls in reinforcement learning. We develop RealAnt, a minimal low-cost physical version of the popular `Ant' benchmark used in reinforcement learning. RealAnt costs only $\sim$350 EUR (\$410) in materials and can be assembled in less than an hour. We validate the platform with… ▽ More

    Submitted 4 June, 2022; v1 submitted 5 November, 2020; originally announced November 2020.

  37. arXiv:2010.09494  [pdf, other

    cs.LG

    Stationary Activations for Uncertainty Calibration in Deep Learning

    Authors: Lassi Meronen, Christabella Irwanto, Arno Solin

    Abstract: We introduce a new family of non-linear neural network activation functions that mimic the properties induced by the widely-used Matérn family of kernels in Gaussian process (GP) models. This class spans a range of locally stationary models of various degrees of mean-square differentiability. We show an explicit link to the corresponding GP models in the case that the network consists of one infin… ▽ More

    Submitted 19 October, 2020; originally announced October 2020.

    Comments: To appear in Advances in Neural Information Processing Systems (NeurIPS 2020)

  38. arXiv:2010.09105  [pdf, other

    cs.CV

    Movement-induced Priors for Deep Stereo

    Authors: Yuxin Hou, Muhammad Kamran Janjua, Juho Kannala, Arno Solin

    Abstract: We propose a method for fusing stereo disparity estimation with movement-induced prior information. Instead of independent inference frame-by-frame, we formulate the problem as a non-parametric learning task in terms of a temporal Gaussian process prior with a movement-driven kernel for inter-frame reasoning. We present a hierarchy of three Gaussian process kernels depending on the availability of… ▽ More

    Submitted 18 October, 2020; originally announced October 2020.

  39. arXiv:2007.05994  [pdf, other

    stat.ML cs.LG

    State Space Expectation Propagation: Efficient Inference Schemes for Temporal Gaussian Processes

    Authors: William J. Wilkinson, Paul E. Chang, Michael Riis Andersen, Arno Solin

    Abstract: We formulate approximate Bayesian inference in non-conjugate temporal and spatio-temporal Gaussian process models as a simple parameter update rule applied during Kalman smoothing. This viewpoint encompasses most inference schemes, including expectation propagation (EP), the classical (Extended, Unscented, etc.) Kalman smoothers, and variational inference. We provide a unifying perspective on thes… ▽ More

    Submitted 12 July, 2020; originally announced July 2020.

    Comments: Accepted to International Conference on Machine Learning (ICML) 2020

  40. arXiv:2007.04731  [pdf, other

    cs.LG stat.ML

    Fast Variational Learning in State-Space Gaussian Process Models

    Authors: Paul E. Chang, William J. Wilkinson, Mohammad Emtiyaz Khan, Arno Solin

    Abstract: Gaussian process (GP) regression with 1D inputs can often be performed in linear time via a stochastic differential equation formulation. However, for non-Gaussian likelihoods, this requires application of approximate inference methods which can make the implementation difficult, e.g., expectation propagation can be numerically unstable and variational inference can be computationally inefficient.… ▽ More

    Submitted 17 July, 2020; v1 submitted 9 July, 2020; originally announced July 2020.

    Comments: To appear in MLSP 2020

  41. arXiv:2006.13856  [pdf, other

    cs.CV cs.LG

    Movement Tracking by Optical Flow Assisted Inertial Navigation

    Authors: Lassi Meronen, William J. Wilkinson, Arno Solin

    Abstract: Robust and accurate six degree-of-freedom tracking on portable devices remains a challenging problem, especially on small hand-held devices such as smartphones. For improved robustness and accuracy, complementary movement information from an IMU and a camera is often fused. Conventional visual-inertial methods fuse information from IMUs with a sparse cloud of feature points tracked by the device c… ▽ More

    Submitted 24 June, 2020; originally announced June 2020.

  42. arXiv:2006.12063  [pdf, other

    cs.LG stat.ML

    Deep Residual Mixture Models

    Authors: Perttu Hämäläinen, Martin Trapp, Tuure Saloheimo, Arno Solin

    Abstract: We propose Deep Residual Mixture Models (DRMMs), a novel deep generative model architecture. Compared to other deep models, DRMMs allow more flexible conditional sampling: The model can be trained once with all variables, and then used for sampling with arbitrary combinations of conditioning variables, Gaussian priors, and (in)equality constraints. This provides new opportunities for interactive a… ▽ More

    Submitted 21 July, 2021; v1 submitted 22 June, 2020; originally announced June 2020.

    Comments: Code and examples can be found at https://github.com/PerttuHamalainen/DRMM

  43. arXiv:1912.10321  [pdf, other

    cs.LG cs.CV stat.ML

    Deep Automodulators

    Authors: Ari Heljakka, Yuxin Hou, Juho Kannala, Arno Solin

    Abstract: We introduce a new category of generative autoencoders called automodulators. These networks can faithfully reproduce individual real-world input images like regular autoencoders, but also generate a fused sample from an arbitrary combination of several such images, allowing instantaneous 'style-mixing' and other new applications. An automodulator decouples the data flow of decoder operations from… ▽ More

    Submitted 29 October, 2020; v1 submitted 21 December, 2019; originally announced December 2019.

    Comments: To appear in Advances in Neural Information Processing Systems (NeurIPS 2020)

  44. arXiv:1912.03249  [pdf, other

    stat.ML cs.CV cs.LG

    Gaussian Process Priors for View-Aware Inference

    Authors: Yuxin Hou, Ari Heljakka, Arno Solin

    Abstract: While frame-independent predictions with deep neural networks have become the prominent solutions to many computer vision tasks, the potential benefits of utilizing correlations between frames have received less attention. Even though probabilistic machine learning provides the ability to encode correlation as prior knowledge for inference, there is a tangible gap between the theory and practice o… ▽ More

    Submitted 3 March, 2021; v1 submitted 6 December, 2019; originally announced December 2019.

    Comments: Appearing in AAAI 2021

  45. arXiv:1911.06287  [pdf, other

    stat.ML cs.LG

    Scalable Exact Inference in Multi-Output Gaussian Processes

    Authors: Wessel P. Bruinsma, Eric Perim, Will Tebbutt, J. Scott Hosking, Arno Solin, Richard E. Turner

    Abstract: Multi-output Gaussian processes (MOGPs) leverage the flexibility and interpretability of GPs while capturing structure across outputs, which is desirable, for example, in spatio-temporal modelling. The key problem with MOGPs is their computational scaling $O(n^3 p^3)$, which is cubic in the number of both inputs $n$ (e.g., time points or locations) and outputs $p$. For this reason, a popular class… ▽ More

    Submitted 17 July, 2020; v1 submitted 14 November, 2019; originally announced November 2019.

    Comments: 31 pages, 12 figures, 5 tables, includes appendix; to appear in ICML 2020

  46. arXiv:1906.00360  [pdf, other

    cs.CV

    Iterative Path Reconstruction for Large-Scale Inertial Navigation on Smartphones

    Authors: Santiago Cortés Reina, Yuxin Hou, Juho Kannala, Arno Solin

    Abstract: Modern smartphones have all the sensing capabilities required for accurate and robust navigation and tracking. In specific environments some data streams may be absent, less reliable, or flat out wrong. In particular, the GNSS signal can become flawed or silent inside buildings or in streets with tall buildings. In this application paper, we aim to advance the current state-of-the-art in motion es… ▽ More

    Submitted 2 June, 2019; originally announced June 2019.

    Comments: To appear in Proceedings FUSION 2019

  47. arXiv:1904.06397  [pdf, other

    cs.CV

    Multi-View Stereo by Temporal Nonparametric Fusion

    Authors: Yuxin Hou, Juho Kannala, Arno Solin

    Abstract: We propose a novel idea for depth estimation from multi-view image-pose pairs, where the model has capability to leverage information from previous latent-space encodings of the scene. This model uses pairs of images and poses, which are passed through an encoder--decoder model for disparity estimation. The novelty lies in soft-constraining the bottleneck layer by a nonparametric Gaussian process… ▽ More

    Submitted 16 August, 2019; v1 submitted 12 April, 2019; originally announced April 2019.

    Comments: ICCV 2019

  48. arXiv:1904.06145  [pdf, other

    cs.LG cs.CV stat.ML

    Towards Photographic Image Manipulation with Balanced Growing of Generative Autoencoders

    Authors: Ari Heljakka, Arno Solin, Juho Kannala

    Abstract: We present a generative autoencoder that provides fast encoding, faithful reconstructions (eg. retaining the identity of a face), sharp generated/reconstructed samples in high resolutions, and a well-structured latent space that supports semantic manipulation of the inputs. There are no current autoencoder or GAN models that satisfactorily achieve all of these. We build on the progressively growin… ▽ More

    Submitted 20 February, 2020; v1 submitted 12 April, 2019; originally announced April 2019.

    Comments: WACV 2020

  49. arXiv:1904.05207  [pdf, other

    stat.ML cs.LG

    Know Your Boundaries: Constraining Gaussian Processes by Variational Harmonic Features

    Authors: Arno Solin, Manon Kok

    Abstract: Gaussian processes (GPs) provide a powerful framework for extrapolation, interpolation, and noise removal in regression and classification. This paper considers constraining GPs to arbitrarily-shaped domains with boundary conditions. We solve a Fourier-like generalised harmonic feature representation of the GP prior in the domain of interest, which both constrains the GP and attains a low-rank rep… ▽ More

    Submitted 10 April, 2019; originally announced April 2019.

    Comments: Appearing in Proceedings of AISTATS 2019

  50. arXiv:1903.03825  [pdf

    stat.ML cs.AI cs.LG

    Interpolation Consistency Training for Semi-Supervised Learning

    Authors: Vikas Verma, Kenji Kawaguchi, Alex Lamb, Juho Kannala, Arno Solin, Yoshua Bengio, David Lopez-Paz

    Abstract: We introduce Interpolation Consistency Training (ICT), a simple and computation efficient algorithm for training Deep Neural Networks in the semi-supervised learning paradigm. ICT encourages the prediction at an interpolation of unlabeled points to be consistent with the interpolation of the predictions at those points. In classification problems, ICT moves the decision boundary to low-density reg… ▽ More

    Submitted 19 October, 2022; v1 submitted 9 March, 2019; originally announced March 2019.

    Comments: This is the latest version, which is published in the Journal, "Neural Networks", in 2022. All the previous results are unchanged. Keyword: Deep Learning, Semi-supervised Learning, Mixup

    Journal ref: Neural Networks, volume 145, pages 90-106 (2022)