Showing 1–2 of 2 results for author: Ramos-López, D
-
Bayesian Models of Data Streams with Hierarchical Power Priors
Authors:
Andres Masegosa,
Thomas D. Nielsen,
Helge Langseth,
Dario Ramos-Lopez,
Antonio Salmeron,
Anders L. Madsen
Abstract:
Making inferences from data streams is a pervasive problem in many modern data analysis applications. But it requires to address the problem of continuous model updating and adapt to changes or drifts in the underlying data generating distribution. In this paper, we approach these problems from a Bayesian perspective covering general conjugate exponential models. Our proposal makes use of non-conj…
▽ More
Making inferences from data streams is a pervasive problem in many modern data analysis applications. But it requires to address the problem of continuous model updating and adapt to changes or drifts in the underlying data generating distribution. In this paper, we approach these problems from a Bayesian perspective covering general conjugate exponential models. Our proposal makes use of non-conjugate hierarchical priors to explicitly model temporal changes of the model parameters. We also derive a novel variational inference scheme which overcomes the use of non-conjugate priors while maintaining the computational efficiency of variational methods over conjugate models. The approach is validated on three real data sets over three latent variable models.
△ Less
Submitted 7 July, 2017;
originally announced July 2017.
-
AMIDST: a Java Toolbox for Scalable Probabilistic Machine Learning
Authors:
Andrés R. Masegosa,
Ana M. Martínez,
Darío Ramos-López,
Rafael Cabañas,
Antonio Salmerón,
Thomas D. Nielsen,
Helge Langseth,
Anders L. Madsen
Abstract:
The AMIDST Toolbox is a software for scalable probabilistic machine learning with a spe- cial focus on (massive) streaming data. The toolbox supports a flexible modeling language based on probabilistic graphical models with latent variables and temporal dependencies. The specified models can be learnt from large data sets using parallel or distributed implementa- tions of Bayesian learning algorit…
▽ More
The AMIDST Toolbox is a software for scalable probabilistic machine learning with a spe- cial focus on (massive) streaming data. The toolbox supports a flexible modeling language based on probabilistic graphical models with latent variables and temporal dependencies. The specified models can be learnt from large data sets using parallel or distributed implementa- tions of Bayesian learning algorithms for either streaming or batch data. These algorithms are based on a flexible variational message passing scheme, which supports discrete and continu- ous variables from a wide range of probability distributions. AMIDST also leverages existing functionality and algorithms by interfacing to software tools such as Flink, Spark, MOA, Weka, R and HUGIN. AMIDST is an open source toolbox written in Java and available at http://www.amidsttoolbox.com under the Apache Software License version 2.0.
△ Less
Submitted 4 April, 2017;
originally announced April 2017.