-
arXiv:2112.11022 [pdf, ps, other]
Synthetic Data and Simulators for Recommendation Systems: Current State and Future Directions
Abstract: Synthetic data and simulators have the potential to markedly improve the performance and robustness of recommendation systems. These approaches have already had a beneficial impact in other machine-learning driven fields. We identify and discuss a key trade-off between data fidelity and privacy in the past work on synthetic data and simulators for recommendation systems. For the important use case… ▽ More
Submitted 21 December, 2021; originally announced December 2021.
Comments: 7 pages, included in SimuRec 2021: Workshop on Simulation Methods for Recommender Systems at ACM RecSys 2021, October 2nd, 2021, Amsterdam, NL and online
-
Unsupervised Distribution Learning for Lunar Surface Anomaly Detection
Abstract: In this work we show that modern data-driven machine learning techniques can be successfully applied on lunar surface remote sensing data to learn, in an unsupervised way, sufficiently good representations of the data distribution to enable lunar technosignature and anomaly detection. In particular we train an unsupervised distribution learning neural network model to find the Apollo 15 landing mo… ▽ More
Submitted 14 January, 2020; originally announced January 2020.
Comments: Second Workshop on Machine Learning and the Physical Sciences, NeurIPS 2019. Five pages, three figures
ACM Class: I.2.1; I.4.9; I.2.10; J.2
-
The Relevance of Bayesian Layer Positioning to Model Uncertainty in Deep Bayesian Active Learning
Abstract: One of the main challenges of deep learning tools is their inability to capture model uncertainty. While Bayesian deep learning can be used to tackle the problem, Bayesian neural networks often require more time and computational power to train than deterministic networks. Our work explores whether fully Bayesian networks are needed to successfully capture model uncertainty. We vary the number and… ▽ More
Submitted 29 November, 2018; originally announced November 2018.
Journal ref: Third workshop on Bayesian Deep Learning (NeurIPS 2018)
-
Large-Scale Visual Active Learning with Deep Probabilistic Ensembles
Abstract: Annotating the right data for training deep neural networks is an important challenge. Active learning using uncertainty estimates from Bayesian Neural Networks (BNNs) could provide an effective solution to this. Despite being theoretically principled, BNNs require approximations to be applied to large-scale problems, where both performance and uncertainty estimation are crucial. In this paper, we… ▽ More
Submitted 20 February, 2019; v1 submitted 8 November, 2018; originally announced November 2018.
Comments: arXiv admin note: text overlap with arXiv:1811.02640
-
arXiv:1811.02640 [pdf, ps, other]
Deep Probabilistic Ensembles: Approximate Variational Inference through KL Regularization
Abstract: In this paper, we introduce Deep Probabilistic Ensembles (DPEs), a scalable technique that uses a regularized ensemble to approximate a deep Bayesian Neural Network (BNN). We do so by incorporating a KL divergence penalty term into the training objective of an ensemble, derived from the evidence lower bound used in variational inference. We evaluate the uncertainty estimates obtained from our mode… ▽ More
Submitted 30 November, 2018; v1 submitted 6 November, 2018; originally announced November 2018.
Comments: Workshop on Bayesian Deep Learning (NeurIPS 2018)
-
How Much Did it Rain? Predicting Real Rainfall Totals Based on Radar Data
Abstract: We applied a variety of parametric and non-parametric machine learning models to predict the probability distribution of rainfall based on 1M training examples over a single year across several U.S. states. Our top performing model based on a squared loss objective was a cross-validated parametric k-nearest-neighbor predictor that took about six days to compute, and was competitive in a world-wide… ▽ More
Submitted 6 August, 2016; originally announced August 2016.