Skip to main content

Showing 1–4 of 4 results for author: Hotti, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.07083  [pdf, other

    cs.LG stat.ML

    Efficient Mixture Learning in Black-Box Variational Inference

    Authors: Alexandra Hotti, Oskar Kviman, Ricky Molén, Víctor Elvira, Jens Lagergren

    Abstract: Mixture variational distributions in black box variational inference (BBVI) have demonstrated impressive results in challenging density estimation tasks. However, currently scaling the number of mixture components can lead to a linear increase in the number of learnable parameters and a quadratic increase in inference time due to the evaluation of the evidence lower bound (ELBO). Our two key contr… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: In Proceedings of the 41 st International Conference on Machine Learning (ICML), Vienna, Austria

  2. arXiv:2403.00563  [pdf, other

    cs.LG stat.ML

    Indirectly Parameterized Concrete Autoencoders

    Authors: Alfred Nilsson, Klas Wijk, Sai bharath chandra Gutha, Erik Englesson, Alexandra Hotti, Carlo Saccardi, Oskar Kviman, Jens Lagergren, Ricardo Vinuesa, Hossein Azizpour

    Abstract: Feature selection is a crucial task in settings where data is high-dimensional or acquiring the full set of features is costly. Recent developments in neural network-based embedded feature selection show promising results across a wide range of applications. Concrete Autoencoders (CAEs), considered state-of-the-art in embedded feature selection, may struggle to achieve stable joint optimization, h… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

  3. arXiv:2209.15514  [pdf, other

    cs.LG stat.ML

    Cooperation in the Latent Space: The Benefits of Adding Mixture Components in Variational Autoencoders

    Authors: Oskar Kviman, Ricky Molén, Alexandra Hotti, Semih Kurt, Víctor Elvira, Jens Lagergren

    Abstract: In this paper, we show how the mixture components cooperate when they jointly adapt to maximize the ELBO. We build upon recent advances in the multiple and adaptive importance sampling literature. We then model the mixture components using separate encoder networks and show empirically that the ELBO is monotonically non-decreasing as a function of the number of mixture components. These results ho… ▽ More

    Submitted 14 July, 2023; v1 submitted 30 September, 2022; originally announced September 2022.

    Comments: Updated to the accepted ICML23 version. I.e. there is a new title (previously Learning with MISELBO: The Mixture Cookbook), more experiments, and clarifying text

  4. arXiv:2111.02168  [pdf, other

    cs.LG cs.CL cs.CV cs.HC cs.IR

    The Klarna Product Page Dataset: Web Element Nomination with Graph Neural Networks and Large Language Models

    Authors: Alexandra Hotti, Riccardo Sven Risuleo, Stefan Magureanu, Aref Moradi, Jens Lagergren

    Abstract: Web automation holds the potential to revolutionize how users interact with the digital world, offering unparalleled assistance and simplifying tasks via sophisticated computational methods. Central to this evolution is the web element nomination task, which entails identifying unique elements on webpages. Unfortunately, the development of algorithmic designs for web automation is hampered by the… ▽ More

    Submitted 23 February, 2024; v1 submitted 3 November, 2021; originally announced November 2021.

    Comments: 12 pages, 8 figures, 3 tables, under review

    MSC Class: 68T07