Skip to main content

Showing 1–11 of 11 results for author: Mujika, A

.
  1. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  2. arXiv:2306.01070  [pdf, other

    cs.LG

    Hierarchical Attention Encoder Decoder

    Authors: Asier Mujika

    Abstract: Recent advances in large language models have shown that autoregressive modeling can generate complex and novel sequences that have many real-world applications. However, these models must generate outputs autoregressively, which becomes time-consuming when dealing with long sequences. Hierarchical autoregressive approaches that compress data have been proposed as a solution, but these methods sti… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

  3. arXiv:2305.15947  [pdf, other

    cs.LG cs.NE

    Online learning of long-range dependencies

    Authors: Nicolas Zucchet, Robert Meier, Simon Schug, Asier Mujika, João Sacramento

    Abstract: Online learning holds the promise of enabling efficient long-term credit assignment in recurrent neural networks. However, current algorithms fall short of offline backpropagation by either not being scalable or failing to learn long-range dependencies. Here we present a high-performance online learning algorithm that merely doubles the memory and computational requirements of a single inference p… ▽ More

    Submitted 6 November, 2023; v1 submitted 25 May, 2023; originally announced May 2023.

    Comments: Accepted at NeurIPS 2023

  4. arXiv:2202.08266  [pdf, other

    cs.LG cs.AI cs.RO

    Open-Ended Reinforcement Learning with Neural Reward Functions

    Authors: Robert Meier, Asier Mujika

    Abstract: Inspired by the great success of unsupervised learning in Computer Vision and Natural Language Processing, the Reinforcement Learning community has recently started to focus more on unsupervised discovery of skills. Most current approaches, like DIAYN or DADS, optimize some form of mutual information objective. We propose a different approach that uses reward functions encoded by neural networks.… ▽ More

    Submitted 12 October, 2022; v1 submitted 16 February, 2022; originally announced February 2022.

  5. arXiv:1910.05268  [pdf, other

    cs.NE cs.LG

    Improving Gradient Estimation in Evolutionary Strategies With Past Descent Directions

    Authors: Florian Meier, Asier Mujika, Marcelo Matheus Gauy, Angelika Steger

    Abstract: Evolutionary Strategies (ES) are known to be an effective black-box optimization technique for deep neural networks when the true gradients cannot be computed, such as in Reinforcement Learning. We continue a recent line of research that uses surrogate gradients to improve the gradient estimation of ES. We propose a novel method to optimally incorporate surrogate gradient information. Our approach… ▽ More

    Submitted 11 October, 2019; originally announced October 2019.

  6. arXiv:1910.05245  [pdf, other

    cs.LG cs.NE stat.ML

    Decoupling Hierarchical Recurrent Neural Networks With Locally Computable Losses

    Authors: Asier Mujika, Felix Weissenberger, Angelika Steger

    Abstract: Learning long-term dependencies is a key long-standing challenge of recurrent neural networks (RNNs). Hierarchical recurrent neural networks (HRNNs) have been considered a promising approach as long-term dependencies are resolved through shortcuts up and down the hierarchy. Yet, the memory requirements of Truncated Backpropagation Through Time (TBPTT) still prevent training them on very long seque… ▽ More

    Submitted 11 October, 2019; originally announced October 2019.

  7. arXiv:1902.03993  [pdf, other

    cs.LG stat.ML

    Optimal Kronecker-Sum Approximation of Real Time Recurrent Learning

    Authors: Frederik Benzing, Marcelo Matheus Gauy, Asier Mujika, Anders Martinsson, Angelika Steger

    Abstract: One of the central goals of Recurrent Neural Networks (RNNs) is to learn long-term dependencies in sequential data. Nevertheless, the most popular training method, Truncated Backpropagation through Time (TBPTT), categorically forbids learning dependencies beyond the truncation horizon. In contrast, the online training algorithm Real Time Recurrent Learning (RTRL) provides untruncated gradients, wi… ▽ More

    Submitted 17 May, 2019; v1 submitted 11 February, 2019; originally announced February 2019.

    Comments: ICML 2019 camera ready version; new version includes additional plots in the appendix

  8. arXiv:1808.05566  [pdf, ps, other

    cs.NE cs.DS

    The linear hidden subset problem for the (1+1) EA with scheduled and adaptive mutation rates

    Authors: Hafsteinn Einarsson, Marcelo Matheus Gauy, Johannes Lengler, Florian Meier, Asier Mujika, Angelika Steger, Felix Weissenberger

    Abstract: We study unbiased $(1+1)$ evolutionary algorithms on linear functions with an unknown number $n$ of bits with non-zero weight. Static algorithms achieve an optimal runtime of $O(n (\ln n)^{2+ε})$, however, it remained unclear whether more dynamic parameter policies could yield better runtime guarantees. We consider two setups: one where the mutation rate follows a fixed schedule, and one where it… ▽ More

    Submitted 16 August, 2018; originally announced August 2018.

  9. arXiv:1805.10842  [pdf, other

    cs.LG stat.ML

    Approximating Real-Time Recurrent Learning with Random Kronecker Factors

    Authors: Asier Mujika, Florian Meier, Angelika Steger

    Abstract: Despite all the impressive advances of recurrent neural networks, sequential data is still in need of better modelling. Truncated backpropagation through time (TBPTT), the learning algorithm most widely used in practice, suffers from the truncation bias, which drastically limits its ability to learn long-term dependencies.The Real-Time Recurrent Learning algorithm (RTRL) addresses this issue, but… ▽ More

    Submitted 5 December, 2018; v1 submitted 28 May, 2018; originally announced May 2018.

  10. arXiv:1705.08639  [pdf, ps, other

    cs.NE

    Fast-Slow Recurrent Neural Networks

    Authors: Asier Mujika, Florian Meier, Angelika Steger

    Abstract: Processing sequential data of variable length is a major challenge in a wide range of applications, such as speech recognition, language modeling, generative image modeling and machine translation. Here, we address this challenge by proposing a novel recurrent neural network (RNN) architecture, the Fast-Slow RNN (FS-RNN). The FS-RNN incorporates the strengths of both multiscale RNNs and deep trans… ▽ More

    Submitted 9 June, 2017; v1 submitted 24 May, 2017; originally announced May 2017.

    Comments: Corrected minor typos in Figure 1 and Zoneout citation

  11. arXiv:1611.01457  [pdf, other

    cs.LG

    Multi-task learning with deep model based reinforcement learning

    Authors: Asier Mujika

    Abstract: In recent years, model-free methods that use deep learning have achieved great success in many different reinforcement learning environments. Most successful approaches focus on solving a single task, while multi-task reinforcement learning remains an open problem. In this paper, we present a model based approach to deep reinforcement learning which we use to solve different tasks simultaneously.… ▽ More

    Submitted 23 May, 2017; v1 submitted 4 November, 2016; originally announced November 2016.