Skip to main content

Showing 1–16 of 16 results for author: Bošnjak, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1092 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 14 June, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  2. arXiv:2401.09865  [pdf, other

    cs.CV cs.AI cs.LG

    Improving fine-grained understanding in image-text pre-training

    Authors: Ioana Bica, Anastasija Ilić, Matthias Bauer, Goker Erdogan, Matko Bošnjak, Christos Kaplanis, Alexey A. Gritsenko, Matthias Minderer, Charles Blundell, Razvan Pascanu, Jovana Mitrović

    Abstract: We introduce SPARse Fine-grained Contrastive Alignment (SPARC), a simple method for pretraining more fine-grained multimodal representations from image-text pairs. Given that multiple image patches often correspond to single words, we propose to learn a grou** of image patches for every token in the caption. To achieve this, we use a sparse similarity metric between image patches and language to… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.

    Comments: 26 pages

  3. arXiv:2301.05158  [pdf, other

    cs.CV cs.AI cs.LG

    SemPPL: Predicting pseudo-labels for better contrastive representations

    Authors: Matko Bošnjak, Pierre H. Richemond, Nenad Tomasev, Florian Strub, Jacob C. Walker, Felix Hill, Lars Holger Buesing, Razvan Pascanu, Charles Blundell, Jovana Mitrovic

    Abstract: Learning from large amounts of unsupervised data and a small amount of supervision is an important open problem in computer vision. We propose a new semi-supervised learning method, Semantic Positives via Pseudo-Labels (SemPPL), that combines labelled and unlabelled data to learn informative representations. Our method extends self-supervised contrastive learning -- where representations are shape… ▽ More

    Submitted 10 January, 2024; v1 submitted 12 January, 2023; originally announced January 2023.

    Comments: Published as a conference paper at ICLR 2023. For checkpoints and source code see https://github.com/google-deepmind/semppl

  4. arXiv:2209.11142  [pdf, other

    cs.LG cs.AI stat.ML

    A Generalist Neural Algorithmic Learner

    Authors: Borja Ibarz, Vitaly Kurin, George Papamakarios, Kyriacos Nikiforou, Mehdi Bennani, Róbert Csordás, Andrew Dudzik, Matko Bošnjak, Alex Vitvitskyi, Yulia Rubanova, Andreea Deac, Beatrice Bevilacqua, Yaroslav Ganin, Charles Blundell, Petar Veličković

    Abstract: The cornerstone of neural algorithmic reasoning is the ability to solve algorithmic tasks, especially in a way that generalises out of distribution. While recent years have seen a surge in methodological improvements in this area, they mostly focused on building specialist models. Specialist models are capable of learning to neurally execute either only one algorithm or a collection of algorithms… ▽ More

    Submitted 3 December, 2022; v1 submitted 22 September, 2022; originally announced September 2022.

    Comments: To appear at LoG 2022 (Spotlight talk). 23 pages, 11 figures

  5. arXiv:2107.08881  [pdf, other

    cs.LG cs.AI stat.ML

    Reasoning-Modulated Representations

    Authors: Petar Veličković, Matko Bošnjak, Thomas Kipf, Alexander Lerchner, Raia Hadsell, Razvan Pascanu, Charles Blundell

    Abstract: Neural networks leverage robust internal representations in order to generalise. Learning them is difficult, and often requires a large training set that covers the data distribution densely. We study a common setting where our task is not purely opaque. Indeed, very often we may have access to information about the underlying system (e.g. that observations must obey certain laws of physics) that… ▽ More

    Submitted 3 December, 2022; v1 submitted 19 July, 2021; originally announced July 2021.

    Comments: To appear at LoG 2022. 17 pages, 5 figures

  6. arXiv:2101.05125  [pdf, other

    cs.AI

    Formalising Concepts as Grounded Abstractions

    Authors: Stephen Clark, Alexander Lerchner, Tamara von Glehn, Olivier Tieleman, Richard Tanburn, Misha Dashevskiy, Matko Bosnjak

    Abstract: The notion of concept has been studied for centuries, by philosophers, linguists, cognitive scientists, and researchers in artificial intelligence (Margolis & Laurence, 1999). There is a large literature on formal, mathematical models of concepts, including a whole sub-field of AI -- Formal Concept Analysis -- devoted to this topic (Ganter & Obiedkov, 2016). Recently, researchers in machine learni… ▽ More

    Submitted 13 January, 2021; originally announced January 2021.

  7. arXiv:2004.04460  [pdf, other

    cs.CL cs.CY cs.SI

    PANDORA Talks: Personality and Demographics on Reddit

    Authors: Matej Gjurković, Mladen Karan, Iva Vukojević, Mihaela Bošnjak, Jan Šnajder

    Abstract: Personality and demographics are important variables in social sciences, while in NLP they can aid in interpretability and removal of societal biases. However, datasets with both personality and demographic labels are scarce. To address this, we present PANDORA, the first large-scale dataset of Reddit comments labeled with three personality models (including the well-established Big 5 model) and d… ▽ More

    Submitted 8 June, 2021; v1 submitted 9 April, 2020; originally announced April 2020.

    Comments: Proceedings of the Ninth International Workshop on Natural Language Processing for Social Media, NAACL 2021, https://www.aclweb.org/anthology/2021.socialnlp-1.12

  8. arXiv:1912.10824  [pdf, other

    cs.LG cs.CL cs.LO

    Differentiable Reasoning on Large Knowledge Bases and Natural Language

    Authors: Pasquale Minervini, Matko Bošnjak, Tim Rocktäschel, Sebastian Riedel, Edward Grefenstette

    Abstract: Reasoning with knowledge expressed in natural language and Knowledge Bases (KBs) is a major challenge for Artificial Intelligence, with applications in machine reading, dialogue, and question answering. General neural architectures that jointly learn representations and transformations of text are very data-inefficient, and it is hard to analyse their reasoning process. These issues are addressed… ▽ More

    Submitted 17 December, 2019; originally announced December 2019.

    Comments: Accepted at the 34th AAAI Conference on Artificial Intelligence (AAAI-20)

  9. arXiv:1906.04985  [pdf, other

    cs.LG cs.AI cs.SC stat.ML

    Neural Variational Inference For Estimating Uncertainty in Knowledge Graph Embeddings

    Authors: Alexander I. Cowen-Rivers, Pasquale Minervini, Tim Rocktaschel, Matko Bosnjak, Sebastian Riedel, Jun Wang

    Abstract: Recent advances in Neural Variational Inference allowed for a renaissance in latent variable models in a variety of domains involving high-dimensional data. While traditional variational methods derive an analytical approximation for the intractable distribution over the latent variables, here we construct an inference network conditioned on the symbolic representation of entities and relation typ… ▽ More

    Submitted 18 August, 2019; v1 submitted 12 June, 2019; originally announced June 2019.

    Comments: Accepted at IJCAI 19 Neural-Symbolic Learning and Reasoning Workshop

  10. arXiv:1905.09275  [pdf, other

    cs.LG cs.AI

    COBRA: Data-Efficient Model-Based RL through Unsupervised Object Discovery and Curiosity-Driven Exploration

    Authors: Nicholas Watters, Loic Matthey, Matko Bosnjak, Christopher P. Burgess, Alexander Lerchner

    Abstract: Data efficiency and robustness to task-irrelevant perturbations are long-standing challenges for deep reinforcement learning algorithms. Here we introduce a modular approach to addressing these challenges in a continuous control environment, without using hand-crafted or supervised information. Our Curious Object-Based seaRch Agent (COBRA) uses task-free intrinsically motivated exploration and uns… ▽ More

    Submitted 14 August, 2019; v1 submitted 22 May, 2019; originally announced May 2019.

  11. arXiv:1807.08204  [pdf, other

    cs.AI cs.CL

    Towards Neural Theorem Proving at Scale

    Authors: Pasquale Minervini, Matko Bosnjak, Tim Rocktäschel, Sebastian Riedel

    Abstract: Neural models combining representation learning and reasoning in an end-to-end trainable manner are receiving increasing interest. However, their use is severely limited by their computational complexity, which renders them unusable on real world datasets. We focus on the Neural Theorem Prover (NTP) model proposed by Rockt{ä}schel and Riedel (2017), a continuous relaxation of the Prolog backward c… ▽ More

    Submitted 21 July, 2018; originally announced July 2018.

    Comments: Federated Artificial Intelligence Meeting (FAIM) Workshop on Neural Abstract Machines & Program Induction v2

  12. arXiv:1806.08727  [pdf, ps, other

    cs.CL cs.LG stat.ML

    Jack the Reader - A Machine Reading Framework

    Authors: Dirk Weissenborn, Pasquale Minervini, Tim Dettmers, Isabelle Augenstein, Johannes Welbl, Tim Rocktäschel, Matko Bošnjak, Jeff Mitchell, Thomas Demeester, Pontus Stenetorp, Sebastian Riedel

    Abstract: Many Machine Reading and Natural Language Understanding tasks require reading supporting text in order to answer questions. For example, in Question Answering, the supporting text can be newswire or Wikipedia articles; in Natural Language Inference, premises can be seen as the supporting text and hypotheses as questions. Providing a set of useful primitives operating in a single framework of relat… ▽ More

    Submitted 19 June, 2018; originally announced June 2018.

    Comments: Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL 2018), System Demonstrations

  13. arXiv:1707.03389  [pdf, other

    stat.ML cs.LG

    SCAN: Learning Hierarchical Compositional Visual Concepts

    Authors: Irina Higgins, Nicolas Sonnerat, Loic Matthey, Arka Pal, Christopher P Burgess, Matko Bosnjak, Murray Shanahan, Matthew Botvinick, Demis Hassabis, Alexander Lerchner

    Abstract: The seemingly infinite diversity of the natural world arises from a relatively small set of coherent rules, such as the laws of physics or chemistry. We conjecture that these rules give rise to regularities that can be discovered through primarily unsupervised experiences and represented as abstract concepts. If such representations are compositional and hierarchical, they can be recombined into a… ▽ More

    Submitted 6 June, 2018; v1 submitted 11 July, 2017; originally announced July 2017.

  14. arXiv:1609.08359  [pdf, other

    cs.CL

    emoji2vec: Learning Emoji Representations from their Description

    Authors: Ben Eisner, Tim Rocktäschel, Isabelle Augenstein, Matko Bošnjak, Sebastian Riedel

    Abstract: Many current natural language processing applications for social media rely on representation learning and utilize pre-trained word embeddings. There currently exist several publicly-available, pre-trained sets of word embeddings, but they contain few or no emoji representations even as emoji usage in social media has increased. In this paper we release emoji2vec, pre-trained embeddings for all Un… ▽ More

    Submitted 20 November, 2016; v1 submitted 27 September, 2016; originally announced September 2016.

    Comments: 7 pages, 4 figures, 1 table, In Proceedings of the 4th International Workshop on Natural Language Processing for Social Media at EMNLP 2016 (SocialNLP at EMNLP 2016)

    MSC Class: 68T50 ACM Class: I.2.7

  15. arXiv:1605.06640  [pdf, other

    cs.NE cs.AI cs.LG

    Programming with a Differentiable Forth Interpreter

    Authors: Matko Bošnjak, Tim Rocktäschel, Jason Naradowsky, Sebastian Riedel

    Abstract: Given that in practice training data is scarce for all but a small set of problems, a core question is how to incorporate prior knowledge into a model. In this paper, we consider the case of prior procedural knowledge for neural networks, such as knowing how a program should traverse a sequence, but not what local actions should be performed at each step. To this end, we present an end-to-end diff… ▽ More

    Submitted 23 July, 2017; v1 submitted 21 May, 2016; originally announced May 2016.

    Comments: 34th International Conference on Machine Learning (ICML 2017)

  16. Synthetic sequence generator for recommender systems - memory biased random walk on sequence multilayer network

    Authors: Nino Antulov-Fantulin, Matko Bosnjak, Vinko Zlatic, Miha Grcar, Tomislav Smuc

    Abstract: Personalized recommender systems rely on each user's personal usage data in the system, in order to assist in decision making. However, privacy policies protecting users' rights prevent these highly personal data from being publicly available to a wider researcher audience. In this work, we propose a memory biased random walk model on multilayer sequence network, as a generator of synthetic sequen… ▽ More

    Submitted 15 July, 2014; v1 submitted 30 January, 2012; originally announced January 2012.

    Comments: The new updated version of the paper

    ACM Class: K.4.1; H.2.8; G.3