Skip to main content

Showing 1–19 of 19 results for author: Fitzgerald, N

.
  1. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1092 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 14 June, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  2. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  3. WASM-MUTATE: Fast and Effective Binary Diversification for WebAssembly

    Authors: Javier Cabrera-Arteaga, Nicholas Fitzgerald, Martin Monperrus, Benoit Baudry

    Abstract: WebAssembly is the fourth officially endorsed Web language. It is recognized because of its efficiency and design, focused on security. Yet, its swiftly expanding ecosystem lacks robust software diversification systems. We introduce WASM-MUTATE, a diversification engine specifically designed for WebAssembly. Our engine meets several essential criteria: 1) To quickly generate functionally identical… ▽ More

    Submitted 17 January, 2024; v1 submitted 14 September, 2023; originally announced September 2023.

    Report number: volume 139

    Journal ref: Computers & Security, 2024

  4. arXiv:2306.10231  [pdf, other

    cs.CL cs.AI cs.LG

    GLIMMER: generalized late-interaction memory reranker

    Authors: Michiel de Jong, Yury Zemlyanskiy, Nicholas FitzGerald, Sumit Sanghai, William W. Cohen, Joshua Ainslie

    Abstract: Memory-augmentation is a powerful approach for efficiently incorporating external information into language models, but leads to reduced performance relative to retrieving text. Recent work introduced LUMEN, a memory-retrieval hybrid that partially pre-computes memory and updates memory representations on the fly with a smaller live encoder. We propose GLIMMER, which improves on this approach th… ▽ More

    Submitted 16 June, 2023; originally announced June 2023.

  5. A Checklist to Publish Collections as Data in GLAM Institutions

    Authors: Gustavo Candela, Nele Gabriëls, Sally Chambers, Thuy-An Pham, Sarah Ames, Neil Fitzgerald, Katrine Hofmann, Victor Harbo, Abigail Potter, Meghan Ferriter, Eileen Manchester, Alba Irollo, Ellen Van Keer, Mahendra Mahey, Olga Holownia, Milena Dobreva

    Abstract: Large-scale digitization in Galleries, Libraries, Archives and Museums (GLAM) created the conditions for providing access to collections as data. It opened new opportunities to explore, use and reuse digital collections. Strong proponents of collections as data are the Innovation Labs which provided numerous examples of publishing datasets under open licenses in order to reuse digital content in n… ▽ More

    Submitted 13 November, 2023; v1 submitted 5 April, 2023; originally announced April 2023.

    Comments: This is an original manuscript of an article published by Emerald Publishing Limited in Global Knowledge, Memory and Communication on 9 November 2023, available online: https://doi.org/10.1108/GKMC-06-2023-0195

  6. arXiv:2301.10448  [pdf, other

    cs.CL cs.AI cs.LG

    Pre-computed memory or on-the-fly encoding? A hybrid approach to retrieval augmentation makes the most of your compute

    Authors: Michiel de Jong, Yury Zemlyanskiy, Nicholas FitzGerald, Joshua Ainslie, Sumit Sanghai, Fei Sha, William Cohen

    Abstract: Retrieval-augmented language models such as Fusion-in-Decoder are powerful, setting the state of the art on a variety of knowledge-intensive tasks. However, they are also expensive, due to the need to encode a large number of retrieved passages. Some work avoids this cost by pre-encoding a text corpus into a memory and retrieving dense representations directly. However, pre-encoding memory incurs… ▽ More

    Submitted 2 June, 2023; v1 submitted 25 January, 2023; originally announced January 2023.

    Comments: ICML 2023

  7. arXiv:2212.08153  [pdf, other

    cs.CL cs.AI cs.LG

    FiDO: Fusion-in-Decoder optimized for stronger performance and faster inference

    Authors: Michiel de Jong, Yury Zemlyanskiy, Joshua Ainslie, Nicholas FitzGerald, Sumit Sanghai, Fei Sha, William Cohen

    Abstract: Fusion-in-Decoder (FiD) is a powerful retrieval-augmented language model that sets the state-of-the-art on many knowledge-intensive NLP tasks. However, the architecture used for FiD was chosen by making minimal modifications to a standard T5 model, which our analysis shows to be highly suboptimal for a retrieval-augmented model. In particular, FiD allocates the bulk of FLOPs to the encoder, while… ▽ More

    Submitted 2 June, 2023; v1 submitted 15 December, 2022; originally announced December 2022.

    Comments: ACL Findings 2023

  8. arXiv:2110.06176  [pdf, other

    cs.CL cs.AI cs.LG

    Mention Memory: incorporating textual knowledge into Transformers through entity mention attention

    Authors: Michiel de Jong, Yury Zemlyanskiy, Nicholas FitzGerald, Fei Sha, William Cohen

    Abstract: Natural language understanding tasks such as open-domain question answering often require retrieving and assimilating factual information from multiple sources. We propose to address this problem by integrating a semi-parametric representation of a large text corpus into a Transformer model as a source of factual knowledge. Specifically, our method represents knowledge with `mention memory', a tab… ▽ More

    Submitted 19 April, 2022; v1 submitted 12 October, 2021; originally announced October 2021.

  9. arXiv:2106.07352  [pdf, other

    cs.IR cs.CL cs.LG cs.SI

    MOLEMAN: Mention-Only Linking of Entities with a Mention Annotation Network

    Authors: Nicholas FitzGerald, Jan A. Botha, Daniel Gillick, Daniel M. Bikel, Tom Kwiatkowski, Andrew McCallum

    Abstract: We present an instance-based nearest neighbor approach to entity linking. In contrast to most prior entity retrieval systems which represent each entity with a single vector, we build a contextualized mention-encoder that learns to place similar mentions of the same entity closer in vector space than mentions of different entities. This approach allows all mentions of an entity to serve as "class… ▽ More

    Submitted 22 July, 2022; v1 submitted 2 June, 2021; originally announced June 2021.

    Comments: Accepted to ACL 2021, edit to add missing Turkish results in Tables 2 and 7

  10. arXiv:2008.00920  [pdf, other

    cs.AI cs.CL

    On The Plurality of Graphs

    Authors: Nicole Fitzgerald, Jacopo Tagliabue

    Abstract: We conduct a series of experiments designed to empirically demonstrate the effects of varying the structural features of a multi-agent emergent communication game framework. Specifically, we model the interactions (edges) between individual agents (nodes)as the structure of a graph generated according to a series of known random graph generating algorithms. Confirming the hypothesis proposed in [1… ▽ More

    Submitted 3 August, 2020; originally announced August 2020.

    Comments: Manuscript accepted at NETREASON @ ECAI2020

  11. arXiv:2005.14253  [pdf, ps, other

    cs.CL cs.LG

    Empirical Evaluation of Pretraining Strategies for Supervised Entity Linking

    Authors: Thibault Févry, Nicholas FitzGerald, Livio Baldini Soares, Tom Kwiatkowski

    Abstract: In this work, we present an entity linking model which combines a Transformer architecture with large scale pretraining from Wikipedia links. Our model achieves the state-of-the-art on two commonly used entity linking datasets: 96.7% on CoNLL and 94.9% on TAC-KBP. We present detailed analyses to understand what design choices are important for entity linking, including choices of negative entity c… ▽ More

    Submitted 28 May, 2020; originally announced May 2020.

    Comments: 11 pages, 8 figures, appearing at AKBC 2020

  12. arXiv:2004.07202  [pdf, other

    cs.CL cs.LG

    Entities as Experts: Sparse Memory Access with Entity Supervision

    Authors: Thibault Févry, Livio Baldini Soares, Nicholas FitzGerald, Eunsol Choi, Tom Kwiatkowski

    Abstract: We focus on the problem of capturing declarative knowledge about entities in the learned parameters of a language model. We introduce a new model - Entities as Experts (EAE) - that can access distinct memories of the entities mentioned in a piece of text. Unlike previous efforts to integrate entity knowledge into sequence models, EAE's entity representations are learned directly from text. We show… ▽ More

    Submitted 6 October, 2020; v1 submitted 15 April, 2020; originally announced April 2020.

  13. arXiv:2001.03765  [pdf, other

    cs.CL

    Learning Cross-Context Entity Representations from Text

    Authors: Jeffrey Ling, Nicholas FitzGerald, Zifei Shan, Livio Baldini Soares, Thibault Févry, David Weiss, Tom Kwiatkowski

    Abstract: Language modeling tasks, in which words, or word-pieces, are predicted on the basis of a local context, have been very effective for learning word embeddings and context dependent representations of phrases. Motivated by the observation that efforts to code world knowledge into machine readable knowledge bases or human readable encyclopedias tend to be entity-centric, we investigate the use of a f… ▽ More

    Submitted 11 January, 2020; originally announced January 2020.

  14. arXiv:1911.04362  [pdf, other

    cs.LG cs.CL stat.ML

    To Populate is To Regulate

    Authors: Nicole Fitzgerald

    Abstract: We examine the effects of instantiating Lewis signaling games within a population of speaker and listener agents with the aim of producing a set of general and robust representations of unstructured pixel data. Preliminary experiments suggest that the set of representations associated with languages generated within a population outperform those generated between a single speaker-listener pair on… ▽ More

    Submitted 6 November, 2019; originally announced November 2019.

    Comments: EmeCom Neurips 2019

  15. arXiv:1906.03158  [pdf, other

    cs.CL cs.AI

    Matching the Blanks: Distributional Similarity for Relation Learning

    Authors: Livio Baldini Soares, Nicholas FitzGerald, Jeffrey Ling, Tom Kwiatkowski

    Abstract: General purpose relation extractors, which can model arbitrary relations, are a core aspiration in information extraction. Efforts have been made to build general purpose extractors that represent relations with their surface forms, or which jointly embed surface forms with relations from an existing knowledge graph. However, both of these approaches are limited in their ability to generalize. In… ▽ More

    Submitted 7 June, 2019; originally announced June 2019.

    Comments: To appear at ACL 2019

  16. arXiv:1805.05377  [pdf, other

    cs.CL cs.AI

    Large-Scale QA-SRL Parsing

    Authors: Nicholas FitzGerald, Julian Michael, Luheng He, Luke Zettlemoyer

    Abstract: We present a new large-scale corpus of Question-Answer driven Semantic Role Labeling (QA-SRL) annotations, and the first high-quality QA-SRL parser. Our corpus, QA-SRL Bank 2.0, consists of over 250,000 question-answer pairs for over 64,000 sentences across 3 domains and was gathered with a new crowd-sourcing scheme that we show has high precision and good recall at modest cost. We also present ne… ▽ More

    Submitted 14 May, 2018; originally announced May 2018.

    Comments: 10 pages, 3 figures, 8 tables. Accepted to ACL 2018

  17. arXiv:1805.03716  [pdf, other

    cs.CL cs.AI cs.LG stat.ML

    Long Short-Term Memory as a Dynamically Computed Element-wise Weighted Sum

    Authors: Omer Levy, Kenton Lee, Nicholas FitzGerald, Luke Zettlemoyer

    Abstract: LSTMs were introduced to combat vanishing gradients in simple RNNs by augmenting them with gated additive recurrent connections. We present an alternative view to explain the success of LSTMs: the gates themselves are versatile recurrent models that provide more representational power than previously appreciated. We do this by decoupling the LSTM's gates from the embedded simple RNN, producing a n… ▽ More

    Submitted 9 May, 2018; originally announced May 2018.

    Comments: ACL 2018

  18. arXiv:1206.6423  [pdf

    cs.CL cs.LG cs.RO

    A Joint Model of Language and Perception for Grounded Attribute Learning

    Authors: Cynthia Matuszek, Nicholas FitzGerald, Luke Zettlemoyer, Liefeng Bo, Dieter Fox

    Abstract: As robots become more ubiquitous and capable, it becomes ever more important to enable untrained users to easily interact with them. Recently, this has led to study of the language grounding problem, where the goal is to extract representations of the meanings of natural language tied to perception and actuation in the physical world. In this paper, we present an approach for joint learning of lan… ▽ More

    Submitted 27 June, 2012; originally announced June 2012.

    Comments: Appears in Proceedings of the 29th International Conference on Machine Learning (ICML 2012)

  19. arXiv:1206.5344  [pdf, other

    math.OC

    Gain Scheduling Control of Gas Turbine Engines: Absolute Stability by Finding a Common Lyapunov Matrix

    Authors: Mehrdad Pakmehr, Nathan Fitzgerald, Eric Feron, Jeff Shamma, Alireza Behbahani

    Abstract: This manuscript aims to develop and describe gain scheduling control concept for a gas turbine engine which drives a variable pitch propeller. An architecture for gain-scheduling control is developed that controls the turboshaft engine for large thrust commands in stable fashion with good performance. Fuel ow and propeller pitch angle are the two control inputs of the system. New stability proof h… ▽ More

    Submitted 22 June, 2012; originally announced June 2012.

    Comments: 15 pages, 21 figures