Skip to main content

Showing 1–8 of 8 results for author: Lorenzo, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  2. arXiv:2212.13535  [pdf, other

    cs.CV cs.AI

    From Single-Visit to Multi-Visit Image-Based Models: Single-Visit Models are Enough to Predict Obstructive Hydronephrosis

    Authors: Stanley Bryan Z. Hua, Mandy Rickard, John Weaver, Alice Xiang, Daniel Alvarez, Kyla N. Velear, Kunj Sheth, Gregory E. Tasian, Armando J. Lorenzo, Anna Goldenberg, Lauren Erdman

    Abstract: Previous work has shown the potential of deep learning to predict renal obstruction using kidney ultrasound images. However, these image-based classifiers have been trained with the goal of single-visit inference in mind. We compare methods from video action recognition (i.e. convolutional pooling, LSTM, TSM) to adapt single-visit convolutional models to handle multiple visit inference. We demonst… ▽ More

    Submitted 27 December, 2022; originally announced December 2022.

    Comments: Paper accepted to SIPAIM 2022 (in Valparaiso, Chile)

  3. arXiv:2210.03453  [pdf, other

    cs.CV

    Key Information Extraction in Purchase Documents using Deep Learning and Rule-based Corrections

    Authors: Roberto Arroyo, Javier Yebes, Elena Martínez, Héctor Corrales, Javier Lorenzo

    Abstract: Deep Learning (DL) is dominating the fields of Natural Language Processing (NLP) and Computer Vision (CV) in the recent times. However, DL commonly relies on the availability of large data annotations, so other alternative or complementary pattern-based techniques can help to improve results. In this paper, we build upon Key Information Extraction (KIE) in purchase documents using both DL and rule… ▽ More

    Submitted 7 October, 2022; originally announced October 2022.

    Comments: Conference on Computational Linguistics (COLING 2022). PAN-DL Workshop

  4. arXiv:2112.13922  [pdf

    cs.LG

    Predicting Breakdown Risk Based on Historical Maintenance Data for Air Force Ground Vehicles

    Authors: Jeff Jang, Dilan Nana, Jack Hochschild, Jordi Vila Hernandez de Lorenzo

    Abstract: Unscheduled maintenance has contributed to longer downtime for vehicles and increased costs for Logistic Readiness Squadrons (LRSs) in the Air Force. When vehicles are in need of repair outside of their scheduled time, depending on their priority level, the entire squadron's slated repair schedule is transformed negatively. The repercussions of unscheduled maintenance are specifically seen in the… ▽ More

    Submitted 22 December, 2021; originally announced December 2021.

    Comments: 15 pages, 8 figures

  5. arXiv:2105.08647  [pdf, other

    cs.CV cs.AI

    IntFormer: Predicting pedestrian intention with the aid of the Transformer architecture

    Authors: J. Lorenzo, I. Parra, M. A. Sotelo

    Abstract: Understanding pedestrian crossing behavior is an essential goal in intelligent vehicle development, leading to an improvement in their security and traffic flow. In this paper, we developed a method called IntFormer. It is based on transformer architecture and a novel convolutional video classification model called RubiksNet. Following the evaluation procedure in a recent benchmark, we show that o… ▽ More

    Submitted 18 May, 2021; originally announced May 2021.

    Comments: 5 pages, 2 figures

  6. arXiv:2008.11647  [pdf, other

    cs.CV cs.LG

    RNN-based Pedestrian Crossing Prediction using Activity and Pose-related Features

    Authors: Javier Lorenzo, Ignacio Parra, Florian Wirth, Christoph Stiller, David Fernandez Llorca, Miguel Angel Sotelo

    Abstract: Pedestrian crossing prediction is a crucial task for autonomous driving. Numerous studies show that an early estimation of the pedestrian's intention can decrease or even avoid a high percentage of accidents. In this paper, different variations of a deep learning system are proposed to attempt to solve this problem. The proposed models are composed of two parts: a CNN-based feature extractor and a… ▽ More

    Submitted 26 August, 2020; originally announced August 2020.

    Comments: 6 pages, 5 figures. This work has been accepted for publication at IEEE Intelligent Vehicle Symposium 2020

  7. arXiv:1809.10937  [pdf, other

    cs.DC cs.PF

    New Thread Migration Strategies for NUMA Systems

    Authors: O. G. Lorenzo, M. L. Becoña, T. F. Pena, J. C. Cabaleiro, J. A. Lorenzo, F. F. Rivera

    Abstract: Multicore systems present on-board memory hierarchies and communication networks that influence performance when executing shared memory parallel codes. Characterising this influence is complex, and understanding the effect of particular hardware configurations on different codes is of paramount importance. In previous works, monitoring information extracted from hardware counters at runtime has b… ▽ More

    Submitted 28 September, 2018; originally announced September 2018.

    Comments: Unpublished work

  8. Norm-1 Regularized Consensus-based ADMM for Imaging with a Compressive Antenna

    Authors: Juan Heredia Juesas, Ali Molaei, Luis Tirado, William Blackwell, Jose A Martinez Lorenzo

    Abstract: This paper presents a novel norm-one-regularized, consensus-based imaging algorithm, based on the Alternating Direction Method of Multipliers (ADMM). This algorithm is capable of imaging composite dielectric and metallic targets by using limited amount of data. The distributed capabilities of the ADMM accelerates the convergence of the imaging. Recently, a Compressive Reflector Antenna (CRA) has b… ▽ More

    Submitted 16 March, 2016; originally announced March 2016.

    Comments: 4 pages, 4 figures