Skip to main content

Showing 1–3 of 3 results for author: Cervantes, C M

.
  1. arXiv:2002.02012  [pdf, other

    cs.AI cs.LG

    From Route Instructions to Landmark Graphs

    Authors: Christopher M Cervantes

    Abstract: Landmarks are central to how people navigate, but most navigation technologies do not incorporate them into their representations. We propose the landmark graph generation task (creating landmark-based spatial representations from natural language) and introduce a fully end-to-end neural approach to generate these graphs. We evaluate our models on the SAIL route instruction dataset, as well as on… ▽ More

    Submitted 5 February, 2020; originally announced February 2020.

  2. arXiv:1611.06641  [pdf, other

    cs.CV

    Phrase Localization and Visual Relationship Detection with Comprehensive Image-Language Cues

    Authors: Bryan A. Plummer, Arun Mallya, Christopher M. Cervantes, Julia Hockenmaier, Svetlana Lazebnik

    Abstract: This paper presents a framework for localization or grounding of phrases in images using a large collection of linguistic and visual cues. We model the appearance, size, and position of entity bounding boxes, adjectives that contain attribute information, and spatial relationships between pairs of entities connected by verbs or prepositions. Special attention is given to relationships between peop… ▽ More

    Submitted 8 August, 2017; v1 submitted 20 November, 2016; originally announced November 2016.

    Comments: IEEE ICCV 2017 accepted paper

  3. arXiv:1505.04870  [pdf, other

    cs.CV cs.CL

    Flickr30k Entities: Collecting Region-to-Phrase Correspondences for Richer Image-to-Sentence Models

    Authors: Bryan A. Plummer, Liwei Wang, Chris M. Cervantes, Juan C. Caicedo, Julia Hockenmaier, Svetlana Lazebnik

    Abstract: The Flickr30k dataset has become a standard benchmark for sentence-based image description. This paper presents Flickr30k Entities, which augments the 158k captions from Flickr30k with 244k coreference chains, linking mentions of the same entities across different captions for the same image, and associating them with 276k manually annotated bounding boxes. Such annotations are essential for conti… ▽ More

    Submitted 19 September, 2016; v1 submitted 19 May, 2015; originally announced May 2015.