Skip to main content

Showing 1–7 of 7 results for author: Philbin, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2006.09917  [pdf, other

    cs.CV cs.LG

    FISHING Net: Future Inference of Semantic Heatmaps In Grids

    Authors: Noureldin Hendy, Cooper Sloan, Feng Tian, Pengfei Duan, Nick Charchut, Yuesong Xie, Chuang Wang, James Philbin

    Abstract: For autonomous robots to navigate a complex environment, it is crucial to understand the surrounding scene both geometrically and semantically. Modern autonomous robots employ multiple sets of sensors, including lidars, radars, and cameras. Managing the different reference frames and characteristics of the sensors, and merging their observations into a single representation complicates perception.… ▽ More

    Submitted 17 June, 2020; originally announced June 2020.

  2. arXiv:1906.08945  [pdf, other

    cs.CV cs.LG cs.RO

    Rules of the Road: Predicting Driving Behavior with a Convolutional Model of Semantic Interactions

    Authors: Joey Hong, Benjamin Sapp, James Philbin

    Abstract: We focus on the problem of predicting future states of entities in complex, real-world driving scenarios. Previous research has used low-level signals to predict short time horizons, and has not addressed how to leverage key assets relied upon heavily by industry self-driving systems: (1) large 3D perception efforts which provide highly accurate 3D states of agents with rich attributes, and (2) de… ▽ More

    Submitted 21 June, 2019; originally announced June 2019.

    Comments: Accepted at CVPR 2019

    Journal ref: The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019, pp. 8454-8462

  3. PlaNet - Photo Geolocation with Convolutional Neural Networks

    Authors: Tobias Weyand, Ilya Kostrikov, James Philbin

    Abstract: Is it possible to build a system to determine the location where a photo was taken using just its pixels? In general, the problem seems exceptionally difficult: it is trivial to construct situations where no location can be inferred. Yet images often contain informative cues such as landmarks, weather patterns, vegetation, road markings, and architectural details, which in combination may allow on… ▽ More

    Submitted 17 February, 2016; originally announced February 2016.

  4. arXiv:1511.06789  [pdf, other

    cs.CV

    The Unreasonable Effectiveness of Noisy Data for Fine-Grained Recognition

    Authors: Jonathan Krause, Benjamin Sapp, Andrew Howard, Howard Zhou, Alexander Toshev, Tom Duerig, James Philbin, Li Fei-Fei

    Abstract: Current approaches for fine-grained recognition do the following: First, recruit experts to annotate a dataset of images, optionally also collecting more structured data in the form of part annotations and bounding boxes. Second, train a model utilizing this data. Toward the goal of solving fine-grained recognition, we introduce an alternative approach, leveraging free, noisy data from the web and… ▽ More

    Submitted 18 October, 2016; v1 submitted 20 November, 2015; originally announced November 2015.

    Comments: ECCV 2016, data is released

  5. arXiv:1506.06825  [pdf, other

    cs.CV

    DeepStereo: Learning to Predict New Views from the World's Imagery

    Authors: John Flynn, Ivan Neulander, James Philbin, Noah Snavely

    Abstract: Deep networks have recently enjoyed enormous success when applied to recognition and classification problems in computer vision, but their use in graphics problems has been limited. In this work, we present a novel deep architecture that performs new view synthesis directly from pixels, trained from a large number of posed image sets. In contrast to traditional approaches which consist of multiple… ▽ More

    Submitted 22 June, 2015; originally announced June 2015.

    Comments: Video showing additional results available at http://youtu.be/cizgVZ8rjKA

  6. FaceNet: A Unified Embedding for Face Recognition and Clustering

    Authors: Florian Schroff, Dmitry Kalenichenko, James Philbin

    Abstract: Despite significant recent advances in the field of face recognition, implementing face verification and recognition efficiently at scale presents serious challenges to current approaches. In this paper we present a system, called FaceNet, that directly learns a map** from face images to a compact Euclidean space where distances directly correspond to a measure of face similarity. Once this spac… ▽ More

    Submitted 17 June, 2015; v1 submitted 12 March, 2015; originally announced March 2015.

    Comments: Also published, in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition 2015

  7. arXiv:1404.4661  [pdf, ps, other

    cs.CV

    Learning Fine-grained Image Similarity with Deep Ranking

    Authors: Jiang Wang, Yang song, Thomas Leung, Chuck Rosenberg, **bin Wang, James Philbin, Bo Chen, Ying Wu

    Abstract: Learning fine-grained image similarity is a challenging task. It needs to capture between-class and within-class image differences. This paper proposes a deep ranking model that employs deep learning techniques to learn similarity metric directly from images.It has higher learning capability than models based on hand-crafted features. A novel multiscale network structure has been developed to desc… ▽ More

    Submitted 17 April, 2014; originally announced April 2014.

    Comments: CVPR 2014