Skip to main content

Showing 1–19 of 19 results for author: Brostow, G J

.
  1. arXiv:2306.01596  [pdf, other

    cs.CV

    Two-View Geometry Scoring Without Correspondences

    Authors: Axel Barroso-Laguna, Eric Brachmann, Victor Adrian Prisacariu, Gabriel J. Brostow, Daniyar Turmukhambetov

    Abstract: Camera pose estimation for two-view geometry traditionally relies on RANSAC. Normally, a multitude of image correspondences leads to a pool of proposed hypotheses, which are then scored to find a winning model. The inlier count is generally regarded as a reliable indicator of "consensus". We examine this scoring heuristic, and find that it favors disappointing models under certain circumstances. A… ▽ More

    Submitted 2 June, 2023; originally announced June 2023.

    Journal ref: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023

  2. arXiv:2305.07014  [pdf, other

    cs.CV

    Virtual Occlusions Through Implicit Depth

    Authors: Jamie Watson, Mohamed Sayed, Zawar Qureshi, Gabriel J. Brostow, Sara Vicente, Oisin Mac Aodha, Michael Firman

    Abstract: For augmented reality (AR), it is important that virtual assets appear to `sit among' real world objects. The virtual element should variously occlude and be occluded by real matter, based on a plausible depth ordering. This occlusion should be consistent over time as the viewer's camera moves. Unfortunately, small mistakes in the estimated scene depth can ruin the downstream occlusion mask, and t… ▽ More

    Submitted 11 May, 2023; originally announced May 2023.

    Comments: Accepted to CVPR 2023

  3. Automatic Joint Parameter Estimation from Magnetic Motion Capture Data

    Authors: James F. O'Brien, Robert E. Bodenheimer, Gabriel J. Brostow, Jessica K. Hodgins

    Abstract: This paper describes a technique for using magnetic motion capture data to determine the joint parameters of an articulated hierarchy. This technique makes it possible to determine limb lengths, joint locations, and sensor placement for a human subject without external measurements. Instead, the joint parameters are inferred with high accuracy from the motion data acquired during the capture sessi… ▽ More

    Submitted 18 March, 2023; originally announced March 2023.

    Comments: 8 pages, 8 figures, 4 tables

    ACM Class: I.3.5

    Journal ref: In Proceedings of Graphics Interface 2000, pages 53-60, May 2000

  4. arXiv:2008.10634  [pdf, other

    cs.CV

    DiverseNet: When One Right Answer is not Enough

    Authors: Michael Firman, Neill D. F. Campbell, Lourdes Agapito, Gabriel J. Brostow

    Abstract: Many structured prediction tasks in machine vision have a collection of acceptable answers, instead of one definitive ground truth answer. Segmentation of images, for example, is subject to human labeling bias. Similarly, there are multiple possible pixel values that could plausibly complete occluded image regions. State-of-the art supervised learning methods are typically optimized to make a sing… ▽ More

    Submitted 24 August, 2020; originally announced August 2020.

    Comments: Presented at CVPR 2018

  5. arXiv:2008.06959  [pdf, other

    cs.CV

    Image Stylization for Robust Features

    Authors: Iaroslav Melekhov, Gabriel J. Brostow, Juho Kannala, Daniyar Turmukhambetov

    Abstract: Local features that are robust to both viewpoint and appearance changes are crucial for many computer vision tasks. In this work we investigate if photorealistic image stylization improves robustness of local features to not only day-night, but also weather and season variations. We show that image stylization in addition to color augmentation is a powerful method of learning robust features. We e… ▽ More

    Submitted 16 August, 2020; originally announced August 2020.

    Comments: v1.1

  6. arXiv:2008.05785  [pdf, other

    cs.CV cs.LG

    Predicting Visual Overlap of Images Through Interpretable Non-Metric Box Embeddings

    Authors: Anita Rau, Guillermo Garcia-Hernando, Danail Stoyanov, Gabriel J. Brostow, Daniyar Turmukhambetov

    Abstract: To what extent are two images picturing the same 3D surfaces? Even when this is a known scene, the answer typically requires an expensive search across scale space, with matching and geometric verification of large sets of local features. This expense is further multiplied when a query image is evaluated against a gallery, e.g. in visual relocalization. While we don't obviate the need for geometri… ▽ More

    Submitted 13 August, 2020; originally announced August 2020.

    Comments: ECCV 2020

  7. arXiv:2008.01484  [pdf, other

    cs.CV

    Learning Stereo from Single Images

    Authors: Jamie Watson, Oisin Mac Aodha, Daniyar Turmukhambetov, Gabriel J. Brostow, Michael Firman

    Abstract: Supervised deep networks are among the best methods for finding correspondences in stereo image pairs. Like all supervised approaches, these networks require ground truth data during training. However, collecting large quantities of accurate dense correspondence data is very challenging. We propose that it is unnecessary to have such a high reliance on ground truth depths or even corresponding ste… ▽ More

    Submitted 20 August, 2020; v1 submitted 4 August, 2020; originally announced August 2020.

    Comments: Accepted as an oral presentation at ECCV 2020

  8. arXiv:2004.06376  [pdf, other

    cs.CV

    Footprints and Free Space from a Single Color Image

    Authors: Jamie Watson, Michael Firman, Aron Monszpart, Gabriel J. Brostow

    Abstract: Understanding the shape of a scene from a single color image is a formidable computer vision task. However, most methods aim to predict the geometry of surfaces that are visible to the camera, which is of limited use when planning paths for robots or augmented reality agents. Such agents can only move when grounded on a traversable surface, which we define as the set of classes which humans can al… ▽ More

    Submitted 14 April, 2020; originally announced April 2020.

    Comments: Accepted to CVPR 2020 as an oral presentation

  9. arXiv:1909.09051  [pdf, other

    cs.CV

    Self-Supervised Monocular Depth Hints

    Authors: Jamie Watson, Michael Firman, Gabriel J. Brostow, Daniyar Turmukhambetov

    Abstract: Monocular depth estimators can be trained with various forms of self-supervision from binocular-stereo data to circumvent the need for high-quality laser scans or other ground-truth data. The disadvantage, however, is that the photometric reprojection losses used with self-supervised learning typically have multiple local minima. These plausible-looking alternatives to ground truth can restrict wh… ▽ More

    Submitted 19 September, 2019; originally announced September 2019.

    Comments: Accepted to ICCV 2019

  10. arXiv:1711.07476  [pdf, other

    cs.LG cs.CV stat.ML

    Virtual Adversarial Ladder Networks For Semi-supervised Learning

    Authors: Saki Shinoda, Daniel E. Worrall, Gabriel J. Brostow

    Abstract: Semi-supervised learning (SSL) partially circumvents the high cost of labeling data by augmenting a small labeled dataset with a large and relatively cheap unlabeled dataset drawn from the same distribution. This paper offers a novel interpretation of two deep learning-based SSL approaches, ladder networks and virtual adversarial training (VAT), as applying distributional smoothing to their respec… ▽ More

    Submitted 12 December, 2017; v1 submitted 20 November, 2017; originally announced November 2017.

    Comments: Camera-ready version for NIPS 2017 workshop Learning with Limited Labeled Data

  11. arXiv:1710.07307  [pdf, other

    cs.CV

    Interpretable Transformations with Encoder-Decoder Networks

    Authors: Daniel E. Worrall, Stephan J. Garbin, Daniyar Turmukhambetov, Gabriel J. Brostow

    Abstract: Deep feature spaces have the capacity to encode complex transformations of their input data. However, understanding the relative feature-space relationship between two transformed encoded images is difficult. For instance, what is the relative feature space relationship between two rotated images? What is decoded when we interpolate in feature space? Ideally, we want to disentangle confounding fac… ▽ More

    Submitted 19 October, 2017; originally announced October 2017.

    Comments: Accepted at ICCV 2017

  12. Responsive Action-based Video Synthesis

    Authors: Corneliu Ilisescu, Halil Aytac Kanaci, Matteo Romagnoli, Neill D. F. Campbell, Gabriel J. Brostow

    Abstract: We propose technology to enable a new medium of expression, where video elements can be looped, merged, and triggered, interactively. Like audio, video is easy to sample from the real world but hard to segment into clean reusable elements. Reusing a video clip means non-linear editing and compositing with novel footage. The new context dictates how carefully a clip must be prepared, so our end-to-… ▽ More

    Submitted 20 May, 2017; originally announced May 2017.

    Comments: 10 pages, 12 figures, 1 table, accepted and published in Proceedings of the 2017 CHI Conference on Human Factors in Computing Systems

    ACM Class: H.5.2

  13. arXiv:1612.04642  [pdf, other

    cs.CV cs.LG stat.ML

    Harmonic Networks: Deep Translation and Rotation Equivariance

    Authors: Daniel E. Worrall, Stephan J. Garbin, Daniyar Turmukhambetov, Gabriel J. Brostow

    Abstract: Translating or rotating an input image should not affect the results of many computer vision tasks. Convolutional neural networks (CNNs) are already translation equivariant: input image translations produce proportionate feature map translations. This is not the case for rotations. Global rotation equivariance is typically sought through data augmentation, but patch-wise equivariance is more diffi… ▽ More

    Submitted 11 April, 2017; v1 submitted 14 December, 2016; originally announced December 2016.

    Comments: Submitted to CVPR 2017

  14. arXiv:1611.03906  [pdf, other

    cs.HC

    Help, It Looks Confusing: GUI Task Automation Through Demonstration and Follow-up Questions

    Authors: Thanapong Intharah, Daniyar Turmukhambetov, Gabriel J. Brostow

    Abstract: Non-programming users should be able to create their own customized scripts to perform computer-based tasks for them, just by demonstrating to the machine how it's done. To that end, we develop a system prototype which learns-by-demonstration called HILC (Help, It Looks Confusing). Users train HILC to synthesize a task script by demonstrating the task, which produces the needed screenshots and the… ▽ More

    Submitted 13 January, 2017; v1 submitted 11 November, 2016; originally announced November 2016.

    Comments: Camera Ready version. Accepted to be presented at the ACM IUI 2017

  15. arXiv:1609.08080  [pdf, other

    cs.CV

    Swipe Mosaics from Video

    Authors: Malcolm Reynolds, Tom S. F. Haines, Gabriel J. Brostow

    Abstract: A panoramic image mosaic is an attractive visualization for viewing many overlap** photos, but its images must be both captured and processed correctly to produce an acceptable composite. We propose Swipe Mosaics, an interactive visualization that places the individual video frames on a 2D planar map that represents the layout of the physical scene. Compared to traditional panoramic mosaics, our… ▽ More

    Submitted 26 September, 2016; originally announced September 2016.

  16. arXiv:1609.03677  [pdf, other

    cs.CV cs.LG stat.ML

    Unsupervised Monocular Depth Estimation with Left-Right Consistency

    Authors: Clément Godard, Oisin Mac Aodha, Gabriel J. Brostow

    Abstract: Learning based methods have shown very promising results for the task of depth estimation in single images. However, most existing approaches treat depth prediction as a supervised regression problem and as a result, require vast quantities of corresponding ground truth depth data for training. Just recording quality depth data in a range of environments is a challenging problem. In this paper, we… ▽ More

    Submitted 12 April, 2017; v1 submitted 13 September, 2016; originally announced September 2016.

    Comments: CVPR 2017 oral

  17. arXiv:1504.08219  [pdf, other

    cs.CV cs.LG stat.ML

    Hierarchical Subquery Evaluation for Active Learning on a Graph

    Authors: Oisin Mac Aodha, Neill D. F. Campbell, Jan Kautz, Gabriel J. Brostow

    Abstract: To train good supervised and semi-supervised object classifiers, it is critical that we not waste the time of the human experts who are providing the training labels. Existing active learning strategies can have uneven performance, being efficient on some datasets but wasteful on others, or inconsistent just between runs on the same dataset. We propose perplexity based graph construction and a new… ▽ More

    Submitted 30 April, 2015; originally announced April 2015.

    Comments: CVPR 2014

  18. arXiv:1504.07575  [pdf, other

    cs.CV cs.LG stat.ML

    Becoming the Expert - Interactive Multi-Class Machine Teaching

    Authors: Edward Johns, Oisin Mac Aodha, Gabriel J. Brostow

    Abstract: Compared to machines, humans are extremely good at classifying images into categories, especially when they possess prior knowledge of the categories at hand. If this prior information is not available, supervision in the form of teaching images is required. To learn categories more quickly, people should see important and representative images first, followed by less important images later - or n… ▽ More

    Submitted 28 April, 2015; originally announced April 2015.

    Comments: CVPR 2015

  19. arXiv:1502.04983  [pdf, other

    cs.CV

    Context Tricks for Cheap Semantic Segmentation

    Authors: Thanapong Intharah, Gabriel J. Brostow

    Abstract: Accurate semantic labeling of image pixels is difficult because intra-class variability is often greater than inter-class variability. In turn, fast semantic segmentation is hard because accurate models are usually too complicated to also run quickly at test-time. Our experience with building and running semantic segmentation systems has also shown a reasonably obvious bottleneck on model complexi… ▽ More

    Submitted 17 February, 2015; originally announced February 2015.

    Comments: Supplementary material can be found at http://www0.cs.ucl.ac.uk/staff/T.Intharah/research.html