Skip to main content

Showing 1–44 of 44 results for author: Mac Aodha, O

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.08960  [pdf, other

    cs.CV

    AirPlanes: Accurate Plane Estimation via 3D-Consistent Embeddings

    Authors: Jamie Watson, Filippo Aleotti, Mohamed Sayed, Zawar Qureshi, Oisin Mac Aodha, Gabriel Brostow, Michael Firman, Sara Vicente

    Abstract: Extracting planes from a 3D scene is useful for downstream tasks in robotics and augmented reality. In this paper we tackle the problem of estimating the planar surfaces in a scene from posed images. Our first finding is that a surprisingly competitive baseline results from combining popular clustering algorithms with recent improvements in 3D geometry estimation. However, such purely geometric me… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Journal ref: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024

  2. arXiv:2406.04898  [pdf, other

    cs.CV

    Labeled Data Selection for Category Discovery

    Authors: Bingchen Zhao, Nico Lang, Serge Belongie, Oisin Mac Aodha

    Abstract: Category discovery methods aim to find novel categories in unlabeled visual data. At training time, a set of labeled and unlabeled images are provided, where the labels correspond to the categories present in the images. The labeled data provides guidance during training by indicating what types of visual properties and features are relevant for performing discovery in the unlabeled data. As a res… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  3. arXiv:2406.04254  [pdf, other

    cs.CV cs.AI

    GeoGen: Geometry-Aware Generative Modeling via Signed Distance Functions

    Authors: Salvatore Esposito, Qingshan Xu, Kacper Kania, Charlie Hewitt, Octave Mariotti, Lohit Petikam, Julien Valentin, Arno Onken, Oisin Mac Aodha

    Abstract: We introduce a new generative approach for synthesizing 3D geometry and images from single-view collections. Most existing approaches predict volumetric density to render multi-view consistent images. By employing volumetric rendering using neural radiance fields, they inherit a key limitation: the generated geometry is noisy and unconstrained, limiting the quality and utility of the output meshes… ▽ More

    Submitted 14 June, 2024; v1 submitted 6 June, 2024; originally announced June 2024.

  4. arXiv:2406.02535  [pdf, other

    cs.CV

    Enhancing 2D Representation Learning with a 3D Prior

    Authors: Mehmet Aygün, Prithviraj Dhar, Zhicheng Yan, Oisin Mac Aodha, Rakesh Ranjan

    Abstract: Learning robust and effective representations of visual data is a fundamental task in computer vision. Traditionally, this is achieved by training models with labeled data which can be expensive to obtain. Self-supervised learning attempts to circumvent the requirement for labeled data by learning representations from raw unlabeled visual data alone. However, unlike humans who obtain rich 3D infor… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  5. arXiv:2403.14526  [pdf, other

    cs.RO cs.AI cs.CV

    Click to Grasp: Zero-Shot Precise Manipulation via Visual Diffusion Descriptors

    Authors: Nikolaos Tsagkas, Jack Rome, Subramanian Ramamoorthy, Oisin Mac Aodha, Chris Xiaoxuan Lu

    Abstract: Precise manipulation that is generalizable across scenes and objects remains a persistent challenge in robotics. Current approaches for this task heavily depend on having a significant number of training instances to handle objects with pronounced visual and/or geometric part ambiguities. Our work explores the grounding of fine-grained part descriptors for precise manipulation in a zero-shot setti… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: 8 pages, 4 figures

  6. arXiv:2312.13216  [pdf, other

    cs.CV

    Improving Semantic Correspondence with Viewpoint-Guided Spherical Maps

    Authors: Octave Mariotti, Oisin Mac Aodha, Hakan Bilen

    Abstract: Recent progress in self-supervised representation learning has resulted in models that are capable of extracting image features that are not only effective at encoding image level, but also pixel-level, semantics. These features have been shown to be effective for dense visual semantic correspondence estimation, even outperforming fully-supervised methods. Nevertheless, current self-supervised app… ▽ More

    Submitted 20 December, 2023; originally announced December 2023.

  7. arXiv:2311.02061  [pdf, other

    cs.LG

    Active Learning-Based Species Range Estimation

    Authors: Christian Lange, Elijah Cole, Grant Van Horn, Oisin Mac Aodha

    Abstract: We propose a new active learning approach for efficiently estimating the geographic range of a species from a limited number of on the ground observations. We model the range of an unmapped species of interest as the weighted combination of estimated ranges obtained from a set of different species. We show that it is possible to generate this candidate set of ranges by using models that have been… ▽ More

    Submitted 3 November, 2023; originally announced November 2023.

    Comments: NeurIPS 2023

  8. arXiv:2308.12688  [pdf, other

    cs.SD eess.AS

    Whombat: An open-source annotation tool for machine learning development in bioacoustics

    Authors: Santiago Martinez Balvanera, Oisin Mac Aodha, Matthew J. Weldy, Holly Pringle, Ella Browning, Kate E. Jones

    Abstract: 1. Automated analysis of bioacoustic recordings using machine learning (ML) methods has the potential to greatly scale biodiversity monitoring efforts. The use of ML for high-stakes applications, such as conservation research, demands a data-centric approach with a focus on utilizing carefully annotated and curated evaluation and training data that is relevant and representative. Creating annotate… ▽ More

    Submitted 7 November, 2023; v1 submitted 24 August, 2023; originally announced August 2023.

    Comments: 17 pages, 2 figures, 2 tables, to be submitted to Methods in Ecology and Evolution

    ACM Class: H.5.5; H.5.2; J.3; I.2.m

  9. arXiv:2306.02564  [pdf, other

    cs.LG cs.CV

    Spatial Implicit Neural Representations for Global-Scale Species Map**

    Authors: Elijah Cole, Grant Van Horn, Christian Lange, Alexander Shepard, Patrick Leary, Pietro Perona, Scott Loarie, Oisin Mac Aodha

    Abstract: Estimating the geographical range of a species from sparse observations is a challenging and important geospatial prediction problem. Given a set of locations where a species has been observed, the goal is to build a model to predict whether the species is present or absent at any location. This problem has a long history in ecology, but traditional methods struggle to take advantage of emerging l… ▽ More

    Submitted 4 June, 2023; originally announced June 2023.

    Comments: ICML 2023

  10. arXiv:2305.12427  [pdf, other

    cs.CV

    VL-Fields: Towards Language-Grounded Neural Implicit Spatial Representations

    Authors: Nikolaos Tsagkas, Oisin Mac Aodha, Chris Xiaoxuan Lu

    Abstract: We present Visual-Language Fields (VL-Fields), a neural implicit spatial representation that enables open-vocabulary semantic queries. Our model encodes and fuses the geometry of a scene with vision-language trained latent features by distilling information from a language-driven segmentation model. VL-Fields is trained without requiring any prior knowledge of the scene object classes, which makes… ▽ More

    Submitted 25 May, 2023; v1 submitted 21 May, 2023; originally announced May 2023.

    Comments: Project page: https://tsagkas.github.io/vl-fields/

  11. arXiv:2305.07014  [pdf, other

    cs.CV

    Virtual Occlusions Through Implicit Depth

    Authors: Jamie Watson, Mohamed Sayed, Zawar Qureshi, Gabriel J. Brostow, Sara Vicente, Oisin Mac Aodha, Michael Firman

    Abstract: For augmented reality (AR), it is important that virtual assets appear to `sit among' real world objects. The virtual element should variously occlude and be occluded by real matter, based on a plausible depth ordering. This occlusion should be consistent over time as the viewer's camera moves. Unfortunately, small mistakes in the estimated scene depth can ruin the downstream occlusion mask, and t… ▽ More

    Submitted 11 May, 2023; originally announced May 2023.

    Comments: Accepted to CVPR 2023

  12. arXiv:2304.14310  [pdf, other

    cs.CV

    Incremental Generalized Category Discovery

    Authors: Bingchen Zhao, Oisin Mac Aodha

    Abstract: We explore the problem of Incremental Generalized Category Discovery (IGCD). This is a challenging category incremental learning setting where the goal is to develop models that can correctly categorize images from previously seen categories, in addition to discovering novel ones. Learning is performed over a series of time steps where the model obtains new labeled and unlabeled data, and discards… ▽ More

    Submitted 7 December, 2023; v1 submitted 27 April, 2023; originally announced April 2023.

    Comments: This paper is accepted at ICCV 2023

  13. arXiv:2304.01008  [pdf, other

    cs.LG cs.AI cs.CL

    Self-Supervised Multimodal Learning: A Survey

    Authors: Yongshuo Zong, Oisin Mac Aodha, Timothy Hospedales

    Abstract: Multimodal learning, which aims to understand and analyze information from multiple modalities, has achieved substantial progress in the supervised regime in recent years. However, the heavy dependence on data paired with expensive human annotations impedes scaling up models. Meanwhile, given the availability of large-scale unannotated data in the wild, self-supervised learning has become an attra… ▽ More

    Submitted 4 August, 2023; v1 submitted 31 March, 2023; originally announced April 2023.

  14. arXiv:2303.13514  [pdf, other

    cs.CV

    SAOR: Single-View Articulated Object Reconstruction

    Authors: Mehmet Aygün, Oisin Mac Aodha

    Abstract: We introduce SAOR, a novel approach for estimating the 3D shape, texture, and viewpoint of an articulated object from a single image captured in the wild. Unlike prior approaches that rely on pre-defined category-specific 3D templates or tailored 3D skeletons, SAOR learns to articulate shapes from single-view image collections with a skeleton-free part-based model without requiring any 3D object s… ▽ More

    Submitted 8 April, 2024; v1 submitted 23 March, 2023; originally announced March 2023.

    Comments: Accepted to CVPR 2024, website: https://mehmetaygun.github.io/saor

  15. arXiv:2301.07088  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    Vision Learners Meet Web Image-Text Pairs

    Authors: Bingchen Zhao, Quan Cui, Hao Wu, Osamu Yoshie, Cheng Yang, Oisin Mac Aodha

    Abstract: Most recent self-supervised learning methods are pre-trained on the well-curated ImageNet-1K dataset. In this work, given the excellent scalability of web data, we consider self-supervised pre-training on noisy web sourced image-text paired data. First, we conduct a benchmark study of representative self-supervised pre-training methods on large-scale web data in a like-for-like setting. We compare… ▽ More

    Submitted 5 April, 2023; v1 submitted 17 January, 2023; originally announced January 2023.

    Comments: Project page: https://bzhao.me/MUG/

  16. arXiv:2212.00436  [pdf, other

    cs.CV

    ViewNeRF: Unsupervised Viewpoint Estimation Using Category-Level Neural Radiance Fields

    Authors: Octave Mariotti, Oisin Mac Aodha, Hakan Bilen

    Abstract: We introduce ViewNeRF, a Neural Radiance Field-based viewpoint estimation method that learns to predict category-level viewpoints directly from images during training. While NeRF is usually trained with ground-truth camera poses, multiple extensions have been proposed to reduce the need for this expensive supervision. Nonetheless, most of these methods still struggle in complex settings with large… ▽ More

    Submitted 1 December, 2022; originally announced December 2022.

    Journal ref: Proceedings of the 33rd British Machine Vision Conference, BMVC 2022

  17. arXiv:2212.00435  [pdf, other

    cs.CV

    ViewNet: Unsupervised Viewpoint Estimation from Conditional Generation

    Authors: Octave Mariotti, Oisin Mac Aodha, Hakan Bilen

    Abstract: Understanding the 3D world without supervision is currently a major challenge in computer vision as the annotations required to supervise deep networks for tasks in this domain are expensive to obtain on a large scale. In this paper, we address the problem of unsupervised viewpoint estimation. We formulate this as a self-supervised learning task, where image reconstruction provides the supervision… ▽ More

    Submitted 1 December, 2022; originally announced December 2022.

    Journal ref: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2021, pp. 10418-10428

  18. arXiv:2210.04933  [pdf, other

    cs.CV

    An Action Is Worth Multiple Words: Handling Ambiguity in Action Recognition

    Authors: Kiyoon Kim, Davide Moltisanti, Oisin Mac Aodha, Laura Sevilla-Lara

    Abstract: Precisely naming the action depicted in a video can be a challenging and oftentimes ambiguous task. In contrast to object instances represented as nouns (e.g. dog, cat, chair, etc.), in the case of actions, human annotators typically lack a consensus as to what constitutes a specific action (e.g. jogging versus running). In practice, a given video can contain multiple valid positive annotations fo… ▽ More

    Submitted 10 October, 2022; originally announced October 2022.

    Comments: BMVC 2022

  19. arXiv:2210.03794  [pdf, other

    cs.CV

    SVL-Adapter: Self-Supervised Adapter for Vision-Language Pretrained Models

    Authors: Omiros Pantazis, Gabriel Brostow, Kate Jones, Oisin Mac Aodha

    Abstract: Vision-language models such as CLIP are pretrained on large volumes of internet sourced image and text pairs, and have been shown to sometimes exhibit impressive zero- and low-shot image classification performance. However, due to their size, fine-tuning these models on new datasets can be prohibitively expensive, both in terms of the supervision and compute required. To combat this, a series of l… ▽ More

    Submitted 7 October, 2022; originally announced October 2022.

    Comments: BMVC 2022

  20. arXiv:2207.10664  [pdf, other

    cs.CV cs.LG

    Exploring Fine-Grained Audiovisual Categorization with the SSW60 Dataset

    Authors: Grant Van Horn, Rui Qian, Kimberly Wilber, Hartwig Adam, Oisin Mac Aodha, Serge Belongie

    Abstract: We present a new benchmark dataset, Sapsucker Woods 60 (SSW60), for advancing research on audiovisual fine-grained categorization. While our community has made great strides in fine-grained visual categorization on images, the counterparts in audio and video fine-grained categorization are relatively unexplored. To encourage advancements in this space, we have carefully constructed the SSW60 datas… ▽ More

    Submitted 21 July, 2022; originally announced July 2022.

    Comments: ECCV 2022 Camera Ready

  21. arXiv:2207.10225  [pdf, other

    cs.CV cs.LG

    On Label Granularity and Object Localization

    Authors: Elijah Cole, Kimberly Wilber, Grant Van Horn, Xuan Yang, Marco Fornoni, Pietro Perona, Serge Belongie, Andrew Howard, Oisin Mac Aodha

    Abstract: Weakly supervised object localization (WSOL) aims to learn representations that encode object location using only image-level category labels. However, many objects can be labeled at different levels of granularity. Is it an animal, a bird, or a great horned owl? Which image-level labels should we use? In this paper we study the role of label granularity in WSOL. To facilitate this investigation w… ▽ More

    Submitted 20 July, 2022; originally announced July 2022.

    Comments: ECCV 2022

  22. arXiv:2207.10157  [pdf, other

    cs.CV cs.HC

    Visual Knowledge Tracing

    Authors: Neehar Kondapaneni, Pietro Perona, Oisin Mac Aodha

    Abstract: Each year, thousands of people learn new visual categorization tasks -- radiologists learn to recognize tumors, birdwatchers learn to distinguish similar species, and crowd workers learn how to annotate valuable data for applications like autonomous driving. As humans learn, their brain updates the visual features it extracts and attend to, which ultimately informs their final classification decis… ▽ More

    Submitted 21 July, 2022; v1 submitted 20 July, 2022; originally announced July 2022.

    Comments: 14 pages, 4 figures, 14 supplemental pages, 11 supplemental figures, accepted to European Conference on Computer Vision (ECCV) 2022

  23. arXiv:2207.05054  [pdf, other

    cs.CV cs.LG

    Demystifying Unsupervised Semantic Correspondence Estimation

    Authors: Mehmet Aygün, Oisin Mac Aodha

    Abstract: We explore semantic correspondence estimation through the lens of unsupervised learning. We thoroughly evaluate several recently proposed unsupervised methods across multiple challenging datasets using a standardized evaluation protocol where we vary factors such as the backbone architecture, the pre-training strategy, and the pre-training and finetuning datasets. To better understand the failure… ▽ More

    Submitted 11 July, 2022; originally announced July 2022.

    Comments: ECCV22, project page https://mehmetaygun.github.io/demistfy.html

  24. arXiv:2201.10394  [pdf, other

    cs.CV

    Capturing Temporal Information in a Single Frame: Channel Sampling Strategies for Action Recognition

    Authors: Kiyoon Kim, Shreyank N Gowda, Oisin Mac Aodha, Laura Sevilla-Lara

    Abstract: We address the problem of capturing temporal information for video classification in 2D networks, without increasing their computational cost. Existing approaches focus on modifying the architecture of 2D networks (e.g. by including filters in the temporal dimension to turn them into 3D networks, or using optical flow, etc.), which increases computation cost. Instead, we propose a novel sampling s… ▽ More

    Submitted 10 October, 2022; v1 submitted 25 January, 2022; originally announced January 2022.

    Comments: BMVC 2022

  25. arXiv:2111.06119  [pdf, other

    cs.CV cs.LG

    Fine-Grained Image Analysis with Deep Learning: A Survey

    Authors: Xiu-Shen Wei, Yi-Zhe Song, Oisin Mac Aodha, Jianxin Wu, Yuxin Peng, **hui Tang, Jian Yang, Serge Belongie

    Abstract: Fine-grained image analysis (FGIA) is a longstanding and fundamental problem in computer vision and pattern recognition, and underpins a diverse set of real-world applications. The task of FGIA targets analyzing visual objects from subordinate categories, e.g., species of birds or models of cars. The small inter-class and large intra-class variation inherent to fine-grained image analysis makes it… ▽ More

    Submitted 19 November, 2021; v1 submitted 11 November, 2021; originally announced November 2021.

    Comments: Accepted by IEEE TPAMI

  26. arXiv:2108.06435  [pdf, other

    cs.CV cs.LG

    Focus on the Positives: Self-Supervised Learning for Biodiversity Monitoring

    Authors: Omiros Pantazis, Gabriel Brostow, Kate Jones, Oisin Mac Aodha

    Abstract: We address the problem of learning self-supervised representations from unlabeled image collections. Unlike existing approaches that attempt to learn useful features by maximizing similarity between augmented versions of each input image or by speculatively picking negative samples, we instead also make use of the natural variation that occurs in image collections that are captured using static mo… ▽ More

    Submitted 13 August, 2021; originally announced August 2021.

    Comments: ICCV 2021

  27. arXiv:2106.09708  [pdf, other

    cs.CV cs.LG

    Multi-Label Learning from Single Positive Labels

    Authors: Elijah Cole, Oisin Mac Aodha, Titouan Lorieul, Pietro Perona, Dan Morris, Nebojsa Jojic

    Abstract: Predicting all applicable labels for a given image is known as multi-label classification. Compared to the standard multi-class case (where each image has only one label), it is considerably more challenging to annotate training data for multi-label classification. When the number of potential labels is large, human annotators find it difficult to mention all applicable labels for each training im… ▽ More

    Submitted 22 October, 2021; v1 submitted 17 June, 2021; originally announced June 2021.

    Comments: CVPR 2021. Supplementary material included

  28. arXiv:2105.05837  [pdf, other

    cs.CV cs.LG

    When Does Contrastive Visual Representation Learning Work?

    Authors: Elijah Cole, Xuan Yang, Kimberly Wilber, Oisin Mac Aodha, Serge Belongie

    Abstract: Recent self-supervised representation learning techniques have largely closed the gap between supervised and unsupervised learning on ImageNet classification. While the particulars of pretraining on ImageNet are now relatively well understood, the field still lacks widely accepted best practices for replicating this success on other datasets. As a first step in this direction, we study contrastive… ▽ More

    Submitted 4 April, 2022; v1 submitted 12 May, 2021; originally announced May 2021.

    Comments: CVPR 2022

  29. arXiv:2104.14540  [pdf, other

    cs.CV

    The Temporal Opportunist: Self-Supervised Multi-Frame Monocular Depth

    Authors: Jamie Watson, Oisin Mac Aodha, Victor Prisacariu, Gabriel Brostow, Michael Firman

    Abstract: Self-supervised monocular depth estimation networks are trained to predict scene depth using nearby frames as a supervision signal during training. However, for many applications, sequence information in the form of video frames is also available at test time. The vast majority of monocular networks do not make use of this extra signal, thus ignoring valuable information that could be used to impr… ▽ More

    Submitted 14 July, 2021; v1 submitted 29 April, 2021; originally announced April 2021.

    Comments: CVPR 2021

  30. arXiv:2103.16483  [pdf, other

    cs.CV

    Benchmarking Representation Learning for Natural World Image Collections

    Authors: Grant Van Horn, Elijah Cole, Sara Beery, Kimberly Wilber, Serge Belongie, Oisin Mac Aodha

    Abstract: Recent progress in self-supervised learning has resulted in models that are capable of extracting rich representations from image collections without requiring any explicit label supervision. However, to date the vast majority of these approaches have restricted themselves to training on standard benchmark datasets such as ImageNet. We argue that fine-grained visual categorization problems, such a… ▽ More

    Submitted 8 June, 2021; v1 submitted 30 March, 2021; originally announced March 2021.

    Comments: CVPR 2021

  31. arXiv:2008.01484  [pdf, other

    cs.CV

    Learning Stereo from Single Images

    Authors: Jamie Watson, Oisin Mac Aodha, Daniyar Turmukhambetov, Gabriel J. Brostow, Michael Firman

    Abstract: Supervised deep networks are among the best methods for finding correspondences in stereo image pairs. Like all supervised approaches, these networks require ground truth data during training. However, collecting large quantities of accurate dense correspondence data is very challenging. We propose that it is unnecessary to have such a high reliance on ground truth depths or even corresponding ste… ▽ More

    Submitted 20 August, 2020; v1 submitted 4 August, 2020; originally announced August 2020.

    Comments: Accepted as an oral presentation at ECCV 2020

  32. arXiv:2002.01708  [pdf, other

    cs.CV

    Geocoding of trees from street addresses and street-level images

    Authors: Daniel Laumer, Nico Lang, Natalie van Doorn, Oisin Mac Aodha, Pietro Perona, Jan Dirk Wegner

    Abstract: We introduce an approach for updating older tree inventories with geographic coordinates using street-level panorama images and a global optimization framework for tree instance matching. Geolocations of trees in inventories until the early 2000s where recorded using street addresses whereas newer inventories use GPS. Our method retrofits older inventories with geographic coordinates to allow conn… ▽ More

    Submitted 5 February, 2020; originally announced February 2020.

    Comments: Accepted for publication in ISPRS Journal of Photogrammetry and Remote Sensing

  33. arXiv:1906.05272  [pdf, other

    cs.CV cs.LG

    Presence-Only Geographical Priors for Fine-Grained Image Classification

    Authors: Oisin Mac Aodha, Elijah Cole, Pietro Perona

    Abstract: Appearance information alone is often not sufficient to accurately differentiate between fine-grained visual categories. Human experts make use of additional cues such as where, and when, a given image was taken in order to inform their final decision. This contextual information is readily available in many online image collections but has been underutilized by existing image classifiers that foc… ▽ More

    Submitted 28 October, 2019; v1 submitted 12 June, 2019; originally announced June 2019.

    Comments: ICCV 2019

  34. arXiv:1904.05986  [pdf, other

    cs.CV

    The iWildCam 2018 Challenge Dataset

    Authors: Sara Beery, Grant van Horn, Oisin Mac Aodha, Pietro Perona

    Abstract: Camera traps are a valuable tool for studying biodiversity, but research using this data is limited by the speed of human annotation. With the vast amounts of data now available it is imperative that we develop automatic solutions for annotating camera trap data in order to allow this research to scale. A promising approach is based on deep networks trained on human-annotated images. We provide a… ▽ More

    Submitted 24 April, 2019; v1 submitted 11 April, 2019; originally announced April 2019.

    Comments: Challenge hosted at the fifth Fine-Grained Visual Categorization Workshop (FGVC5) at CVPR 2018

  35. arXiv:1806.01260  [pdf, other

    cs.CV stat.ML

    Digging Into Self-Supervised Monocular Depth Estimation

    Authors: Clément Godard, Oisin Mac Aodha, Michael Firman, Gabriel Brostow

    Abstract: Per-pixel ground-truth depth data is challenging to acquire at scale. To overcome this limitation, self-supervised learning has emerged as a promising alternative for training models to perform monocular depth estimation. In this paper, we propose a set of improvements, which together result in both quantitatively and qualitatively improved depth maps compared to competing self-supervised methods.… ▽ More

    Submitted 17 August, 2019; v1 submitted 4 June, 2018; originally announced June 2018.

    Comments: ICCV 19

  36. arXiv:1805.08322  [pdf, other

    cs.AI cs.LG

    Teaching Multiple Concepts to a Forgetful Learner

    Authors: Anette Hunziker, Yuxin Chen, Oisin Mac Aodha, Manuel Gomez Rodriguez, Andreas Krause, Pietro Perona, Yisong Yue, Adish Singla

    Abstract: How can we help a forgetful learner learn multiple concepts within a limited time frame? While there have been extensive studies in designing optimal schedules for teaching a single concept given a learner's memory model, existing approaches for teaching multiple concepts are typically based on heuristic scheduling techniques without theoretical guarantees. In this paper, we look at the problem fr… ▽ More

    Submitted 25 October, 2019; v1 submitted 21 May, 2018; originally announced May 2018.

    Comments: NeurIPS 2019

  37. arXiv:1805.06880  [pdf, other

    cs.CV

    It's all Relative: Monocular 3D Human Pose Estimation from Weakly Supervised Data

    Authors: Matteo Ruggero Ronchi, Oisin Mac Aodha, Robert Eng, Pietro Perona

    Abstract: We address the problem of 3D human pose estimation from 2D input images using only weakly supervised training data. Despite showing considerable success for 2D pose estimation, the application of supervised machine learning to 3D pose estimation in real world images is currently hampered by the lack of varied training images with corresponding 3D poses. Most existing 3D pose estimation algorithms… ▽ More

    Submitted 27 July, 2018; v1 submitted 17 May, 2018; originally announced May 2018.

    Comments: BMVC 2018. Project page available at http://www.vision.caltech.edu/~mronchi/projects/RelativePose

  38. arXiv:1802.06924  [pdf, other

    cs.CV cs.LG stat.ML

    Teaching Categories to Human Learners with Visual Explanations

    Authors: Oisin Mac Aodha, Shihan Su, Yuxin Chen, Pietro Perona, Yisong Yue

    Abstract: We study the problem of computer-assisted teaching with explanations. Conventional approaches for machine teaching typically only provide feedback at the instance level e.g., the category or label of the instance. However, it is intuitive that clear explanations from a knowledgeable teacher can significantly improve a student's ability to learn a new concept. To address these existing limitations,… ▽ More

    Submitted 19 February, 2018; originally announced February 2018.

  39. arXiv:1802.05190  [pdf, other

    cs.LG

    Understanding the Role of Adaptivity in Machine Teaching: The Case of Version Space Learners

    Authors: Yuxin Chen, Adish Singla, Oisin Mac Aodha, Pietro Perona, Yisong Yue

    Abstract: In real-world applications of education, an effective teacher adaptively chooses the next example to teach based on the learner's current state. However, most existing work in algorithmic machine teaching focuses on the batch setting, where adaptivity plays no role. In this paper, we study the case of teaching consistent, version space learners in an interactive setting. At any time step, the teac… ▽ More

    Submitted 8 December, 2018; v1 submitted 14 February, 2018; originally announced February 2018.

    Comments: NeurIPS 2018 (extended version)

  40. arXiv:1710.01691  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Context Embedding Networks

    Authors: Kun Ho Kim, Oisin Mac Aodha, Pietro Perona

    Abstract: Low dimensional embeddings that capture the main variations of interest in collections of data are important for many applications. One way to construct these embeddings is to acquire estimates of similarity from the crowd. However, similarity is a multi-dimensional concept that varies from individual to individual. Existing models for learning embeddings from the crowd typically make simplifying… ▽ More

    Submitted 29 March, 2018; v1 submitted 22 September, 2017; originally announced October 2017.

    Comments: CVPR 2018 spotlight

  41. arXiv:1707.06642  [pdf, other

    cs.CV

    The iNaturalist Species Classification and Detection Dataset

    Authors: Grant Van Horn, Oisin Mac Aodha, Yang Song, Yin Cui, Chen Sun, Alex Shepard, Hartwig Adam, Pietro Perona, Serge Belongie

    Abstract: Existing image classification datasets used in computer vision tend to have a uniform distribution of images across object categories. In contrast, the natural world is heavily imbalanced, as some species are more abundant and easier to photograph than others. To encourage further progress in challenging real world conditions we present the iNaturalist species classification and detection dataset,… ▽ More

    Submitted 10 April, 2018; v1 submitted 20 July, 2017; originally announced July 2017.

    Comments: CVPR 2018

  42. arXiv:1609.03677  [pdf, other

    cs.CV cs.LG stat.ML

    Unsupervised Monocular Depth Estimation with Left-Right Consistency

    Authors: Clément Godard, Oisin Mac Aodha, Gabriel J. Brostow

    Abstract: Learning based methods have shown very promising results for the task of depth estimation in single images. However, most existing approaches treat depth prediction as a supervised regression problem and as a result, require vast quantities of corresponding ground truth depth data for training. Just recording quality depth data in a range of environments is a challenging problem. In this paper, we… ▽ More

    Submitted 12 April, 2017; v1 submitted 13 September, 2016; originally announced September 2016.

    Comments: CVPR 2017 oral

  43. arXiv:1504.08219  [pdf, other

    cs.CV cs.LG stat.ML

    Hierarchical Subquery Evaluation for Active Learning on a Graph

    Authors: Oisin Mac Aodha, Neill D. F. Campbell, Jan Kautz, Gabriel J. Brostow

    Abstract: To train good supervised and semi-supervised object classifiers, it is critical that we not waste the time of the human experts who are providing the training labels. Existing active learning strategies can have uneven performance, being efficient on some datasets but wasteful on others, or inconsistent just between runs on the same dataset. We propose perplexity based graph construction and a new… ▽ More

    Submitted 30 April, 2015; originally announced April 2015.

    Comments: CVPR 2014

  44. arXiv:1504.07575  [pdf, other

    cs.CV cs.LG stat.ML

    Becoming the Expert - Interactive Multi-Class Machine Teaching

    Authors: Edward Johns, Oisin Mac Aodha, Gabriel J. Brostow

    Abstract: Compared to machines, humans are extremely good at classifying images into categories, especially when they possess prior knowledge of the categories at hand. If this prior information is not available, supervision in the form of teaching images is required. To learn categories more quickly, people should see important and representative images first, followed by less important images later - or n… ▽ More

    Submitted 28 April, 2015; originally announced April 2015.

    Comments: CVPR 2015