Skip to main content

Showing 1–10 of 10 results for author: Wilber, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2311.05770  [pdf, other

    cs.CV

    PolyMaX: General Dense Prediction with Mask Transformer

    Authors: Xuan Yang, Liangzhe Yuan, Kimberly Wilber, Astuti Sharma, Xiuye Gu, Siyuan Qiao, Stephanie Debats, Huisheng Wang, Hartwig Adam, Mikhail Sirotenko, Liang-Chieh Chen

    Abstract: Dense prediction tasks, such as semantic segmentation, depth estimation, and surface normal prediction, can be easily formulated as per-pixel classification (discrete outputs) or regression (continuous outputs). This per-pixel prediction paradigm has remained popular due to the prevalence of fully convolutional networks. However, on the recent frontier of segmentation task, the community has been… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

    Comments: WACV 2024

  2. arXiv:2309.12172  [pdf, other

    cs.CV

    SANPO: A Scene Understanding, Accessibility, Navigation, Pathfinding, Obstacle Avoidance Dataset

    Authors: Sagar M. Waghmare, Kimberly Wilber, Dave Hawkey, Xuan Yang, Matthew Wilson, Stephanie Debats, Cattalyya Nuengsigkapian, Astuti Sharma, Lars Pandikow, Huisheng Wang, Hartwig Adam, Mikhail Sirotenko

    Abstract: We introduce SANPO, a large-scale egocentric video dataset focused on dense prediction in outdoor environments. It contains stereo video sessions collected across diverse outdoor environments, as well as rendered synthetic video sessions. (Synthetic data was provided by Parallel Domain.) All sessions have (dense) depth and odometry labels. All synthetic sessions and a subset of real sessions have… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

    Comments: 10 pages plus additional references. 13 figures

  3. arXiv:2207.10664  [pdf, other

    cs.CV cs.LG

    Exploring Fine-Grained Audiovisual Categorization with the SSW60 Dataset

    Authors: Grant Van Horn, Rui Qian, Kimberly Wilber, Hartwig Adam, Oisin Mac Aodha, Serge Belongie

    Abstract: We present a new benchmark dataset, Sapsucker Woods 60 (SSW60), for advancing research on audiovisual fine-grained categorization. While our community has made great strides in fine-grained visual categorization on images, the counterparts in audio and video fine-grained categorization are relatively unexplored. To encourage advancements in this space, we have carefully constructed the SSW60 datas… ▽ More

    Submitted 21 July, 2022; originally announced July 2022.

    Comments: ECCV 2022 Camera Ready

  4. arXiv:2207.10225  [pdf, other

    cs.CV cs.LG

    On Label Granularity and Object Localization

    Authors: Elijah Cole, Kimberly Wilber, Grant Van Horn, Xuan Yang, Marco Fornoni, Pietro Perona, Serge Belongie, Andrew Howard, Oisin Mac Aodha

    Abstract: Weakly supervised object localization (WSOL) aims to learn representations that encode object location using only image-level category labels. However, many objects can be labeled at different levels of granularity. Is it an animal, a bird, or a great horned owl? Which image-level labels should we use? In this paper we study the role of label granularity in WSOL. To facilitate this investigation w… ▽ More

    Submitted 20 July, 2022; originally announced July 2022.

    Comments: ECCV 2022

  5. arXiv:2106.10258  [pdf, other

    cs.CV cs.AI cs.LG

    Bridging the Gap Between Object Detection and User Intent via Query-Modulation

    Authors: Marco Fornoni, Chaochao Yan, Liangchen Luo, Kimberly Wilber, Alex Stark, Yin Cui, Boqing Gong, Andrew Howard

    Abstract: When interacting with objects through cameras, or pictures, users often have a specific intent. For example, they may want to perform a visual search. With most object detection models relying on image pixels as their sole input, undesired results are not uncommon. Most typically: lack of a high-confidence detection on the object of interest, or detection with a wrong class label. The issue is esp… ▽ More

    Submitted 3 August, 2022; v1 submitted 18 June, 2021; originally announced June 2021.

  6. arXiv:2105.05837  [pdf, other

    cs.CV cs.LG

    When Does Contrastive Visual Representation Learning Work?

    Authors: Elijah Cole, Xuan Yang, Kimberly Wilber, Oisin Mac Aodha, Serge Belongie

    Abstract: Recent self-supervised representation learning techniques have largely closed the gap between supervised and unsupervised learning on ImageNet classification. While the particulars of pretraining on ImageNet are now relatively well understood, the field still lacks widely accepted best practices for replicating this success on other datasets. As a first step in this direction, we study contrastive… ▽ More

    Submitted 4 April, 2022; v1 submitted 12 May, 2021; originally announced May 2021.

    Comments: CVPR 2022

  7. arXiv:2103.16483  [pdf, other

    cs.CV

    Benchmarking Representation Learning for Natural World Image Collections

    Authors: Grant Van Horn, Elijah Cole, Sara Beery, Kimberly Wilber, Serge Belongie, Oisin Mac Aodha

    Abstract: Recent progress in self-supervised learning has resulted in models that are capable of extracting rich representations from image collections without requiring any explicit label supervision. However, to date the vast majority of these approaches have restricted themselves to training on standard benchmark datasets such as ImageNet. We argue that fine-grained visual categorization problems, such a… ▽ More

    Submitted 8 June, 2021; v1 submitted 30 March, 2021; originally announced March 2021.

    Comments: CVPR 2021

  8. arXiv:2102.03349  [pdf, other

    cs.LG

    On the Reproducibility of Neural Network Predictions

    Authors: Srinadh Bhojanapalli, Kimberly Wilber, Andreas Veit, Ankit Singh Rawat, Seungyeon Kim, Aditya Menon, Sanjiv Kumar

    Abstract: Standard training techniques for neural networks involve multiple sources of randomness, e.g., initialization, mini-batch ordering and in some cases data augmentation. Given that neural networks are heavily over-parameterized in practice, such randomness can cause {\em churn} -- for the same input, disagreements between predictions of the two models independently trained by the same algorithm, con… ▽ More

    Submitted 5 February, 2021; originally announced February 2021.

    Comments: 19 pages, 7 figures

  9. arXiv:2011.08824  [pdf, other

    cs.LG

    Improving Calibration in Deep Metric Learning With Cross-Example Softmax

    Authors: Andreas Veit, Kimberly Wilber

    Abstract: Modern image retrieval systems increasingly rely on the use of deep neural networks to learn embedding spaces in which distance encodes the relevance between a given query and image. In this setting, existing approaches tend to emphasize one of two properties. Triplet-based methods capture top-$k$ relevancy, where all top-$k$ scoring documents are assumed to be relevant to a given query Pairwise c… ▽ More

    Submitted 17 November, 2020; originally announced November 2020.

    Comments: 9 pages

  10. arXiv:1811.10648  [pdf, other

    cs.CV

    Understanding Image Quality and Trust in Peer-to-Peer Marketplaces

    Authors: Xiao Ma, Lina Mezghani, Kimberly Wilber, Hui Hong, Robinson Piramuthu, Mor Naaman, Serge Belongie

    Abstract: As any savvy online shopper knows, second-hand peer-to-peer marketplaces are filled with images of mixed quality. How does image quality impact marketplace outcomes, and can quality be automatically predicted? In this work, we conducted a large-scale study on the quality of user-generated images in peer-to-peer marketplaces. By gathering a dataset of common second-hand products (~75,000 images) an… ▽ More

    Submitted 26 November, 2018; originally announced November 2018.

    Comments: WACV 2019

    ACM Class: I.4; H.5; I.2.10