Skip to main content

Showing 1–11 of 11 results for author: Collins, M D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2106.09748  [pdf, other

    cs.CV

    DeepLab2: A TensorFlow Library for Deep Labeling

    Authors: Mark Weber, Huiyu Wang, Siyuan Qiao, Jun Xie, Maxwell D. Collins, Yukun Zhu, Liangzhe Yuan, Dahun Kim, Qihang Yu, Daniel Cremers, Laura Leal-Taixe, Alan L. Yuille, Florian Schroff, Hartwig Adam, Liang-Chieh Chen

    Abstract: DeepLab2 is a TensorFlow library for deep labeling, aiming to provide a state-of-the-art and easy-to-use TensorFlow codebase for general dense pixel prediction problems in computer vision. DeepLab2 includes all our recently developed DeepLab model variants with pretrained checkpoints as well as model training and evaluation code, allowing the community to reproduce and further improve upon the sta… ▽ More

    Submitted 17 June, 2021; originally announced June 2021.

    Comments: 4-page technical report. The first three authors contributed equally to this work

  2. arXiv:2005.10266  [pdf, other

    cs.CV

    Naive-Student: Leveraging Semi-Supervised Learning in Video Sequences for Urban Scene Segmentation

    Authors: Liang-Chieh Chen, Raphael Gontijo Lopes, Bowen Cheng, Maxwell D. Collins, Ekin D. Cubuk, Barret Zoph, Hartwig Adam, Jonathon Shlens

    Abstract: Supervised learning in large discriminative models is a mainstay for modern computer vision. Such an approach necessitates investing in large-scale human-annotated datasets for achieving state-of-the-art results. In turn, the efficacy of supervised learning may be limited by the size of the human annotated dataset. This limitation is particularly notable for image segmentation tasks, where the exp… ▽ More

    Submitted 19 July, 2020; v1 submitted 20 May, 2020; originally announced May 2020.

    Comments: Accepted to ECCV 2020

  3. arXiv:1911.10194  [pdf, other

    cs.CV

    Panoptic-DeepLab: A Simple, Strong, and Fast Baseline for Bottom-Up Panoptic Segmentation

    Authors: Bowen Cheng, Maxwell D. Collins, Yukun Zhu, Ting Liu, Thomas S. Huang, Hartwig Adam, Liang-Chieh Chen

    Abstract: In this work, we introduce Panoptic-DeepLab, a simple, strong, and fast system for panoptic segmentation, aiming to establish a solid baseline for bottom-up methods that can achieve comparable performance of two-stage methods while yielding fast inference speed. In particular, Panoptic-DeepLab adopts the dual-ASPP and dual-decoder structures specific to semantic, and instance segmentation, respect… ▽ More

    Submitted 11 March, 2020; v1 submitted 22 November, 2019; originally announced November 2019.

    Comments: CVPR 2020

  4. arXiv:1910.06962  [pdf, other

    cs.CV cs.LG eess.IV

    SegSort: Segmentation by Discriminative Sorting of Segments

    Authors: Jyh-**g Hwang, Stella X. Yu, Jianbo Shi, Maxwell D. Collins, Tien-Ju Yang, Xiao Zhang, Liang-Chieh Chen

    Abstract: Almost all existing deep learning approaches for semantic segmentation tackle this task as a pixel-wise classification problem. Yet humans understand a scene not in terms of pixels, but by decomposing it into perceptual groups and structures that are the basic building blocks of recognition. This motivates us to propose an end-to-end pixel-wise metric learning approach that mimics this process. In… ▽ More

    Submitted 30 October, 2019; v1 submitted 15 October, 2019; originally announced October 2019.

    Comments: In ICCV 2019. Webpage & Code: https://jyh**ghwang.github.io/projects/segsort.html

  5. arXiv:1910.04751  [pdf, other

    cs.CV cs.LG eess.IV stat.ML

    Panoptic-DeepLab

    Authors: Bowen Cheng, Maxwell D. Collins, Yukun Zhu, Ting Liu, Thomas S. Huang, Hartwig Adam, Liang-Chieh Chen

    Abstract: We present Panoptic-DeepLab, a bottom-up and single-shot approach for panoptic segmentation. Our Panoptic-DeepLab is conceptually simple and delivers state-of-the-art results. In particular, we adopt the dual-ASPP and dual-decoder structures specific to semantic, and instance segmentation, respectively. The semantic segmentation branch is the same as the typical design of any semantic segmentation… ▽ More

    Submitted 23 October, 2019; v1 submitted 10 October, 2019; originally announced October 2019.

    Comments: This work is presented at ICCV 2019 Joint COCO and Mapillary Recognition Challenge Workshop

  6. arXiv:1902.05093  [pdf, other

    cs.CV

    DeeperLab: Single-Shot Image Parser

    Authors: Tien-Ju Yang, Maxwell D. Collins, Yukun Zhu, Jyh-**g Hwang, Ting Liu, Xiao Zhang, Vivienne Sze, George Papandreou, Liang-Chieh Chen

    Abstract: We present a single-shot, bottom-up approach for whole image parsing. Whole image parsing, also known as Panoptic Segmentation, generalizes the tasks of semantic segmentation for 'stuff' classes and instance segmentation for 'thing' classes, assigning both semantic and instance labels to every pixel in an image. Recent approaches to whole image parsing typically employ separate standalone modules… ▽ More

    Submitted 12 March, 2019; v1 submitted 13 February, 2019; originally announced February 2019.

    Comments: 20 pages. The code of the proposed Parsing Covering metric is available at http://deeperlab.mit.edu

  7. arXiv:1809.04184  [pdf, other

    cs.CV cs.LG stat.ML

    Searching for Efficient Multi-Scale Architectures for Dense Image Prediction

    Authors: Liang-Chieh Chen, Maxwell D. Collins, Yukun Zhu, George Papandreou, Barret Zoph, Florian Schroff, Hartwig Adam, Jonathon Shlens

    Abstract: The design of neural network architectures is an important component for achieving state-of-the-art performance with machine learning systems across a broad array of tasks. Much work has endeavored to design and build architectures automatically through clever construction of a search space paired with simple learning algorithms. Recent progress has demonstrated that such meta-learning methods may… ▽ More

    Submitted 11 September, 2018; originally announced September 2018.

    Comments: Accepted by NIPS 2018

  8. arXiv:1612.02297  [pdf, other

    cs.CV cs.LG

    Spatially Adaptive Computation Time for Residual Networks

    Authors: Michael Figurnov, Maxwell D. Collins, Yukun Zhu, Li Zhang, Jonathan Huang, Dmitry Vetrov, Ruslan Salakhutdinov

    Abstract: This paper proposes a deep learning architecture based on Residual Network that dynamically adjusts the number of executed layers for the regions of the image. This architecture is end-to-end trainable, deterministic and problem-agnostic. It is therefore applicable without any modifications to a wide range of computer vision problems such as image classification, object detection and image segment… ▽ More

    Submitted 2 July, 2017; v1 submitted 7 December, 2016; originally announced December 2016.

    Comments: CVPR 2017

  9. arXiv:1511.04056  [pdf, other

    cs.LG cs.CV

    Efficient non-greedy optimization of decision trees

    Authors: Mohammad Norouzi, Maxwell D. Collins, Matthew Johnson, David J. Fleet, Pushmeet Kohli

    Abstract: Decision trees and randomized forests are widely used in computer vision and machine learning. Standard algorithms for decision tree induction optimize the split functions one node at a time according to some splitting criteria. This greedy procedure often leads to suboptimal trees. In this paper, we present an algorithm for optimizing the split functions at all levels of the tree jointly with the… ▽ More

    Submitted 12 November, 2015; originally announced November 2015.

    Comments: in NIPS 2015

  10. arXiv:1506.06155  [pdf, other

    cs.LG cs.CV

    CO2 Forest: Improved Random Forest by Continuous Optimization of Oblique Splits

    Authors: Mohammad Norouzi, Maxwell D. Collins, David J. Fleet, Pushmeet Kohli

    Abstract: We propose a novel algorithm for optimizing multivariate linear threshold functions as split functions of decision trees to create improved Random Forest classifiers. Standard tree induction methods resort to sampling and exhaustive search to find good univariate split functions. In contrast, our method computes a linear combination of the features at each node, and optimizes the parameters of the… ▽ More

    Submitted 24 June, 2015; v1 submitted 19 June, 2015; originally announced June 2015.

  11. arXiv:1412.1442  [pdf, other

    cs.CV

    Memory Bounded Deep Convolutional Networks

    Authors: Maxwell D. Collins, Pushmeet Kohli

    Abstract: In this work, we investigate the use of sparsity-inducing regularizers during training of Convolution Neural Networks (CNNs). These regularizers encourage that fewer connections in the convolution and fully connected layers take non-zero values and in effect result in sparse connectivity between hidden units in the deep network. This in turn reduces the memory and runtime cost involved in deployin… ▽ More

    Submitted 3 December, 2014; originally announced December 2014.