Skip to main content

Showing 301–321 of 321 results for author: Yuille, A

.
  1. arXiv:1505.00276  [pdf, other

    cs.CV

    Joint Object and Part Segmentation using Deep Learned Potentials

    Authors: Peng Wang, Xiaohui Shen, Zhe Lin, Scott Cohen, Brian Price, Alan Yuille

    Abstract: Segmenting semantic objects from images and parsing them into their respective semantic parts are fundamental steps towards detailed object understanding in computer vision. In this paper, we propose a joint solution that tackles semantic object and part segmentation simultaneously, in which higher object-level context is provided to guide part segmentation, and more detailed part-level localizati… ▽ More

    Submitted 1 May, 2015; originally announced May 2015.

  2. arXiv:1504.06692  [pdf, other

    cs.CV cs.CL cs.LG

    Learning like a Child: Fast Novel Visual Concept Learning from Sentence Descriptions of Images

    Authors: Junhua Mao, Wei Xu, Yi Yang, Jiang Wang, Zhiheng Huang, Alan Yuille

    Abstract: In this paper, we address the task of learning novel visual concepts, and their interactions with other concepts, from a few images with sentence descriptions. Using linguistic context and visual features, our method is able to efficiently hypothesize the semantic meaning of new words and add them to its word dictionary so that they can be used to describe images which contain these novel concepts… ▽ More

    Submitted 1 October, 2015; v1 submitted 25 April, 2015; originally announced April 2015.

    Comments: ICCV 2015 camera ready version. We add much more novel visual concepts in the NVC dataset and have released it, see http://www.stat.ucla.edu/~junhua.mao/projects/child_learning.html

    ACM Class: I.2.6; I.2.7; I.2.10

  3. arXiv:1502.02734  [pdf, other

    cs.CV

    Weakly- and Semi-Supervised Learning of a DCNN for Semantic Image Segmentation

    Authors: George Papandreou, Liang-Chieh Chen, Kevin Murphy, Alan L. Yuille

    Abstract: Deep convolutional neural networks (DCNNs) trained on a large number of images with strong pixel-level annotations have recently significantly pushed the state-of-art in semantic image segmentation. We study the more challenging problem of learning DCNNs for semantic image segmentation from either (1) weakly annotated training data such as bounding boxes or image-level labels or (2) a combination… ▽ More

    Submitted 5 October, 2015; v1 submitted 9 February, 2015; originally announced February 2015.

    Comments: Accepted to ICCV 2015

  4. arXiv:1412.7062  [pdf, other

    cs.CV cs.LG cs.NE

    Semantic Image Segmentation with Deep Convolutional Nets and Fully Connected CRFs

    Authors: Liang-Chieh Chen, George Papandreou, Iasonas Kokkinos, Kevin Murphy, Alan L. Yuille

    Abstract: Deep Convolutional Neural Networks (DCNNs) have recently shown state of the art performance in high level vision tasks, such as image classification and object detection. This work brings together methods from DCNNs and probabilistic graphical models for addressing the task of pixel-level classification (also called "semantic image segmentation"). We show that responses at the final layer of DCNNs… ▽ More

    Submitted 7 June, 2016; v1 submitted 22 December, 2014; originally announced December 2014.

    Comments: 14 pages. Updated related work

  5. arXiv:1412.6632  [pdf, other

    cs.CV cs.CL cs.LG

    Deep Captioning with Multimodal Recurrent Neural Networks (m-RNN)

    Authors: Junhua Mao, Wei Xu, Yi Yang, Jiang Wang, Zhiheng Huang, Alan Yuille

    Abstract: In this paper, we present a multimodal Recurrent Neural Network (m-RNN) model for generating novel image captions. It directly models the probability distribution of generating a word given previous words and an image. Image captions are generated by sampling from this distribution. The model consists of two sub-networks: a deep recurrent neural network for sentences and a deep convolutional netwo… ▽ More

    Submitted 11 June, 2015; v1 submitted 20 December, 2014; originally announced December 2014.

    Comments: Add a simple strategy to boost the performance of image captioning task significantly. More details are shown in Section 8 of the paper. The code and related data are available at https://github.com/mjhucla/mRNN-CR ;. arXiv admin note: substantial text overlap with arXiv:1410.1090

    ACM Class: I.2.6; I.2.7; I.2.10

    Journal ref: ICLR 2015

  6. arXiv:1412.6124  [pdf, other

    cs.CV

    Semantic Part Segmentation using Compositional Model combining Shape and Appearance

    Authors: Jianyu Wang, Alan Yuille

    Abstract: In this paper, we study the problem of semantic part segmentation for animals. This is more challenging than standard object detection, object segmentation and pose estimation tasks because semantic parts of animals often have similar appearance and highly varying shapes. To tackle these challenges, we build a mixture of compositional models to represent the object boundary and the boundaries of s… ▽ More

    Submitted 18 December, 2014; originally announced December 2014.

  7. arXiv:1412.4102  [pdf, other

    cs.CV

    Representing Data by a Mixture of Activated Simplices

    Authors: Chunyu Wang, John Flynn, Yizhou Wang, Alan L. Yuille

    Abstract: We present a new model which represents data as a mixture of simplices. Simplices are geometric structures that generalize triangles. We give a simple geometric understanding that allows us to learn a simplicial structure efficiently. Our method requires that the data are unit normalized (and thus lie on the unit sphere). We show that under this restriction, building a model with simplices amounts… ▽ More

    Submitted 12 December, 2014; originally announced December 2014.

  8. arXiv:1412.1526  [pdf, other

    cs.CV

    Parsing Occluded People by Flexible Compositions

    Authors: Xianjie Chen, Alan Yuille

    Abstract: This paper presents an approach to parsing humans when there is significant occlusion. We model humans using a graphical model which has a tree structure building on recent work [32, 6] and exploit the connectivity prior that, even in presence of occlusion, the visible nodes form a connected subtree of the graphical model. We call each connected subtree a flexible composition of object parts. This… ▽ More

    Submitted 24 November, 2015; v1 submitted 3 December, 2014; originally announced December 2014.

    Comments: CVPR 15 Camera Ready

  9. arXiv:1410.1090  [pdf, other

    cs.CV cs.CL cs.LG

    Explain Images with Multimodal Recurrent Neural Networks

    Authors: Junhua Mao, Wei Xu, Yi Yang, Jiang Wang, Alan L. Yuille

    Abstract: In this paper, we present a multimodal Recurrent Neural Network (m-RNN) model for generating novel sentence descriptions to explain the content of images. It directly models the probability distribution of generating a word given previous words and the image. Image descriptions are generated by sampling from this distribution. The model consists of two sub-networks: a deep recurrent neural network… ▽ More

    Submitted 4 October, 2014; originally announced October 2014.

    ACM Class: I.2.6; I.2.7; I.2.10

  10. arXiv:1407.3399  [pdf, other

    cs.CV

    Articulated Pose Estimation by a Graphical Model with Image Dependent Pairwise Relations

    Authors: Xianjie Chen, Alan Yuille

    Abstract: We present a method for estimating articulated human pose from a single static image based on a graphical model with novel pairwise relations that make adaptive use of local image measurements. More precisely, we specify a graphical model for human pose which exploits the fact the local image measurements can be used both to detect parts (or joints) and also to predict the spatial relationships be… ▽ More

    Submitted 4 November, 2014; v1 submitted 12 July, 2014; originally announced July 2014.

    Comments: NIPS 2014 Camera Ready

  11. arXiv:1407.2538  [pdf, other

    cs.LG

    Learning Deep Structured Models

    Authors: Liang-Chieh Chen, Alexander G. Schwing, Alan L. Yuille, Raquel Urtasun

    Abstract: Many problems in real-world applications involve predicting several random variables which are statistically related. Markov random fields (MRFs) are a great mathematical tool to encode such relationships. The goal of this paper is to combine MRFs with deep learning algorithms to estimate complex representations while taking into account the dependencies between the output random variables. Toward… ▽ More

    Submitted 27 April, 2015; v1 submitted 9 July, 2014; originally announced July 2014.

    Comments: 11 pages including reference

  12. arXiv:1406.3906  [pdf, other

    cs.CV

    Human-Machine CRFs for Identifying Bottlenecks in Holistic Scene Understanding

    Authors: Roozbeh Mottaghi, Sanja Fidler, Alan Yuille, Raquel Urtasun, Devi Parikh

    Abstract: Recent trends in image understanding have pushed for holistic scene understanding models that jointly reason about various tasks such as object detection, scene recognition, shape analysis, contextual reasoning, and local appearance based classifiers. In this work, we are interested in understanding the roles of these different tasks in improved scene understanding, in particular semantic segmenta… ▽ More

    Submitted 16 June, 2014; originally announced June 2014.

  13. arXiv:1406.2807  [pdf, other

    cs.CV

    The Secrets of Salient Object Segmentation

    Authors: Yin Li, Xiaodi Hou, Christof Koch, James M. Rehg, Alan L. Yuille

    Abstract: In this paper we provide an extensive evaluation of fixation prediction and salient object segmentation algorithms as well as statistics of major datasets. Our analysis identifies serious design flaws of existing salient object benchmarks, called the dataset design bias, by over emphasizing the stereotypical concepts of saliency. The dataset design bias does not only create the discomforting disco… ▽ More

    Submitted 12 June, 2014; v1 submitted 11 June, 2014; originally announced June 2014.

    Comments: 15 pages, 8 figures. Conference version was accepted by CVPR 2014

    Report number: CBMM Memmo #14

  14. arXiv:1406.2375  [pdf, other

    cs.CV

    Parsing Semantic Parts of Cars Using Graphical Models and Segment Appearance Consistency

    Authors: Wenhao Lu, Xiaochen Lian, Alan Yuille

    Abstract: This paper addresses the problem of semantic part parsing (segmentation) of cars, i.e.assigning every pixel within the car to one of the parts (e.g.body, window, lights, license plates and wheels). We formulate this as a landmark identification problem, where a set of landmarks specifies the boundaries of the parts. A novel mixture of graphical models is proposed, which dynamically couples the lan… ▽ More

    Submitted 11 June, 2014; v1 submitted 9 June, 2014; originally announced June 2014.

    Comments: 12 pages, CBMM memo

  15. arXiv:1406.2282  [pdf, other

    cs.CV

    Robust Estimation of 3D Human Poses from a Single Image

    Authors: Chunyu Wang, Yizhou Wang, Zhouchen Lin, Alan L. Yuille, Wen Gao

    Abstract: Human pose estimation is a key step to action recognition. We propose a method of estimating 3D human poses from a single image, which works in conjunction with an existing 2D pose/joint detector. 3D pose estimation is challenging because multiple 3D poses may correspond to the same 2D pose after projection due to the lack of depth information. Moreover, current 2D pose estimators are usually inac… ▽ More

    Submitted 9 June, 2014; originally announced June 2014.

  16. arXiv:1406.2031  [pdf, other

    cs.CV

    Detect What You Can: Detecting and Representing Objects using Holistic Models and Body Parts

    Authors: Xianjie Chen, Roozbeh Mottaghi, Xiaobai Liu, Sanja Fidler, Raquel Urtasun, Alan Yuille

    Abstract: Detecting objects becomes difficult when we need to deal with large shape deformation, occlusion and low resolution. We propose a novel approach to i) handle large deformations and partial occlusions in animals (as examples of highly deformable objects), ii) describe them in terms of body parts, and iii) detect them when their body parts are hard to detect (e.g., animals depicted at low resolution… ▽ More

    Submitted 8 June, 2014; originally announced June 2014.

    Comments: CBMM memo

  17. arXiv:1302.5985  [pdf, ps, other

    cs.CV

    A Meta-Theory of Boundary Detection Benchmarks

    Authors: Xiaodi Hou, Alan Yuille, Christof Koch

    Abstract: Human labeled datasets, along with their corresponding evaluation algorithms, play an important role in boundary detection. We here present a psychophysical experiment that addresses the reliability of such benchmarks. To find better remedies to evaluate the performance of any boundary detection algorithm, we propose a computational framework to remove inappropriate human labels and estimate the i… ▽ More

    Submitted 24 February, 2013; originally announced February 2013.

    Comments: NIPS 2012 Workshop on Human Computation for Science and Computational Sustainability

  18. arXiv:1301.3560  [pdf, other

    cs.CV

    Complexity of Representation and Inference in Compositional Models with Part Sharing

    Authors: Alan L. Yuille, Roozbeh Mottaghi

    Abstract: This paper describes serial and parallel compositional models of multiple objects with part sharing. Objects are built by part-subpart compositions and expressed in terms of a hierarchical dictionary of object parts. These parts are represented on lattices of decreasing sizes which yield an executive summary description. We describe inference and learning algorithms for these models. We analyze th… ▽ More

    Submitted 15 January, 2013; originally announced January 2013.

    Comments: ICLR 2013

  19. arXiv:1207.1417  [pdf

    cs.LG stat.ML

    The DLR Hierarchy of Approximate Inference

    Authors: Michal Rosen-Zvi, Michael I. Jordan, Alan Yuille

    Abstract: We propose a hierarchy for approximate inference based on the Dobrushin, Lanford, Ruelle (DLR) equations. This hierarchy includes existing algorithms, such as belief propagation, and also motivates novel algorithms such as factorized neighbors (FN) algorithms and variants of mean field (MF) algorithms. In particular, we show that extrema of the Bethe free energy correspond to approximate solutions… ▽ More

    Submitted 4 July, 2012; originally announced July 2012.

    Comments: Appears in Proceedings of the Twenty-First Conference on Uncertainty in Artificial Intelligence (UAI2005)

    Report number: UAI-P-2005-PG-493-500

  20. Probabilistic Motion Estimation Based on Temporal Coherence

    Authors: Pierre-Yves Burgi, Alan L. Yuille, Norberto M. Grzywacz

    Abstract: We develop a theory for the temporal integration of visual motion motivated by psychophysical experiments. The theory proposes that input data are temporally grouped and used to predict and estimate the motion flows in the image sequence. This temporal grou** can be considered a generalization of the data association techniques used by engineers to study motion sequences. Our temporal-grou** t… ▽ More

    Submitted 5 January, 2012; originally announced January 2012.

    Comments: 40 pages, 7 figures

    Journal ref: Neural Computation, 2000, vol. 12, no. 8, p. 1839-1867

  21. arXiv:1107.4637  [pdf, ps, other

    cs.CV cs.IT stat.ML

    Efficient variational inference in large-scale Bayesian compressed sensing

    Authors: George Papandreou, Alan Yuille

    Abstract: We study linear models under heavy-tailed priors from a probabilistic viewpoint. Instead of computing a single sparse most probable (MAP) solution as in standard deterministic approaches, the focus in the Bayesian compressed sensing framework shifts towards capturing the full posterior distribution on the latent variables, which allows quantifying the estimation uncertainty and learning model para… ▽ More

    Submitted 4 September, 2011; v1 submitted 22 July, 2011; originally announced July 2011.

    Comments: 8 pages, 3 figures, appears in Proc. IEEE Workshop on Information Theory in Computer Vision and Pattern Recognition (in conjunction with ICCV-11), Barcelona, Spain, Nov. 2011

    Journal ref: Proc. IEEE Workshop on Information Theory in Computer Vision and Pattern Recognition (in conjunction with ICCV-11), pp. 1332-1339, Barcelona, Spain, Nov. 2011