Skip to main content

Showing 51–58 of 58 results for author: Schwing, A G

.
  1. arXiv:1711.07068  [pdf, other

    cs.CV

    Diverse and Accurate Image Description Using a Variational Auto-Encoder with an Additive Gaussian Encoding Space

    Authors: Liwei Wang, Alexander G. Schwing, Svetlana Lazebnik

    Abstract: This paper explores image caption generation using conditional variational auto-encoders (CVAEs). Standard CVAEs with a fixed Gaussian prior yield descriptions with too little variability. Instead, we propose two models that explicitly structure the latent space around $K$ components corresponding to different types of image content, and combine components to create priors for images that contain… ▽ More

    Submitted 19 November, 2017; originally announced November 2017.

  2. arXiv:1711.04323  [pdf, other

    cs.CV cs.AI cs.LG

    High-Order Attention Models for Visual Question Answering

    Authors: Idan Schwartz, Alexander G. Schwing, Tamir Hazan

    Abstract: The quest for algorithms that enable cognitive abilities is an important part of machine learning. A common trait in many recently investigated cognitive-like tasks is that they take into account different data modalities, such as visual and textual input. In this paper we propose a novel and generally applicable form of attention mechanism that learns high-order correlations between various data… ▽ More

    Submitted 12 November, 2017; originally announced November 2017.

    Comments: 9 pages, 8 figures, NIPS 2017

  3. arXiv:1611.01606  [pdf, other

    cs.LG stat.ML

    Learning to Play in a Day: Faster Deep Reinforcement Learning by Optimality Tightening

    Authors: Frank S. He, Yang Liu, Alexander G. Schwing, Jian Peng

    Abstract: We propose a novel training algorithm for reinforcement learning which combines the strength of deep Q-learning with a constrained optimization approach to tighten optimality and encourage faster reward propagation. Our novel technique makes deep reinforcement learning more practical by drastically reducing the training time. We evaluate the performance of our approach on the 49 games of the chall… ▽ More

    Submitted 5 November, 2016; originally announced November 2016.

  4. arXiv:1607.07539  [pdf, other

    cs.CV

    Semantic Image Inpainting with Deep Generative Models

    Authors: Raymond A. Yeh, Chen Chen, Teck Yian Lim, Alexander G. Schwing, Mark Hasegawa-Johnson, Minh N. Do

    Abstract: Semantic image inpainting is a challenging task where large missing regions have to be filled based on the available visual data. Existing methods which extract information from only a single image generally produce unsatisfactory results due to the lack of high level context. In this paper, we propose a novel method for semantic image inpainting, which generates the missing content by conditionin… ▽ More

    Submitted 13 July, 2017; v1 submitted 26 July, 2016; originally announced July 2016.

  5. arXiv:1511.06411  [pdf, other

    cs.LG

    Training Deep Neural Networks via Direct Loss Minimization

    Authors: Yang Song, Alexander G. Schwing, Richard S. Zemel, Raquel Urtasun

    Abstract: Supervised training of deep neural nets typically relies on minimizing cross-entropy. However, in many domains, we are interested in performing well on metrics specific to the application. In this paper we propose a direct loss minimization approach to train deep neural networks, which provably minimizes the application-specific loss function. This is often non-trivial, since these functions are n… ▽ More

    Submitted 1 June, 2016; v1 submitted 19 November, 2015; originally announced November 2015.

    Comments: ICML2016

  6. arXiv:1505.03159  [pdf, other

    cs.CV

    Monocular Object Instance Segmentation and Depth Ordering with CNNs

    Authors: Ziyu Zhang, Alexander G. Schwing, Sanja Fidler, Raquel Urtasun

    Abstract: In this paper we tackle the problem of instance-level segmentation and depth ordering from a single monocular image. Towards this goal, we take advantage of convolutional neural nets and train them to directly predict instance-level segmentations where the instance ID encodes the depth ordering within image patches. To provide a coherent single explanation of an image we develop a Markov random fi… ▽ More

    Submitted 17 December, 2015; v1 submitted 12 May, 2015; originally announced May 2015.

    Comments: International Conference on Computer Vision (ICCV), 2015

  7. arXiv:1503.02351  [pdf, other

    cs.CV cs.LG

    Fully Connected Deep Structured Networks

    Authors: Alexander G. Schwing, Raquel Urtasun

    Abstract: Convolutional neural networks with many layers have recently been shown to achieve excellent results on many high-level tasks such as image classification, object detection and more recently also semantic segmentation. Particularly for semantic segmentation, a two-stage procedure is often employed. Hereby, convolutional networks are trained to provide good local pixel-wise features for the second… ▽ More

    Submitted 8 March, 2015; originally announced March 2015.

  8. arXiv:1407.2538  [pdf, other

    cs.LG

    Learning Deep Structured Models

    Authors: Liang-Chieh Chen, Alexander G. Schwing, Alan L. Yuille, Raquel Urtasun

    Abstract: Many problems in real-world applications involve predicting several random variables which are statistically related. Markov random fields (MRFs) are a great mathematical tool to encode such relationships. The goal of this paper is to combine MRFs with deep learning algorithms to estimate complex representations while taking into account the dependencies between the output random variables. Toward… ▽ More

    Submitted 27 April, 2015; v1 submitted 9 July, 2014; originally announced July 2014.

    Comments: 11 pages including reference