Skip to main content

Showing 51–100 of 113 results for author: Kohli, P

.
  1. arXiv:1811.09300  [pdf, other

    cs.NE cs.CR cs.LG

    Strength in Numbers: Trading-off Robustness and Computation via Adversarially-Trained Ensembles

    Authors: Edward Grefenstette, Robert Stanforth, Brendan O'Donoghue, Jonathan Uesato, Grzegorz Swirszcz, Pushmeet Kohli

    Abstract: While deep learning has led to remarkable results on a number of challenging problems, researchers have discovered a vulnerability of neural networks in adversarial settings, where small but carefully chosen perturbations to the input can make the models produce extremely inaccurate outputs. This makes these models particularly unsuitable for safety-critical application domains (e.g. self-driving… ▽ More

    Submitted 22 November, 2018; originally announced November 2018.

    Comments: 12 pages

  2. arXiv:1810.12715  [pdf, other

    cs.LG cs.CR stat.ML

    On the Effectiveness of Interval Bound Propagation for Training Verifiably Robust Models

    Authors: Sven Gowal, Krishnamurthy Dvijotham, Robert Stanforth, Rudy Bunel, Chongli Qin, Jonathan Uesato, Relja Arandjelovic, Timothy Mann, Pushmeet Kohli

    Abstract: Recent work has shown that it is possible to train deep neural networks that are provably robust to norm-bounded adversarial perturbations. Most of these methods are based on minimizing an upper bound on the worst-case loss over all possible adversarial perturbations. While these techniques show promise, they often result in difficult optimization procedures that remain hard to scale to larger net… ▽ More

    Submitted 29 August, 2019; v1 submitted 30 October, 2018; originally announced October 2018.

    Comments: [v2] Best paper at NeurIPS SECML 2018 Workshop [v4] Accepted at ICCV 2019 under the title "Scalable Verified Training for Provably Robust Image Classification"

  3. arXiv:1810.02338  [pdf, other

    cs.AI cs.CL cs.CV cs.LG

    Neural-Symbolic VQA: Disentangling Reasoning from Vision and Language Understanding

    Authors: Kexin Yi, Jiajun Wu, Chuang Gan, Antonio Torralba, Pushmeet Kohli, Joshua B. Tenenbaum

    Abstract: We marry two powerful ideas: deep representation learning for visual recognition and language understanding, and symbolic program execution for reasoning. Our neural-symbolic visual question answering (NS-VQA) system first recovers a structural scene representation from the image and a program trace from the question. It then executes the program on the scene representation to obtain an answer. In… ▽ More

    Submitted 14 January, 2019; v1 submitted 4 October, 2018; originally announced October 2018.

    Comments: NeurIPS 2018 (spotlight). The first two authors contributed equally to this work. Project page: http://nsvqa.csail.mit.edu

  4. arXiv:1806.01946  [pdf, other

    cs.AI cs.LG

    Learning to Understand Goal Specifications by Modelling Reward

    Authors: Dzmitry Bahdanau, Felix Hill, Jan Leike, Edward Hughes, Arian Hosseini, Pushmeet Kohli, Edward Grefenstette

    Abstract: Recent work has shown that deep reinforcement-learning agents can learn to follow language-like instructions from infrequent environment rewards. However, this places on environment designers the onus of designing language-conditional reward functions which may not be easily or tractably implemented as the complexity of the environment and the language scales. To overcome this limitation, we prese… ▽ More

    Submitted 23 December, 2019; v1 submitted 5 June, 2018; originally announced June 2018.

    Comments: 19 pages, 9 figures

  5. arXiv:1806.01261  [pdf, other

    cs.LG cs.AI stat.ML

    Relational inductive biases, deep learning, and graph networks

    Authors: Peter W. Battaglia, Jessica B. Hamrick, Victor Bapst, Alvaro Sanchez-Gonzalez, Vinicius Zambaldi, Mateusz Malinowski, Andrea Tacchetti, David Raposo, Adam Santoro, Ryan Faulkner, Caglar Gulcehre, Francis Song, Andrew Ballard, Justin Gilmer, George Dahl, Ashish Vaswani, Kelsey Allen, Charles Nash, Victoria Langston, Chris Dyer, Nicolas Heess, Daan Wierstra, Pushmeet Kohli, Matt Botvinick, Oriol Vinyals , et al. (2 additional authors not shown)

    Abstract: Artificial intelligence (AI) has undergone a renaissance recently, making major progress in key domains such as vision, language, control, and decision-making. This has been due, in part, to cheap data and cheap compute resources, which have fit the natural strengths of deep learning. However, many defining characteristics of human intelligence, which developed under much different pressures, rema… ▽ More

    Submitted 17 October, 2018; v1 submitted 4 June, 2018; originally announced June 2018.

  6. Enabling Pedestrian Safety using Computer Vision Techniques: A Case Study of the 2018 Uber Inc. Self-driving Car Crash

    Authors: Puneet Kohli, Anjali Chadha

    Abstract: Human lives are important. The decision to allow self-driving vehicles operate on our roads carries great weight. This has been a hot topic of debate between policy-makers, technologists and public safety institutions. The recent Uber Inc. self-driving car crash, resulting in the death of a pedestrian, has strengthened the argument that autonomous vehicle technology is still not ready for deployme… ▽ More

    Submitted 30 May, 2018; originally announced May 2018.

    Comments: 10 pages, 8 figures, 3 tables

    Journal ref: Arai K., Bhatia R. (eds) Advances in Information and Communication. FICC 2019. Lecture Notes in Networks and Systems, vol 69. Springer, Cham

  7. arXiv:1805.11199  [pdf, other

    cs.AI cs.LG

    Value Propagation Networks

    Authors: Nantas Nardelli, Gabriel Synnaeve, Zeming Lin, Pushmeet Kohli, Philip H. S. Torr, Nicolas Usunier

    Abstract: We present Value Propagation (VProp), a set of parameter-efficient differentiable planning modules built on Value Iteration which can successfully be trained using reinforcement learning to solve unseen tasks, has the capability to generalize to larger map sizes, and can learn to navigate in dynamic environments. We show that the modules enable learning to plan when the environment also includes s… ▽ More

    Submitted 25 March, 2019; v1 submitted 28 May, 2018; originally announced May 2018.

    Comments: Updated to match ICLR 2019 OpenReview's version

  8. arXiv:1805.10265  [pdf, other

    cs.LG stat.ML

    Training verified learners with learned verifiers

    Authors: Krishnamurthy Dvijotham, Sven Gowal, Robert Stanforth, Relja Arandjelovic, Brendan O'Donoghue, Jonathan Uesato, Pushmeet Kohli

    Abstract: This paper proposes a new algorithmic framework, predictor-verifier training, to train neural networks that are verifiable, i.e., networks that provably satisfy some desired input-output properties. The key idea is to simultaneously train two networks: a predictor network that performs the task at hand,e.g., predicting labels given inputs, and a verifier network that computes a bound on how well t… ▽ More

    Submitted 29 May, 2018; v1 submitted 25 May, 2018; originally announced May 2018.

  9. arXiv:1805.09028  [pdf, other

    cs.CV

    Efficient Relaxations for Dense CRFs with Sparse Higher Order Potentials

    Authors: Thomas Joy, Alban Desmaison, Thalaiyasingam Ajanthan, Rudy Bunel, Mathieu Salzmann, Pushmeet Kohli, Philip H. S. Torr, M. Pawan Kumar

    Abstract: Dense conditional random fields (CRFs) have become a popular framework for modelling several problems in computer vision such as stereo correspondence and multi-class semantic segmentation. By modelling long-range interactions, dense CRFs provide a labelling that captures finer detail than their sparse counterparts. Currently, the state-of-the-art algorithm performs mean-field inference using a fi… ▽ More

    Submitted 26 October, 2018; v1 submitted 23 May, 2018; originally announced May 2018.

  10. arXiv:1805.04276  [pdf, other

    cs.LG stat.ML

    Leveraging Grammar and Reinforcement Learning for Neural Program Synthesis

    Authors: Rudy Bunel, Matthew Hausknecht, Jacob Devlin, Rishabh Singh, Pushmeet Kohli

    Abstract: Program synthesis is the task of automatically generating a program consistent with a specification. Recent years have seen proposal of a number of neural approaches for program synthesis, many of which adopt a sequence generation paradigm similar to neural machine translation, in which sequence-to-sequence models are trained to maximize the likelihood of known reference programs. While achieving… ▽ More

    Submitted 22 May, 2018; v1 submitted 11 May, 2018; originally announced May 2018.

    Comments: ICLR 2018

  11. arXiv:1804.02477  [pdf, ps, other

    cs.LG cs.AI cs.PL stat.ML

    Programmatically Interpretable Reinforcement Learning

    Authors: Abhinav Verma, Vijayaraghavan Murali, Rishabh Singh, Pushmeet Kohli, Swarat Chaudhuri

    Abstract: We present a reinforcement learning framework, called Programmatically Interpretable Reinforcement Learning (PIRL), that is designed to generate interpretable and verifiable agent policies. Unlike the popular Deep Reinforcement Learning (DRL) paradigm, which represents policies by neural networks, PIRL represents policies using a high-level, domain-specific programming language. Such programmatic… ▽ More

    Submitted 10 April, 2019; v1 submitted 6 April, 2018; originally announced April 2018.

    Comments: Published at The 35th International Conference on Machine Learning (ICML 2018)

    Journal ref: PMLR 80:5045-5054

  12. arXiv:1803.06567  [pdf, other

    cs.LG stat.ML

    A Dual Approach to Scalable Verification of Deep Networks

    Authors: Krishnamurthy, Dvijotham, Robert Stanforth, Sven Gowal, Timothy Mann, Pushmeet Kohli

    Abstract: This paper addresses the problem of formally verifying desirable properties of neural networks, i.e., obtaining provable guarantees that neural networks satisfy specifications relating their inputs and outputs (robustness to bounded norm adversarial perturbations, for example). Most previous work on this topic was limited in its applicability by the size of the network, network architecture and th… ▽ More

    Submitted 3 August, 2018; v1 submitted 17 March, 2018; originally announced March 2018.

  13. arXiv:1802.08535  [pdf, other

    cs.NE cs.AI

    Can Neural Networks Understand Logical Entailment?

    Authors: Richard Evans, David Saxton, David Amos, Pushmeet Kohli, Edward Grefenstette

    Abstract: We introduce a new dataset of logical entailments for the purpose of measuring models' ability to capture and exploit the structure of logical expressions against an entailment prediction task. We use this task to compare a series of architectures which are ubiquitous in the sequence-processing literature, in addition to a new model class---PossibleWorldNets---which computes entailment as a "convo… ▽ More

    Submitted 23 February, 2018; originally announced February 2018.

    Comments: Published at ICLR 2018 (main conference)

  14. arXiv:1802.05666  [pdf, other

    cs.LG cs.CR stat.ML

    Adversarial Risk and the Dangers of Evaluating Against Weak Attacks

    Authors: Jonathan Uesato, Brendan O'Donoghue, Aaron van den Oord, Pushmeet Kohli

    Abstract: This paper investigates recently proposed approaches for defending against adversarial examples and evaluating adversarial robustness. We motivate 'adversarial risk' as an objective for achieving models robust to worst-case inputs. We then frame commonly used attacks and evaluation metrics as defining a tractable surrogate objective to the true adversarial risk. This suggests that models may optim… ▽ More

    Submitted 12 June, 2018; v1 submitted 15 February, 2018; originally announced February 2018.

  15. arXiv:1711.06970  [pdf

    cs.CY cs.LG

    How much is my car worth? A methodology for predicting used cars prices using Random Forest

    Authors: Nabarun Pal, Priya Arora, Dhanasekar Sundararaman, Puneet Kohli, Sai Sumanth Palakurthy

    Abstract: Cars are being sold more than ever. Develo** countries adopt the lease culture instead of buying a new car due to affordability. Therefore, the rise of used cars sales is exponentially increasing. Car sellers sometimes take advantage of this scenario by listing unrealistic prices owing to the demand. Therefore, arises a need for a model that can assign a price for a vehicle by evaluating its fea… ▽ More

    Submitted 19 November, 2017; originally announced November 2017.

    Comments: FICC Camera Ready

  16. arXiv:1711.00455  [pdf, ps, other

    cs.AI cs.LG

    A Unified View of Piecewise Linear Neural Network Verification

    Authors: Rudy Bunel, Ilker Turkaslan, Philip H. S. Torr, Pushmeet Kohli, M. Pawan Kumar

    Abstract: The success of Deep Learning and its potential use in many safety-critical applications has motivated research on formal verification of Neural Network (NN) models. Despite the reputation of learned NN models to behave as black boxes and the theoretical hardness of proving their properties, researchers have been successful in verifying some classes of models by exploiting their piecewise linear st… ▽ More

    Submitted 22 May, 2018; v1 submitted 1 November, 2017; originally announced November 2017.

    Comments: Updated version of "Piecewise Linear Neural Network verification: A comparative study"

  17. arXiv:1710.11054  [pdf, other

    cs.AI cs.PL cs.SE

    Semantic Code Repair using Neuro-Symbolic Transformation Networks

    Authors: Jacob Devlin, Jonathan Uesato, Rishabh Singh, Pushmeet Kohli

    Abstract: We study the problem of semantic code repair, which can be broadly defined as automatically fixing non-syntactic bugs in source code. The majority of past work in semantic code repair assumed access to unit tests against which candidate repairs could be validated. In contrast, the goal here is to develop a strong statistical model to accurately predict both bug locations and exact fixes without ac… ▽ More

    Submitted 30 October, 2017; originally announced October 2017.

  18. arXiv:1710.04157  [pdf, other

    cs.AI

    Neural Program Meta-Induction

    Authors: Jacob Devlin, Rudy Bunel, Rishabh Singh, Matthew Hausknecht, Pushmeet Kohli

    Abstract: Most recently proposed methods for Neural Program Induction work under the assumption of having a large set of input/output (I/O) examples for learning any underlying input-output map**. This paper aims to address the problem of data and computation efficiency of program induction by leveraging information from related tasks. Specifically, we propose two approaches for cross-task knowledge trans… ▽ More

    Submitted 11 October, 2017; originally announced October 2017.

    Comments: 8 Pages + 1 page appendix

  19. arXiv:1706.05064  [pdf, other

    cs.AI cs.LG

    Zero-Shot Task Generalization with Multi-Task Deep Reinforcement Learning

    Authors: Junhyuk Oh, Satinder Singh, Honglak Lee, Pushmeet Kohli

    Abstract: As a step towards develo** zero-shot task generalization capabilities in reinforcement learning (RL), we introduce a new RL problem where the agent should learn to execute sequences of instructions after learning useful skills that solve subtasks. In this problem, we consider two types of generalizations: to previously unseen instructions and to longer sequences of instructions. For generalizati… ▽ More

    Submitted 6 November, 2017; v1 submitted 15 June, 2017; originally announced June 2017.

    Comments: ICML 2017

  20. arXiv:1706.01445  [pdf, other

    stat.ML cs.LG math.OC

    Batched Large-scale Bayesian Optimization in High-dimensional Spaces

    Authors: Zi Wang, Clement Gehring, Pushmeet Kohli, Stefanie Jegelka

    Abstract: Bayesian optimization (BO) has become an effective approach for black-box function optimization problems when function evaluations are expensive and the optimum can be achieved within a relatively small number of queries. However, many cases, such as the ones with high-dimensional inputs, may require a much larger number of observations for optimization. Despite an abundance of observations thanks… ▽ More

    Submitted 15 May, 2018; v1 submitted 5 June, 2017; originally announced June 2017.

    Comments: Proceedings of the 21st International Conference on Artificial Intelligence and Statistics (AISTATS) 2018, Lanzarote, Spain

  21. arXiv:1706.00400  [pdf, other

    stat.ML cs.AI cs.LG

    Learning Disentangled Representations with Semi-Supervised Deep Generative Models

    Authors: N. Siddharth, Brooks Paige, Jan-Willem van de Meent, Alban Desmaison, Noah D. Goodman, Pushmeet Kohli, Frank Wood, Philip H. S. Torr

    Abstract: Variational autoencoders (VAEs) learn representations of data by jointly training a probabilistic encoder and decoder network. Typically these models encode all features of the data into a single variable. Here we are interested in learning disentangled representations that encode distinct aspects of the data into separate variables. We propose to learn such representations using model architectur… ▽ More

    Submitted 13 November, 2017; v1 submitted 1 June, 2017; originally announced June 2017.

    Comments: Accepted for publication at NIPS 2017

  22. arXiv:1704.04327  [pdf, other

    cs.AI cs.LG

    Deep API Programmer: Learning to Program with APIs

    Authors: Surya Bhupatiraju, Rishabh Singh, Abdel-rahman Mohamed, Pushmeet Kohli

    Abstract: We present DAPIP, a Programming-By-Example system that learns to program with APIs to perform data transformation tasks. We design a domain-specific language (DSL) that allows for arbitrary concatenations of API outputs and constant strings. The DSL consists of three family of APIs: regular expression-based APIs, lookup APIs, and transformation APIs. We then present a novel neural synthesis algori… ▽ More

    Submitted 13 April, 2017; originally announced April 2017.

    Comments: 8 pages + 4 pages of supplementary material. Submitted to IJCAI 2017

  23. arXiv:1703.07469  [pdf, other

    cs.AI

    RobustFill: Neural Program Learning under Noisy I/O

    Authors: Jacob Devlin, Jonathan Uesato, Surya Bhupatiraju, Rishabh Singh, Abdel-rahman Mohamed, Pushmeet Kohli

    Abstract: The problem of automatically generating a computer program from some specification has been studied since the early days of AI. Recently, two competing approaches for automatic program learning have received significant attention: (1) neural program synthesis, where a neural network is conditioned on input/output (I/O) examples and learns to generate a program, and (2) neural program induction, wh… ▽ More

    Submitted 21 March, 2017; originally announced March 2017.

    Comments: 8 pages + 9 pages of supplementary material

  24. arXiv:1703.01973  [pdf, other

    stat.ML cs.LG math.OC

    Batched High-dimensional Bayesian Optimization via Structural Kernel Learning

    Authors: Zi Wang, Chengtao Li, Stefanie Jegelka, Pushmeet Kohli

    Abstract: Optimization of high-dimensional black-box functions is an extremely challenging problem. While Bayesian optimization has emerged as a popular approach for optimizing black-box functions, its applicability has been limited to low-dimensional problems due to its computational and statistical challenges arising from high-dimensional settings. In this paper, we propose to tackle these challenges by (… ▽ More

    Submitted 6 January, 2018; v1 submitted 6 March, 2017; originally announced March 2017.

    Comments: Proceedings of the 34th International Conference on Machine Learning, Sydney, Australia, PMLR 70, 2017

  25. arXiv:1702.08887  [pdf, other

    cs.AI cs.LG cs.MA

    Stabilising Experience Replay for Deep Multi-Agent Reinforcement Learning

    Authors: Jakob Foerster, Nantas Nardelli, Gregory Farquhar, Triantafyllos Afouras, Philip H. S. Torr, Pushmeet Kohli, Shimon Whiteson

    Abstract: Many real-world problems, such as network packet routing and urban traffic control, are naturally modeled as multi-agent reinforcement learning (RL) problems. However, existing multi-agent RL methods typically scale poorly in the problem size. Therefore, a key challenge is to translate the success of deep learning on single-agent RL to the multi-agent setting. A major stumbling block is that indep… ▽ More

    Submitted 21 May, 2018; v1 submitted 28 February, 2017; originally announced February 2017.

    Comments: Camera-ready version, International Conference of Machine Learning 2017; updated to fix print-breaking image

  26. arXiv:1612.01234  [pdf, other

    cs.CV

    Multi-way Particle Swarm Fusion

    Authors: Chen Liu, Hang Yan, Pushmeet Kohli, Yasutaka Furukawa

    Abstract: This paper proposes a novel MAP inference framework for Markov Random Field (MRF) in parallel computing environments. The inference framework, dubbed Swarm Fusion, is a natural generalization of the Fusion Move method. Every thread (in a case of multi-threading environments) maintains and updates a solution. At each iteration, a thread can generate arbitrary number of solution proposals and take a… ▽ More

    Submitted 4 December, 2016; originally announced December 2016.

  27. arXiv:1612.01225  [pdf, other

    cs.CV

    Deep Multi-Modal Image Correspondence Learning

    Authors: Chen Liu, Jiajun Wu, Pushmeet Kohli, Yasutaka Furukawa

    Abstract: Inference of correspondences between images from different modalities is an extremely important perceptual ability that enables humans to understand and recognize cross-modal concepts. In this paper, we consider an instance of this problem that involves matching photographs of building interiors with their corresponding floorplan. This is a particularly challenging problem because a floorplan, as… ▽ More

    Submitted 4 December, 2016; originally announced December 2016.

  28. arXiv:1612.01094  [pdf, other

    cs.LG

    Learning to superoptimize programs - Workshop Version

    Authors: Rudy Bunel, Alban Desmaison, M. Pawan Kumar, Philip H. S. Torr, Pushmeet Kohli

    Abstract: Superoptimization requires the estimation of the best program for a given computational task. In order to deal with large programs, superoptimization techniques perform a stochastic search. This involves proposing a modification of the current program, which is accepted or rejected based on the improvement achieved. The state of the art method uses uniform proposal distributions, which fails to ex… ▽ More

    Submitted 4 December, 2016; originally announced December 2016.

    Comments: Workshop version for the NIPS NAMPI Workshop. Extended version at arXiv:1611.01787

  29. arXiv:1612.00817  [pdf, other

    cs.LG cs.AI cs.NE

    Summary - TerpreT: A Probabilistic Programming Language for Program Induction

    Authors: Alexander L. Gaunt, Marc Brockschmidt, Rishabh Singh, Nate Kushman, Pushmeet Kohli, Jonathan Taylor, Daniel Tarlow

    Abstract: We study machine learning formulations of inductive program synthesis; that is, given input-output examples, synthesize source code that maps inputs to corresponding outputs. Our key contribution is TerpreT, a domain-specific language for expressing program synthesis problems. A TerpreT model is composed of a specification of a program representation and an interpreter that describes how programs… ▽ More

    Submitted 2 December, 2016; originally announced December 2016.

    Comments: 7 pages, 2 figures, 4 tables in 1st Workshop on Neural Abstract Machines & Program Induction (NAMPI), @NIPS 2016

  30. arXiv:1611.07492  [pdf, other

    stat.ML cs.CV cs.LG

    Inducing Interpretable Representations with Variational Autoencoders

    Authors: N. Siddharth, Brooks Paige, Alban Desmaison, Jan-Willem Van de Meent, Frank Wood, Noah D. Goodman, Pushmeet Kohli, Philip H. S. Torr

    Abstract: We develop a framework for incorporating structured graphical models in the \emph{encoders} of variational autoencoders (VAEs) that allows us to induce interpretable representations through approximate variational inference. This allows us to both perform reasoning (e.g. classification) under the structural constraints of a given graphical model, and use deep generative models to deal with messy,… ▽ More

    Submitted 22 November, 2016; originally announced November 2016.

    Comments: Presented at NIPS 2016 Workshop on Interpretable Machine Learning in Complex Systems

  31. arXiv:1611.04088  [pdf, other

    cs.LG

    Batched Gaussian Process Bandit Optimization via Determinantal Point Processes

    Authors: Tarun Kathuria, Amit Deshpande, Pushmeet Kohli

    Abstract: Gaussian Process bandit optimization has emerged as a powerful tool for optimizing noisy black box functions. One example in machine learning is hyper-parameter optimization where each evaluation of the target function requires training a model which may involve days or even weeks of computation. Most methods for this so-called "Bayesian optimization" only allow sequential exploration of the param… ▽ More

    Submitted 13 November, 2016; originally announced November 2016.

    Comments: To appear at NIPS 2016

  32. arXiv:1611.02261  [pdf, other

    cs.CV cs.LG cs.NE

    Memory-augmented Attention Modelling for Videos

    Authors: Rasool Fakoor, Abdel-rahman Mohamed, Margaret Mitchell, Sing Bing Kang, Pushmeet Kohli

    Abstract: We present a method to improve video description generation by modeling higher-order interactions between video frames and described concepts. By storing past visual attention in the video associated to previously generated words, the system is able to decide what to look at and describe in light of what it has already looked at and described. This enables not only more effective local attention,… ▽ More

    Submitted 24 April, 2017; v1 submitted 7 November, 2016; originally announced November 2016.

    Comments: Revised version, minor changes, add the link for the source codes

  33. arXiv:1611.01855  [pdf, other

    cs.AI cs.PL

    Neuro-Symbolic Program Synthesis

    Authors: Emilio Parisotto, Abdel-rahman Mohamed, Rishabh Singh, Lihong Li, Dengyong Zhou, Pushmeet Kohli

    Abstract: Recent years have seen the proposal of a number of neural architectures for the problem of Program Induction. Given a set of input-output examples, these architectures are able to learn map**s that generalize to new test inputs. While achieving impressive results, these approaches have a number of important limitations: (a) they are computationally expensive and hard to train, (b) a model has to… ▽ More

    Submitted 6 November, 2016; originally announced November 2016.

  34. arXiv:1611.01787  [pdf, other

    cs.LG

    Learning to superoptimize programs

    Authors: Rudy Bunel, Alban Desmaison, M. Pawan Kumar, Philip H. S. Torr, Pushmeet Kohli

    Abstract: Code super-optimization is the task of transforming any given program to a more efficient version while preserving its input-output behaviour. In some sense, it is similar to the paraphrase problem from natural language processing where the intention is to change the syntax of an utterance without changing its semantics. Code-optimization has been the subject of years of research that has resulted… ▽ More

    Submitted 28 June, 2017; v1 submitted 6 November, 2016; originally announced November 2016.

    Comments: Accepted to ICLR 2017

  35. arXiv:1611.01423  [pdf, other

    cs.LG cs.AI

    Learning Continuous Semantic Representations of Symbolic Expressions

    Authors: Miltiadis Allamanis, Pankajan Chanthirasegaran, Pushmeet Kohli, Charles Sutton

    Abstract: Combining abstract, symbolic reasoning with continuous neural reasoning is a grand challenge of representation learning. As a step in this direction, we propose a new architecture, called neural equivalence networks, for the problem of learning continuous semantic representations of algebraic and logical expressions. These networks are trained to represent semantic equivalence, even of expressions… ▽ More

    Submitted 10 June, 2017; v1 submitted 4 November, 2016; originally announced November 2016.

    Comments: Accepted to ICML 2017

  36. arXiv:1610.03777  [pdf, other

    cs.CV

    Deep disentangled representations for volumetric reconstruction

    Authors: Edward Grant, Pushmeet Kohli, Marcel van Gerven

    Abstract: We introduce a convolutional neural network for inferring a compact disentangled graphical description of objects from 2D images that can be used for volumetric reconstruction. The network comprises an encoder and a twin-tailed decoder. The encoder generates a disentangled graphics code. The first decoder generates a volume, and the second decoder reconstructs the input image using a novel trainin… ▽ More

    Submitted 12 October, 2016; originally announced October 2016.

  37. arXiv:1608.06192  [pdf, other

    cs.CV

    Efficient Continuous Relaxations for Dense CRF

    Authors: Alban Desmaison, Rudy Bunel, Pushmeet Kohli, Philip H. S. Torr, M. Pawan Kumar

    Abstract: Dense conditional random fields (CRF) with Gaussian pairwise potentials have emerged as a popular framework for several computer vision applications such as stereo correspondence and semantic segmentation. By modeling long-range interactions, dense CRFs provide a more detailed labelling compared to their sparse counterparts. Variational inference in these dense models is performed using a filterin… ▽ More

    Submitted 22 August, 2016; originally announced August 2016.

  38. arXiv:1608.04428  [pdf, other

    cs.LG cs.AI cs.NE

    TerpreT: A Probabilistic Programming Language for Program Induction

    Authors: Alexander L. Gaunt, Marc Brockschmidt, Rishabh Singh, Nate Kushman, Pushmeet Kohli, Jonathan Taylor, Daniel Tarlow

    Abstract: We study machine learning formulations of inductive program synthesis; given input-output examples, we try to synthesize source code that maps inputs to corresponding outputs. Our aims are to develop new machine learning approaches based on neural networks and graphical models, and to understand the capabilities of machine learning techniques relative to traditional alternatives, such as those bas… ▽ More

    Submitted 15 August, 2016; originally announced August 2016.

    Comments: 50 pages, 20 figures, 4 tables

  39. arXiv:1605.07969  [pdf, other

    cs.AI cs.LG

    Adaptive Neural Compilation

    Authors: Rudy Bunel, Alban Desmaison, Pushmeet Kohli, Philip H. S. Torr, M. Pawan Kumar

    Abstract: This paper proposes an adaptive neural-compilation framework to address the problem of efficient program learning. Traditional code optimisation strategies used in compilers are based on applying pre-specified set of transformations that make the code faster to execute without changing its semantics. In contrast, our work involves adapting programs to make them more efficient while considering cor… ▽ More

    Submitted 26 May, 2016; v1 submitted 25 May, 2016; originally announced May 2016.

    Comments: Submitted to NIPS 2016, code and supplementary materials will be available on author's page

  40. arXiv:1604.03968  [pdf, other

    cs.CL cs.AI cs.CV

    Visual Storytelling

    Authors: Ting-Hao, Huang, Francis Ferraro, Nasrin Mostafazadeh, Ishan Misra, Aishwarya Agrawal, Jacob Devlin, Ross Girshick, Xiaodong He, Pushmeet Kohli, Dhruv Batra, C. Lawrence Zitnick, Devi Parikh, Lucy Vanderwende, Michel Galley, Margaret Mitchell

    Abstract: We introduce the first dataset for sequential vision-to-language, and explore how this data may be used for the task of visual storytelling. The first release of this dataset, SIND v.1, includes 81,743 unique photos in 20,211 sequences, aligned to both descriptive (caption) and story language. We establish several strong baselines for the storytelling task, and motivate an automatic metric to benc… ▽ More

    Submitted 13 April, 2016; originally announced April 2016.

    Comments: to appear in NAACL 2016

  41. arXiv:1604.01696  [pdf, other

    cs.CL cs.AI

    A Corpus and Evaluation Framework for Deeper Understanding of Commonsense Stories

    Authors: Nasrin Mostafazadeh, Nathanael Chambers, Xiaodong He, Devi Parikh, Dhruv Batra, Lucy Vanderwende, Pushmeet Kohli, James Allen

    Abstract: Representation and learning of commonsense knowledge is one of the foundational problems in the quest to enable deep language understanding. This issue is particularly challenging for understanding casual and correlational relationships between events. While this topic has received a lot of interest in the NLP community, research has been hindered by the lack of a proper evaluation framework. This… ▽ More

    Submitted 6 April, 2016; originally announced April 2016.

    Comments: In Proceedings of the 2016 North American Chapter of the ACL (NAACL HLT), 2016

  42. arXiv:1603.05772  [pdf, other

    cs.CV

    Learning to Navigate the Energy Landscape

    Authors: Julien Valentin, Angela Dai, Matthias Nießner, Pushmeet Kohli, Philip Torr, Shahram Izadi, Cem Keskin

    Abstract: In this paper, we present a novel and efficient architecture for addressing computer vision problems that use `Analysis by Synthesis'. Analysis by synthesis involves the minimization of the reconstruction error which is typically a non-convex function of the latent target variables. State-of-the-art methods adopt a hybrid scheme where discriminatively trained predictors like Random Forests or Conv… ▽ More

    Submitted 18 March, 2016; originally announced March 2016.

  43. arXiv:1603.04922  [pdf, other

    cs.CV

    DeepContext: Context-Encoding Neural Pathways for 3D Holistic Scene Understanding

    Authors: Yinda Zhang, Mingru Bai, Pushmeet Kohli, Shahram Izadi, Jianxiong Xiao

    Abstract: While deep neural networks have led to human-level performance on computer vision tasks, they have yet to demonstrate similar gains for holistic scene understanding. In particular, 3D context has been shown to be an extremely important cue for scene understanding - yet very little research has been done on integrating context information with deep models. This paper presents an approach to embed 3… ▽ More

    Submitted 16 August, 2017; v1 submitted 15 March, 2016; originally announced March 2016.

    Comments: Accepted by ICCV2017

  44. arXiv:1511.04056  [pdf, other

    cs.LG cs.CV

    Efficient non-greedy optimization of decision trees

    Authors: Mohammad Norouzi, Maxwell D. Collins, Matthew Johnson, David J. Fleet, Pushmeet Kohli

    Abstract: Decision trees and randomized forests are widely used in computer vision and machine learning. Standard algorithms for decision tree induction optimize the split functions one node at a time according to some splitting criteria. This greedy procedure often leads to suboptimal trees. In this paper, we present an algorithm for optimizing the split functions at all levels of the tree jointly with the… ▽ More

    Submitted 12 November, 2015; originally announced November 2015.

    Comments: in NIPS 2015

  45. arXiv:1510.06335  [pdf, other

    cs.AI cs.LG

    Time-Sensitive Bayesian Information Aggregation for Crowdsourcing Systems

    Authors: Matteo Venanzi, John Guiver, Pushmeet Kohli, Nick Jennings

    Abstract: Crowdsourcing systems commonly face the problem of aggregating multiple judgments provided by potentially unreliable workers. In addition, several aspects of the design of efficient crowdsourcing processes, such as defining worker's bonuses, fair prices and time limits of the tasks, involve knowledge of the likely duration of the task at hand. Bringing this together, in this work we introduce a ne… ▽ More

    Submitted 18 April, 2016; v1 submitted 21 October, 2015; originally announced October 2015.

  46. arXiv:1508.02823  [pdf, other

    cs.HC cs.CY cs.LG

    Learning to Hire Teams

    Authors: Adish Singla, Eric Horvitz, Pushmeet Kohli, Andreas Krause

    Abstract: Crowdsourcing and human computation has been employed in increasingly sophisticated projects that require the solution of a heterogeneous set of tasks. We explore the challenge of building or hiring an effective team, for performing tasks required for such projects on an ongoing basis, from an available pool of applicants or workers who have bid for the tasks. The recruiter needs to learn workers'… ▽ More

    Submitted 12 August, 2015; originally announced August 2015.

    Comments: Short version of this paper will appear in HCOMP'15

  47. arXiv:1506.06155  [pdf, other

    cs.LG cs.CV

    CO2 Forest: Improved Random Forest by Continuous Optimization of Oblique Splits

    Authors: Mohammad Norouzi, Maxwell D. Collins, David J. Fleet, Pushmeet Kohli

    Abstract: We propose a novel algorithm for optimizing multivariate linear threshold functions as split functions of decision trees to create improved Random Forest classifiers. Standard tree induction methods resort to sampling and exhaustive search to find good univariate split functions. In contrast, our method computes a linear combination of the features at each node, and optimizes the parameters of the… ▽ More

    Submitted 24 June, 2015; v1 submitted 19 June, 2015; originally announced June 2015.

  48. arXiv:1504.08362  [pdf, other

    cs.CV

    PerforatedCNNs: Acceleration through Elimination of Redundant Convolutions

    Authors: Michael Figurnov, Aijan Ibraimova, Dmitry Vetrov, Pushmeet Kohli

    Abstract: We propose a novel approach to reduce the computational cost of evaluation of convolutional neural networks, a factor that has hindered their deployment in low-power devices such as mobile phones. Inspired by the loop perforation technique from source code optimization, we speed up the bottleneck convolutional layers by skip** their evaluation in some of the spatial positions. We propose and ana… ▽ More

    Submitted 15 October, 2016; v1 submitted 30 April, 2015; originally announced April 2015.

    Comments: NIPS 2016

  49. arXiv:1504.06423  [pdf, other

    cs.AI

    Information Gathering in Networks via Active Exploration

    Authors: Adish Singla, Eric Horvitz, Pushmeet Kohli, Ryen White, Andreas Krause

    Abstract: How should we gather information in a network, where each node's visibility is limited to its local neighborhood? This problem arises in numerous real-world applications, such as surveying and task routing in social networks, team formation in collaborative networks and experimental design with dependency constraints. Often the informativeness of a set of nodes can be quantified via a submodular u… ▽ More

    Submitted 6 May, 2015; v1 submitted 24 April, 2015; originally announced April 2015.

    Comments: Longer version of IJCAI'15 paper

  50. arXiv:1503.03167  [pdf, other

    cs.CV cs.GR cs.LG cs.NE

    Deep Convolutional Inverse Graphics Network

    Authors: Tejas D. Kulkarni, Will Whitney, Pushmeet Kohli, Joshua B. Tenenbaum

    Abstract: This paper presents the Deep Convolution Inverse Graphics Network (DC-IGN), a model that learns an interpretable representation of images. This representation is disentangled with respect to transformations such as out-of-plane rotations and lighting variations. The DC-IGN model is composed of multiple layers of convolution and de-convolution operators and is trained using the Stochastic Gradient… ▽ More

    Submitted 21 June, 2015; v1 submitted 11 March, 2015; originally announced March 2015.

    Comments: First two authors contributed equally