Skip to main content

Showing 51–63 of 63 results for author: Bagnell, J A

.
  1. arXiv:1308.3506  [pdf, other

    cs.GT cs.LG stat.ML

    Computational Rationalization: The Inverse Equilibrium Problem

    Authors: Kevin Waugh, Brian D. Ziebart, J. Andrew Bagnell

    Abstract: Modeling the purposeful behavior of imperfect agents from a small number of observations is a challenging task. When restricted to the single-agent decision-theoretic setting, inverse optimal control techniques assume that observed behavior is an approximately optimal solution to an unknown decision problem. These techniques learn a utility function that explains the example behavior and can then… ▽ More

    Submitted 15 August, 2013; originally announced August 2013.

    Comments: In submission to JMLR, conference version: arXiv:1103.5254

  2. arXiv:1305.2532  [pdf, other

    cs.LG stat.ML

    Learning Policies for Contextual Submodular Prediction

    Authors: Stephane Ross, Jiaji Zhou, Yisong Yue, Debadeepta Dey, J. Andrew Bagnell

    Abstract: Many prediction domains, such as ad placement, recommendation, trajectory prediction, and document summarization, require predicting a set or list of options. Such lists are often evaluated using submodular reward functions that measure both quality and diversity. We propose a simple, efficient, and provably near-optimal approach to optimizing such prediction problems based on no-regret learning.… ▽ More

    Submitted 11 May, 2013; originally announced May 2013.

    Comments: 13 pages. To appear in proceedings of the International Conference on Machine Learning (ICML), 2013

  3. arXiv:1301.0556  [pdf

    cs.LG cs.IR stat.ML

    Learning with Scope, with Application to Information Extraction and Classification

    Authors: David Blei, J Andrew Bagnell, Andrew McCallum

    Abstract: In probabilistic approaches to classification and information extraction, one typically builds a statistical model of words under the assumption that future data will exhibit the same regularities as the training data. In many data sets, however, there are scope-limited features whose predictive power is only applicable to a certain subset of the data. For example, in information extraction from… ▽ More

    Submitted 12 December, 2012; originally announced January 2013.

    Comments: Appears in Proceedings of the Eighteenth Conference on Uncertainty in Artificial Intelligence (UAI2002)

    Report number: UAI-P-2002-PG-53-60

  4. arXiv:1211.1690  [pdf, other

    cs.RO cs.CV cs.LG eess.SY

    Learning Monocular Reactive UAV Control in Cluttered Natural Environments

    Authors: Stephane Ross, Narek Melik-Barkhudarov, Kumar Shaurya Shankar, Andreas Wendel, Debadeepta Dey, J. Andrew Bagnell, Martial Hebert

    Abstract: Autonomous navigation for large Unmanned Aerial Vehicles (UAVs) is fairly straight-forward, as expensive sensors and monitoring devices can be employed. In contrast, obstacle avoidance remains a challenging task for Micro Aerial Vehicles (MAVs) which operate at low altitude in cluttered environments. Unlike large vehicles, MAVs can only carry very light sensors, such as cameras, making autonomous… ▽ More

    Submitted 7 November, 2012; originally announced November 2012.

    Comments: 8 pages, 10 figures

  5. arXiv:1208.6067  [pdf, other

    cs.RO

    Efficient Touch Based Localization through Submodularity

    Authors: Shervin Javdani, Matthew Klingensmith, J. Andrew Bagnell, Nancy S. Pollard, Siddhartha S. Srinivasa

    Abstract: Many robotic systems deal with uncertainty by performing a sequence of information gathering actions. In this work, we focus on the problem of efficiently constructing such a sequence by drawing an explicit connection to submodularity. Ideally, we would like a method that finds the optimal sequence, taking the minimum amount of time while providing sufficient information. Finding this sequence, ho… ▽ More

    Submitted 23 April, 2013; v1 submitted 29 August, 2012; originally announced August 2012.

  6. arXiv:1206.5281  [pdf

    cs.LG stat.ML

    Learning Selectively Conditioned Forest Structures with Applications to DBNs and Classification

    Authors: Brian D. Ziebart, Anind K. Dey, J Andrew Bagnell

    Abstract: Dealing with uncertainty in Bayesian Network structures using maximum a posteriori (MAP) estimation or Bayesian Model Averaging (BMA) is often intractable due to the superexponential number of possible directed, acyclic graphs. When the prior is decomposable, two classes of graphs where efficient learning can take place are tree structures, and fixed-orderings with limited in-degree. We show how M… ▽ More

    Submitted 20 June, 2012; originally announced June 2012.

    Comments: Appears in Proceedings of the Twenty-Third Conference on Uncertainty in Artificial Intelligence (UAI2007)

    Report number: UAI-P-2007-PG-458-465

  7. arXiv:1205.2656  [pdf

    cs.LG cs.IT stat.ML

    Convex Coding

    Authors: David M. Bradley, J Andrew Bagnell

    Abstract: Inspired by recent work on convex formulations of clustering (Lashkari & Golland, 2008; Nowozin & Bakir, 2008) we investigate a new formulation of the Sparse Coding Problem (Olshausen & Field, 1997). In sparse coding we attempt to simultaneously represent a sequence of data-vectors sparsely (i.e. sparse approximation (Tropp et al., 2006)) in terms of a 'code' defined by a set of basis elements, wh… ▽ More

    Submitted 9 May, 2012; originally announced May 2012.

    Comments: Appears in Proceedings of the Twenty-Fifth Conference on Uncertainty in Artificial Intelligence (UAI2009)

    Report number: UAI-P-2009-PG-83-90

  8. arXiv:1203.1007  [pdf, other

    cs.LG cs.AI eess.SY stat.ML

    Agnostic System Identification for Model-Based Reinforcement Learning

    Authors: Stephane Ross, J. Andrew Bagnell

    Abstract: A fundamental problem in control is to learn a model of a system from observations that is useful for controller synthesis. To provide good performance guarantees, existing methods must assume that the real system is in the class of models considered during learning. We present an iterative method with strong guarantees even in the agnostic case where the system is not in the class. In particular,… ▽ More

    Submitted 3 July, 2012; v1 submitted 5 March, 2012; originally announced March 2012.

    Comments: 8 pages, published in ICML 2012

  9. arXiv:1202.2112  [pdf, other

    cs.AI cs.LG cs.RO

    Predicting Contextual Sequences via Submodular Function Maximization

    Authors: Debadeepta Dey, Tian Yu Liu, Martial Hebert, J. Andrew Bagnell

    Abstract: Sequence optimization, where the items in a list are ordered to maximize some reward has many applications such as web advertisement placement, search, and control libraries in robotics. Previous work in sequence optimization produces a static ordering that does not take any features of the item or context of the problem into account. In this work, we propose a general approach to order the items… ▽ More

    Submitted 9 February, 2012; originally announced February 2012.

    Comments: 8 pages

    Report number: CMU-RI-TR-12-05

  10. arXiv:1108.3154  [pdf, ps, other

    cs.LG stat.ML

    Stability Conditions for Online Learnability

    Authors: Stephane Ross, J. Andrew Bagnell

    Abstract: Stability is a general notion that quantifies the sensitivity of a learning algorithm's output to small change in the training dataset (e.g. deletion or replacement of a single training sample). Such conditions have recently been shown to be more powerful to characterize learnability in the general learning setting under i.i.d. samples where uniform convergence is not necessary for learnability, b… ▽ More

    Submitted 17 August, 2011; v1 submitted 16 August, 2011; originally announced August 2011.

    Comments: 16 pages. Earlier version of this work submitted (but rejected) to COLT 2011

  11. arXiv:1105.2054  [pdf, other

    cs.LG stat.ML

    Generalized Boosting Algorithms for Convex Optimization

    Authors: Alexander Grubb, J. Andrew Bagnell

    Abstract: Boosting is a popular way to derive powerful learners from simpler hypothesis classes. Following previous work (Mason et al., 1999; Friedman, 2000) on general boosting frameworks, we analyze gradient-based descent algorithms for boosting with respect to any convex objective and introduce a new measure of weak learner performance into this setting which generalizes existing work. We present the wea… ▽ More

    Submitted 14 February, 2012; v1 submitted 10 May, 2011; originally announced May 2011.

    Comments: Extended version of paper presented at the International Conference on Machine Learning, 2011. 9 pages + appendix with proofs

  12. arXiv:1103.5254  [pdf, other

    cs.GT

    Computational Rationalization: The Inverse Equilibrium Problem

    Authors: Kevin Waugh, Brian D. Ziebart, J. Andrew Bagnell

    Abstract: Modeling the purposeful behavior of imperfect agents from a small number of observations is a challenging task. When restricted to the single-agent decision-theoretic setting, inverse optimal control techniques assume that observed behavior is an approximately optimal solution to an unknown decision problem. These techniques learn a utility function that explains the example behavior and can then… ▽ More

    Submitted 6 May, 2011; v1 submitted 27 March, 2011; originally announced March 2011.

    Comments: 8 pages, 4 page appendix, ICML 2011

  13. arXiv:1011.0686  [pdf, other

    cs.LG cs.AI stat.ML

    A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning

    Authors: Stephane Ross, Geoffrey J. Gordon, J. Andrew Bagnell

    Abstract: Sequential prediction problems such as imitation learning, where future observations depend on previous predictions (actions), violate the common i.i.d. assumptions made in statistical learning. This leads to poor performance in theory and often in practice. Some recent approaches provide stronger guarantees in this setting, but remain somewhat unsatisfactory as they train either non-stationary or… ▽ More

    Submitted 16 March, 2011; v1 submitted 2 November, 2010; originally announced November 2010.

    Comments: Appearing in the 14th International Conference on Artificial Intelligence and Statistics (AISTATS 2011)