Skip to main content

Showing 1–50 of 56 results for author: Guestrin, C

.
  1. arXiv:2406.07496  [pdf, other

    cs.CL cs.AI cs.LG

    TextGrad: Automatic "Differentiation" via Text

    Authors: Mert Yuksekgonul, Federico Bianchi, Joseph Boen, Sheng Liu, Zhi Huang, Carlos Guestrin, James Zou

    Abstract: AI is undergoing a paradigm shift, with breakthroughs achieved by systems orchestrating multiple large language models (LLMs) and other complex components. As a result, develo** principled and automated optimization methods for compound AI systems is one of the most important new challenges. Neural networks faced a similar challenge in its early days until backpropagation and automatic different… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: 41 pages, 6 figures

  2. arXiv:2404.07815  [pdf, other

    cs.LG cs.AI stat.ML

    Post-Hoc Reversal: Are We Selecting Models Prematurely?

    Authors: Rishabh Ranjan, Saurabh Garg, Mrigank Raman, Carlos Guestrin, Zachary Chase Lipton

    Abstract: Trained models are often composed with post-hoc transforms such as temperature scaling (TS), ensembling and stochastic weight averaging (SWA) to improve performance, robustness, uncertainty estimation, etc. However, such transforms are typically applied only after the base models have already been finalized by standard means. In this paper, we challenge this practice with an extensive empirical st… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

    Comments: 9 pages + references + appendix, 7 figures

  3. arXiv:2403.04871  [pdf, other

    cs.IR cs.DB

    ACORN: Performant and Predicate-Agnostic Search Over Vector Embeddings and Structured Data

    Authors: Liana Patel, Peter Kraft, Carlos Guestrin, Matei Zaharia

    Abstract: Applications increasingly leverage mixed-modality data, and must jointly search over vector data, such as embedded images, text and video, as well as structured data, such as attributes and keywords. Proposed methods for this hybrid search setting either suffer from poor performance or support a severely restricted set of search predicates (e.g., only small sets of equality predicates), making the… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

  4. arXiv:2311.12233  [pdf, other

    cs.CL

    Unifying Corroborative and Contributive Attributions in Large Language Models

    Authors: Theodora Worledge, Judy Hanwen Shen, Nicole Meister, Caleb Winston, Carlos Guestrin

    Abstract: As businesses, products, and services spring up around large language models, the trustworthiness of these models hinges on the verifiability of their outputs. However, methods for explaining language model outputs largely fall across two distinct fields of study which both use the term "attribution" to refer to entirely separate techniques: citation generation and training data attribution. In ma… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

    Comments: NeurIPS ATTRIB Workshop 2023

  5. arXiv:2310.13807  [pdf, other

    cs.LG

    Learning to (Learn at Test Time)

    Authors: Yu Sun, Xinhao Li, Karan Dalal, Chloe Hsu, Sanmi Koyejo, Carlos Guestrin, Xiaolong Wang, Tatsunori Hashimoto, Xinlei Chen

    Abstract: We reformulate the problem of supervised learning as learning to learn with two nested loops (i.e. learning problems). The inner loop learns on each individual instance with self-supervision before final prediction. The outer loop learns the self-supervised task used by the inner loop, such that its final prediction improves. Our inner loop turns out to be equivalent to linear attention when the i… ▽ More

    Submitted 7 January, 2024; v1 submitted 20 October, 2023; originally announced October 2023.

    Comments: Fixed a few small typos

  6. arXiv:2305.18262  [pdf, other

    cs.LG cs.AI

    Beyond Confidence: Reliable Models Should Also Consider Atypicality

    Authors: Mert Yuksekgonul, Linjun Zhang, James Zou, Carlos Guestrin

    Abstract: While most machine learning models can provide confidence in their predictions, confidence is insufficient to understand a prediction's reliability. For instance, the model may have a low confidence prediction if the input is not well-represented in the training dataset or if the input is inherently ambiguous. In this work, we investigate the relationship between how atypical(rare) a sample or a c… ▽ More

    Submitted 30 October, 2023; v1 submitted 29 May, 2023; originally announced May 2023.

    Comments: Published at NeurIPS 2023

  7. arXiv:2305.14387  [pdf, other

    cs.LG cs.AI cs.CL

    AlpacaFarm: A Simulation Framework for Methods that Learn from Human Feedback

    Authors: Yann Dubois, Xuechen Li, Rohan Taori, Tianyi Zhang, Ishaan Gulrajani, Jimmy Ba, Carlos Guestrin, Percy Liang, Tatsunori B. Hashimoto

    Abstract: Large language models (LLMs) such as ChatGPT have seen widespread adoption due to their strong instruction-following abilities. Develo** these LLMs involves a complex yet poorly understood workflow requiring training with human feedback. Replicating and understanding this instruction-following requires tackling three major challenges: the high cost of data collection, the lack of trustworthy eva… ▽ More

    Submitted 7 January, 2024; v1 submitted 22 May, 2023; originally announced May 2023.

    Comments: Spotlight at NeurIPS 2023

  8. arXiv:2302.05733  [pdf, other

    cs.CR cs.LG

    Exploiting Programmatic Behavior of LLMs: Dual-Use Through Standard Security Attacks

    Authors: Daniel Kang, Xuechen Li, Ion Stoica, Carlos Guestrin, Matei Zaharia, Tatsunori Hashimoto

    Abstract: Recent advances in instruction-following large language models (LLMs) have led to dramatic improvements in a range of NLP tasks. Unfortunately, we find that the same improved capabilities amplify the dual-use risks for malicious purposes of these models. Dual-use is difficult to prevent as instruction-following capabilities now enable standard attacks from computer security. The capabilities of th… ▽ More

    Submitted 11 February, 2023; originally announced February 2023.

  9. arXiv:2102.10472  [pdf, other

    cs.LG cs.CV

    Learning Neural Network Subspaces

    Authors: Mitchell Wortsman, Maxwell Horton, Carlos Guestrin, Ali Farhadi, Mohammad Rastegari

    Abstract: Recent observations have advanced our understanding of the neural network optimization landscape, revealing the existence of (1) paths of high accuracy containing diverse solutions and (2) wider minima offering improved performance. Previous methods observing diverse paths require multiple training runs. In contrast we aim to leverage both property (1) and (2) with a single method and in a single… ▽ More

    Submitted 12 September, 2021; v1 submitted 20 February, 2021; originally announced February 2021.

  10. arXiv:2007.05105  [pdf, other

    cs.LG stat.ML

    AdaScale SGD: A User-Friendly Algorithm for Distributed Training

    Authors: Tyler B. Johnson, Pulkit Agrawal, Haijie Gu, Carlos Guestrin

    Abstract: When using large-batch training to speed up stochastic gradient descent, learning rates must adapt to new batch sizes in order to maximize speed-ups and preserve model quality. Re-tuning learning rates is resource intensive, while fixed scaling rules often degrade model quality. We propose AdaScale SGD, an algorithm that reliably adapts learning rates to large-batch training. By continually adapti… ▽ More

    Submitted 9 July, 2020; originally announced July 2020.

    Comments: ICML 2020

  11. arXiv:2006.10705  [pdf, other

    cs.LG cs.CV stat.ML

    Set Distribution Networks: a Generative Model for Sets of Images

    Authors: Shuangfei Zhai, Walter Talbott, Miguel Angel Bautista, Carlos Guestrin, Josh M. Susskind

    Abstract: Images with shared characteristics naturally form sets. For example, in a face verification benchmark, images of the same identity form sets. For generative models, the standard way of dealing with sets is to represent each as a one hot vector, and learn a conditional generative model $p(\mathbf{x}|\mathbf{y})$. This representation assumes that the number of sets is limited and known, such that th… ▽ More

    Submitted 18 June, 2020; originally announced June 2020.

  12. arXiv:2006.07630  [pdf, other

    cs.CV stat.ML

    Equivariant Neural Rendering

    Authors: Emilien Dupont, Miguel Angel Bautista, Alex Colburn, Aditya Sankar, Carlos Guestrin, Josh Susskind, Qi Shan

    Abstract: We propose a framework for learning neural scene representations directly from images, without 3D supervision. Our key insight is that 3D structure can be imposed by ensuring that the learned representation transforms like a real 3D scene. Specifically, we introduce a loss which enforces equivariance of the scene representation with respect to 3D transformations. Our formulation allows us to infer… ▽ More

    Submitted 21 December, 2020; v1 submitted 13 June, 2020; originally announced June 2020.

    Comments: Add link to code

  13. arXiv:2005.04118  [pdf, other

    cs.CL cs.LG

    Beyond Accuracy: Behavioral Testing of NLP models with CheckList

    Authors: Marco Tulio Ribeiro, Tongshuang Wu, Carlos Guestrin, Sameer Singh

    Abstract: Although measuring held-out accuracy has been the primary approach to evaluate generalization, it often overestimates the performance of NLP models, while alternative approaches for evaluating models either focus on individual tasks or on specific behaviors. Inspired by principles of behavioral testing in software engineering, we introduce CheckList, a task-agnostic methodology for testing NLP mod… ▽ More

    Submitted 8 May, 2020; originally announced May 2020.

    Journal ref: Association for Computational Linguistics (ACL), 2020

  14. arXiv:1910.13101  [pdf, other

    cs.LG stat.ML

    Adversarial Fisher Vectors for Unsupervised Representation Learning

    Authors: Shuangfei Zhai, Walter Talbott, Carlos Guestrin, Joshua M. Susskind

    Abstract: We examine Generative Adversarial Networks (GANs) through the lens of deep Energy Based Models (EBMs), with the goal of exploiting the density model that follows from this formulation. In contrast to a traditional view where the discriminator learns a constant function when reaching convergence, here we show that it can provide useful information for downstream tasks, e.g., feature extraction for… ▽ More

    Submitted 29 October, 2019; originally announced October 2019.

    Comments: Accepted as spotlight presentation to NeurIPS 2019

  15. arXiv:1905.05895  [pdf, other

    cs.LG cs.CV stat.ML

    Addressing the Loss-Metric Mismatch with Adaptive Loss Alignment

    Authors: Chen Huang, Shuangfei Zhai, Walter Talbott, Miguel Angel Bautista, Shih-Yu Sun, Carlos Guestrin, Josh Susskind

    Abstract: In most machine learning training paradigms a fixed, often handcrafted, loss function is assumed to be a good proxy for an underlying evaluation metric. In this work we assess this assumption by meta-learning an adaptive loss function to directly optimize the evaluation metric. We propose a sample efficient reinforcement learning approach for adapting the loss dynamically during training. We empir… ▽ More

    Submitted 14 May, 2019; originally announced May 2019.

    Comments: Accepted to ICML 2019

  16. arXiv:1809.07841  [pdf, other

    cs.SI cs.CY physics.soc-ph

    Over-Optimization of Academic Publishing Metrics: Observing Goodhart's Law in Action

    Authors: Michael Fire, Carlos Guestrin

    Abstract: The academic publishing world is changing significantly, with ever-growing numbers of publications each year and shifting publishing patterns. However, the metrics used to measure academic success, such as the number of publications, citation number, and impact factor, have not changed for decades. Moreover, recent studies indicate that these metrics have become targets and follow Goodhart's Law,… ▽ More

    Submitted 20 September, 2018; originally announced September 2018.

  17. arXiv:1807.08046  [pdf, other

    stat.ML cs.LG

    A Fast, Principled Working Set Algorithm for Exploiting Piecewise Linear Structure in Convex Problems

    Authors: Tyler B. Johnson, Carlos Guestrin

    Abstract: By reducing optimization to a sequence of smaller subproblems, working set algorithms achieve fast convergence times for many machine learning problems. Despite such performance, working set implementations often resort to heuristics to determine subproblem size, makeup, and stop** criteria. We propose BlitzWS, a working set algorithm with useful theoretical guarantees. Our theory relates subpro… ▽ More

    Submitted 20 July, 2018; originally announced July 2018.

  18. arXiv:1807.04188  [pdf, other

    cs.LG cs.DC stat.ML

    A Hardware-Software Blueprint for Flexible Deep Learning Specialization

    Authors: Thierry Moreau, Tianqi Chen, Luis Vega, Jared Roesch, Eddie Yan, Lianmin Zheng, Josh Fromm, Ziheng Jiang, Luis Ceze, Carlos Guestrin, Arvind Krishnamurthy

    Abstract: Specialized Deep Learning (DL) acceleration stacks, designed for a specific set of frameworks, model architectures, operators, and data types, offer the allure of high performance while sacrificing flexibility. Changes in algorithms, models, operators, or numerical systems threaten the viability of specialized hardware accelerators. We propose VTA, a programmable deep learning architecture templat… ▽ More

    Submitted 22 April, 2019; v1 submitted 11 July, 2018; originally announced July 2018.

    Comments: 6 pages plus references, 8 figures

  19. arXiv:1805.08166  [pdf, other

    cs.LG stat.ML

    Learning to Optimize Tensor Programs

    Authors: Tianqi Chen, Lianmin Zheng, Eddie Yan, Ziheng Jiang, Thierry Moreau, Luis Ceze, Carlos Guestrin, Arvind Krishnamurthy

    Abstract: We introduce a learning-based framework to optimize tensor programs for deep learning workloads. Efficient implementations of tensor operators, such as matrix multiplication and high dimensional convolution, are key enablers of effective deep learning systems. However, existing systems rely on manually optimized libraries such as cuDNN where only a narrow range of server class GPUs are well-suppor… ▽ More

    Submitted 8 January, 2019; v1 submitted 21 May, 2018; originally announced May 2018.

    Comments: NeurIPS 2018

  20. arXiv:1805.00184  [pdf, other

    stat.ML cs.AI cs.LG

    Compact Factorization of Matrices Using Generalized Round-Rank

    Authors: Pouya Pezeshkpour, Carlos Guestrin, Sameer Singh

    Abstract: Matrix factorization is a well-studied task in machine learning for compactly representing large, noisy data. In our approach, instead of using the traditional concept of matrix rank, we define a new notion of link-rank based on a non-linear link function used within factorization. In particular, by applying the round function on a factorization to obtain ordinal-valued matrices, we introduce gene… ▽ More

    Submitted 1 May, 2018; originally announced May 2018.

  21. arXiv:1802.04799  [pdf, other

    cs.LG cs.AI cs.PL

    TVM: An Automated End-to-End Optimizing Compiler for Deep Learning

    Authors: Tianqi Chen, Thierry Moreau, Ziheng Jiang, Lianmin Zheng, Eddie Yan, Meghan Cowan, Haichen Shen, Leyuan Wang, Yuwei Hu, Luis Ceze, Carlos Guestrin, Arvind Krishnamurthy

    Abstract: There is an increasing need to bring machine learning to a wide diversity of hardware devices. Current frameworks rely on vendor-specific operator libraries and optimize for a narrow range of server-class GPUs. Deploying workloads to new platforms -- such as mobile phones, embedded devices, and accelerators (e.g., FPGAs, ASICs) -- requires significant manual effort. We propose TVM, a compiler that… ▽ More

    Submitted 5 October, 2018; v1 submitted 12 February, 2018; originally announced February 2018.

    Comments: Significantly improved version, add automated optimization

  22. arXiv:1706.06690  [pdf, other

    cs.SI cs.CY physics.data-an physics.soc-ph

    The Rise and Fall of Network Stars: Analyzing 2.5 million graphs to reveal how high-degree vertices emerge over time

    Authors: Michael Fire, Carlos Guestrin

    Abstract: Trends change rapidly in today's world, prompting this key question: What is the mechanism behind the emergence of new trends? By representing real-world dynamic systems as complex networks, the emergence of new trends can be symbolized by vertices that "shine." That is, at a specific time interval in a network's life, certain vertices become increasingly connected to other vertices. This process… ▽ More

    Submitted 13 October, 2018; v1 submitted 20 June, 2017; originally announced June 2017.

    Comments: Keywords: Complex Systems; Dynamic Systems; Network Science; Big Data; Complex Network Evolution; Network Datasets;Social Networks

    MSC Class: 91D30; 05C82; 90B10

  23. arXiv:1611.07579  [pdf, other

    stat.ML cs.AI cs.LG

    Programs as Black-Box Explanations

    Authors: Sameer Singh, Marco Tulio Ribeiro, Carlos Guestrin

    Abstract: Recent work in model-agnostic explanations of black-box machine learning has demonstrated that interpretability of complex models does not have to come at the cost of accuracy or model flexibility. However, it is not clear what kind of explanations, such as linear models, decision trees, and rule lists, are the appropriate family to consider, and different tasks and models may benefit from differe… ▽ More

    Submitted 22 November, 2016; originally announced November 2016.

    Comments: Presented at NIPS 2016 Workshop on Interpretable Machine Learning in Complex Systems

  24. arXiv:1611.05817  [pdf, other

    stat.ML cs.AI cs.LG

    Nothing Else Matters: Model-Agnostic Explanations By Identifying Prediction Invariance

    Authors: Marco Tulio Ribeiro, Sameer Singh, Carlos Guestrin

    Abstract: At the core of interpretable machine learning is the question of whether humans are able to make accurate predictions about a model's behavior. Assumed in this question are three properties of the interpretable output: coverage, precision, and effort. Coverage refers to how often humans think they can predict the model's behavior, precision to how accurate humans are in those predictions, and effo… ▽ More

    Submitted 17 November, 2016; originally announced November 2016.

    Comments: Presented at NIPS 2016 Workshop on Interpretable Machine Learning in Complex Systems

  25. arXiv:1606.05386  [pdf, other

    stat.ML cs.LG

    Model-Agnostic Interpretability of Machine Learning

    Authors: Marco Tulio Ribeiro, Sameer Singh, Carlos Guestrin

    Abstract: Understanding why machine learning models behave the way they do empowers both system designers and end-users in many ways: in model selection, feature engineering, in order to trust and act upon the predictions, and in more intuitive user interfaces. Thus, interpretability has become a vital concern in machine learning, and work in the area of interpretable models has found renewed interest. In s… ▽ More

    Submitted 16 June, 2016; originally announced June 2016.

    Comments: presented at 2016 ICML Workshop on Human Interpretability in Machine Learning (WHI 2016), New York, NY

  26. arXiv:1606.00399  [pdf, other

    cs.LG math.CO stat.ML

    Scaling Submodular Maximization via Pruned Submodularity Graphs

    Authors: Tianyi Zhou, Hua Ouyang, Yi Chang, Jeff Bilmes, Carlos Guestrin

    Abstract: We propose a new random pruning method (called "submodular sparsification (SS)") to reduce the cost of submodular maximization. The pruning is applied via a "submodularity graph" over the $n$ ground elements, where each directed edge is associated with a pairwise dependency defined by the submodular function. In each step, SS prunes a $1-1/\sqrt{c}$ (for $c>1$) fraction of the nodes using weights… ▽ More

    Submitted 1 June, 2016; originally announced June 2016.

  27. arXiv:1604.06174  [pdf, other

    cs.LG

    Training Deep Nets with Sublinear Memory Cost

    Authors: Tianqi Chen, Bing Xu, Chiyuan Zhang, Carlos Guestrin

    Abstract: We propose a systematic approach to reduce the memory consumption of deep neural network training. Specifically, we design an algorithm that costs O(sqrt(n)) memory to train a n layer network, with only the computational cost of an extra forward pass per mini-batch. As many of the state-of-the-art models hit the upper bound of the GPU memory, our algorithm allows deeper and more complex models to… ▽ More

    Submitted 22 April, 2016; v1 submitted 21 April, 2016; originally announced April 2016.

  28. arXiv:1603.07445  [pdf, other

    cs.SI physics.soc-ph

    Time Is of the Essence: Analyzing the Effect of Vertex-Joining Time on Complex Network Evolution

    Authors: Michael Fire, Carlos Guestrin

    Abstract: Complex networks have non-trivial characteristics and appear in many real-world systems. Such networks are vitally important, but their full underlying dynamics are not completely understood. Utilizing new data sources, however, can unveil the evolution process of these networks. This study uses the recently published Reddit dataset, containing over 1.65 billion comments, to construct the larges… ▽ More

    Submitted 25 August, 2016; v1 submitted 24 March, 2016; originally announced March 2016.

  29. XGBoost: A Scalable Tree Boosting System

    Authors: Tianqi Chen, Carlos Guestrin

    Abstract: Tree boosting is a highly effective and widely used machine learning method. In this paper, we describe a scalable end-to-end tree boosting system called XGBoost, which is used widely by data scientists to achieve state-of-the-art results on many machine learning challenges. We propose a novel sparsity-aware algorithm for sparse data and weighted quantile sketch for approximate tree learning. More… ▽ More

    Submitted 10 June, 2016; v1 submitted 8 March, 2016; originally announced March 2016.

    Comments: KDD'16 changed all figures to type1

  30. arXiv:1602.04938  [pdf, other

    cs.LG cs.AI stat.ML

    "Why Should I Trust You?": Explaining the Predictions of Any Classifier

    Authors: Marco Tulio Ribeiro, Sameer Singh, Carlos Guestrin

    Abstract: Despite widespread adoption, machine learning models remain mostly black boxes. Understanding the reasons behind predictions is, however, quite important in assessing trust, which is fundamental if one plans to take action based on a prediction, or when choosing whether to deploy a new model. Such understanding also provides insights into the model, which can be used to transform an untrustworthy… ▽ More

    Submitted 9 August, 2016; v1 submitted 16 February, 2016; originally announced February 2016.

  31. arXiv:1408.2041  [pdf

    cs.LG cs.DC

    GraphLab: A New Framework For Parallel Machine Learning

    Authors: Yucheng Low, Joseph E. Gonzalez, Aapo Kyrola, Danny Bickson, Carlos E. Guestrin, Joseph Hellerstein

    Abstract: Designing and implementing efficient, provably correct parallel machine learning (ML) algorithms is challenging. Existing high-level parallel abstractions like MapReduce are insufficiently expressive while low-level tools like MPI and Pthreads leave ML experts repeatedly solving the same design challenges. By targeting common patterns in ML, we developed GraphLab, which improves upon abstractions… ▽ More

    Submitted 9 August, 2014; originally announced August 2014.

    Comments: Appears in Proceedings of the Twenty-Sixth Conference on Uncertainty in Artificial Intelligence (UAI2010)

    Report number: UAI-P-2010-PG-340-349

  32. arXiv:1406.5752  [pdf, other

    stat.ML cs.LG

    Divide-and-Conquer Learning by Anchoring a Conical Hull

    Authors: Tianyi Zhou, Jeff Bilmes, Carlos Guestrin

    Abstract: We reduce a broad class of machine learning problems, usually addressed by EM or sampling, to the problem of finding the $k$ extremal rays spanning the conical hull of a data point set. These $k$ "anchors" lead to a global solution and a more interpretable model that can even outperform EM and sampling on generalization error. To find the $k$ anchors, we propose a novel divide-and-conquer learning… ▽ More

    Submitted 22 June, 2014; originally announced June 2014.

    Comments: 26 pages, long version, in updating

  33. arXiv:1403.0701  [pdf, other

    cs.DB

    GraphChi-DB: Simple Design for a Scalable Graph Database System -- on Just a PC

    Authors: Aapo Kyrola, Carlos Guestrin

    Abstract: We propose a new data structure, Parallel Adjacency Lists (PAL), for efficiently managing graphs with billions of edges on disk. The PAL structure is based on the graph storage model of GraphChi (Kyrola et. al., OSDI 2012), but we extend it to enable online database features such as queries and fast insertions. In addition, we extend the model with edge and vertex attributes. Compared to previous… ▽ More

    Submitted 4 March, 2014; originally announced March 2014.

  34. arXiv:1402.4102  [pdf, other

    stat.ME cs.LG stat.ML

    Stochastic Gradient Hamiltonian Monte Carlo

    Authors: Tianqi Chen, Emily B. Fox, Carlos Guestrin

    Abstract: Hamiltonian Monte Carlo (HMC) sampling methods provide a mechanism for defining distant proposals with high acceptance probabilities in a Metropolis-Hastings framework, enabling more efficient exploration of the state space than standard random-walk proposals. The popularity of such methods has grown significantly in recent years. However, a limitation of HMC methods is the required gradient compu… ▽ More

    Submitted 12 May, 2014; v1 submitted 17 February, 2014; originally announced February 2014.

    Comments: ICML 2014 version

  35. Riffled Independence for Efficient Inference with Partial Rankings

    Authors: Jonathan Huang, Ashish Kapoor, Carlos Guestrin

    Abstract: Distributions over rankings are used to model data in a multitude of real world settings such as preference analysis and political elections. Modeling such distributions presents several computational challenges, however, due to the factorial size of the set of rankings over an item set. Some of these challenges are quite familiar to the artificial intelligence community, such as how to compactly… ▽ More

    Submitted 22 January, 2014; originally announced January 2014.

    Comments: arXiv admin note: text overlap with arXiv:1202.3734

    Journal ref: Journal Of Artificial Intelligence Research, Volume 44, pages 491-532, 2012

  36. Optimal Value of Information in Graphical Models

    Authors: Andreas Krause, Carlos Guestrin

    Abstract: Many real-world decision making tasks require us to choose among several expensive observations. In a sensor network, for example, it is important to select the subset of sensors that is expected to provide the strongest reduction in uncertainty. In medical decision making tasks, one needs to select which tests to administer before deciding on the most effective treatment. It has been general prac… ▽ More

    Submitted 15 January, 2014; originally announced January 2014.

    Journal ref: Journal Of Artificial Intelligence Research, Volume 35, pages 557-591, 2009

  37. arXiv:1401.3462  [pdf

    cs.RO cs.AI

    Efficient Informative Sensing using Multiple Robots

    Authors: Amarjeet Singh, Andreas Krause, Carlos Guestrin, William J. Kaiser

    Abstract: The need for efficient monitoring of spatio-temporal dynamics in large environmental applications, such as the water quality monitoring in rivers and lakes, motivates the use of robotic sensors in order to achieve sufficient spatial coverage. Typically, these robots have bounded resources, such as limited battery or limited amounts of time to obtain measurements. Thus, careful coordination of thei… ▽ More

    Submitted 15 January, 2014; originally announced January 2014.

    Journal ref: Journal Of Artificial Intelligence Research, Volume 34, pages 707-755, 2009

  38. arXiv:1301.2273  [pdf

    cs.AI eess.SY

    Robust Combination of Local Controllers

    Authors: Carlos E. Guestrin, Dirk Ormoneit

    Abstract: Planning problems are hard, motion planning, for example, isPSPACE-hard. Such problems are even more difficult in the presence of uncertainty. Although, Markov Decision Processes (MDPs) provide a formal framework for such problems, finding solutions to high dimensional continuous MDPs is usually difficult, especially when the actions and time measurements are continuous. Fortunately, problem-spec… ▽ More

    Submitted 10 January, 2013; originally announced January 2013.

    Comments: Appears in Proceedings of the Seventeenth Conference on Uncertainty in Artificial Intelligence (UAI2001)

    Report number: UAI-P-2001-PG-178-185

  39. arXiv:1301.0571  [pdf

    cs.AI

    Distributed Planning in Hierarchical Factored MDPs

    Authors: Carlos E. Guestrin, Geoffrey Gordon

    Abstract: We present a principled and efficient planning algorithm for collaborative multiagent dynamical systems. All computation, during both the planning and the execution phases, is distributed among the agents; each agent only needs to model and plan for a small part of the system. Each of these local subsystems is small, but once they are combined they can represent an exponentially larger problem.… ▽ More

    Submitted 12 December, 2012; originally announced January 2013.

    Comments: Appears in Proceedings of the Eighteenth Conference on Uncertainty in Artificial Intelligence (UAI2002)

    Report number: UAI-P-2002-PG-197-206

  40. arXiv:1207.4174  [pdf

    cs.AI cs.DC

    Robust Probabilistic Inference in Distributed Systems

    Authors: Mark Paskin, Carlos E. Guestrin

    Abstract: Probabilistic inference problems arise naturally in distributed systems such as sensor networks and teams of mobile robots. Inference algorithms that use message passing are a natural fit for distributed systems, but they must be robust to the failure situations that arise in real-world settings, such as unreliable communication and node failures. Unfortunately, the popular sum-product algorithm c… ▽ More

    Submitted 11 July, 2012; originally announced July 2012.

    Comments: Appears in Proceedings of the Twentieth Conference on Uncertainty in Artificial Intelligence (UAI2004)

    Report number: UAI-P-2004-PG-436-445

  41. arXiv:1207.4150  [pdf

    cs.AI

    Solving Factored MDPs with Continuous and Discrete Variables

    Authors: Carlos E. Guestrin, Milos Hauskrecht, Branislav Kveton

    Abstract: Although many real-world stochastic planning problems are more naturally formulated by hybrid models with both discrete and continuous variables, current state-of-the-art methods cannot adequately address these problems. We present the first framework that can exploit problem structure for modeling and solving hybrid problems efficiently. We formulate these problems as hybrid Markov decision proce… ▽ More

    Submitted 11 July, 2012; originally announced July 2012.

    Comments: Appears in Proceedings of the Twentieth Conference on Uncertainty in Artificial Intelligence (UAI2004)

    Report number: UAI-P-2004-PG-235-242

  42. arXiv:1207.1394  [pdf

    cs.AI

    Near-optimal Nonmyopic Value of Information in Graphical Models

    Authors: Andreas Krause, Carlos E. Guestrin

    Abstract: A fundamental issue in real-world systems, such as sensor networks, is the selection of observations which most effectively reduce uncertainty. More specifically, we address the long standing problem of nonmyopically selecting the most informative subset of variables in a graphical model. We present the first efficient randomized algorithm providing a constant factor (1-1/e-epsilon) approximation… ▽ More

    Submitted 4 July, 2012; originally announced July 2012.

    Comments: Appears in Proceedings of the Twenty-First Conference on Uncertainty in Artificial Intelligence (UAI2005)

    Report number: UAI-P-2005-PG-324-331

  43. arXiv:1206.6454  [pdf

    cs.LG stat.ML

    Hierarchical Exploration for Accelerating Contextual Bandits

    Authors: Yisong Yue, Sue Ann Hong, Carlos Guestrin

    Abstract: Contextual bandit learning is an increasingly popular approach to optimizing recommender systems via user feedback, but can be slow to converge in practice due to the need for exploring a large feature space. In this paper, we propose a coarse-to-fine hierarchical approach for encoding prior knowledge that drastically reduces the amount of exploration required. Intuitively, user preferences can be… ▽ More

    Submitted 27 June, 2012; originally announced June 2012.

    Comments: Appears in Proceedings of the 29th International Conference on Machine Learning (ICML 2012)

  44. arXiv:1205.2645  [pdf

    cs.AI cs.DC

    Distributed Parallel Inference on Large Factor Graphs

    Authors: Joseph E. Gonzalez, Yucheng Low, Carlos E. Guestrin, David O'Hallaron

    Abstract: As computer clusters become more common and the size of the problems encountered in the field of AI grows, there is an increasing demand for efficient parallel inference algorithms. We consider the problem of parallel inference on large factor graphs in the distributed memory setting of computer clusters. We develop a new efficient parallel inference algorithm, DBRSplash, which incorporates over-s… ▽ More

    Submitted 9 May, 2012; originally announced May 2012.

    Comments: Appears in Proceedings of the Twenty-Fifth Conference on Uncertainty in Artificial Intelligence (UAI2009)

    Report number: UAI-P-2009-PG-203-212

  45. arXiv:1204.6078  [pdf, other

    cs.DB cs.LG

    Distributed GraphLab: A Framework for Machine Learning in the Cloud

    Authors: Yucheng Low, Joseph Gonzalez, Aapo Kyrola, Danny Bickson, Carlos Guestrin, Joseph M. Hellerstein

    Abstract: While high-level data parallel frameworks, like MapReduce, simplify the design and implementation of large-scale data processing systems, they do not naturally or efficiently support many important data mining and machine learning algorithms and can lead to inefficient learning systems. To help fill this critical void, we introduced the GraphLab abstraction which naturally expresses asynchronous,… ▽ More

    Submitted 26 April, 2012; originally announced April 2012.

    Comments: VLDB2012

    Journal ref: Proceedings of the VLDB Endowment (PVLDB), Vol. 5, No. 8, pp. 716-727 (2012)

  46. arXiv:1204.2523  [pdf, other

    stat.ML cs.CL cs.IR cs.LG

    Concept Modeling with Superwords

    Authors: Khalid El-Arini, Emily B. Fox, Carlos Guestrin

    Abstract: In information retrieval, a fundamental goal is to transform a document into concepts that are representative of its content. The term "representative" is in itself challenging to define, and various tasks require different granularities of concepts. In this paper, we aim to model concepts that are sparse over the vocabulary, and that flexibly adapt their content based on other relevant semantic i… ▽ More

    Submitted 11 April, 2012; originally announced April 2012.

  47. arXiv:1202.3734  [pdf

    cs.LG cs.AI stat.ML

    Efficient Probabilistic Inference with Partial Ranking Queries

    Authors: Jonathan Huang, Ashish Kapoor, Carlos E. Guestrin

    Abstract: Distributions over rankings are used to model data in various settings such as preference analysis and political elections. The factorial size of the space of rankings, however, typically forces one to make structural assumptions, such as smoothness, sparsity, or probabilistic independence about these underlying distributions. We approach the modeling problem from the computational principle that… ▽ More

    Submitted 14 February, 2012; originally announced February 2012.

    Report number: UAI-P-2011-PG-355-362

  48. Solving Factored MDPs with Hybrid State and Action Variables

    Authors: C. Guestrin, M. Hauskrecht, B. Kveton

    Abstract: Efficient representations and solutions for large decision problems with continuous and discrete variables are among the most important challenges faced by the designers of automated decision support systems. In this paper, we describe a novel hybrid factored Markov decision process (MDP) model that allows for a compact representation of these problems, and a new hybrid approximate linear programm… ▽ More

    Submitted 30 September, 2011; originally announced October 2011.

    Journal ref: Journal Of Artificial Intelligence Research, Volume 27, pages 153-201, 2006

  49. arXiv:1107.0922  [pdf, other

    cs.LG

    GraphLab: A Distributed Framework for Machine Learning in the Cloud

    Authors: Yucheng Low, Joseph Gonzalez, Aapo Kyrola, Danny Bickson, Carlos Guestrin

    Abstract: Machine Learning (ML) techniques are indispensable in a wide range of fields. Unfortunately, the exponential increase of dataset sizes are rapidly extending the runtime of sequential algorithms and threatening to slow future progress in ML. With the promise of affordable large-scale parallel computing, Cloud systems offer a viable platform to resolve the computational challenges in ML. However, de… ▽ More

    Submitted 5 July, 2011; originally announced July 2011.

    Comments: CMU Tech Report, GraphLab project webpage: http://graphlab.org

  50. Efficient Solution Algorithms for Factored MDPs

    Authors: C. Guestrin, D. Koller, R. Parr, S. Venkataraman

    Abstract: This paper addresses the problem of planning under uncertainty in large Markov Decision Processes (MDPs). Factored MDPs represent a complex state space using state variables and the transition model using a dynamic Bayesian network. This representation often allows an exponential reduction in the representation size of structured MDPs, but the complexity of exact solution algorithms for such MDPs… ▽ More

    Submitted 9 June, 2011; originally announced June 2011.

    Journal ref: Journal Of Artificial Intelligence Research, Volume 19, pages 399-468, 2003