Skip to main content

Showing 1–50 of 54 results for author: Bilmes, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.08199  [pdf, other

    cs.LG cs.AI

    Deep Submodular Peripteral Networks

    Authors: Gantavya Bhatt, Arnav Das, Jeff Bilmes

    Abstract: Submodular functions, crucial for various applications, often lack practical learning methods for their acquisition. Seemingly unrelated, learning a scaling from oracles offering graded pairwise preferences (GPC) is underexplored, despite a rich history in psychometrics. In this paper, we introduce deep submodular peripteral networks (DSPNs), a novel parametric family of submodular functions, and… ▽ More

    Submitted 15 March, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

    Comments: Preprint

  2. arXiv:2403.04099  [pdf, other

    cs.LG

    Many-Objective Multi-Solution Transport

    Authors: Ziyue Li, Tian Li, Virginia Smith, Jeff Bilmes, Tianyi Zhou

    Abstract: Optimizing the performance of many objectives (instantiated by tasks or clients) jointly with a few Pareto stationary solutions (models) is critical in machine learning. However, previous multi-objective optimization methods often focus on a few number of objectives and cannot scale to many objectives that outnumber the solutions, leading to either subpar performance or ignored objectives. We intr… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

  3. arXiv:2401.06692  [pdf, other

    cs.CL cs.AI cs.LG

    An Experimental Design Framework for Label-Efficient Supervised Finetuning of Large Language Models

    Authors: Gantavya Bhatt, Yifang Chen, Arnav M. Das, Jifan Zhang, Sang T. Truong, Stephen Mussmann, Yinglun Zhu, Jeffrey Bilmes, Simon S. Du, Kevin Jamieson, Jordan T. Ash, Robert D. Nowak

    Abstract: Supervised finetuning (SFT) on instruction datasets has played a crucial role in achieving the remarkable zero-shot generalization capabilities observed in modern large language models (LLMs). However, the annotation efforts required to produce high quality responses for instructions are becoming prohibitively expensive, especially as the number of tasks spanned by instruction datasets continues t… ▽ More

    Submitted 6 May, 2024; v1 submitted 12 January, 2024; originally announced January 2024.

  4. arXiv:2311.14948  [pdf, other

    cs.LG cs.AI cs.CV

    Effective Backdoor Mitigation Depends on the Pre-training Objective

    Authors: Sahil Verma, Gantavya Bhatt, Avi Schwarzschild, Soumye Singhal, Arnav Mohanty Das, Chirag Shah, John P Dickerson, Jeff Bilmes

    Abstract: Despite the advanced capabilities of contemporary machine learning (ML) models, they remain vulnerable to adversarial and backdoor attacks. This vulnerability is particularly concerning in real-world deployments, where compromised models may exhibit unpredictable behavior in critical scenarios. Such risks are heightened by the prevalent practice of collecting massive, internet-sourced datasets for… ▽ More

    Submitted 5 December, 2023; v1 submitted 25 November, 2023; originally announced November 2023.

    Comments: Accepted for oral presentation at BUGS workshop @ NeurIPS 2023 (https://neurips2023-bugs.github.io/)

  5. arXiv:2306.09910  [pdf, other

    cs.LG cs.AI cs.CV

    LabelBench: A Comprehensive Framework for Benchmarking Adaptive Label-Efficient Learning

    Authors: Jifan Zhang, Yifang Chen, Gregory Canal, Stephen Mussmann, Arnav M. Das, Gantavya Bhatt, Yinglun Zhu, Jeffrey Bilmes, Simon Shaolei Du, Kevin Jamieson, Robert D Nowak

    Abstract: Labeled data are critical to modern machine learning applications, but obtaining labels can be expensive. To mitigate this cost, machine learning methods, such as transfer learning, semi-supervised learning and active learning, aim to be label-efficient: achieving high predictive performance from relatively few labeled examples. While obtaining the best label-efficiency in practice often requires… ▽ More

    Submitted 1 March, 2024; v1 submitted 16 June, 2023; originally announced June 2023.

  6. arXiv:2305.06408  [pdf, other

    cs.LG

    Accelerating Batch Active Learning Using Continual Learning Techniques

    Authors: Arnav Das, Gantavya Bhatt, Megh Bhalerao, Vianne Gao, Rui Yang, Jeff Bilmes

    Abstract: A major problem with Active Learning (AL) is high training costs since models are typically retrained from scratch after every query round. We start by demonstrating that standard AL on neural networks with warm starting fails, both to accelerate training and to avoid catastrophic forgetting when using fine-tuning over AL query rounds. We then develop a new class of techniques, circumventing this… ▽ More

    Submitted 12 December, 2023; v1 submitted 10 May, 2023; originally announced May 2023.

    Comments: Appeared in TMLR 2023

  7. arXiv:2207.03091  [pdf, other

    cs.LG

    Online SuBmodular + SuPermodular (BP) Maximization with Bandit Feedback

    Authors: Adhyyan Narang, Omid Sadeghi, Lillian J Ratliff, Maryam Fazel, Jeff Bilmes

    Abstract: In the context of online interactive machine learning with combinatorial objectives, we extend purely submodular prior work to more general non-submodular objectives. This includes: (1) those that are additively decomposable into a sum of two terms (a monotone submodular and monotone supermodular term, known as a BP decomposition); and (2) those that are only weakly submodular. In both cases, this… ▽ More

    Submitted 12 May, 2024; v1 submitted 7 July, 2022; originally announced July 2022.

    Comments: 37 pages, 4 figures

  8. High Resolution Point Clouds from mmWave Radar

    Authors: Akarsh Prabhakara, Tao **, Arnav Das, Gantavya Bhatt, Lilly Kumari, Elahe Soltanaghaei, Jeff Bilmes, Swarun Kumar, Anthony Rowe

    Abstract: This paper explores a machine learning approach for generating high resolution point clouds from a single-chip mmWave radar. Unlike lidar and vision-based systems, mmWave radar can operate in harsh environments and see through occlusions like smoke, fog, and dust. Unfortunately, current mmWave processing techniques offer poor spatial resolution compared to lidar point clouds. This paper presents R… ▽ More

    Submitted 16 July, 2023; v1 submitted 18 June, 2022; originally announced June 2022.

    Journal ref: 2023 IEEE International Conference on Robotics and Automation (ICRA), London, United Kingdom, 2023, pp. 4135-4142

  9. arXiv:2202.00132  [pdf, other

    cs.LG cs.AI

    Submodularity In Machine Learning and Artificial Intelligence

    Authors: Jeff Bilmes

    Abstract: In this manuscript, we offer a gentle review of submodularity and supermodularity and their properties. We offer a plethora of submodular definitions; a full description of a number of example submodular functions and their generalizations; example discrete constraints; a discussion of basic algorithms for maximization, minimization, and other operations; a brief overview of continuous submodular… ▽ More

    Submitted 4 October, 2022; v1 submitted 31 January, 2022; originally announced February 2022.

  10. arXiv:2108.03154  [pdf, other

    cs.IT math.CO

    Independence Properties of Generalized Submodular Information Measures

    Authors: Himanshu Asnani, Jeff Bilmes, Rishabh Iyer

    Abstract: Recently a class of generalized information measures was defined on sets of items parametrized by submodular functions. In this paper, we propose and study various notions of independence between sets with respect to such information measures, and connections thereof. Since entropy can also be used to parametrize such measures, we derive interesting independence properties for the entropy of sets… ▽ More

    Submitted 6 August, 2021; originally announced August 2021.

    Comments: This paper was accepted at ISIT 2021. arXiv admin note: text overlap with arXiv:2006.15412

  11. arXiv:2105.07107  [pdf, other

    cs.LG cs.AI

    An Effective Baseline for Robustness to Distributional Shift

    Authors: Sunil Thulasidasan, Sushil Thapa, Sayera Dhaubhadel, Gopinath Chennupati, Tanmoy Bhattacharya, Jeff Bilmes

    Abstract: Refraining from confidently predicting when faced with categories of inputs different from those seen during training is an important requirement for the safe deployment of deep learning systems. While simple to state, this has been a particularly challenging problem in deep learning, where models often end up making overconfident predictions in such situations. In this work we present a simple, b… ▽ More

    Submitted 14 May, 2021; originally announced May 2021.

  12. arXiv:2105.00043  [pdf, other

    cs.LG cs.CV

    Submodular Mutual Information for Targeted Data Subset Selection

    Authors: Suraj Kothawade, Vishal Kaushal, Ganesh Ramakrishnan, Jeff Bilmes, Rishabh Iyer

    Abstract: With the rapid growth of data, it is becoming increasingly difficult to train or improve deep learning models with the right subset of data. We show that this problem can be effectively solved at an additional labeling cost by targeted data subset selection(TSS) where a subset of unlabeled data points similar to an auxiliary set are added to the training data. We do so by using a rich class of Sub… ▽ More

    Submitted 30 April, 2021; originally announced May 2021.

    Comments: Accepted to ICLR 2021 S2D-OLAD Workshop; https://s2d-olad.github.io/. arXiv admin note: substantial text overlap with arXiv:2103.00128

  13. arXiv:2103.00128  [pdf, other

    cs.CV

    PRISM: A Rich Class of Parameterized Submodular Information Measures for Guided Subset Selection

    Authors: Suraj Kothawade, Vishal Kaushal, Ganesh Ramakrishnan, Jeff Bilmes, Rishabh Iyer

    Abstract: With ever-increasing dataset sizes, subset selection techniques are becoming increasingly important for a plethora of tasks. It is often necessary to guide the subset selection to achieve certain desiderata, which includes focusing or targeting certain data points, while avoiding others. Examples of such problems include: i)targeted learning, where the goal is to find subsets with rare classes or… ▽ More

    Submitted 8 March, 2022; v1 submitted 26 February, 2021; originally announced March 2021.

    Comments: To Appear In 36th AAAI Conference on Artificial Intelligence, AAAI 2022

  14. arXiv:2010.05631  [pdf, other

    cs.LG cs.CV

    A Unified Framework for Generic, Query-Focused, Privacy Preserving and Update Summarization using Submodular Information Measures

    Authors: Vishal Kaushal, Suraj Kothawade, Ganesh Ramakrishnan, Jeff Bilmes, Himanshu Asnani, Rishabh Iyer

    Abstract: We study submodular information measures as a rich framework for generic, query-focused, privacy sensitive, and update summarization tasks. While past work generally treats these problems differently ({\em e.g.}, different models are often used for generic and query-focused summarization), the submodular information measures allow us to study each of these problems via a unified approach. We first… ▽ More

    Submitted 12 October, 2020; originally announced October 2020.

    Comments: 35 pages, 14 figures, 5 tables

  15. arXiv:2006.16784  [pdf, ps, other

    cs.DM cs.IT cs.LG math.CO math.OC

    Concave Aspects of Submodular Functions

    Authors: Rishabh Iyer, Jeff Bilmes

    Abstract: Submodular Functions are a special class of set functions, which generalize several information-theoretic quantities such as entropy and mutual information [1]. Submodular functions have subgradients and subdifferentials [2] and admit polynomial-time algorithms for minimization, both of which are fundamental characteristics of convex functions. Submodular functions also show signs similar to conca… ▽ More

    Submitted 27 June, 2020; originally announced June 2020.

    Comments: Also appearing in International Symposium of Information Theory. arXiv admin note: substantial text overlap with arXiv:1506.07329

  16. arXiv:2006.15412  [pdf, ps, other

    cs.LG cs.DS cs.IT math.OC stat.ML

    Submodular Combinatorial Information Measures with Applications in Machine Learning

    Authors: Rishabh Iyer, Ninad Khargonkar, Jeff Bilmes, Himanshu Asnani

    Abstract: Information-theoretic quantities like entropy and mutual information have found numerous uses in machine learning. It is well known that there is a strong connection between these entropic quantities and submodularity since entropy over a set of random variables is submodular. In this paper, we study combinatorial information measures that generalize independence, (conditional) entropy, (condition… ▽ More

    Submitted 2 March, 2021; v1 submitted 27 June, 2020; originally announced June 2020.

    Comments: To Appear in the 32nd International Conference on Algorithmic Learning Theory, ALT 2021

  17. arXiv:1906.03543  [pdf, other

    cs.LG stat.ML

    apricot: Submodular selection for data summarization in Python

    Authors: Jacob Schreiber, Jeffrey Bilmes, William Stafford Noble

    Abstract: We present apricot, an open source Python package for selecting representative subsets from large data sets using submodular optimization. The package implements an efficient greedy selection algorithm that offers strong theoretical guarantees on the quality of the selected set. Two submodular set functions are implemented in apricot: facility location, which is broadly applicable but requires mem… ▽ More

    Submitted 8 June, 2019; originally announced June 2019.

  18. arXiv:1906.01827  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Coresets for Data-efficient Training of Machine Learning Models

    Authors: Baharan Mirzasoleiman, Jeff Bilmes, Jure Leskovec

    Abstract: Incremental gradient (IG) methods, such as stochastic gradient descent and its variants are commonly used for large scale optimization in machine learning. Despite the sustained effort to make IG methods more data-efficient, it remains an open question how to select a training data subset that can theoretically and practically perform on par with the full dataset. Here we develop CRAIG, a method t… ▽ More

    Submitted 16 November, 2020; v1 submitted 5 June, 2019; originally announced June 2019.

    Journal ref: International Conference on Machine Learning 2020

  19. arXiv:1905.11001  [pdf, other

    stat.ML cs.LG

    On Mixup Training: Improved Calibration and Predictive Uncertainty for Deep Neural Networks

    Authors: Sunil Thulasidasan, Gopinath Chennupati, Jeff Bilmes, Tanmoy Bhattacharya, Sarah Michalak

    Abstract: Mixup~\cite{zhang2017mixup} is a recently proposed method for training deep neural networks where additional samples are generated during training by convexly combining random pairs of images and their associated labels. While simple to implement, it has been shown to be a surprisingly effective method of data augmentation for image classification: DNNs trained with mixup show noticeable gains in… ▽ More

    Submitted 6 January, 2020; v1 submitted 27 May, 2019; originally announced May 2019.

    Comments: NeurIPS 2019

  20. arXiv:1905.10964  [pdf, other

    stat.ML cs.LG

    Combating Label Noise in Deep Learning Using Abstention

    Authors: Sunil Thulasidasan, Tanmoy Bhattacharya, Jeff Bilmes, Gopinath Chennupati, Jamal Mohd-Yusof

    Abstract: We introduce a novel method to combat label noise when training deep neural networks for classification. We propose a loss function that permits abstention during training thereby allowing the DNN to abstain on confusing samples while continuing to learn and improve classification performance on the non-abstained samples. We show how such a deep abstaining classifier (DAC) can be used for robust l… ▽ More

    Submitted 1 August, 2019; v1 submitted 27 May, 2019; originally announced May 2019.

    Comments: ICML 2019. Added source code link

  21. arXiv:1902.10176  [pdf, ps, other

    cs.LG cs.AI cs.DS stat.ML

    A Memoization Framework for Scaling Submodular Optimization to Large Scale Problems

    Authors: Rishabh Iyer, Jeff Bilmes

    Abstract: We are motivated by large scale submodular optimization problems, where standard algorithms that treat the submodular functions in the \emph{value oracle model} do not scale. In this paper, we present a model called the \emph{precomputational complexity model}, along with a unifying memoization based framework, which looks at the specific form of the given submodular function. A key ingredient in… ▽ More

    Submitted 26 February, 2019; originally announced February 2019.

    Comments: To Appear in Proc. AISTATS 2019

  22. arXiv:1902.10172  [pdf, other

    cs.LG cs.DM

    Near Optimal Algorithms for Hard Submodular Programs with Discounted Cooperative Costs

    Authors: Rishabh Iyer, Jeff Bilmes

    Abstract: In this paper, we investigate a class of submodular problems which in general are very hard. These include minimizing a submodular cost function under combinatorial constraints, which include cuts, matchings, paths, etc., optimizing a submodular function under submodular cover and submodular knapsack constraints, and minimizing a ratio of submodular functions. All these problems appear in several… ▽ More

    Submitted 26 February, 2019; originally announced February 2019.

    Comments: To Appear in Proc. AISTATS 2019

  23. arXiv:1801.07413  [pdf, other

    cs.DM

    Greed is Still Good: Maximizing Monotone Submodular+Supermodular Functions

    Authors: Wenruo Bai, Jeffrey A. Bilmes

    Abstract: We analyze the performance of the greedy algorithm, and also a discrete semi-gradient based algorithm, for maximizing the sum of a suBmodular and suPermodular (BP) function (both of which are non-negative monotone non-decreasing) under two types of constraints, either a cardinality constraint or $p\geq 1$ matroid independence constraints. These problems occur naturally in several real-world applic… ▽ More

    Submitted 23 January, 2018; originally announced January 2018.

  24. arXiv:1701.08939  [pdf, other

    cs.LG

    Deep Submodular Functions

    Authors: Jeffrey Bilmes, Wenruo Bai

    Abstract: We start with an overview of a class of submodular functions called SCMMs (sums of concave composed with non-negative modular functions plus a final arbitrary modular). We then define a new class of submodular functions we call {\em deep submodular functions} or DSFs. We show that DSFs are a flexible parametric family of submodular functions that share many of the properties and advantages of deep… ▽ More

    Submitted 31 January, 2017; originally announced January 2017.

  25. arXiv:1612.04899  [pdf, other

    stat.ML cs.LG

    Semi-Supervised Phone Classification using Deep Neural Networks and Stochastic Graph-Based Entropic Regularization

    Authors: Sunil Thulasidasan, Jeffrey Bilmes

    Abstract: We describe a graph-based semi-supervised learning framework in the context of deep neural networks that uses a graph-based entropic regularizer to favor smooth solutions over a graph induced by the data. The main contribution of this work is a computationally efficient, stochastic graph-regularization technique that uses mini-batches that are consistent with the graph structure, but also provides… ▽ More

    Submitted 30 May, 2018; v1 submitted 14 December, 2016; originally announced December 2016.

    Comments: InterSpeech Workshop on Machine Learning in Speech and Language Processing, 2016. Based on and extends work in arXiv:1612.04898

    Report number: LA-UR-16-24599

  26. arXiv:1612.04898  [pdf, other

    stat.ML cs.DC cs.LG

    Efficient Distributed Semi-Supervised Learning using Stochastic Regularization over Affinity Graphs

    Authors: Sunil Thulasidasan, Jeffrey Bilmes, Garrett Kenyon

    Abstract: We describe a computationally efficient, stochastic graph-regularization technique that can be utilized for the semi-supervised training of deep neural networks in a parallel or distributed setting. We utilize a technique, first described in [13] for the construction of mini-batches for stochastic gradient descent (SGD) based on synthesized partitions of an affinity graph that are consistent with… ▽ More

    Submitted 30 May, 2018; v1 submitted 14 December, 2016; originally announced December 2016.

    Comments: NIPS 2016 Workshop on Machine Learning Systems

    Report number: LA-UR-16-28681

  27. arXiv:1606.00399  [pdf, other

    cs.LG math.CO stat.ML

    Scaling Submodular Maximization via Pruned Submodularity Graphs

    Authors: Tianyi Zhou, Hua Ouyang, Yi Chang, Jeff Bilmes, Carlos Guestrin

    Abstract: We propose a new random pruning method (called "submodular sparsification (SS)") to reduce the cost of submodular maximization. The pruning is applied via a "submodularity graph" over the $n$ ground elements, where each directed edge is associated with a pairwise dependency defined by the submodular function. In each step, SS prunes a $1-1/\sqrt{c}$ (for $c>1$) fraction of the nodes using weights… ▽ More

    Submitted 1 June, 2016; originally announced June 2016.

  28. arXiv:1606.00389  [pdf, other

    stat.ML cs.LG math.CO

    Stream Clipper: Scalable Submodular Maximization on Stream

    Authors: Tianyi Zhou, Jeff Bilmes

    Abstract: We propose a streaming submodular maximization algorithm "stream clipper" that performs as well as the offline greedy algorithm on document/video summarization in practice. It adds elements from a stream either to a solution set $S$ or to an extra buffer $B$ based on two adaptive thresholds, and improves $S$ by a final greedy step that starts from $S$ adding elements from $B$. During this process,… ▽ More

    Submitted 12 February, 2018; v1 submitted 1 June, 2016; originally announced June 2016.

    Comments: 17 pages, 12 figures, submitted to conference

  29. arXiv:1602.01024  [pdf, ps, other

    cs.LG

    On Deep Multi-View Representation Learning: Objectives and Optimization

    Authors: Weiran Wang, Raman Arora, Karen Livescu, Jeff Bilmes

    Abstract: We consider learning representations (features) in the setting in which we have access to multiple unlabeled views of the data for learning while only one view is available for downstream tasks. Previous work on this problem has proposed several techniques based on deep neural networks, typically involving either autoencoder-like networks with a reconstruction objective or paired feedforward netwo… ▽ More

    Submitted 2 February, 2016; originally announced February 2016.

  30. arXiv:1511.02163  [pdf, other

    cs.DS cs.AI cs.DM

    Submodular Hamming Metrics

    Authors: Jennifer Gillenwater, Rishabh Iyer, Bethany Lusch, Rahul Kidambi, Jeff Bilmes

    Abstract: We show that there is a largely unexplored class of functions (positive polymatroids) that can define proper discrete metrics over pairs of binary vectors and that are fairly tractable to optimize over. By exploiting submodularity, we are able to give hardness results and approximation algorithms for optimizing over such metrics. Additionally, we demonstrate empirically the effectiveness of these… ▽ More

    Submitted 6 November, 2015; originally announced November 2015.

    Comments: 15 pages, 1 figure, a short version of this will appear in the NIPS 2015 conference

  31. arXiv:1510.08865  [pdf, other

    cs.DS cs.DM cs.LG

    Mixed Robust/Average Submodular Partitioning: Fast Algorithms, Guarantees, and Applications to Parallel Machine Learning and Multi-Label Image Segmentation

    Authors: Kai Wei, Rishabh Iyer, Shengjie Wang, Wenruo Bai, Jeff Bilmes

    Abstract: We study two mixed robust/average-case submodular partitioning problems that we collectively call Submodular Partitioning. These problems generalize both purely robust instances of the problem (namely max-min submodular fair allocation (SFA) and min-max submodular load balancing (SLB) and also generalize average-case instances (that is the submodular welfare problem (SWP) and submodular multiway p… ▽ More

    Submitted 16 August, 2016; v1 submitted 29 October, 2015; originally announced October 2015.

  32. arXiv:1506.07329  [pdf, other

    cs.DM cs.DS

    Polyhedral aspects of Submodularity, Convexity and Concavity

    Authors: Rishabh Iyer, Jeff Bilmes

    Abstract: Seminal work by Edmonds and Lovasz shows the strong connection between submodularity and convexity. Submodular functions have tight modular lower bounds, and subdifferentials in a manner akin to convex functions. They also admit poly-time algorithms for minimization and satisfy the Fenchel duality theorem and the Discrete Seperation Theorem, both of which are fundamental characteristics of convex… ▽ More

    Submitted 8 September, 2015; v1 submitted 24 June, 2015; originally announced June 2015.

    Comments: 38 pages, 10 figures

  33. arXiv:1408.2062  [pdf

    cs.LG stat.ML

    The Lovasz-Bregman Divergence and connections to rank aggregation, clustering, and web ranking

    Authors: Rishabh Iyer, Jeff A. Bilmes

    Abstract: We extend the recently introduced theory of Lovasz-Bregman (LB) divergences (Iyer & Bilmes 2012) in several ways. We show that they represent a distortion between a "score" and an "ordering", thus providing a new view of rank aggregation and order based clustering with interesting connections to web ranking. We show how the LB divergences have a number of properties akin to many permutation based… ▽ More

    Submitted 9 August, 2014; originally announced August 2014.

    Comments: Appears in Proceedings of the Twenty-Ninth Conference on Uncertainty in Artificial Intelligence (UAI2013)

    Report number: UAI-P-2013-PG-321-330

  34. arXiv:1408.2051  [pdf

    cs.LG stat.ML

    Algorithms for Approximate Minimization of the Difference Between Submodular Functions, with Applications

    Authors: Rishabh Iyer, Jeff A. Bilmes

    Abstract: We extend the work of Narasimhan and Bilmes [30] for minimizing set functions representable as a dierence between submodular functions. Similar to [30], our new algorithms are guaranteed to monotonically reduce the objective function at every step. We empirically and theoretically show that the per-iteration cost of our algorithms is much less than [30], and our algorithms can be used to efficient… ▽ More

    Submitted 9 August, 2014; originally announced August 2014.

    Comments: Appears in Proceedings of the Twenty-Eighth Conference on Uncertainty in Artificial Intelligence (UAI2012)

    Report number: UAI-P-2012-PG-407-417

  35. arXiv:1406.5752  [pdf, other

    stat.ML cs.LG

    Divide-and-Conquer Learning by Anchoring a Conical Hull

    Authors: Tianyi Zhou, Jeff Bilmes, Carlos Guestrin

    Abstract: We reduce a broad class of machine learning problems, usually addressed by EM or sampling, to the problem of finding the $k$ extremal rays spanning the conical hull of a data point set. These $k$ "anchors" lead to a global solution and a more interpretable model that can even outperform EM and sampling on generalization error. To find the $k$ anchors, we propose a novel divide-and-conquer learning… ▽ More

    Submitted 22 June, 2014; originally announced June 2014.

    Comments: 26 pages, long version, in updating

  36. arXiv:1402.0240  [pdf, other

    cs.DS cs.CV cs.DM math.OC

    Graph Cuts with Interacting Edge Costs - Examples, Approximations, and Algorithms

    Authors: Stefanie Jegelka, Jeff Bilmes

    Abstract: We study an extension of the classical graph cut problem, wherein we replace the modular (sum of edge weights) cost function by a submodular set function defined over graph edges. Special cases of this problem have appeared in different applications in signal processing, machine learning, and computer vision. In this paper, we connect these applications via the generic formulation of "cooperative… ▽ More

    Submitted 26 March, 2016; v1 submitted 2 February, 2014; originally announced February 2014.

    Comments: 46 pages

  37. arXiv:1311.2110  [pdf, other

    cs.DS cs.DM cs.LG

    Curvature and Optimal Algorithms for Learning and Minimizing Submodular Functions

    Authors: Rishabh Iyer, Stefanie Jegelka, Jeff Bilmes

    Abstract: We investigate three related and important problems connected to machine learning: approximating a submodular function everywhere, learning a submodular function (in a PAC-like setting [53]), and constrained minimization of submodular functions. We show that the complexity of all three problems depends on the 'curvature' of the submodular function, and provide lower and upper bounds that refine an… ▽ More

    Submitted 8 November, 2013; originally announced November 2013.

    Comments: 21 pages. A shorter version appeared in Advances of NIPS-2013

  38. arXiv:1311.2106  [pdf, other

    cs.DS cs.AI cs.DM

    Submodular Optimization with Submodular Cover and Submodular Knapsack Constraints

    Authors: Rishabh Iyer, Jeff Bilmes

    Abstract: We investigate two new optimization problems -- minimizing a submodular function subject to a submodular lower bound constraint (submodular cover) and maximizing a submodular function subject to a submodular upper bound constraint (submodular knapsack). We are motivated by a number of real-world applications in machine learning including sensor placement and data subset selection, which require ma… ▽ More

    Submitted 8 November, 2013; originally announced November 2013.

    Comments: 23 pages. A short version of this appeared in Advances of NIPS-2013

  39. arXiv:1308.5275  [pdf, other

    cs.LG cs.IR stat.ML

    The Lovasz-Bregman Divergence and connections to rank aggregation, clustering, and web ranking

    Authors: Rishabh Iyer, Jeff Bilmes

    Abstract: We extend the recently introduced theory of Lovasz-Bregman (LB) divergences (Iyer & Bilmes, 2012) in several ways. We show that they represent a distortion between a 'score' and an 'ordering', thus providing a new view of rank aggregation and order based clustering with interesting connections to web ranking. We show how the LB divergences have a number of properties akin to many permutation based… ▽ More

    Submitted 23 August, 2013; originally announced August 2013.

    Comments: 18 pages. A shorter version appeared in Proc. Uncertainty in Artificial Intelligence (UAI)-2013, Bellevue, WA

    Journal ref: UAI-2013

  40. arXiv:1308.1006  [pdf, other

    cs.DS cs.DM cs.LG

    Fast Semidifferential-based Submodular Function Optimization

    Authors: Rishabh Iyer, Stefanie Jegelka, Jeff Bilmes

    Abstract: We present a practical and powerful new framework for both unconstrained and constrained submodular function optimization based on discrete semidifferentials (sub- and super-differentials). The resulting algorithms, which repeatedly compute and then efficiently optimize submodular semigradients, offer new and generalize many old methods for submodular optimization. Our approach, moreover, takes st… ▽ More

    Submitted 5 August, 2013; originally announced August 2013.

    Comments: This work appeared in Proc. International Conference of Machine Learning (ICML, 2013)

  41. arXiv:1301.3837  [pdf

    cs.LG cs.AI stat.ML

    Dynamic Bayesian Multinets

    Authors: Jeff A. Bilmes

    Abstract: In this work, dynamic Bayesian multinets are introduced where a Markov chain state at time t determines conditional independence patterns between random variables lying within a local time window surrounding t. It is shown how information-theoretic criterion functions can be used to induce sparse, discriminative, and class-conditional network structures that yield an optimal approximation to the… ▽ More

    Submitted 16 January, 2013; originally announced January 2013.

    Comments: Appears in Proceedings of the Sixteenth Conference on Uncertainty in Artificial Intelligence (UAI2000)

    Report number: UAI-P-2000-PG-38-45

  42. arXiv:1212.2448  [pdf

    cs.AI

    On Triangulating Dynamic Graphical Models

    Authors: Jeff A. Bilmes, Chris Bartels

    Abstract: This paper introduces new methodology to triangulate dynamic Bayesian networks (DBNs) and dynamic graphical models (DGMs). While most methods to triangulate such networks use some form of constrained elimination scheme based on properties of the underlying directed graph, we find it useful to view triangulation and elimination using properties only of the resulting undirected g… ▽ More

    Submitted 19 October, 2012; originally announced December 2012.

    Comments: Appears in Proceedings of the Nineteenth Conference on Uncertainty in Artificial Intelligence (UAI2003)

    Report number: UAI-P-2003-PG-47-56

  43. arXiv:1210.4904  [pdf

    cs.CE q-bio.QM

    Spectrum Identification using a Dynamic Bayesian Network Model of Tandem Mass Spectra

    Authors: Ajit P. Singh, John Halloran, Jeff A. Bilmes, Katrin Kirchoff, William S. Noble

    Abstract: Shotgun proteomics is a high-throughput technology used to identify unknown proteins in a complex mixture. At the heart of this process is a prediction task, the spectrum identification problem, in which each fragmentation spectrum produced by a shotgun proteomics experiment must be mapped to the peptide (protein subsequence) which generated the spectrum. We propose a new algorithm for spectrum id… ▽ More

    Submitted 16 October, 2012; originally announced October 2012.

    Comments: Appears in Proceedings of the Twenty-Eighth Conference on Uncertainty in Artificial Intelligence (UAI2012)

    Report number: UAI-P-2012-PG-775-785

  44. arXiv:1210.4871  [pdf

    cs.LG cs.CL cs.IR stat.ML

    Learning Mixtures of Submodular Shells with Application to Document Summarization

    Authors: Hui Lin, Jeff A. Bilmes

    Abstract: We introduce a method to learn a mixture of submodular "shells" in a large-margin setting. A submodular shell is an abstract submodular function that can be instantiated with a ground set and a set of parameters to produce a submodular function. A mixture of such shells can then also be so instantiated to produce a more complex submodular function. What our algorithm learns are the mixture weights… ▽ More

    Submitted 16 October, 2012; originally announced October 2012.

    Comments: Appears in Proceedings of the Twenty-Eighth Conference on Uncertainty in Artificial Intelligence (UAI2012)

    Report number: UAI-P-2012-PG-479-490

  45. arXiv:1207.4151  [pdf

    cs.LG cs.DS stat.ML

    PAC-learning bounded tree-width Graphical Models

    Authors: Mukund Narasimhan, Jeff A. Bilmes

    Abstract: We show that the class of strongly connected graphical models with treewidth at most k can be properly efficiently PAC-learnt with respect to the Kullback-Leibler Divergence. Previous approaches to this problem, such as those of Chow ([1]), and Ho gen ([7]) have shown that this class is PAC-learnable by reducing it to a combinatorial optimization problem. However, for k > 1, this problem is NP-com… ▽ More

    Submitted 11 July, 2012; originally announced July 2012.

    Comments: Appears in Proceedings of the Twentieth Conference on Uncertainty in Artificial Intelligence (UAI2004)

    Report number: UAI-P-2004-PG-410-417

  46. arXiv:1207.1404  [pdf

    cs.LG cs.DS stat.ML

    A submodular-supermodular procedure with applications to discriminative structure learning

    Authors: Mukund Narasimhan, Jeff A. Bilmes

    Abstract: In this paper, we present an algorithm for minimizing the difference between two submodular functions using a variational framework which is based on (an extension of) the concave-convex procedure [17]. Because several commonly used metrics in machine learning, like mutual information and conditional mutual information, are submodular, the problem of minimizing the difference of two submodular pro… ▽ More

    Submitted 4 July, 2012; originally announced July 2012.

    Comments: Appears in Proceedings of the Twenty-First Conference on Uncertainty in Artificial Intelligence (UAI2005)

    Report number: UAI-P-2005-PG-404-412

  47. arXiv:1207.0560  [pdf, ps, other

    cs.DS cs.LG

    Algorithms for Approximate Minimization of the Difference Between Submodular Functions, with Applications

    Authors: Rishabh Iyer, Jeff Bilmes

    Abstract: We extend the work of Narasimhan and Bilmes [30] for minimizing set functions representable as a difference between submodular functions. Similar to [30], our new algorithms are guaranteed to monotonically reduce the objective function at every step. We empirically and theoretically show that the per-iteration cost of our algorithms is much less than [30], and our algorithms can be used to efficie… ▽ More

    Submitted 24 August, 2013; v1 submitted 2 July, 2012; originally announced July 2012.

    Comments: 17 pages, 8 figures. A shorter version of this appeared in Proc. Uncertainty in Artificial Intelligence (UAI), Catalina Islands, 2012

    Journal ref: UAI-2012

  48. arXiv:1206.6869  [pdf

    cs.AI

    Recognizing Activities and Spatial Context Using Wearable Sensors

    Authors: Amarnag Subramanya, Alvin Raj, Jeff A. Bilmes, Dieter Fox

    Abstract: We introduce a new dynamic model with the capability of recognizing both activities that an individual is performing as well as where that ndividual is located. Our model is novel in that it utilizes a dynamic graphical model to jointly estimate both activity and spatial context over time based on the simultaneous use of asynchronous observations consisting of GPS measurements, and measurements fr… ▽ More

    Submitted 27 June, 2012; originally announced June 2012.

    Comments: Appears in Proceedings of the Twenty-Second Conference on Uncertainty in Artificial Intelligence (UAI2006)

    Report number: UAI-P-2006-PG-494-502

  49. arXiv:1206.6825  [pdf

    cs.AI cs.DS

    Non-Minimal Triangulations for Mixed Stochastic/Deterministic Graphical Models

    Authors: Chris Bartels, Jeff A. Bilmes

    Abstract: We observe that certain large-clique graph triangulations can be useful to reduce both computational and space requirements when making queries on mixed stochastic/deterministic graphical models. We demonstrate that many of these large-clique triangulations are non-minimal and are thus unattainable via the variable elimination algorithm. We introduce ancestral pairs as the basis for novel triangul… ▽ More

    Submitted 27 June, 2012; originally announced June 2012.

    Comments: Appears in Proceedings of the Twenty-Second Conference on Uncertainty in Artificial Intelligence (UAI2006)

    Report number: UAI-P-2006-PG-15-22

  50. arXiv:1206.5265  [pdf

    cs.LG cs.AI stat.ML

    Consensus ranking under the exponential model

    Authors: Marina Meila, Kapil Phadnis, Arthur Patterson, Jeff A. Bilmes

    Abstract: We analyze the generalized Mallows model, a popular exponential model over rankings. Estimating the central (or consensus) ranking from data is NP-hard. We obtain the following new results: (1) We show that search methods can estimate both the central ranking pi0 and the model parameters theta exactly. The search is n! in the worst case, but is tractable when the true distribution is concentrated… ▽ More

    Submitted 20 June, 2012; originally announced June 2012.

    Comments: Appears in Proceedings of the Twenty-Third Conference on Uncertainty in Artificial Intelligence (UAI2007)

    Report number: UAI-P-2007-PG-285-294