Skip to main content

Showing 1–50 of 89 results for author: Balcan, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.15911  [pdf, other

    cs.LG

    Learning accurate and interpretable decision trees

    Authors: Maria-Florina Balcan, Dravyansh Sharma

    Abstract: Decision trees are a popular tool in machine learning and yield easy-to-understand models. Several techniques have been proposed in the literature for learning a decision tree classifier, with different techniques working well for data from different domains. In this work, we develop approaches to design decision tree learning algorithms given repeated access to data from the same domain. We propo… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: 26 pages, UAI 2024

  2. arXiv:2402.08576  [pdf, other

    cs.GT cs.LG

    Regret Minimization in Stackelberg Games with Side Information

    Authors: Keegan Harris, Zhiwei Steven Wu, Maria-Florina Balcan

    Abstract: Algorithms for playing in Stackelberg games have been deployed in real-world domains including airport security, anti-poaching efforts, and cyber-crime prevention. However, these algorithms often fail to take into consideration the additional information available to each player (e.g. traffic patterns, weather conditions, network congestion), a salient feature of reality which may significantly af… ▽ More

    Submitted 23 May, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

  3. arXiv:2402.00645  [pdf, other

    stat.ML cs.LG

    Spectrally Transformed Kernel Regression

    Authors: Runtian Zhai, Rattana Pukdee, Roger **, Maria-Florina Balcan, Pradeep Ravikumar

    Abstract: Unlabeled data is a key component of modern machine learning. In general, the role of unlabeled data is to impose a form of smoothness, usually from the similarity information encoded in a base kernel, such as the $ε$-neighbor kernel or the adjacency matrix of a graph. This work revisits the classical idea of spectrally transformed kernel regression (STKR), and provides a new class of general and… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

    Comments: ICLR 2024 spotlight. 36 pages

  4. arXiv:2401.13773  [pdf, other

    math.OC cs.DM cs.DS

    New Sequence-Independent Lifting Techniques for Cutting Planes and When They Induce Facets

    Authors: Siddharth Prasad, Ellen Vitercik, Maria-Florina Balcan, Tuomas Sandholm

    Abstract: Sequence-independent lifting is a procedure for strengthening valid inequalities of an integer program. We generalize the sequence-independent lifting method of Gu, Nemhauser, and Savelsbergh (GNS lifting) for cover inequalities and correct an error in their proposed generalization. We obtain a new sequence-independent lifting technique -- piecewise-constant (PC) lifting -- with a number of intere… ▽ More

    Submitted 24 January, 2024; originally announced January 2024.

  5. arXiv:2310.02246  [pdf, other

    cs.LG cs.AI math.NA stat.ML

    Learning to Relax: Setting Solver Parameters Across a Sequence of Linear System Instances

    Authors: Mikhail Khodak, Edmond Chow, Maria-Florina Balcan, Ameet Talwalkar

    Abstract: Solving a linear system $Ax=b$ is a fundamental scientific computing primitive for which numerous solvers and preconditioners have been developed. These come with parameters whose optimal values depend on the system being solved and are often impossible or too expensive to identify; thus in practice sub-optimal heuristics are used. We consider the common setting in which many related linear system… ▽ More

    Submitted 2 May, 2024; v1 submitted 3 October, 2023; originally announced October 2023.

    Comments: ICLR 2024 Spotlight

  6. arXiv:2307.02295  [pdf, other

    cs.LG cs.AI stat.ML

    Meta-Learning Adversarial Bandit Algorithms

    Authors: Mikhail Khodak, Ilya Osadchiy, Keegan Harris, Maria-Florina Balcan, Kfir Y. Levy, Ron Meir, Zhiwei Steven Wu

    Abstract: We study online meta-learning with bandit feedback, with the goal of improving performance across multiple tasks if they are similar according to some natural similarity measure. As the first to target the adversarial online-within-online partial-information setting, we design meta-algorithms that combine outer learners to simultaneously tune the initialization and other hyperparameters of an inne… ▽ More

    Submitted 1 November, 2023; v1 submitted 5 July, 2023; originally announced July 2023.

    Comments: Merger of arXiv:2205.14128 and arXiv:2205.15921, with some additional improvements; to appear in NeurIPS 2023

  7. arXiv:2304.03370  [pdf, other

    cs.LG cs.CR

    Reliable learning in challenging environments

    Authors: Maria-Florina Balcan, Steve Hanneke, Rattana Pukdee, Dravyansh Sharma

    Abstract: The problem of designing learners that provide guarantees that their predictions are provably correct is of increasing importance in machine learning. However, learning theoretic guarantees have only been considered in very specific settings. In this work, we consider the design and analysis of reliable learners in challenging test-time environments as encountered in modern machine learning proble… ▽ More

    Submitted 29 October, 2023; v1 submitted 6 April, 2023; originally announced April 2023.

    Journal ref: NeurIPS 2023

  8. arXiv:2303.14496  [pdf, other

    cs.LG cs.AI stat.ML

    Learning with Explanation Constraints

    Authors: Rattana Pukdee, Dylan Sam, J. Zico Kolter, Maria-Florina Balcan, Pradeep Ravikumar

    Abstract: As larger deep learning models are hard to interpret, there has been a recent focus on generating explanations of these black-box models. In contrast, we may have apriori explanations of how models should behave. In this paper, we formalize this notion as learning from explanation constraints and provide a learning theoretic framework to analyze how such explanations can improve the learning of ou… ▽ More

    Submitted 22 December, 2023; v1 submitted 25 March, 2023; originally announced March 2023.

    Comments: NeurIPS 2023

  9. arXiv:2302.14234  [pdf, other

    cs.GT econ.TH

    Bicriteria Multidimensional Mechanism Design with Side Information

    Authors: Maria-Florina Balcan, Siddharth Prasad, Tuomas Sandholm

    Abstract: We develop a versatile new methodology for multidimensional mechanism design that incorporates side information about agent types with the bicriteria goal of generating high social welfare and high revenue simultaneously. Side information can come from a variety of sources -- examples include advice from a domain expert, predictions from a machine-learning model trained on historical agent data, o… ▽ More

    Submitted 5 June, 2023; v1 submitted 27 February, 2023; originally announced February 2023.

  10. arXiv:2302.11700  [pdf, ps, other

    cs.GT cs.LG

    Learning Revenue Maximizing Menus of Lotteries and Two-Part Tariffs

    Authors: Maria-Florina Balcan, Hedyeh Beyhaghi

    Abstract: We advance a recently flourishing line of work at the intersection of learning theory and computational economics by studying the learnability of two classes of mechanisms prominent in economics, namely menus of lotteries and two-part tariffs. The former is a family of randomized mechanisms designed for selling multiple items, known to achieve revenue beyond deterministic mechanisms, while the lat… ▽ More

    Submitted 30 June, 2023; v1 submitted 22 February, 2023; originally announced February 2023.

  11. arXiv:2210.12606  [pdf, other

    cs.LG cs.GT

    Nash Equilibria and Pitfalls of Adversarial Training in Adversarial Robustness Games

    Authors: Maria-Florina Balcan, Rattana Pukdee, Pradeep Ravikumar, Hongyang Zhang

    Abstract: Adversarial training is a standard technique for training adversarially robust models. In this paper, we study adversarial training as an alternating best-response strategy in a 2-player zero-sum game. We prove that even in a simple scenario of a linear classifier and a statistical model that abstracts robust vs. non-robust features, the alternating best response strategy of such game may not conv… ▽ More

    Submitted 27 February, 2023; v1 submitted 22 October, 2022; originally announced October 2022.

    Comments: AISTATS 2023

  12. arXiv:2210.03594  [pdf, other

    cs.LG stat.ML

    Label Propagation with Weak Supervision

    Authors: Rattana Pukdee, Dylan Sam, Maria-Florina Balcan, Pradeep Ravikumar

    Abstract: Semi-supervised learning and weakly supervised learning are important paradigms that aim to reduce the growing demand for labeled data in current machine learning applications. In this paper, we introduce a novel analysis of the classical label propagation algorithm (LPA) (Zhu & Ghahramani, 2002) that moreover takes advantage of useful prior information, specifically probabilistic hypothesized lab… ▽ More

    Submitted 9 April, 2023; v1 submitted 7 October, 2022; originally announced October 2022.

    Comments: ICLR 2023, 26 pages, 2 figures

  13. arXiv:2207.10199  [pdf, other

    cs.LG stat.ML

    Provably tuning the ElasticNet across instances

    Authors: Maria-Florina Balcan, Mikhail Khodak, Dravyansh Sharma, Ameet Talwalkar

    Abstract: An important unresolved challenge in the theory of regularization is to set the regularization coefficients of popular techniques like the ElasticNet with general provable guarantees. We consider the problem of tuning the regularization parameters of Ridge regression, LASSO, and the ElasticNet across multiple problem instances, a setting that encompasses both cross-validation and multi-task hyperp… ▽ More

    Submitted 15 January, 2024; v1 submitted 20 July, 2022; originally announced July 2022.

  14. arXiv:2205.14128  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Meta-Learning Adversarial Bandits

    Authors: Maria-Florina Balcan, Keegan Harris, Mikhail Khodak, Zhiwei Steven Wu

    Abstract: We study online learning with bandit feedback across multiple tasks, with the goal of improving average performance across tasks if they are similar according to some natural task-similarity measure. As the first to target the adversarial setting, we design a unified meta-algorithm that yields setting-specific guarantees for two important cases: multi-armed bandits (MAB) and bandit linear optimiza… ▽ More

    Submitted 27 May, 2022; originally announced May 2022.

    Comments: 19 pages

  15. arXiv:2204.07312  [pdf, other

    math.OC cs.DS cs.LG

    Structural Analysis of Branch-and-Cut and the Learnability of Gomory Mixed Integer Cuts

    Authors: Maria-Florina Balcan, Siddharth Prasad, Tuomas Sandholm, Ellen Vitercik

    Abstract: The incorporation of cutting planes within the branch-and-bound algorithm, known as branch-and-cut, forms the backbone of modern integer programming solvers. These solvers are the foremost method for solving discrete optimization problems and thus have a vast array of applications in machine learning, operations research, and many other fields. Choosing cutting planes effectively is a major resear… ▽ More

    Submitted 14 April, 2022; originally announced April 2022.

  16. arXiv:2204.03569  [pdf, other

    cs.DS cs.LG

    Output-sensitive ERM-based techniques for data-driven algorithm design

    Authors: Maria-Florina Balcan, Christopher Seiler, Dravyansh Sharma

    Abstract: Data-driven algorithm design is a promising, learning-based approach for beyond worst-case analysis of algorithms with tunable parameters. An important open problem is the design of computationally efficient data-driven algorithms for combinatorial algorithm families with multiple parameters. As one fixes the problem instance and varies the parameters, the "dual" loss function typically has a piec… ▽ More

    Submitted 29 September, 2023; v1 submitted 7 April, 2022; originally announced April 2022.

    Comments: 39 pages, 3 figures

  17. arXiv:2203.04160  [pdf, other

    cs.LG cs.AI cs.CR cs.DS

    Robustly-reliable learners under poisoning attacks

    Authors: Maria-Florina Balcan, Avrim Blum, Steve Hanneke, Dravyansh Sharma

    Abstract: Data poisoning attacks, in which an adversary corrupts a training set with the goal of inducing specific desired mistakes, have raised substantial concern: even just the possibility of such an attack can make a user no longer trust the results of a learning system. In this work, we show how to achieve strong robustness guarantees in the face of such attacks across multiple axes. We provide robus… ▽ More

    Submitted 8 March, 2022; originally announced March 2022.

  18. arXiv:2202.09312  [pdf, other

    cs.LG cs.AI cs.DS stat.ML

    Learning Predictions for Algorithms with Predictions

    Authors: Mikhail Khodak, Maria-Florina Balcan, Ameet Talwalkar, Sergei Vassilvitskii

    Abstract: A burgeoning paradigm in algorithm design is the field of algorithms with predictions, in which algorithms can take advantage of a possibly-imperfect prediction of some aspect of the problem. While much work has focused on using predictions to improve competitive ratios, running times, or other performance measures, less effort has been devoted to the question of how to obtain the predictions them… ▽ More

    Submitted 17 October, 2022; v1 submitted 18 February, 2022; originally announced February 2022.

    Comments: NeurIPS 2022 camera-ready

  19. arXiv:2111.11207  [pdf, other

    cs.LG cs.AI cs.DS math.OC

    Improved Sample Complexity Bounds for Branch-and-Cut

    Authors: Maria-Florina Balcan, Siddharth Prasad, Tuomas Sandholm, Ellen Vitercik

    Abstract: Branch-and-cut is the most widely used algorithm for solving integer programs, employed by commercial solvers like CPLEX and Gurobi. Branch-and-cut has a wide variety of tunable parameters that have a huge impact on the size of the search tree that it builds, but are challenging to tune by hand. An increasingly popular approach is to use machine learning to tune these parameters: using a training… ▽ More

    Submitted 11 May, 2022; v1 submitted 17 November, 2021; originally announced November 2021.

  20. arXiv:2108.08770  [pdf, other

    cs.LG

    Learning-to-learn non-convex piecewise-Lipschitz functions

    Authors: Maria-Florina Balcan, Mikhail Khodak, Dravyansh Sharma, Ameet Talwalkar

    Abstract: We analyze the meta-learning of the initialization and step-size of learning algorithms for piecewise-Lipschitz functions, a non-convex setting with applications to both machine learning and algorithms. Starting from recent regret bounds for the exponential forecaster on losses with dispersed discontinuities, we generalize them to be initialization-dependent and then use this result to propose a p… ▽ More

    Submitted 19 August, 2021; originally announced August 2021.

  21. arXiv:2106.04502  [pdf, other

    cs.LG cs.AI cs.DC stat.ML

    Federated Hyperparameter Tuning: Challenges, Baselines, and Connections to Weight-Sharing

    Authors: Mikhail Khodak, Renbo Tu, Tian Li, Liam Li, Maria-Florina Balcan, Virginia Smith, Ameet Talwalkar

    Abstract: Tuning hyperparameters is a crucial but arduous part of the machine learning pipeline. Hyperparameter optimization is even more challenging in federated learning, where models are learned over a distributed network of heterogeneous devices; here, the need to keep data on device and perform local training makes it difficult to efficiently train and evaluate configurations. In this work, we investig… ▽ More

    Submitted 4 November, 2021; v1 submitted 8 June, 2021; originally announced June 2021.

    Comments: NeurIPS 2021

  22. arXiv:2106.04033  [pdf, other

    cs.AI cs.LG

    Sample Complexity of Tree Search Configuration: Cutting Planes and Beyond

    Authors: Maria-Florina Balcan, Siddharth Prasad, Tuomas Sandholm, Ellen Vitercik

    Abstract: Cutting-plane methods have enabled remarkable successes in integer programming over the last few decades. State-of-the-art solvers integrate a myriad of cutting-plane techniques to speed up the underlying tree-search algorithm used to find optimal solutions. In this paper we prove the first guarantees for learning high-performing cut-selection policies tailored to the instance distribution at hand… ▽ More

    Submitted 7 June, 2021; originally announced June 2021.

  23. arXiv:2103.10547  [pdf, other

    cs.LG cs.AI

    Data driven semi-supervised learning

    Authors: Maria-Florina Balcan, Dravyansh Sharma

    Abstract: We consider a novel data driven approach for designing learning algorithms that can effectively learn with only a small number of labeled examples. This is crucial for modern machine learning applications where labels are scarce or expensive to obtain. We focus on graph-based techniques, where the unlabeled examples are connected in a graph under the implicit assumption that similar nodes likely h… ▽ More

    Submitted 29 September, 2021; v1 submitted 18 March, 2021; originally announced March 2021.

    Comments: 33 pages, 11 figures

  24. arXiv:2012.13315  [pdf, other

    cs.AI

    Generalization in portfolio-based algorithm selection

    Authors: Maria-Florina Balcan, Tuomas Sandholm, Ellen Vitercik

    Abstract: Portfolio-based algorithm selection has seen tremendous practical success over the past two decades. This algorithm configuration procedure works by first selecting a portfolio of diverse algorithm parameter settings, and then, on a given problem instance, using an algorithm selector to choose a parameter setting from the portfolio with strong predicted performance. Oftentimes, both the portfolio… ▽ More

    Submitted 24 December, 2020; originally announced December 2020.

    Comments: AAAI 2021

  25. arXiv:2012.10602  [pdf, other

    cs.LG cs.CR stat.ML

    Scalable and Provably Accurate Algorithms for Differentially Private Distributed Decision Tree Learning

    Authors: Kaiwen Wang, Travis Dick, Maria-Florina Balcan

    Abstract: This paper introduces the first provably accurate algorithms for differentially private, top-down decision tree learning in the distributed setting (Balcan et al., 2012). We propose DP-TopDown, a general privacy preserving decision tree learning algorithm, and present two distributed implementations. Our first method NoisyCounts naturally extends the single machine algorithm by using the Laplace m… ▽ More

    Submitted 22 February, 2021; v1 submitted 19 December, 2020; originally announced December 2020.

    Comments: In AAAI Workshop on Privacy-Preserving Artificial Intelligence, 2020

  26. arXiv:2011.07177  [pdf, other

    cs.DS cs.AI cs.LG

    Data-driven Algorithm Design

    Authors: Maria-Florina Balcan

    Abstract: Data driven algorithm design is an important aspect of modern data science and algorithm design. Rather than using off the shelf algorithms that only have worst case performance guarantees, practitioners often optimize over large families of parametrized algorithms and tune the parameters of these algorithms using a training set of problem instances from their domain to determine a configuration w… ▽ More

    Submitted 13 November, 2020; originally announced November 2020.

    Comments: Chapter 29 of the book Beyond the Worst-Case Analysis of Algorithms, edited by Tim Roughgarden and published by Cambridge University Press (2020)

  27. arXiv:2010.06154  [pdf, other

    cs.LG cs.CR stat.ML

    An Analysis of Robustness of Non-Lipschitz Networks

    Authors: Maria-Florina Balcan, Avrim Blum, Dravyansh Sharma, Hongyang Zhang

    Abstract: Despite significant advances, deep networks remain highly susceptible to adversarial attack. One fundamental challenge is that small input perturbations can often produce large movements in the network's final-layer feature space. In this paper, we define an attack model that abstracts this challenge, to help understand its intrinsic properties. In our model, the adversary may move data an arbitra… ▽ More

    Submitted 18 April, 2023; v1 submitted 12 October, 2020; originally announced October 2020.

    Comments: To appear in Journal of Machine Learning Research (JMLR)

  28. arXiv:2010.05080  [pdf, other

    cs.LG cs.DS stat.ML

    Noise in Classification

    Authors: Maria-Florina Balcan, Nika Haghtalab

    Abstract: This chapter considers the computational and statistical aspects of learning linear thresholds in presence of noise. When there is no noise, several algorithms exist that efficiently learn near-optimal linear thresholds using a small amount of data. However, even a small amount of adversarial noise makes this problem notoriously hard in the worst-case. We discuss approaches for dealing with these… ▽ More

    Submitted 13 November, 2020; v1 submitted 10 October, 2020; originally announced October 2020.

    Comments: Chapter 16 of the book Beyond the Worst-Case Analysis of Algorithms

  29. arXiv:2006.11827  [pdf, other

    cs.AI cs.DS cs.LG

    Refined bounds for algorithm configuration: The knife-edge of dual class approximability

    Authors: Maria-Florina Balcan, Tuomas Sandholm, Ellen Vitercik

    Abstract: Automating algorithm configuration is growing increasingly necessary as algorithms come with more and more tunable parameters. It is common to tune parameters using machine learning, optimizing performance metrics such as runtime and solution quality. The training set consists of problem instances from the specific domain at hand. We investigate a fundamental question about these techniques: how l… ▽ More

    Submitted 24 December, 2020; v1 submitted 21 June, 2020; originally announced June 2020.

  30. arXiv:2004.07802  [pdf, other

    cs.LG cs.CV cs.NE math.OC stat.ML

    Geometry-Aware Gradient Algorithms for Neural Architecture Search

    Authors: Liam Li, Mikhail Khodak, Maria-Florina Balcan, Ameet Talwalkar

    Abstract: Recent state-of-the-art methods for neural architecture search (NAS) exploit gradient-based optimization by relaxing the problem into continuous optimization over architectures and shared-weights, a noisy process that remains poorly understood. We argue for the study of single-level empirical risk minimization to understand NAS with weight-sharing, reducing the design of NAS methods to devising op… ▽ More

    Submitted 18 March, 2021; v1 submitted 16 April, 2020; originally announced April 2020.

    Comments: ICLR 2021 Camera-Ready

  31. arXiv:1908.02894  [pdf, other

    cs.LG stat.ML

    How much data is sufficient to learn high-performing algorithms? Generalization guarantees for data-driven algorithm design

    Authors: Maria-Florina Balcan, Dan DeBlasio, Travis Dick, Carl Kingsford, Tuomas Sandholm, Ellen Vitercik

    Abstract: Algorithms often have tunable parameters that impact performance metrics such as runtime and solution quality. For many algorithms used in practice, no parameter settings admit meaningful worst-case bounds, so the parameters are made available for the user to tune. Alternatively, parameters may be tuned implicitly within the proof of a worst-case approximation ratio or runtime bound. Worst-case in… ▽ More

    Submitted 25 April, 2021; v1 submitted 7 August, 2019; originally announced August 2019.

  32. arXiv:1907.09137  [pdf, other

    cs.LG stat.ML

    Learning piecewise Lipschitz functions in changing environments

    Authors: Maria-Florina Balcan, Travis Dick, Dravyansh Sharma

    Abstract: Optimization in the presence of sharp (non-Lipschitz), unpredictable (w.r.t. time and amount) changes is a challenging and largely unexplored problem of great significance. We consider the class of piecewise Lipschitz functions, which is the most general online setting considered in the literature for the problem, and arises naturally in various combinatorial algorithm selection problems where uti… ▽ More

    Submitted 6 August, 2020; v1 submitted 22 July, 2019; originally announced July 2019.

  33. arXiv:1907.00533  [pdf, other

    cs.LG cs.DS

    Learning to Link

    Authors: Maria-Florina Balcan, Travis Dick, Manuel Lang

    Abstract: Clustering is an important part of many modern data analysis pipelines, including network analysis and data retrieval. There are many different clustering algorithms developed by various communities, and it is often not clear which algorithm will give the best performance on a specific clustering task. Similarly, we often have multiple ways to measure distances between data points, and the best cl… ▽ More

    Submitted 2 October, 2019; v1 submitted 1 July, 2019; originally announced July 2019.

  34. arXiv:1906.02717  [pdf, other

    cs.LG cs.AI stat.ML

    Adaptive Gradient-Based Meta-Learning Methods

    Authors: Mikhail Khodak, Maria-Florina Balcan, Ameet Talwalkar

    Abstract: We build a theoretical framework for designing and understanding practical meta-learning methods that integrates sophisticated formalizations of task-similarity with the extensive literature on online convex optimization and sequential prediction algorithms. Our approach enables the task-similarity to be learned adaptively, provides sharper transfer-risk bounds in the setting of statistical learni… ▽ More

    Submitted 6 December, 2019; v1 submitted 6 June, 2019; originally announced June 2019.

    Comments: NeurIPS 2019

  35. arXiv:1905.10819  [pdf, other

    cs.LG cs.AI stat.ML

    Learning to Optimize Computational Resources: Frugal Training with Generalization Guarantees

    Authors: Maria-Florina Balcan, Tuomas Sandholm, Ellen Vitercik

    Abstract: Algorithms typically come with tunable parameters that have a considerable impact on the computational resources they consume. Too often, practitioners must hand-tune the parameters, a tedious and error-prone task. A recent line of research provides algorithms that return nearly-optimal parameters from within a finite set. These algorithms can be used when the parameter space is infinite by provid… ▽ More

    Submitted 20 November, 2020; v1 submitted 26 May, 2019; originally announced May 2019.

  36. arXiv:1904.09014  [pdf, other

    cs.LG stat.ML

    Semi-bandit Optimization in the Dispersed Setting

    Authors: Maria-Florina Balcan, Travis Dick, Wesley Pegden

    Abstract: The goal of data-driven algorithm design is to obtain high-performing algorithms for specific application domains using machine learning and data. Across many fields in AI, science, and engineering, practitioners will often fix a family of parameterized algorithms and then optimize those parameters to obtain good performance on example instances from the application domain. In the online setting,… ▽ More

    Submitted 21 December, 2020; v1 submitted 18 April, 2019; originally announced April 2019.

  37. arXiv:1902.10644  [pdf, other

    cs.LG cs.AI stat.ML

    Provable Guarantees for Gradient-Based Meta-Learning

    Authors: Mikhail Khodak, Maria-Florina Balcan, Ameet Talwalkar

    Abstract: We study the problem of meta-learning through the lens of online convex optimization, develo** a meta-algorithm bridging the gap between popular gradient-based meta-learning and classical regularization-based multi-task transfer methods. Our method is the first to simultaneously satisfy good sample efficiency guarantees in the convex setting, with generalization bounds that improve with task-sim… ▽ More

    Submitted 16 May, 2019; v1 submitted 27 February, 2019; originally announced February 2019.

    Comments: ICML 2019

  38. arXiv:1902.09413  [pdf, other

    cs.GT

    Estimating Approximate Incentive Compatibility

    Authors: Maria-Florina Balcan, Tuomas Sandholm, Ellen Vitercik

    Abstract: In practice, most mechanisms for selling, buying, matching, voting, and so on are not incentive compatible. We present techniques for estimating how far a mechanism is from incentive compatible. Given samples from the agents' type distribution, we show how to estimate the extent to which an agent can improve his utility by misreporting his type. We do so by first measuring the maximum utility an a… ▽ More

    Submitted 11 December, 2023; v1 submitted 25 February, 2019; originally announced February 2019.

  39. arXiv:1810.08171  [pdf, other

    cs.DS cs.LG stat.ML

    Testing Matrix Rank, Optimally

    Authors: Maria-Florina Balcan, Yi Li, David P. Woodruff, Hongyang Zhang

    Abstract: We show that for the problem of testing if a matrix $A \in F^{n \times n}$ has rank at most $d$, or requires changing an $ε$-fraction of entries to have rank at most $d$, there is a non-adaptive query algorithm making $\widetilde{O}(d^2/ε)$ queries. Our algorithm works for any field $F$. This improves upon the previous $O(d^2/ε^2)$ bound (SODA'03), and bypasses an $Ω(d^2/ε^2)$ lower bound of (KDD'… ▽ More

    Submitted 18 October, 2018; originally announced October 2018.

    Comments: 51 pages. To appear in SODA 2019

  40. arXiv:1809.08700  [pdf, other

    cs.LG cs.GT stat.ML

    Envy-Free Classification

    Authors: Maria-Florina Balcan, Travis Dick, Ritesh Noothigattu, Ariel D. Procaccia

    Abstract: In classic fair division problems such as cake cutting and rent division, envy-freeness requires that each individual (weakly) prefer his allocation to anyone else's. On a conceptual level, we argue that envy-freeness also provides a compelling notion of fairness for classification tasks. Our technical focus is the generalizability of envy-free classification, i.e., understanding whether a classif… ▽ More

    Submitted 24 September, 2020; v1 submitted 23 September, 2018; originally announced September 2018.

    Journal ref: Advances in Neural Information Processing Systems, 2019, pp. 1240-1250

  41. arXiv:1809.06987  [pdf, other

    cs.DS cs.AI cs.LG

    Data-Driven Clustering via Parameterized Lloyd's Families

    Authors: Maria-Florina Balcan, Travis Dick, Colin White

    Abstract: Algorithms for clustering points in metric spaces is a long-studied area of research. Clustering has seen a multitude of work both theoretically, in understanding the approximation guarantees possible for many objective functions such as k-median and k-means clustering, and experimentally, in finding the fastest algorithms and seeding procedures for Lloyd's algorithm. The performance of a given cl… ▽ More

    Submitted 24 May, 2019; v1 submitted 18 September, 2018; originally announced September 2018.

  42. arXiv:1803.10150  [pdf, other

    cs.AI cs.DS

    Learning to Branch

    Authors: Maria-Florina Balcan, Travis Dick, Tuomas Sandholm, Ellen Vitercik

    Abstract: Tree search algorithms, such as branch-and-bound, are the most widely used tools for solving combinatorial and nonconvex problems. For example, they are the foremost method for solving (mixed) integer programs and constraint satisfaction problems. Tree search algorithms recursively partition the search space to find an optimal solution. In order to keep the tree size small, it is crucial to carefu… ▽ More

    Submitted 16 May, 2018; v1 submitted 27 March, 2018; originally announced March 2018.

  43. arXiv:1711.03091  [pdf, other

    cs.LG

    Dispersion for Data-Driven Algorithm Design, Online Learning, and Private Optimization

    Authors: Maria-Florina Balcan, Travis Dick, Ellen Vitercik

    Abstract: Data-driven algorithm design, that is, choosing the best algorithm for a specific application, is a crucial problem in modern data science. Practitioners often optimize over a parameterized algorithm family, tuning parameters based on problems from their domain. These procedures have historically come with no guarantees, though a recent line of work studies algorithm selection from a theoretical p… ▽ More

    Submitted 22 October, 2018; v1 submitted 8 November, 2017; originally announced November 2017.

  44. arXiv:1706.10271  [pdf, other

    cs.LG

    Lifelong Learning in Costly Feature Spaces

    Authors: Maria-Florina Balcan, Avrim Blum, Vaishnavh Nagarajan

    Abstract: An important long-term goal in machine learning systems is to build learning agents that, like humans, can learn many tasks over their lifetime, and moreover use information from these tasks to improve their ability to do so efficiently. In this work, our goal is to provide new theoretical insights into the potential of this paradigm. In particular, we propose a lifelong learning framework that ad… ▽ More

    Submitted 30 June, 2017; originally announced June 2017.

  45. arXiv:1705.07157  [pdf, other

    cs.DS cs.LG

    Clustering under Local Stability: Bridging the Gap between Worst-Case and Beyond Worst-Case Analysis

    Authors: Maria-Florina Balcan, Colin White

    Abstract: Recently, there has been substantial interest in clustering research that takes a beyond worst-case approach to the analysis of algorithms. The typical idea is to design a clustering algorithm that outputs a near-optimal solution, provided the data satisfy a natural stability notion. For example, Bilu and Linial (2010) and Awasthi et al. (2012) presented algorithms that output near-optimal solutio… ▽ More

    Submitted 19 May, 2017; originally announced May 2017.

    Comments: arXiv admin note: substantial text overlap with arXiv:1505.03924

  46. arXiv:1705.00243  [pdf, other

    cs.LG cs.GT

    Generalization Guarantees for Multi-item Profit Maximization: Pricing, Auctions, and Randomized Mechanisms

    Authors: Maria-Florina Balcan, Tuomas Sandholm, Ellen Vitercik

    Abstract: We study multi-item profit maximization when there is an underlying distribution over buyers' values. In practice, a full description of the distribution is typically unavailable, so we study the setting where the mechanism designer only has samples from the distribution. If the designer uses the samples to optimize over a complex mechanism class -- such as the set of all multi-item, multi-buyer m… ▽ More

    Submitted 6 May, 2023; v1 submitted 29 April, 2017; originally announced May 2017.

  47. arXiv:1704.08683  [pdf, other

    cs.DS cs.LG stat.ML

    Matrix Completion and Related Problems via Strong Duality

    Authors: Maria-Florina Balcan, Yingyu Liang, David P. Woodruff, Hongyang Zhang

    Abstract: This work studies the strong duality of non-convex matrix factorization problems: we show that under certain dual conditions, these problems and its dual have the same optimum. This has been well understood for convex optimization, but little was known for non-convex problems. We propose a novel analytical framework and show that under certain dual conditions, the optimal solution of the matrix fa… ▽ More

    Submitted 25 April, 2018; v1 submitted 27 April, 2017; originally announced April 2017.

    Comments: 37 pages, 4 figures

  48. arXiv:1703.07758  [pdf, other

    stat.ML cs.AI cs.LG

    Sample and Computationally Efficient Learning Algorithms under S-Concave Distributions

    Authors: Maria-Florina Balcan, Hongyang Zhang

    Abstract: We provide new results for noise-tolerant and sample-efficient learning algorithms under $s$-concave distributions. The new class of $s$-concave distributions is a broad and natural generalization of log-concavity, and includes many important additional distributions, e.g., the Pareto distribution and $t$-distribution. This class has been studied in the context of efficient sampling, integration,… ▽ More

    Submitted 27 January, 2018; v1 submitted 22 March, 2017; originally announced March 2017.

    Comments: Appear in NIPS 2017

  49. arXiv:1703.00830  [pdf, ps, other

    cs.DS cs.DC cs.LG

    Robust Communication-Optimal Distributed Clustering Algorithms

    Authors: Pranjal Awasthi, Ainesh Bakshi, Maria-Florina Balcan, Colin White, David Woodruff

    Abstract: In this work, we study the $k$-median and $k$-means clustering problems when the data is distributed across many servers and can contain outliers. While there has been a lot of work on these problems for worst-case instances, we focus on gaining a finer understanding through the lens of beyond worst-case analysis. Our main motivation is the following: for many applications such as clustering prote… ▽ More

    Submitted 6 March, 2019; v1 submitted 2 March, 2017; originally announced March 2017.

  50. arXiv:1612.02712  [pdf, other

    cs.SI cs.DS cs.LG stat.ML

    Scalable Influence Maximization for Multiple Products in Continuous-Time Diffusion Networks

    Authors: Nan Du, Yingyu Liang, Maria-Florina Balcan, Manuel Gomez-Rodriguez, Hongyuan Zha, Le Song

    Abstract: A typical viral marketing model identifies influential users in a social network to maximize a single product adoption assuming unlimited user attention, campaign budgets, and time. In reality, multiple products need campaigns, users have limited attention, convincing users incurs costs, and advertisers have limited budgets and expect the adoptions to be maximized soon. Facing these user, monetary… ▽ More

    Submitted 29 January, 2017; v1 submitted 8 December, 2016; originally announced December 2016.

    Comments: 45 pages, to appear in Journal of Machine Learning Research. arXiv admin note: substantial text overlap with arXiv:1312.2164, arXiv:1311.3669