Skip to main content

Showing 1–27 of 27 results for author: Vitercik, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.05959  [pdf, other

    cs.LG cs.DS

    MAGNOLIA: Matching Algorithms via GNNs for Online Value-to-go Approximation

    Authors: Alexandre Hayderi, Amin Saberi, Ellen Vitercik, Anders Wikum

    Abstract: Online Bayesian bipartite matching is a central problem in digital marketplaces and exchanges, including advertising, crowdsourcing, ridesharing, and kidney exchange. We introduce a graph neural network (GNN) approach that emulates the problem's combinatorially-complex optimal online algorithm, which selects actions (e.g., which nodes to match) by computing each action's value-to-go (VTG) -- the e… ▽ More

    Submitted 18 June, 2024; v1 submitted 9 June, 2024; originally announced June 2024.

    Comments: Accepted to ICML 2024; Hardness result from introduction made precise

  2. arXiv:2403.01361  [pdf, other

    cs.LG cs.GT econ.GN q-fin.GN

    Bandit Profit-maximization for Targeted Marketing

    Authors: Joon Suk Huh, Ellen Vitercik, Kirthevasan Kandasamy

    Abstract: We study a sequential profit-maximization problem, optimizing for both price and ancillary variables like marketing expenditures. Specifically, we aim to maximize profit over an arbitrary sequence of multiple demand curves, each dependent on a distinct ancillary variable, but sharing the same price. A prototypical example is targeted marketing, where a firm (seller) wishes to sell a product over m… ▽ More

    Submitted 2 March, 2024; originally announced March 2024.

  3. arXiv:2402.14332  [pdf, other

    cs.LG stat.ML

    From Large to Small Datasets: Size Generalization for Clustering Algorithm Selection

    Authors: Vaggos Chatziafratis, Ishani Karmarkar, Ellen Vitercik

    Abstract: In clustering algorithm selection, we are given a massive dataset and must efficiently select which clustering algorithm to use. We study this problem in a semi-supervised setting, with an unknown ground-truth clustering that we can only access through expensive oracle queries. Ideally, the clustering algorithm's output will be structurally close to the ground truth. We approach this problem by in… ▽ More

    Submitted 25 February, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

  4. arXiv:2401.13773  [pdf, other

    math.OC cs.DM cs.DS

    New Sequence-Independent Lifting Techniques for Cutting Planes and When They Induce Facets

    Authors: Siddharth Prasad, Ellen Vitercik, Maria-Florina Balcan, Tuomas Sandholm

    Abstract: Sequence-independent lifting is a procedure for strengthening valid inequalities of an integer program. We generalize the sequence-independent lifting method of Gu, Nemhauser, and Savelsbergh (GNS lifting) for cover inequalities and correct an error in their proposed generalization. We obtain a new sequence-independent lifting technique -- piecewise-constant (PC) lifting -- with a number of intere… ▽ More

    Submitted 24 January, 2024; originally announced January 2024.

  5. arXiv:2310.14113  [pdf, other

    cs.DS

    Sorting from Crowdsourced Comparisons using Expert Verifications

    Authors: Ellen Vitercik, Manolis Zampetakis, David Zhang

    Abstract: We introduce a novel noisy sorting model motivated by the Just Noticeable Difference (JND) model from experimental psychology. The goal of our model is to capture the low quality of the data that are collected from crowdsourcing environments. Compared to other celebrated models of noisy sorting, our model does not rely on precise data-generation assumptions and captures crowdsourced tasks' varying… ▽ More

    Submitted 21 October, 2023; originally announced October 2023.

  6. arXiv:2310.09974  [pdf, other

    cs.GT

    Algorithmic Contract Design for Crowdsourced Ranking

    Authors: Kiriaki Frangias, Andrew Lin, Ellen Vitercik, Manolis Zampetakis

    Abstract: Ranking is fundamental to many areas, such as search engine optimization, human feedback for language models, as well as peer grading. Crowdsourcing, which is often used for these tasks, requires proper incentivization to ensure accurate inputs. In this work, we draw on the field of \emph{contract theory} from Economics to propose a novel mechanism that enables a \emph{principal} to accurately ran… ▽ More

    Submitted 24 January, 2024; v1 submitted 15 October, 2023; originally announced October 2023.

  7. arXiv:2305.14537  [pdf, other

    cs.CY cs.AI cs.LG

    Disincentivizing Polarization in Social Networks

    Authors: Christian Borgs, Jennifer Chayes, Christian Ikeokwu, Ellen Vitercik

    Abstract: On social networks, algorithmic personalization drives users into filter bubbles where they rarely see content that deviates from their interests. We present a model for content curation and personalization that avoids filter bubbles, along with algorithmic guarantees and nearly matching lower bounds. In our model, the platform interacts with $n$ users over $T$ timesteps, choosing content for each… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

  8. arXiv:2302.09700  [pdf, other

    cs.GT cs.LG

    Leveraging Reviews: Learning to Price with Buyer and Seller Uncertainty

    Authors: Wenshuo Guo, Nika Haghtalab, Kirthevasan Kandasamy, Ellen Vitercik

    Abstract: In online marketplaces, customers have access to hundreds of reviews for a single product. Buyers often use reviews from other customers that share their type -- such as height for clothing, skin type for skincare products, and location for outdoor furniture -- to estimate their values, which they may not know a priori. Customers with few relevant reviews may hesitate to make a purchase except at… ▽ More

    Submitted 11 September, 2023; v1 submitted 19 February, 2023; originally announced February 2023.

  9. arXiv:2204.07312  [pdf, other

    math.OC cs.DS cs.LG

    Structural Analysis of Branch-and-Cut and the Learnability of Gomory Mixed Integer Cuts

    Authors: Maria-Florina Balcan, Siddharth Prasad, Tuomas Sandholm, Ellen Vitercik

    Abstract: The incorporation of cutting planes within the branch-and-bound algorithm, known as branch-and-cut, forms the backbone of modern integer programming solvers. These solvers are the foremost method for solving discrete optimization problems and thus have a vast array of applications in machine learning, operations research, and many other fields. Choosing cutting planes effectively is a major resear… ▽ More

    Submitted 14 April, 2022; originally announced April 2022.

  10. arXiv:2202.10606  [pdf, ps, other

    cs.LG cs.GT

    No-Regret Learning in Partially-Informed Auctions

    Authors: Wenshuo Guo, Michael I. Jordan, Ellen Vitercik

    Abstract: Auctions with partially-revealed information about items are broadly employed in real-world applications, but the underlying mechanisms have limited theoretical support. In this work, we study a machine learning formulation of these types of mechanisms, presenting algorithms that are no-regret from the buyer's perspective. Specifically, a buyer who wishes to maximize his utility interacts repeated… ▽ More

    Submitted 4 July, 2022; v1 submitted 21 February, 2022; originally announced February 2022.

    Comments: 30 pages

  11. arXiv:2111.11207  [pdf, other

    cs.LG cs.AI cs.DS math.OC

    Improved Sample Complexity Bounds for Branch-and-Cut

    Authors: Maria-Florina Balcan, Siddharth Prasad, Tuomas Sandholm, Ellen Vitercik

    Abstract: Branch-and-cut is the most widely used algorithm for solving integer programs, employed by commercial solvers like CPLEX and Gurobi. Branch-and-cut has a wide variety of tunable parameters that have a huge impact on the size of the search tree that it builds, but are challenging to tune by hand. An increasingly popular approach is to use machine learning to tune these parameters: using a training… ▽ More

    Submitted 11 May, 2022; v1 submitted 17 November, 2021; originally announced November 2021.

  12. arXiv:2106.04033  [pdf, other

    cs.AI cs.LG

    Sample Complexity of Tree Search Configuration: Cutting Planes and Beyond

    Authors: Maria-Florina Balcan, Siddharth Prasad, Tuomas Sandholm, Ellen Vitercik

    Abstract: Cutting-plane methods have enabled remarkable successes in integer programming over the last few decades. State-of-the-art solvers integrate a myriad of cutting-plane techniques to speed up the underlying tree-search algorithm used to find optimal solutions. In this paper we prove the first guarantees for learning high-performing cut-selection policies tailored to the instance distribution at hand… ▽ More

    Submitted 7 June, 2021; originally announced June 2021.

  13. arXiv:2012.13315  [pdf, other

    cs.AI

    Generalization in portfolio-based algorithm selection

    Authors: Maria-Florina Balcan, Tuomas Sandholm, Ellen Vitercik

    Abstract: Portfolio-based algorithm selection has seen tremendous practical success over the past two decades. This algorithm configuration procedure works by first selecting a portfolio of diverse algorithm parameter settings, and then, on a given problem instance, using an algorithm selector to choose a parameter setting from the portfolio with strong predicted performance. Oftentimes, both the portfolio… ▽ More

    Submitted 24 December, 2020; originally announced December 2020.

    Comments: AAAI 2021

  14. arXiv:2007.01181  [pdf, other

    cs.LG cs.CR stat.ML

    Private Optimization Without Constraint Violations

    Authors: Andrés Muñoz Medina, Umar Syed, Sergei Vassilvitskii, Ellen Vitercik

    Abstract: We study the problem of differentially private optimization with linear constraints when the right-hand-side of the constraints depends on private data. This type of problem appears in many applications, especially resource allocation. Previous research provided solutions that retained privacy but sometimes violated the constraints. In many settings, however, the constraints cannot be violated und… ▽ More

    Submitted 3 November, 2020; v1 submitted 2 July, 2020; originally announced July 2020.

  15. arXiv:2006.11827  [pdf, other

    cs.AI cs.DS cs.LG

    Refined bounds for algorithm configuration: The knife-edge of dual class approximability

    Authors: Maria-Florina Balcan, Tuomas Sandholm, Ellen Vitercik

    Abstract: Automating algorithm configuration is growing increasingly necessary as algorithms come with more and more tunable parameters. It is common to tune parameters using machine learning, optimizing performance metrics such as runtime and solution quality. The training set consists of problem instances from the specific domain at hand. We investigate a fundamental question about these techniques: how l… ▽ More

    Submitted 24 December, 2020; v1 submitted 21 June, 2020; originally announced June 2020.

  16. arXiv:1908.02894  [pdf, other

    cs.LG stat.ML

    How much data is sufficient to learn high-performing algorithms? Generalization guarantees for data-driven algorithm design

    Authors: Maria-Florina Balcan, Dan DeBlasio, Travis Dick, Carl Kingsford, Tuomas Sandholm, Ellen Vitercik

    Abstract: Algorithms often have tunable parameters that impact performance metrics such as runtime and solution quality. For many algorithms used in practice, no parameter settings admit meaningful worst-case bounds, so the parameters are made available for the user to tune. Alternatively, parameters may be tuned implicitly within the proof of a worst-case approximation ratio or runtime bound. Worst-case in… ▽ More

    Submitted 25 April, 2021; v1 submitted 7 August, 2019; originally announced August 2019.

  17. arXiv:1905.10819  [pdf, other

    cs.LG cs.AI stat.ML

    Learning to Optimize Computational Resources: Frugal Training with Generalization Guarantees

    Authors: Maria-Florina Balcan, Tuomas Sandholm, Ellen Vitercik

    Abstract: Algorithms typically come with tunable parameters that have a considerable impact on the computational resources they consume. Too often, practitioners must hand-tune the parameters, a tedious and error-prone task. A recent line of research provides algorithms that return nearly-optimal parameters from within a finite set. These algorithms can be used when the parameter space is infinite by provid… ▽ More

    Submitted 20 November, 2020; v1 submitted 26 May, 2019; originally announced May 2019.

  18. arXiv:1904.11875  [pdf, other

    cs.LG stat.ML

    Learning to Prune: Speeding up Repeated Computations

    Authors: Daniel Alabi, Adam Tauman Kalai, Katrina Ligett, Cameron Musco, Christos Tzamos, Ellen Vitercik

    Abstract: It is common to encounter situations where one must solve a sequence of similar computational problems. Running a standard algorithm with worst-case runtime guarantees on each instance will fail to take advantage of valuable structure shared across the problem instances. For example, when a commuter drives from work to home, there are typically only a handful of routes that will ever be the shorte… ▽ More

    Submitted 26 April, 2019; originally announced April 2019.

  19. arXiv:1902.09413  [pdf, other

    cs.GT

    Estimating Approximate Incentive Compatibility

    Authors: Maria-Florina Balcan, Tuomas Sandholm, Ellen Vitercik

    Abstract: In practice, most mechanisms for selling, buying, matching, voting, and so on are not incentive compatible. We present techniques for estimating how far a mechanism is from incentive compatible. Given samples from the agents' type distribution, we show how to estimate the extent to which an agent can improve his utility by misreporting his type. We do so by first measuring the maximum utility an a… ▽ More

    Submitted 11 December, 2023; v1 submitted 25 February, 2019; originally announced February 2019.

  20. arXiv:1803.10150  [pdf, other

    cs.AI cs.DS

    Learning to Branch

    Authors: Maria-Florina Balcan, Travis Dick, Tuomas Sandholm, Ellen Vitercik

    Abstract: Tree search algorithms, such as branch-and-bound, are the most widely used tools for solving combinatorial and nonconvex problems. For example, they are the foremost method for solving (mixed) integer programs and constraint satisfaction problems. Tree search algorithms recursively partition the search space to find an optimal solution. In order to keep the tree size small, it is crucial to carefu… ▽ More

    Submitted 16 May, 2018; v1 submitted 27 March, 2018; originally announced March 2018.

  21. arXiv:1711.03091  [pdf, other

    cs.LG

    Dispersion for Data-Driven Algorithm Design, Online Learning, and Private Optimization

    Authors: Maria-Florina Balcan, Travis Dick, Ellen Vitercik

    Abstract: Data-driven algorithm design, that is, choosing the best algorithm for a specific application, is a crucial problem in modern data science. Practitioners often optimize over a parameterized algorithm family, tuning parameters based on problems from their domain. These procedures have historically come with no guarantees, though a recent line of work studies algorithm selection from a theoretical p… ▽ More

    Submitted 22 October, 2018; v1 submitted 8 November, 2017; originally announced November 2017.

  22. arXiv:1707.04233  [pdf, other

    cs.IT cs.DS

    Synchronization Strings: Channel Simulations and Interactive Coding for Insertions and Deletions

    Authors: Bernhard Haeupler, Amirbehshad Shahrasbi, Ellen Vitercik

    Abstract: We present many new results related to reliable (interactive) communication over insertion-deletion channels. Synchronization errors, such as insertions and deletions, strictly generalize the usual symbol corruption errors and are much harder to protect against. We show how to hide the complications of synchronization errors in many applications by introducing very general channel simulations wh… ▽ More

    Submitted 20 March, 2018; v1 submitted 13 July, 2017; originally announced July 2017.

  23. arXiv:1705.00243  [pdf, other

    cs.LG cs.GT

    Generalization Guarantees for Multi-item Profit Maximization: Pricing, Auctions, and Randomized Mechanisms

    Authors: Maria-Florina Balcan, Tuomas Sandholm, Ellen Vitercik

    Abstract: We study multi-item profit maximization when there is an underlying distribution over buyers' values. In practice, a full description of the distribution is typically unavailable, so we study the setting where the mechanism designer only has samples from the distribution. If the designer uses the samples to optimize over a complex mechanism class -- such as the set of all multi-item, multi-buyer m… ▽ More

    Submitted 6 May, 2023; v1 submitted 29 April, 2017; originally announced May 2017.

  24. arXiv:1611.04535  [pdf, other

    cs.DS cs.AI cs.LG

    Learning-Theoretic Foundations of Algorithm Configuration for Combinatorial Partitioning Problems

    Authors: Maria-Florina Balcan, Vaishnavh Nagarajan, Ellen Vitercik, Colin White

    Abstract: Max-cut, clustering, and many other partitioning problems that are of significant importance to machine learning and other scientific fields are NP-hard, a reality that has motivated researchers to develop a wealth of approximation algorithms and heuristics. Although the best algorithm to use typically depends on the specific application domain, a worst-case analysis is often used to compare algor… ▽ More

    Submitted 16 October, 2018; v1 submitted 14 November, 2016; originally announced November 2016.

  25. arXiv:1606.04145  [pdf, other

    cs.LG cs.GT

    Sample Complexity of Automated Mechanism Design

    Authors: Maria-Florina Balcan, Tuomas Sandholm, Ellen Vitercik

    Abstract: The design of revenue-maximizing combinatorial auctions, i.e. multi-item auctions over bundles of goods, is one of the most fundamental problems in computational economics, unsolved even for two bidders and two items for sale. In the traditional economic models, it is assumed that the bidders' valuations are drawn from an underlying distribution and that the auction designer has perfect knowledge… ▽ More

    Submitted 13 June, 2016; originally announced June 2016.

  26. arXiv:1605.09227  [pdf, other

    cs.LG cs.DS

    Learning Combinatorial Functions from Pairwise Comparisons

    Authors: Maria-Florina Balcan, Ellen Vitercik, Colin White

    Abstract: A large body of work in machine learning has focused on the problem of learning a close approximation to an underlying combinatorial function, given a small set of labeled examples. However, for real-valued functions, cardinal labels might not be accessible, or it may be difficult for an expert to consistently assign real-valued labels over the entire set of examples. For instance, it is notorious… ▽ More

    Submitted 30 May, 2016; originally announced May 2016.

    Comments: 1 figure

  27. arXiv:1503.06381  [pdf, ps, other

    cs.DS cs.IT

    Balancing Communication for Multi-party Interactive Coding

    Authors: Allison Lewko, Ellen Vitercik

    Abstract: We consider interactive coding in a setting where $n$ parties wish to compute a joint function of their inputs via an interactive protocol over imperfect channels. We assume that adversarial errors can comprise a $\mathcal{O}(\frac{1}{n})$ fraction of the total communication, occurring anywhere on the communication network. Our goal is to maintain a constant multiplicative overhead in the total co… ▽ More

    Submitted 21 March, 2015; originally announced March 2015.