Skip to main content

Showing 1–50 of 53 results for author: Leyton-Brown, K

.
  1. arXiv:2406.10937  [pdf, ps, other

    cs.AI cs.CL cs.LG

    Understanding Understanding: A Pragmatic Framework Motivated by Large Language Models

    Authors: Kevin Leyton-Brown, Yoav Shoham

    Abstract: Motivated by the rapid ascent of Large Language Models (LLMs) and debates about the extent to which they possess human-level qualities, we propose a framework for testing whether any agent (be it a machine or a human) understands a subject matter. In Turing-test fashion, the framework is based solely on the agent's performance, and specifically on how well it answers questions. Elements of the fra… ▽ More

    Submitted 19 June, 2024; v1 submitted 16 June, 2024; originally announced June 2024.

  2. arXiv:2405.18246  [pdf, other

    cs.AI

    Utilitarian Algorithm Configuration for Infinite Parameter Spaces

    Authors: Devon Graham, Kevin Leyton-Brown

    Abstract: Utilitarian algorithm configuration is a general-purpose technique for automatically searching the parameter space of a given algorithm to optimize its performance, as measured by a given utility function, on a given set of inputs. Recently introduced utilitarian configuration procedures offer optimality guarantees about the returned parameterization while provably adapting to the hardness of the… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  3. arXiv:2405.06563  [pdf, other

    cs.CL

    What Can Natural Language Processing Do for Peer Review?

    Authors: Ilia Kuznetsov, Osama Mohammed Afzal, Koen Dercksen, Nils Dycke, Alexander Goldberg, Tom Hope, Dirk Hovy, Jonathan K. Kummerfeld, Anne Lauscher, Kevin Leyton-Brown, Sheng Lu, Mausam, Margot Mieskes, Aurélie Névéol, Danish Pruthi, Lizhen Qu, Roy Schwartz, Noah A. Smith, Thamar Solorio, **gyan Wang, Xiaodan Zhu, Anna Rogers, Nihar B. Shah, Iryna Gurevych

    Abstract: The number of scientific articles produced every year is growing rapidly. Providing quality control over them is crucial for scientists and, ultimately, for the public good. In modern science, this process is largely delegated to peer review -- a distributed procedure in which each submission is evaluated by several independent experts in the field. Peer review is widely used, yet it is hard, time… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

  4. arXiv:2402.19420  [pdf, other

    cs.GT cs.AI cs.MA

    Understanding Iterative Combinatorial Auction Designs via Multi-Agent Reinforcement Learning

    Authors: Greg d'Eon, Neil Newman, Kevin Leyton-Brown

    Abstract: Iterative combinatorial auctions are widely used in high stakes settings such as spectrum auctions. Such auctions can be hard to understand analytically, making it difficult for bidders to determine how to behave and for designers to optimize auction rules to ensure desirable outcomes such as high revenue or welfare. In this paper, we investigate whether multi-agent reinforcement learning (MARL) a… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

    Comments: 18 pages (body) + 10 pages (acknowledgements, references, appendices)

  5. arXiv:2402.09552  [pdf, other

    cs.CL econ.GN

    STEER: Assessing the Economic Rationality of Large Language Models

    Authors: Narun Raman, Taylor Lundy, Samuel Amouyal, Yoav Levine, Kevin Leyton-Brown, Moshe Tennenholtz

    Abstract: There is increasing interest in using LLMs as decision-making "agents." Doing so includes many degrees of freedom: which model should be used; how should it be prompted; should it be asked to introspect, conduct chain-of-thought reasoning, etc? Settling these questions -- and more broadly, determining whether an LLM agent is reliable enough to be trusted -- requires a methodology for assessing suc… ▽ More

    Submitted 28 May, 2024; v1 submitted 14 February, 2024; originally announced February 2024.

  6. arXiv:2312.10205  [pdf, other

    cs.GT

    Pay to (Not) Play: Monetizing Impatience in Mobile Games

    Authors: Taylor Lundy, Narun Raman, Hu Fu, Kevin Leyton-Brown

    Abstract: Mobile gaming is a rapidly growing and incredibly profitable sector; having grown seven-fold over the past 10 years, it now grosses over $100 billion annually. This growth was due in large part to a shift in monetization strategies: rather than charging players an upfront cost ("pay-to-play"), games often request optional microtransactions throughout gameplay ("free-to-play"). We focus on a common… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

    Comments: 18 pages

  7. arXiv:2310.20401  [pdf, other

    cs.AI

    Utilitarian Algorithm Configuration

    Authors: Devon R. Graham, Kevin Leyton-Brown, Tim Roughgarden

    Abstract: We present the first nontrivial procedure for configuring heuristic algorithms to maximize the utility provided to their end users while also offering theoretical guarantees about performance. Existing procedures seek configurations that minimize expected runtime. However, very recent theoretical work argues that expected runtime minimization fails to capture algorithm designers' preferences. Here… ▽ More

    Submitted 31 October, 2023; originally announced October 2023.

  8. arXiv:2307.06908  [pdf, other

    cs.CL cs.AI

    Generating Benchmarks for Factuality Evaluation of Language Models

    Authors: Dor Muhlgay, Ori Ram, Inbal Magar, Yoav Levine, Nir Ratner, Yonatan Belinkov, Omri Abend, Kevin Leyton-Brown, Amnon Shashua, Yoav Shoham

    Abstract: Before deploying a language model (LM) within a given domain, it is important to measure its tendency to generate factually incorrect information in that domain. Existing methods for factuality evaluation of LLM generation focus on facts sampled from the LM itself, and thus do not control the set of evaluated facts and might under-represent domain specific or rare facts. We propose FACTOR: Factual… ▽ More

    Submitted 4 February, 2024; v1 submitted 13 July, 2023; originally announced July 2023.

  9. arXiv:2306.04778  [pdf, other

    cs.LG cs.GT

    How to Evaluate Behavioral Models

    Authors: Greg d'Eon, Sophie Greenwood, Kevin Leyton-Brown, James R. Wright

    Abstract: Researchers building behavioral models, such as behavioral game theorists, use experimental data to evaluate predictive models of human behavior. However, there is little agreement about which loss function should be used in evaluations, with error rate, negative log-likelihood, cross-entropy, Brier score, and squared L2 error all being common choices. We attempt to offer a principled answer to th… ▽ More

    Submitted 22 February, 2024; v1 submitted 7 June, 2023; originally announced June 2023.

    Comments: 15 pages (7 pages body + references and appendix). To appear at AAAI 2024

  10. arXiv:2302.00083  [pdf, other

    cs.CL cs.IR

    In-Context Retrieval-Augmented Language Models

    Authors: Ori Ram, Yoav Levine, Itay Dalmedigos, Dor Muhlgay, Amnon Shashua, Kevin Leyton-Brown, Yoav Shoham

    Abstract: Retrieval-Augmented Language Modeling (RALM) methods, which condition a language model (LM) on relevant documents from a grounding corpus during generation, were shown to significantly improve language modeling performance. In addition, they can mitigate the problem of factually inaccurate text generation and provide natural source attribution mechanism. Existing RALM approaches focus on modifying… ▽ More

    Submitted 1 August, 2023; v1 submitted 31 January, 2023; originally announced February 2023.

    Comments: Accepted for publication in Transactions of the Association for Computational Linguistics (TACL). pre-MIT Press publication version

  11. arXiv:2212.10947  [pdf, other

    cs.CL

    Parallel Context Windows for Large Language Models

    Authors: Nir Ratner, Yoav Levine, Yonatan Belinkov, Ori Ram, Inbal Magar, Omri Abend, Ehud Karpas, Amnon Shashua, Kevin Leyton-Brown, Yoav Shoham

    Abstract: When applied to processing long text, Large Language Models (LLMs) are limited by their context window. Existing efforts to address this limitation involve training specialized architectures, and cannot be easily applied to off-the-shelf LLMs. We present Parallel Context Windows (PCW), a method that alleviates the context window restriction for any off-the-shelf LLM without further training. The k… ▽ More

    Submitted 1 August, 2023; v1 submitted 21 December, 2022; originally announced December 2022.

    Comments: The 61st Annual Meeting of the Association for Computational Linguistics (ACL 2023)

  12. arXiv:2211.12581  [pdf, other

    cs.AI cs.LG

    UNSAT Solver Synthesis via Monte Carlo Forest Search

    Authors: Chris Cameron, Jason Hartford, Taylor Lundy, Tuan Truong, Alan Milligan, Rex Chen, Kevin Leyton-Brown

    Abstract: We introduce Monte Carlo Forest Search (MCFS), a class of reinforcement learning (RL) algorithms for learning policies in {tree MDPs}, for which policy execution involves traversing an exponential-sized tree. Examples of such problems include proving unsatisfiability of a SAT formula; counting the number of solutions of a satisfiable SAT formula; and finding the optimal solution to a mixed-integer… ▽ More

    Submitted 25 May, 2023; v1 submitted 22 November, 2022; originally announced November 2022.

  13. arXiv:2211.06318  [pdf

    cs.CY cs.AI cs.LG

    Artificial Intelligence and Life in 2030: The One Hundred Year Study on Artificial Intelligence

    Authors: Peter Stone, Rodney Brooks, Erik Brynjolfsson, Ryan Calo, Oren Etzioni, Greg Hager, Julia Hirschberg, Shivaram Kalyanakrishnan, Ece Kamar, Sarit Kraus, Kevin Leyton-Brown, David Parkes, William Press, AnnaLee Saxenian, Julie Shah, Milind Tambe, Astro Teller

    Abstract: In September 2016, Stanford's "One Hundred Year Study on Artificial Intelligence" project (AI100) issued the first report of its planned long-term periodic assessment of artificial intelligence (AI) and its impact on society. It was written by a panel of 17 study authors, each of whom is deeply rooted in AI research, chaired by Peter Stone of the University of Texas at Austin. The report, entitled… ▽ More

    Submitted 31 October, 2022; originally announced November 2022.

    Comments: 52 pages, https://ai100.stanford.edu/2016-report

  14. arXiv:2209.01242  [pdf, other

    cs.AI cs.GT

    Better Peer Grading through Bayesian Inference

    Authors: Hedayat Zarkoob, Greg d'Eon, Lena Podina, Kevin Leyton-Brown

    Abstract: Peer grading systems aggregate noisy reports from multiple students to approximate a true grade as closely as possible. Most current systems either take the mean or median of reported grades; others aim to estimate students' grading accuracy under a probabilistic model. This paper extends the state of the art in the latter approach in three key ways: (1) recognizing that students can behave strate… ▽ More

    Submitted 2 December, 2022; v1 submitted 2 September, 2022; originally announced September 2022.

  15. arXiv:2205.13028  [pdf, other

    cs.AI cs.GT cs.LG cs.PF

    Formalizing Preferences Over Runtime Distributions

    Authors: Devon R. Graham, Kevin Leyton-Brown, Tim Roughgarden

    Abstract: When trying to solve a computational problem, we are often faced with a choice between algorithms that are guaranteed to return the right answer but differ in their runtime distributions (e.g., SAT solvers, sorting algorithms). This paper aims to lay theoretical foundations for such choices by formalizing preferences over runtime distributions. It might seem that we should simply prefer the algori… ▽ More

    Submitted 2 June, 2023; v1 submitted 25 May, 2022; originally announced May 2022.

  16. arXiv:2205.00445  [pdf, other

    cs.CL cs.AI

    MRKL Systems: A modular, neuro-symbolic architecture that combines large language models, external knowledge sources and discrete reasoning

    Authors: Ehud Karpas, Omri Abend, Yonatan Belinkov, Barak Lenz, Opher Lieber, Nir Ratner, Yoav Shoham, Hofit Bata, Yoav Levine, Kevin Leyton-Brown, Dor Muhlgay, Noam Rozen, Erez Schwartz, Gal Shachaf, Shai Shalev-Shwartz, Amnon Shashua, Moshe Tenenholtz

    Abstract: Huge language models (LMs) have ushered in a new era for AI, serving as a gateway to natural-language-based knowledge tasks. Although an essential element of modern AI, LMs are also inherently limited in a number of ways. We discuss these limitations and how they can be avoided by adopting a systems approach. Conceptualizing the challenge as one that involves knowledge and reasoning in addition to… ▽ More

    Submitted 1 May, 2022; originally announced May 2022.

  17. arXiv:2204.10019  [pdf, other

    cs.CL cs.AI

    Standing on the Shoulders of Giant Frozen Language Models

    Authors: Yoav Levine, Itay Dalmedigos, Ori Ram, Yoel Zeldes, Daniel Jannai, Dor Muhlgay, Yoni Osin, Opher Lieber, Barak Lenz, Shai Shalev-Shwartz, Amnon Shashua, Kevin Leyton-Brown, Yoav Shoham

    Abstract: Huge pretrained language models (LMs) have demonstrated surprisingly good zero-shot capabilities on a wide variety of tasks. This gives rise to the appealing vision of a single, versatile model with a wide range of functionalities across disparate applications. However, current leading techniques for leveraging a "frozen" LM -- i.e., leaving its weights untouched -- still often underperform fine-t… ▽ More

    Submitted 21 April, 2022; originally announced April 2022.

  18. arXiv:2202.12273  [pdf, other

    cs.AI cs.IR

    Matching Papers and Reviewers at Large Conferences

    Authors: Kevin Leyton-Brown, Mausam, Yatin Nandwani, Hedayat Zarkoob, Chris Cameron, Neil Newman, Dinesh Raghu

    Abstract: Peer-reviewed conferences, the main publication venues in CS, rely critically on matching highly qualified reviewers for each paper. Because of the growing scale of these conferences, the tight timelines on which they operate, and a recent surge in explicitly dishonest behavior, there is now no alternative to performing this matching in an automated way. This paper studies a novel reviewer-paper m… ▽ More

    Submitted 5 August, 2022; v1 submitted 24 February, 2022; originally announced February 2022.

  19. arXiv:2107.00758  [pdf, other

    cs.LG stat.ML

    The Spotlight: A General Method for Discovering Systematic Errors in Deep Learning Models

    Authors: Greg d'Eon, Jason d'Eon, James R. Wright, Kevin Leyton-Brown

    Abstract: Supervised learning models often make systematic errors on rare subsets of the data. When these subsets correspond to explicit labels in the data (e.g., gender, race) such poor performance can be identified straightforwardly. This paper introduces a method for discovering systematic errors that do not correspond to such explicitly labelled subgroups. The key idea is that similar inputs tend to hav… ▽ More

    Submitted 15 October, 2021; v1 submitted 1 July, 2021; originally announced July 2021.

  20. arXiv:2106.10349  [pdf, other

    cs.LG cs.AI math.OC

    The Perils of Learning Before Optimizing

    Authors: Chris Cameron, Jason Hartford, Taylor Lundy, Kevin Leyton-Brown

    Abstract: Formulating real-world optimization problems often begins with making predictions from historical data (e.g., an optimizer that aims to recommend fast routes relies upon travel-time predictions). Typically, learning the prediction model used to generate the optimization problem and solving that problem are performed in two separate stages. Recent work has showed how such prediction models can be l… ▽ More

    Submitted 16 December, 2021; v1 submitted 18 June, 2021; originally announced June 2021.

  21. arXiv:2101.10078  [pdf, other

    cs.CY

    Mechanical TA 2: A System for Peer Grading with TA Support

    Authors: Hedayat Zarkoob, Farzad Abdolhosseini, Kevin Leyton-Brown

    Abstract: Mechanical TA 2 (MTA2) is an open source web-based peer grading application that leverages trusted TA graders to incentivize high-quality peer review. A previous, prototype implementation of MTA proved the value of the concept, but was neither suitable for use at scale nor easily extensible; MTA2 is a complete reimplementation of the system that overcomes these hurdles. MTA2 serves two, interconne… ▽ More

    Submitted 18 January, 2021; originally announced January 2021.

  22. arXiv:2012.00689  [pdf, ps, other

    cs.GT cs.DS

    Dynamic Weighted Matching with Heterogeneous Arrival and Departure Rates

    Authors: Natalie Collina, Nicole Immorlica, Kevin Leyton-Brown, Brendan Lucier, Neil Newman

    Abstract: We study a dynamic non-bipartite matching problem. There is a fixed set of agent types, and agents of a given type arrive and depart according to type-specific Poisson processes. Agent departures are not announced in advance. The value of a match is determined by the types of the matched agents. We present an online algorithm that is (1/8)-competitive with respect to the value of the optimal-in-hi… ▽ More

    Submitted 10 January, 2021; v1 submitted 1 December, 2020; originally announced December 2020.

  23. arXiv:2011.01285  [pdf, other

    cs.LG cs.CL

    Exemplar Guided Active Learning

    Authors: Jason Hartford, Kevin Leyton-Brown, Hadas Raviv, Dan Padnos, Shahar Lev, Barak Lenz

    Abstract: We consider the problem of wisely using a limited budget to label a small subset of a large unlabeled dataset. We are motivated by the NLP problem of word sense disambiguation. For any word, we have a set of candidate labels from a knowledge base, but the label set is not necessarily representative of what occurs in the data: there may exist labels in the knowledge base that very rarely occur in t… ▽ More

    Submitted 2 November, 2020; originally announced November 2020.

    Comments: Published at NeurIPS 2020

  24. arXiv:2010.01825  [pdf, other

    cs.LG cs.CL stat.ML

    PMI-Masking: Principled masking of correlated spans

    Authors: Yoav Levine, Barak Lenz, Opher Lieber, Omri Abend, Kevin Leyton-Brown, Moshe Tennenholtz, Yoav Shoham

    Abstract: Masking tokens uniformly at random constitutes a common flaw in the pretraining of Masked Language Models (MLMs) such as BERT. We show that such uniform masking allows an MLM to minimize its training objective by latching onto shallow local signals, leading to pretraining inefficiency and suboptimal downstream performance. To address this flaw, we propose PMI-Masking, a principled masking strategy… ▽ More

    Submitted 5 October, 2020; originally announced October 2020.

  25. arXiv:2006.11386  [pdf, other

    stat.ME cs.LG econ.EM stat.ML

    Valid Causal Inference with (Some) Invalid Instruments

    Authors: Jason Hartford, Victor Veitch, Dhanya Sridhar, Kevin Leyton-Brown

    Abstract: Instrumental variable methods provide a powerful approach to estimating causal effects in the presence of unobserved confounding. But a key challenge when applying them is the reliance on untestable "exclusion" assumptions that rule out any relationship between the instrument variable and the response that is not mediated by the treatment. In this paper, we show how to perform consistent IV estima… ▽ More

    Submitted 19 June, 2020; originally announced June 2020.

  26. arXiv:2006.04497  [pdf, other

    cs.GT cs.AI cs.CY cs.LG

    Learning under Invariable Bayesian Safety

    Authors: Gal Bahar, Omer Ben-Porat, Kevin Leyton-Brown, Moshe Tennenholtz

    Abstract: A recent body of work addresses safety constraints in explore-and-exploit systems. Such constraints arise where, for example, exploration is carried out by individuals whose welfare should be balanced with overall welfare. In this paper, we adopt a model inspired by recent work on a bandit-like setting for recommendations. We contribute to this line of literature by introducing a safety constraint… ▽ More

    Submitted 8 June, 2020; originally announced June 2020.

  27. arXiv:2003.09761  [pdf, other

    cs.CY cs.LG physics.soc-ph stat.ML

    Smarter Parking: Using AI to Identify Parking Inefficiencies in Vancouver

    Authors: Devon Graham, Satish Kumar Sarraf, Taylor Lundy, Ali MohammadMehr, Sara Uppal, Tae Yoon Lee, Hedayat Zarkoob, Scott Duke Kominers, Kevin Leyton-Brown

    Abstract: On-street parking is convenient, but has many disadvantages: on-street spots come at the expense of other road uses such as traffic lanes, transit lanes, bike lanes, or parklets; drivers looking for parking contribute substantially to traffic congestion and hence to greenhouse gas emissions; safety is reduced both due to the fact that drivers looking for spots are more distracted than other road u… ▽ More

    Submitted 21 March, 2020; originally announced March 2020.

    Comments: All the authors contributed equally. This paper is an outcome of https://www.cs.ubc.ca/~kevinlb/teaching/cs532l%20-%202018-19/index.html. To be submitted to a journal in transportation or urban planning

  28. arXiv:1906.05884  [pdf, other

    cs.GT

    Report-Sensitive Spot-Checking in Peer-Grading Systems

    Authors: Hedayat Zarkoob, Hu Fu, Kevin Leyton-Brown

    Abstract: Peer grading systems make large courses more scalable, provide students with faster and more detailed feedback, and help students to learn by thinking critically about the work of others. A key obstacle to the broader adoption of peer grading systems is motivating students to provide accurate grades. The literature has explored many different approaches to incentivizing accurate grading (which we… ▽ More

    Submitted 8 March, 2021; v1 submitted 13 June, 2019; originally announced June 2019.

    Comments: This work is published at AAMAS 2020 and supersedes an AAMAS 2019 extended abstract with the same title

  29. arXiv:1905.07043  [pdf, ps, other

    cs.GT cs.IR cs.LG

    Fiduciary Bandits

    Authors: Gal Bahar, Omer Ben-Porat, Kevin Leyton-Brown, Moshe Tennenholtz

    Abstract: Recommendation systems often face exploration-exploitation tradeoffs: the system can only learn about the desirability of new options by recommending them to some user. Such systems can thus be modeled as multi-armed bandit settings; however, users are self-interested and cannot be made to follow recommendations. We ask whether exploration can nevertheless be performed in a way that scrupulously r… ▽ More

    Submitted 28 June, 2020; v1 submitted 16 May, 2019; originally announced May 2019.

    Comments: Published in The Thirty-seventh International Conference on Machine Learning (ICML 2020)

  30. arXiv:1902.05454  [pdf, other

    cs.AI cs.LG

    Procrastinating with Confidence: Near-Optimal, Anytime, Adaptive Algorithm Configuration

    Authors: Robert Kleinberg, Kevin Leyton-Brown, Brendan Lucier, Devon Graham

    Abstract: Algorithm configuration methods optimize the performance of a parameterized heuristic algorithm on a given distribution of problem instances. Recent work introduced an algorithm configuration procedure ("Structured Procrastination") that provably achieves near optimal performance with high probability and with nearly minimal runtime in the worst case. It also offers an $\textit{anytime}$ property:… ▽ More

    Submitted 8 November, 2019; v1 submitted 14 February, 2019; originally announced February 2019.

  31. arXiv:1812.11571  [pdf, ps, other

    cs.GT

    A Formal Separation Between Strategic and Nonstrategic Behavior

    Authors: James R. Wright, Kevin Leyton-Brown

    Abstract: It is common to make a distinction between "strategic" behavior and other forms of intentional but "nonstrategic" behavior: typically, that strategic agents model other agents while nonstrategic agents do not. However, a crisp boundary between these concepts has proven elusive. This problem is pervasive throughout the game theoretic literature on bounded rationality and particularly critical in pa… ▽ More

    Submitted 26 May, 2022; v1 submitted 30 December, 2018; originally announced December 2018.

  32. arXiv:1803.02879  [pdf, other

    stat.ML cs.LG

    Deep Models of Interactions Across Sets

    Authors: Jason Hartford, Devon R Graham, Kevin Leyton-Brown, Siamak Ravanbakhsh

    Abstract: We use deep learning to model interactions across two or more sets of objects, such as user-movie ratings, protein-drug bindings, or ternary user-item-tag interactions. The canonical representation of such interactions is a matrix (or a higher-dimensional tensor) with an exchangeability property: the encoding's meaning is not changed by permuting rows or columns. We argue that models should hence… ▽ More

    Submitted 8 June, 2018; v1 submitted 7 March, 2018; originally announced March 2018.

  33. arXiv:1706.04324  [pdf, other

    cs.GT

    Assessing Economic Outcomes in Simulated Reverse Clock Auctions for Radio Spectrum

    Authors: Neil Newman, Kevin Leyton-Brown, Paul Milgrom, Ilya Segal

    Abstract: We investigate the economic outcomes that result under simulated bidder behavior in a model of the FCC's reverse auction for radio spectrum. In our simulations, limiting our notion of efficiency to the reverse auction in isolation, the reverse clock auction achieves very efficient solutions, the FCC's scoring rule greatly reduces the total payments to TV broadcasters at the cost of some efficiency… ▽ More

    Submitted 14 June, 2017; originally announced June 2017.

  34. arXiv:1706.03304  [pdf, other

    cs.AI

    Deep Optimization for Spectrum Repacking

    Authors: Neil Newman, Alexandre Fréchette, Kevin Leyton-Brown

    Abstract: Over 13 months in 2016-17 the FCC conducted an "incentive auction" to repurpose radio spectrum from broadcast television to wireless internet. In the end, the auction yielded $19.8 billion, $10.05 billion of which was paid to 175 broadcasters for voluntarily relinquishing their licenses across 14 UHF channels. Stations that continued broadcasting were assigned potentially new channels to fit as de… ▽ More

    Submitted 10 June, 2017; originally announced June 2017.

  35. arXiv:1703.10342  [pdf, other

    cs.AI stat.ML

    Efficient Benchmarking of Algorithm Configuration Procedures via Model-Based Surrogates

    Authors: Katharina Eggensperger, Marius Lindauer, Holger H. Hoos, Frank Hutter, Kevin Leyton-Brown

    Abstract: The optimization of algorithm (hyper-)parameters is crucial for achieving peak performance across a wide range of domains, ranging from deep neural networks to solvers for hard combinatorial problems. The resulting algorithm configuration (AC) problem has attracted much attention from the machine learning community. However, the proper evaluation of new AC procedures is hindered by two key hurdles… ▽ More

    Submitted 30 March, 2017; originally announced March 2017.

  36. arXiv:1612.09596  [pdf, other

    stat.AP cs.LG stat.ML

    Counterfactual Prediction with Deep Instrumental Variables Networks

    Authors: Jason Hartford, Greg Lewis, Kevin Leyton-Brown, Matt Taddy

    Abstract: We are in the middle of a remarkable rise in the use and capability of artificial intelligence. Much of this growth has been fueled by the success of deep learning architectures: models that map from observables to outputs via multiple layers of latent representations. These deep learning algorithms are effective tools for unstructured prediction, and they can be combined in AI systems to solve co… ▽ More

    Submitted 30 December, 2016; originally announced December 2016.

  37. arXiv:1609.08923  [pdf, other

    cs.GT

    Models of Level-0 Behavior for Predicting Human Behavior in Games

    Authors: James R. Wright, Kevin Leyton-Brown

    Abstract: Behavioral game theory seeks to describe the way actual people (as compared to idealized, "rational" agents) act in strategic situations. Our own recent work has identified iterative models (such as quantal cognitive hierarchy) as the state of the art for predicting human play in unrepeated, simultaneous-move games (Wright & Leyton-Brown 2012, 2016). Iterative models predict that agents reason ite… ▽ More

    Submitted 28 September, 2016; originally announced September 2016.

  38. arXiv:1606.07042  [pdf, ps, other

    cs.GT

    Incentivizing Evaluation via Limited Access to Ground Truth: Peer-Prediction Makes Things Worse

    Authors: Alice Gao, James R. Wright, Kevin Leyton-Brown

    Abstract: In many settings, an effective way of evaluating objects of interest is to collect evaluations from dispersed individuals and to aggregate these evaluations together. Some examples are categorizing online content and evaluating student assignments via peer grading. For this data science problem, one challenge is to motivate participants to conduct such evaluations carefully and to report them hone… ▽ More

    Submitted 22 June, 2016; originally announced June 2016.

  39. arXiv:1506.02465  [pdf, other

    cs.AI cs.LG

    ASlib: A Benchmark Library for Algorithm Selection

    Authors: Bernd Bischl, Pascal Kerschke, Lars Kotthoff, Marius Lindauer, Yuri Malitsky, Alexandre Frechette, Holger Hoos, Frank Hutter, Kevin Leyton-Brown, Kevin Tierney, Joaquin Vanschoren

    Abstract: The task of algorithm selection involves choosing an algorithm from a set of algorithms on a per-instance basis in order to exploit the varying performance of algorithms over a set of instances. The algorithm selection problem is attracting increasing attention from researchers and practitioners in AI. Years of fruitful applications in a number of domains have resulted in a large amount of data, b… ▽ More

    Submitted 6 April, 2016; v1 submitted 8 June, 2015; originally announced June 2015.

    Comments: Accepted to be published in Artificial Intelligence Journal

  40. arXiv:1505.01221  [pdf, other

    cs.AI cs.LG

    The Configurable SAT Solver Challenge (CSSC)

    Authors: Frank Hutter, Marius Lindauer, Adrian Balint, Sam Bayless, Holger Hoos, Kevin Leyton-Brown

    Abstract: It is well known that different solution strategies work well for different types of instances of hard combinatorial problems. As a consequence, most solvers for the propositional satisfiability problem (SAT) expose parameters that allow them to be customized to a particular family of instances. In the international SAT competition series, these parameters are ignored: solvers are run using a sing… ▽ More

    Submitted 2 August, 2016; v1 submitted 5 May, 2015; originally announced May 2015.

  41. arXiv:1408.0703  [pdf, other

    cs.GT cs.AI

    Computational Analysis of Perfect-Information Position Auctions

    Authors: David R. M Thompson, Kevin Leyton-Brown

    Abstract: After experimentation with other designs, the major search engines converged on the weighted, generalized second-price auction (wGSP) for selling keyword advertisements. Notably, this convergence occurred before position auctions were well understood (or, indeed, widely studied) theoretically. While much progress has been made since, theoretical analysis is still not able to settle the question of… ▽ More

    Submitted 4 August, 2014; originally announced August 2014.

  42. arXiv:1401.8074  [pdf, ps, other

    cs.GT cs.LG

    Empirically Evaluating Multiagent Learning Algorithms

    Authors: Erik Zawadzki, Asher Lipson, Kevin Leyton-Brown

    Abstract: There exist many algorithms for learning how to play repeated bimatrix games. Most of these algorithms are justified in terms of some sort of theoretical guarantee. On the other hand, little is known about the empirical performance of these algorithms. Most such claims in the literature are based on small experiments, which has hampered understanding as well as the development of new multiagent le… ▽ More

    Submitted 31 January, 2014; originally announced January 2014.

  43. ParamILS: An Automatic Algorithm Configuration Framework

    Authors: Frank Hutter, Thomas Stuetzle, Kevin Leyton-Brown, Holger H. Hoos

    Abstract: The identification of performance-optimizing parameter settings is an important part of the development and application of algorithms. We describe an automatic framework for this algorithm configuration problem. More formally, we provide methods for optimizing a target algorithm's performance on a given class of problem instances by varying a set of ordinal and/or categorical parameters. We review… ▽ More

    Submitted 15 January, 2014; originally announced January 2014.

    Journal ref: Journal Of Artificial Intelligence Research, Volume 36, pages 267-306, 2009

  44. arXiv:1310.1947  [pdf, other

    cs.AI cs.LG stat.ML

    Bayesian Optimization With Censored Response Data

    Authors: Frank Hutter, Holger Hoos, Kevin Leyton-Brown

    Abstract: Bayesian optimization (BO) aims to minimize a given blackbox function using a model that is updated whenever new evidence about the function becomes available. Here, we address the problem of BO under partially right-censored response data, where in some evaluations we only obtain a lower bound on the function value. The ability to handle such response data allows us to adaptively censor costly fu… ▽ More

    Submitted 7 October, 2013; originally announced October 2013.

    Comments: Extended version of NIPS 2011 workshop paper

    ACM Class: G.3; G.1.6

  45. Predicting Human Behavior in Unrepeated, Simultaneous-Move Games

    Authors: James R. Wright, Kevin Leyton-Brown

    Abstract: It is common to assume that agents will adopt Nash equilibrium strategies; however, experimental studies have demonstrated that Nash equilibrium is often a poor description of human players' behavior in unrepeated normal-form games. In this paper, we analyze five widely studied models (Quantal Response Equilibrium, Level-k, Cognitive Hierarchy, QLk, and Noisy Introspection) that aim to describe ac… ▽ More

    Submitted 29 August, 2017; v1 submitted 4 June, 2013; originally announced June 2013.

    Journal ref: Games and Economic Behavior, 106 (2017), pages 16--37

  46. arXiv:1211.0906  [pdf, other

    cs.AI cs.LG cs.PF stat.ML

    Algorithm Runtime Prediction: Methods & Evaluation

    Authors: Frank Hutter, Lin Xu, Holger H. Hoos, Kevin Leyton-Brown

    Abstract: Perhaps surprisingly, it is possible to predict how long an algorithm will take to run on a previously unseen input, using machine learning techniques to build a model of the algorithm's runtime as a function of problem-specific instance features. Such models have important applications to algorithm analysis, portfolio-based algorithm selection, and the automatic configuration of parameterized alg… ▽ More

    Submitted 26 October, 2013; v1 submitted 5 November, 2012; originally announced November 2012.

    Comments: 51 pages, 13 figures, 8 tables. Added references, feature cost, and experiments with subsets of features; reworded Sections 1&2

    MSC Class: 68T20 ACM Class: I.2.8; I.2.6

  47. arXiv:1208.3719  [pdf, other

    cs.LG

    Auto-WEKA: Combined Selection and Hyperparameter Optimization of Classification Algorithms

    Authors: Chris Thornton, Frank Hutter, Holger H. Hoos, Kevin Leyton-Brown

    Abstract: Many different machine learning algorithms exist; taking into account each algorithm's hyperparameters, there is a staggeringly large number of possible alternatives overall. We consider the problem of simultaneously selecting a learning algorithm and setting its hyperparameters, going beyond previous work that addresses these issues in isolation. We show that this problem can be addressed by a fu… ▽ More

    Submitted 6 March, 2013; v1 submitted 17 August, 2012; originally announced August 2012.

    Comments: 9 pages, 3 figures

    Report number: Technical Report TR-2012-05 ACM Class: I.2.6; D.2.10; I.2.2

  48. arXiv:1207.4128  [pdf

    cs.GT

    Computing Nash Equilibria of Action-Graph Games

    Authors: Navin Bhat, Kevin Leyton-Brown

    Abstract: Action-graph games (AGGs) are a fully expressive game representation which can compactly express both strict and context-specific independence between players' utility functions. Actions are represented as nodes in a graph G, and the payoff to an agent who chose the action s depends only on the numbers of other agents who chose actions connected to s. We present algorithms for computing both symme… ▽ More

    Submitted 11 July, 2012; originally announced July 2012.

    Comments: Appears in Proceedings of the Twentieth Conference on Uncertainty in Artificial Intelligence (UAI2004)

    Report number: UAI-P-2004-PG-35-42

  49. arXiv:1205.2638  [pdf

    cs.GT cs.AI

    Temporal Action-Graph Games: A New Representation for Dynamic Games

    Authors: Albert Xin Jiang, Kevin Leyton-Brown, Avi Pfeffer

    Abstract: In this paper we introduce temporal action graph games (TAGGs), a novel graphical representation of imperfect-information extensive form games. We show that when a game involves anonymity or context-specific utility independencies, its encoding as a TAGG can be much more compact than its direct encoding as a multiagent influence diagram (MAID).We also show that TAGGs can be understood as indirect… ▽ More

    Submitted 9 May, 2012; originally announced May 2012.

    Comments: Appears in Proceedings of the Twenty-Fifth Conference on Uncertainty in Artificial Intelligence (UAI2009)

    Report number: UAI-P-2009-PG-268-276

  50. SATzilla: Portfolio-based Algorithm Selection for SAT

    Authors: Lin Xu, Frank Hutter, Holger H. Hoos, Kevin Leyton-Brown

    Abstract: It has been widely observed that there is no single "dominant" SAT solver; instead, different solvers perform best on different instances. Rather than following the traditional approach of choosing the best solver for a given class of instances, we advocate making this decision online on a per-instance basis. Building on previous work, we describe SATzilla, an automated approach for constructing p… ▽ More

    Submitted 31 October, 2011; originally announced November 2011.

    Journal ref: Journal Of Artificial Intelligence Research, Volume 32, pages 565-606, 2008