Skip to main content

Showing 1–22 of 22 results for author: Beygelzimer, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2306.03262  [pdf, other

    cs.LG cs.DL

    Has the Machine Learning Review Process Become More Arbitrary as the Field Has Grown? The NeurIPS 2021 Consistency Experiment

    Authors: Alina Beygelzimer, Yann N. Dauphin, Percy Liang, Jennifer Wortman Vaughan

    Abstract: We present the NeurIPS 2021 consistency experiment, a larger-scale variant of the 2014 NeurIPS experiment in which 10% of conference submissions were reviewed by two independent committees to quantify the randomness in the review process. We observe that the two committees disagree on their accept/reject recommendations for 23% of the papers and that, consistent with the results from 2014, approxi… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

  2. arXiv:2211.12966  [pdf, other

    cs.LG cs.DB cs.DL

    How do Authors' Perceptions of their Papers Compare with Co-authors' Perceptions and Peer-review Decisions?

    Authors: Charvi Rastogi, Ivan Stelmakh, Alina Beygelzimer, Yann N. Dauphin, Percy Liang, Jennifer Wortman Vaughan, Zhenyu Xue, Hal Daumé III, Emma Pierson, Nihar B. Shah

    Abstract: How do author perceptions match up to the outcomes of the peer-review process and perceptions of others? In a top-tier computer science conference (NeurIPS 2021) with more than 23,000 submitting authors and 9,000 submitted papers, we survey the authors on three questions: (i) their predicted probability of acceptance for each of their papers, (ii) their perceived ranking of their own papers based… ▽ More

    Submitted 22 November, 2022; originally announced November 2022.

  3. arXiv:2003.12206  [pdf, other

    cs.LG stat.ML

    Improving Reproducibility in Machine Learning Research (A Report from the NeurIPS 2019 Reproducibility Program)

    Authors: Joelle Pineau, Philippe Vincent-Lamarre, Koustuv Sinha, Vincent Larivière, Alina Beygelzimer, Florence d'Alché-Buc, Emily Fox, Hugo Larochelle

    Abstract: One of the challenges in machine learning research is to ensure that presented and published results are sound and reliable. Reproducibility, that is obtaining similar results as presented in a paper or talk, using the same code and data (when available), is a necessary step to verify the reliability of research findings. Reproducibility is also an important step to promote open and accessible res… ▽ More

    Submitted 30 December, 2020; v1 submitted 26 March, 2020; originally announced March 2020.

    Comments: To appear at JMLR, 16 pages + Appendix

  4. arXiv:1902.02244  [pdf, other

    cs.LG stat.ML

    Bandit Multiclass Linear Classification: Efficient Algorithms for the Separable Case

    Authors: Alina Beygelzimer, Dávid Pál, Balázs Szörényi, Devanathan Thiruvenkatachari, Chen-Yu Wei, Chicheng Zhang

    Abstract: We study the problem of efficient online multiclass linear classification with bandit feedback, where all examples belong to one of $K$ classes and lie in the $d$-dimensional Euclidean space. Previous works have left open the challenge of designing efficient algorithms with finite mistake bounds when the data is linearly separable by a margin $γ$. In this work, we take a first step towards this pr… ▽ More

    Submitted 18 June, 2019; v1 submitted 6 February, 2019; originally announced February 2019.

    Comments: 41 pages, 8 figures

  5. arXiv:1807.06473  [pdf, other

    cs.LG stat.ML

    Contextual Memory Trees

    Authors: Wen Sun, Alina Beygelzimer, Hal Daumé III, John Langford, Paul Mineiro

    Abstract: We design and study a Contextual Memory Tree (CMT), a learning memory controller that inserts new memories into an experience store of unbounded size. It is designed to efficiently query for memories from that store, supporting logarithmic time insertion and retrieval operations. Hence CMT can be integrated into existing statistical learning algorithms as an augmented memory unit without substanti… ▽ More

    Submitted 2 June, 2019; v1 submitted 17 July, 2018; originally announced July 2018.

    Comments: ICM 2019

  6. arXiv:1803.02453  [pdf, other

    cs.LG

    A Reductions Approach to Fair Classification

    Authors: Alekh Agarwal, Alina Beygelzimer, Miroslav Dudík, John Langford, Hanna Wallach

    Abstract: We present a systematic approach for achieving fairness in a binary classification setting. While we focus on two well-known quantitative definitions of fairness, our approach encompasses many other previously studied definitions as special cases. The key idea is to reduce fair classification to a sequence of cost-sensitive classification problems, whose solutions yield a randomized classifier wit… ▽ More

    Submitted 16 July, 2018; v1 submitted 6 March, 2018; originally announced March 2018.

  7. arXiv:1702.07958  [pdf, other

    cs.LG stat.ML

    Efficient Online Bandit Multiclass Learning with $\tilde{O}(\sqrt{T})$ Regret

    Authors: Alina Beygelzimer, Francesco Orabona, Chicheng Zhang

    Abstract: We present an efficient second-order algorithm with $\tilde{O}(\frac{1}η\sqrt{T})$ regret for the bandit online multiclass problem. The regret bound holds simultaneously with respect to a family of loss functions parameterized by $η$, for a range of $η$ restricted by the norm of the competitor. The family of loss functions ranges from hinge loss ($η=0$) to squared hinge loss ($η=1$). This provides… ▽ More

    Submitted 17 January, 2018; v1 submitted 25 February, 2017; originally announced February 2017.

    Comments: 22 pages, 2 figures; ICML 2017; this version includes additional discussions of Newtron, and a variant of SOBA that directly uses an online exp-concave optimization oracle

  8. arXiv:1602.07265  [pdf, ps, other

    cs.LG stat.ML

    Search Improves Label for Active Learning

    Authors: Alina Beygelzimer, Daniel Hsu, John Langford, Chicheng Zhang

    Abstract: We investigate active learning with access to two distinct oracles: Label (which is standard) and Search (which is not). The Search oracle models the situation where a human searches a database to seed or counterexample an existing solution. Search is stronger than Label while being natural to implement in many situations. We show that an algorithm using both oracles can provide exponentially larg… ▽ More

    Submitted 24 October, 2016; v1 submitted 23 February, 2016; originally announced February 2016.

    Comments: 32 pages; NIPS 2016

  9. arXiv:1506.04820  [pdf, ps, other

    cs.LG

    Online Gradient Boosting

    Authors: Alina Beygelzimer, Elad Hazan, Satyen Kale, Haipeng Luo

    Abstract: We extend the theory of boosting for regression problems to the online learning setting. Generalizing from the batch setting for boosting, the notion of a weak learning algorithm is modeled as an online learning algorithm with linear loss functions that competes with a base class of regression functions, while a strong learning algorithm is an online learning algorithm with convex loss functions t… ▽ More

    Submitted 30 October, 2015; v1 submitted 15 June, 2015; originally announced June 2015.

  10. arXiv:1502.02704  [pdf, other

    cs.LG

    Learning Reductions that Really Work

    Authors: Alina Beygelzimer, Hal Daumé III, John Langford, Paul Mineiro

    Abstract: We provide a summary of the mathematical and computational techniques that have enabled learning reductions to effectively address a wide class of problems, and show that this approach to solving machine learning problems can be broadly useful.

    Submitted 9 February, 2015; originally announced February 2015.

  11. arXiv:1502.02651  [pdf, ps, other

    cs.LG

    Optimal and Adaptive Algorithms for Online Boosting

    Authors: Alina Beygelzimer, Satyen Kale, Haipeng Luo

    Abstract: We study online boosting, the task of converting any weak online learner into a strong online learner. Based on a novel and natural definition of weak online learnability, we develop two online boosting algorithms. The first algorithm is an online version of boost-by-majority. By proving a matching lower bound, we show that this algorithm is essentially optimal in terms of the number of weak learn… ▽ More

    Submitted 9 February, 2015; originally announced February 2015.

  12. arXiv:1410.0440  [pdf, other

    cs.LG stat.ML

    Scalable Nonlinear Learning with Adaptive Polynomial Expansions

    Authors: Alekh Agarwal, Alina Beygelzimer, Daniel Hsu, John Langford, Matus Telgarsky

    Abstract: Can we effectively learn a nonlinear representation in time comparable to linear learning? We describe a new algorithm that explicitly and adaptively expands higher-order interaction features over base linear representations. The algorithm is designed for extreme computational efficiency, and an extensive experimental study shows that its computation/prediction tradeoff ability compares very favor… ▽ More

    Submitted 1 October, 2014; originally announced October 2014.

    Comments: To appear in NIPS 2014

  13. arXiv:1408.2031  [pdf

    cs.LG stat.ML

    Conditional Probability Tree Estimation Analysis and Algorithms

    Authors: Alina Beygelzimer, John Langford, Yuri Lifshits, Gregory Sorkin, Alexander L. Strehl

    Abstract: We consider the problem of estimating the conditional probability of a label in time O(log n), where n is the number of possible labels. We analyze a natural reduction of this problem to a set of binary regression problems organized in a tree structure, proving a regret bound that scales with the depth of the tree. Motivated by this analysis, we propose the first online algorithm which provably co… ▽ More

    Submitted 9 August, 2014; originally announced August 2014.

    Comments: Appears in Proceedings of the Twenty-Fifth Conference on Uncertainty in Artificial Intelligence (UAI2009)

    Report number: UAI-P-2009-PG-51-58

  14. arXiv:1207.1418  [pdf

    cs.AI

    Efficient Test Selection in Active Diagnosis via Entropy Approximation

    Authors: Alice X. Zheng, Irina Rish, Alina Beygelzimer

    Abstract: We consider the problem of diagnosing faults in a system represented by a Bayesian network, where diagnosis corresponds to recovering the most likely state of unobserved nodes given the outcomes of tests (observed nodes). Finding an optimal subset of tests in this setting is intractable in general. We show that it is difficult even to compute the next most-informative test using greedy test select… ▽ More

    Submitted 4 July, 2012; originally announced July 2012.

    Comments: Appears in Proceedings of the Twenty-First Conference on Uncertainty in Artificial Intelligence (UAI2005)

    Report number: UAI-P-2005-PG-675-682

  15. arXiv:1201.6655  [pdf, other

    cs.AI q-fin.GN

    Learning Performance of Prediction Markets with Kelly Bettors

    Authors: Alina Beygelzimer, John Langford, David Pennock

    Abstract: In evaluating prediction markets (and other crowd-prediction mechanisms), investigators have repeatedly observed a so-called "wisdom of crowds" effect, which roughly says that the average of participants performs much better than the average participant. The market price---an average or at least aggregate of traders' beliefs---offers a better estimate than most any individual trader's opinion. In… ▽ More

    Submitted 31 January, 2012; originally announced January 2012.

  16. arXiv:1006.2588  [pdf, ps, other

    cs.LG

    Agnostic Active Learning Without Constraints

    Authors: Alina Beygelzimer, Daniel Hsu, John Langford, Tong Zhang

    Abstract: We present and analyze an agnostic active learning algorithm that works without kee** a version space. This is unlike all previous approaches where a restricted set of candidate hypotheses is maintained throughout learning, and only hypotheses from this set are ever returned. By avoiding this version space approach, our algorithm sheds the computational burden and brittleness associated with mai… ▽ More

    Submitted 13 June, 2010; originally announced June 2010.

  17. arXiv:1002.4058  [pdf, ps, other

    cs.LG

    Contextual Bandit Algorithms with Supervised Learning Guarantees

    Authors: Alina Beygelzimer, John Langford, Lihong Li, Lev Reyzin, Robert E. Schapire

    Abstract: We address the problem of learning in an online, bandit setting where the learner must repeatedly select among $K$ actions, but only receives partial feedback based on its choices. We establish two new facts: First, using a new algorithm called Exp4.P, we show that it is possible to compete with the best in a set of $N$ experts with probability $1-δ$ while incurring regret at most… ▽ More

    Submitted 27 October, 2011; v1 submitted 22 February, 2010; originally announced February 2010.

    Comments: 10 pages

    ACM Class: I.2.6

  18. arXiv:0903.4217  [pdf, ps, other

    cs.LG cs.AI

    Conditional Probability Tree Estimation Analysis and Algorithms

    Authors: Alina Beygelzimer, John Langford, Yuri Lifshits, Gregory Sorkin, Alex Strehl

    Abstract: We consider the problem of estimating the conditional probability of a label in time $O(\log n)$, where $n$ is the number of possible labels. We analyze a natural reduction of this problem to a set of binary regression problems organized in a tree structure, proving a regret bound that scales with the depth of the tree. Motivated by this analysis, we propose the first online algorithm which prov… ▽ More

    Submitted 3 June, 2009; v1 submitted 24 March, 2009; originally announced March 2009.

  19. arXiv:0902.3176  [pdf, ps, other

    cs.AI cs.LG

    Error-Correcting Tournaments

    Authors: Alina Beygelzimer, John Langford, Pradeep Ravikumar

    Abstract: We present a family of pairwise tournaments reducing $k$-class classification to binary classification. These reductions are provably robust against a constant fraction of binary errors. The results improve on the PECOC construction \cite{SECOC} with an exponential improvement in computation, from $O(k)$ to $O(\log_2 k)$, and the removal of a square root in the regret dependence, matching the be… ▽ More

    Submitted 3 February, 2010; v1 submitted 18 February, 2009; originally announced February 2009.

    Comments: Minor wording improvements

  20. arXiv:0812.4952  [pdf, ps, other

    cs.LG

    Importance Weighted Active Learning

    Authors: Alina Beygelzimer, Sanjoy Dasgupta, John Langford

    Abstract: We present a practical and statistically consistent scheme for actively learning binary classifiers under general loss functions. Our algorithm uses importance weighting to correct sampling bias, and by controlling the variance, we are able to give rigorous label complexity bounds for the learning process. Experiments on passively labeled data show that this approach reduces the label complexity… ▽ More

    Submitted 20 May, 2009; v1 submitted 29 December, 2008; originally announced December 2008.

  21. arXiv:0812.4044  [pdf, ps, other

    cs.LG cs.AI

    The Offset Tree for Learning with Partial Labels

    Authors: Alina Beygelzimer, John Langford

    Abstract: We present an algorithm, called the Offset Tree, for learning to make decisions in situations where the payoff of only one choice is observed, rather than all choices. The algorithm reduces this setting to binary classification, allowing one to reuse of any existing, fully supervised binary classification algorithm in this partial information setting. We show that the Offset Tree is an optimal red… ▽ More

    Submitted 3 April, 2016; v1 submitted 21 December, 2008; originally announced December 2008.

  22. arXiv:cs/9911007  [pdf, ps, other

    cs.CC cs.CR

    One-Way Functions in Worst-Case Cryptography: Algebraic and Security Properties

    Authors: A. Beygelzimer, L. A. Hemaspaandra, C. M. Homan, J. Rothe

    Abstract: We survey recent developments in the study of (worst-case) one-way functions having strong algebraic and security properties. According to [RS93], this line of research was initiated in 1984 by Rivest and Sherman who designed two-party secret-key agreement protocols that use strongly noninvertible, total, associative one-way functions as their key building blocks. If commutativity is added as an… ▽ More

    Submitted 15 November, 1999; originally announced November 1999.

    Comments: 17 pages

    Report number: University of Rochester Technical Report UR-CS TR 722 ACM Class: F.1.3; E.3