Search | arXiv e-print repository

Simple online learning with consistent oracle

Authors: Alexander Kozachinskiy, Tomasz Steifer

Abstract: We consider online learning in the model where a learning algorithm can access the class only via the \emph{consistent oracle} -- an oracle, that, at any moment, can give a function from the class that agrees with all examples seen so far. This model was recently considered by Assos et al.~(COLT'23). It is motivated by the fact that standard methods of online learning rely on computing the Littles… ▽ More We consider online learning in the model where a learning algorithm can access the class only via the \emph{consistent oracle} -- an oracle, that, at any moment, can give a function from the class that agrees with all examples seen so far. This model was recently considered by Assos et al.~(COLT'23). It is motivated by the fact that standard methods of online learning rely on computing the Littlestone dimension of subclasses, a computationally intractable problem. Assos et al.~gave an online learning algorithm in this model that makes at most $C^d$ mistakes on classes of Littlestone dimension $d$, for some absolute unspecified constant $C > 0$. We give a novel algorithm that makes at most $O(256^d)$ mistakes. Our proof is significantly simpler and uses only very basic properties of the Littlestone dimension. We also show that there exists no algorithm in this model that makes less than $3^d$ mistakes. △ Less

Submitted 6 February, 2024; v1 submitted 15 August, 2023; originally announced August 2023.

Comments: Changes to previous version: added 3^d lower bound

arXiv:2302.04731 [pdf, ps, other]

Find a witness or shatter: the landscape of computable PAC learning

Authors: Valentino Delle Rose, Alexander Kozachinskiy, Cristobal Rojas, Tomasz Steifer

Abstract: This paper contributes to the study of CPAC learnability -- a computable version of PAC learning -- by solving three open questions from recent papers. Firstly, we prove that every improperly CPAC learnable class is contained in a class which is properly CPAC learnable with polynomial sample complexity. This confirms a conjecture by Agarwal et al (COLT 2021). Secondly, we show that there exists a… ▽ More This paper contributes to the study of CPAC learnability -- a computable version of PAC learning -- by solving three open questions from recent papers. Firstly, we prove that every improperly CPAC learnable class is contained in a class which is properly CPAC learnable with polynomial sample complexity. This confirms a conjecture by Agarwal et al (COLT 2021). Secondly, we show that there exists a decidable class of hypothesis which is properly CPAC learnable, but only with uncomputably fast growing sample complexity. This solves a question from Sterkenburg (COLT 2022). Finally, we construct a decidable class of finite Littlestone dimension which is not improperly CPAC learnable, strengthening a recent result of Sterkenburg (2022) and answering a question posed by Hasrati and Ben-David (ALT 2023). Together with previous work, our results provide a complete landscape for the learnability problem in the CPAC setting. △ Less

Submitted 23 February, 2023; v1 submitted 5 February, 2023; originally announced February 2023.

Comments: 12 pages, 1 figure (corrected version)

arXiv:2211.02144 [pdf, ps, other]

No Agreement Without Loss: Learning and Social Choice in Peer Review

Authors: Pablo Barceló, Mauricio Duarte, Cristóbal Rojas, Tomasz Steifer

Abstract: In peer review systems, reviewers are often asked to evaluate various features of submissions, such as technical quality or novelty. A score is given to each of the predefined features and based on these the reviewer has to provide an overall quantitative recommendation. It may be assumed that each reviewer has her own map** from the set of features to a recommendation, and that different review… ▽ More In peer review systems, reviewers are often asked to evaluate various features of submissions, such as technical quality or novelty. A score is given to each of the predefined features and based on these the reviewer has to provide an overall quantitative recommendation. It may be assumed that each reviewer has her own map** from the set of features to a recommendation, and that different reviewers have different map**s in mind. This introduces an element of arbitrariness known as commensuration bias. In this paper we discuss a framework, introduced by Noothigattu, Shah and Procaccia, and then applied by the organizers of the AAAI 2022 conference. Noothigattu, Shah and Procaccia proposed to aggregate reviewer's map** by minimizing certain loss functions, and studied axiomatic properties of this approach, in the sense of social choice theory. We challenge several of the results and assumptions used in their work and report a number of negative results. On the one hand, we study a trade-off between some of the axioms proposed and the ability of the method to properly capture agreements of the majority of reviewers. On the other hand, we show that drop** a certain unrealistic assumption has dramatic effects, including causing the method to be discontinuous. △ Less

Submitted 3 August, 2023; v1 submitted 3 November, 2022; originally announced November 2022.

Comments: accepted for ECAI 2023

MSC Class: 91B14

arXiv:2005.03627 [pdf, ps, other]

doi 10.1017/bsl.2022.18

Universal Coding and Prediction on Ergodic Martin-Löf Random Points

Authors: Łukasz Dębowski, Tomasz Steifer

Abstract: Suppose that we have a method which estimates the conditional probabilities of some unknown stochastic source and we use it to guess which of the outcomes will happen. We want to make a correct guess as often as it is possible. What estimators are good for this? In this work, we consider estimators given by a familiar notion of universal coding for stationary ergodic measures, while working in the… ▽ More Suppose that we have a method which estimates the conditional probabilities of some unknown stochastic source and we use it to guess which of the outcomes will happen. We want to make a correct guess as often as it is possible. What estimators are good for this? In this work, we consider estimators given by a familiar notion of universal coding for stationary ergodic measures, while working in the framework of algorithmic randomness, i.e, we are particularly interested in prediction of Martin-Löf random points. We outline the general theory and exhibit some counterexamples. Completing a result of Ryabko from 2009 we also show that universal probability measure in the sense of universal coding induces a universal predictor in the prequential sense. Surprisingly, this implication holds true provided the universal measure does not ascribe too low conditional probabilities to individual symbols. As an example, we show that the Prediction by Partial Matching (PPM) measure satisfies this requirement with a large reserve. △ Less

Submitted 5 February, 2021; v1 submitted 7 May, 2020; originally announced May 2020.

Comments: 24 pages. The manuscript significantly improved and extended with respect to the previous version

MSC Class: 94A29; 62M20; 03D32

Journal ref: The Bulletin of Symbolic Logic, vol. 28(2), pp. 387-412, 2022

Showing 1–4 of 4 results for author: Steifer, T