Skip to main content

Showing 1–47 of 47 results for author: Gentile, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.02797  [pdf, other

    cs.LG cs.CR

    Auditing Privacy Mechanisms via Label Inference Attacks

    Authors: Róbert István Busa-Fekete, Travis Dick, Claudio Gentile, Andrés Muñoz Medina, Adam Smith, Marika Swanberg

    Abstract: We propose reconstruction advantage measures to audit label privatization mechanisms. A reconstruction advantage measure quantifies the increase in an attacker's ability to infer the true label of an unlabeled example when provided with a private version of the labels in a dataset (e.g., aggregate of labels from different users or noisy labels output by randomized response), compared to an attacke… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  2. arXiv:2401.01329  [pdf, other

    eess.SP cs.NI

    Self-Supervised Millimeter Wave Indoor Localization using Tiny Neural Networks

    Authors: Anish Shastri, Steve Blandino, Camillo Gentile, Chieh** Lai, Paolo Casari

    Abstract: The quasi-optical propagation of millimeter-wave signals enables high-accuracy localization algorithms that employ geometric approaches or machine learning models. However, most algorithms require information on the indoor environment, may entail the collection of large training datasets, or bear an infeasible computational burden for commercial off-the-shelf (COTS) devices. In this work, we propo… ▽ More

    Submitted 2 January, 2024; originally announced January 2024.

    Comments: 13 pages, 11 figures

  3. arXiv:2306.15371  [pdf, ps, other

    cs.CR math.OC

    A New Mathematical Optimization-Based Method for the m-invariance Problem

    Authors: Adrian Tobar, Jordi Castro, Claudio Gentile

    Abstract: The issue of ensuring privacy for users who share their personal information has been a growing priority in a business and scientific environment where the use of different types of data and the laws that protect it have increased in tandem. Different technologies have been widely developed for static publications, i.e., where the information is published only once, such as k-anonymity and ε-diffe… ▽ More

    Submitted 27 June, 2023; originally announced June 2023.

  4. arXiv:2306.04828  [pdf, other

    cs.LG

    Fast and Effective GNN Training with Linearized Random Spanning Trees

    Authors: Francesco Bonchi, Claudio Gentile, Francesco Paolo Nerini, André Panisson, Fabio Vitale

    Abstract: We present a new effective and scalable framework for training GNNs in node classification tasks, based on the effective resistance, a powerful tool solidly rooted in graph theory. Our approach progressively refines the GNN weights on an extensive sequence of random spanning trees, suitably transformed into path graphs that retain essential topological and node information of the original graph. T… ▽ More

    Submitted 14 February, 2024; v1 submitted 7 June, 2023; originally announced June 2023.

  5. arXiv:2306.02869  [pdf, other

    cs.LG cs.AI stat.ML

    Data-Driven Online Model Selection With Regret Guarantees

    Authors: Aldo Pacchiano, Christoph Dann, Claudio Gentile

    Abstract: We consider model selection for sequential decision making in stochastic environments with bandit feedback, where a meta-learner has at its disposal a pool of base learners, and decides on the fly which action to take based on the policies recommended by each base learner. Model selection is performed by regret balancing but, unlike the recent literature on this subject, we do not assume any prior… ▽ More

    Submitted 23 January, 2024; v1 submitted 5 June, 2023; originally announced June 2023.

  6. arXiv:2305.17544  [pdf, ps, other

    cs.LG

    Faster Margin Maximization Rates for Generic and Adversarially Robust Optimization Methods

    Authors: Guanghui Wang, Zihao Hu, Claudio Gentile, Vidya Muthukumar, Jacob Abernethy

    Abstract: First-order optimization methods tend to inherently favor certain solutions over others when minimizing an underdetermined training objective that has multiple global optima. This phenomenon, known as implicit bias, plays a critical role in understanding the generalization capabilities of optimization algorithms. Recent research has revealed that in separable binary classification tasks gradient-d… ▽ More

    Submitted 7 April, 2024; v1 submitted 27 May, 2023; originally announced May 2023.

    Comments: Undated version: New results for implicit bias in adversarial training

  7. arXiv:2302.05765  [pdf, other

    cs.LG

    Adversarial Online Collaborative Filtering

    Authors: Stephen Pasteris, Fabio Vitale, Mark Herbster, Claudio Gentile, Andre' Panisson

    Abstract: We investigate the problem of online collaborative filtering under no-repetition constraints, whereby users need to be served content in an online fashion and a given user cannot be recommended the same content item more than once. We start by designing and analyzing an algorithm that works under biclustering assumptions on the user-item preference matrix, and show that this algorithm exhibits an… ▽ More

    Submitted 29 December, 2023; v1 submitted 11 February, 2023; originally announced February 2023.

  8. arXiv:2302.03784  [pdf, ps, other

    cs.LG stat.ML

    Leveraging User-Triggered Supervision in Contextual Bandits

    Authors: Alekh Agarwal, Claudio Gentile, Teodor V. Marinov

    Abstract: We study contextual bandit (CB) problems, where the user can sometimes respond with the best action in a given context. Such an interaction arises, for example, in text prediction or autocompletion settings, where a poor suggestion is simply ignored and the user enters the desired text instead. Crucially, this extra feedback is user-triggered on only a subset of the contexts. We develop a new fram… ▽ More

    Submitted 7 February, 2023; originally announced February 2023.

  9. arXiv:2302.03115  [pdf, other

    cs.LG stat.ML

    Easy Learning from Label Proportions

    Authors: Robert Istvan Busa-Fekete, Hee** Choi, Travis Dick, Claudio Gentile, Andres Munoz medina

    Abstract: We consider the problem of Learning from Label Proportions (LLP), a weakly supervised classification setup where instances are grouped into "bags", and only the frequency of class labels at each bag is available. Albeit, the objective of the learner is to achieve low task loss at an individual instance level. Here we propose Easyllp: a flexible and simple-to-implement debiasing approach based on a… ▽ More

    Submitted 13 February, 2023; v1 submitted 6 February, 2023; originally announced February 2023.

  10. arXiv:2211.16309  [pdf, other

    cs.RO cs.LG stat.AP

    A Contextual Bandit Approach for Learning to Plan in Environments with Probabilistic Goal Configurations

    Authors: Sohan Rudra, Saksham Goel, Anirban Santara, Claudio Gentile, Laurent Perron, Fei Xia, Vikas Sindhwani, Carolina Parada, Gaurav Aggarwal

    Abstract: Object-goal navigation (Object-nav) entails searching, recognizing and navigating to a target object. Object-nav has been extensively studied by the Embodied-AI community, but most solutions are often restricted to considering static objects (e.g., television, fridge, etc.). We propose a modular framework for object-nav that is able to efficiently search indoor environments for not just static obj… ▽ More

    Submitted 29 November, 2022; originally announced November 2022.

    Comments: Shorter version accepted at NeurIPS 2022 Workshop on Robot Learning: Trustworthy Robotics

  11. arXiv:2206.14912  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Best of Both Worlds Model Selection

    Authors: Aldo Pacchiano, Christoph Dann, Claudio Gentile

    Abstract: We study the problem of model selection in bandit scenarios in the presence of nested policy classes, with the goal of obtaining simultaneous adversarial and stochastic ("best of both worlds") high-probability regret guarantees. Our approach requires that each base learner comes with a candidate regret bound that may or may not hold, while our meta algorithm plays each base learner according to a… ▽ More

    Submitted 29 June, 2022; originally announced June 2022.

    Comments: 10 pages in main, 43 pages appendix

  12. arXiv:2202.05448  [pdf, ps, other

    cs.LG stat.ML

    Fast Rates in Pool-Based Batch Active Learning

    Authors: Claudio Gentile, Zhilei Wang, Tong Zhang

    Abstract: We consider a batch active learning scenario where the learner adaptively issues batches of points to a labeling oracle. Sampling labels in batches is highly desirable in practice due to the smaller number of interactive rounds with the labeling oracle (often human beings). However, batch active learning typically pays the price of a reduced adaptivity, leading to suboptimal results. In this paper… ▽ More

    Submitted 13 June, 2022; v1 submitted 10 February, 2022; originally announced February 2022.

    Comments: This is an extended version of arXiv:2202.05448v1, which has title "Achieving Minimax Rates in Pool-Based Batch Active Learning" and was accepted by ICML 2022 https://icml.cc/virtual/2022/poster/16505

  13. arXiv:2112.02866  [pdf, ps, other

    cs.LG

    Nonstochastic Bandits with Composite Anonymous Feedback

    Authors: Nicolò Cesa-Bianchi, Tommaso Cesari, Roberto Colomboni, Claudio Gentile, Yishay Mansour

    Abstract: We investigate a nonstochastic bandit setting in which the loss of an action is not immediately charged to the player, but rather spread over the subsequent rounds in an adversarial way. The instantaneous loss observed by the player at the end of each round is then a sum of many loss components of previously played actions. This setting encompasses as a special case the easier task of bandits with… ▽ More

    Submitted 24 September, 2022; v1 submitted 6 December, 2021; originally announced December 2021.

  14. arXiv:2109.06131  [pdf, other

    cs.IT eess.SP

    A Framework for Develo** Algorithms for Estimating Propagation Parameters from Measurements

    Authors: Akbar Sayeed, Peter Vouras, Camillo Gentile, Alec Weiss, Jeanne Quimby, Zihang Cheng, Bassel Modad, Yuning Zhang, Chethan An**appa, Fatih Erden, Ozgur Ozdemir, Robert Muller, Diego Dupleich, Han Niu, 6David Michelson, 6Aidan Hughes

    Abstract: A framework is proposed for develo** and evaluating algorithms for extracting multipath propagation components (MPCs) from measurements collected by sounders at millimeter-wave (mmW) frequencies. To focus on algorithmic performance, an idealized model is proposed for the spatial frequency response of the propagation environment measured by a sounder. The input to the sounder model is a pre-deter… ▽ More

    Submitted 13 September, 2021; originally announced September 2021.

    Journal ref: IEEE Globecom 2020

  15. arXiv:2107.14263  [pdf, other

    cs.LG cs.AI

    Batch Active Learning at Scale

    Authors: Gui Citovsky, Giulia DeSalvo, Claudio Gentile, Lazaros Karydas, Anand Rajagopalan, Afshin Rostamizadeh, Sanjiv Kumar

    Abstract: The ability to train complex and highly effective models often requires an abundance of training data, which can easily become a bottleneck in cost, time, and computational resources. Batch active learning, which adaptively issues batched queries to a labeling oracle, is a common approach for addressing this problem. The practical benefits of batch sampling come with the downside of less adaptivit… ▽ More

    Submitted 29 July, 2021; originally announced July 2021.

  16. arXiv:2107.05745  [pdf, ps, other

    cs.LG stat.ML

    Adapting to Misspecification in Contextual Bandits

    Authors: Dylan J. Foster, Claudio Gentile, Mehryar Mohri, Julian Zimmert

    Abstract: A major research direction in contextual bandits is to develop algorithms that are computationally efficient, yet support flexible, general-purpose function approximation. Algorithms based on modeling rewards have shown strong empirical performance, but typically require a well-specified model, and can fail when this assumption does not hold. Can we design algorithms that are efficient and flexibl… ▽ More

    Submitted 12 July, 2021; originally announced July 2021.

    Comments: Appeared at NeurIPS 2020

  17. arXiv:2106.03546  [pdf, other

    cs.LG cs.AI

    On Learning to Rank Long Sequences with Contextual Bandits

    Authors: Anirban Santara, Claudio Gentile, Gaurav Aggarwal, Shuai Li

    Abstract: Motivated by problems of learning to rank long item sequences, we introduce a variant of the cascading bandit model that considers flexible length sequences with varying rewards and losses. We formulate two generative models for this problem within the generalized linear setting, and design and analyze upper confidence algorithms for it. Our analysis delivers tight regret bounds which, when specia… ▽ More

    Submitted 7 June, 2021; originally announced June 2021.

    Report number: PMLR 151:767-797

    Journal ref: Proceedings of The 25th International Conference on Artificial Intelligence and Statistics, PMLR 151:767-797, 2022

  18. arXiv:2106.03243  [pdf, ps, other

    cs.LG

    Neural Active Learning with Performance Guarantees

    Authors: Pranjal Awasthi, Christoph Dann, Claudio Gentile, Ayush Sekhari, Zhilei Wang

    Abstract: We investigate the problem of active learning in the streaming setting in non-parametric regimes, where the labels are stochastically generated from a class of functions on which we make no assumptions whatsoever. We rely on recently proposed Neural Tangent Kernel (NTK) approximation tools to construct a suitable neural embedding that determines the feature space the algorithm operates on and the… ▽ More

    Submitted 6 June, 2021; originally announced June 2021.

    Comments: 30 pages

  19. arXiv:2012.13045  [pdf, ps, other

    cs.LG cs.AI stat.ML stat.OT

    Regret Bound Balancing and Elimination for Model Selection in Bandits and RL

    Authors: Aldo Pacchiano, Christoph Dann, Claudio Gentile, Peter Bartlett

    Abstract: We propose a simple model selection approach for algorithms in stochastic bandit and reinforcement learning problems. As opposed to prior work that (implicitly) assumes knowledge of the optimal regret, we only require that each base algorithm comes with a candidate regret bound that may or may not hold during all rounds. In each round, our approach plays a base algorithm to keep the candidate regr… ▽ More

    Submitted 23 December, 2020; originally announced December 2020.

    Comments: 57 pages

  20. arXiv:2012.03522  [pdf, ps, other

    stat.ML cs.LG

    Online Model Selection: a Rested Bandit Formulation

    Authors: Leonardo Cella, Claudio Gentile, Massimiliano Pontil

    Abstract: Motivated by a natural problem in online model selection with bandit information, we introduce and analyze a best arm identification problem in the rested bandit setting, wherein arm expected losses decrease with the number of times the arm has been played. The shape of the expected loss functions is similar across arms, and is assumed to be available up to unknown parameters that have to be learn… ▽ More

    Submitted 7 December, 2020; originally announced December 2020.

  21. Quasi-Deterministic Channel Model for mmWaves: Mathematical Formalization and Validation

    Authors: Mattia Lecci, Michele Polese, Chieh** Lai, Jian Wang, Camillo Gentile, Nada Golmie, Michele Zorzi

    Abstract: 5G and beyond networks will use, for the first time ever, the millimeter wave (mmWave) spectrum for mobile communications. Accurate performance evaluation is fundamental to the design of reliable mmWave networks, with accuracy rooted in the fidelity of the channel models. At mmWaves, the model must account for the spatial characteristics of propagation since networks will employ highly directional… ▽ More

    Submitted 9 February, 2021; v1 submitted 1 June, 2020; originally announced June 2020.

    Comments: 6 pages, 5 figures, 1 table, presented at IEEE GLOBECOM 2020. Please cite it as: M. Lecci, M. Polese, C. Lai, J. Wang, C. Gentile, N. Golmie, M. Zorzi, "Quasi-Deterministic Channel Model for mmWaves: Mathematical Formalization and Validation," IEEE Global Communications Conference (GLOBECOM), Dec. 2020, Taipei, Taiwan

  22. Simplified Ray Tracing for the Millimeter Wave Channel: A Performance Evaluation

    Authors: Mattia Lecci, Paolo Testolina, Marco Giordani, Michele Polese, Tanguy Ropitault, Camillo Gentile, Neeraj Varshney, Anuraag Bodi, Michele Zorzi

    Abstract: Millimeter-wave (mmWave) communication is one of the cornerstone innovations of fifth-generation (5G) wireless networks, thanks to the massive bandwidth available in these frequency bands. To correctly assess the performance of such systems, however, it is essential to have reliable channel models, based on a deep understanding of the propagation characteristics of the mmWave signal. In this respe… ▽ More

    Submitted 21 February, 2020; originally announced February 2020.

    Comments: 6 pages, 6 figures, 1 table. This paper has been accepted for presentation at ITA 2020. (c) 2020 IEEE. Please cite it as: M. Lecci, P. Testolina, M. Giordani, M. Polese, T. Ropitault, C. Gentile, N. Varshney, A. Bodi, M. Zorzi, "Simplified Ray Tracing for the Millimeter Wave Channel: A Performance Evaluation," Information Theory and Applications Workshop (ITA), San Diego, US, 2020

  23. arXiv:2002.07348  [pdf, other

    cs.LG stat.ML

    Adaptive Region-Based Active Learning

    Authors: Corinna Cortes, Giulia DeSalvo, Claudio Gentile, Mehryar Mohri, Ningshan Zhang

    Abstract: We present a new active learning algorithm that adaptively partitions the input space into a finite number of regions, and subsequently seeks a distinct predictor for each region, both phases actively requesting labels. We prove theoretical guarantees for both the generalization error and the label complexity of our algorithm, and analyze the number of regions defined by the algorithm under some m… ▽ More

    Submitted 17 February, 2020; originally announced February 2020.

  24. arXiv:1906.09458  [pdf, other

    cs.LG stat.ML

    Flattening a Hierarchical Clustering through Active Learning

    Authors: Fabio Vitale, Anand Rajagopalan, Claudio Gentile

    Abstract: We investigate active learning by pairwise similarity over the leaves of trees originating from hierarchical clustering procedures. In the realizable setting, we provide a full characterization of the number of queries needed to achieve perfect reconstruction of the tree cut. In the non-realizable setting, we rely on known important-sampling procedures to obtain regret and query complexity bounds.… ▽ More

    Submitted 12 October, 2019; v1 submitted 22 June, 2019; originally announced June 2019.

  25. arXiv:1806.01182  [pdf, other

    cs.LG stat.ML

    Online Reciprocal Recommendation with Theoretical Performance Guarantees

    Authors: Fabio Vitale, Nikos Parotsidis, Claudio Gentile

    Abstract: A reciprocal recommendation problem is one where the goal of learning is not just to predict a user's preference towards a passive item (e.g., a book), but to recommend the targeted user on one side another user from the other side such that a mutual interest between the two exists. The problem thus is sharply different from the more traditional items-to-users recommendation, since a good match re… ▽ More

    Submitted 4 June, 2018; originally announced June 2018.

  26. arXiv:1706.06474  [pdf, other

    cs.LG

    On Pairwise Clustering with Side Information

    Authors: Stephen Pasteris, Fabio Vitale, Claudio Gentile, Mark Herbster

    Abstract: Pairwise clustering, in general, partitions a set of items via a known similarity function. In our treatment, clustering is modeled as a transductive prediction problem. Thus rather than beginning with a known similarity function, the function instead is hidden and the learner only receives a random sample consisting of a subset of the pairwise similarities. An additional set of pairwise side-info… ▽ More

    Submitted 18 June, 2017; originally announced June 2017.

  27. arXiv:1705.10257  [pdf, ps, other

    cs.LG stat.ML

    Boltzmann Exploration Done Right

    Authors: Nicolò Cesa-Bianchi, Claudio Gentile, Gábor Lugosi, Gergely Neu

    Abstract: Boltzmann exploration is a classic strategy for sequential decision-making under uncertainty, and is one of the most standard tools in Reinforcement Learning (RL). Despite its widespread use, there is virtually no theoretical understanding about the limitations or the actual benefits of this exploration scheme. Does it drive exploration in a meaningful way? Is it prone to misidentifying the optima… ▽ More

    Submitted 7 November, 2017; v1 submitted 29 May, 2017; originally announced May 2017.

  28. arXiv:1703.03478  [pdf, other

    cs.LG

    Online Learning with Abstention

    Authors: Corinna Cortes, Giulia DeSalvo, Claudio Gentile, Mehryar Mohri, Scott Yang

    Abstract: We present an extensive study of the key problem of online learning where algorithms are allowed to abstain from making predictions. In the adversarial setting, we show how existing online algorithms and guarantees can be adapted to this problem. In the stochastic setting, we first point out a bias problem that limits the straightforward extension of algorithms such as UCB-N to time-varying feedba… ▽ More

    Submitted 14 November, 2019; v1 submitted 9 March, 2017; originally announced March 2017.

  29. arXiv:1702.08211  [pdf, ps, other

    stat.ML cs.LG math.ST

    Algorithmic Chaining and the Role of Partial Feedback in Online Nonparametric Learning

    Authors: Nicolò Cesa-Bianchi, Pierre Gaillard, Claudio Gentile, Sébastien Gerchinovitz

    Abstract: We investigate contextual online learning with nonparametric (Lipschitz) comparison classes under different assumptions on losses and feedback information. For full information feedback and Lipschitz losses, we design the first explicit algorithm achieving the minimax regret rate (up to log factors). In a partial feedback model motivated by second-price auctions, we obtain algorithms for Lipschitz… ▽ More

    Submitted 30 June, 2017; v1 submitted 27 February, 2017; originally announced February 2017.

    Comments: This document is the full version of an extended abstract accepted for presentation at COLT 2017

  30. arXiv:1608.03544  [pdf, other

    cs.LG cs.AI cs.IR stat.ML

    On Context-Dependent Clustering of Bandits

    Authors: Claudio Gentile, Shuai Li, Purushottam Kar, Alexandros Karatzoglou, Evans Etrue, Giovanni Zappella

    Abstract: We investigate a novel cluster-of-bandit algorithm CAB for collaborative recommendation tasks that implements the underlying feedback sharing mechanism by estimating the neighborhood of users in a context-dependent manner. CAB makes sharp departures from the state of the art by incorporating collaborative effects into inference as well as learning processes in a manner that seamlessly interleaving… ▽ More

    Submitted 27 February, 2017; v1 submitted 6 August, 2016; originally announced August 2016.

  31. arXiv:1606.00182  [pdf, other

    cs.LG cs.SI

    On the Troll-Trust Model for Edge Sign Prediction in Social Networks

    Authors: Géraud Le Falher, Nicolò Cesa-Bianchi, Claudio Gentile, Fabio Vitale

    Abstract: In the problem of edge sign prediction, we are given a directed graph (representing a social network), and our task is to predict the binary labels of the edges (i.e., the positive or negative nature of the social relationships). Many successful heuristics for this problem are based on the troll-trust features, estimating at each node the fraction of outgoing and incoming positive/negative edges.… ▽ More

    Submitted 28 February, 2017; v1 submitted 1 June, 2016; originally announced June 2016.

    Comments: v5: accepted to AISTATS 2017

  32. arXiv:1605.00596  [pdf, other

    stat.ML cs.AI cs.IR cs.LG

    Graph Clustering Bandits for Recommendation

    Authors: Shuai Li, Claudio Gentile, Alexandros Karatzoglou

    Abstract: We investigate an efficient context-dependent clustering technique for recommender systems based on exploration-exploitation strategies through multi-armed bandits over multiple users. Our algorithm dynamically groups users based on their observed behavioral similarity during a sequence of logged activities. In doing so, the algorithm reacts to the currently served user by sha** clusters around… ▽ More

    Submitted 2 May, 2016; originally announced May 2016.

  33. arXiv:1602.04741  [pdf, other

    cs.LG

    Delay and Cooperation in Nonstochastic Bandits

    Authors: Nicolo' Cesa-Bianchi, Claudio Gentile, Yishay Mansour, Alberto Minora

    Abstract: We study networks of communicating learning agents that cooperate to solve a common nonstochastic bandit problem. Agents use an underlying communication network to get messages about actions selected by other agents, and drop messages that took more than $d$ hops to arrive, where $d$ is a delay parameter. We introduce \textsc{Exp3-Coop}, a cooperative version of the {\sc Exp3} algorithm and prove… ▽ More

    Submitted 1 June, 2016; v1 submitted 15 February, 2016; originally announced February 2016.

    Comments: 30 pages

  34. arXiv:1502.03473  [pdf, other

    cs.LG cs.AI stat.ML

    Collaborative Filtering Bandits

    Authors: Shuai Li, Alexandros Karatzoglou, Claudio Gentile

    Abstract: Classical collaborative filtering, and content-based filtering methods try to learn a static recommendation model given training data. These approaches are far from ideal in highly dynamic recommendation domains such as news recommendation and computational advertisement, where the set of items and users is very fluid. In this work, we investigate an adaptive clustering technique for content recom… ▽ More

    Submitted 31 May, 2016; v1 submitted 11 February, 2015; originally announced February 2015.

    Comments: The 39th SIGIR (SIGIR 2016)

  35. arXiv:1409.8428  [pdf, other

    cs.LG stat.ML

    Nonstochastic Multi-Armed Bandits with Graph-Structured Feedback

    Authors: Noga Alon, Nicolò Cesa-Bianchi, Claudio Gentile, Shie Mannor, Yishay Mansour, Ohad Shamir

    Abstract: We present and study a partial-information model of online learning, where a decision maker repeatedly chooses from a finite set of actions, and observes some subset of the associated losses. This naturally models several situations where the losses of different actions are related, and knowing the loss of one action provides information on the loss of other actions. Moreover, it generalizes and i… ▽ More

    Submitted 30 September, 2014; originally announced September 2014.

    Comments: Preliminary versions of parts of this paper appeared in [1,20], and also as arXiv papers arXiv:1106.2436 and arXiv:1307.4564

  36. arXiv:1401.8257  [pdf, other

    cs.LG stat.ML

    Online Clustering of Bandits

    Authors: Claudio Gentile, Shuai Li, Giovanni Zappella

    Abstract: We introduce a novel algorithmic approach to content recommendation based on adaptive clustering of exploration-exploitation ("bandit") strategies. We provide a sharp regret analysis of this algorithm in a standard stochastic noise setting, demonstrate its scalability properties, and prove its effectiveness on a number of artificial and real-world datasets. Our experiments show a significant incre… ▽ More

    Submitted 6 June, 2014; v1 submitted 31 January, 2014; originally announced January 2014.

    Comments: In E. Xing and T. Jebara (Eds.), Proceedings of 31st International Conference on Machine Learning, Journal of Machine Learning Research Workshop and Conference Proceedings, Vol.32 (JMLR W&CP-32), Bei**g, China, Jun. 21-26, 2014 (ICML 2014), Submitted by Shuai Li (https://sites.google.com/site/shuailidotsli)

  37. arXiv:1307.4564  [pdf, ps, other

    cs.LG stat.ML

    From Bandits to Experts: A Tale of Domination and Independence

    Authors: Noga Alon, Nicolò Cesa-Bianchi, Claudio Gentile, Yishay Mansour

    Abstract: We consider the partial observability model for multi-armed bandits, introduced by Mannor and Shamir. Our main result is a characterization of regret in the directed observability model in terms of the dominating and independence numbers of the observability graph. We also show that in the undirected case, the learner can achieve optimal regret without even accessing the observability graph before… ▽ More

    Submitted 17 July, 2013; originally announced July 2013.

  38. arXiv:1306.0811  [pdf, other

    cs.LG cs.SI stat.ML

    A Gang of Bandits

    Authors: Nicolò Cesa-Bianchi, Claudio Gentile, Giovanni Zappella

    Abstract: Multi-armed bandit problems are receiving a great deal of attention because they adequately formalize the exploration-exploitation trade-offs arising in several industrially relevant applications, such as online advertisement and, more generally, recommendation systems. In many cases, however, these applications have a strong social component, whose integration in the bandit algorithm could lead t… ▽ More

    Submitted 4 November, 2013; v1 submitted 4 June, 2013; originally announced June 2013.

    Comments: NIPS 2013

  39. arXiv:1302.7263  [pdf, other

    cs.LG

    Online Similarity Prediction of Networked Data from Known and Unknown Graphs

    Authors: Claudio Gentile, Mark Herbster, Stephen Pasteris

    Abstract: We consider online similarity prediction problems over networked data. We begin by relating this task to the more standard class prediction problem, showing that, given an arbitrary algorithm for class prediction, we can construct an algorithm for similarity prediction with "nearly" the same mistake bound, and vice versa. After noticing that this general construction is computationally infeasible,… ▽ More

    Submitted 15 March, 2013; v1 submitted 28 February, 2013; originally announced February 2013.

  40. arXiv:1301.5160  [pdf, other

    cs.LG

    See the Tree Through the Lines: The Shazoo Algorithm -- Full Version --

    Authors: Fabio Vitale, Nicolo Cesa-Bianchi, Claudio Gentile, Giovanni Zappella

    Abstract: Predicting the nodes of a given graph is a fascinating theoretical problem with applications in several domains. Since graph sparsification via spanning trees retains enough information while making the task much easier, trees are an important special case of this problem. Although it is known how to predict the nodes of an unweighted tree in a nearly optimal way, in the weighted case a fully sati… ▽ More

    Submitted 28 February, 2013; v1 submitted 22 January, 2013; originally announced January 2013.

  41. arXiv:1301.5112  [pdf, ps, other

    cs.LG stat.ML

    Active Learning on Trees and Graphs

    Authors: Nicolo Cesa-Bianchi, Claudio Gentile, Fabio Vitale, Giovanni Zappella

    Abstract: We investigate the problem of active learning on a given tree whose nodes are assigned binary labels in an adversarial way. Inspired by recent results by Guillory and Bilmes, we characterize (up to constant factors) the optimal placement of queries so to minimize the mistakes made on the non-queried nodes. Our query selection algorithm is extremely efficient, and the optimal number of mistakes on… ▽ More

    Submitted 22 January, 2013; originally announced January 2013.

  42. arXiv:1301.4769  [pdf, other

    cs.LG cs.DS stat.ML

    A Correlation Clustering Approach to Link Classification in Signed Networks -- Full Version --

    Authors: Nicolo Cesa-Bianchi, Claudio Gentile, Fabio Vitale, Giovanni Zappella

    Abstract: Motivated by social balance theory, we develop a theory of link classification in signed networks using the correlation clustering index as measure of label regularity. We derive learning bounds in terms of correlation clustering within three fundamental transductive learning settings: online, batch and active. Our main algorithmic contribution is in the active setting, where we introduce a new fa… ▽ More

    Submitted 28 February, 2013; v1 submitted 21 January, 2013; originally announced January 2013.

  43. arXiv:1301.4767  [pdf, other

    cs.LG cs.SI stat.ML

    A Linear Time Active Learning Algorithm for Link Classification -- Full Version --

    Authors: Nicolo Cesa-Bianchi, Claudio Gentile, Fabio Vitale, Giovanni Zappella

    Abstract: We present very efficient active learning algorithms for link classification in signed networks. Our algorithms are motivated by a stochastic model in which edge labels are obtained through perturbations of a initial sign assignment consistent with a two-clustering of the nodes. We provide a theoretical analysis within this model, showing that we can achieve an optimal (to whithin a constant facto… ▽ More

    Submitted 28 February, 2013; v1 submitted 21 January, 2013; originally announced January 2013.

  44. arXiv:1212.5637  [pdf, other

    cs.LG stat.ML

    Random Spanning Trees and the Prediction of Weighted Graphs

    Authors: Nicolo' Cesa-Bianchi, Claudio Gentile, Fabio Vitale, Giovanni Zappella

    Abstract: We investigate the problem of sequentially predicting the binary labels on the nodes of an arbitrary weighted graph. We show that, under a suitable parametrization of the problem, the optimal number of prediction mistakes can be characterized (up to logarithmic factors) by the cutsize of a random spanning tree of the graph. The cutsize is induced by the unknown adversarial labeling of the graph no… ▽ More

    Submitted 21 December, 2012; originally announced December 2012.

    Comments: Appeared in ICML 2010

  45. arXiv:1207.0166  [pdf, other

    cs.LG

    On Multilabel Classification and Ranking with Partial Feedback

    Authors: Claudio Gentile, Francesco Orabona

    Abstract: We present a novel multilabel/ranking algorithm working in partial information settings. The algorithm is based on 2nd-order descent methods, and relies on upper-confidence bounds to trade-off exploration and exploitation. We analyze this algorithm in a partial adversarial setting, where covariates can be adversarial, but multilabel probabilities are ruled by (generalized) linear models. We show O… ▽ More

    Submitted 16 January, 2013; v1 submitted 30 June, 2012; originally announced July 2012.

  46. arXiv:1109.2296  [pdf, other

    cs.LG

    Bandits with an Edge

    Authors: Dotan Di Castro, Claudio Gentile, Shie Mannor

    Abstract: We consider a bandit problem over a graph where the rewards are not directly observed. Instead, the decision maker can compare two nodes and receive (stochastic) information pertaining to the difference in their value. The graph structure describes the set of possible comparisons. Consequently, comparing between two nodes that are relatively far requires estimating the difference between every pai… ▽ More

    Submitted 11 September, 2011; originally announced September 2011.

  47. arXiv:1105.2550   

    cs.LG

    A Maximal Large Deviation Inequality for Sub-Gaussian Variables

    Authors: Dotan Di Castro, Claudio Gentile, Shie Mannor

    Abstract: In this short note we prove a maximal concentration lemma for sub-Gaussian random variables stating that for independent sub-Gaussian random variables we have \[P<(\max_{1\le i\le N}S_{i}>ε>) \le\exp<(-\frac{1}{N^2}\sum_{i=1}^{N}\frac{ε^{2}}{2σ_{i}^{2}}>), \] where $S_i$ is the sum of $i$ zero mean independent sub-Gaussian random variables and $σ_i$ is the variance of the $i$th random variable.

    Submitted 25 July, 2011; v1 submitted 12 May, 2011; originally announced May 2011.

    Comments: This paper has been withdrawn by the authors due to a crucial error in the last sentence of the proof of Theorem 1: "we can take the infimum of the r.h.s. over s, which yields (1)." This statement is only true if a single value of s yields the supremum of (ε_i s - ρ_i(s)) simultaneously for every i