Skip to main content

Showing 1–4 of 4 results for author: Shidani, A

Searching in archive stat. Search in all archives.
.
  1. arXiv:2403.05490  [pdf, other

    cs.LG cs.AI cs.CV cs.IT stat.ML

    Poly-View Contrastive Learning

    Authors: Amitis Shidani, Devon Hjelm, Jason Ramapuram, Russ Webb, Eeshan Gunesh Dhekane, Dan Busbridge

    Abstract: Contrastive learning typically matches pairs of related views among a number of unrelated negative views. Views can be generated (e.g. by augmentations) or be observed. We investigate matching when there are more than two related views which we call poly-view tasks, and derive new representation learning objectives using information maximization and sufficient statistics. We show that with unlimit… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

    Comments: Accepted to ICLR 2024. 42 pages, 7 figures, 3 tables, loss pseudo-code included in appendix

  2. arXiv:2312.09674  [pdf, ps, other

    cs.LG cs.MA stat.ML

    Optimal Regret Bounds for Collaborative Learning in Bandits

    Authors: Amitis Shidani, Sattar Vakili

    Abstract: We consider regret minimization in a general collaborative multi-agent multi-armed bandit model, in which each agent faces a finite set of arms and may communicate with other agents through a central controller. The optimal arm for each agent in this model is the arm with the largest expected mixed reward, where the mixed reward of each arm is a weighted average of its rewards across all agents, m… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

    Comments: Algorithmic Learning Theory (ALT) 2024

  3. arXiv:2207.00109  [pdf, other

    stat.ML cs.IR cs.LG math.OC

    Ranking In Generalized Linear Bandits

    Authors: Amitis Shidani, George Deligiannidis, Arnaud Doucet

    Abstract: We study the ranking problem in generalized linear bandits. At each time, the learning agent selects an ordered list of items and observes stochastic outcomes. In recommendation systems, displaying an ordered list of the most attractive items is not always optimal as both position and item dependencies result in a complex reward function. A very naive example is the lack of diversity when all the… ▽ More

    Submitted 1 January, 2024; v1 submitted 30 June, 2022; originally announced July 2022.

    Journal ref: AAAI 2024 Workshop on Recommendation Ecosystems: Modeling, Optimization and Incentive Design

  4. arXiv:2203.00977  [pdf, ps, other

    stat.ML cs.IT cs.LG

    Chained Generalisation Bounds

    Authors: Eugenio Clerico, Amitis Shidani, George Deligiannidis, Arnaud Doucet

    Abstract: This work discusses how to derive upper bounds for the expected generalisation error of supervised learning algorithms by means of the chaining technique. By develo** a general theoretical framework, we establish a duality between generalisation bounds based on the regularity of the loss function, and their chained counterparts, which can be obtained by lifting the regularity assumption from the… ▽ More

    Submitted 30 June, 2022; v1 submitted 2 March, 2022; originally announced March 2022.

    Journal ref: Proceedings of the 35th Conference on Learning Theory, PMLR 178:4212-4257, 2022