Skip to main content

Showing 1–23 of 23 results for author: Katariya, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.00091  [pdf, other

    cs.IR cs.HC cs.LG

    Learning to Rank for Maps at Airbnb

    Authors: Malay Haldar, Hongwei Zhang, Kedar Bellare, Sherry Chen, Soumyadip Banerjee, Xiaotang Wang, Mustafa Abdool, Huiji Gao, Pavan Tapadia, Liwei He, Sanjeev Katariya

    Abstract: As a two-sided marketplace, Airbnb brings together hosts who own listings for rent with prospective guests from around the globe. Results from a guest's search for listings are displayed primarily through two interfaces: (1) as a list of rectangular cards that contain on them the listing image, price, rating, and other details, referred to as list-results (2) as oval pins on a map showing the list… ▽ More

    Submitted 25 June, 2024; originally announced July 2024.

  2. arXiv:2306.09136  [pdf, other

    cs.LG stat.ML

    Finite-Time Logarithmic Bayes Regret Upper Bounds

    Authors: Alexia Atsidakou, Branislav Kveton, Sumeet Katariya, Constantine Caramanis, Sujay Sanghavi

    Abstract: We derive the first finite-time logarithmic Bayes regret upper bounds for Bayesian bandits. In a multi-armed bandit, we obtain $O(c_Δ\log n)$ and $O(c_h \log^2 n)$ upper bounds for an upper confidence bound algorithm, where $c_h$ and $c_Δ$ are constants depending on the prior distribution and the gaps of bandit instances sampled from it, respectively. The latter bound asymptotically matches the lo… ▽ More

    Submitted 21 January, 2024; v1 submitted 15 June, 2023; originally announced June 2023.

  3. arXiv:2305.18431  [pdf, other

    cs.IR cs.AI cs.LG

    Optimizing Airbnb Search Journey with Multi-task Learning

    Authors: Chun How Tan, Austin Chan, Malay Haldar, Jie Tang, Xin Liu, Mustafa Abdool, Huiji Gao, Liwei He, Sanjeev Katariya

    Abstract: At Airbnb, an online marketplace for stays and experiences, guests often spend weeks exploring and comparing multiple items before making a final reservation request. Each reservation request may then potentially be rejected or cancelled by the host prior to check-in. The long and exploratory nature of the search journey, as well as the need to balance both guest and host preferences, present uniq… ▽ More

    Submitted 28 May, 2023; originally announced May 2023.

    Comments: Search Ranking, Recommender Systems, User Search Journey, Multi-task learning, Two-sided marketplace

  4. arXiv:2302.00284  [pdf, other

    cs.LG cs.AI

    Selective Uncertainty Propagation in Offline RL

    Authors: Sanath Kumar Krishnamurthy, Shrey Modi, Tanmay Gangwani, Sumeet Katariya, Branislav Kveton, Anshuka Rangi

    Abstract: We consider the finite-horizon offline reinforcement learning (RL) setting, and are motivated by the challenge of learning the policy at any step h in dynamic programming (DP) algorithms. To learn this, it is sufficient to evaluate the treatment effect of deviating from the behavioral policy at step h after having optimized the policy for all future steps. Since the policy at any step can affect n… ▽ More

    Submitted 12 February, 2024; v1 submitted 1 February, 2023; originally announced February 2023.

  5. arXiv:2212.04720  [pdf, other

    cs.LG cs.AI

    Multi-Task Off-Policy Learning from Bandit Feedback

    Authors: Joey Hong, Branislav Kveton, Sumeet Katariya, Manzil Zaheer, Mohammad Ghavamzadeh

    Abstract: Many practical applications, such as recommender systems and learning to rank, involve solving multiple similar tasks. One example is learning of recommendation policies for users with similar movie preferences, where the users may still rank the individual movies slightly differently. Such tasks can be organized in a hierarchy, where similar tasks are related through a shared structure. In this w… ▽ More

    Submitted 9 December, 2022; originally announced December 2022.

    Comments: 14 pages, 3 figures

  6. arXiv:2211.08572  [pdf, other

    cs.LG stat.ML

    Bayesian Fixed-Budget Best-Arm Identification

    Authors: Alexia Atsidakou, Sumeet Katariya, Sujay Sanghavi, Branislav Kveton

    Abstract: Fixed-budget best-arm identification (BAI) is a bandit problem where the agent maximizes the probability of identifying the optimal arm within a fixed budget of observations. In this work, we study this problem in the Bayesian setting. We propose a Bayesian elimination algorithm and derive an upper bound on its probability of misidentifying the optimal arm. The bound reflects the quality of the pr… ▽ More

    Submitted 15 June, 2023; v1 submitted 15 November, 2022; originally announced November 2022.

  7. arXiv:2210.07774  [pdf, other

    cs.IR cs.AI cs.LG

    Learning To Rank Diversely At Airbnb

    Authors: Malay Haldar, Mustafa Abdool, Liwei He, Dillon Davis, Huiji Gao, Sanjeev Katariya

    Abstract: Airbnb is a two-sided marketplace, bringing together hosts who own listings for rent, with prospective guests from around the globe. Applying neural network-based learning to rank techniques has led to significant improvements in matching guests with hosts. These improvements in ranking were driven by a core strategy: order the listings by their estimated booking probabilities, then iterate on tec… ▽ More

    Submitted 8 August, 2023; v1 submitted 19 September, 2022; originally announced October 2022.

    Comments: Search ranking, Diversity, e-commerce

    MSC Class: 68T07 ACM Class: H.3.3

  8. arXiv:2205.15124  [pdf, other

    cs.LG stat.ML

    Mixed-Effect Thompson Sampling

    Authors: Imad Aouali, Branislav Kveton, Sumeet Katariya

    Abstract: A contextual bandit is a popular framework for online learning to act under uncertainty. In practice, the number of actions is huge and their expected rewards are correlated. In this work, we introduce a general framework for capturing such correlations through a mixed-effect model where actions are related through multiple shared effect parameters. To explore efficiently using this structure, we… ▽ More

    Submitted 5 March, 2023; v1 submitted 30 May, 2022; originally announced May 2022.

  9. arXiv:2202.12888  [pdf, other

    cs.LG cs.AI stat.ML

    Meta-Learning for Simple Regret Minimization

    Authors: Mohammadjavad Azizi, Branislav Kveton, Mohammad Ghavamzadeh, Sumeet Katariya

    Abstract: We develop a meta-learning framework for simple regret minimization in bandits. In this framework, a learning agent interacts with a sequence of bandit tasks, which are sampled i.i.d.\ from an unknown prior distribution, and learns its meta-parameters to perform better on future tasks. We propose the first Bayesian and frequentist meta-learning algorithms for this setting. The Bayesian algorithm h… ▽ More

    Submitted 4 July, 2023; v1 submitted 25 February, 2022; originally announced February 2022.

  10. arXiv:2202.08335  [pdf, other

    cs.LG

    Task-Agnostic Graph Explanations

    Authors: Yaochen Xie, Sumeet Katariya, Xianfeng Tang, Edward Huang, Nikhil Rao, Karthik Subbian, Shuiwang Ji

    Abstract: Graph Neural Networks (GNNs) have emerged as powerful tools to encode graph-structured data. Due to their broad applications, there is an increasing need to develop tools to explain how GNNs make decisions given graph-structured data. Existing learning-based GNN explanation approaches are task-specific in training and hence suffer from crucial drawbacks. Specifically, they are incapable of produci… ▽ More

    Submitted 23 September, 2022; v1 submitted 16 February, 2022; originally announced February 2022.

    Comments: Accepted by NeurIPS 2022

  11. arXiv:2202.01454  [pdf, other

    cs.LG stat.ML

    Deep Hierarchy in Bandits

    Authors: Joey Hong, Branislav Kveton, Sumeet Katariya, Manzil Zaheer, Mohammad Ghavamzadeh

    Abstract: Mean rewards of actions are often correlated. The form of these correlations may be complex and unknown a priori, such as the preferences of a user for recommended products and their categories. To maximize statistical efficiency, it is important to leverage these correlations when learning. We formulate a bandit variant of this problem where the correlations of mean action rewards are represented… ▽ More

    Submitted 3 February, 2022; originally announced February 2022.

  12. arXiv:2111.04840  [pdf, other

    cs.LG

    Cold Brew: Distilling Graph Node Representations with Incomplete or Missing Neighborhoods

    Authors: Wenqing Zheng, Edward W Huang, Nikhil Rao, Sumeet Katariya, Zhangyang Wang, Karthik Subbian

    Abstract: Graph Neural Networks (GNNs) have achieved state-of-the-art performance in node classification, regression, and recommendation tasks. GNNs work well when rich and high-quality connections are available. However, their effectiveness is often jeopardized in many real-world graphs in which node degrees have power-law distributions. The extreme case of this situation, where a node may have no neighbor… ▽ More

    Submitted 13 March, 2022; v1 submitted 8 November, 2021; originally announced November 2021.

    Comments: Published as a conference paper in ICLR 2022

  13. arXiv:2110.13522  [pdf, other

    cs.LG cs.CL cs.IR

    Probabilistic Entity Representation Model for Reasoning over Knowledge Graphs

    Authors: Nurendra Choudhary, Nikhil Rao, Sumeet Katariya, Karthik Subbian, Chandan K. Reddy

    Abstract: Logical reasoning over Knowledge Graphs (KGs) is a fundamental technique that can provide efficient querying mechanism over large and incomplete databases. Current approaches employ spatial geometries such as boxes to learn query representations that encompass the answer entities and model the logical operations of projection and intersection. However, their geometry is restrictive and leads to no… ▽ More

    Submitted 30 October, 2021; v1 submitted 26 October, 2021; originally announced October 2021.

    Comments: Accepted at Thirty-fifth Conference on Neural Information Processing Systems 2021 (NeurIPS '21)

  14. arXiv:2104.05294  [pdf, other

    cs.LG

    Pure Exploration with Structured Preference Feedback

    Authors: Shubham Gupta, Aadirupa Saha, Sumeet Katariya

    Abstract: We consider the problem of pure exploration with subset-wise preference feedback, which contains $N$ arms with features. The learner is allowed to query subsets of size $K$ and receives feedback in the form of a noisy winner. The goal of the learner is to identify the best arm efficiently using as few queries as possible. This setting is relevant in various online decision-making scenarios involvi… ▽ More

    Submitted 12 April, 2021; originally announced April 2021.

  15. arXiv:2012.13023  [pdf, other

    cs.LG cs.CL cs.IR

    Self-Supervised Hyperboloid Representations from Logical Queries over Knowledge Graphs

    Authors: Nurendra Choudhary, Nikhil Rao, Sumeet Katariya, Karthik Subbian, Chandan K. Reddy

    Abstract: Knowledge Graphs (KGs) are ubiquitous structures for information storagein several real-world applications such as web search, e-commerce, social networks, and biology. Querying KGs remains a foundational and challenging problem due to their size and complexity. Promising approaches to tackle this problem include embedding the KG units (e.g., entities and relations) in a Euclidean space such that… ▽ More

    Submitted 12 May, 2021; v1 submitted 23 December, 2020; originally announced December 2020.

    Comments: Accepted at the Web Conference 2021 (WWW '21)

  16. arXiv:2009.09988  [pdf, other

    stat.ML cs.LG

    Robust Outlier Arm Identification

    Authors: Yinglun Zhu, Sumeet Katariya, Robert Nowak

    Abstract: We study the problem of Robust Outlier Arm Identification (ROAI), where the goal is to identify arms whose expected rewards deviate substantially from the majority, by adaptively sampling from their reward distributions. We compute the outlier threshold using the median and median absolute deviation of the expected rewards. This is a robust choice for the threshold compared to using the mean and s… ▽ More

    Submitted 21 September, 2020; originally announced September 2020.

    Comments: Full version of our ICML 2020 paper

  17. arXiv:1906.00547  [pdf, other

    stat.ML cs.LG

    MaxGap Bandit: Adaptive Algorithms for Approximate Ranking

    Authors: Sumeet Katariya, Ardhendu Tripathy, Robert Nowak

    Abstract: This paper studies the problem of adaptively sampling from K distributions (arms) in order to identify the largest gap between any two adjacent means. We call this the MaxGap-bandit problem. This problem arises naturally in approximate ranking, noisy sorting, outlier detection, and top-arm identification in bandits. The key novelty of the MaxGap-bandit problem is that it aims to adaptively determi… ▽ More

    Submitted 2 June, 2019; originally announced June 2019.

  18. arXiv:1806.00892  [pdf, other

    stat.ML cs.LG

    Conservative Exploration using Interleaving

    Authors: Sumeet Katariya, Branislav Kveton, Zheng Wen, Vamsi K. Potluru

    Abstract: In many practical problems, a learning agent may want to learn the best action in hindsight without ever taking a bad action, which is significantly worse than the default production action. In general, this is impossible because the agent has to explore unknown actions, some of which can be bad, to learn better actions. However, when the actions are combinatorial, this may be possible if the unkn… ▽ More

    Submitted 3 June, 2018; originally announced June 2018.

  19. arXiv:1802.07176  [pdf, other

    cs.LG stat.ML

    Adaptive Sampling for Coarse Ranking

    Authors: Sumeet Katariya, Lalit Jain, Nandana Sengupta, James Evans, Robert Nowak

    Abstract: We consider the problem of active coarse ranking, where the goal is to sort items according to their means into clusters of pre-specified sizes, by adaptively sampling from their reward distributions. This setting is useful in many social science applications involving human raters and the approximate rank of every item is desired. Approximate or coarse ranking can significantly reduce the number… ▽ More

    Submitted 20 February, 2018; originally announced February 2018.

    Comments: Accepted at AISTATS 2018

  20. arXiv:1703.06513  [pdf, other

    cs.LG stat.ML

    Bernoulli Rank-$1$ Bandits for Click Feedback

    Authors: Sumeet Katariya, Branislav Kveton, Csaba Szepesvári, Claire Vernade, Zheng Wen

    Abstract: The probability that a user will click a search result depends both on its relevance and its position on the results page. The position based model explains this behavior by ascribing to every item an attraction probability, and to every position an examination probability. To be clicked, a result must be both attractive and examined. The probabilities of an item-position pair being clicked thus f… ▽ More

    Submitted 19 March, 2017; originally announced March 2017.

  21. arXiv:1608.03023  [pdf, other

    cs.LG stat.ML

    Stochastic Rank-1 Bandits

    Authors: Sumeet Katariya, Branislav Kveton, Csaba Szepesvari, Claire Vernade, Zheng Wen

    Abstract: We propose stochastic rank-$1$ bandits, a class of online learning problems where at each step a learning agent chooses a pair of row and column arms, and receives the product of their values as a reward. The main challenge of the problem is that the individual values of the row and column are unobserved. We assume that these values are stochastic and drawn independently. We propose a computationa… ▽ More

    Submitted 8 March, 2017; v1 submitted 9 August, 2016; originally announced August 2016.

    Comments: Proceedings of the 20th International Conference on Artificial Intelligence and Statistics

  22. arXiv:1602.03146  [pdf, other

    cs.LG stat.ML

    DCM Bandits: Learning to Rank with Multiple Clicks

    Authors: Sumeet Katariya, Branislav Kveton, Csaba Szepesvári, Zheng Wen

    Abstract: A search engine recommends to the user a list of web pages. The user examines this list, from the first page to the last, and clicks on all attractive pages until the user is satisfied. This behavior of the user can be described by the dependent click model (DCM). We propose DCM bandits, an online learning variant of the DCM where the goal is to maximize the probability of recommending satisfactor… ▽ More

    Submitted 31 May, 2016; v1 submitted 9 February, 2016; originally announced February 2016.

    Comments: Proceedings of the 33rd International Conference on Machine Learning

  23. arXiv:1502.00133  [pdf, other

    stat.ML cs.LG

    Sparse Dueling Bandits

    Authors: Kevin Jamieson, Sumeet Katariya, Atul Deshpande, Robert Nowak

    Abstract: The dueling bandit problem is a variation of the classical multi-armed bandit in which the allowable actions are noisy comparisons between pairs of arms. This paper focuses on a new approach for finding the "best" arm according to the Borda criterion using noisy comparisons. We prove that in the absence of structural assumptions, the sample complexity of this problem is proportional to the sum of… ▽ More

    Submitted 31 January, 2015; originally announced February 2015.