Skip to main content

Showing 1–25 of 25 results for author: Sankararaman, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2310.00183  [pdf, other

    cs.LG cs.AI

    On the Equivalence of Graph Convolution and Mixup

    Authors: Xiaotian Han, Hanqing Zeng, Yu Chen, Shaoliang Nie, **gzhou Liu, Kanika Narang, Zahra Shakeri, Karthik Abinav Sankararaman, Song Jiang, Madian Khabsa, Qifan Wang, Xia Hu

    Abstract: This paper investigates the relationship between graph convolution and Mixup techniques. Graph convolution in a graph neural network involves aggregating features from neighboring samples to learn representative features for a specific node or sample. On the other hand, Mixup is a data augmentation technique that generates new examples by averaging features and one-hot labels from multiple samples… ▽ More

    Submitted 29 September, 2023; originally announced October 2023.

  2. arXiv:2309.16039  [pdf, other

    cs.CL

    Effective Long-Context Scaling of Foundation Models

    Authors: Wenhan Xiong, **gyu Liu, Igor Molybog, Hejia Zhang, Prajjwal Bhargava, Rui Hou, Louis Martin, Rashi Rungta, Karthik Abinav Sankararaman, Barlas Oguz, Madian Khabsa, Han Fang, Yashar Mehdad, Sharan Narang, Kshitiz Malik, Angela Fan, Shruti Bhosale, Sergey Edunov, Mike Lewis, Sinong Wang, Hao Ma

    Abstract: We present a series of long-context LLMs that support effective context windows of up to 32,768 tokens. Our model series are built through continual pretraining from Llama 2 with longer training sequences and on a dataset where long texts are upsampled. We perform extensive evaluation on language modeling, synthetic context probing tasks, and a wide range of research benchmarks. On research benchm… ▽ More

    Submitted 13 November, 2023; v1 submitted 27 September, 2023; originally announced September 2023.

  3. arXiv:2306.07893  [pdf, other

    cs.GT

    Rethinking Incentives in Recommender Systems: Are Monotone Rewards Always Beneficial?

    Authors: Fan Yao, Chuanhao Li, Karthik Abinav Sankararaman, Yiming Liao, Yan Zhu, Qifan Wang, Hongning Wang, Haifeng Xu

    Abstract: The past decade has witnessed the flourishing of a new profession as media content creators, who rely on revenue streams from online content recommendation platforms. The reward mechanism employed by these platforms creates a competitive environment among creators which affect their production choices and, consequently, content distribution and system welfare. It is thus crucial to design the plat… ▽ More

    Submitted 9 July, 2023; v1 submitted 13 June, 2023; originally announced June 2023.

  4. arXiv:2211.07484  [pdf, ps, other

    cs.LG stat.ML

    Contextual Bandits with Packing and Covering Constraints: A Modular Lagrangian Approach via Regression

    Authors: Aleksandrs Slivkins, Xingyu Zhou, Karthik Abinav Sankararaman, Dylan J. Foster

    Abstract: We consider contextual bandits with linear constraints (CBwLC), a variant of contextual bandits in which the algorithm consumes multiple resources subject to linear constraints on total consumption. This problem generalizes contextual bandits with knapsacks (CBwK), allowing for packing and covering constraints, as well as positive and negative resource consumption. We provide the first algorithm f… ▽ More

    Submitted 29 June, 2024; v1 submitted 14 November, 2022; originally announced November 2022.

    Comments: A preliminary version of this paper, authored by A. Slivkins, K.A. Sankararaman and D.J. Foster, has been published at COLT 2023. The present version features an important improvement, due to Xingyu Zhou. Specifically, the $\sqrt{T}$-regret result in Theorem 3.6(a) holds under a much weaker assumption, and is now positioned as the main guarantee

  5. arXiv:2211.02233  [pdf, ps, other

    cs.LG cs.AI

    Improved Adaptive Algorithm for Scalable Active Learning with Weak Labeler

    Authors: Yifang Chen, Karthik Sankararaman, Alessandro Lazaric, Matteo Pirotta, Dmytro Karamshuk, Qifan Wang, Karishma Mandyam, Sinong Wang, Han Fang

    Abstract: Active learning with strong and weak labelers considers a practical setting where we have access to both costly but accurate strong labelers and inaccurate but cheap predictions provided by weak labelers. We study this problem in the streaming setting, where decisions must be taken \textit{online}. We design a novel algorithmic template, Weak Labeler Active Cover (WL-AC), that is able to robustly… ▽ More

    Submitted 3 November, 2022; originally announced November 2022.

  6. arXiv:2206.00826  [pdf, other

    cs.CL cs.AI cs.LG

    BayesFormer: Transformer with Uncertainty Estimation

    Authors: Karthik Abinav Sankararaman, Sinong Wang, Han Fang

    Abstract: Transformer has become ubiquitous due to its dominant performance in various NLP and image processing tasks. However, it lacks understanding of how to generate mathematically grounded uncertainty estimates for transformer architectures. Models equipped with such uncertainty estimates can typically improve predictive performance, make networks robust, avoid over-fitting and used as acquisition func… ▽ More

    Submitted 1 June, 2022; originally announced June 2022.

  7. arXiv:2112.05247  [pdf, ps, other

    cs.DS

    Online minimum matching with uniform metric and random arrivals

    Authors: Sharmila Duppala, Karthik A. Sankararaman, Pan Xu

    Abstract: We consider Online Minimum Bipartite Matching under the uniform metric. We show that Randomized Greedy achieves a competitive ratio equal to $(1+1/n) (H_{n+1}-1)$, which matches the lower bound. Comparing with the fact that RG achieves an optimal ratio of $Θ(\ln n)$ for the same problem but under the adversarial order, we find that the weaker arrival assumption of random order doesn't offer any ex… ▽ More

    Submitted 9 December, 2021; originally announced December 2021.

    Comments: Accepted to Operations Research Letters (ORL)

  8. arXiv:2108.04862  [pdf, other

    cs.AI cs.CY

    Matching Algorithms for Blood Donation

    Authors: Duncan C McElfresh, Christian Kroer, Sergey Pupyrev, Eric Sodomka, Karthik Sankararaman, Zack Chauvin, Neil Dexter, John P Dickerson

    Abstract: Global demand for donated blood far exceeds supply, and unmet need is greatest in low- and middle-income countries; experts suggest that large-scale coordination is necessary to alleviate demand. Using the Facebook Blood Donation tool, we conduct the first large-scale algorithmic matching of blood donors with donation opportunities. While measuring actual donation rates remains a challenge, we mea… ▽ More

    Submitted 13 August, 2021; v1 submitted 10 August, 2021; originally announced August 2021.

    Comments: An early version of this paper appeared at EC'20. (https://doi.org/10.1145/3391403.3399458)

    ACM Class: J.3; J.4

  9. arXiv:2103.10246  [pdf, other

    cs.GT cs.LG

    Stochastic Bandits for Multi-platform Budget Optimization in Online Advertising

    Authors: Vashist Avadhanula, Riccardo Colini-Baldeschi, Stefano Leonardi, Karthik Abinav Sankararaman, Okke Schrijvers

    Abstract: We study the problem of an online advertising system that wants to optimally spend an advertiser's given budget for a campaign across multiple platforms, without knowing the value for showing an ad to the users on those platforms. We model this challenging practical application as a Stochastic Bandits with Knapsacks problem over $T$ rounds of bidding with the set of arms given by the set of distin… ▽ More

    Submitted 25 March, 2021; v1 submitted 16 March, 2021; originally announced March 2021.

  10. arXiv:2103.07501  [pdf, other

    cs.LG stat.ML

    Beyond $\log^2(T)$ Regret for Decentralized Bandits in Matching Markets

    Authors: Soumya Basu, Karthik Abinav Sankararaman, Abishek Sankararaman

    Abstract: We design decentralized algorithms for regret minimization in the two-sided matching market with one-sided bandit feedback that significantly improves upon the prior works (Liu et al. 2020a, 2020b, Sankararaman et al. 2020). First, for general markets, for any $\varepsilon > 0$, we design an algorithm that achieves a $O(\log^{1+\varepsilon}(T))$ regret to the agent-optimal stable matching, with un… ▽ More

    Submitted 12 March, 2021; originally announced March 2021.

  11. arXiv:2010.08142  [pdf, other

    cs.DS

    Improved Approximation Algorithms for Stochastic-Matching Problems

    Authors: Marek Adamczyk, Brian Brubach, Fabrizio Grandoni, Karthik A. Sankararaman, Aravind Srinivasan, Pan Xu

    Abstract: We consider the Stochastic Matching problem, which is motivated by applications in kidney exchange and online dating. In this problem, we are given an undirected graph. Each edge is assigned a known, independent probability of existence and a positive weight (or profit). We must probe an edge to discover whether or not it exists. Each node is assigned a positive integer called a timeout (or a pati… ▽ More

    Submitted 14 October, 2020; originally announced October 2020.

    Comments: arXiv admin note: substantial text overlap with arXiv:1505.01439

  12. arXiv:2007.06869  [pdf, other

    cs.LG cs.AI cs.DS stat.ML

    Robust Identifiability in Linear Structural Equation Models of Causal Inference

    Authors: Karthik Abinav Sankararaman, Anand Louis, Navin Goyal

    Abstract: In this work, we consider the problem of robust parameter estimation from observational data in the context of linear structural equation models (LSEMs). LSEMs are a popular and well-studied class of models for inferring causality in the natural and social sciences. One of the main problems related to LSEMs is to recover the model parameters from the observational data. Under various conditions on… ▽ More

    Submitted 14 July, 2020; originally announced July 2020.

  13. arXiv:2006.15166  [pdf, other

    cs.LG cs.DS cs.GT stat.ML

    Dominate or Delete: Decentralized Competing Bandits in Serial Dictatorship

    Authors: Abishek Sankararaman, Soumya Basu, Karthik Abinav Sankararaman

    Abstract: Online learning in a two-sided matching market, with demand side agents continuously competing to be matched with supply side (arms), abstracts the complex interactions under partial information on matching platforms (e.g. UpWork, TaskRabbit). We study the decentralized serial dictatorship setting, a two-sided matching market where the demand side agents have unknown and heterogeneous valuation ov… ▽ More

    Submitted 12 March, 2021; v1 submitted 26 June, 2020; originally announced June 2020.

    Comments: AISTATS, 2021

  14. arXiv:2002.00253  [pdf, other

    cs.LG cs.DS stat.ML

    Bandits with Knapsacks beyond the Worst-Case

    Authors: Karthik Abinav Sankararaman, Aleksandrs Slivkins

    Abstract: Bandits with Knapsacks (BwK) is a general model for multi-armed bandits under supply/budget constraints. While worst-case regret bounds for BwK are well-understood, we present three results that go beyond the worst-case perspective. First, we provide upper and lower bounds which amount to a full characterization for logarithmic, instance-dependent regret rates. Second, we consider "simple regret"… ▽ More

    Submitted 28 December, 2021; v1 submitted 1 February, 2020; originally announced February 2020.

    Comments: The initial version, titled "Advances in Bandits with Knapsacks", was published on arxiv.longhoe.net in Jan'20. The present version improves both upper and lower bounds, deriving Theorem 3.2(ii) and Theorem 4.2. Moreover, it simplifies the algorithm and analysis in the main result, and fixes several issues in the lower bounds

  15. arXiv:1912.08388  [pdf, other

    cs.AI cs.CY

    Balancing the Tradeoff between Profit and Fairness in Rideshare Platforms During High-Demand Hours

    Authors: Vedant Nanda, Pan Xu, Karthik Abinav Sankararaman, John P. Dickerson, Aravind Srinivasan

    Abstract: Rideshare platforms, when assigning requests to drivers, tend to maximize profit for the system and/or minimize waiting time for riders. Such platforms can exacerbate biases that drivers may have over certain types of requests. We consider the case of peak hours when the demand for rides is more than the supply of drivers. Drivers are well aware of their advantage during the peak hours and can cho… ▽ More

    Submitted 6 September, 2020; v1 submitted 18 December, 2019; originally announced December 2019.

    Comments: 8 pages, 4 figures, Accepted at AAAI 2020 & AIES (Oral) 2020

  16. arXiv:1912.00225  [pdf, other

    cs.DS cs.GT cs.LG cs.MA

    Mix and Match: Markov Chains & Mixing Times for Matching in Rideshare

    Authors: Michael J. Curry, John P. Dickerson, Karthik Abinav Sankararaman, Aravind Srinivasan, Yuhao Wan, Pan Xu

    Abstract: Rideshare platforms such as Uber and Lyft dynamically dispatch drivers to match riders' requests. We model the dispatching process in rideshare as a Markov chain that takes into account the geographic mobility of both drivers and riders over time. Prior work explores dispatch policies in the limit of such Markov chains; we characterize when this limit assumption is valid, under a variety of natura… ▽ More

    Submitted 30 November, 2019; originally announced December 2019.

  17. arXiv:1905.06836  [pdf, other

    cs.LG stat.ML

    Stability of Linear Structural Equation Models of Causal Inference

    Authors: Karthik Abinav Sankararaman, Anand Louis, Navin Goyal

    Abstract: We consider the numerical stability of the parameter recovery problem in Linear Structural Equation Model ($\LSEM$) of causal inference. A long line of work starting from Wright (1920) has focused on understanding which sub-classes of $\LSEM$ allow for efficient parameter recovery. Despite decades of study, this question is not yet fully resolved. The goal of this paper is complementary to this li… ▽ More

    Submitted 17 August, 2020; v1 submitted 16 May, 2019; originally announced May 2019.

    Comments: To appear in UAI 2019

  18. arXiv:1904.06963  [pdf, other

    cs.LG stat.ML

    The Impact of Neural Network Overparameterization on Gradient Confusion and Stochastic Gradient Descent

    Authors: Karthik A. Sankararaman, Soham De, Zheng Xu, W. Ronny Huang, Tom Goldstein

    Abstract: This paper studies how neural network architecture affects the speed of training. We introduce a simple concept called gradient confusion to help formally analyze this. When gradient confusion is high, stochastic gradients produced by different data samples may be negatively correlated, slowing down convergence. But when gradient confusion is low, data samples interact harmoniously, and training p… ▽ More

    Submitted 6 July, 2020; v1 submitted 15 April, 2019; originally announced April 2019.

    Comments: ICML 2020 camera-ready version

  19. arXiv:1811.11881  [pdf, other

    cs.DS cs.LG stat.ML

    Adversarial Bandits with Knapsacks

    Authors: Nicole Immorlica, Karthik Abinav Sankararaman, Robert Schapire, Aleksandrs Slivkins

    Abstract: We consider Bandits with Knapsacks (henceforth, BwK), a general model for multi-armed bandits under supply/budget constraints. In particular, a bandit algorithm needs to solve a well-known knapsack problem: find an optimal packing of items into a limited-size knapsack. The BwK problem is a common generalization of numerous motivating examples, which range from dynamic pricing to repeated auctions… ▽ More

    Submitted 6 March, 2023; v1 submitted 28 November, 2018; originally announced November 2018.

    Comments: The extended abstract appeared in FOCS 2019. The definitive version was published in JACM '22. V8 is the latest version with all technical changes. Subsequent versions fixes minor LATEX presentation issues

  20. arXiv:1811.05100  [pdf, other

    cs.DS

    Balancing Relevance and Diversity in Online Bipartite Matching via Submodularity

    Authors: John P. Dickerson, Karthik Abinav Sankararaman, Aravind Srinivasan, Pan Xu

    Abstract: In bipartite matching problems, vertices on one side of a bipartite graph are paired with those on the other. In its online variant, one side of the graph is available offline, while the vertices on the other side arrive online. When a vertex arrives, an irrevocable and immediate decision should be made by the algorithm; either match it to an available vertex or drop it. Examples of such problems… ▽ More

    Submitted 12 November, 2018; originally announced November 2018.

    Comments: To appear in AAAI 2019

  21. arXiv:1804.08062  [pdf, ps, other

    cs.DS cs.AI cs.MA

    Attenuate Locally, Win Globally: An Attenuation-based Framework for Online Stochastic Matching with Timeouts

    Authors: Brian Brubach, Karthik Abinav Sankararaman, Aravind Srinivasan, Pan Xu

    Abstract: Online matching problems have garnered significant attention in recent years due to numerous applications in e-commerce, online advertisements, ride-sharing, etc. Many of them capture the uncertainty in the real world by including stochasticity in both the arrival process and the matching process. The Online Stochastic Matching with Timeouts problem introduced by Bansal, et al., (Algorithmica, 201… ▽ More

    Submitted 21 June, 2019; v1 submitted 21 April, 2018; originally announced April 2018.

    Comments: A short version appeared in AAMAS-2017. This version fixes some bugs in the camera-ready version of the paper

  22. arXiv:1711.08345  [pdf, other

    cs.AI cs.GT

    Allocation Problems in Ride-Sharing Platforms: Online Matching with Offline Reusable Resources

    Authors: John P Dickerson, Karthik A Sankararaman, Aravind Srinivasan, Pan Xu

    Abstract: Bipartite matching markets pair agents on one side of a market with agents, items, or contracts on the opposing side. Prior work addresses online bipartite matching markets, where agents arrive over time and are dynamically matched to a known set of disposable resources. In this paper, we propose a new model, Online Matching with (offline) Reusable Resources under Known Adversarial Distributions (… ▽ More

    Submitted 11 December, 2017; v1 submitted 22 November, 2017; originally announced November 2017.

    Comments: To appear in AAAI 2018

  23. arXiv:1711.02724  [pdf, ps, other

    cs.DS cs.DM math.CO

    Algorithms to Approximate Column-Sparse Packing Problems

    Authors: Brian Brubach, Karthik Abinav Sankararaman, Aravind Srinivasan, Pan Xu

    Abstract: Column-sparse packing problems arise in several contexts in both deterministic and stochastic discrete optimization. We present two unifying ideas, (non-uniform) attenuation and multiple-chance algorithms, to obtain improved approximation algorithms for some well-known families of such problems. As three main examples, we attain the integrality gap, up to lower-order terms, for known LP relaxation… ▽ More

    Submitted 5 August, 2019; v1 submitted 7 November, 2017; originally announced November 2017.

    Comments: Extended abstract appeared in SODA 2018. Full version in ACM Transactions of Algorithms

  24. arXiv:1705.08110  [pdf, other

    cs.LG

    Combinatorial Semi-Bandits with Knapsacks

    Authors: Karthik Abinav Sankararaman, Aleksandrs Slivkins

    Abstract: We unify two prominent lines of work on multi-armed bandits: bandits with knapsacks (BwK) and combinatorial semi-bandits. The former concerns limited "resources" consumed by the algorithm, e.g., limited supply in dynamic pricing. The latter allows a huge number of actions but assumes combinatorial structure and additional feedback to make the problem tractable. We define a common generalization, s… ▽ More

    Submitted 20 February, 2018; v1 submitted 23 May, 2017; originally announced May 2017.

  25. arXiv:1606.06395  [pdf, ps, other

    cs.DS cs.DM cs.GT math.CO math.PR

    Online Stochastic Matching: New Algorithms and Bounds

    Authors: Brian Brubach, Karthik Abinav Sankararaman, Aravind Srinivasan, Pan Xu

    Abstract: Online matching has received significant attention over the last 15 years due to its close connection to Internet advertising. As the seminal work of Karp, Vazirani, and Vazirani has an optimal (1 - 1/e) competitive ratio in the standard adversarial online model, much effort has gone into develo** useful online models that incorporate some stochasticity in the arrival process. One such popular m… ▽ More

    Submitted 22 July, 2019; v1 submitted 20 June, 2016; originally announced June 2016.

    Comments: Preliminary Version appeared in European Symposium on Algorithms (ESA) 2016