Skip to main content

Showing 1–4 of 4 results for author: Taywade, K

Searching in archive cs. Search in all archives.
.
  1. Personalizing Task-oriented Dialog Systems via Zero-shot Generalizable Reward Function

    Authors: A. B. Siddique, M. H. Maqbool, Kshitija Taywade, Hassan Foroosh

    Abstract: Task-oriented dialog systems enable users to accomplish tasks using natural language. State-of-the-art systems respond to users in the same way regardless of their personalities, although personalizing dialogues can lead to higher levels of adoption and better user experiences. Building personalized dialog systems is an important, yet challenging endeavor and only a handful of works took on the ch… ▽ More

    Submitted 24 March, 2023; originally announced March 2023.

    Comments: 11 pages, 4 tables, 31st ACM International Conference on Information and Knowledge Management (CIKM'22)

  2. arXiv:2201.01182  [pdf, other

    cs.GT cs.AI cs.LG cs.MA econ.EM

    Modelling Cournot Games as Multi-agent Multi-armed Bandits

    Authors: Kshitija Taywade, Brent Harrison, Adib Bagh

    Abstract: We investigate the use of a multi-agent multi-armed bandit (MA-MAB) setting for modeling repeated Cournot oligopoly games, where the firms acting as agents choose from the set of arms representing production quantity (a discrete value). Agents interact with separate and independent bandit problems. In this formulation, each agent makes sequential choices among arms to maximize its own reward. Agen… ▽ More

    Submitted 1 January, 2022; originally announced January 2022.

    Comments: 12 pages. arXiv admin note: text overlap with arXiv:2201.00486

  3. arXiv:2201.00486  [pdf, other

    cs.LG cs.GT cs.MA econ.GN

    Using Non-Stationary Bandits for Learning in Repeated Cournot Games with Non-Stationary Demand

    Authors: Kshitija Taywade, Brent Harrison, Judy Goldsmith

    Abstract: Many past attempts at modeling repeated Cournot games assume that demand is stationary. This does not align with real-world scenarios in which market demands can evolve over a product's lifetime for a myriad of reasons. In this paper, we model repeated Cournot games with non-stationary demand such that firms/agents face separate instances of non-stationary multi-armed bandit problem. The set of ar… ▽ More

    Submitted 3 January, 2022; originally announced January 2022.

    Comments: 13 pages

  4. arXiv:2005.01117  [pdf, ps, other

    cs.LG cs.AI cs.MA

    Multi-agent Reinforcement Learning for Decentralized Stable Matching

    Authors: Kshitija Taywade, Judy Goldsmith, Brent Harrison

    Abstract: In the real world, people/entities usually find matches independently and autonomously, such as finding jobs, partners, roommates, etc. It is possible that this search for matches starts with no initial knowledge of the environment. We propose the use of a multi-agent reinforcement learning (MARL) paradigm for a spatially formulated decentralized two-sided matching market with independent and auto… ▽ More

    Submitted 3 December, 2021; v1 submitted 3 May, 2020; originally announced May 2020.

    Comments: 16 pages

    Journal ref: 7th International Conference on Algorithmic Decision Theory, 2021