Skip to main content

Showing 1–4 of 4 results for author: Tulepbergenov, A

.
  1. arXiv:2310.06176  [pdf, other

    cs.AI

    Factual and Personalized Recommendations using Language Models and Reinforcement Learning

    Authors: Jihwan Jeong, Yinlam Chow, Guy Tennenholtz, Chih-Wei Hsu, Azamat Tulepbergenov, Mohammad Ghavamzadeh, Craig Boutilier

    Abstract: Recommender systems (RSs) play a central role in connecting users to content, products, and services, matching candidate items to users based on their preferences. While traditional RSs rely on implicit user feedback signals, conversational RSs interact with users in natural language. In this work, we develop a comPelling, Precise, Personalized, Preference-relevant language model (P4LM) that recom… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

  2. arXiv:2310.04475  [pdf, other

    cs.CL cs.AI cs.LG

    Demystifying Embedding Spaces using Large Language Models

    Authors: Guy Tennenholtz, Yinlam Chow, Chih-Wei Hsu, Jihwan Jeong, Lior Shani, Azamat Tulepbergenov, Deepak Ramachandran, Martin Mladenov, Craig Boutilier

    Abstract: Embeddings have become a pivotal means to represent complex, multi-faceted information about entities, concepts, and relationships in a condensed and useful format. Nevertheless, they often preclude direct interpretation. While downstream tasks make use of these compressed representations, meaningful interpretation usually requires visualization using dimensionality reduction or specialized machin… ▽ More

    Submitted 13 March, 2024; v1 submitted 6 October, 2023; originally announced October 2023.

    Comments: Accepted to ICLR 2024

  3. arXiv:2302.10850  [pdf, other

    cs.LG cs.AI cs.CL

    Offline Reinforcement Learning for Mixture-of-Expert Dialogue Management

    Authors: Dhawal Gupta, Yinlam Chow, Aza Tulepbergenov, Mohammad Ghavamzadeh, Craig Boutilier

    Abstract: Reinforcement learning (RL) has shown great promise for develo** dialogue management (DM) agents that are non-myopic, conduct rich conversations, and maximize overall user satisfaction. Despite recent developments in RL and language models (LMs), using RL to power conversational chatbots remains challenging, in part because RL requires online exploration to learn effectively, whereas collecting… ▽ More

    Submitted 29 October, 2023; v1 submitted 21 February, 2023; originally announced February 2023.

    Comments: Thirty-seventh Conference on Neural Information Processing Systems (NeurIPS 2023)

  4. arXiv:2206.00059  [pdf, other

    cs.CL cs.AI

    A Mixture-of-Expert Approach to RL-based Dialogue Management

    Authors: Yinlam Chow, Aza Tulepbergenov, Ofir Nachum, MoonKyung Ryu, Mohammad Ghavamzadeh, Craig Boutilier

    Abstract: Despite recent advancements in language models (LMs), their application to dialogue management (DM) problems and ability to carry on rich conversations remain a challenge. We use reinforcement learning (RL) to develop a dialogue agent that avoids being short-sighted (outputting generic utterances) and maximizes overall user satisfaction. Most existing RL approaches to DM train the agent at the wor… ▽ More

    Submitted 31 May, 2022; originally announced June 2022.