Skip to main content

Showing 1–2 of 2 results for author: Manzanares, C A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2304.04916  [pdf, other

    cs.LG stat.ML

    A Data-Driven State Aggregation Approach for Dynamic Discrete Choice Models

    Authors: Sinong Geng, Houssam Nassif, Carlos A. Manzanares

    Abstract: We study dynamic discrete choice models, where a commonly studied problem involves estimating parameters of agent reward functions (also known as "structural" parameters), using agent behavioral data. Maximum likelihood estimation for such models requires dynamic programming, which is limited by the curse of dimensionality. In this work, we present a novel algorithm that provides a data-driven met… ▽ More

    Submitted 31 May, 2023; v1 submitted 10 April, 2023; originally announced April 2023.

    Journal ref: The Conference on Uncertainty in Artificial Intelligence (UAI'23), Pittsburgh, PA, pp. 647-657, 2023

  2. arXiv:2007.07443  [pdf, other

    cs.LG math.OC stat.ML

    Deep PQR: Solving Inverse Reinforcement Learning using Anchor Actions

    Authors: Sinong Geng, Houssam Nassif, Carlos A. Manzanares, A. Max Reppen, Ronnie Sircar

    Abstract: We propose a reward function estimation framework for inverse reinforcement learning with deep energy-based policies. We name our method PQR, as it sequentially estimates the Policy, the $Q$-function, and the Reward function by deep learning. PQR does not assume that the reward solely depends on the state, instead it allows for a dependency on the choice of action. Moreover, PQR allows for stochas… ▽ More

    Submitted 14 August, 2020; v1 submitted 14 July, 2020; originally announced July 2020.

    Journal ref: In Proceedings of the 37th ICML, Vienna, Austria, PMLR 119, pp. 3431-3441, 2020