Skip to main content

Showing 1–1 of 1 results for author: Guyton, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2204.11910  [pdf, other

    cs.LG cs.CY

    Integrating Reward Maximization and Population Estimation: Sequential Decision-Making for Internal Revenue Service Audit Selection

    Authors: Peter Henderson, Ben Chugg, Brandon Anderson, Kristen Altenburger, Alex Turk, John Guyton, Jacob Goldin, Daniel E. Ho

    Abstract: We introduce a new setting, optimize-and-estimate structured bandits. Here, a policy must select a batch of arms, each characterized by its own context, that would allow it to both maximize reward and maintain an accurate (ideally unbiased) population estimate of the reward. This setting is inherent to many public and private sector applications and often requires handling delayed feedback, small… ▽ More

    Submitted 24 January, 2023; v1 submitted 25 April, 2022; originally announced April 2022.

    Comments: Accepted to the Thirty-Seventh AAAI Conference On Artificial Intelligence (AAAI), 2023