-
Approximation of Convex Envelope Using Reinforcement Learning
Abstract: Oberman gave a stochastic control formulation of the problem of estimating the convex envelope of a non-convex function. Based on this, we develop a reinforcement learning scheme to approximate the convex envelope, using a variant of Q-learning for controlled optimal stop**. It shows very promising results on a standard library of test problems.
Submitted 24 November, 2023; originally announced November 2023.