Showing 1–2 of 2 results for author: Vu, B L

Search v0.5.6 released 2020-02-24

arXiv:1910.08926 [pdf, other]

cs.LG cs.AI cs.CV stat.ML

Policy Learning for Malaria Control

Authors: Van Bach Nguyen, Belaid Mohamed Karim, Bao Long Vu, Jörg Schlötterer, Michael Granitzer

Abstract: Sequential decision making is a typical problem in reinforcement learning with plenty of algorithms to solve it. However, only a few of them can work effectively with a very small number of observations. In this report, we introduce the progress to learn the policy for Malaria Control as a Reinforcement Learning problem in the KDD Cup Challenge 2019 and propose diverse solutions to deal with the l… ▽ More Sequential decision making is a typical problem in reinforcement learning with plenty of algorithms to solve it. However, only a few of them can work effectively with a very small number of observations. In this report, we introduce the progress to learn the policy for Malaria Control as a Reinforcement Learning problem in the KDD Cup Challenge 2019 and propose diverse solutions to deal with the limited observations problem. We apply the Genetic Algorithm, Bayesian Optimization, Q-learning with sequence breaking to find the optimal policy for five years in a row with only 20 episodes/100 evaluations. We evaluate those algorithms and compare their performance with Random Search as a baseline. Among these algorithms, Q-Learning with sequence breaking has been submitted to the challenge and got ranked 7th in KDD Cup. △ Less

Submitted 20 October, 2019; originally announced October 2019.
arXiv:1707.05422 [pdf, other]

math.OC cs.LG

Don't relax: early stop** for convex regularization

Authors: Simon Matet, Lorenzo Rosasco, Silvia Villa, Bang Long Vu

Abstract: We consider the problem of designing efficient regularization algorithms when regularization is encoded by a (strongly) convex functional. Unlike classical penalization methods based on a relaxation approach, we propose an iterative method where regularization is achieved via early stop**. Our results show that the proposed procedure achieves the same recovery accuracy as penalization methods, w… ▽ More We consider the problem of designing efficient regularization algorithms when regularization is encoded by a (strongly) convex functional. Unlike classical penalization methods based on a relaxation approach, we propose an iterative method where regularization is achieved via early stop**. Our results show that the proposed procedure achieves the same recovery accuracy as penalization methods, while naturally integrating computational considerations. An empirical analysis on a number of problems provides promising results with respect to the state of the art. △ Less

Submitted 17 July, 2017; originally announced July 2017.

MSC Class: 47H05; 49M29; 49M27; 90C25

Search v0.5.6 released 2020-02-24