Offline Inverse Reinforcement Learning

Jarboui, Firas; Perchet, Vianney

Computer Science > Machine Learning

arXiv:2106.05068 (cs)

[Submitted on 9 Jun 2021]

Title:Offline Inverse Reinforcement Learning

Authors:Firas Jarboui, Vianney Perchet

View PDF

Abstract:The objective of offline RL is to learn optimal policies when a fixed exploratory demonstrations data-set is available and sampling additional observations is impossible (typically if this operation is either costly or rises ethical questions). In order to solve this problem, off the shelf approaches require a properly defined cost function (or its evaluation on the provided data-set), which are seldom available in practice. To circumvent this issue, a reasonable alternative is to query an expert for few optimal demonstrations in addition to the exploratory data-set. The objective is then to learn an optimal policy w.r.t. the expert's latent cost function. Current solutions either solve a behaviour cloning problem (which does not leverage the exploratory data) or a reinforced imitation learning problem (using a fixed cost function that discriminates available exploratory trajectories from expert ones). Inspired by the success of IRL techniques in achieving state of the art imitation performances in online settings, we exploit GAN based data augmentation procedures to construct the first offline IRL algorithm. The obtained policies outperformed the aforementioned solutions on multiple OpenAI gym environments.

Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2106.05068 [cs.LG]
	(or arXiv:2106.05068v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2106.05068

Submission history

From: Firas Jarboui [view email]
[v1] Wed, 9 Jun 2021 13:44:06 UTC (193 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2021-06

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Firas Jarboui
Vianney Perchet

export BibTeX citation

Computer Science > Machine Learning

Title:Offline Inverse Reinforcement Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Offline Inverse Reinforcement Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators