Tackling Long-Horizon Tasks with Model-based Offline Reinforcement Learning

Yonsei University

Abstract

MOBILE

LEQ (Ours)

CBOP


Lower-Expectile Q learning (LEQ)


Lower expectile



λ-returns




Utilizing transitions of the dataset




Experiments



Antmaze

(a) umaze

(b) medium

(c) large

(d) ultra


Locomotions

BibTeX

@article{park2024tackling,
  title={Tackling Long-Horizon Tasks with Model-based Offline Reinforcement Learning},
  author={Kwanyoung Park and Youngwoon Lee},
  journal={arXiv Preprint arxiv:2407.00699},
  year={2024}
}