Model-based Offline Reinforcement Learning with Lower Expectile Q-learning

Yonsei University

Abstract

MOBILE

LEQ (Ours)

CBOP


Lower-Expectile Q learning (LEQ)


Lower expectile



λ-returns




Utilizing transitions of the dataset




Experiments



Antmaze

(a) umaze

(b) medium

(c) large

(d) ultra


Locomotions


Visual Control

BibTeX

@inproceedings{park2025leq,
  title={Model-based Offline Reinforcement Learning with Lower Expectile Q-learning},
  author={Kwanyoung Park and Youngwoon Lee},
  booktitle={The Thirteenth International Conference on Learning Representations},
  year={2025},
}