Uniformly Distributed Data Effects in Offline RL: A Case Study in Gridworld Setting

Kuanysh Tokayev, Jurn Gyu Park

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

In the emerging landscape of off-policy reinforce-ment learning (RL), challenges arise due to the significant costs and risks tied to data collection. To address these issues, there is an alternative path for transitioning from off-policy to offline RL, known for its fixed data collection practices. This stands in contrast to online algorithms, which are sensitive to changes in data during the learning phase. However, the inherent challenge of offline RL lies in its limited interaction with the environment, resulting in inadequate data coverage. Hence, we underscore the convenient application of offline RL, 1) starting from the collection of a static dataset, 2) followed by the training of offline RL models, and 3) culminating with testing in the same environment as off-policy RL methodologies. This involves the utilization of a uniform dataset gathered systematically via non-arbitrary action selection, covering all possible states of the environment. Utilizing the proposed approach, the Offline RL model employing a Multi-Layer Perceptron (MLP) achieves a testing accuracy that falls within 1 % of the results obtained by the off-policy RL agent. Moreover, we provide a practical guide with datasets, offering valuable tutorials on the application of Offline RL in Gridworld-based real-world applications. The guide can be found in this GitHub1repository.

Original languageEnglish
Title of host publicationProceedings - 2024 IEEE International Conference on Big Data and Smart Computing, BigComp 2024
EditorsHerwig Unger, Jinseok Chae, Young-Koo Lee, Christian Wagner, Chaokun Wang, Mehdi Bennis, Mahasak Ketcham, Young-Kyoon Suh, Hyuk-Yoon Kwon
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages159-166
Number of pages8
ISBN (Electronic)9798350370027
DOIs
Publication statusPublished - 2024
Event2024 IEEE International Conference on Big Data and Smart Computing, BigComp 2024 - Bangkok, Thailand
Duration: Feb 18 2024Feb 21 2024

Publication series

NameProceedings - 2024 IEEE International Conference on Big Data and Smart Computing, BigComp 2024

Conference

Conference2024 IEEE International Conference on Big Data and Smart Computing, BigComp 2024
Country/TerritoryThailand
CityBangkok
Period2/18/242/21/24

Keywords

  • Data Distribution
  • Deep Learning
  • DQN
  • Machine Learning
  • Offline RL
  • Tutorial

ASJC Scopus subject areas

  • Artificial Intelligence
  • Computational Theory and Mathematics
  • Computer Networks and Communications
  • Computer Vision and Pattern Recognition
  • Computer Science Applications
  • Information Systems

Fingerprint

Dive into the research topics of 'Uniformly Distributed Data Effects in Offline RL: A Case Study in Gridworld Setting'. Together they form a unique fingerprint.

Cite this