NathanGavenski/MountainCar-v0

Name: NathanGavenski/MountainCar-v0
Creator: NathanGavenski
Published: 2024-06-11 13:50:26
License: 暂无描述

Hugging Face2024-06-11 更新2024-03-04 收录

下载链接：

https://hf-mirror.com/datasets/NathanGavenski/MountainCar-v0

下载链接

链接失效反馈

官方服务：

资源简介：

--- license: mit tags: - Imitation Learning - Expert Trajectory pretty_name: MountainCar-v0 Expert Dataset size_categories: - 10M<n<100M --- # MountainCar-v0 - Imitation Learning Datasets This is a dataset created by [Imitation Learning Datasets](https://github.com/NathanGavenski/IL-Datasets) project. It was created by using Stable Baselines weights from a DQN policy from [HuggingFace](https://huggingface.co/sb3/dqn-MountainCar-v0). ## Description The dataset consists of 1,000 episodes with an average episodic reward of `-98.817`. Each entry consists of: ``` obs (list): observation with length 2. action (int): action (0 or 1). reward (float): reward point for that timestep. episode_returns (bool): if that state was the initial timestep for an episode. ``` ## Usage Feel free to download and use the `teacher.jsonl` dataset as you please. If you are interested in using our PyTorch Dataset implementation, feel free to check the [IL Datasets](https://github.com/NathanGavenski/IL-Datasets/blob/main/src/imitation_datasets/dataset/dataset.py) project. There, we implement a base Dataset that downloads this dataset and all other datasets directly from HuggingFace. The Baseline Dataset also allows for more control over train and test splits and how many episodes you want to use (in cases where the 1k episodes are not necessary). ## Citation ```{bibtex} @inproceedings{gavenski2024ildatasets, author = {Gavenski, Nathan and Luck, Michael and Rodrigues, Odinaldo}, title = {Imitation Learning Datasets: A Toolkit For Creating Datasets, Training Agents and Benchmarking}, year = {2024}, isbn = {9798400704864}, publisher = {International Foundation for Autonomous Agents and Multiagent Systems}, address = {Richland, SC}, abstract = {Imitation learning field requires expert data to train agents in a task. Most often, this learning approach suffers from the absence of available data, which results in techniques being tested on its dataset. Creating datasets is a cumbersome process requiring researchers to train expert agents from scratch, record their interactions and test each benchmark method with newly created data. Moreover, creating new datasets for each new technique results in a lack of consistency in the evaluation process since each dataset can drastically vary in state and action distribution. In response, this work aims to address these issues by creating Imitation Learning Datasets, a toolkit that allows for: (i) curated expert policies with multithreaded support for faster dataset creation; (ii) readily available datasets and techniques with precise measurements; and (iii) sharing implementations of common imitation learning techniques. Demonstration link: https://nathangavenski.github.io/#/il-datasets-video}, booktitle = {Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems}, pages = {2800–2802}, numpages = {3}, keywords = {benchmarking, dataset, imitation learning}, location = {<conf-loc>, <city>Auckland</city>, <country>New Zealand</country>, </conf-loc>}, series = {AAMAS '24} } ```

提供机构：

NathanGavenski

原始信息汇总

MountainCar-v0 - Imitation Learning Datasets

描述

该数据集包含1,000个回合，平均回合奖励为-98.817。每个条目包含以下内容：

obs (列表): 长度为2的观测值。
action (整数): 动作（0或1）。
reward (浮点数): 该时间步的奖励点。
episode_returns (布尔值): 该状态是否为回合的初始时间步。

使用

可以自由下载并使用teacher.jsonl数据集。如果对使用PyTorch数据集实现感兴趣，可以查看IL Datasets项目。该项目实现了一个基础数据集，可以直接从HuggingFace下载此数据集及其他数据集。基础数据集还允许对训练和测试分割进行更多控制，以及选择使用的回合数（在不需要1,000个回合的情况下）。

引用

{bibtex} @inproceedings{gavenski2024ildatasets, author = {Gavenski, Nathan and Luck, Michael and Rodrigues, Odinaldo}, title = {Imitation Learning Datasets: A Toolkit For Creating Datasets, Training Agents and Benchmarking}, year = {2024}, isbn = {9798400704864}, publisher = {International Foundation for Autonomous Agents and Multiagent Systems}, address = {Richland, SC}, abstract = {Imitation learning field requires expert data to train agents in a task. Most often, this learning approach suffers from the absence of available data, which results in techniques being tested on its dataset. Creating datasets is a cumbersome process requiring researchers to train expert agents from scratch, record their interactions and test each benchmark method with newly created data. Moreover, creating new datasets for each new technique results in a lack of consistency in the evaluation process since each dataset can drastically vary in state and action distribution. In response, this work aims to address these issues by creating Imitation Learning Datasets, a toolkit that allows for: (i) curated expert policies with multithreaded support for faster dataset creation; (ii) readily available datasets and techniques with precise measurements; and (iii) sharing implementations of common imitation learning techniques. Demonstration link: https://nathangavenski.github.io/#/il-datasets-video}, booktitle = {Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems}, pages = {2800–2802}, numpages = {3}, keywords = {benchmarking, dataset, imitation learning}, location = {<conf-loc>, <city>Auckland</city>, <country>New Zealand</country>, </conf-loc>}, series = {AAMAS 24} }

5,000+

优质数据集

54 个

任务类型

进入经典数据集