OpenDILabCommunity/LMDrive
收藏Hugging Face2023-12-25 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/OpenDILabCommunity/LMDrive
下载链接
链接失效反馈官方服务:
资源简介:
---
configs:
- config_name: default
data_files:
- split: train
path: navigation_instruction_list.txt
sep: " "
default: true
license: apache-2.0
language:
- en
size_categories:
- n>1T
---
# LMDrive 64K Dataset Card
LMDrive Dataset consists of 64K instruction-sensor-control data clips collected in the CARLA simulator, where each clip includes one navigation instruction, several notice instructions, a sequence of multi-modal multi-view sensor data, and control signals. The duration of the clip spans from 2 to 20 seconds.
## Dataset details
- `data/`: dataset folder, the entire dataset contains about 2T of data.
- `data/Town01`: sub dataset folder, which only consists of the data folder for the Town01
- `data/Town02`: sub dataset folder, which only consists of the data folder for the Town02
- ...
- `dataset_index.txt`: the data list for pretraining the vision encoder
- `navigation_instruction_list.txt`: the data list for instruction finetuning
- `notice_instruction_list.json`: the data list for instruction finetuning (optional if the notice instruction data is not engaged in the training)
**Dataset date:**
LMDrive-1.0 Dataset was collected in September 2023.
**Paper or resources for more information:**
Github: https://github.com/opendilab/LMDrive/README.md
Paper: https://arxiv.org/abs/2312.07488
**License:**
Attribution-NonCommercial 4.0 International
**Where to send questions or comments about the model:**
https://github.com/opendilab/LMDrive/issues
## Intended use
**Primary intended uses:**
The primary use of LMDrive is research on large multimodal models for autonomous driving.
**Primary intended users:**
The primary intended users of the model are researchers and hobbyists in computer vision, large multimodal model, autonomous driving, and artificial intelligence.
提供机构:
OpenDILabCommunity
原始信息汇总
LMDrive 64K 数据集卡片
数据集概述
LMDrive 数据集包含 64K 个指令-传感器-控制数据片段,这些数据片段在 CARLA 模拟器中收集。每个片段包括一个导航指令、几个注意指令、一系列多模态多视角传感器数据和控制信号。片段的持续时间从 2 秒到 20 秒不等。
数据集详情
- 数据目录结构:
data/:数据集文件夹,整个数据集包含约 2T 的数据。data/Town01:子数据集文件夹,仅包含 Town01 的数据文件夹。data/Town02:子数据集文件夹,仅包含 Town02 的数据文件夹。- ...
- 数据列表文件:
dataset_index.txt:用于预训练视觉编码器的数据列表。navigation_instruction_list.txt:用于指令微调的数据列表。notice_instruction_list.json:用于指令微调的数据列表(可选,如果未使用注意指令数据进行训练)。
数据集日期
LMDrive-1.0 数据集于 2023 年 9 月收集。
许可证
Attribution-NonCommercial 4.0 International
预期用途
- 主要预期用途:
- LMDrive 主要用于研究大型多模态模型在自动驾驶领域的应用。
- 主要预期用户:
- 该模型的主要预期用户是计算机视觉、大型多模态模型、自动驾驶和人工智能领域的研究人员和爱好者。



