Nemotron-VLM-Dataset-v2
收藏魔搭社区2026-01-06 更新2025-11-03 收录
下载链接:
https://modelscope.cn/datasets/nv-community/Nemotron-VLM-Dataset-v2
下载链接
链接失效反馈官方服务:
资源简介:
# Nemotron-VLM-Dataset v2
## Versions
| Date | Commit | Changes |
|-------------|--------------|----------|
| **2025-11-05** | [head](https://huggingface.co/datasets/nvidia/Nemotron-VLM-Dataset-v2/tree/main) | Fix `nights_cot` dataset. Fix/filter broken <think> entries. Update fintabnet instructions. Update indexes. |
| **2025-10-28** | [214051e](https://huggingface.co/datasets/nvidia/Nemotron-VLM-Dataset-v2/tree/214051e30f0f5ef2d6cd7eb54027b38d229c8822) | Initial Release |
## Dataset Description
Following up on Llama Nemotron VLM Dataset V1 with 3 million samples, we are releasing the Nemotron VLM Dataset V2 with almost three times as many high-quality samples.
This time, our focus was on three main areas: Adding new data modalities like video, expanding our chain-of-thought reasoning data, and providing the community with a toolchain to generate OCR training data.
We discovered that to enhance performance further, our models needed to learn not only the correct answer but also the reasoning process behind it. Adding more targeted chain-of-thought datasets proved to be the key to breaking the plateau for various benchmarks.
With this release, we are broadening the dataset scope to allow for training more capable models. We added
- New Modalities and Domains: We have added a substantial amount of new data covering UI understanding, complex charts, diagrams. For the first time, we are also including video understanding tasks.
- Focus on Reasoning: We have been able to break benchmark plateaus by adding more chain-of-thought data, some of which we generated by auto labeling thinking traces for existing samples. We found that providing those traces helped especially for samples that the previous model struggled with.
- Improved OCR: We further improved on the highly-competitive OCR capabilities of our first VL model by adding an even larger variety of training samples including multilingual data for six languages. Unfortunately, we cannot redistribute a large part of those samples, but we are releasing the data generation pipeline that we used, so you can generate all that OCR data with ground truth yourself! Check it out [here](https://github.com/NVIDIA-NeMo/Curator/tree/experimental/experimental/nvpdftex).
In the table below, you can see all the subdatasets that we are publishing with their sizes, properties and link to a subdataset card with more details.
For each subdataset we are publishing the annotations/labels which we generated by using various strategies, see "Source & Processing" column.
The actual media data (images and videos) can only be redistributed for some of the datasets according to their licenses.
For the remaining ones, we provide instructions on how to obtain the data in each of the subdataset cards.
All of the data is prepared to be used with our multi-modal data loader [Megatron Energon](https://github.com/NVIDIA/Megatron-Energon). For more details, see [this section](#loading-the-data-with-megatron-energon) below.
Our [Nemotron Nano V2 VL](https://huggingface.co/papers/2511.03929) model was trained using this data.
This dataset is ready for commercial use.
---
## Dataset Owner
NVIDIA Corporation
---
## Dataset Creation Date
10/27/2025
---
## License/Terms of Use
**Governing Terms**: This collection of datasets is governed by the [Creative Commons Attribution 4.0 International License](https://creativecommons.org/licenses/by/4.0/deed.en) (CC-BY-4.0), except for the following datasets, which are governed by the [Creative Commons Attribution-ShareAlike 4.0 International License](https://creativecommons.org/licenses/by-sa/4.0/) (CC BY-SA 4.0): dewiki_v5_0828, enwiki_v5_0828, eswiki_v5_0828, frwiki_v5_0828, itwiki_v5_0828, jawiki_v5_0828, kowiki_v5_0828, nlwiki_v5_0828, ptwiki_v5_0828, and zhwiki_v5_0828.
---
## Intended Usage
The Llama Nemotron VLM Dataset is intended to be used by the community to continue to improve open models. The data may be freely used to train and evaluate.
---
## Dataset Composition
| Dataset Name | Samples | Size (GB) | Data & Task Type | Source & Processing | Media incl. | Governing Terms |
|------------|-----------:|-----------:|------------|------------|------------|------------|
| [wiki_de](./wiki_de/README.md) | 200,000 | 37.13 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FCF1CF;white-space:nowrap">image</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> | ☑ | CC BY-SA 4.0 |
| [wiki_en](./wiki_en/README.md) | 200,000 | 33.38 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FCF1CF;white-space:nowrap">image</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> | ☑ | CC BY-SA 4.0 |
| [wiki_es](./wiki_es/README.md) | 200,000 | 32.85 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FCF1CF;white-space:nowrap">image</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> | ☑ | CC BY-SA 4.0 |
| [wiki_fr](./wiki_fr/README.md) | 200,000 | 31.14 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FCF1CF;white-space:nowrap">image</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> | ☑ | CC BY-SA 4.0 |
| [wiki_it](./wiki_it/README.md) | 200,000 | 30.30 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FCF1CF;white-space:nowrap">image</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> | ☑ | CC BY-SA 4.0 |
| [wiki_ja](./wiki_ja/README.md) | 200,000 | 38.39 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FCF1CF;white-space:nowrap">image</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> | ☑ | CC BY-SA 4.0 |
| [wiki_ko](./wiki_ko/README.md) | 200,000 | 27.09 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FCF1CF;white-space:nowrap">image</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> | ☑ | CC BY-SA 4.0 |
| [wiki_nl](./wiki_nl/README.md) | 200,000 | 29.52 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FCF1CF;white-space:nowrap">image</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> | ☑ | CC BY-SA 4.0 |
| [wiki_pt](./wiki_pt/README.md) | 200,000 | 30.49 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FCF1CF;white-space:nowrap">image</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> | ☑ | CC BY-SA 4.0 |
| [wiki_zh](./wiki_zh/README.md) | 200,000 | 30.14 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FCF1CF;white-space:nowrap">image</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> | ☑ | CC BY-SA 4.0 |
| [oi_bbox_1](./oi_bbox_1/README.md) | 1,664,533 | 490.09 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FCF1CF;white-space:nowrap">image</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#C9D6FF;white-space:nowrap">qa</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> | | CC BY 4.0 |
| [oi_bbox_2](./oi_bbox_2/README.md) | 1,664,533 | 488.08 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FCF1CF;white-space:nowrap">image</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#C9D6FF;white-space:nowrap">qa</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> | | CC BY 4.0 |
| [oi_bbox_3](./oi_bbox_3/README.md) | 1,128,326 | 324.41 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FCF1CF;white-space:nowrap">image</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#C9D6FF;white-space:nowrap">qa</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> | | CC BY 4.0 |
| [tabmwp_cot](./tabmwp_cot/README.md) | 20,305 | 0.28 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FCF1CF;white-space:nowrap">image</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#E7D0FF;white-space:nowrap">reasoning</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#EEDCFF;white-space:nowrap">qwen-labels</span> | | CC BY 4.0 |
| [sparsetables](./sparsetables/README.md) | 100,000 | 14.36 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FCF1CF;white-space:nowrap">image</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#C9D6FF;white-space:nowrap">synthetic</span> | ☑ | CC BY 4.0 |
| [mulberry_cot_1](./mulberry_cot_1/README.md) | 191,332 | 16.27 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FCF1CF;white-space:nowrap">image</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#E7D0FF;white-space:nowrap">reasoning</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFE0B8;white-space:nowrap">glm-labels</span> | | CC BY 4.0 |
| [llava_cot_100k](./llava_cot_100k/README.md) | 63,013 | 6.67 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FCF1CF;white-space:nowrap">image</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#E7D0FF;white-space:nowrap">reasoning</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFE0B8;white-space:nowrap">glm-labels</span> | | CC BY 4.0 |
| [geomverse_cot](./geomverse_cot/README.md) | 9,298 | 0.90 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FCF1CF;white-space:nowrap">image</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#E7D0FF;white-space:nowrap">reasoning</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFE0B8;white-space:nowrap">glm-labels</span> | | CC BY 4.0 |
| [mapqa_cot](./mapqa_cot/README.md) | 16,832 | 1.77 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FCF1CF;white-space:nowrap">image</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#E7D0FF;white-space:nowrap">reasoning</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFE0B8;white-space:nowrap">glm-labels</span> | | CC BY 4.0 |
| [plotqa_cot](./plotqa_cot/README.md) | 16,256 | 0.50 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FCF1CF;white-space:nowrap">image</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#E7D0FF;white-space:nowrap">reasoning</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFE0B8;white-space:nowrap">glm-labels</span> | ☑ | CC BY 4.0 |
| [visual7w_telling_cot](./visual7w_telling_cot/README.md) | 62,592 | 0.76 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FCF1CF;white-space:nowrap">image</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#E7D0FF;white-space:nowrap">reasoning</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFE0B8;white-space:nowrap">glm-labels</span> | | CC BY 4.0 |
| [visual_web_instruct_cot](./visual_web_instruct_cot/README.md) | 48,929 | 4.37 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FCF1CF;white-space:nowrap">image</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#E7D0FF;white-space:nowrap">reasoning</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFE0B8;white-space:nowrap">glm-labels</span> | | CC BY 4.0 |
| [docvqa_cot](./docvqa_cot/README.md) | 36,333 | 6.38 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FCF1CF;white-space:nowrap">image</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#E7D0FF;white-space:nowrap">reasoning</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#EEDCFF;white-space:nowrap">qwen-labels</span> | | CC BY 4.0 |
| [chartqa_cot](./chartqa_cot/README.md) | 45,710 | 0.88 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FCF1CF;white-space:nowrap">image</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#E7D0FF;white-space:nowrap">reasoning</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#EEDCFF;white-space:nowrap">qwen-labels</span> | | CC BY 4.0 |
| [infographicsvqa_cot](./infographicsvqa_cot/README.md) | 19,548 | 1.41 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FCF1CF;white-space:nowrap">image</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#E7D0FF;white-space:nowrap">reasoning</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#EEDCFF;white-space:nowrap">qwen-labels</span> | | CC BY 4.0 |
| [mulberry_cot_2](./mulberry_cot_2/README.md) | 103,763 | 11.84 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FCF1CF;white-space:nowrap">image</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#E7D0FF;white-space:nowrap">reasoning</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#EEDCFF;white-space:nowrap">qwen-labels</span> | | CC BY 4.0 |
| [unigeo_cot](./unigeo_cot/README.md) | 9,728 | 0.03 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FCF1CF;white-space:nowrap">image</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#E7D0FF;white-space:nowrap">reasoning</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFE0B8;white-space:nowrap">glm-labels</span> | | CC BY 4.0 |
| [nights_cot](./nights_cot/README.md) | 12,906 | 37.01 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FCF1CF;white-space:nowrap">image</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#E7D0FF;white-space:nowrap">reasoning</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFE0B8;white-space:nowrap">glm-labels</span> | ☑ | CC BY 4.0 |
| [mantis_instruct_cot](./mantis_instruct_cot/README.md) | 67,723 | 9.39 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FCF1CF;white-space:nowrap">image</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#E7D0FF;white-space:nowrap">reasoning</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFE0B8;white-space:nowrap">glm-labels</span> | | CC BY 4.0 |
| [fintabnet_cot](./fintabnet_cot/README.md) | 8,356 | 3.17 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FCF1CF;white-space:nowrap">image</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#E7D0FF;white-space:nowrap">reasoning</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#EEDCFF;white-space:nowrap">qwen-labels</span> | | CC BY 4.0 |
| [hiertext](./hiertext/README.md) | 514 | 0.02 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FCF1CF;white-space:nowrap">image</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#E7D0FF;white-space:nowrap">reasoning</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#EEDCFF;white-space:nowrap">qwen-labels</span> | | CC BY 4.0 |
| [nextqa](./nextqa/README.md) | 34,132 | 16.80 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#C9F0F4;white-space:nowrap">video</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#C9D6FF;white-space:nowrap">qa</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">rule-based</span> | | CC BY 4.0 |
| [clevrer](./clevrer/README.md) | 40,000 | 11.45 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#C9F0F4;white-space:nowrap">video</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#C9D6FF;white-space:nowrap">qa</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">rule-based</span> | | CC BY 4.0 |
| [ego_exo_learn](./ego_exo_learn/README.md) | 36,373 | 92.64 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#C9F0F4;white-space:nowrap">video</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#C9D6FF;white-space:nowrap">qa</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">rule-based</span> | ☑ | CC BY 4.0 |
| [kinetics_k710](./kinetics_k710/README.md) | 647,883 | 890.53 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#C9F0F4;white-space:nowrap">video</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#C9D6FF;white-space:nowrap">qa</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">rule-based</span> | | CC BY 4.0 |
| [perception_test_1](./perception_test_1/README.md) | 7,392 | 23.58 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#C9F0F4;white-space:nowrap">video</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#C9D6FF;white-space:nowrap">qa</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">rule-based</span> | ☑ | CC BY 4.0 |
| [activity_net_1](./activity_net_1/README.md) | 10,021 | 191.49 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#C9F0F4;white-space:nowrap">video</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#C9D6FF;white-space:nowrap">qa</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">rule-based</span> | | CC BY 4.0 |
| [hacs](./hacs/README.md) | 31,223 | 829.25 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#C9F0F4;white-space:nowrap">video</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#C9D6FF;white-space:nowrap">qa</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">rule-based</span> | | CC BY 4.0 |
| [hirest_1](./hirest_1/README.md) | 822 | 42.50 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#C9F0F4;white-space:nowrap">video</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#C9D6FF;white-space:nowrap">qa</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">rule-based</span> | | CC BY 4.0 |
| [perception_test_2](./perception_test_2/README.md) | 2,135 | 24.36 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#C9F0F4;white-space:nowrap">video</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#C9D6FF;white-space:nowrap">qa</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">rule-based</span> | ☑ | CC BY 4.0 |
| [activity_net_2](./activity_net_2/README.md) | 9,064 | 181.24 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#C9F0F4;white-space:nowrap">video</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#C9D6FF;white-space:nowrap">qa</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">rule-based</span> | | CC BY 4.0 |
| [hirest_2](./hirest_2/README.md) | 525 | 27.54 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#C9F0F4;white-space:nowrap">video</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#C9D6FF;white-space:nowrap">qa</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">rule-based</span> | | CC BY 4.0 |
| [youcook2_1](./youcook2_1/README.md) | 1,180 | 77.65 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#C9F0F4;white-space:nowrap">video</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#C9D6FF;white-space:nowrap">qa</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">rule-based</span> | | CC BY 4.0 |
| [youcook2_2](./youcook2_2/README.md) | 2,270 | 77.65 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#C9F0F4;white-space:nowrap">video</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#C9D6FF;white-space:nowrap">qa</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">rule-based</span> | | CC BY 4.0 |
| [breakfast_actions](./breakfast_actions/README.md) | 1,204 | 3.45 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#C9F0F4;white-space:nowrap">video</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#C9D6FF;white-space:nowrap">qa</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">rule-based</span> | ☑ | CC BY 4.0 |
| [ccpdf_multipage_1](./ccpdf_multipage_1/README.md) | 7,262 | 22.02 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FCF1CF;white-space:nowrap">image</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#C9D6FF;white-space:nowrap">qa</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#EEDCFF;white-space:nowrap">qwen-labels</span> | | CC BY 4.0 |
| [ccpdf_multipage_2](./ccpdf_multipage_2/README.md) | 455 | 17.81 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FCF1CF;white-space:nowrap">image</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#C9D6FF;white-space:nowrap">qa</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#EEDCFF;white-space:nowrap">qwen-labels</span> | | CC BY 4.0 |
| [perception_test_cot](./perception_test_cot/README.md) | 4,977 | 21.90 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#C9F0F4;white-space:nowrap">video</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#C9D6FF;white-space:nowrap">qa</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFE0B8;white-space:nowrap">glm-labels</span> | ☑ | CC BY 4.0 |
| [ccpdf_nv_notables](./ccpdf_nv_notables/README.md) | 14,234 | 8.54 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FCF1CF;white-space:nowrap">image</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#D8F1C6;white-space:nowrap">human-labels</span> | | CC BY 4.0 |
| [ccpdf_nv_qa](./ccpdf_nv_qa/README.md) | 1,668 | 0.54 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FCF1CF;white-space:nowrap">image</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#C9D6FF;white-space:nowrap">qa</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#EEDCFF;white-space:nowrap">qwen-labels</span> | | CC BY 4.0 |
| [ccpdf_nv_tables](./ccpdf_nv_tables/README.md) | 4,249 | 1.85 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FCF1CF;white-space:nowrap">image</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#D8F1C6;white-space:nowrap">human-labels</span> | | CC BY 4.0 |
| **Total** (51) | 8,147,599 | 4,470.46 | | | | |
## Tag Legend
* <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span>: Contains text data
* <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FCF1CF;white-space:nowrap">image</span>: Contains image data
* <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#C9F0F4;white-space:nowrap">video</span>: Contains video data
* <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#C9D6FF;white-space:nowrap">qa</span>: Contains question answering data
* <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#E7D0FF;white-space:nowrap">reasoning</span>: Contains chain of thought reasoning data
* <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span>: Origin of the data is another public dataset
* <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#C9D6FF;white-space:nowrap">synthetic</span>: The data was synthetically generated
* <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#EEDCFF;white-space:nowrap">qwen-labels</span>: Labels generated by Qwen
* <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFE0B8;white-space:nowrap">glm-labels</span>: Labels generated by GLM
* <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#D8F1C6;white-space:nowrap">human-labels</span>: Labels generated by human
* <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">rule-based</span>: Labels generated/transformed by simple rules
---
## Dataset Quantification
- **Total Number of Datasets**: 51
- **Total Number of Samples**: 8,147,599
- **Total Size**: 4,301.82 GB
---
## Dataset Characterization
### **Data Collection Method**
Hybrid: Synthetic, Automated, Human
### **Labeling Method**
Hybrid: Synthetic, Automated, Human
---
## Dataset Format
Each given dataset includes either:
- Text annotations (.jsonl format), referencing images or videos from source datasets, or
- Text annotations (.jsonl format) together with images or videos (in tar'ed shards).
For details on the format, check [Data Format](data_format.md).
---
## Loading the Data with Megatron Energon
This data has been prepared to be used with [Megatron Energon](https://github.com/NVIDIA/Megatron-Energon).
You can just go ahead and try it out like this:
```sh
# Install energon if you haven't already
pip install megatron-energon[av_decode] dacite
# Download this dataset (OPTION 1, slower)
git lfs install
git clone git@hf.co:datasets/nvidia/Nemotron-VLM-Dataset-v2 Nemotron-VLM-Dataset-v2
# Download this dataset (OPTION 2, modern faster way)
pip install --upgrade huggingface_hub
hf download nvidia/Nemotron-VLM-Dataset-v2 --repo-type dataset --local-dir Nemotron-VLM-Dataset-v2
# Try out the example to print a few dataset samples
cd Nemotron-VLM-Dataset-v2
python example_loader.py
```
---
## Ethical Considerations:
NVIDIA believes **Trustworthy AI** is a shared responsibility and we have established policies and practices to enable development for a wide array of AI applications.
When downloaded or used in accordance with our terms of service, developers should work with their internal model team to ensure this model meets requirements for the relevant industry and use case and addresses unforeseen product misuse.
Please report model quality, risk, security vulnerabilities or **NVIDIA AI Concerns** [here](https://app.intigriti.com/programs/nvidia/nvidiavdp/detail).
提供机构:
maas
创建时间:
2025-10-29
搜集汇总
数据集介绍

背景与挑战
背景概述
Nemotron-VLM-Dataset-v2是NVIDIA发布的大规模视觉语言模型数据集,包含约814万样本,总大小4.3TB,涵盖图像、视频和文本多模态数据。该数据集重点扩展了链式思维推理数据和视频理解任务,并提供OCR数据生成工具链,旨在提升模型推理能力和多语言处理性能,适用于商业用途的训练和评估。
以上内容由遇见数据集搜集并总结生成



