five

Nemotron-VLM-Dataset-v2

收藏
魔搭社区2026-01-06 更新2025-11-03 收录
下载链接:
https://modelscope.cn/datasets/nv-community/Nemotron-VLM-Dataset-v2
下载链接
链接失效反馈
官方服务:
资源简介:
# Nemotron-VLM-Dataset v2 ## Versions | Date | Commit | Changes | |-------------|--------------|----------| | **2025-11-05** | [head](https://huggingface.co/datasets/nvidia/Nemotron-VLM-Dataset-v2/tree/main) | Fix `nights_cot` dataset. Fix/filter broken &lt;think&gt; entries. Update fintabnet instructions. Update indexes. | | **2025-10-28** | [214051e](https://huggingface.co/datasets/nvidia/Nemotron-VLM-Dataset-v2/tree/214051e30f0f5ef2d6cd7eb54027b38d229c8822) | Initial Release | ## Dataset Description Following up on Llama Nemotron VLM Dataset V1 with 3 million samples, we are releasing the Nemotron VLM Dataset V2 with almost three times as many high-quality samples. This time, our focus was on three main areas: Adding new data modalities like video, expanding our chain-of-thought reasoning data, and providing the community with a toolchain to generate OCR training data. We discovered that to enhance performance further, our models needed to learn not only the correct answer but also the reasoning process behind it. Adding more targeted chain-of-thought datasets proved to be the key to breaking the plateau for various benchmarks. With this release, we are broadening the dataset scope to allow for training more capable models. We added - New Modalities and Domains: We have added a substantial amount of new data covering UI understanding, complex charts, diagrams. For the first time, we are also including video understanding tasks. - Focus on Reasoning: We have been able to break benchmark plateaus by adding more chain-of-thought data, some of which we generated by auto labeling thinking traces for existing samples. We found that providing those traces helped especially for samples that the previous model struggled with. - Improved OCR: We further improved on the highly-competitive OCR capabilities of our first VL model by adding an even larger variety of training samples including multilingual data for six languages. Unfortunately, we cannot redistribute a large part of those samples, but we are releasing the data generation pipeline that we used, so you can generate all that OCR data with ground truth yourself! Check it out [here](https://github.com/NVIDIA-NeMo/Curator/tree/experimental/experimental/nvpdftex). In the table below, you can see all the subdatasets that we are publishing with their sizes, properties and link to a subdataset card with more details. For each subdataset we are publishing the annotations/labels which we generated by using various strategies, see "Source & Processing" column. The actual media data (images and videos) can only be redistributed for some of the datasets according to their licenses. For the remaining ones, we provide instructions on how to obtain the data in each of the subdataset cards. All of the data is prepared to be used with our multi-modal data loader [Megatron Energon](https://github.com/NVIDIA/Megatron-Energon). For more details, see [this section](#loading-the-data-with-megatron-energon) below. Our [Nemotron Nano V2 VL](https://huggingface.co/papers/2511.03929) model was trained using this data. This dataset is ready for commercial use. --- ## Dataset Owner NVIDIA Corporation --- ## Dataset Creation Date 10/27/2025 --- ## License/Terms of Use **Governing Terms**: This collection of datasets is governed by the [Creative Commons Attribution 4.0 International License](https://creativecommons.org/licenses/by/4.0/deed.en) (CC-BY-4.0), except for the following datasets, which are governed by the [Creative Commons Attribution-ShareAlike 4.0 International License](https://creativecommons.org/licenses/by-sa/4.0/) (CC BY-SA 4.0): dewiki_v5_0828, enwiki_v5_0828, eswiki_v5_0828, frwiki_v5_0828, itwiki_v5_0828, jawiki_v5_0828, kowiki_v5_0828, nlwiki_v5_0828, ptwiki_v5_0828, and zhwiki_v5_0828. --- ## Intended Usage The Llama Nemotron VLM Dataset is intended to be used by the community to continue to improve open models. The data may be freely used to train and evaluate. --- ## Dataset Composition | Dataset Name | Samples | Size (GB) | Data & Task Type | Source & Processing | Media incl. | Governing Terms | |------------|-----------:|-----------:|------------|------------|------------|------------| | [wiki_de](./wiki_de/README.md) | 200,000 | 37.13 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FCF1CF;white-space:nowrap">image</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> | ☑ | CC BY-SA 4.0 | | [wiki_en](./wiki_en/README.md) | 200,000 | 33.38 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FCF1CF;white-space:nowrap">image</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> | ☑ | CC BY-SA 4.0 | | [wiki_es](./wiki_es/README.md) | 200,000 | 32.85 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FCF1CF;white-space:nowrap">image</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> | ☑ | CC BY-SA 4.0 | | [wiki_fr](./wiki_fr/README.md) | 200,000 | 31.14 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FCF1CF;white-space:nowrap">image</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> | ☑ | CC BY-SA 4.0 | | [wiki_it](./wiki_it/README.md) | 200,000 | 30.30 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FCF1CF;white-space:nowrap">image</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> | ☑ | CC BY-SA 4.0 | | [wiki_ja](./wiki_ja/README.md) | 200,000 | 38.39 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FCF1CF;white-space:nowrap">image</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> | ☑ | CC BY-SA 4.0 | | [wiki_ko](./wiki_ko/README.md) | 200,000 | 27.09 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FCF1CF;white-space:nowrap">image</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> | ☑ | CC BY-SA 4.0 | | [wiki_nl](./wiki_nl/README.md) | 200,000 | 29.52 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FCF1CF;white-space:nowrap">image</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> | ☑ | CC BY-SA 4.0 | | [wiki_pt](./wiki_pt/README.md) | 200,000 | 30.49 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FCF1CF;white-space:nowrap">image</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> | ☑ | CC BY-SA 4.0 | | [wiki_zh](./wiki_zh/README.md) | 200,000 | 30.14 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FCF1CF;white-space:nowrap">image</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> | ☑ | CC BY-SA 4.0 | | [oi_bbox_1](./oi_bbox_1/README.md) | 1,664,533 | 490.09 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FCF1CF;white-space:nowrap">image</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#C9D6FF;white-space:nowrap">qa</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> | | CC BY 4.0 | | [oi_bbox_2](./oi_bbox_2/README.md) | 1,664,533 | 488.08 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FCF1CF;white-space:nowrap">image</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#C9D6FF;white-space:nowrap">qa</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> | | CC BY 4.0 | | [oi_bbox_3](./oi_bbox_3/README.md) | 1,128,326 | 324.41 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FCF1CF;white-space:nowrap">image</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#C9D6FF;white-space:nowrap">qa</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> | | CC BY 4.0 | | [tabmwp_cot](./tabmwp_cot/README.md) | 20,305 | 0.28 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FCF1CF;white-space:nowrap">image</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#E7D0FF;white-space:nowrap">reasoning</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#EEDCFF;white-space:nowrap">qwen-labels</span> | | CC BY 4.0 | | [sparsetables](./sparsetables/README.md) | 100,000 | 14.36 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FCF1CF;white-space:nowrap">image</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#C9D6FF;white-space:nowrap">synthetic</span> | ☑ | CC BY 4.0 | | [mulberry_cot_1](./mulberry_cot_1/README.md) | 191,332 | 16.27 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FCF1CF;white-space:nowrap">image</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#E7D0FF;white-space:nowrap">reasoning</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFE0B8;white-space:nowrap">glm-labels</span> | | CC BY 4.0 | | [llava_cot_100k](./llava_cot_100k/README.md) | 63,013 | 6.67 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FCF1CF;white-space:nowrap">image</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#E7D0FF;white-space:nowrap">reasoning</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFE0B8;white-space:nowrap">glm-labels</span> | | CC BY 4.0 | | [geomverse_cot](./geomverse_cot/README.md) | 9,298 | 0.90 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FCF1CF;white-space:nowrap">image</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#E7D0FF;white-space:nowrap">reasoning</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFE0B8;white-space:nowrap">glm-labels</span> | | CC BY 4.0 | | [mapqa_cot](./mapqa_cot/README.md) | 16,832 | 1.77 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FCF1CF;white-space:nowrap">image</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#E7D0FF;white-space:nowrap">reasoning</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFE0B8;white-space:nowrap">glm-labels</span> | | CC BY 4.0 | | [plotqa_cot](./plotqa_cot/README.md) | 16,256 | 0.50 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FCF1CF;white-space:nowrap">image</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#E7D0FF;white-space:nowrap">reasoning</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFE0B8;white-space:nowrap">glm-labels</span> | ☑ | CC BY 4.0 | | [visual7w_telling_cot](./visual7w_telling_cot/README.md) | 62,592 | 0.76 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FCF1CF;white-space:nowrap">image</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#E7D0FF;white-space:nowrap">reasoning</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFE0B8;white-space:nowrap">glm-labels</span> | | CC BY 4.0 | | [visual_web_instruct_cot](./visual_web_instruct_cot/README.md) | 48,929 | 4.37 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FCF1CF;white-space:nowrap">image</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#E7D0FF;white-space:nowrap">reasoning</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFE0B8;white-space:nowrap">glm-labels</span> | | CC BY 4.0 | | [docvqa_cot](./docvqa_cot/README.md) | 36,333 | 6.38 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FCF1CF;white-space:nowrap">image</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#E7D0FF;white-space:nowrap">reasoning</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#EEDCFF;white-space:nowrap">qwen-labels</span> | | CC BY 4.0 | | [chartqa_cot](./chartqa_cot/README.md) | 45,710 | 0.88 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FCF1CF;white-space:nowrap">image</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#E7D0FF;white-space:nowrap">reasoning</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#EEDCFF;white-space:nowrap">qwen-labels</span> | | CC BY 4.0 | | [infographicsvqa_cot](./infographicsvqa_cot/README.md) | 19,548 | 1.41 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FCF1CF;white-space:nowrap">image</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#E7D0FF;white-space:nowrap">reasoning</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#EEDCFF;white-space:nowrap">qwen-labels</span> | | CC BY 4.0 | | [mulberry_cot_2](./mulberry_cot_2/README.md) | 103,763 | 11.84 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FCF1CF;white-space:nowrap">image</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#E7D0FF;white-space:nowrap">reasoning</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#EEDCFF;white-space:nowrap">qwen-labels</span> | | CC BY 4.0 | | [unigeo_cot](./unigeo_cot/README.md) | 9,728 | 0.03 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FCF1CF;white-space:nowrap">image</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#E7D0FF;white-space:nowrap">reasoning</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFE0B8;white-space:nowrap">glm-labels</span> | | CC BY 4.0 | | [nights_cot](./nights_cot/README.md) | 12,906 | 37.01 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FCF1CF;white-space:nowrap">image</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#E7D0FF;white-space:nowrap">reasoning</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFE0B8;white-space:nowrap">glm-labels</span> | ☑ | CC BY 4.0 | | [mantis_instruct_cot](./mantis_instruct_cot/README.md) | 67,723 | 9.39 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FCF1CF;white-space:nowrap">image</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#E7D0FF;white-space:nowrap">reasoning</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFE0B8;white-space:nowrap">glm-labels</span> | | CC BY 4.0 | | [fintabnet_cot](./fintabnet_cot/README.md) | 8,356 | 3.17 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FCF1CF;white-space:nowrap">image</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#E7D0FF;white-space:nowrap">reasoning</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#EEDCFF;white-space:nowrap">qwen-labels</span> | | CC BY 4.0 | | [hiertext](./hiertext/README.md) | 514 | 0.02 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FCF1CF;white-space:nowrap">image</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#E7D0FF;white-space:nowrap">reasoning</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#EEDCFF;white-space:nowrap">qwen-labels</span> | | CC BY 4.0 | | [nextqa](./nextqa/README.md) | 34,132 | 16.80 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#C9F0F4;white-space:nowrap">video</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#C9D6FF;white-space:nowrap">qa</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">rule-based</span> | | CC BY 4.0 | | [clevrer](./clevrer/README.md) | 40,000 | 11.45 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#C9F0F4;white-space:nowrap">video</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#C9D6FF;white-space:nowrap">qa</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">rule-based</span> | | CC BY 4.0 | | [ego_exo_learn](./ego_exo_learn/README.md) | 36,373 | 92.64 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#C9F0F4;white-space:nowrap">video</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#C9D6FF;white-space:nowrap">qa</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">rule-based</span> | ☑ | CC BY 4.0 | | [kinetics_k710](./kinetics_k710/README.md) | 647,883 | 890.53 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#C9F0F4;white-space:nowrap">video</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#C9D6FF;white-space:nowrap">qa</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">rule-based</span> | | CC BY 4.0 | | [perception_test_1](./perception_test_1/README.md) | 7,392 | 23.58 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#C9F0F4;white-space:nowrap">video</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#C9D6FF;white-space:nowrap">qa</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">rule-based</span> | ☑ | CC BY 4.0 | | [activity_net_1](./activity_net_1/README.md) | 10,021 | 191.49 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#C9F0F4;white-space:nowrap">video</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#C9D6FF;white-space:nowrap">qa</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">rule-based</span> | | CC BY 4.0 | | [hacs](./hacs/README.md) | 31,223 | 829.25 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#C9F0F4;white-space:nowrap">video</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#C9D6FF;white-space:nowrap">qa</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">rule-based</span> | | CC BY 4.0 | | [hirest_1](./hirest_1/README.md) | 822 | 42.50 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#C9F0F4;white-space:nowrap">video</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#C9D6FF;white-space:nowrap">qa</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">rule-based</span> | | CC BY 4.0 | | [perception_test_2](./perception_test_2/README.md) | 2,135 | 24.36 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#C9F0F4;white-space:nowrap">video</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#C9D6FF;white-space:nowrap">qa</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">rule-based</span> | ☑ | CC BY 4.0 | | [activity_net_2](./activity_net_2/README.md) | 9,064 | 181.24 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#C9F0F4;white-space:nowrap">video</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#C9D6FF;white-space:nowrap">qa</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">rule-based</span> | | CC BY 4.0 | | [hirest_2](./hirest_2/README.md) | 525 | 27.54 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#C9F0F4;white-space:nowrap">video</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#C9D6FF;white-space:nowrap">qa</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">rule-based</span> | | CC BY 4.0 | | [youcook2_1](./youcook2_1/README.md) | 1,180 | 77.65 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#C9F0F4;white-space:nowrap">video</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#C9D6FF;white-space:nowrap">qa</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">rule-based</span> | | CC BY 4.0 | | [youcook2_2](./youcook2_2/README.md) | 2,270 | 77.65 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#C9F0F4;white-space:nowrap">video</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#C9D6FF;white-space:nowrap">qa</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">rule-based</span> | | CC BY 4.0 | | [breakfast_actions](./breakfast_actions/README.md) | 1,204 | 3.45 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#C9F0F4;white-space:nowrap">video</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#C9D6FF;white-space:nowrap">qa</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">rule-based</span> | ☑ | CC BY 4.0 | | [ccpdf_multipage_1](./ccpdf_multipage_1/README.md) | 7,262 | 22.02 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FCF1CF;white-space:nowrap">image</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#C9D6FF;white-space:nowrap">qa</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#EEDCFF;white-space:nowrap">qwen-labels</span> | | CC BY 4.0 | | [ccpdf_multipage_2](./ccpdf_multipage_2/README.md) | 455 | 17.81 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FCF1CF;white-space:nowrap">image</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#C9D6FF;white-space:nowrap">qa</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#EEDCFF;white-space:nowrap">qwen-labels</span> | | CC BY 4.0 | | [perception_test_cot](./perception_test_cot/README.md) | 4,977 | 21.90 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#C9F0F4;white-space:nowrap">video</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#C9D6FF;white-space:nowrap">qa</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFE0B8;white-space:nowrap">glm-labels</span> | ☑ | CC BY 4.0 | | [ccpdf_nv_notables](./ccpdf_nv_notables/README.md) | 14,234 | 8.54 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FCF1CF;white-space:nowrap">image</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#D8F1C6;white-space:nowrap">human-labels</span> | | CC BY 4.0 | | [ccpdf_nv_qa](./ccpdf_nv_qa/README.md) | 1,668 | 0.54 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FCF1CF;white-space:nowrap">image</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#C9D6FF;white-space:nowrap">qa</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#EEDCFF;white-space:nowrap">qwen-labels</span> | | CC BY 4.0 | | [ccpdf_nv_tables](./ccpdf_nv_tables/README.md) | 4,249 | 1.85 | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FCF1CF;white-space:nowrap">image</span> | <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span> <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#D8F1C6;white-space:nowrap">human-labels</span> | | CC BY 4.0 | | **Total** (51) | 8,147,599 | 4,470.46 | | | | | ## Tag Legend * <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">text</span>: Contains text data * <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FCF1CF;white-space:nowrap">image</span>: Contains image data * <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#C9F0F4;white-space:nowrap">video</span>: Contains video data * <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#C9D6FF;white-space:nowrap">qa</span>: Contains question answering data * <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#E7D0FF;white-space:nowrap">reasoning</span>: Contains chain of thought reasoning data * <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#CFEBD9;white-space:nowrap">public</span>: Origin of the data is another public dataset * <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#C9D6FF;white-space:nowrap">synthetic</span>: The data was synthetically generated * <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#EEDCFF;white-space:nowrap">qwen-labels</span>: Labels generated by Qwen * <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFE0B8;white-space:nowrap">glm-labels</span>: Labels generated by GLM * <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#D8F1C6;white-space:nowrap">human-labels</span>: Labels generated by human * <span style="display:inline-block;padding:.15em .55em;border-radius:.5em;font:600 12px/1.25 system-ui,-apple-system,Segoe UI,Roboto,Ubuntu,Cantarell,Noto Sans,sans-serif;color:#444;background:#FFD6D6;white-space:nowrap">rule-based</span>: Labels generated/transformed by simple rules --- ## Dataset Quantification - **Total Number of Datasets**: 51 - **Total Number of Samples**: 8,147,599 - **Total Size**: 4,301.82 GB --- ## Dataset Characterization ### **Data Collection Method** Hybrid: Synthetic, Automated, Human ### **Labeling Method** Hybrid: Synthetic, Automated, Human --- ## Dataset Format Each given dataset includes either: - Text annotations (.jsonl format), referencing images or videos from source datasets, or - Text annotations (.jsonl format) together with images or videos (in tar'ed shards). For details on the format, check [Data Format](data_format.md). --- ## Loading the Data with Megatron Energon This data has been prepared to be used with [Megatron Energon](https://github.com/NVIDIA/Megatron-Energon). You can just go ahead and try it out like this: ```sh # Install energon if you haven't already pip install megatron-energon[av_decode] dacite # Download this dataset (OPTION 1, slower) git lfs install git clone git@hf.co:datasets/nvidia/Nemotron-VLM-Dataset-v2 Nemotron-VLM-Dataset-v2 # Download this dataset (OPTION 2, modern faster way) pip install --upgrade huggingface_hub hf download nvidia/Nemotron-VLM-Dataset-v2 --repo-type dataset --local-dir Nemotron-VLM-Dataset-v2 # Try out the example to print a few dataset samples cd Nemotron-VLM-Dataset-v2 python example_loader.py ``` --- ## Ethical Considerations: NVIDIA believes **Trustworthy AI** is a shared responsibility and we have established policies and practices to enable development for a wide array of AI applications. When downloaded or used in accordance with our terms of service, developers should work with their internal model team to ensure this model meets requirements for the relevant industry and use case and addresses unforeseen product misuse. Please report model quality, risk, security vulnerabilities or **NVIDIA AI Concerns** [here](https://app.intigriti.com/programs/nvidia/nvidiavdp/detail).
提供机构:
maas
创建时间:
2025-10-29
搜集汇总
数据集介绍
main_image_url
背景与挑战
背景概述
Nemotron-VLM-Dataset-v2是NVIDIA发布的大规模视觉语言模型数据集,包含约814万样本,总大小4.3TB,涵盖图像、视频和文本多模态数据。该数据集重点扩展了链式思维推理数据和视频理解任务,并提供OCR数据生成工具链,旨在提升模型推理能力和多语言处理性能,适用于商业用途的训练和评估。
以上内容由遇见数据集搜集并总结生成
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作