datalab-to/marker_benchmark_comparison_olmocr_llm
收藏Hugging Face2025-02-28 更新2025-04-08 收录
下载链接:
https://hf-mirror.com/datasets/datalab-to/marker_benchmark_comparison_olmocr_llm
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了一系列的字段,包括唯一标识符(uuid)、分类(classification)、语言(language)、图像(img)、标记相关描述(marker_md)、标记图像(marker_img)、标记启发式得分(marker_heuristic)、标记启发式详情(marker_heuristic_detail)、标记LLM得分(marker_llm)、标记LLM详情(marker_llm_detail)、OLMOCR相关描述(olmocr_md)、OLMOCR图像(olmocr_img)、OLMOCR启发式得分(olmocr_heuristic)、OLMOCR启发式详情(olmocr_heuristic_detail)和OLMOCR LLM得分(olmocr_llm)、OLMOCR LLM详情(olmocr_llm_detail)。数据集分为训练集,包含1109个示例,总大小约为597.62MB。未提供具体的数据集用途和背景。
The dataset includes fields such as uuid, classification, language, image (img), marker description (marker_md), marker image (marker_img), marker heuristic score (marker_heuristic), marker heuristic detail (marker_heuristic_detail), marker LLM score (marker_llm), marker LLM detail (marker_llm_detail), OLMOCR description (olmocr_md), OLMOCR image (olmocr_img), OLMOCR heuristic score (olmocr_heuristic), OLMOCR heuristic detail (olmocr_heuristic_detail), and OLMOCR LLM score (olmocr_llm), OLMOCR LLM detail (olmocr_llm_detail). The dataset is split into a training set with 1109 examples, totaling approximately 597.62MB in size. No specific purpose or background of the dataset is provided.
提供机构:
datalab-to



