harryrobert/latex-ocr-v3
收藏Hugging Face2026-04-03 更新2026-04-12 收录
下载链接:
https://hf-mirror.com/datasets/harryrobert/latex-ocr-v3
下载链接
链接失效反馈官方服务:
资源简介:
---
license: mit
configs:
- config_name: default
data_files:
- split: mlp_train
path: data/mlp_train-*
- split: full_train
path: data/full_train-*
- split: sft_train
path: data/sft_train-*
- split: dev
path: data/dev-*
- split: test
path: data/test-*
dataset_info:
features:
- name: index
dtype: int64
- name: image
struct:
- name: bytes
dtype: binary
- name: path
dtype: string
- name: label
dtype: string
splits:
- name: mlp_train
num_bytes: 3439045544
num_examples: 574490
- name: full_train
num_bytes: 1112874908
num_examples: 127108
- name: sft_train
num_bytes: 21279794
num_examples: 2416
- name: dev
num_bytes: 85008743
num_examples: 16950
- name: test
num_bytes: 83152757
num_examples: 16557
download_size: 4697443548
dataset_size: 4741361746
---
提供机构:
harryrobert



