uv-scripts/dataset-creation
收藏Hugging Face2025-07-23 更新2025-08-09 收录
下载链接:
https://hf-mirror.com/datasets/uv-scripts/dataset-creation
下载链接
链接失效反馈官方服务:
资源简介:
这是一个可以将本地PDF文件目录转换为Hugging Face数据集的脚本。它支持从文件夹结构自动标记,无需配置即可使用,并能直接上传数据集到Hugging Face Hub。适用于处理大量PDF文件,并可以根据需要进行文本提取、图像转换等操作。
This is a script that converts local PDF file directories into Hugging Face datasets. It supports automatic labeling from folder structures, requires no configuration to use, and can directly upload datasets to the Hugging Face Hub. It is suitable for processing a large number of PDF files and allows for operations such as text extraction and image conversion as needed.
提供机构:
uv-scripts



