five

WildDoc

收藏
魔搭社区2026-01-06 更新2025-05-24 收录
下载链接:
https://modelscope.cn/datasets/ByteDance/WildDoc
下载链接
链接失效反馈
官方服务:
资源简介:
# Dataset Card [WildDoc: How Far Are We from Achieving Comprehensive and Robust Document Understanding in the Wild?](https://arxiv.org/abs/2505.11015) ## - Project Homepage https://bytedance.github.io/WildDoc/ ## - Direct usage The data is designed to evaluate the document understanding capabilities of VLM models in the real world, hoping to facilitate the understanding of documents in the wild. ### -- Huggingface dataloader ``` from datasets import load_dataset dataset = load_dataset("ByteDance/WildDoc") ``` ## - Out-of-Scope usage Academic use only, not supported for commercial usage. ## - Bias, Risks, and Limitations Your access to and use of this dataset are at your own risk. We do not guarantee the accuracy of this dataset. The dataset is provided “as is,” and we make no warranty or representation to you with respect to it and we expressly disclaim, and hereby expressly waive, all warranties, express, implied, statutory or otherwise. This includes, without limitation, warranties of quality, performance, merchantability or fitness for a particular purpose, non-infringement, absence of latent or other defects, accuracy, or the presence or absence of errors, whether or not known or discoverable. In no event will we be liable to you on any legal theory (including, without limitation, negligence) or otherwise for any direct, special, indirect, incidental, consequential, punitive, exemplary, or other losses, costs, expenses, or damages arising out of this public license or use of the licensed material. The disclaimer of warranties and limitation of liability provided above shall be interpreted in a manner that, to the extent possible, most closely approximates an absolute disclaimer and waiver of all liability. ## - Citation ``` @misc{wang2025wilddoc, title={WildDoc: How Far Are We from Achieving Comprehensive and Robust Document Understanding in the Wild?}, author={An-Lan Wang and Jingqun Tang and Liao Lei and Hao Feng and Qi Liu and Xiang Fei and Jinghui Lu and Han Wang and Weiwei Liu and Hao Liu and Yuliang Liu and Xiang Bai and Can Huang}, year={2025}, eprint={2505.11015}, archivePrefix={arXiv}, primaryClass={cs.CV}, url={https://arxiv.org/abs/2505.11015}, } ```

# 数据集卡片 ## [WildDoc:我们距离实现真实场景下全面且鲁棒的文档理解还有多远?](https://arxiv.org/abs/2505.11015) - 项目主页 https://bytedance.github.io/WildDoc/ - 直接使用场景 本数据集旨在评估视觉语言模型(Vision-Language Model, VLM)在真实场景下的文档理解能力,以期推动真实场景中文档理解任务的发展。 ### -- HuggingFace 数据加载器 from datasets import load_dataset dataset = load_dataset("ByteDance/WildDoc") - 非合规使用范围 仅可用于学术研究,不得用于商业用途。 - 偏差、风险与局限性 您对本数据集的访问与使用需自行承担风险。我们不对本数据集的准确性作出任何保证。本数据集按"as is"提供,我们未就其作出任何明示或默示的担保、陈述或承诺,并明确放弃所有明示、默示、法定或其他形式的担保。前述担保范围包括但不限于质量、性能、适销性、特定用途适用性、不侵权、无潜在或其他缺陷、准确性,以及是否存在已知或未知、可发现或不可发现的错误。在任何情况下,无论基于何种法律理论(包括但不限于过失责任),我们均不对因本公开许可协议或许可材料的使用而产生的任何直接、特殊、间接、附带、后果性、惩罚性、惩戒性或其他损失、成本、费用或损害承担责任。上述担保免责声明与责任限制条款应尽可能被解释为最接近绝对免责并放弃所有责任的表述。 - 引用 @misc{wang2025wilddoc, title={WildDoc: How Far Are We from Achieving Comprehensive and Robust Document Understanding in the Wild?}, author={An-Lan Wang and Jingqun Tang and Liao Lei and Hao Feng and Qi Liu and Xiang Fei and Jinghui Lu and Han Wang and Weiwei Liu and Hao Liu and Yuliang Liu and Xiang Bai and Can Huang}, year={2025}, eprint={2505.11015}, archivePrefix={arXiv}, primaryClass={cs.CV}, url={https://arxiv.org/abs/2505.11015}, }
提供机构:
maas
创建时间:
2025-05-17
搜集汇总
数据集介绍
main_image_url
以上内容由遇见数据集搜集并总结生成
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作