fredk8/fred_core_document_corpus_v1
收藏Hugging Face2025-10-10 更新2025-11-30 收录
下载链接:
https://hf-mirror.com/datasets/fredk8/fred_core_document_corpus_v1
下载链接
链接失效反馈官方服务:
资源简介:
Fred测试语料库是一个异构的PDF文档集合,用于对Fred开源智能代理平台的文档摄入、向量化以及检索组件进行验证和基准测试。它包括来自欧洲中央银行、经济合作与发展组织以及arXiv.org人工智能类别的实际世界文档。该数据集专为功能测试和性能测试设计,用于开源实验、演示和可重现性测试,而非生产环境。
The Fred Test Corpus is a heterogeneous collection of PDF documents used for validating and benchmarking the document ingestion, vectorization, and retrieval (RAG) components of the Fred open-source agentic AI platform. It includes real-world documents from the European Central Bank, the Organisation for Economic Co-operation and Development, and the arXiv.org Artificial Intelligence category. This dataset is designed for functional and performance testing, for open-source experimentation, demos, and reproducibility testing, not for production use.
提供机构:
fredk8



