cabusar/gutenberg-txt-fr
收藏Hugging Face2024-03-30 更新2024-06-11 收录
下载链接:
https://hf-mirror.com/datasets/cabusar/gutenberg-txt-fr
下载链接
链接失效反馈官方服务:
资源简介:
---
license: unknown
task_categories:
- text-classification
- question-answering
- text2text-generation
language:
- fr
tags:
- rag
---
This dataset is not yet refined enough to do other things than tests, it contain a mix of french and english and special caracters (french accentuation).
# Dataset Card for gutenberg-txt-fr
218 french books, txt file format, from Gutenberg project.
### Dataset Sources
- **Gutemberg project** : https://www.gutenberg.org
## Uses
Could be used for RAG, datas are not labeled.
## Why ?
I needed a dataset of txt files to test RAG pipelines, didn't find one suitable, created this one, shared it. :)
提供机构:
cabusar
原始信息汇总
数据集概述
基本信息
- 许可证: 未知
- 任务类别:
- 文本分类
- 问答
- 文本到文本生成
- 语言: 法语
- 标签: RAG
数据集详情
- 名称: gutenberg-txt-fr
- 内容: 218本法语书籍,文本文件格式
- 来源: Gutenberg项目(https://www.gutenberg.org)
使用场景
- 适用于RAG,数据未标注
创建目的
- 用于测试RAG管道,因未找到合适的现有数据集而创建



