emdemor/ptbr-question-and-answer
收藏Hugging Face2024-08-14 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/emdemor/ptbr-question-and-answer
下载链接
链接失效反馈官方服务:
资源简介:
这个数据集是葡萄牙语中问题和答案的汇编,来源于[clips/mqa](https://huggingface.co/datasets/clips/mqa)。数据集经过了清理和标准化处理,保留了最相关的领域,并移除了有害和不适当的文本。其主要目的是帮助自然语言处理模型和葡萄牙语嵌入模型生成更精确和上下文相关的文本和相似性计算。数据集包含4140677行数据,列包括`id`、`bucket`、`domain`、`text`、`question`和`answer`。
This dataset is a compilation of questions and answers in Portuguese available from [clips/mqa](https://huggingface.co/datasets/clips/mqa). The data has been cleaned and normalized, retaining only the most relevant domains and removing harmful and inappropriate text. Its main goal is to assist natural language processing models and Portuguese embedding models in generating more precise and contextually relevant texts and similarity calculations. The dataset contains 4140677 rows, with columns including `id`, `bucket`, `domain`, `text`, `question`, and `answer`.
提供机构:
emdemor



