lcalvobartolome/ende_mind_topics
收藏Hugging Face2025-10-06 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/lcalvobartolome/ende_mind_topics
下载链接
链接失效反馈官方服务:
资源简介:
ENDE-MIND-Topics(英语-德语)是一个包含25,148个来自维基百科的文档片段的双语语料库,这些片段包含由PLTM模型训练得到的25个主题的主题模型信息。该数据集作为MIND管道的输入,用于多语言问答生成和差异检测。
ENDE-MIND-Topics (English–German) is a bilingual corpus of 25,148 Wikipedia-derived document chunks containing topic modeling information derived from training a PLTM model on this data with 25 topics. The dataset serves as input for the MIND pipeline, which performs multilingual question–answer generation and discrepancy detection.
提供机构:
lcalvobartolome



