BASF-AI/PubChemWikiESPC
收藏Hugging Face2024-12-05 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/BASF-AI/PubChemWikiESPC
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是[PubChem & Wikipedia Paragraphs Pair Classification](https://huggingface.co/datasets/BASF-AI/PubChemWikiParagraphsPC)数据集的多语言扩展。它包括英语和西班牙语的段落对(sent1和sent2),以及一个二进制标签列,用于指示段落是否描述相同的实体(1)或不同的实体(0)。
This dataset is a multilingual extension of the [PubChem & Wikipedia Paragraphs Pair Classification](https://huggingface.co/datasets/BASF-AI/PubChemWikiParagraphsPC) dataset. It includes pairs of paragraphs in English and Spanish (sent1 and sent2) with a binary labels column indicating whether the paragraphs describe the same entity (1) or different entities (0).
提供机构:
BASF-AI



