rntc/pubmed_articles_prefixed_20250227
收藏Hugging Face2025-02-27 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/rntc/pubmed_articles_prefixed_20250227
下载链接
链接失效反馈官方服务:
资源简介:
这是一个包含文章相关信息的文本数据集,具体包括文章ID、文章文本、文档类型、领域、语言和语言置信度分数。数据集仅包含训练集部分,共有约413万篇文章,数据大小为约129GB。
This is a text dataset containing article-related information, including article ID, article text, document type, domain, language, and language confidence score. The dataset only includes the training set, with a total of approximately 4.13 million articles, and the data size is about 129GB.
提供机构:
rntc



