Mxode/IndustryCorpus-Subset-zh-en
收藏Hugging Face2024-09-09 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/Mxode/IndustryCorpus-Subset-zh-en
下载链接
链接失效反馈官方服务:
资源简介:
IC-Subset数据集是一个多领域文本生成数据集,涵盖化学、生物、金融、法律、艺术、代码、气候、医疗和音乐等多个领域。数据集支持英文和中文,数据量在1百万到1千万之间。
The IC-Subset dataset is a multi-domain text generation dataset covering various fields such as chemistry, biology, finance, legal, art, code, climate, medical, and music. The dataset supports both English and Chinese, with a data size ranging between 1 million and 10 million.
提供机构:
Mxode



