Japanese-Novels-23M
收藏魔搭社区2025-10-09 更新2025-07-12 收录
下载链接:
https://modelscope.cn/datasets/OmniAICreator/Japanese-Novels-23M
下载链接
链接失效反馈官方服务:
资源简介:
# Japanese-Novels-23M
This dataset contains Japanese web novels that I collected personally.
**Machine-Learning Use Only**
Access is restricted to bona fide machine-learning–related purposes.
To request access, please provide a detailed explanation of the specific tasks or applications for which you intend to use the dataset.
- Total records: 23,212,809
- Total characters: 80,846,120,027
- Total tokens (Llama 4 tokenizer): 55,406,468,406 (55.4 B)
# 日语网络小说数据集(Japanese-Novels-23M)
本数据集为笔者个人收集的日语网络小说。
**仅限机器学习用途**
本数据集的使用权限仅针对合法的机器学习相关用途开放。如需申请使用权限,请提供您计划使用该数据集的具体任务或应用场景的详细说明。
- 总记录数:23,212,809
- 总字符数:80,846,120,027
- 总Token数(使用Llama 4分词器(Llama 4 tokenizer)):55,406,468,406(55.4 B)
提供机构:
maas
创建时间:
2025-07-07



