DSULT-Core/2ch.sc
收藏Hugging Face2025-09-11 更新2025-10-18 收录
下载链接:
https://hf-mirror.com/datasets/DSULT-Core/2ch.sc
下载链接
链接失效反馈官方服务:
资源简介:
2ch.sc语料库是一个大规模的日本匿名网络论坛用户生成文本数据集,覆盖了数十年间的日本互联网文化、对话和公众意见。数据集包含数十亿个跨数千个特定主题板块的帖子,为研究非正式语言、在线亚文化和社会趋势提供了无与伦比的资源。
The 2ch.sc Corpus is a large-scale dataset of user-generated text from 2ch.sc, an anonymous Japanese textboard. Spanning decades, it covers Japanese internet culture, dialogue, and public opinion. The dataset comprises billions of posts across thousands of topic-specific boards, offering an unparalleled resource for studying informal language, online subcultures, and social trends.
提供机构:
DSULT-Core



