Orphanage/Baidu_Tieba_SunXiaochuan
收藏Hugging Face2025-06-13 更新2025-11-01 收录
下载链接:
https://hf-mirror.com/datasets/Orphanage/Baidu_Tieba_SunXiaochuan
下载链接
链接失效反馈官方服务:
资源简介:
百度贴吧孙笑川吧的随机爬取内容,大约10万条数据,不包含视频和图片,适用于风格微调。数据集遵循ChatGLM4的格式,未经过彻底清洗,包括未经清洗的原始数据和清洗后未划分训练验证集的数据。
This dataset consists of roughly 100,000 samples randomly scraped from the Sun Xiaochuan bar on Baidu Tieba, without videos or images, suitable for style fine-tuning. The data follows the format used by ChatGLM4 and is not thoroughly cleaned, including both the raw uncleaned data and the cleaned data without a train/validation split.
提供机构:
Orphanage



