asadfgglie/TaiwanChat
收藏Hugging Face2025-01-19 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/asadfgglie/TaiwanChat
下载链接
链接失效反馈官方服务:
资源简介:
这是一个包含台湾聊天数据的重整理版本,数据集以消息和图像为特征。消息特征包括内容、发送者姓名和角色,图像特征则为图像序列。数据集分为训练集,包含485432个样本,总大小为642038889字节。此数据集是原始[yentinglin/TaiwanChat]数据集的修改版本,主要是将`<image>`标签替换为`<img>`以适应LLama-factory训练。
This is a reorganized version of a Taiwan chat dataset, featuring messages and images. The message feature includes content, senders name, and role, while the image feature is a sequence of images. The dataset is split into a training set with 485432 samples, totaling 642038889 bytes. This dataset is a modified version of the original [yentinglin/TaiwanChat] dataset, primarily replacing `<image>` tags with `<img>` to accommodate LLama-factory training.
提供机构:
asadfgglie



