tattrongvu/magvit_1m
收藏Hugging Face2024-12-30 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/tattrongvu/magvit_1m
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含英文字符串和非英文字符串两个文本特征,以及一个浮点数标签序列。数据集主要用于训练模型,其中的训练集大小为约19.6GB,包含约124万条示例。数据集的具体应用场景和目的在README中未明确说明。
The dataset includes two text features, English strings and non-English strings, as well as a sequence of floating-point number labels. The training set of the dataset is about 19.6GB in size, containing about 1.24 million examples. The specific application scenario and purpose of the dataset are not explicitly stated in the README.
提供机构:
tattrongvu



