Hariprasath28/zac_sample-dataset-tokenised
收藏Hugging Face2025-04-01 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/Hariprasath28/zac_sample-dataset-tokenised
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了用于训练的文本数据,具体特征包括input_ids(整数序列),labels(整数序列),以及attention_mask(整数序列)。数据集分为训练集,大小为140341字节,共有20个示例。提供了默认配置,指定了训练数据文件的路径。
The dataset consists of text data for training, including features such as input_ids (integer sequence), labels (integer sequence), and attention_mask (integer sequence). The dataset is split into a training set, which is 140341 bytes in size and contains 20 examples. A default configuration is provided, specifying the path to the training data files.
提供机构:
Hariprasath28



