MIN12352/codellama_java_python-tokenized-128
收藏Hugging Face2024-09-08 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/MIN12352/codellama_java_python-tokenized-128
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含整数序列作为其特征,主要用于训练机器学习模型。数据集分为训练集,共有2,083,276个示例,总大小为1,074,970,416字节。提供的配置信息中,默认配置指定了训练集的数据文件路径。
The dataset contains integer sequences as its features, primarily used for training machine learning models. The dataset is split into a training set with a total of 2,083,276 examples, with a total size of 1,074,970,416 bytes. The default configuration specifies the data file path for the training set.
提供机构:
MIN12352



