AlgorithmicResearchGroup/edge_llm_training
收藏Hugging Face2024-08-21 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/AlgorithmicResearchGroup/edge_llm_training
下载链接
链接失效反馈官方服务:
资源简介:
数据集包含两个配置:alpaca_cleaned和c4_combined_dataset。alpaca_cleaned配置包含三个特征:output、input和instruction,均为字符串类型,训练集包含51760个例子,大小为40283906字节。c4_combined_dataset配置包含一个特征:text,为字符串类型,训练集包含989000个例子,大小为2149163594字节。
The dataset includes two configurations: alpaca_cleaned and c4_combined_dataset. The alpaca_cleaned configuration contains three features: output, input, and instruction, all of string type, with a training set of 51,760 examples and a size of 40,283,906 bytes. The c4_combined_dataset configuration contains one feature: text, of string type, with a training set of 989,000 examples and a size of 2,149,163,594 bytes.
提供机构:
AlgorithmicResearchGroup



