nhagar/CC-MAIN-2015-22_nyt_urls
收藏Hugging Face2025-04-02 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/nhagar/CC-MAIN-2015-22_nyt_urls
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含一个名为url的字符串类型字段。数据集被分割为训练集,共有825080个样本,数据集大小为88139084字节。提供了一个默认配置,其中训练数据存放在data/train-*路径下。
The dataset contains a field named url of string type. The dataset is split into a training set with a total of 825080 samples, and the dataset size is 88139084 bytes. A default configuration is provided, where the training data is stored under the path data/train-*.
提供机构:
nhagar



