nhagar/fulg_urls
收藏Hugging Face2025-05-15 更新2025-08-30 收录
下载链接:
https://hf-mirror.com/datasets/nhagar/fulg_urls
下载链接
链接失效反馈官方服务:
资源简介:
fulg_urls数据集包含来源于[faur-ai/fulg](https://huggingface.co/datasets/faur-ai/fulg)数据集的URLs和顶级域名。该数据集通过下载源数据、提取URLs和顶级域名,并仅保留这些记录标识符而创建。这使得研究人员和实践者可以探索这些训练数据集的内容,而无需管理数TB的原始文本数据。
The fulg_urls dataset includes URLs and top-level domains sourced from the [faur-ai/fulg](https://huggingface.co/datasets/faur-ai/fulg) dataset. It is created by downloading the source data, extracting URLs and top-level domains, and retaining only the record identifiers. This allows researchers and practitioners to explore the contents of these training datasets without having to manage terabytes of raw text data.
提供机构:
nhagar



