humair025/urdu_fineweb-2
收藏Hugging Face2025-09-25 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/humair025/urdu_fineweb-2
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了文本内容及其相关信息,如ID、URL、日期、文件路径、语言类型及其评分、语言脚本等。数据集分为训练集和测试集,可用于文本分析、语言识别等NLP任务。
The dataset includes text content and related information such as ID, URL, date, file path, language type and its score, language script, etc. The dataset is divided into training and test sets, which can be used for text analysis, language recognition and other NLP tasks.
提供机构:
humair025



