Ihssane123/fineweb2-ary_Arab-CC-MAIN-2017-39
收藏Hugging Face2026-03-23 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/Ihssane123/fineweb2-ary_Arab-CC-MAIN-2017-39
下载链接
链接失效反馈官方服务:
资源简介:
---
language:
- ary_Arab
license: odc-by
source_datasets:
- HuggingFaceFW/fineweb-2
tags:
- fineweb2
- web-crawl
- text
---
# fineweb2-ary_Arab-CC-MAIN-2017-39
Extracted from [HuggingFaceFW/fineweb-2](https://huggingface.co/datasets/HuggingFaceFW/fineweb-2) using [snapsift](https://github.com/Ihssane123/snapsift).
**Language:** `ary_Arab`
**Source:** `HuggingFaceFW/fineweb-2`
## Snapshots included
- `CC-MAIN-2017-39`
## Usage
```python
import polars as pl
df = pl.read_parquet("hf://datasets/Ihssane123/fineweb2-ary_Arab-CC-MAIN-2017-39/data/*.parquet")
```
提供机构:
Ihssane123



