khaledsayed1/fineweb-bbc-news-embeddings-DuckDB
收藏Hugging Face2025-11-11 更新2025-11-15 收录
下载链接:
https://hf-mirror.com/datasets/khaledsayed1/fineweb-bbc-news-embeddings-DuckDB
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含URL、文本内容和对应的嵌入向量。它有一个训练集,包含179829个样本,总文件大小为约923MB。提供的配置项为默认配置,指定了训练集文件的路径。
The dataset includes URL, text content, and corresponding embedding vectors. It has a training set with 179,829 samples, with a total file size of approximately 923MB. The provided configuration is the default one, specifying the path to the training set files.
提供机构:
khaledsayed1



