dk-crazydiv/huggingface-modelhub
收藏Hugging Face2021-06-20 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/dk-crazydiv/huggingface-modelhub
下载链接
链接失效反馈官方服务:
资源简介:
## Summary
Metadata information of all the models uploaded on [HuggingFace modelhub](https://huggingface.co/models)
Dataset was last updated on 15th June 2021. Contains information on 10,354 models (v1).
Only `train` dataset is provided
#### Update: v1.0.2: Added downloads_last_month and library data
Same dataset is available in [kaggle](https://www.kaggle.com/crazydiv/huggingface-modelhub)
## Loading data
```python
from datasets import load_dataset
modelhub_dataset = load_dataset("dk-crazydiv/huggingface-modelhub")
```
### Useful commands:
```python
modelhub_dataset["train"] # Access train subset (the only subset available)
modelhub_dataset["train"][0] # Access the dataset elements by index
modelhub_dataset["train"].features # Get the columns present in the dataset.
```
### Sample dataset:
```json
{
"downloads_last_month": 7474,
"files": [
".gitattributes",
"README.md",
"config.json",
"pytorch_model.bin",
"spiece.model",
"tf_model.h5",
"tokenizer.json",
"with-prefix-tf_model.h5"
],
"lastModified": "2021-01-13T15:08:24.000Z",
"library": "transformers",
"modelId": "albert-base-v1",
"pipeline_tag": "fill-mask",
"publishedBy": "huggingface",
"tags": [
"pytorch",
"tf",
"albert",
"masked-lm",
"en",
"dataset:bookcorpus",
"dataset:wikipedia",
"arxiv:1909.11942",
"transformers",
"exbert",
"license:apache-2.0",
"fill-mask"
],
"modelCard": "Readme sample data..."
}
```
## Bugs:
Please report any bugs/improvements to me on [twitter](https://twitter.com/kartik_godawat)
提供机构:
dk-crazydiv
原始信息汇总
数据集概述
基本信息
- 数据集名称: HuggingFace modelhub 元数据信息
- 数据集描述: 包含上传至 HuggingFace modelhub 的所有模型元数据信息。
- 数据集大小: 包含10,354个模型信息。
- 最新更新日期: 2021年6月15日。
- 数据集版本: v1.0.2
数据集内容
- 数据集子集: 仅提供
train子集。 - 新增数据: 在 v1.0.2 版本中,增加了
downloads_last_month和library数据。
数据集结构
- 数据集元素示例: json { "downloads_last_month": 7474, "files": [...], "lastModified": "2021-01-13T15:08:24.000Z", "library": "transformers", "modelId": "albert-base-v1", "pipeline_tag": "fill-mask", "publishedBy": "huggingface", "tags": [...] }
数据集访问
-
加载命令: python from datasets import load_dataset modelhub_dataset = load_dataset("dk-crazydiv/huggingface-modelhub")
-
访问命令: python modelhub_dataset["train"] # 访问训练子集 modelhub_dataset["train"][0] # 按索引访问数据集元素 modelhub_dataset["train"].features # 获取数据集中的列信息



