librarian-bots/card_with_first_commit
收藏Hugging Face2023-06-27 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/librarian-bots/card_with_first_commit
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: modelId
dtype: string
- name: tags
sequence: string
- name: pipeline_tag
dtype: string
- name: config
struct:
- name: architectures
sequence: string
- name: model_type
dtype: string
- name: task_specific_params
struct:
- name: conversational
struct:
- name: max_length
dtype: float64
- name: summarization
struct:
- name: early_stopping
dtype: bool
- name: length_penalty
dtype: float64
- name: max_length
dtype: float64
- name: min_length
dtype: float64
- name: no_repeat_ngram_size
dtype: float64
- name: num_beams
dtype: float64
- name: prefix
dtype: string
- name: text-generation
struct:
- name: do_sample
dtype: bool
- name: max_length
dtype: float64
- name: translation_en_to_de
struct:
- name: early_stopping
dtype: bool
- name: max_length
dtype: float64
- name: num_beams
dtype: float64
- name: prefix
dtype: string
- name: translation_en_to_fr
struct:
- name: early_stopping
dtype: bool
- name: max_length
dtype: float64
- name: num_beams
dtype: float64
- name: prefix
dtype: string
- name: translation_en_to_ro
struct:
- name: early_stopping
dtype: bool
- name: max_length
dtype: float64
- name: num_beams
dtype: float64
- name: prefix
dtype: string
- name: downloads
dtype: int64
- name: first_commit
dtype: timestamp[ns, tz=UTC]
- name: card
dtype: string
splits:
- name: train
num_bytes: 20198907.41971414
num_examples: 30344
download_size: 25260494
dataset_size: 20198907.41971414
task_categories:
- text-classification
- feature-extraction
- fill-mask
language:
- en
tags:
- model cards
pretty_name: Model card READMEs with first commit information
size_categories:
- 10K<n<100K
---
# Dataset Card for "card_with_first_commit"
[More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
提供机构:
librarian-bots
原始信息汇总
数据集概述
数据集特征
- modelId: 字符串类型
- tags: 字符串序列类型
- pipeline_tag: 字符串类型
- config: 结构体类型,包含以下子特征:
- architectures: 字符串序列类型
- model_type: 字符串类型
- task_specific_params: 结构体类型,包含多个任务特定的参数结构体,如:
- conversational: 结构体,包含 max_length(浮点数64位)
- summarization: 结构体,包含多个参数(布尔型、浮点数64位、字符串)
- text-generation: 结构体,包含 do_sample(布尔型)和 max_length(浮点数64位)
- translation_en_to_de: 结构体,包含多个参数(布尔型、浮点数64位、字符串)
- translation_en_to_fr: 结构体,包含多个参数(布尔型、浮点数64位、字符串)
- translation_en_to_ro: 结构体,包含多个参数(布尔型、浮点数64位、字符串)
- downloads: 整数64位类型
- first_commit: 时间戳类型,纳秒精度,时区为UTC
- card: 字符串类型
数据集分割
- train:
- 数据量: 20198907.41971414 字节
- 示例数量: 30344
数据集大小
- 下载大小: 25260494 字节
- 数据集大小: 20198907.41971414 字节
任务类别
- text-classification
- feature-extraction
- fill-mask
语言
- en
标签
- model cards
数据集名称
- pretty_name: Model card READMEs with first commit information
大小类别
- 10K<n<100K



