davanstrien/card_with_first_commit_embedded
收藏Hugging Face2023-05-23 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/davanstrien/card_with_first_commit_embedded
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: modelId
dtype: string
- name: tags
sequence: string
- name: pipeline_tag
dtype: string
- name: config
struct:
- name: architectures
sequence: string
- name: model_type
dtype: string
- name: task_specific_params
struct:
- name: conversational
struct:
- name: max_length
dtype: float64
- name: summarization
struct:
- name: early_stopping
dtype: bool
- name: length_penalty
dtype: float64
- name: max_length
dtype: float64
- name: min_length
dtype: float64
- name: no_repeat_ngram_size
dtype: float64
- name: num_beams
dtype: float64
- name: prefix
dtype: string
- name: text-generation
struct:
- name: do_sample
dtype: bool
- name: max_length
dtype: float64
- name: translation_en_to_de
struct:
- name: early_stopping
dtype: bool
- name: max_length
dtype: float64
- name: num_beams
dtype: float64
- name: prefix
dtype: string
- name: translation_en_to_fr
struct:
- name: early_stopping
dtype: bool
- name: max_length
dtype: float64
- name: num_beams
dtype: float64
- name: prefix
dtype: string
- name: translation_en_to_ro
struct:
- name: early_stopping
dtype: bool
- name: max_length
dtype: float64
- name: num_beams
dtype: float64
- name: prefix
dtype: string
- name: downloads
dtype: int64
- name: first_commit
dtype: timestamp[ns, tz=UTC]
- name: card
dtype: string
- name: embedding
sequence: float32
splits:
- name: train
num_bytes: 177783576
num_examples: 30344
download_size: 137071859
dataset_size: 177783576
---
# Dataset Card for "card_with_first_commit_embedded"
[More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
提供机构:
davanstrien
原始信息汇总
数据集概述
数据集特征
- modelId: 字符串类型
- tags: 字符串序列类型
- pipeline_tag: 字符串类型
- config: 结构体类型,包含以下子特征:
- architectures: 字符串序列类型
- model_type: 字符串类型
- task_specific_params: 结构体类型,包含多个任务特定的子结构体,如:
- conversational: 结构体,包含:
- max_length: 浮点数类型
- summarization: 结构体,包含:
- early_stopping: 布尔类型
- length_penalty: 浮点数类型
- max_length: 浮点数类型
- min_length: 浮点数类型
- no_repeat_ngram_size: 浮点数类型
- num_beams: 浮点数类型
- prefix: 字符串类型
- text-generation: 结构体,包含:
- do_sample: 布尔类型
- max_length: 浮点数类型
- translation_en_to_de: 结构体,包含:
- early_stopping: 布尔类型
- max_length: 浮点数类型
- num_beams: 浮点数类型
- prefix: 字符串类型
- translation_en_to_fr: 结构体,包含:
- early_stopping: 布尔类型
- max_length: 浮点数类型
- num_beams: 浮点数类型
- prefix: 字符串类型
- translation_en_to_ro: 结构体,包含:
- early_stopping: 布尔类型
- max_length: 浮点数类型
- num_beams: 浮点数类型
- prefix: 字符串类型
- conversational: 结构体,包含:
- downloads: 整数类型
- first_commit: 时间戳类型,时区为UTC
- card: 字符串类型
- embedding: 浮点数序列类型
数据集分割
- train:
- 字节数: 177783576
- 示例数: 30344
数据集大小
- 下载大小: 137071859字节
- 数据集大小: 177783576字节



