MallardDuck/scryfall-dataset
收藏Hugging Face2024-02-07 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/MallardDuck/scryfall-dataset
下载链接
链接失效反馈官方服务:
资源简介:
---
license: mit
dataset_info:
features:
- name: id
dtype: string
- name: oracle_id
dtype: string
- name: name
dtype: string
- name: released_at
dtype: string
- name: uri
dtype: string
- name: mana_cost
dtype: string
- name: cmc
dtype: float64
- name: type_line
dtype: string
- name: oracle_text
dtype: string
- name: power
dtype: string
- name: toughness
dtype: string
- name: loyalty
dtype: string
- name: colors
sequence: string
- name: color_identity
sequence: string
- name: keywords
sequence: string
- name: edhrec_rank
dtype: float64
splits:
- name: train
num_bytes: 13145409
num_examples: 30845
- name: train_clean
num_bytes: 11096387
num_examples: 25438
download_size: 12595090
dataset_size: 24241796
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
- split: train_clean
path: data/train_clean-*
---
This dataset contains information related to card games, including card IDs, names, release dates, mana costs, types, descriptions, power, toughness, loyalty, colors, color identities, keywords, and EDHREC rankings. The dataset is divided into a training set and a cleaned training set, with 30845 and 25438 examples respectively.
提供机构:
MallardDuck
原始信息汇总
数据集概述
数据集信息
- 许可证: MIT
特征信息
数据集包含以下特征:
- id: 数据类型为字符串
- oracle_id: 数据类型为字符串
- name: 数据类型为字符串
- released_at: 数据类型为字符串
- uri: 数据类型为字符串
- mana_cost: 数据类型为字符串
- cmc: 数据类型为浮点数 (float64)
- type_line: 数据类型为字符串
- oracle_text: 数据类型为字符串
- power: 数据类型为字符串
- toughness: 数据类型为字符串
- loyalty: 数据类型为字符串
- colors: 数据类型为字符串序列
- color_identity: 数据类型为字符串序列
- keywords: 数据类型为字符串序列
- edhrec_rank: 数据类型为浮点数 (float64)
数据分割
数据集分为以下几个部分:
- train: 包含30845个样本,总字节数为13145409
- train_clean: 包含25438个样本,总字节数为11096387
数据集大小
- 下载大小: 12595090字节
- 数据集总大小: 24241796字节
配置信息
- 默认配置: default
- 数据文件:
- train: 路径为
data/train-* - train_clean: 路径为
data/train_clean-*
- train: 路径为
- 数据文件:



