averoo/librusec_meta

Name: averoo/librusec_meta
Creator: averoo
Published: 2024-06-06 11:15:30
License: 暂无描述

Hugging Face2024-06-06 更新2024-06-12 收录

下载链接：

https://hf-mirror.com/datasets/averoo/librusec_meta

下载链接

链接失效反馈

官方服务：

资源简介：

--- dataset_info: features: - name: genre sequence: string - name: book_title dtype: string - name: author.first_name dtype: string - name: author.last_name dtype: string - name: author.middle_name dtype: string - name: langs.src_lang dtype: string - name: langs.lang dtype: string - name: doc_info.nickname dtype: string - name: doc_info.email dtype: string - name: doc_info.date dtype: string - name: doc_info.doc_id dtype: string - name: doc_info.version dtype: string - name: doc_info.program_used dtype: string - name: title_info.annotation dtype: string - name: doc_info.src_url dtype: string - name: pub_info.book_name dtype: string - name: pub_info.publisher dtype: string - name: pub_info.city dtype: string - name: pub_info.year dtype: string - name: pub_info.isbn dtype: string - name: pub_info.sequence dtype: string - name: title_info.keywords dtype: string splits: - name: train num_bytes: 558057710 num_examples: 493378 download_size: 266662640 dataset_size: 558057710 configs: - config_name: default data_files: - split: train path: data/train-* --- # Dataset Card for "librusec_meta" [More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)

This dataset contains metadata about books, including genre, title, author information, languages, document information, publication information, and keywords. The dataset is split into a training set with 493378 examples, with a dataset size of 558057710 bytes and a download size of 266662640 bytes.

提供机构：

averoo

原始信息汇总

数据集概述

数据集名称

librusec_meta

数据特征

genre：字符串序列
book_title：字符串
author：
- first_name：字符串
- last_name：字符串
- middle_name：字符串
langs：
- src_lang：字符串
- lang：字符串
doc_info：
- nickname：字符串
- email：字符串
- date：字符串
- doc_id：字符串
- version：字符串
- program_used：字符串
- src_url：字符串
title_info：
- annotation：字符串
- keywords：字符串
pub_info：
- book_name：字符串
- publisher：字符串
- city：字符串
- year：字符串
- isbn：字符串
- sequence：字符串

数据集划分

train：
- num_bytes：558057710
- num_examples：493378

数据集大小

download_size：266662640
dataset_size：558057710

配置

config_name：default
data_files：
- split：train
- path：data/train-*

5,000+

优质数据集

54 个

任务类型

进入经典数据集