Yamei/VISBank_Parsed
收藏Hugging Face2023-03-04 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/Yamei/VISBank_Parsed
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: title
dtype: string
- name: paper_id
dtype: int64
- name: abstract
dtype: string
- name: authors
list:
- name: first
dtype: string
- name: middle
sequence: string
- name: last
dtype: string
- name: suffix
dtype: string
- name: year
dtype: float64
- name: arxiv_id
dtype: string
- name: acl_id
dtype: string
- name: pmc_id
dtype: string
- name: pubmed_id
dtype: string
- name: doi
dtype: string
- name: venue
dtype: string
- name: journal
dtype: string
- name: mag_id
dtype: string
- name: outbound_citations
sequence: string
- name: inbound_citations
sequence: string
- name: has_outbound_citations
dtype: bool
- name: has_inbound_citations
dtype: bool
- name: has_pdf_parse
dtype: bool
- name: s2_url
dtype: string
- name: has_pdf_body_text
dtype: float64
- name: has_pdf_parsed_abstract
dtype: float64
- name: has_pdf_parsed_body_text
dtype: float64
- name: has_pdf_parsed_bib_entries
dtype: float64
- name: has_pdf_parsed_ref_entries
dtype: float64
- name: entities
sequence:
sequence: string
splits:
- name: train
num_bytes: 260857853
num_examples: 125745
download_size: 135186490
dataset_size: 260857853
---
# Dataset Card for "VISBank_Parsed"
[More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
提供机构:
Yamei
原始信息汇总
数据集概述
数据集特征
- title: 字符串类型
- paper_id: 整数类型
- abstract: 字符串类型
- authors: 列表类型,包含以下字段:
- first: 字符串类型
- middle: 序列类型,字符串
- last: 字符串类型
- suffix: 字符串类型
- year: 浮点数类型
- arxiv_id: 字符串类型
- acl_id: 字符串类型
- pmc_id: 字符串类型
- pubmed_id: 字符串类型
- doi: 字符串类型
- venue: 字符串类型
- journal: 字符串类型
- mag_id: 字符串类型
- outbound_citations: 序列类型,字符串
- inbound_citations: 序列类型,字符串
- has_outbound_citations: 布尔类型
- has_inbound_citations: 布尔类型
- has_pdf_parse: 布尔类型
- s2_url: 字符串类型
- has_pdf_body_text: 浮点数类型
- has_pdf_parsed_abstract: 浮点数类型
- has_pdf_parsed_body_text: 浮点数类型
- has_pdf_parsed_bib_entries: 浮点数类型
- has_pdf_parsed_ref_entries: 浮点数类型
- entities: 序列类型,包含序列类型字符串
数据集分割
- train:
- num_bytes: 260857853
- num_examples: 125745
数据集大小
- download_size: 135186490
- dataset_size: 260857853



