BioDEX/BioDEX-ICSR-Abstract
收藏Hugging Face2023-05-04 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/BioDEX/BioDEX-ICSR-Abstract
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: title
dtype: string
- name: abstract
dtype: string
- name: fulltext
dtype: string
- name: target
dtype: string
- name: pmid
dtype: string
- name: fulltext_license
dtype: string
- name: title_normalized
dtype: string
- name: issue
dtype: string
- name: pages
dtype: string
- name: journal
dtype: string
- name: authors
dtype: string
- name: pubdate
dtype: string
- name: doi
dtype: string
- name: affiliations
dtype: string
- name: medline_ta
dtype: string
- name: nlm_unique_id
dtype: string
- name: issn_linking
dtype: string
- name: country
dtype: string
- name: mesh_terms
dtype: string
- name: publication_types
dtype: string
- name: chemical_list
dtype: string
- name: keywords
dtype: string
- name: references
dtype: string
- name: delete
dtype: bool
- name: pmc
dtype: string
- name: other_id
dtype: string
- name: fulltext_processed
dtype: string
splits:
- name: test
num_bytes: 118045716
num_examples: 8053
- name: train
num_bytes: 333640345
num_examples: 32235
- name: validation
num_bytes: 82957309
num_examples: 8059
download_size: 285101366
dataset_size: 534643370
---
# Dataset Card for "BioDEX-ICSR-Abstract"
[More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
提供机构:
BioDEX
原始信息汇总
数据集概述
数据集名称
BioDEX-ICSR-Abstract
数据集特征
数据集包含以下特征:
- title: 字符串类型
- abstract: 字符串类型
- fulltext: 字符串类型
- target: 字符串类型
- pmid: 字符串类型
- fulltext_license: 字符串类型
- title_normalized: 字符串类型
- issue: 字符串类型
- pages: 字符串类型
- journal: 字符串类型
- authors: 字符串类型
- pubdate: 字符串类型
- doi: 字符串类型
- affiliations: 字符串类型
- medline_ta: 字符串类型
- nlm_unique_id: 字符串类型
- issn_linking: 字符串类型
- country: 字符串类型
- mesh_terms: 字符串类型
- publication_types: 字符串类型
- chemical_list: 字符串类型
- keywords: 字符串类型
- references: 字符串类型
- delete: 布尔类型
- pmc: 字符串类型
- other_id: 字符串类型
- fulltext_processed: 字符串类型
数据集分割
- test: 8053个样本,118045716字节
- train: 32235个样本,333640345字节
- validation: 8059个样本,82957309字节
数据集大小
- 下载大小: 285101366字节
- 数据集总大小: 534643370字节



