shubhamagarwal92/rw_2308_filtered
收藏Hugging Face2023-09-21 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/shubhamagarwal92/rw_2308_filtered
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: aid
dtype: string
- name: mid
dtype: string
- name: abstract
dtype: string
- name: corpusid
dtype: int64
- name: text_except_rw
dtype: string
- name: title
dtype: string
- name: related_work
dtype: string
- name: original_related_work
dtype: string
- name: ref_abstract
struct:
- name: abstract
sequence: string
- name: cite_N
sequence: string
- name: corpursid
sequence: string
- name: ref_abstract_original
struct:
- name: abstract
sequence: string
- name: cite_N
sequence: string
- name: corpursid
sequence: string
- name: ref_abstract_full_text
struct:
- name: abstract
sequence: string
- name: all_para_text
sequence: string
- name: cite_N
sequence: string
- name: corpursid
sequence: string
- name: ref_abstract_full_text_original
struct:
- name: abstract
sequence: string
- name: all_para_text
sequence: string
- name: cite_N
sequence: string
- name: corpursid
sequence: string
- name: total_cites
dtype: int64
splits:
- name: test
num_bytes: 254996014
num_examples: 1000
download_size: 106899160
dataset_size: 254996014
---
# Dataset Card for "rw_2308_filtered"
[More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
提供机构:
shubhamagarwal92
原始信息汇总
数据集概述
特征信息
- aid: 字符串类型
- mid: 字符串类型
- abstract: 字符串类型
- corpusid: 64位整数类型
- text_except_rw: 字符串类型
- title: 字符串类型
- related_work: 字符串类型
- original_related_work: 字符串类型
- ref_abstract: 结构类型,包含以下字段:
- abstract: 字符串序列
- cite_N: 字符串序列
- corpursid: 字符串序列
- ref_abstract_original: 结构类型,包含以下字段:
- abstract: 字符串序列
- cite_N: 字符串序列
- corpursid: 字符串序列
- ref_abstract_full_text: 结构类型,包含以下字段:
- abstract: 字符串序列
- all_para_text: 字符串序列
- cite_N: 字符串序列
- corpursid: 字符串序列
- ref_abstract_full_text_original: 结构类型,包含以下字段:
- abstract: 字符串序列
- all_para_text: 字符串序列
- cite_N: 字符串序列
- corpursid: 字符串序列
- total_cites: 64位整数类型
数据分割
- test: 包含1000个样本,占用254996014字节
数据集大小
- 下载大小: 106899160字节
- 数据集大小: 254996014字节



