JSALT2024-Astro-LLMs/astro_paper_corpus
收藏Hugging Face2024-07-11 更新2024-07-06 收录
下载链接:
https://hf-mirror.com/datasets/JSALT2024-Astro-LLMs/astro_paper_corpus
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含科学文献的元数据,涵盖了作者、标题、引用次数、关键词、摘要等信息。数据集的特征包括id、author、bibcode、title、citation_count、aff、citation、database、read_count、keyword、reference、doi、subfolder、filename、introduction、conclusions、year、month、arxiv_id、abstract、failed_ids、keyword_search、umap_x、umap_y和clust_id等字段。数据集分为训练集,包含271,544个样本,总大小为4,128,576,193字节。
This dataset contains metadata of scientific literature, covering information such as authors, titles, citation counts, keywords, abstracts, etc. The features of the dataset include id, author, bibcode, title, citation_count, aff, citation, database, read_count, keyword, reference, doi, subfolder, filename, introduction, conclusions, year, month, arxiv_id, abstract, failed_ids, keyword_search, umap_x, umap_y, and clust_id. The dataset is divided into a training set, containing 271,544 samples, with a total size of 4,128,576,193 bytes.
提供机构:
JSALT2024-Astro-LLMs



