five

nils-herrmann/clean_scirepeval

收藏
Hugging Face2026-04-14 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/nils-herrmann/clean_scirepeval
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: mit configs: - config_name: biomimicry data_files: - split: evaluation path: biomimicry/evaluation-* - config_name: cite_count data_files: - split: evaluation path: cite_count/evaluation-* - config_name: drsm data_files: - split: evaluation path: drsm/evaluation-* - config_name: fos data_files: - split: evaluation path: fos/evaluation-* - config_name: high_influence_cite data_files: - split: evaluation path: high_influence_cite/evaluation-* - config_name: mesh_descriptors data_files: - split: evaluation path: mesh_descriptors/evaluation-* - config_name: nfcorpus data_files: - split: evaluation path: nfcorpus/evaluation-* - config_name: paper_reviewer_matching data_files: - split: evaluation path: paper_reviewer_matching/evaluation-* - config_name: peer_review_score_hIndex data_files: - split: evaluation path: peer_review_score_hIndex/evaluation-* - config_name: pub_year data_files: - split: evaluation path: pub_year/evaluation-* - config_name: relish data_files: - split: evaluation path: relish/evaluation-* - config_name: same_author data_files: - split: evaluation path: same_author/evaluation-* - config_name: scidocs_mag_mesh data_files: - split: evaluation path: scidocs_mag_mesh/evaluation-* - config_name: scidocs_view_cite_read data_files: - split: evaluation path: scidocs_view_cite_read/evaluation-* - config_name: search data_files: - split: evaluation path: search/evaluation-* - config_name: trec_covid data_files: - split: evaluation path: trec_covid/evaluation-* - config_name: tweet_mentions data_files: - split: evaluation path: tweet_mentions/evaluation-* dataset_info: - config_name: biomimicry features: - name: doc_id dtype: string - name: doi dtype: string - name: corpus_id dtype: uint64 - name: title dtype: string - name: abstract dtype: string - name: label dtype: uint32 - name: venue dtype: string splits: - name: evaluation num_bytes: 16627394 num_examples: 10991 download_size: 9173952 dataset_size: 16627394 - config_name: cite_count features: - name: doc_id dtype: string - name: corpus_id dtype: uint64 - name: title dtype: string - name: abstract dtype: string - name: venue dtype: string - name: n_citations dtype: int32 - name: log_citations dtype: float32 splits: - name: evaluation num_bytes: 45344590 num_examples: 30058 download_size: 25568353 dataset_size: 45344590 - config_name: drsm features: - name: doc_id dtype: string - name: corpus_id dtype: uint64 - name: title dtype: string - name: abstract dtype: string - name: label_type dtype: string - name: label dtype: string - name: class dtype: uint32 splits: - name: evaluation num_bytes: 12724992 num_examples: 8813 download_size: 6869256 dataset_size: 12724992 - config_name: fos features: - name: doc_id dtype: string - name: corpus_id dtype: uint64 - name: title dtype: string - name: abstract dtype: string - name: labels list: int32 - name: labels_text list: string splits: - name: evaluation num_bytes: 63479356 num_examples: 68147 download_size: 38792602 dataset_size: 63479356 - config_name: high_influence_cite features: - name: query struct: - name: doc_id dtype: string - name: title dtype: string - name: abstract dtype: string - name: corpus_id dtype: uint64 - name: candidates list: - name: doc_id dtype: string - name: title dtype: string - name: abstract dtype: string - name: corpus_id dtype: uint64 - name: score dtype: uint32 splits: - name: evaluation num_bytes: 85173159 num_examples: 1199 download_size: 85415125 dataset_size: 85173159 - config_name: mesh_descriptors features: - name: doc_id dtype: string - name: mag_id dtype: uint64 - name: corpus_id dtype: uint64 - name: title dtype: string - name: abstract dtype: string - name: descriptor dtype: string - name: qualifier dtype: string splits: - name: evaluation num_bytes: 386932653 num_examples: 258678 download_size: 218974160 dataset_size: 386932653 - config_name: nfcorpus features: - name: query dtype: string - name: doc_id dtype: string - name: candidates list: - name: doc_id dtype: string - name: title dtype: string - name: abstract dtype: string - name: score dtype: uint32 splits: - name: evaluation num_bytes: 70875855 num_examples: 323 download_size: 71070084 dataset_size: 70875855 - config_name: paper_reviewer_matching features: - name: doc_id dtype: string - name: title dtype: string - name: abstract dtype: string - name: corpus_id dtype: uint64 splits: - name: evaluation num_bytes: 75908720 num_examples: 73364 download_size: 40357135 dataset_size: 75908720 - config_name: peer_review_score_hIndex features: - name: doc_id dtype: string - name: corpus_id dtype: uint64 - name: title dtype: string - name: abstract dtype: string - name: rating list: int32 - name: confidence dtype: string - name: authors list: string - name: decision dtype: string - name: mean_rating dtype: float32 - name: hIndex list: string splits: - name: evaluation num_bytes: 18225432 num_examples: 12668 download_size: 10489454 dataset_size: 18225432 - config_name: pub_year features: - name: doc_id dtype: string - name: corpus_id dtype: uint64 - name: title dtype: string - name: abstract dtype: string - name: year dtype: int32 - name: venue dtype: string - name: norm_year dtype: float32 - name: scaled_year dtype: float32 - name: n_authors dtype: int32 - name: norm_authors dtype: float32 splits: - name: evaluation num_bytes: 45729112 num_examples: 30000 download_size: 26312820 dataset_size: 45729112 - config_name: relish features: - name: query struct: - name: doc_id dtype: string - name: title dtype: string - name: abstract dtype: string - name: corpus_id dtype: int64 - name: candidates list: - name: doc_id dtype: string - name: title dtype: string - name: abstract dtype: string - name: corpus_id dtype: int64 - name: score dtype: uint32 splits: - name: evaluation num_bytes: 334536092 num_examples: 3190 download_size: 334956568 dataset_size: 334536092 - config_name: same_author features: - name: dataset dtype: string - name: query struct: - name: doc_id dtype: string - name: title dtype: string - name: abstract dtype: string - name: corpus_id dtype: uint64 - name: candidates list: - name: doc_id dtype: string - name: title dtype: string - name: abstract dtype: string - name: corpus_id dtype: uint64 - name: score dtype: uint32 splits: - name: evaluation num_bytes: 125996831 num_examples: 13585 download_size: 126179456 dataset_size: 125996831 - config_name: scidocs_mag_mesh features: - name: doc_id dtype: string - name: corpus_id dtype: uint64 - name: title dtype: string - name: abstract dtype: string - name: authors list: string - name: cited_by list: string - name: references list: string - name: year dtype: int32 splits: - name: evaluation num_bytes: 73136014 num_examples: 48473 download_size: 47314082 dataset_size: 73136014 - config_name: scidocs_view_cite_read features: - name: doc_id dtype: string - name: corpus_id dtype: uint64 - name: title dtype: string - name: abstract dtype: string - name: authors list: string - name: cited_by list: string - name: references list: string - name: year dtype: int32 splits: - name: evaluation num_bytes: 239486780 num_examples: 142009 download_size: 161257321 dataset_size: 239486780 - config_name: search features: - name: query dtype: string - name: doc_id dtype: string - name: candidates list: - name: doc_id dtype: string - name: title dtype: string - name: abstract dtype: string - name: corpus_id dtype: uint64 - name: venue dtype: string - name: year dtype: float64 - name: author_names list: string - name: n_citations dtype: int32 - name: n_key_citations dtype: int32 - name: score dtype: uint32 splits: - name: evaluation num_bytes: 39076558 num_examples: 2637 download_size: 39009549 dataset_size: 39076558 - config_name: trec_covid features: - name: query dtype: string - name: doc_id dtype: string - name: candidates list: - name: title dtype: string - name: abstract dtype: string - name: corpus_id dtype: string - name: doc_id dtype: string - name: date dtype: string - name: doi dtype: string - name: iteration dtype: string - name: score dtype: int32 splits: - name: evaluation num_bytes: 97612064 num_examples: 50 download_size: 97619223 dataset_size: 97612064 - config_name: tweet_mentions features: - name: doc_id dtype: string - name: corpus_id dtype: uint64 - name: title dtype: string - name: abstract dtype: string - name: index dtype: int32 - name: retweets dtype: float32 - name: count dtype: int32 - name: mentions dtype: float32 splits: - name: evaluation num_bytes: 25849693 num_examples: 25655 download_size: 14646396 dataset_size: 25849693 ---
提供机构:
nils-herrmann
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作