mlsa-iai-msu-lab/ru_sci_bench_mteb_test
收藏Hugging Face2024-08-29 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/mlsa-iai-msu-lab/ru_sci_bench_mteb_test
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含多个配置,每个配置对应不同的语言版本(英文和俄文),并且每个配置都包含论文ID(paper_id)、文本内容(text)以及不同的标签或值(label或value)。数据集的主要用途可能与论文的引用次数(cited_count)、核心风险(corerisc)、GRNTI分类(grnti)、OECD分类(oecd)、出版类型(pub_type)和出版年份(yearpubl)相关。每个配置都包含训练集和测试集的分割,适用于机器学习任务。
This dataset contains multiple configurations, each corresponding to different language versions (English and Russian), and each configuration includes paper ID (paper_id), text content (text), and different labels or values (label or value). The main purpose of the dataset may be related to citation counts (cited_count), core risks (corerisc), GRNTI classification (grnti), OECD classification (oecd), publication types (pub_type), and publication years (yearpubl). Each configuration includes splits for training and testing sets, making it suitable for machine learning tasks.
提供机构:
mlsa-iai-msu-lab



