five

Researcher data for pre-training

收藏
科学数据银行2022-02-15 更新2026-04-23 收录
下载链接:
https://www.scidb.cn/en/detail?dataSetId=3fbfa46e6c9f4c7d8b59db68149bdc76
下载链接
链接失效反馈
官方服务:
资源简介:
This data was extracted from the DBLP, ACM, and MAG digital library as the input of the researcher pre-training model RPT[1][2].This dataset contains a researcher semantic document set and a researcher community graph.--author_document_corpus.txtthe researcher semantic document set, where each line represents a researcher's semantic document set, including the researcher ID and document sequence separated by tab(\t), for example:a1 \t d1 \t d2 \t ... \t dm \na2 \t d1 \t d2 \t ... \t dn \n...--author_community.pklwhere the key is the researcher id, and the value includes two vectors of equal length, preserving the neighbor id and corresponding relation type index. For example:{a1: ["neighbors": [a2, a4, a9, ..., a6], "relations": [0, 0, 1, ..., 2] ] a2: ...} [1] Qiao, Ziyue, et al. "RPT: Toward Transferable Model on Heterogeneous Researcher Data via Pre-Training." arXiv preprint arXiv:2110.07336 (2021).[2] https://github.com/joe817/RPT
提供机构:
Ziyue Qiao; Computer Network Information Center; Yuanchun Zhou
创建时间:
2022-02-08
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作