five

PERSON Dataset V2

收藏
DataCite Commons2025-04-01 更新2024-08-17 收录
下载链接:
https://figshare.com/articles/PERSON_Dataset_V2/6958514/1
下载链接
链接失效反馈
官方服务:
资源简介:
PERSON Dataset V2:Dataset created for paper "Search Personalization Based on Social-Network-Based Interestedness Measures." Please cite the paper for any usage.<br>The dataset is produced by data cleaning of AMiner's citation network V2 dataset (https://aminer.org/citation). Anyone who wants to use PERSON V2 dataset must cite Aminer's dataset (as explained in its homepage: Jie Tang, Jing Zhang, Limin Yao, Juanzi Li, Li Zhang, and Zhong Su. ArnetMiner: Extraction and Mining of Academic Social Networks. In <i>Proceedings of the Fourteenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining</i> (SIGKDD'2008). pp.990-998) as well as the aforementioned paper.<br><br>It includes two files: <br> 1- authors_giant.txt: the information of authors and their co-authors. The format is as follows:<br> author ID<br> author name<br> the list of coauthors delimited by "," (Each entry contains the ID of the coauthor followed by the number of times they co-authored a paper)<br> ...<br><br> 2- papers_giant.txt: the information of papers and references. The format is as follows:<br> paper ID<br> Is paper merged (See the first paper for details)<br> original paper ID (in Aminer's dataset)<br> blank<br> blank<br> blank<br> blank<br> title<br> abstract<br> time (only the year part is important)<br> blank<br> references to papers out of the PERSON dataset (indicated by Aminer's IDs)<br> references to papers inside the PERSON dataset (indicated by PERSON's IDs)<br> author IDs<br> ...<br><br>

PERSON数据集V2:本数据集为论文《基于社交网络兴趣度度量的搜索个性化》(Search Personalization Based on Social-Network-Based Interestedness Measures.)所创建,任何使用本数据集的场景均需引用该论文。<br>本数据集基于AMiner引用网络V2数据集(https://aminer.org/citation)经数据清洗生成。使用PERSON V2数据集的用户,需同时引用AMiner数据集(详见其主页:Jie Tang, Jing Zhang, Limin Yao, Juanzi Li, Li Zhang, and Zhong Su. ArnetMiner: 学术社交网络的抽取与挖掘. 载于:<i>第14届ACM SIGKDD知识发现与数据挖掘国际会议论文集</i> (SIGKDD'2008). 第990-998页)以及前述论文。<br><br>本数据集包含两个文件:<br>1. authors_giant.txt:作者及其合作者信息,格式如下:<br>作者ID<br>作者姓名<br>合作者列表,以英文逗号分隔(每一项格式为:合作者ID + 合作论文次数)<br>……<br>2. papers_giant.txt:论文及其参考文献信息,格式如下:<br>论文ID<br>论文是否已合并(详见首篇论文说明)<br>原始论文ID(取自AMiner数据集)<br>空字段<br>空字段<br>空字段<br>空字段<br>论文标题<br>论文摘要<br>发表年份(仅年份字段有效)<br>空字段<br>本数据集外的论文参考文献(以AMiner的ID标识)<br>本数据集内的论文参考文献(以PERSON数据集的ID标识)<br>作者ID列表<br>……
提供机构:
figshare
创建时间:
2018-08-11
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作