Q&A dataset of Naver KiN Here
收藏Mendeley Data2024-03-27 更新2024-06-30 收录
下载链接:
https://zenodo.org/record/46018
下载链接
链接失效反馈官方服务:
资源简介:
We crawled the publicly accessible local questions and answers on Naver KiN Here from December 17, 2012 to December 31, 2013; a total of 508,334 questions and 567,156 answers were obtained. NKH questions are accessible on the web since web users can also answer the questions. However, the site only lists the questions that are less than one month old, and thus, we scraped the question listing pages in every other week. The collected URLs were then used to download the question pages, which contain all the answerers. For the given dataset, we extracted a set of associated items for analysis including user information (e.g., ID, the question closing rate and answer acceptance rate), title, content, posted time, categorized region, posted coordinate (i.e., latitude, longitude). Similarly, we extracted all the fields of each answer (e.g., answerer ID, posted time, answerer status information). For field extraction, we manually investigated the page format in HTML to write a parser code with regular expressions.
创建时间:
2023-06-28



