five

Q&A dataset of Naver KiN Here

收藏
Mendeley Data2024-03-27 更新2024-06-30 收录
下载链接:
https://zenodo.org/record/46018
下载链接
链接失效反馈
官方服务:
资源简介:
We crawled the publicly accessible local questions and answers on Naver KiN Here from December 17, 2012 to December 31, 2013; a total of 508,334 questions and 567,156 answers were obtained. NKH questions are accessible on the web since web users can also answer the questions. However, the site only lists the questions that are less than one month old, and thus, we scraped the question listing pages in every other week. The collected URLs were then used to download the question pages, which contain all the answerers. For the given dataset, we extracted a set of associated items for analysis including user information (e.g., ID, the question closing rate and answer acceptance rate), title, content, posted time, categorized region, posted coordinate (i.e., latitude, longitude). Similarly, we extracted all the fields of each answer (e.g., answerer ID, posted time, answerer status information). For field extraction, we manually investigated the page format in HTML to write a parser code with regular expressions.
创建时间:
2023-06-28
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作