five

General Zhihu Corpus

收藏
Figshare2019-05-17 更新2026-04-08 收录
下载链接:
https://figshare.com/articles/General_Zhihu_Corpus/8131781/1
下载链接
链接失效反馈
官方服务:
资源简介:
Chinese language corpus containing 3,434 questions and 231,939 answers posted to Zhihu.com.<br>Questions taken from 10 popular topics: “Culture” (文化), “Education” (教育), “Art” (艺术), “University” (大学), “The Internet” (互联网), “Psychology” (心理), “Technology” (科技), “Health” (健康), “Career Development” (职业发展), “Lifestyle” (生活方式)<br>Includes R scripts used to extract data.Data extracted in April 2019.<br>Files are questions (Q), answers (A) and question topics (T).The naming convention is the URL of the webpage:For questions:https://www.zhihu.com/question/[question number]For answers:https://www.zhihu.com/question/[question number]/answer/[answer number]<br>Answers are organised by author category: "male", "female", "undisclosed gender", "anonymous", "organisation" using information from the user's profile where publicly accessible.<br>Short Answers: ≤1,000 characters Medium Answers: 1,001-4,999 characters Long Answers: ≥5,000 characters<br>
创建时间:
2019-05-15
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作