five

Wiki[Alt]Med corpus

收藏
Figshare2025-09-02 更新2026-04-28 收录
下载链接:
https://figshare.com/articles/dataset/Wiki_Alt_Med_corpus/30024127
下载链接
链接失效反馈
官方服务:
资源简介:
The Wiki[Alt]Med corpus is designed to allow investigation of discourses circulating within Wikipedia's medicine and health-related content, with a particular focus on the ways in which Wikipedians discuss distinctions between 'scientific' and 'alternative' forms of medicine.The corpus currently contains:All 100 articles in Wikipedia’s ‘Top Importance Medical Articles’ category (642,135 tokens)333 articles categorised by the Wikipedia community as relevant to ‘Alternative Medicine’ (652,704 tokens)Ancillary ‘Talk page’ discussion forums associated with all of the above encyclopedia articles - e.g. Talk:Acupuncture (20.2 million tokens)Metadata facilitating combination of wide-angle analysis of corpus with close reading of text in its original context on the Wikipedia platformOnce downloaded, the coprus can be queried using Sketch Engine (https://www.sketchengine.eu/) or Lancsbox (https://lancsbox.lancs.ac.uk/).
创建时间:
2025-09-02
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作