five

Word Representations for Clinical Danish

收藏
DataCite Commons2020-08-25 更新2024-07-28 收录
下载链接:
https://figshare.com/articles/Word_Representations_for_Clinical_Danish/12377858
下载链接
链接失效反馈
官方服务:
资源简介:
Word embeddings and word clusters for Clinical Danish, drawn from the heavily-anonymised E4C resource (https://doi.org/10.1177/1460458216647760) and presented here as statistical aggregate data over those records. Vocabulary of 382737 words. Vectors have 100 dimensions. Clusters generated using Generalised Brown clustering with a=2500 and a minimum count of 3; coarser clusters can be generated rapidly from the included mergefile (see https://github.com/sean-chester/generalised-brown/blob/master/cluster_generator/cluster.py)<br>Data statement included<br>
提供机构:
figshare
创建时间:
2020-05-27
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作