Word Representations for Clinical Danish
收藏DataCite Commons2020-08-25 更新2024-08-25 收录
下载链接:
https://figshare.com/articles/Word_Representations_for_Clinical_Danish/12377858/1
下载链接
链接失效反馈官方服务:
资源简介:
Word embeddings and word clusters for Clinical Danish, drawn from the heavily-anonymised E4C resource (https://doi.org/10.1177/1460458216647760) and presented here as statistical aggregate data over those records. Vocabulary of 382737 words. Vectors have 100 dimensions. Clusters generated using Generalised Brown clustering with a=2500 and a minimum count of 3; coarser clusters can be generated rapidly from the included mergefile (see https://github.com/sean-chester/generalised-brown/blob/master/cluster_generator/cluster.py)<br>Data statement included<br>
提供机构:
figshare
创建时间:
2020-05-27



