five

Medical Concept Embeddings for SNOMED-CT (Jan 2019 version)

收藏
NIAID Data Ecosystem2026-03-11 收录
下载链接:
https://zenodo.org/record/3842142
下载链接
链接失效反馈
官方服务:
资源简介:
This dataset contains the SNOMED-CT medical concept embeddings trained using the following text and graph embedding methods. Averaged Word Embedding (300) ELMo (1024) Universal Sentence Encoder (512) BERT (768) Deepwalk (128) Node2Vec (128) HARP (128) LINE (128) The tar file contains eight JSON files corresponding to the aforementioned embedding techniques. The number (in parenthesis) besides each embedding method represents the dimensionality of the embedding. Each JSON file contains a python dictionary of the form SNOMED concept ID (String): Embedding (List). If you find this resource useful in your research, please consider citing our paper: "Pattisapu, N., Patil, S., Palshikar, G. and Varma, V., Medical Concept Normalization by Encoding Target Knowledge, Proceedings of Machine Learning Research 116:246–259, 2020 Machine Learning for Health (ML4H) at NeurIPS 2019" Warning: The dataset size is large (~12 GB). Please ensure that you have sufficient network bandwidth and disk space before requesting a download.
创建时间:
2020-05-25
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作