EchoGraph-annotated ECHO-NOTE2NUM examples
收藏DataCite Commons2025-12-04 更新2026-05-04 收录
下载链接:
https://physionet.org/content/echograph-note2num-annotations/
下载链接
链接失效反馈官方服务:
资源简介:
This repository releases the EchoGraph-annotated ECHO-NOTE2NUM dataset,
containing 45,794 echocardiography reports with comprehensive entity and
relation annotations. Each report from the ECHO-NOTE2NUM dataset has been
automatically annotated using EchoGraph, a BERT-based information extraction
model specifically designed for echocardiography reports. The annotations
employ a tailored schema capturing clinical observations (definitely present,
definitely absent, uncertain), anatomical structures, measurements, and four
types of relations (Modify, Gauge, Located at, Suggestive of). The dataset
contains a total of 1,709,074 entities and 671,512 relations across all 45,794
reports. EchoGraph was developed using 600 densely annotated Mayo Clinic
reports (48,256 entities, 29,731 relations) and validated on 60 MIMIC-EchoNote
reports, demonstrating strong performance (entity F1 0.85 internal, 0.80
external; relation F1 0.70 internal, 0.52 external).
This annotated dataset enables research in clinical NLP, automated report
evaluation, and development of AI systems for echocardiography.
提供机构:
PhysioNet
创建时间:
2025-10-31



