five

bigbio/twadrl

收藏
Hugging Face2022-12-22 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/bigbio/twadrl
下载链接
链接失效反馈
官方服务:
资源简介:
--- language: - en bigbio_language: - English license: cc-by-4.0 multilinguality: monolingual bigbio_license_shortname: CC_BY_4p0 pretty_name: TwADR-L homepage: https://zenodo.org/record/55013 bigbio_pubmed: False bigbio_public: True bigbio_tasks: - NAMED_ENTITY_RECOGNITION - NAMED_ENTITY_DISAMBIGUATION --- # Dataset Card for TwADR-L ## Dataset Description - **Homepage:** https://zenodo.org/record/55013 - **Pubmed:** False - **Public:** True - **Tasks:** NER,NED The TwADR-L dataset contains medical concepts written on social media (Twitter) mapped to how they are formally written in medical ontologies (SIDER 4). ## Citation Information ``` @inproceedings{limsopatham-collier-2016-normalising, title = "Normalising Medical Concepts in Social Media Texts by Learning Semantic Representation", author = "Limsopatham, Nut and Collier, Nigel", booktitle = "Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)", month = aug, year = "2016", address = "Berlin, Germany", publisher = "Association for Computational Linguistics", url = "https://aclanthology.org/P16-1096", doi = "10.18653/v1/P16-1096", pages = "1014--1023", } ```
提供机构:
bigbio
原始信息汇总

TwADR-L 数据集概述

基本信息

  • 语言: 英语
  • 许可证: CC-BY-4.0
  • 多语言性: 单语种
  • 数据集名称: TwADR-L
  • 主页: https://zenodo.org/record/55013
  • 是否公开: 是
  • 是否包含PubMed数据: 否

数据集描述

TwADR-L 数据集包含社交媒体(Twitter)上的医学概念,并映射到医学本体(SIDER 4)中的正式表达。

任务类型

  • 命名实体识别 (NER)
  • 命名实体消歧 (NED)
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作