five

biomed_NER

收藏
魔搭社区2025-12-03 更新2024-12-28 收录
下载链接:
https://modelscope.cn/datasets/knowledgator/biomed_NER
下载链接
链接失效反馈
官方服务:
资源简介:
### BioMed_general_NER This dataset consists of manually annotated biomedical abstracts from PubMed, drug descriptions from FDA and abstracts from patents. It was extracted 24 different entity types, including those specific to medicine and biology and general such as location and organization as well. This is one of the biggest datasets of such kind, which consists of 4840 annotated abstracts. ### Classes Here's a description for each of the labels: 1. **CHEMICALS** - Represents substances with distinct molecular composition, often involved in various biological or industrial processes. 2. **CLINICAL DRUG** - Refers to pharmaceutical substances developed for medical use, aimed at preventing, treating, or managing diseases. 3. **BODY SUBSTANCE** - Denotes materials or substances within the human body, including fluids, tissues, and other biological components. 4. **ANATOMICAL STRUCTURE** - Describes specific parts or structures within an organism's body, often related to anatomy and physiology. 5. **CELLS AND THEIR COMPONENTS** - Encompasses the basic structural and functional units of living organisms, along with their constituent elements. 6. **GENE AND GENE PRODUCTS** - Involves genetic information and the resultant products, such as proteins, that play a crucial role in biological processes. 7. **INTELLECTUAL PROPERTY** - Pertains to legal rights associated with creations of the mind, including inventions, literary and artistic works, and trademarks. 8. **LANGUAGE** - Relates to linguistic elements, including words, phrases, and language constructs, often in the context of communication or analysis. 9. **REGULATION OR LAW** - Represents rules, guidelines, or legal frameworks established by authorities to govern behavior, practices, or procedures. 10. **GEOGRAPHICAL AREAS** - Refers to specific regions, locations, or places on the Earth's surface, often associated with particular characteristics or significance. 11. **ORGANISM** - Denotes a living being, typically a plant, animal, or microorganism, as a distinct biological entity. 12. **GROUP** - Encompasses collections of individuals with shared characteristics, interests, or affiliations. 13. **PERSON** - Represents an individual human being, often considered as a distinct entity with personal attributes. 14. **ORGANIZATION** - Refers to structured entities, institutions, or companies formed for specific purposes or activities. 15. **PRODUCT** - Encompasses tangible or intangible items resulting from a process, often associated with manufacturing or creation. 16. **LOCATION** - Describes a specific place or position, whether physical or abstract, with potential relevance to various contexts. 17. **PHENOTYPE** - Represents the observable characteristics or traits of an organism, resulting from the interaction of its genotype with the environment. 18. **DISORDER** - Denotes abnormal conditions or disruptions in the normal functioning of a biological organism, often associated with diseases or medical conditions. 19. **SIGNALING MOLECULES** - Involves molecules that transmit signals within and between cells, playing a crucial role in various physiological processes. 20. **EVENT** - Describes occurrences or happenings at a specific time and place, often with significance or impact. 21. **MEDICAL PROCEDURE** - Involves specific actions or interventions conducted for medical purposes, such as surgeries, diagnostic tests, or therapeutic treatments. 22. **ACTIVITY** - Encompasses actions, behaviors, or processes undertaken by individuals, groups, or entities. 23. **FUNCTION** - Describes the purpose or role of a biological or mechanical entity, focusing on its intended or inherent activities. 24. **MONEY** - Represents currency or financial assets used as a medium of exchange, often in the context of economic transactions. ### Datasources * PubMed - biomedical articles abstracts; * FDA - drugs descriptions; * Patents - patents abstracts;

### BioMed_general_NER 本数据集包含经人工标注的PubMed生物医学文献摘要、美国食品药品监督管理局(FDA)药物说明文档以及专利文献摘要。 该数据集涵盖24种不同的实体类型,既包含医学与生物学专属实体,也涵盖地理位置、组织机构等通用实体。 本数据集是此类规模最大的数据集之一,共包含4840篇经标注的摘要。 ### 类别 以下为各标签的详细说明: 1. **化学物质(CHEMICALS)**:指代具有明确分子组成的物质,常参与各类生物或工业过程。 2. **临床药物(CLINICAL DRUG)**:指为医疗用途开发的药物物质,用于预防、治疗或管控疾病。 3. **机体物质(BODY SUBSTANCE)**:指人体内的各类物质,包括体液、组织及其他生物成分。 4. **解剖结构(ANATOMICAL STRUCTURE)**:描述生物体内的特定部位或结构,通常与解剖学和生理学相关。 5. **细胞及其组分(CELLS AND THEIR COMPONENTS)**:涵盖生物体的基本结构与功能单元,及其组成成分。 6. **基因及其产物(GENE AND GENE PRODUCTS)**:涉及遗传信息及其表达产物(如蛋白质),这些物质在生物过程中发挥关键作用。 7. **知识产权(INTELLECTUAL PROPERTY)**:指与智力创作相关的法定权利,包括发明、文学与艺术作品及商标等。 8. **语言(LANGUAGE)**:关联语言要素,包括词汇、短语及语言结构,通常用于沟通或分析场景。 9. **法规与法律(REGULATION OR LAW)**:指代权威机构制定的用于规范行为、实践或流程的规则、指南或法律框架。 10. **地理区域(GEOGRAPHICAL AREAS)**:指地球表面的特定区域、地点或场所,通常具有特定特征或意义。 11. **生物体(ORGANISM)**:指代具有独立生命属性的活体,通常为植物、动物或微生物。 12. **群体(GROUP)**:涵盖具有共同特征、兴趣或隶属关系的个体集合。 13. **人物(PERSON)**:指代独立的人类个体,通常被视为具有个人属性的独特实体。 14. **组织机构(ORGANIZATION)**:指为特定目标或活动组建的结构化实体、机构或公司。 15. **产品(PRODUCT)**:涵盖由某一过程产生的有形或无形物品,通常与制造或创作相关。 16. **位置(LOCATION)**:描述特定的地点或方位,包括物理或抽象层面,可应用于多种场景。 17. **表型(PHENOTYPE)**:指生物体可观测的特征或性状,由基因型与环境的相互作用共同决定。 18. **机能紊乱(DISORDER)**:指代生物机体正常功能出现异常或中断的状态,通常与疾病或医学病症相关。 19. **信号分子(SIGNALING MOLECULES)**:指在细胞内外传递信号的分子,在各类生理过程中发挥关键作用。 20. **事件(EVENT)**:描述特定时间与地点发生的事件或现象,通常具有重要意义或影响。 21. **医疗操作(MEDICAL PROCEDURE)**:指为医疗目的开展的特定行动或干预措施,例如手术、诊断检测或治疗手段。 22. **活动(ACTIVITY)**:涵盖个体、群体或实体所实施的行动、行为或过程。 23. **功能(FUNCTION)**:描述生物或机械实体的用途或角色,聚焦于其预期或固有活动。 24. **货币(MONEY)**:指代作为交换媒介的货币或金融资产,通常应用于经济交易场景。 ### 数据源 * PubMed:生物医学文献摘要; * FDA:药物说明文档; * 专利:专利文献摘要;
提供机构:
maas
创建时间:
2024-12-26
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作