FiscaAI/icd10gm-kodes
收藏Hugging Face2025-01-02 更新2025-11-01 收录
下载链接:
https://hf-mirror.com/datasets/FiscaAI/icd10gm-kodes
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: classification_level
dtype: string
- name: location_in_tree
dtype: string
- name: type_of_four_and_five_digit
dtype: string
- name: chapter_number
dtype: string
- name: first_three_digit_code
dtype: string
- name: code_without_cross
dtype: string
- name: code_without_dash_etc
dtype: string
- name: code_plain
dtype: string
- name: combined_class_title
dtype: string
- name: three_digit_code_title
dtype: string
- name: four_digit_code_title
dtype: string
- name: five_digit_code_title
dtype: string
- name: usage_paragraph_295
dtype: string
- name: usage_paragraph_301
dtype: string
- name: mortality_list_1_reference
dtype: string
- name: mortality_list_2_reference
dtype: string
- name: mortality_list_3_reference
dtype: string
- name: mortality_list_4_reference
dtype: string
- name: morbidity_list_reference
dtype: string
- name: gender_relation
dtype: string
- name: gender_error_type
dtype: string
- name: lower_age_limit
dtype: string
- name: upper_age_limit
dtype: string
- name: age_relation_error_type
dtype: string
- name: rare_in_central_europe
dtype: string
- name: code_content_occupied
dtype: string
- name: ifsg_notification
dtype: string
- name: ifsg_lab
dtype: string
splits:
- name: train
num_bytes: 5583760
num_examples: 16817
download_size: 961908
dataset_size: 5583760
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
---
```
icd10gm2025syst_kodes.txt - Codes of the ICD Systematics
Field 1: Classification level, 1 character
3 = Three-character
4 = Four-character
5 = Five-character
Field 2: Position of the key number in the classification tree, 1 character
T = Terminal key number (codable endpoint)
N = Non-terminal key number (not a codable endpoint)
Field 3: Type of four- and five-character codes
X = Explicitly listed (pre-combined)
S = By sub-classification (post-combined)
Field 4: Chapter number, max. 2 characters
Field 5: First three-character group code, 3 characters
Field 6: Key number without any cross, up to 7 characters
Field 7: Key number without hyphen, star, or exclamation mark, up to 6 characters
Field 8: Key number without dot, hyphen, star, or exclamation mark, up to 5 characters
Field 9: Class title, composed of parts of the titles of the three-character, four-character, and five-character codes, if available, up to 255 characters
Field 10: Title of the three-character code, up to 255 characters
Field 11: Title of the four-character code, if available, up to 255 characters
Field 12: Title of the five-character code, if available, up to 255 characters
Field 13: Use of the key number according to Paragraph 295
P = Allowed for primary coding
O = Allowed only as star key number
Z = Allowed only as exclamation key number
V = Not allowed for coding
Field 14: Use of the key number according to Paragraph 301
P = Allowed for primary coding
O = Allowed only as star key number
Z = Allowed only as exclamation key number
V = Not allowed for coding
Field 15: Reference to mortality list 1
Field 16: Reference to mortality list 2
Field 17: Reference to mortality list 3
Field 18: Reference to mortality list 4
Field 19: Reference to morbidity list
Field 20: Gender reference of the key number
9 = No gender reference
M = Male
W = Female
Field 21: Type of error related to gender reference
9 = Irrelevant
K = Optional error
Field 22: Lower age limit for a key number (A disease may occur from at least n completed days/years of life)
9999 = Irrelevant
t000 - t364 = From 0 days including fetal time to 364 days
e.g., t000 = From birth (1st day of life) including fetal time
t001 = From 1 completed day of life (from the 2nd day of life)
t002 = From 2 completed days of life (from the 3rd day of life)
etc. up to
t028 = From 28 completed days of life (from the 29th day, from the 2nd month)
etc. up to
t364 = From 364 completed days of life (from the 365th day of life)
j001 - j124 = From the 1st year to 124 years
e.g., j001 = From 1 completed year of life (from the 2nd year, from the 365th day of life)
j002 = From 2 completed years of life (from the 3rd year)
j003 = From 3 completed years of life (from the 4th year)
etc. up to
j124 = From 124 completed years of life (from the 125th year)
Field 23: Upper age limit for a key number (A disease may occur up to a maximum of m completed days/years of life)
9999 = Irrelevant
t000 - t364 = 0 days - up to 364 days
e.g., t000 = fetal, before birth
t001 = Up to 1 completed day of life (until the end of the 1st day of life)
t002 = Up to 2 completed days of life (until the end of the 2nd day of life)
etc. up to
t364 = Up to 364 completed days of life (until the end of the 364th day)
j001 - j124 = Up to 1 year – up to 124 years
e.g., j001 = Up to 1 completed year of life (until the end of the 1st year, until the end of the 365th day)
j002 = Up to 2 completed years of life (until the end of the 2nd year)
etc. up to
j124 = Up to 124 completed years of life (until the end of the 124th year)
Field 24: Type of error related to age reference
9 = Irrelevant
M = Must error
K = Optional error
Field 25: Disease very rare in Central Europe?
J = Yes (--> Can trigger optional error!)
N = No
Field 26: Key number populated with content?
J = Yes
N = No (--> Can trigger optional error!)
Field 27: IfSG report, indicates that diagnoses coded with this key number require special attention to the doctor’s reporting obligation under the Infection Protection Act (IfSG)
J = Yes
N = No
Field 28: IfSG laboratory, indicates that laboratory tests for these diagnoses may use the laboratory exclusion code of the EBM (32006)
J = Yes
N = No
```
数据集信息:
特征:
- 名称:分类层级(classification_level),数据类型:字符串
- 名称:分类树内位置(location_in_tree),数据类型:字符串
- 名称:四五位编码类型(type_of_four_and_five_digit),数据类型:字符串
- 名称:章节编号(chapter_number),数据类型:字符串
- 名称:首三位分组编码(first_three_digit_code),数据类型:字符串
- 名称:无交叉引用编码(code_without_cross),数据类型:字符串
- 名称:无连字符等符号编码(code_without_dash_etc),数据类型:字符串
- 名称:纯编码(code_plain),数据类型:字符串
- 名称:组合分类标题(combined_class_title),数据类型:字符串
- 名称:三位编码标题(three_digit_code_title),数据类型:字符串
- 名称:四位编码标题(four_digit_code_title),数据类型:字符串
- 名称:五位编码标题(five_digit_code_title),数据类型:字符串
- 名称:第295款使用说明(usage_paragraph_295),数据类型:字符串
- 名称:第301款使用说明(usage_paragraph_301),数据类型:字符串
- 名称:死亡列表1参考文献(mortality_list_1_reference),数据类型:字符串
- 名称:死亡列表2参考文献(mortality_list_2_reference),数据类型:字符串
- 名称:死亡列表3参考文献(mortality_list_3_reference),数据类型:字符串
- 名称:死亡列表4参考文献(mortality_list_4_reference),数据类型:字符串
- 名称:发病列表参考文献(morbidity_list_reference),数据类型:字符串
- 名称:性别关联(gender_relation),数据类型:字符串
- 名称:性别错误类型(gender_error_type),数据类型:字符串
- 名称:年龄下限(lower_age_limit),数据类型:字符串
- 名称:年龄上限(upper_age_limit),数据类型:字符串
- 名称:年龄关联错误类型(age_relation_error_type),数据类型:字符串
- 名称:中欧罕见病标识(rare_in_central_europe),数据类型:字符串
- 名称:编码内容占用标识(code_content_occupied),数据类型:字符串
- 名称:IfSG通报要求(ifsg_notification),数据类型:字符串
- 名称:IfSG实验室标识(ifsg_lab),数据类型:字符串
划分集:
- 名称:训练集(train),字节数:5583760,样本数:16817
下载大小:961908,数据集总大小:5583760
配置:
- 配置名称:默认(default),数据文件:
- 划分集:训练集,路径:data/train-*
`icd10gm2025syst_kodes.txt` - ICD分类系统编码
字段1:分类层级,1个字符
3 = 三位编码
4 = 四位编码
5 = 五位编码
字段2:编码在分类树中的位置,1个字符
T = 终端编码(可编码终点)
N = 非终端编码(不可作为编码终点)
字段3:四五位编码类型
X = 明确列出(预组合)
S = 通过子分类生成(后组合)
字段4:章节编号,最多2个字符
字段5:首三位分组编码,3个字符
字段6:无任何交叉引用的编码,最长7个字符
字段7:无连字符、星号或感叹号的编码,最长6个字符
字段8:无点号、连字符、星号或感叹号的编码,最长5个字符
字段9:组合分类标题,由三位、四位、五位编码的标题部分组成(若有),最长255个字符
字段10:三位编码的标题,最长255个字符
字段11:四位编码的标题(若有),最长255个字符
字段12:五位编码的标题(若有),最长255个字符
字段13:根据第295款的编码使用规则
P = 允许作为主要编码
O = 仅可作为星号编码
Z = 仅可作为感叹号编码
V = 不允许用于编码
字段14:根据第301款的编码使用规则
P = 允许作为主要编码
O = 仅可作为星号编码
Z = 仅可作为感叹号编码
V = 不允许用于编码
字段15:死亡列表1参考文献
字段16:死亡列表2参考文献
字段17:死亡列表3参考文献
字段18:死亡列表4参考文献
字段19:发病列表参考文献
字段20:编码的性别关联
9 = 无性别关联
M = 男性
W = 女性
字段21:与性别关联相关的错误类型
9 = 无关
K = 可选错误
字段22:编码对应的年龄下限(疾病可在完成至少n天/年的生命后发生)
9999 = 无关
t000 - t364 = 0天(含胎儿期)至364天
例如,t000 = 从出生(生命第1天)起,含胎儿期
t001 = 从生命满1天起(即生命第2天起)
t002 = 从生命满2天起(即生命第3天起)
依此类推至
t028 = 从生命满28天起(即第29天起,第2个月起)
依此类推至
t364 = 从生命满364天起(即生命第365天起)
j001 - j124 = 从1年至124年
例如,j001 = 从生命满1年起(即第2年起,生命第365天起)
j002 = 从生命满2年起(即第3年起)
j003 = 从生命满3年起(即第4年起)
依此类推至
j124 = 从生命满124年起(即第125年起)
字段23:编码对应的年龄上限(疾病最多可在完成m天/年的生命后发生)
9999 = 无关
t000 - t364 = 0天至364天
例如,t000 = 胎儿期,出生前
t001 = 至生命满1天(即生命第1天结束时)
t002 = 至生命满2天(即生命第2天结束时)
依此类推至
t364 = 至生命满364天(即第364天结束时)
j001 - j124 = 至1年至124年
例如,j001 = 至生命满1年(即第1年结束时,生命第365天结束时)
j002 = 至生命满2年(即第2年结束时)
依此类推至
j124 = 至生命满124年(即第124年结束时)
字段24:与年龄关联相关的错误类型
9 = 无关
M = 必现错误
K = 可选错误
字段25:该疾病在中欧是否极为罕见?
J = 是(→ 可触发可选错误!)
N = 否
字段26:编码是否已填充内容?
J = 是
N = 否(→ 可触发可选错误!)
字段27:IfSG通报要求:标识使用该编码的诊断需根据《感染保护法》(IfSG)履行医生通报义务
J = 是
N = 否
字段28:IfSG实验室标识:标识此类诊断的实验室检测可使用德国医疗收费系统(EBM)编码32006作为实验室排除编码
J = 是
N = 否
提供机构:
FiscaAI



