jarrydmartinx/metabric2
收藏Hugging Face2023-04-22 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/jarrydmartinx/metabric2
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: patient_id
dtype: int64
- name: age_at_diagnosis
dtype: float64
- name: type_of_breast_surgery
dtype: string
- name: cancer_type
dtype: string
- name: cancer_type_detailed
dtype: string
- name: cellularity
dtype: string
- name: chemotherapy
dtype: int64
- name: pam50_+_claudin-low_subtype
dtype: string
- name: cohort
dtype: float64
- name: er_status_measured_by_ihc
dtype: string
- name: er_status
dtype: string
- name: neoplasm_histologic_grade
dtype: float64
- name: her2_status_measured_by_snp6
dtype: string
- name: her2_status
dtype: string
- name: tumor_other_histologic_subtype
dtype: string
- name: hormone_therapy
dtype: int64
- name: inferred_menopausal_state
dtype: string
- name: integrative_cluster
dtype: string
- name: primary_tumor_laterality
dtype: string
- name: lymph_nodes_examined_positive
dtype: float64
- name: nottingham_prognostic_index
dtype: float64
- name: oncotree_code
dtype: string
- name: pr_status
dtype: string
- name: radio_therapy
dtype: int64
- name: 3-gene_classifier_subtype
dtype: string
- name: tumor_size
dtype: float64
- name: tumor_stage
dtype: float64
- name: death_from_cancer
dtype: string
- name: brca1
dtype: float64
- name: brca2
dtype: float64
- name: palb2
dtype: float64
- name: pten
dtype: float64
- name: tp53
dtype: float64
- name: atm
dtype: float64
- name: cdh1
dtype: float64
- name: chek2
dtype: float64
- name: nbn
dtype: float64
- name: nf1
dtype: float64
- name: stk11
dtype: float64
- name: bard1
dtype: float64
- name: mlh1
dtype: float64
- name: msh2
dtype: float64
- name: msh6
dtype: float64
- name: pms2
dtype: float64
- name: epcam
dtype: float64
- name: rad51c
dtype: float64
- name: rad51d
dtype: float64
- name: rad50
dtype: float64
- name: rb1
dtype: float64
- name: rbl1
dtype: float64
- name: rbl2
dtype: float64
- name: ccna1
dtype: float64
- name: ccnb1
dtype: float64
- name: cdk1
dtype: float64
- name: ccne1
dtype: float64
- name: cdk2
dtype: float64
- name: cdc25a
dtype: float64
- name: ccnd1
dtype: float64
- name: cdk4
dtype: float64
- name: cdk6
dtype: float64
- name: ccnd2
dtype: float64
- name: cdkn2a
dtype: float64
- name: cdkn2b
dtype: float64
- name: myc
dtype: float64
- name: cdkn1a
dtype: float64
- name: cdkn1b
dtype: float64
- name: e2f1
dtype: float64
- name: e2f2
dtype: float64
- name: e2f3
dtype: float64
- name: e2f4
dtype: float64
- name: e2f5
dtype: float64
- name: e2f6
dtype: float64
- name: e2f7
dtype: float64
- name: e2f8
dtype: float64
- name: src
dtype: float64
- name: jak1
dtype: float64
- name: jak2
dtype: float64
- name: stat1
dtype: float64
- name: stat2
dtype: float64
- name: stat3
dtype: float64
- name: stat5a
dtype: float64
- name: stat5b
dtype: float64
- name: mdm2
dtype: float64
- name: tp53bp1
dtype: float64
- name: adam10
dtype: float64
- name: adam17
dtype: float64
- name: aph1a
dtype: float64
- name: aph1b
dtype: float64
- name: arrdc1
dtype: float64
- name: cir1
dtype: float64
- name: ctbp1
dtype: float64
- name: ctbp2
dtype: float64
- name: cul1
dtype: float64
- name: dll1
dtype: float64
- name: dll3
dtype: float64
- name: dll4
dtype: float64
- name: dtx1
dtype: float64
- name: dtx2
dtype: float64
- name: dtx3
dtype: float64
- name: dtx4
dtype: float64
- name: ep300
dtype: float64
- name: fbxw7
dtype: float64
- name: hdac1
dtype: float64
- name: hdac2
dtype: float64
- name: hes1
dtype: float64
- name: hes5
dtype: float64
- name: heyl
dtype: float64
- name: itch
dtype: float64
- name: jag1
dtype: float64
- name: jag2
dtype: float64
- name: kdm5a
dtype: float64
- name: lfng
dtype: float64
- name: maml1
dtype: float64
- name: maml2
dtype: float64
- name: maml3
dtype: float64
- name: ncor2
dtype: float64
- name: ncstn
dtype: float64
- name: notch1
dtype: float64
- name: notch2
dtype: float64
- name: notch3
dtype: float64
- name: nrarp
dtype: float64
- name: numb
dtype: float64
- name: numbl
dtype: float64
- name: psen1
dtype: float64
- name: psen2
dtype: float64
- name: psenen
dtype: float64
- name: rbpj
dtype: float64
- name: rbpjl
dtype: float64
- name: rfng
dtype: float64
- name: snw1
dtype: float64
- name: spen
dtype: float64
- name: hes2
dtype: float64
- name: hes4
dtype: float64
- name: hes7
dtype: float64
- name: hey1
dtype: float64
- name: hey2
dtype: float64
- name: acvr1
dtype: float64
- name: acvr1b
dtype: float64
- name: acvr1c
dtype: float64
- name: acvr2a
dtype: float64
- name: acvr2b
dtype: float64
- name: acvrl1
dtype: float64
- name: akt1
dtype: float64
- name: akt1s1
dtype: float64
- name: akt2
dtype: float64
- name: apaf1
dtype: float64
- name: arl11
dtype: float64
- name: atr
dtype: float64
- name: aurka
dtype: float64
- name: bad
dtype: float64
- name: bcl2
dtype: float64
- name: bcl2l1
dtype: float64
- name: bmp10
dtype: float64
- name: bmp15
dtype: float64
- name: bmp2
dtype: float64
- name: bmp3
dtype: float64
- name: bmp4
dtype: float64
- name: bmp5
dtype: float64
- name: bmp6
dtype: float64
- name: bmp7
dtype: float64
- name: bmpr1a
dtype: float64
- name: bmpr1b
dtype: float64
- name: bmpr2
dtype: float64
- name: braf
dtype: float64
- name: casp10
dtype: float64
- name: casp3
dtype: float64
- name: casp6
dtype: float64
- name: casp7
dtype: float64
- name: casp8
dtype: float64
- name: casp9
dtype: float64
- name: chek1
dtype: float64
- name: csf1
dtype: float64
- name: csf1r
dtype: float64
- name: cxcl8
dtype: float64
- name: cxcr1
dtype: float64
- name: cxcr2
dtype: float64
- name: dab2
dtype: float64
- name: diras3
dtype: float64
- name: dlec1
dtype: float64
- name: dph1
dtype: float64
- name: egfr
dtype: float64
- name: eif4e
dtype: float64
- name: eif4ebp1
dtype: float64
- name: eif5a2
dtype: float64
- name: erbb2
dtype: float64
- name: erbb3
dtype: float64
- name: erbb4
dtype: float64
- name: fas
dtype: float64
- name: fgf1
dtype: float64
- name: fgfr1
dtype: float64
- name: folr1
dtype: float64
- name: folr2
dtype: float64
- name: folr3
dtype: float64
- name: foxo1
dtype: float64
- name: foxo3
dtype: float64
- name: gdf11
dtype: float64
- name: gdf2
dtype: float64
- name: gsk3b
dtype: float64
- name: hif1a
dtype: float64
- name: hla-g
dtype: float64
- name: hras
dtype: float64
- name: igf1
dtype: float64
- name: igf1r
dtype: float64
- name: inha
dtype: float64
- name: inhba
dtype: float64
- name: inhbc
dtype: float64
- name: itgav
dtype: float64
- name: itgb3
dtype: float64
- name: izumo1r
dtype: float64
- name: kdr
dtype: float64
- name: kit
dtype: float64
- name: kras
dtype: float64
- name: map2k1
dtype: float64
- name: map2k2
dtype: float64
- name: map2k3
dtype: float64
- name: map2k4
dtype: float64
- name: map2k5
dtype: float64
- name: map3k1
dtype: float64
- name: map3k3
dtype: float64
- name: map3k4
dtype: float64
- name: map3k5
dtype: float64
- name: mapk1
dtype: float64
- name: mapk12
dtype: float64
- name: mapk14
dtype: float64
- name: mapk3
dtype: float64
- name: mapk4
dtype: float64
- name: mapk6
dtype: float64
- name: mapk7
dtype: float64
- name: mapk8
dtype: float64
- name: mapk9
dtype: float64
- name: mdc1
dtype: float64
- name: mlst8
dtype: float64
- name: mmp1
dtype: float64
- name: mmp10
dtype: float64
- name: mmp11
dtype: float64
- name: mmp12
dtype: float64
- name: mmp13
dtype: float64
- name: mmp14
dtype: float64
- name: mmp15
dtype: float64
- name: mmp16
dtype: float64
- name: mmp17
dtype: float64
- name: mmp19
dtype: float64
- name: mmp2
dtype: float64
- name: mmp21
dtype: float64
- name: mmp23b
dtype: float64
- name: mmp24
dtype: float64
- name: mmp25
dtype: float64
- name: mmp26
dtype: float64
- name: mmp27
dtype: float64
- name: mmp28
dtype: float64
- name: mmp3
dtype: float64
- name: mmp7
dtype: float64
- name: mmp9
dtype: float64
- name: mtor
dtype: float64
- name: nfkb1
dtype: float64
- name: nfkb2
dtype: float64
- name: opcml
dtype: float64
- name: pdgfa
dtype: float64
- name: pdgfb
dtype: float64
- name: pdgfra
dtype: float64
- name: pdgfrb
dtype: float64
- name: pdpk1
dtype: float64
- name: peg3
dtype: float64
- name: pik3ca
dtype: float64
- name: pik3r1
dtype: float64
- name: pik3r2
dtype: float64
- name: plagl1
dtype: float64
- name: ptk2
dtype: float64
- name: rab25
dtype: float64
- name: rad51
dtype: float64
- name: raf1
dtype: float64
- name: rassf1
dtype: float64
- name: rheb
dtype: float64
- name: rictor
dtype: float64
- name: rps6
dtype: float64
- name: rps6ka1
dtype: float64
- name: rps6ka2
dtype: float64
- name: rps6kb1
dtype: float64
- name: rps6kb2
dtype: float64
- name: rptor
dtype: float64
- name: slc19a1
dtype: float64
- name: smad1
dtype: float64
- name: smad2
dtype: float64
- name: smad3
dtype: float64
- name: smad4
dtype: float64
- name: smad5
dtype: float64
- name: smad6
dtype: float64
- name: smad7
dtype: float64
- name: smad9
dtype: float64
- name: sptbn1
dtype: float64
- name: terc
dtype: float64
- name: tert
dtype: float64
- name: tgfb1
dtype: float64
- name: tgfb2
dtype: float64
- name: tgfb3
dtype: float64
- name: tgfbr1
dtype: float64
- name: tgfbr2
dtype: float64
- name: tgfbr3
dtype: float64
- name: tsc1
dtype: float64
- name: tsc2
dtype: float64
- name: vegfa
dtype: float64
- name: vegfb
dtype: float64
- name: wfdc2
dtype: float64
- name: wwox
dtype: float64
- name: zfyve9
dtype: float64
- name: arid1a
dtype: float64
- name: arid1b
dtype: float64
- name: cbfb
dtype: float64
- name: gata3
dtype: float64
- name: kmt2c
dtype: float64
- name: kmt2d
dtype: float64
- name: myh9
dtype: float64
- name: ncor1
dtype: float64
- name: pde4dip
dtype: float64
- name: ptprd
dtype: float64
- name: ros1
dtype: float64
- name: runx1
dtype: float64
- name: tbx3
dtype: float64
- name: abcb1
dtype: float64
- name: abcb11
dtype: float64
- name: abcc1
dtype: float64
- name: abcc10
dtype: float64
- name: bbc3
dtype: float64
- name: bmf
dtype: float64
- name: cyp2c8
dtype: float64
- name: cyp3a4
dtype: float64
- name: fgf2
dtype: float64
- name: fn1
dtype: float64
- name: map2
dtype: float64
- name: map4
dtype: float64
- name: mapt
dtype: float64
- name: nr1i2
dtype: float64
- name: slco1b3
dtype: float64
- name: tubb1
dtype: float64
- name: tubb4a
dtype: float64
- name: tubb4b
dtype: float64
- name: twist1
dtype: float64
- name: adgra2
dtype: float64
- name: afdn
dtype: float64
- name: aff2
dtype: float64
- name: agmo
dtype: float64
- name: agtr2
dtype: float64
- name: ahnak
dtype: float64
- name: ahnak2
dtype: float64
- name: akap9
dtype: float64
- name: alk
dtype: float64
- name: apc
dtype: float64
- name: arid2
dtype: float64
- name: arid5b
dtype: float64
- name: asxl1
dtype: float64
- name: asxl2
dtype: float64
- name: bap1
dtype: float64
- name: bcas3
dtype: float64
- name: birc6
dtype: float64
- name: cacna2d3
dtype: float64
- name: ccnd3
dtype: float64
- name: chd1
dtype: float64
- name: clk3
dtype: float64
- name: clrn2
dtype: float64
- name: col12a1
dtype: float64
- name: col22a1
dtype: float64
- name: col6a3
dtype: float64
- name: ctcf
dtype: float64
- name: ctnna1
dtype: float64
- name: ctnna3
dtype: float64
- name: dnah11
dtype: float64
- name: dnah2
dtype: float64
- name: dnah5
dtype: float64
- name: dtwd2
dtype: float64
- name: fam20c
dtype: float64
- name: fanca
dtype: float64
- name: fancd2
dtype: float64
- name: flt3
dtype: float64
- name: foxp1
dtype: float64
- name: frmd3
dtype: float64
- name: gh1
dtype: float64
- name: gldc
dtype: float64
- name: gpr32
dtype: float64
- name: gps2
dtype: float64
- name: hdac9
dtype: float64
- name: herc2
dtype: float64
- name: hist1h2bc
dtype: float64
- name: kdm3a
dtype: float64
- name: kdm6a
dtype: float64
- name: klrg1
dtype: float64
- name: l1cam
dtype: float64
- name: lama2
dtype: float64
- name: lamb3
dtype: float64
- name: large1
dtype: float64
- name: ldlrap1
dtype: float64
- name: lifr
dtype: float64
- name: lipi
dtype: float64
- name: magea8
dtype: float64
- name: map3k10
dtype: float64
- name: map3k13
dtype: float64
- name: men1
dtype: float64
- name: mtap
dtype: float64
- name: muc16
dtype: float64
- name: myo1a
dtype: float64
- name: myo3a
dtype: float64
- name: ncoa3
dtype: float64
- name: nek1
dtype: float64
- name: nf2
dtype: float64
- name: npnt
dtype: float64
- name: nr2f1
dtype: float64
- name: nr3c1
dtype: float64
- name: nras
dtype: float64
- name: nrg3
dtype: float64
- name: nt5e
dtype: float64
- name: or6a2
dtype: float64
- name: palld
dtype: float64
- name: pbrm1
dtype: float64
- name: ppp2cb
dtype: float64
- name: ppp2r2a
dtype: float64
- name: prkacg
dtype: float64
- name: prkce
dtype: float64
- name: prkcq
dtype: float64
- name: prkcz
dtype: float64
- name: prkg1
dtype: float64
- name: prps2
dtype: float64
- name: prr16
dtype: float64
- name: ptpn22
dtype: float64
- name: ptprm
dtype: float64
- name: rasgef1b
dtype: float64
- name: rpgr
dtype: float64
- name: ryr2
dtype: float64
- name: sbno1
dtype: float64
- name: setd1a
dtype: float64
- name: setd2
dtype: float64
- name: setdb1
dtype: float64
- name: sf3b1
dtype: float64
- name: sgcd
dtype: float64
- name: shank2
dtype: float64
- name: siah1
dtype: float64
- name: sik1
dtype: float64
- name: sik2
dtype: float64
- name: smarcb1
dtype: float64
- name: smarcc1
dtype: float64
- name: smarcc2
dtype: float64
- name: smarcd1
dtype: float64
- name: spaca1
dtype: float64
- name: stab2
dtype: float64
- name: stmn2
dtype: float64
- name: syne1
dtype: float64
- name: taf1
dtype: float64
- name: taf4b
dtype: float64
- name: tbl1xr1
dtype: float64
- name: tg
dtype: float64
- name: thada
dtype: float64
- name: thsd7a
dtype: float64
- name: ttyh1
dtype: float64
- name: ubr5
dtype: float64
- name: ush2a
dtype: float64
- name: usp9x
dtype: float64
- name: utrn
dtype: float64
- name: zfp36l1
dtype: float64
- name: ackr3
dtype: float64
- name: akr1c1
dtype: float64
- name: akr1c2
dtype: float64
- name: akr1c3
dtype: float64
- name: akr1c4
dtype: float64
- name: akt3
dtype: float64
- name: ar
dtype: float64
- name: bche
dtype: float64
- name: cdk8
dtype: float64
- name: cdkn2c
dtype: float64
- name: cyb5a
dtype: float64
- name: cyp11a1
dtype: float64
- name: cyp11b2
dtype: float64
- name: cyp17a1
dtype: float64
- name: cyp19a1
dtype: float64
- name: cyp21a2
dtype: float64
- name: cyp3a43
dtype: float64
- name: cyp3a5
dtype: float64
- name: cyp3a7
dtype: float64
- name: ddc
dtype: float64
- name: hes6
dtype: float64
- name: hsd17b1
dtype: float64
- name: hsd17b10
dtype: float64
- name: hsd17b11
dtype: float64
- name: hsd17b12
dtype: float64
- name: hsd17b13
dtype: float64
- name: hsd17b14
dtype: float64
- name: hsd17b2
dtype: float64
- name: hsd17b3
dtype: float64
- name: hsd17b4
dtype: float64
- name: hsd17b6
dtype: float64
- name: hsd17b7
dtype: float64
- name: hsd17b8
dtype: float64
- name: hsd3b1
dtype: float64
- name: hsd3b2
dtype: float64
- name: hsd3b7
dtype: float64
- name: mecom
dtype: float64
- name: met
dtype: float64
- name: ncoa2
dtype: float64
- name: nrip1
dtype: float64
- name: pik3r3
dtype: float64
- name: prkci
dtype: float64
- name: prkd1
dtype: float64
- name: ran
dtype: float64
- name: rdh5
dtype: float64
- name: sdc4
dtype: float64
- name: serpini1
dtype: float64
- name: shbg
dtype: float64
- name: slc29a1
dtype: float64
- name: sox9
dtype: float64
- name: spry2
dtype: float64
- name: srd5a1
dtype: float64
- name: srd5a2
dtype: float64
- name: srd5a3
dtype: float64
- name: st7
dtype: float64
- name: star
dtype: float64
- name: tnk2
dtype: float64
- name: tulp4
dtype: float64
- name: ugt2b15
dtype: float64
- name: ugt2b17
dtype: float64
- name: ugt2b7
dtype: float64
- name: event_time
dtype: float64
- name: event_indicator
dtype: int64
splits:
- name: train
num_bytes: 8074440
num_examples: 1904
download_size: 7639518
dataset_size: 8074440
---
# Dataset Card for "metabric"
[More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
提供机构:
jarrydmartinx
原始信息汇总
数据集概述
数据集信息
特征描述
数据集包含以下特征及其数据类型:
- patient_id: 患者ID,数据类型为
int64 - age_at_diagnosis: 诊断时的年龄,数据类型为
float64 - type_of_breast_surgery: 乳腺癌手术类型,数据类型为
string - cancer_type: 癌症类型,数据类型为
string - cancer_type_detailed: 详细的癌症类型,数据类型为
string - cellularity: 细胞密度,数据类型为
string - chemotherapy: 是否接受化疗,数据类型为
int64 - pam50_+_claudin-low_subtype: PAM50和Claudin-low亚型,数据类型为
string - cohort: 队列,数据类型为
float64 - er_status_measured_by_ihc: 通过IHC测量的ER状态,数据类型为
string - er_status: ER状态,数据类型为
string - neoplasm_histologic_grade: 肿瘤组织学分级,数据类型为
float64 - her2_status_measured_by_snp6: 通过SNP6测量的HER2状态,数据类型为
string - her2_status: HER2状态,数据类型为
string - tumor_other_histologic_subtype: 肿瘤其他组织学亚型,数据类型为
string - hormone_therapy: 是否接受激素治疗,数据类型为
int64 - inferred_menopausal_state: 推断的绝经状态,数据类型为
string - integrative_cluster: 综合集群,数据类型为
string - primary_tumor_laterality: 原发性肿瘤侧向,数据类型为
string - lymph_nodes_examined_positive: 检查阳性的淋巴结数量,数据类型为
float64 - nottingham_prognostic_index: 诺丁汉预后指数,数据类型为
float64 - oncotree_code: OncoTree代码,数据类型为
string - pr_status: PR状态,数据类型为
string - radio_therapy: 是否接受放射治疗,数据类型为
int64 - 3-gene_classifier_subtype: 3基因分类器亚型,数据类型为
string - tumor_size: 肿瘤大小,数据类型为
float64 - tumor_stage: 肿瘤阶段,数据类型为
float64 - death_from_cancer: 癌症导致的死亡,数据类型为
string - brca1: BRCA1基因,数据类型为
float64 - brca2: BRCA2基因,数据类型为
float64 - palb2: PALB2基因,数据类型为
float64 - pten: PTEN基因,数据类型为
float64 - tp53: TP53基因,数据类型为
float64 - atm: ATM基因,数据类型为
float64 - cdh1: CDH1基因,数据类型为
float64 - chek2: CHEK2基因,数据类型为
float64 - nbn: NBN基因,数据类型为
float64 - nf1: NF1基因,数据类型为
float64 - stk11: STK11基因,数据类型为
float64 - bard1: BARD1基因,数据类型为
float64 - mlh1: MLH1基因,数据类型为
float64 - msh2: MSH2基因,数据类型为
float64 - msh6: MSH6基因,数据类型为
float64 - pms2: PMS2基因,数据类型为
float64 - epcam: EPCAM基因,数据类型为
float64 - rad51c: RAD51C基因,数据类型为
float64 - rad51d: RAD51D基因,数据类型为
float64 - rad50: RAD50基因,数据类型为
float64 - rb1: RB1基因,数据类型为
float64 - rbl1: RBL1基因,数据类型为
float64 - rbl2: RBL2基因,数据类型为
float64 - ccna1: CCNA1基因,数据类型为
float64 - ccnb1: CCNB1基因,数据类型为
float64 - cdk1: CDK1基因,数据类型为
float64 - ccne1: CCNE1基因,数据类型为
float64 - cdk2: CDK2基因,数据类型为
float64 - cdc25a: CDC25A基因,数据类型为
float64 - ccnd1: CCND1基因,数据类型为
float64 - cdk4: CDK4基因,数据类型为
float64 - cdk6: CDK6基因,数据类型为
float64 - ccnd2: CCND2基因,数据类型为
float64 - cdkn2a: CDKN2A基因,数据类型为
float64 - cdkn2b: CDKN2B基因,数据类型为
float64 - myc: MYC基因,数据类型为
float64 - cdkn1a: CDKN1A基因,数据类型为
float64 - cdkn1b: CDKN1B基因,数据类型为
float64 - e2f1: E2F1基因,数据类型为
float64 - e2f2: E2F2基因,数据类型为
float64 - e2f3: E2F3基因,数据类型为
float64 - e2f4: E2F4基因,数据类型为
float64 - e2f5: E2F5基因,数据类型为
float64 - e2f6: E2F6基因,数据类型为
float64 - e2f7: E2F7基因,数据类型为
float64 - e2f8: E2F8基因,数据类型为
float64 - src: SRC基因,数据类型为
float64 - jak1: JAK1基因,数据类型为
float64 - jak2: JAK2基因,数据类型为
float64 - stat1: STAT1基因,数据类型为
float64 - stat2: STAT2基因,数据类型为
float64 - stat3: STAT3基因,数据类型为
float64 - stat5a: STAT5A基因,数据类型为
float64 - stat5b: STAT5B基因,数据类型为
float64 - mdm2: MDM2基因,数据类型为
float64 - tp53bp1: TP53BP1基因,数据类型为
float64 - adam10: ADAM10基因,数据类型为
float64 - adam17: ADAM17基因,数据类型为
float64 - aph1a: APH1A基因,数据类型为
float64 - aph1b: APH1B基因,数据类型为
float64 - arrdc1: ARRDC1基因,数据类型为
float64 - cir1: CIR1基因,数据类型为
float64 - ctbp1: CTBP1基因,数据类型为
float64 - ctbp2: CTBP2基因,数据类型为
float64 - cul1: CUL1基因,数据类型为
float64 - dll1: DLL1基因,数据类型为
float64 - dll3: DLL3基因,数据类型为
float64 - dll4: DLL4基因,数据类型为
float64 - dtx1: DTX1基因,数据类型为
float64 - dtx2: DTX2基因,数据类型为
float64 - dtx3: DTX3基因,数据类型为
float64 - dtx4: DTX4基因,数据类型为
float64 - ep300: EP300基因,数据类型为
float64 - fbxw7: FBXW7基因,数据类型为
float64 - hdac1: HDAC1基因,数据类型为
float64 - hdac2: HDAC2基因,数据类型为
float64 - hes1: HES1基因,数据类型为
float64 - hes5: HES5基因,数据类型为
float64 - heyl: HEYL基因,数据类型为
float64 - itch: ITCH基因,数据类型为
float64 - jag1: JAG1基因,数据类型为
float64 - jag2: JAG2基因,数据类型为
float64 - kdm5a: KDM5A基因,数据类型为
float64 - lfng: LFNG基因,数据类型为
float64 - maml1: MAML1基因,数据类型为
float64 - maml2: MAML2基因,数据类型为
float64 - maml3: MAML3基因,数据类型为
float64 - ncor2: NCOR2基因,数据类型为
float64 - ncstn: NCSTN基因,数据类型为
float64 - notch1: NOTCH1基因,数据类型为
float64 - notch2: NOTCH2基因,数据类型为
float64 - notch3: NOTCH3基因,数据类型为
float64 - nrarp: NRARP基因,数据类型为
float64 - numb: NUMB基因,数据类型为
float64 - numbl: NUMBL基因,数据类型为
float64 - psen1: PSEN1基因,数据类型为
float64 - psen2: PSEN2基因,数据类型为
float64 - psenen: PSENEN基因,数据类型为
float64 - rbpj: RBPJ基因,数据类型为
float64 - rbpjl: RBPJL基因,数据类型为
float64 - rfng: RFNG基因,数据类型为
float64 - snw1: SNW1基因,数据类型为
float64 - spen: SPEN基因,数据类型为
float64 - hes2: HES2基因,数据类型为
float64 - hes4: HES4基因,数据类型为
float64 - hes7: HES7基因,数据类型为
float64 - hey1: HEY1基因,数据类型为
float64 - hey2: HEY2基因,数据类型为
float64 - acvr1: ACVR1基因,数据类型为
float64 - acvr1b: ACVR1B基因,数据类型为
float64 - acvr1c: ACVR1C基因,数据类型为
float64 - acvr2a: ACVR2A基因,数据类型为
float64 - acvr2b: ACVR2B基因,数据类型为
float64 - acvrl1: ACVRL1基因,数据类型为
float64 - akt1: AKT1基因,数据类型为
float64 - akt1s1: AKT1S1基因,数据类型为
float64 - akt2: AKT2基因,数据类型为
float64 - apaf1: APAF1基因,数据类型为
float64 - arl11: ARL11基因,数据类型为
float64 - atr: ATR基因,数据类型为
float64 - aurka: AURKA基因,数据类型为
float64 - bad: BAD基因,数据类型为
float64 - bcl2: BCL2基因,数据类型为
float64 - bcl2l1: BCL2L1基因,数据类型为
float64 - bmp10: BMP10基因,数据类型为
float64 - bmp15: BMP15基因,数据类型为
float64 - bmp2: BMP2基因,数据类型为
float64 - bmp3: BMP3基因,数据类型为
float64 - bmp4: BMP4基因,数据类型为
float64 - bmp5: BMP5基因,数据类型为
float64 - bmp6: BMP6基因,数据类型为
float64 - bmp7: BMP7基因,数据类型为
float64 - bmpr1a: BMPR1A基因,数据类型为
float64 - bmpr1b: BMPR1B基因,数据类型为
float64 - bmpr2: BMPR2基因,数据类型为
float64 - braf: BRAF基因,数据类型为
float64 - casp10: CASP10基因,数据类型为
float64 - casp3: CASP3基因,数据类型为
float64 - casp6: CASP6基因,数据类型为
float64 - casp7: CASP7基因,数据类型为
float64 - casp8: CASP8基因,数据类型为
float64 - casp9: CASP9基因,数据类型为
float64 - chek1: CHEK1基因,数据类型为
float64 - csf1: CSF1基因,数据类型为
float64 - csf1r: CSF1R基因,数据类型为
float64 - cxcl8: CXCL8基因,数据类型为
float64 - cxcr1: CXCR1基因,数据类型为
float64 - cxcr2: CXCR2基因,数据类型为
float64 - dab2: DAB2基因,数据类型为
float64 - diras3: DIRAS3基因,数据类型为
float64 - dlec1: DLEC1基因,数据类型为
float64 - dph1: DPH1基因,数据类型为
float64 - egfr: EGFR基因,数据类型为
float64 - eif4e: EIF4E基因,数据类型为
float64 - eif4ebp1: EIF4EBP1基因,数据类型为
float64 - eif5a2: EIF5A2基因,数据类型为
float64 - erbb2: ERBB2基因,数据类型为
float64 - erbb3: ERBB3基因,数据类型为
float64 - erbb4: ERBB4基因,数据类型为
float64 - fas: FAS基因,数据类型为
float64 - fgf1: FGF1基因,数据类型为
float64 - fgfr1: FGFR1基因,数据类型为
float64 - folr1: FOLR1基因,数据类型为
float64 - folr2: FOLR2基因,数据类型为
float64 - folr3: FOLR3基因,数据类型为
float64 - foxo1: FOXO1基因,数据类型为
float64 - foxo3: FOXO3基因,数据类型为
float64 - gdf11: GDF11基因,数据类型为
float64 - gdf2: GDF2基因,数据类型为
float64 - gsk3b: GSK3B基因,数据类型为
float64 - hif1a: HIF1
搜集汇总
数据集介绍

构建方式
该数据集的构建基于METABRIC(Molecular Taxonomy of Breast Cancer International Consortium)项目,汇集了大量乳腺癌患者的临床和基因组数据。数据集包括患者的诊断年龄、手术类型、癌症类型、治疗信息、基因表达水平等多个特征。通过系统化的数据收集和整合,确保了数据的高质量和完整性,为乳腺癌研究和临床应用提供了宝贵的资源。
特点
METABRIC2数据集具有多维度的特征,涵盖了患者的临床信息、治疗反应、基因表达等多个方面。其特点在于包含了大量的基因表达数据,涉及多个与乳腺癌相关的基因,如BRCA1、BRCA2等。此外,数据集还提供了患者的生存时间和事件指示器,便于进行生存分析和预后模型的构建。这些特征使得该数据集在乳腺癌的分子分型、治疗方案优化和预后预测等方面具有重要的应用价值。
使用方法
METABRIC2数据集适用于多种乳腺癌相关的研究,包括但不限于生存分析、预后模型构建、基因表达与临床特征的关联分析等。研究者可以通过加载数据集,提取相关特征进行统计分析、机器学习模型的训练和验证。数据集的结构化设计使得数据处理和分析过程更加高效,支持多种编程语言和数据分析工具的使用。通过合理的数据预处理和特征选择,研究者可以深入挖掘数据中的潜在规律,为乳腺癌的精准治疗提供科学依据。
背景与挑战
背景概述
METABRIC2数据集是由Jarryd Martin和X团队在2023年创建的,专注于乳腺癌患者的基因表达和临床数据。该数据集包含了1904名患者的详细信息,涵盖了从基因突变到治疗反应的多个维度。主要研究人员通过整合多源数据,旨在解决乳腺癌预后和治疗策略优化的核心问题。METABRIC2数据集的发布对乳腺癌研究领域具有重要影响,为研究人员提供了丰富的数据资源,推动了个性化医疗和精准治疗的发展。
当前挑战
METABRIC2数据集在构建过程中面临多重挑战。首先,数据来源的多样性和异质性增加了数据整合和清洗的难度。其次,基因表达数据的复杂性和高维度使得特征选择和模型构建变得复杂。此外,临床数据的缺失和不一致性也对数据分析提出了挑战。最后,如何在保护患者隐私的前提下,有效利用这些敏感数据进行研究,也是一个重要的伦理和法律问题。
常用场景
经典使用场景
在乳腺癌研究领域,jarrydmartinx/metabric2数据集被广泛用于预测患者的生存率和治疗反应。通过分析患者的基因表达数据、临床特征以及治疗方案,研究人员可以构建模型来预测患者的预后,从而为个性化治疗提供依据。
实际应用
在临床实践中,jarrydmartinx/metabric2数据集的应用主要体现在个性化治疗方案的制定。医生可以根据患者的基因表达和临床特征,利用数据集中的信息来选择最有效的治疗策略,从而提高治疗效果和患者生存率。
衍生相关工作
基于jarrydmartinx/metabric2数据集,许多相关研究工作得以开展,包括开发新的预测模型、验证现有模型的有效性以及探索新的治疗靶点。这些研究不仅推动了乳腺癌研究的进展,也为其他癌症类型的研究提供了参考。
以上内容由遇见数据集搜集并总结生成



