PCSPF-Pancreatic Cancer Survival based on Preoperative Features
收藏ieee-dataport.org2025-03-22 收录
下载链接:
https://ieee-dataport.org/documents/pcspf-pancreatic-cancer-survival-based-preoperative-features
下载链接
链接失效反馈官方服务:
资源简介:
The prognostic survival dataset, Pancreatic Cancer Survival based on Preoperative Features (PCSPF), was constructed to explore the impact of key preoperative features on prognosis based on the follow-up data of patients with pancreatic cancer at Changhai Hospital, Shanghai, China. Based on the suggestions of doctors, the PCSPF contained 20 preoperative features that they considered important, including sex, abdominal pain, age, body mass index (BMI), C-reactive protein (CRP), albumin (ALB), CRP/ALB, leukocyte, neutrocyte, platelet, lymphocyte, neutrocyte lymphocyte ratio (NLR), platelet lymphocyte ratio (PLR), systemic immune-inflammation index (SII), lactic dehydrogenase, carbohydrate antigen 19-9 (CA19-9), carcinoembryonic antigen (CEA), prealbumin, total bilirubin, and directed bilirubin. The most critical preoperative features affecting individual patients were selected from this list. Patients that survived for less than 90 days, had samples with missing values, or were labeled as survivors but had less than 365 days of follow-up were excluded from the initial 2,257 samples. These steps eliminated potential confounders that could have arisen from surgical interventions or statistical data. In total, 878 samples were selected to construct the dataset. In addition, pancreatic cancer survival prediction was formulated as a binary classification task because survival for at least one year is an important threshold for the prognosis of pancreatic cancer patients. Based on the survival duration, 1 and 0 denoted patients surviving one year or more and less than one year, respectively.
预测生存数据集《基于术前特征的胰腺癌生存情况》(PCSPF)旨在探究关键术前特征对预后影响,该数据集基于中国上海市长海医院胰腺癌患者的随访数据构建。依据医生的建议,PCSPF 包含了他们认为重要的20项术前特征,包括性别、腹部疼痛、年龄、体重指数(BMI)、C反应蛋白(CRP)、白蛋白(ALB)、CRP/ALB、白细胞、中性粒细胞、血小板、淋巴细胞、中性粒细胞与淋巴细胞比率(NLR)、血小板与淋巴细胞比率(PLR)、全身免疫炎症指数(SII)、乳酸脱氢酶、碳水化合物抗原19-9(CA19-9)、癌胚抗原(CEA)、前白蛋白、总胆红素和直接胆红素。从该列表中选取了最关键的术前特征以影响个体患者。排除标准包括生存时间少于90天、样本存在缺失值或被标记为生存者但随访时间少于365天,这些步骤消除了可能由手术干预或统计数据引起的潜在混杂因素。总计选取了878个样本构建该数据集。此外,将胰腺癌生存预测任务构建为二元分类任务,因为至少一年的生存时间是胰腺癌患者预后中的一个重要阈值。基于生存时间,1和0分别代表生存一年或以上和少于一年。
提供机构:
IEEE Dataport



