Preoperative gut microbiota data of colorectal cancer patients
收藏DataCite Commons2025-05-22 更新2025-05-18 收录
下载链接:
https://www.scidb.cn/detail?dataSetId=cbcc148c82d543d893130336bd2d0f7c
下载链接
链接失效反馈官方服务:
资源简介:
Here is the English translation of your text:This dataset was collected from preoperative fecal samples of patients undergoing radical surgery for colorectal cancer, with the aim of investigating the characteristics of the gut microbiota in these patients and its clinical correlations. Sample collection was conducted between September 2021 and December 2023 at the Colorectal Surgery Department of Zhejiang Cancer Hospital, involving 434 hospitalized patients. The cohort includes individuals of various genders, age groups, and cancer stages, providing a high degree of representativeness. All patients had not received antibiotics, probiotics, or other medications that might affect the composition of the gut microbiota prior to sample collection, ensuring the objectivity and scientific validity of the data.Fecal samples were immediately stored at −80°C after collection. DNA extraction was performed using the QIAamp Fast DNA Stool Mini Kit (QIAGEN) under sterile conditions to avoid contamination. The V3–V4 region of the microbial 16S rRNA gene was sequenced using the Illumina MiSeq high-throughput sequencing platform. Raw sequencing data were subjected to quality control using Trimmomatic software to remove low-quality reads and adapter sequences. USEARCH software was then used for sequence clustering and construction of Operational Taxonomic Units (OTUs), followed by taxonomic annotation.The dataset is primarily organized in CSV table format, including approximately 434 individual sample records, each representing the gut microbiota composition of a single patient. The row labels in the data table are “Sample ID,” and the column labels include fields such as “OTU ID,” “Genus Name,” and “Relative Abundance,” with the latter expressed as a percentage (%). Basic clinical information (e.g., gender, age, cancer stage) is also included to facilitate analysis of the relationships between microbial structure and clinical factors. Additionally, the microbial abundance matrix is provided in BIOM format, which is compatible with microbial analysis tools such as QIIME2 and R (phyloseq package). QIIME2 can be downloaded from https://qiime2.org/.Due to low-quality sequences or failed DNA extraction in some samples, the final number of samples included in the analysis is slightly less than the total initially collected. Missing data are clearly indicated in the accompanying metadata file, with missing fields marked as “NA.” Despite strict sample processing protocols, sequencing errors may still be present, primarily due to PCR amplification bias and platform-specific issues. The estimated error rate is around 1%, and quality control and filtering procedures have been implemented to minimize this impact.This dataset provides a reliable foundation for future studies on microbiota-related mechanisms in colorectal cancer and the development of preoperative risk assessment models.
提供机构:
Science Data Bank
创建时间:
2025-04-30



