five

Generation of Genome-Wide Methylation Matrix from Bismark Coverage Files Using methylKit

收藏
DataCite Commons2025-05-22 更新2025-09-08 收录
下载链接:
https://figshare.com/articles/dataset/Generation_of_Genome-Wide_Methylation_Matrix_from_Bismark_Coverage_Files_Using_methylKit/29126885/1
下载链接
链接失效反馈
官方服务:
资源简介:
This R script processes bisulfite sequencing coverage files generated by the Bismark methylation extractor and produces a comprehensive CpG methylation matrix using the <b>methylKit</b> package. The matrix reports the percentage methylation for each CpG site across all samples, formatted for downstream statistical analysis and visualization.The script includes two pipelines:Reads and filters raw <code>.cov</code> files from Bismark.Alternatively, loads a pre-processed methylKit object (<code>finalmeth.RData</code>) to regenerate the methylation matrix.<b>Process Overview:</b>Reads 40 <code>.cov</code> files (CpG coverage) using <code>methRead()</code>, specifying treatment group assignments and metadata.Applies coverage filtering to remove extreme or low-confidence sites.Unites the methylation calls across samples using <code>unite()</code>.Converts to a percent methylation matrix via <code>percMethylation()</code>.Exports the resulting matrix as a tab-delimited file <code>erindata.txt</code>.<b>Inputs:</b>40 Bismark coverage files named like <code>W1.CpG_report.merged_CpG_evidence.cov</code>.Optional input: <code>finalmeth.RData</code> containing a pre-constructed <code>methylBase</code> object (<code>methBase_ob</code>).<b>Outputs:</b><code>erindata.txt</code>: a tab-separated table with columns:Sample-specific percent methylation values<code>Row_ID</code>: unique CpG identifiers in the format <code>chr_pos</code><b>Software Requirements:</b>R (≥ 4.0)Package: <code>methylKit</code> (Bioconductor)<b>Usage Notes:</b>Ensure filenames in <code>file.list</code> match the actual file names in your working directory.The <code>treatment</code> vector must be aligned with sample IDs for correct experimental grouping.CpG context only is supported (<code>context = "CpG"</code>).The script is configured for Bismark coverage format (<code>pipeline = "bismarkCoverage"</code>).Percent methylation is reported without strand collapsing (<code>destrand = FALSE</code>).
提供机构:
figshare
创建时间:
2025-05-22
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作