five

Preprocessing and Binomial Filtering of Bisulfite Sequencing Coverage Data

收藏
Figshare2025-05-22 更新2026-04-28 收录
下载链接:
https://figshare.com/articles/dataset/Preprocessing_and_Binomial_Filtering_of_Bisulfite_Sequencing_Coverage_Data/29126867
下载链接
链接失效反馈
官方服务:
资源简介:
This R script performs data preprocessing and statistical filtering on bisulfite sequencing output files (*.cov format) using a binomial test approach. The primary objective is to prepare methylation count data in a format compatible with the DSS package for downstream differential methylation analysis.The script applies a coverage threshold, computes binomial p-values against a specified background rate (default: 0.007), adjusts p-values using the Benjamini–Hochberg method, and exports three structured output files per sample:_Binomial_Applied.txt: Full filtered data with FDR-adjusted p-values._DSS_Format.txt: Reformatted table with columns chr, pos, N, and X for DSS._w_Unique.txt: Same as DSS format, with a unique site identifier column (chr_pos).Inputs:Tab-delimited coverage files matching *evidence.cov (e.g., from Bismark methylation extractor).Columns must include chromosome, start, end, percent methylation, count methylated (C), and count unmethylated (T).Outputs:For each input file:[Sample]__Binomial_Applied.txt[Sample]__DSS_Format.txt[Sample]__w_Unique.txtSoftware Requirements:R (≥ 4.0)Packages: readr, sqldf, doBy, dplyr, foreach, doParallelUsage Notes:This script is intended to run in a directory containing .cov files.Parallel processing is used for speed; adjust the number of cores with doParallel::registerDoParallel().Downstream DSS analysis expects *_DSS_Format.txt files to be loaded using makeBSseqData().
创建时间:
2025-05-22
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作