five

Stra8-AVI ChIP-seq

收藏
DataCite Commons2024-08-05 更新2024-08-19 收录
下载链接:
https://figshare.com/articles/dataset/Stra8-AVI_ChIP-seq/26496871
下载链接
链接失效反馈
官方服务:
资源简介:
ChIP-seq samples comprise four distinct groups: Stra8-AVI males, Stra8-AVI females, a combination of wild-type males and wild-type females, and a pooled input sample from both Stra8-AVI males and Stra8-AVI females.In this ChIP-seq analysis workflow, the data preprocessing begins with Cutadapt, which is used to trim adapters and low-quality sequences; it's configured with parameters <code>--times 1</code> to trim once per read, <code>-e 0.1</code> for an allowable error rate of 0.1, <pre>-O 3</pre> for a minimum overlap with the adapter of 3 bases, and <code>--quality-cutoff 25</code> to remove bases below a quality threshold of 25. Following trimming, Bowtie2 aligns the cleaned reads to the reference genome, omitting unaligned reads (<code>--no-unal</code>) and restricting insert sizes to a maximum of 2000 bases (<code>--maxins 2000</code>). Post-alignment, Samtools filters the aligned reads to include only those that are properly paired and primary (<code>-f 3</code>, <pre>-F 12</pre>, <code>-F 256</code>), using 10 parallel threads (<pre>-@ 10</pre>) to enhance processing speed. Deeptools then analyzes the filtered alignments for coverage visualization, applying a scaling factor of 1 (<pre>--scaleFactor 1</pre>), setting the bin size to 1 for high-resolution coverage maps (<code>--binSize 1</code>), and utilizing 10 threads (<code>-p 10</code>). Finally, Macs2 identifies and calls peaks based on these alignments; it operates on BAM formatted files (<code>-f BAM</code>), references the mouse genome size (<code>-g mm</code>), highlights peak summits (<code>--call-summits</code>), builds models across a range of 5 to 50 bases (<code>-m 5 50</code>), and sets a p-value cutoff of 0.05 for peak detection. To manage memory efficiently during these operations, Picard adjusts Java heap sizes to 40 gigabytes (<code>Xms40g -Xmx40g</code>) and optimizes parallel garbage collection with 20 threads (<code>-XX:ParallelGCThreads=20</code>). This sequence of tools and parameters ensures a rigorous and precise approach to ChIP-seq data analysis.<br>
提供机构:
figshare
创建时间:
2024-08-05
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作