five

Raw count matrix

收藏
DataCite Commons2020-10-30 更新2024-07-28 收录
下载链接:
https://figshare.com/articles/dataset/Raw_count_matrix/12320693
下载链接
链接失效反馈
官方服务:
资源简介:
Reads from Bioproject PRJNA628886 where aligned against reference transcriptome (Bioproject PRJNA236528, https://doi.org/10.5061/dryad.11978) with BWA mem (http://bio-bwa.sourceforge.net/bwa.shtml). <br>Quantification was performed with SAMtools<sup>1</sup> idxstats to generate the quantification matrix. p { margin-bottom: 0.25cm; direction: ltr; line-height: 120%; text-align: left; orphans: 2; widows: 2 } a:link { color: #0000ff } The matrix was filtered with edgeR<sup>2</sup> and only contigs with more than 1 CPM (Count Per Million) in at least one sample were kept, providing a matrix of 76,550 contigs. <br>File <b>PRJNA628886_raw_quantification_206K.tsv.gz </b>is the raw count matrix of the whole transcriptome. <br> File <b>76k_ids_list.txt</b> is the identifier list of contigs expressed at 1 CPM in our conditions. <br><br> p { margin-bottom: 0.25cm; direction: ltr; line-height: 120%; text-align: left; orphans: 2; widows: 2 } a:link { color: #0000ff } <sup>1</sup>SAMtools programs (view, sort, index and idxStats, flagstat): version 1.8, standard parameters. <i>Ref: Li, H. </i><i>et al.</i><i> </i><i>The Sequence Alignment/Map format and SAMtools. </i><i>Bioinformatics</i><i> </i><i><b>25</b></i><i>, 2078–2079 (2009).</i> <sup>2</sup>EdgeR: version 3.26.5. <i>Ref: Robinson, M. D., McCarthy, D. J. &amp; Smyth, G. K. edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. </i><i>Bioinformatics</i><i> </i><i><b>26</b></i><i>, 139–140 (2010).</i><i><br></i>Related to bioproject: https://www.ncbi.nlm.nih.gov/bioproject/PRJNA628886
提供机构:
figshare
创建时间:
2020-05-20
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作