five

mosstrap-masked-coding.tar

收藏
Mendeley Data2024-01-31 更新2024-06-28 收录
下载链接:
http://datadryad.org/resource/doi:10.5061/dryad.475g7/1
下载链接
链接失效反馈
官方服务:
资源简介:
RNA-Seq was conducted on Illumina Hi-Seq 2000 platform, and reads were quality trimmed using Trimmomatic. De novo transcriptome assemblies were generated using Trinity, which can produce many isoforms. We used a phylogenetic approach to decide whether to retain isoforms. Transcripts which contained a valid protein with a BLASTp hit to a known land plant proteome were clustered using mcl, and gene trees were built from sequences in each cluster. To generate the "masked" dataset, the gene trees were searched for monophyletic clades containing sequences from the same species. Only one transcript per clade was retained, and the others were pruned from further analysis. This archive contains a multi-FASTA file of coding (DNA) sequences for each of the transcriptomes generated for the study.
创建时间:
2024-01-31
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作