five

Additional file 2 of Using RNA-Seq for gene identification, polymorphism detection and transcript profiling in two alfalfa genotypes with divergent cell wall composition in stems

收藏
Mendeley Data2024-06-25 更新2024-06-28 收录
下载链接:
https://springernature.figshare.com/articles/dataset/Additional_file_2_of_Using_RNA-Seq_for_gene_identification_polymorphism_detection_and_transcript_profiling_in_two_alfalfa_genotypes_with_divergent_cell_wall_composition_in_stems/12875820/1
下载链接
链接失效反馈
官方服务:
资源简介:
Additional file 2:Alfalfa Gene Index 1.0 (MSGI 1.0). A fasta file containing Alfalfa Gene Index 1.0 (MSGI 1.0) sequences. MSGI 1.0 contains a total of 124,025 unique sequences including 22,729 tentative consensus sequences (TCs), 22,315 singletons and 78,981 pseudo-singletons. The average length of the unique sequences in MSGI 1.0 is 384 bp (100 bp minimum and 6,956 bp maximum) with more than 10,000 sequences larger than 800 bp. The total base count of the sequences in MSGI 1.0 is 47,628,953 bp. Unfortunately, the current pipe line of the DFCI gene index database http://compbio.dfci.harvard.edu/tgi/ is not suited for short reads (personal communication with a DFCI Gene Index staff). The Gene Index Project team has indicated that it plans to address this issue soon. When a gene index database is established for alfalfa, MSGI1.0 will be uploaded to the DFCI gene index database. (ZIP 15 MB)

附加文件2:苜蓿基因索引1.0(MSGI 1.0)。本数据集为包含苜蓿基因索引1.0(MSGI 1.0)序列的FASTA文件。MSGI 1.0共包含124025条唯一序列,其中包括22729条暂定一致序列(TCs)、22315条单序列(singletons)以及78981条伪单序列(pseudo-singletons)。MSGI 1.0中唯一序列的平均长度为384 bp(最短100 bp,最长6956 bp),其中长度超过800 bp的序列数量超过10000条。MSGI 1.0中所有序列的总碱基数为47628953 bp。遗憾的是,当前DFCI基因索引数据库(http://compbio.dfci.harvard.edu/tgi/)的分析流程无法适配短读长测序数据(该结论来自与DFCI基因索引团队工作人员的私下沟通)。基因索引项目团队已表示,计划尽快解决该问题。待苜蓿专属基因索引数据库建成后,MSGI 1.0将被上传至DFCI基因索引数据库。(压缩包大小:15 MB)
创建时间:
2023-06-28
二维码
社区交流群
二维码
科研交流群
商业服务