Data for PhD Thesis on Next Generation Nematode Genomes
收藏DataCite Commons2020-09-05 更新2024-07-25 收录
下载链接:
https://figshare.com/articles/dataset/Data_for_PhD_Thesis__Next-generation_Nematode_Genomes__-_Sujai_Kumar/96089/5
下载链接
链接失效反馈官方服务:
资源简介:
Data for PhD thesis on "Next-generation Nematode Genomes" Sujai Kumar (Note: The thesis itself will be made publicly available after the viva/oral examination is complete). Update: Thesis available at http://hdl.handle.net/1842/7609 (https://www.era.lib.ed.ac.uk/handle/1842/7609) -------------------------------------------------------------------- Species Abbreviations: <em>Trichinella spiralis</em> (ts) <em>Ascaris suum</em> (as) <em>Dirofilaria immitis</em> (di) <em>Brugia malayi</em> (bm) <em>Litomosoides sigmodontis</em> (ls) <em>Acanthocheilonema viteae</em> (av) <em>Strongyloides ratii</em> (sr) <em>Bursaphelenchus xylophilus</em> (bx) <em>Meloidogyne hapla</em> (mh) <em>Meloidogyne incognita</em> (mi) <em>Meloidogyne floridensis</em> (mf) <em>Pristionchus pacificus</em> (pp) <em>Caenorhabditis angaria</em> (ca) <em>Caenorhabditis japonica</em> (cj) <em>Caenorhabditis elegans</em> (ce) <em>Caenorhabditis brenneri</em> (cbn) <em>Caenorhabditis sp. 11</em> (csp11) <em>Caenorhabditis remanei</em> (cr) <em>Caenorhabditis briggsae</em> (cbg) <em>Caenorhabditis sp.5</em> (csp5) -------------------------------------------------------------------- File descriptions: -------------------------------------------------------------------- <strong>Chapter 3: Annotating nematode genomes</strong> - 20_nematode_protein_files.tgz - This tgz file has 20 Nematode protein fasta files used in Chapter 3 "Annotating nematode genomes". The original files were obtained from WormBase (WS230), http://nematod.es, and www.inra.fr/meloidogyne_incognita/genomic_resources . The fasta files have been cleaned up: a) all whitespace converted to spaces in sequence headers (otherwise NCBI's makeblastdb fails) b) multi-line sequences have been converted to single line c) sequence IDs have been prefixed with a species abbreviation. - 20_nematode_genome_files_part{1,2,3}.tgz - These three tgz files are Nematode genome nucleotide fasta files. The original files were obtained from WormBase (WS230),http://nematod.es, and www.inra.fr/meloidogyne_incognita/genomic_resources . The fasta files have been cleaned up: a) multi-line sequences have been converted to single line b) sequence IDs have been prefixed with a species abbreviation. - 20_nematode_blast2go.annot.goslim.tgz 20 Blast2GO annotation files for each nematode proteome - 20_nematode_iprscan.tgz 20 proteomes with InterProScan annotations - 20_nematode_tRNA_counts.xls tRNA counts for 20 nematode genomes - 20_nematode_tRNAscan_gff.tgz tRNA locations for 20 nematode genomes (GFF format) - 20_nematode_rfamscan_gff.tgz Rfamscan output for 20 nematode genomes (GFF format) -------------------------------------------------------------------- <strong>Chapter 4: Lack of deeply conserved non-coding elements in nematodes</strong> - tba.alignments.tar Whole-genome multiple alignment files for specific nodes in the nematode phylogeny: Clade III, Onchocercidae, Clade IV, Meloidogyne, Clade V, Caenorhabditis, Elegans group - tba.alignments.CNEs.tar CNE multiple alignment files for specific nodes in the nematode phylogeny (whole- genome multiple alignments with coding regions removed - tba.alignments.CNEs.stats.tgz Tab delimited files with length and relative identity for each CNE - pairwise.megablast.tar Pairwise MegaBLAST alignments for all 20 genomes - megablast.cluster.tgz MegaBLAST based clusters of CNEs -------------------------------------------------------------------- <strong>Chapter 5: The <em>Meloidogyne floridensis</em> genome reveals complex hybrid origins of the root-knot nematodes</strong> - protein.faa.tgz Protein sets used for M. hapla, M. incognita, and M. floridensis after truncating at stop codons and filtering short proteins (protein fasta files) - cds.fna.tgz CDS transcript files corresponding to proteins in M. hapla, M. incognita, and M. floridensis (nucleotide fasta files) - mhmimf.98.self.id Tab-delimited file with self-identity scores for each CDS in each species - InParanoid-mh-mi-mf.tgz InParanoid results (pair-wise clustering) - QuickParanoid-mh-mi-mf.tgz QuickParanoid results (orthologous clusters across three species) - raxml-mh-mi-mf.tgz phylogenetic trees for each QuickParanoid cluster
提供机构:
figshare
创建时间:
2016-01-11



