Data from: Genome reannotation of the lizard Anolis carolinensis based on 14 adult and embryonic deep transcriptomes
收藏DataCite Commons2025-06-01 更新2025-06-15 收录
下载链接:
https://datadryad.org/dataset/doi:10.5061/dryad.17bn1
下载链接
链接失效反馈官方服务:
资源简介:
Background: The green anole lizard, Anolis carolinensis, is a key species
for both laboratory and field-based studies of evolutionary genetics,
development, neurobiology, physiology, behavior, and ecology. As the first
non-avian reptilian genome sequenced, A. carolinesis is also a prime
reptilian model for comparison with other vertebrate genomes. The public
databases of Ensembl and NCBI have provided a first generation gene
annotation of the anole genome that relies primarily on sequence
conservation with related species. A second generation annotation based on
tissue-specific transcriptomes would provide a valuable resource for
molecular studies. Results: Here we provide an annotation of the A.
carolinensis genome based on de novo assembly of deep transcriptomes of 14
adult and embryonic tissues. This revised annotation describes 59,373
transcripts, compared to 16,533 and 18,939 currently for Ensembl and NCBI,
and 22,962 predicted protein-coding genes. A key improvement in this
revised annotation is coverage of untranslated region (UTR) sequences,
with 79% and 59% of transcripts containing 5' and 3' UTRs,
respectively. Gaps in genome sequence from the current A. carolinensis
build (Anocar2.0) are highlighted by our identification of 16,542 unmapped
transcripts, representing 6,695 orthologues, with less than 70% genomic
coverage. Conclusions: Incorporation of tissue-specific transcriptome
sequence into the A. carolinensis genome annotation has markedly improved
its utility for comparative and functional studies. Increased UTR coverage
allows for more accurate predicted protein sequence and regulatory
analysis. This revised annotation also provides an atlas of gene
expression specific to adult and embryonic tissues.
提供机构:
Dryad
创建时间:
2013-03-19



