five

Dataset: Combined Annotation Dependent Depletion (CADD) scores for turkey and chicken

收藏
DataCite Commons2025-11-06 更新2025-11-15 收录
下载链接:
https://data.4tu.nl/datasets/f2ff2a38-0766-48f0-99f1-65d875ba81d4
下载链接
链接失效反馈
官方服务:
资源简介:
This dataset contains genome-wide CADD (Combined Annotation Dependent Depletion) scores for chicken and turkey, generated as part of research aimed at predicting the deleteriousness of genetic variants in non-model species. The objective of the study was to develop and apply a generic, species-agnostic pipeline that computes CADD scores using only a high-quality reference genome, corresponding gene annotation, and a multi-species alignment (MSA) to infer ancestral sequences. The research involved computational methods rather than experimental sample collection; genomic reference assemblies, available functional annotations, and an evolutionary MSA were used as input features to train a machine learning model that assigns PHRED-like CADD scores to all possible single nucleotide variants across the genome. The resulting data consist of chromosome-wise tab-delimited files containing CADD scores for chicken (<code>chr{chr}.tsv.gz</code>) and turkey (<code>Turkey_chr{chr}.tsv.gz</code>), which can be used for comparative genomics, evolutionary analyses, and prioritization of candidate variants in genomic and breeding studies. The work is described in the publication <em>“A generic pipeline for CADD Score generation: chickenCADD and turkeyCADD”</em>, accepted in <em>G3</em>.
提供机构:
4TU.ResearchData
创建时间:
2025-11-06
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作