人类基因染色体分类大模型数据集
收藏河北数据知识产权登记系统2025-09-06 收录
下载链接:
https://dataip.hebamr.cn/#/changeDetialCertical?pType=登记&cType=登记&id=3bcfa63e15c50dcfbd4fc1a883a4d351
下载链接
链接失效反馈官方服务:
资源简介:
该数据集涉及基因变异、基因功能、表型信息和测序技术等多个维度的数据,旨在为基因组学研究、疾病关联研究、个性化医疗和精准治疗提供强大的支持。具体包括:样本ID:该字段记录每个样本的唯一标识符,染色体带型:表示基因所处的染色体带型,染色体长度(Mb):记录染色体的总长度,着丝粒位置(Mb):记录染色体着丝粒的位置,基因编号:每个基因的唯一标识符,基因长度(kb):记录基因的长度,基因位置:基因在染色体上的具体位置,基因功能:描述基因在生物体内的功能,通路信息:描述基因所属的生物学通路,外显子数:该基因的外显子(编码区域)的数量,内含子数:该基因的内含子(非编码区域)的数量,是否存在变异点位:指示该基因是否包含的变异位点,变异点位信息:记录基因中的变异点位,表型信息:记录与基因变异相关的临床表型,测序技术描述:记录用于基因组分析的测序平台和技术,基因组功能注释:基因组功能的注释内容,实验条件:样本处理的实验条件;分类结果:基于基因功能和表型信息,结合其他调整系数(如基因长度、基因变异、实验条件等),对基因进行染色体分类。
This dataset encompasses multi-dimensional data including genetic variations, gene functions, phenotypic information, sequencing technologies and other related domains, aiming to provide robust support for genomics research, disease association studies, personalized medicine and precision therapy. Specifically, the dataset includes the following fields: Sample ID: This field records the unique identifier of each sample; chromosomal banding pattern: indicates the chromosomal banding pattern where the gene is located; chromosome length (Mb): records the total length of the chromosome; centromere position (Mb): records the position of the chromosome centromere; gene number: the unique identifier of each gene; gene length (kb): records the length of the gene; gene location: the specific position of the gene on the chromosome; gene function: describes the function of the gene in a living organism; pathway information: describes the biological pathway to which the gene belongs; number of exons: the count of exons (coding regions) of the gene; number of introns: the count of introns (non-coding regions) of the gene; presence of variant sites: indicates whether the gene harbors variant sites; variant site information: records the variant sites within the gene; phenotypic information: records clinical phenotypes associated with genetic variations; sequencing technology description: records the sequencing platforms and technologies utilized for genomic analysis; genomic functional annotation: annotation content of genomic functions; experimental conditions: experimental conditions applied during sample processing; classification results: chromosomal classification of genes based on gene function and phenotypic information, combined with other adjustment coefficients such as gene length, genetic variation, experimental conditions and other relevant factors.
提供机构:
河北水熊基因科技有限公司
创建时间:
2025-01-01
搜集汇总
数据集介绍

背景与挑战
背景概述
该数据集是一个专注于人类基因染色体分类的大规模数据集,包含基因组、变异、表型信息和基因功能注释等多维度数据,旨在支持染色体分类、基因变异数据库构建和群体遗传学研究。数据集以Excel格式提供,涵盖样本ID、染色体带型、基因功能、变异点位等关键字段,适用于基因组学分析、疾病关联研究和个性化医疗应用。
以上内容由遇见数据集搜集并总结生成



