five

Inter-individual gene expression variability implies stable regulation of brain-biased genes across organs

收藏
NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/14063786
下载链接
链接失效反馈
官方服务:
资源简介:
Abstract Phenotypic variation among individuals plays a key role in evolution, since variation provides the material on which natural selection can act. One important link between genetic and phenotypic variation is gene expression. As for other phenotypes, the range of accessible expression variation is limited and biased by different evolutionary and developmental constraints. Gene expression variability broadly refers to the tendency of a gene to vary in expression (i.e., between individuals or cells) due to stochastic fluctuations or differences in genetic, epigenetic, or environmental factors, separately from the differences between e.g. organs. Variability due to biomolecular stochasticity (transcriptional ‘noise’) and cell-to-cell heterogeneity has been well-studied in isogenic populations of unicellular organisms such as bacteria and yeasts. However, for more complex organisms with multiple cells, tissues, and organs sharing the same genetic background, the interplay between inter-individual expression variability, gene and organ function, and gene regulation remains an open question. In this study, we used highly multiplexed 3’-end Bulk RNA Barcoding and sequencing (BRB-seq) to generate transcriptome profiles spanning at least nine organs in outbred individuals of three ray-finned fish species: zebrafish, Northern pike, and spotted gar. For each condition, we measured expression variation per gene independent of mean expression level. We observed that lowly variable genes are enriched in cellular housekeeping functions whereas highly variable genes are enriched in stimulus-response functions. Furthermore, genes with highly variable expression between individuals evolve under weaker purifying selection at the coding sequence level, indicating that intra-species gene expression variability predicts inter-species protein sequence divergence. Genes that are broadly expressed across organs tend to be both highly expressed and lowly variable between individuals, whereas organ-biased genes are typically highly variable within their top organ of expression. For genes with organ-biased expression profiles, we inferred differences in selective pressure on gene regulation depending on their top organ. We found that genes with peak expression in the brain have low inter-individual expression variability across non-nervous organs, suggesting stabilizing selection on regulatory evolution of brain-biased genes. Conversely, liver-biased genes have highly variable expression across organs, implying weaker regulatory constraints. These patterns show that gene regulatory mechanisms evolved differently based on constraints on the primary organ. Directory Structure config/: Contains YAML file indicating package versions for conda environment data/: Contains input data counts/: Contains counts and UMI-deduplicated counts. Currently under embargo and will be made available upon acceptance for publication. gene_metadata/: Contains gene biotype information from Ensembl sample_metadata/: Contains sample metadata files for each species selectome/: Contains selection statistics from the Selectome database results/: Contains output files sorted by subfolders labeled after each step of the analysis pipeline. Only R notebook HTML files are available on the Git repository, please check Zenodo for R data files. run_pipeline.Rdata: Contains all parameters used for each step of the analysis pipeline workflow/: Contains scripts used for the analysis pipeline analysis/: Contains all steps of the analysis pipeline, available as .Rmd files functions/: Contains all functions used for analysis/ renv/: Used for package management in R run_pipeline.R: Runs all the steps under analysis/ run_go_figure.sh: Runs GO-Figure! 1.0.0 (downloaded separately) demultiplex_brbseq_fastq.sh: Used for demultiplexing BRB-seq fastq files using BRB-seqTools 1.6.1 (downloaded separately) for uploading to NCBI SRA rename_fastq_files.sh: Used for renaming demultiplexed fastq files by mapping each barcode to their corresponding sample name renv.lock: Lockfile for managing R package versions. Run renv::restore() to set up the R environment based on packages specified in the lockfile. All package versions used are also specified in the output HTML files under results/. Species Codes LOC: Lepisosteus oculatus (spotted gar) ELU: Esox lucius (Northern pike) DRE: Danio rerio (zebrafish)
创建时间:
2024-12-13
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作