five

The cacao gene atlas: A transcriptome developmental atlas reveals highly tissue-specific and dynamically-regulated gene networks in Theobroma cacao L

收藏
DataONE2024-09-23 更新2025-08-23 收录
下载链接:
https://search.dataone.org/view/sha256:b9896979ec93a6f6d3b76046d1dad96416fe16b6b7c96ebb446a443ca5bc51a4
下载链接
链接失效反馈
官方服务:
资源简介:
A large dataset of replicated transcriptomes was developed to accelerate Theobroma cocoa genomics research with the long-term goal of progressing breeding towards developing high-yielding elite varieties of cacao. RNAs were extracted and transcriptomes were sequenced from 123 different tissues and stages of development representing major organs and developmental stages of the cacao lifecycle. In addition, several experimental treatments and time courses were performed to measure gene expression in tissues responding to biotic and abiotic stressors. Samples were collected in replicates (3-5) to enable statistical analysis of gene expression levels for a total of 390 transcriptomes.  We describe the creation of the atlas,and its global characterization and define sets of genes co-regulated in highly organ- and temporally-specific manners. To promote wider use of these data, all raw sequencing data, expression read mapping matrices, scripts, and other information used to create the resourc..., RNA was extracted form about 400 different tissues/treatments and replicates. Transcriptome sequencing was performed by Quant Seq (Lexogen). Raw QuantSeq reads were first examined with FASTQC (v0.11.9 https://www.bioinformatics.babraham.ac.uk/projects/fastqc/) to assess the overall data quality before processing. Reads were then processed using bbduk (BBMap tools v37.76; https://jgi.doe.gov/data-and-tools/software-tools/bbtools/bb-tools-user-guide/bbduk-guide/) to trim the adapter sequences, poly-A tails, and low-quality bases and to discard fragments less than 20 bp in length after trimming. Trimmed reads were mapped to the CCN-51 and SCA6 Theobroma cacao genotype reference genomes using the STAR Aligner version 2.7.5b (Dobin et al. 2013). Expression quantification was performed with featureCounts from the Subread package version 2.0.1 (Liao et al. 2013) in a fractional read-counting mode to prop distribute muti-mapping reads among features using gene annotation GFF3 files modified wit..., Excel or any text editor or spreadsheet program., # The cacao gene atlas: A transcriptome developmental atlas reveals highly tissue-specific and dynamically-regulated gene networks in Theobroma cacao L ## Description of the Data and file structure 1. The first row lists all tissues, replicates and time points for each sample. The first column lists each cacao gene that was detected. All other cells contain the number of transcripts that were mapped for each gene/sample combination. 2. CPM counts are normalized by counts per million, they are used on the BAR website 3. To compare values in the gene expression matrix with the BAR website, be sure to use the CPM read counts 4. Fractional reads are unnormalized raw reads to be used for downsteam analysis such as DESeq2, do not compare these counts with the counts on BAR 5. Genotype of the tissue is indicated in the metadata, the data was mapped to multiple genomes which may differ from the genotype of the tissue 6. Ex: CCN51 tissues were mapped to the CCN51 genome AND SCA6 genome 7. The ...
创建时间:
2025-08-05
二维码
社区交流群
二维码
科研交流群
商业服务