Genotyping analysis of over 130,000 CIMMYT bread wheat breeding lines: A decade-long effort in optimizing wheat genotyping
收藏DataONE2026-01-19 更新2026-01-24 收录
下载链接:
https://search.dataone.org/view/sha256:50d87f755847198c1a16bfb5cceefc831814fac4fcc690a32a6c9f2aa9addafb
下载链接
链接失效反馈官方服务:
资源简介:
A total of 130,247 bread wheat breeding lines from the year 2013-2023 developed by the International Maize and Wheat Improvement Center (CIMMYT) were genotyped. We used genotyping-by-sequencing (GBS) to construct 636 GBS libraries and sequenced them in the Illumina platform to generate FASTQ files. The key file consists of metadata such as sample name, flowcell, lane number, and barcode used for multiplexing samples. The FASTQ file of corresponding samples can be identified based on the library. The raw reads are available at the National Center for Biotechnology Information (NCBI) with BioProject accessions PRJNA498085 (2013 â 2020 data), PRJNA901877 (2021), PRJNA901925 and PRJNA901462 (2022) and PRJNA1044425 (2023).
, , , # Data from: Genotyping analysis of over 130,000 CIMMYT bread wheat breeding lines: A decade-long effort in optimizing wheat genotyping
[https://doi.org/10.5061/dryad.37pvmcvjq](https://doi.org/10.5061/dryad.37pvmcvjq)
## Description of the data and file structure:
1\. This file consists of the name of FASTQ files generated after sequencing respective GBS libraries and also the SRA accession of the files. The Tassel GBS pipeline requires specific naming of FASTQ files for the analysis. So, the downloaded FASTQ files from NCBI can be renamed to the respective file names as listed in column 'file_name_for_Tassel'.
`SRA_fastq_files_CIMMYT_bread_wheat_breeding_lines_2013-2023.xlsx`
2\. This file has information about the samples. The Tassel GBS pipeline uses the first four columns to identify sequencing reads present in the FASTQ files.
`key_file_of_CIMMYT_bread_wheat_breeding_lines_from_years_2013-2023.xlsx`
3\. The steps below shows on how to extract a subset of samples for the ana...,
创建时间:
2026-01-20



