five

Dataset. FASTA file containing filtered and clustered NLR baits

收藏
Figshare2024-11-08 更新2026-04-28 收录
下载链接:
https://figshare.com/articles/dataset/Sweetpotato_NLR_baits_designed_for_RenSeq_experiment_N_38_694_baits_/25303204
下载链接
链接失效反馈
官方服务:
资源简介:
The file baits-moderatre-RM25pc-noPT.fas.clust-83-100 contains filtered baits that have been clustered after applying moderate filtering criteria, ensuring they have ≤25% repeat masking and exclude matches to plastid genomes. These baits were further collapsed by removing sequences that were 100% identical over 83% of their length. The resulting file contains a reduced set of 38,694 baits, effectively capturing the targeted regions while minimizing redundancy.

文件baits-moderatre-RM25pc-noPT.fas.clust-83-100 包含经筛选的诱饵序列:该数据集先通过中等强度筛选标准完成预处理并完成聚类,确保序列的重复序列屏蔽(repeat masking)比例不超过25%,且不匹配任何质体基因组(plastid genomes)。随后通过移除在83%及以上序列区域内完全一致的片段,对该批诱饵序列进行进一步冗余压缩。最终生成的文件共包含38694条诱饵序列,可在有效覆盖目标区域的同时最大限度降低序列冗余度。
创建时间:
2024-11-08
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作