five

Customized R scripts and a data table for recombining genes assignments.

收藏
NIAID Data Ecosystem2026-05-01 收录
下载链接:
https://figshare.com/articles/dataset/Customized_R_scripts_and_a_data_table_for_recombining_genes_assignments_/23618667
下载链接
链接失效反馈
官方服务:
资源简介:
This repository contains files that complement the work by Pfeifer and Rocha on the recombination and conversion of phages, plasmids and phage-plasmids (preprint is here available: https://doi.org/10.1101/2023.08.08.552325). Following files are listed: A table (.zip) with information on the assignments of recombining genes between different types of mobile genetic elements. Recombining genes were defined as genes between dissimilar elements (gene repertoire relatedness, wGRR < 0.1), but with high gene similarity (>80% identity, >80% sequence coverage, and no more than 25 highly related genes between two elements). The table includes recombining genes (assignments: "From-To") among 3585 phages, 20274 plasmids, and 1416 phage-plasmids. The IDs of genes, proteins, and genomes were sourced from the NCBI database. Protein percentage identity (pident), E-value, and bitscore were computed using MMseqs2 (see Methods), while alignment fractions (sequence coverage) were determined by dividing alignment lengths by sequence lengths (coverage_from, coverage_to). Customized R scripts are provided for computing wGRR (which may be memory-intensive), gene clustering using single linkage, quantification of gene flow, and enrichment tests (including Fisher tests).
创建时间:
2024-01-14
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作