Prediction of multi-drug resistance transporters dataset
收藏f1000.figshare.com2023-05-31 更新2025-03-25 收录
下载链接:
https://f1000.figshare.com/articles/dataset/Prediction_of_multi_drug_resistance_transporters_dataset/1415804/1
下载链接
链接失效反馈官方服务:
资源简介:
Data file 1Title: Data File PROSITE_positives_PS000125.fasta.Legend: Sequence file in FASTA format of all positive examples for the ser/thr phosphatase model.
Data file 2Data File PROSITE_negatives_PS000125.fasta.Sequence file in FASTA format of all randomly selected negative examples for the ser/thr phosphatase model."
Data file 3Data File PROSITE_positives_PS00028.fasta.Sequence file in FASTA format of all positive examples for the zinc finger model.
Data file 4Data File PROSITE_negatives_PS00028.fasta.Sequence file in FASTA format of all randomly selected negative examples for the zinc finger model.
Data file 5Data File PROSITE_PS00125.txt.PROSITE record used for the ser/thr phosphatase model.
Data file 6Data File PROSITE_PS00028.txt.PROSITE record used for the zinc finger model.
Data file 7Data File MDR_TCDB_positives.fasta.Sequence file of MDR transporters used for training. FASTA format file of positive examples used in this study derived from the TCDB.
Data file 8Data File MDR_TCDB_negatives.fasta.Sequence file of non-MDR transporters used for training. FASTA format file of negative examples used in this study derived from the TCDB.
Data file 9Data File PILGram_PATTERNS_PS00125.txt.Regular expression generated by PILGram for the ser/thr phosphatase model.
Data file 10Data File PS00125_alignments.out.Sequence alignments of PILGram model matches to the positive examples in the ser/thr phosphatase model.
Data file 11Data File PILGram_PATTERNS_PS00028.txt.Regular expressions generated by PILGram for the zinc finger model.
Data file 12Data File PS00028_alignments.out.Sequence alignments of PILGram model matches to the positive examples in the zinc finger model and a summary score line that represents the overlap of the 10 different models for each sequence.
Data file 13Data File PILGram_PATTERNS_MDRpred.txt.The 36 regular expressions and associated physiochemical properties (where applicable) generated by PILGram for the MDR model .
Data file 14Data File MDRpred_alignments.out.Alignments of 36 PILGram model matches on the MDR positive example sequences.
Data file 15Data File Pfam_transporters.txt.A list of Pfam families that were used to identify transporters in the Hot Lake metagenome.
Data file 16Data File HotLake_MDRpred_predictions.fasta.A FASTA format file of 63 protein sequences from the Hot Lake metagenome that are matched by 30 or more MDRpred individual models (high confidence predictions), match Pfam families for transporters (Pfam e-value less than 1e-20), but are not identified by Pfam as multidrug resistance transporters.
数据文件 1:标题:PROSITE_positives_PS000125.fasta。注释:包含针对丝/苏氨酸磷酸酶模型的全部正例序列的FASTA格式序列文件。
数据文件 2:数据文件 PROSITE_negatives_PS000125.fasta。序列文件,采用FASTA格式,包含针对丝/苏氨酸磷酸酶模型的全部随机选择的负例序列。
数据文件 3:数据文件 PROSITE_positives_PS00028.fasta。序列文件,采用FASTA格式,包含针对锌指模型的全部正例序列。
数据文件 4:数据文件 PROSITE_negatives_PS00028.fasta。序列文件,采用FASTA格式,包含针对锌指模型的全部随机选择的负例序列。
数据文件 5:数据文件 PROSITE_PS00125.txt。用于丝/苏氨酸磷酸酶模型的PROSITE记录。
数据文件 6:数据文件 PROSITE_PS00028.txt。用于锌指模型的PROSITE记录。
数据文件 7:数据文件 MDR_TCDB_positives.fasta。用于训练的MDR转运蛋白序列文件。本研究中使用的FASTA格式正例序列文件,源自TCDB。
数据文件 8:数据文件 MDR_TCDB_negatives.fasta。用于训练的非MDR转运蛋白序列文件。本研究中使用的FASTA格式负例序列文件,源自TCDB。
数据文件 9:数据文件 PILGram_PATTERNS_PS00125.txt。针对丝/苏氨酸磷酸酶模型由PILGram生成的正则表达式。
数据文件 10:数据文件 PS00125_alignments.out。PILGram模型匹配到丝/苏氨酸磷酸酶模型正例序列的序列比对。
数据文件 11:数据文件 PILGram_PATTERNS_PS00028.txt。针对锌指模型由PILGram生成的正则表达式。
数据文件 12:数据文件 PS00028_alignments.out。针对锌指模型,PILGram模型匹配到正例序列的序列比对,以及表示每个序列10种不同模型重叠的汇总评分行。
数据文件 13:数据文件 PILGram_PATTERNS_MDRpred.txt。由PILGram为MDR模型生成的36个正则表达式及其相关的物理化学性质(如适用)。
数据文件 14:数据文件 MDRpred_alignments.out。36个PILGram模型匹配在MDR正例序列上的比对。
数据文件 15:数据文件 Pfam_transporters.txt。用于在Hot Lake宏基因组中识别转运蛋白的Pfam家族列表。
数据文件 16:数据文件 HotLake_MDRpred_predictions.fasta。63个蛋白序列的FASTA格式文件,来自Hot Lake宏基因组,这些序列被30个或更多MDRpred单个模型匹配(高置信度预测),与转运蛋白的Pfam家族匹配(Pfam e-value小于1e-20),但未被Pfam识别为多药耐药转运蛋白。
提供机构:
f1000.figshare.com



