ASMC: sequence and result files
收藏NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/13483302
下载链接
链接失效反馈官方服务:
资源简介:
This repository contains the data specified in the application note entitled "ASMC: investigating the amino acid diversity of enzyme active sites". It includes:
1. *.fa.gz - The starting sequence sets in FASTA format.
2. *_removed_sequences.txt.gz - The list of sequences discarded due to important gaps.
3. *_groups_X_min_Y.tsv.gz - The ASMC results in TSV format. The columns must be read as follows: "Protein ID | Active Site sequences | ASMC group". The letters X and Y correspond to the DBSCAN parameters defined with the 'auto' option (eps and min_samples, respectively).
创建时间:
2024-12-31



