five

Alignment of the Serine Protease family

收藏
NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/14599585
下载链接
链接失效反馈
官方服务:
资源简介:
Dataset containing the files for the alignment of the Serine Protease protein family. serprot_ref.faa: manually curated alignment of 1390 members of the Serine Protease family for the Halabi Sector paper. serprot.hmm: HMM profile made using hmmbuild from the serprot_ref.faa alignment. serprot_matches.stk: Stockholm file containing the output of the hmmsearch on the UniProt database using the serprot.hmm profile. new_aln0.faa: FASTA file with the MSA after conversion from Stockholm using the stk2fasta.pl script. new_aln.faa: FASTA file with the MSA after running the code of the 00_alignment_cleaning.ipynb notebook. iter_aln.faa: FASTA file with the MSA after running the code of the 01_compact_alignment.ipynb notebook. iter_aln_dedup.faa: FASTA file with the MSA without duplicate sequences. MSA file MSA description L (number of positions) M (number of sequences) serprot_ref.faa Reference alignment of Serine Proteases, manually curated for the Halabi Sector paper. 823 1 390 new_aln0.faa Alignment coming from the hmmsearch using the HMM profile made with serprot_ref.faa. 823 189 304 new_aln.faa Alignment coming from the 00_alignment_cleaning.ipynb notebook. 693 101 543 iter_aln.faa Alignment coming from the 01_compact_alignment.ipynb notebook. 260 101 543 iter_aln_dedup.faa Alignment coming from the deduplication of iter_aln.faa. 260 82 021
创建时间:
2025-03-26
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作