AI4Protein/ssp_q8
收藏Hugging Face2025-11-21 更新2026-01-03 收录
下载链接:
https://hf-mirror.com/datasets/AI4Protein/ssp_q8
下载链接
链接失效反馈官方服务:
资源简介:
---
license: apache-2.0
task_categories:
- token-classification
tags:
- chemistry
- biology
---
Dataset Summary
The study of a protein’s secondary structure (Sec. Struc. P.) forms a fundamental cornerstone in understanding its biological function. This secondary structure, comprising helices, strands, and various turns, bestows the protein with a specific three-dimensional configuration, which is critical for the formation of its tertiary structure. In the context of this work, a given protein sequence is classified into three distinct categories, each representing a different structural element: H - Alpha-helix, G - 3-10 helix, I - Pi helix, E - Beta-strand, B - Beta-bridge, T - Turn, S - Bend, C - Coil (or random coil).
Data Fields
seq: a string containing the protein sequence
label: a sequence containing the structural label of each residue.
Original Dataset Name: biomap-research/ssp_q8
Original Author / Organization: Biomap
Original URL: https://huggingface.co/datasets/biomap-research/ssp_q8
Original License: Apache License 2.0
No changes were made to the data except for the column name. All credit and rights belong to the original authors.
提供机构:
AI4Protein



