amirka20/peptag
收藏Hugging Face2026-03-25 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/amirka20/peptag
下载链接
链接失效反馈官方服务:
资源简介:
---
license: mit
configs:
- config_name: peptag
data_files:
- split: train
path: peptag/train-*
- split: val
path: peptag/val-*
- split: test
path: peptag/test-*
- split: trainval
path: peptag/trainval-*
- config_name: stereo_pairs
data_files:
- split: stereo_pairs
path: stereo_pairs/stereo_pairs-*
- config_name: substitution_pairs
data_files:
- split: substitution_pairs
path: substitution_pairs/substitution_pairs-*
- config_name: tag_pairs
data_files:
- split: tag_pairs
path: tag_pairs/tag_pairs-*
dataset_info:
- config_name: peptag
features:
- name: Peptide
dtype: string
- name: B
dtype: float64
- name: M
dtype: float64
- name: Z
dtype: int64
- name: Length
dtype: int64
- name: SMILES
dtype: string
splits:
- name: train
num_bytes: 11965676
num_examples: 41455
- name: val
num_bytes: 1331730
num_examples: 4607
- name: test
num_bytes: 866767
num_examples: 2726
- name: trainval
num_bytes: 13297406
num_examples: 46062
download_size: 4249559
dataset_size: 27461579
- config_name: stereo_pairs
features:
- name: Sequence_f
dtype: string
- name: Sequence_F
dtype: string
- name: SMILES_f
dtype: string
- name: SMILES_F
dtype: string
- name: B_f
dtype: float64
- name: B_F
dtype: float64
- name: delta_B
dtype: float64
splits:
- name: stereo_pairs
num_bytes: 332784
num_examples: 542
download_size: 51973
dataset_size: 332784
- config_name: substitution_pairs
features:
- name: Sequence_1
dtype: string
- name: Sequence_2
dtype: string
- name: Position
dtype: int64
- name: Residue_1
dtype: string
- name: Residue_2
dtype: string
- name: SMILES_1
dtype: string
- name: SMILES_2
dtype: string
- name: B_1
dtype: float64
- name: B_2
dtype: float64
- name: delta_B
dtype: float64
splits:
- name: substitution_pairs
num_bytes: 4290171
num_examples: 6942
download_size: 318447
dataset_size: 4290171
- config_name: tag_pairs
features:
- name: Sequence_untagged
dtype: string
- name: Sequence_tagged
dtype: string
- name: Tag
dtype: string
- name: SMILES_untagged
dtype: string
- name: SMILES_tagged
dtype: string
- name: B_untagged
dtype: float64
- name: B_tagged
dtype: float64
- name: delta_B
dtype: float64
splits:
- name: tag_pairs
num_bytes: 911044
num_examples: 1549
download_size: 115469
dataset_size: 911044
---
提供机构:
amirka20



