protein_localization_dataset
收藏IEEE2026-04-17 收录
下载链接:
https://ieee-dataport.org/documents/proteinlocalizationdataset
下载链接
链接失效反馈官方服务:
资源简介:
This dataset contains protein sequences and precomputed ProtT5 embeddings used for subcellular localization prediction as presented in the paper: \Sequence to Location: Protein Subcellular Localization Driven by Deep Pretrained Language Model.\ It includes training, validation, and test sets in FASTA format, along with corresponding ProtT5 embeddings stored in HDF5 (.h5) files. This dataset supports reproducibility and enables researchers to replicate or extend the experiments in the paper.
提供机构:
Chen Song



