five

opsomerto/mini-protein-dataset

收藏
Hugging Face2026-04-05 更新2026-04-12 收录
下载链接:
https://hf-mirror.com/datasets/opsomerto/mini-protein-dataset
下载链接
链接失效反馈
官方服务:
资源简介:
--- language: - en license: cc-by-4.0 tags: - protein - biology - uniprot - swiss-prot --- # Mini Protein Dataset A small subset of reviewed (Swiss-Prot) protein sequences from UniProt, created for the `minihf` toy project demonstrating a full HuggingFace workflow. ## Fields | Field | Type | Description | |---|---|---| | `id` | string | UniProt accession (e.g. `P12345`) | | `description` | string | Protein name from FASTA header | | `sequence` | string | Amino-acid sequence (single-letter codes) | | `length` | int | Sequence length in amino acids | ## Usage ```python from datasets import load_dataset ds = load_dataset("opsomerto/mini-protein-dataset") print(ds["train"][0]["sequence"]) ``` ## Source UniProt Swiss-Prot (reviewed), fetched via the UniProt REST API.
提供机构:
opsomerto
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作