neuralbioinfo/segmentdb_balanced__s_actL2k_tsh5m__plasmid10
收藏Hugging Face2026-02-04 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/neuralbioinfo/segmentdb_balanced__s_actL2k_tsh5m__plasmid10
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
- config_name: segmentdb
features:
- name: segment_id
dtype: int64
- name: sequence_id
dtype: int64
- name: host_taxa_genera
dtype: float64
- name: script_method
dtype: large_string
- name: o_fasta_id
dtype: large_string
- name: asm_acc
dtype: large_string
- name: y
dtype: int64
- name: category
dtype: large_string
- name: labels
dtype: int64
- name: filtered_by_virus
dtype: bool
- name: seg_len
dtype: int64
- name: actL
dtype: float64
- name: start
dtype: int64
- name: end
dtype: float64
- name: strand
dtype: large_string
- name: segment_category
dtype: large_string
- name: segment
dtype: large_string
splits:
- name: train
num_bytes: 6297106869
num_examples: 2933836
download_size: 2835317534
dataset_size: 6297106869
- config_name: segments2host
features:
- name: segment_id
dtype: int64
- name: host_taxa_id
dtype: int64
splits:
- name: train
num_bytes: 69050832
num_examples: 4315677
download_size: 21640380
dataset_size: 69050832
configs:
- config_name: segmentdb
data_files:
- split: train
path: segmentdb/train-*
- config_name: segments2host
data_files:
- split: train
path: segments2host/train-*
---
提供机构:
neuralbioinfo



