hyperdemocracy/usc-vecs-v1-s1024-o256-BAAI-bge-large-en-v1.5
收藏Hugging Face2024-02-24 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/hyperdemocracy/usc-vecs-v1-s1024-o256-BAAI-bge-large-en-v1.5
下载链接
链接失效反馈官方服务:
资源简介:
---
configs:
- config_name: default
data_files:
- path: data/usc-113-vecs-v1-s1024-o256-BAAI-bge-large-en-v1.5.parquet
split: '113'
- path: data/usc-114-vecs-v1-s1024-o256-BAAI-bge-large-en-v1.5.parquet
split: '114'
- path: data/usc-115-vecs-v1-s1024-o256-BAAI-bge-large-en-v1.5.parquet
split: '115'
- path: data/usc-116-vecs-v1-s1024-o256-BAAI-bge-large-en-v1.5.parquet
split: '116'
- path: data/usc-117-vecs-v1-s1024-o256-BAAI-bge-large-en-v1.5.parquet
split: '117'
- path: data/usc-118-vecs-v1-s1024-o256-BAAI-bge-large-en-v1.5.parquet
split: '118'
dataset_info:
features:
- dtype: string
name: chunk_id
- dtype: string
name: text_id
- dtype: string
name: legis_id
- dtype: string
name: text
- list:
dtype: float32
name: vec
- name: metadata
struct:
- dtype: string
name: chunk_id
- dtype: int32
name: chunk_index
- dtype: string
name: congress_num
- dtype: string
name: legis_class
- dtype: string
name: legis_id
- dtype: int32
name: legis_num
- dtype: string
name: legis_type
- dtype: string
name: legis_version
- dtype: int32
name: start_index
- dtype: string
name: text_date
- dtype: string
name: text_id
---
提供机构:
hyperdemocracy
原始信息汇总
数据集概述
数据文件配置
- 默认配置 (
default)- 数据文件路径及分割信息:
data/usc-113-vecs-v1-s1024-o256-BAAI-bge-large-en-v1.5.parquet,分割:113data/usc-114-vecs-v1-s1024-o256-BAAI-bge-large-en-v1.5.parquet,分割:114data/usc-115-vecs-v1-s1024-o256-BAAI-bge-large-en-v1.5.parquet,分割:115data/usc-116-vecs-v1-s1024-o256-BAAI-bge-large-en-v1.5.parquet,分割:116data/usc-117-vecs-v1-s1024-o256-BAAI-bge-large-en-v1.5.parquet,分割:117data/usc-118-vecs-v1-s1024-o256-BAAI-bge-large-en-v1.5.parquet,分割:118
- 数据文件路径及分割信息:
数据集特征信息
- 特征列表
chunk_id:字符串类型text_id:字符串类型legis_id:字符串类型text:字符串类型vec:浮点数列表类型metadata:结构类型chunk_id:字符串类型chunk_index:整数类型congress_num:字符串类型legis_class:字符串类型legis_id:字符串类型legis_num:整数类型legis_type:字符串类型legis_version:字符串类型start_index:整数类型text_date:字符串类型text_id:字符串类型



