five

imperialwarrior/open-australian-legal-qa-paraphrased-easy-gpt-with-emb

收藏
Hugging Face2024-03-13 更新2024-06-22 收录
下载链接:
https://hf-mirror.com/datasets/imperialwarrior/open-australian-legal-qa-paraphrased-easy-gpt-with-emb
下载链接
链接失效反馈
官方服务:
资源简介:
--- dataset_info: features: - name: pipeline_1_result dtype: string - name: pipeline_1_result_r_embeddings sequence: float64 - name: pipeline_1_result_nr_embeddings sequence: float64 - name: pipeline_2_context dtype: string - name: pipeline_2_result dtype: string - name: pipeline_2_result_r_embeddings sequence: float64 - name: pipeline_2_result_nr_embeddings sequence: float64 - name: pipeline_3_context dtype: string - name: pipeline_3_result dtype: string - name: pipeline_3_result_r_embeddings sequence: float64 - name: pipeline_3_result_nr_embeddings sequence: float64 - name: pipeline_4_context dtype: string - name: pipeline_4_result dtype: string - name: pipeline_4_result_r_embeddings sequence: float64 - name: pipeline_4_result_nr_embeddings sequence: float64 - name: pipeline_5_context dtype: string - name: pipeline_5_result dtype: string - name: pipeline_5_result_r_embeddings sequence: float64 - name: pipeline_5_result_nr_embeddings sequence: float64 - name: pipeline_6_context dtype: string - name: pipeline_6_result dtype: string - name: pipeline_6_result_r_embeddings sequence: float64 - name: pipeline_6_result_nr_embeddings sequence: float64 - name: pipeline_7_context dtype: string - name: pipeline_7_result dtype: string - name: pipeline_7_result_r_embeddings sequence: float64 - name: pipeline_7_result_nr_embeddings sequence: float64 - name: referenced_question dtype: string - name: answer dtype: string - name: answer_non_retrieval_embeddings dtype: string - name: answer_retrieval_embeddings dtype: string - name: question dtype: string - name: question_retrieval_embeddings dtype: string - name: question_non_retrieval_embeddings dtype: string - name: __index_level_0__ dtype: float64 - name: case_index dtype: float64 - name: pipeline_6_case_indexes sequence: int64 - name: pipeline_7_case_indexes sequence: int64 splits: - name: train num_bytes: 137944644 num_examples: 208 download_size: 32779364 dataset_size: 137944644 configs: - config_name: default data_files: - split: train path: data/train-* ---

This dataset includes processing results and context information from multiple pipelines, along with related questions and answers. Specific features include processing results, embedding vectors, referenced questions, answers, and their embedding vectors from various pipelines. The dataset is divided into a training set with 208 samples.
提供机构:
imperialwarrior
原始信息汇总

数据集概述

特征信息

数据集包含以下特征:

  • pipeline_1_result: 字符串类型
  • pipeline_1_result_r_embeddings: 浮点数序列
  • pipeline_1_result_nr_embeddings: 浮点数序列
  • pipeline_2_context: 字符串类型
  • pipeline_2_result: 字符串类型
  • pipeline_2_result_r_embeddings: 浮点数序列
  • pipeline_2_result_nr_embeddings: 浮点数序列
  • pipeline_3_context: 字符串类型
  • pipeline_3_result: 字符串类型
  • pipeline_3_result_r_embeddings: 浮点数序列
  • pipeline_3_result_nr_embeddings: 浮点数序列
  • pipeline_4_context: 字符串类型
  • pipeline_4_result: 字符串类型
  • pipeline_4_result_r_embeddings: 浮点数序列
  • pipeline_4_result_nr_embeddings: 浮点数序列
  • pipeline_5_context: 字符串类型
  • pipeline_5_result: 字符串类型
  • pipeline_5_result_r_embeddings: 浮点数序列
  • pipeline_5_result_nr_embeddings: 浮点数序列
  • pipeline_6_context: 字符串类型
  • pipeline_6_result: 字符串类型
  • pipeline_6_result_r_embeddings: 浮点数序列
  • pipeline_6_result_nr_embeddings: 浮点数序列
  • pipeline_7_context: 字符串类型
  • pipeline_7_result: 字符串类型
  • pipeline_7_result_r_embeddings: 浮点数序列
  • pipeline_7_result_nr_embeddings: 浮点数序列
  • referenced_question: 字符串类型
  • answer: 字符串类型
  • answer_non_retrieval_embeddings: 字符串类型
  • answer_retrieval_embeddings: 字符串类型
  • question: 字符串类型
  • question_retrieval_embeddings: 字符串类型
  • question_non_retrieval_embeddings: 字符串类型
  • __index_level_0__: 浮点数类型
  • case_index: 浮点数类型
  • pipeline_6_case_indexes: 整数序列
  • pipeline_7_case_indexes: 整数序列

数据分割

  • train: 包含208个样本,数据大小为137944644字节

数据集大小

  • 下载大小: 32779364字节
  • 数据集大小: 137944644字节

配置信息

  • 配置名称: default
  • 数据文件路径: data/train-*
二维码
社区交流群
二维码
科研交流群
商业服务