TUDB-Labs/medical-qa

Name: TUDB-Labs/medical-qa
Creator: TUDB-Labs
Published: 2024-05-27 03:32:34
License: 暂无描述

Hugging Face2024-05-27 更新2024-06-12 收录

下载链接：

https://hf-mirror.com/datasets/TUDB-Labs/medical-qa

下载链接

链接失效反馈

官方服务：

资源简介：

--- language: - en license: apache-2.0 size_categories: - 1K<n<10K task_categories: - question-answering dataset_info: features: - name: properties struct: - name: doi dtype: string - name: subject dtype: string - name: long_context dtype: string - name: question dtype: string - name: evaluation_35 dtype: string - name: raw_context struct: - name: abstract dtype: string - name: introduction dtype: string - name: methods dtype: string - name: title dtype: string - name: short_answer dtype: string - name: short_context dtype: string - name: answer_format struct: - name: labels sequence: string - name: type dtype: string - name: raw_reason dtype: string - name: raw_answer dtype: string splits: - name: train num_bytes: 64281926.11785095 num_examples: 6231 - name: test num_bytes: 7149313.882149047 num_examples: 693 download_size: 36402332 dataset_size: 71431240.0 configs: - config_name: default data_files: - split: train path: data/train-* - split: test path: data/test-* tags: - medical --- # Dataset Card for Dataset Name  This dataset card aims to be a base template for new datasets. It has been generated using [this raw template](https://github.com/huggingface/huggingface_hub/blob/main/src/huggingface_hub/templates/datasetcard_template.md?plain=1). ## Dataset Details ### Dataset Description  - **Curated by:** [More Information Needed] - **Funded by [optional]:** [More Information Needed] - **Shared by [optional]:** [More Information Needed] - **Language(s) (NLP):** [More Information Needed] - **License:** [More Information Needed] ### Dataset Sources [optional]  - **Repository:** [More Information Needed] - **Paper [optional]:** [More Information Needed] - **Demo [optional]:** [More Information Needed] ## Uses  ### Direct Use  [More Information Needed] ### Out-of-Scope Use  [More Information Needed] ## Dataset Structure  [More Information Needed] ## Dataset Creation ### Curation Rationale  [More Information Needed] ### Source Data  #### Data Collection and Processing  [More Information Needed] #### Who are the source data producers?  [More Information Needed] ### Annotations [optional]  #### Annotation process  [More Information Needed] #### Who are the annotators?  [More Information Needed] #### Personal and Sensitive Information  [More Information Needed] ## Bias, Risks, and Limitations  [More Information Needed] ### Recommendations  Users should be made aware of the risks, biases and limitations of the dataset. More information needed for further recommendations. ## Citation [optional]  **BibTeX:** [More Information Needed] **APA:** [More Information Needed] ## Glossary [optional]  [More Information Needed] ## More Information [optional] [More Information Needed] ## Dataset Card Authors [optional] [More Information Needed] ## Dataset Card Contact [More Information Needed]

提供机构：

TUDB-Labs

原始信息汇总

数据集概述

基本信息

语言: 英语
许可证: Apache-2.0
大小分类: 1K<n<10K
任务分类: 问答

数据集特征

属性:
- doi: 字符串类型
- subject: 字符串类型
长上下文 (long_context): 字符串类型
问题 (question): 字符串类型
评估指标 (evaluation_35): 字符串类型
原始上下文 (raw_context):
- abstract: 字符串类型
- introduction: 字符串类型
- methods: 字符串类型
- title: 字符串类型
简短答案 (short_answer): 字符串类型
简短上下文 (short_context): 字符串类型
答案格式 (answer_format):
- labels: 字符串序列
- type: 字符串类型
原始推理 (raw_reason): 字符串类型
原始答案 (raw_answer): 字符串类型

数据集分割

训练集 (train):
- 字节数: 64281926.11785095
- 示例数: 6231
测试集 (test):
- 字节数: 7149313.882149047
- 示例数: 693

下载和数据集大小

下载大小: 36402332
数据集大小: 71431240.0

配置

默认配置 (default):
- 训练数据路径: data/train-*
- 测试数据路径: data/test-*