missvector/asd-qa-train
收藏Hugging Face2023-09-13 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/missvector/asd-qa-train
下载链接
链接失效反馈官方服务:
资源简介:
---
license: mit
dataset_info:
features:
- name: question
dtype: string
- name: answers
struct:
- name: answer_end
dtype: int64
- name: answer_start
dtype: int64
- name: text
dtype: string
- name: paragraph
dtype: string
splits:
- name: train
num_bytes: 3060746
num_examples: 2593
download_size: 450478
dataset_size: 3060746
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
---
# Dataset Card for The ASD QA Dataset (train set)
## Dataset Description
- **Repository:** https://github.com/vifirsanova/empi
### Dataset Summary
A dataset for question-answering used for building an informational Russian language chatbot for the inclusion of people with autism spectrum disorder and Asperger syndrome in particular, based on data from the following website: https://aspergers.ru.
### Languages
Russian
## Dataset Structure
The dataset inherits SQuAD 2.0 structure.
### Source Data
https://aspergers.ru
### Dataset Curators
Victoria Firsanova
提供机构:
missvector
原始信息汇总
数据集卡片 for The ASD QA Dataset (训练集)
数据集描述
数据集摘要
一个用于问答的数据集,用于构建一个信息性的俄语聊天机器人,特别针对自闭症谱系障碍和阿斯伯格综合症人群的融入,基于以下网站的数据:https://aspergers.ru。
语言
俄语
数据集结构
该数据集继承了 SQuAD 2.0 结构。
数据集信息
-
特征:
- question: 字符串类型
- answers: 结构体类型,包含以下字段:
- answer_end: 64位整数类型
- answer_start: 64位整数类型
- text: 字符串类型
- paragraph: 字符串类型
-
分割:
- train:
- 字节数: 3060746
- 样本数: 2593
- train:
-
下载大小: 450478
-
数据集大小: 3060746
配置
- config_name: default
- data_files:
- split: train
- path: data/train-*
- data_files:
数据集策展人
Victoria Firsanova



