ATIS Dataset
收藏ATIS (Airline Travel Information System) Dataset 概述
数据格式
- Python 3 Pickle 格式
- Rasa NLU JSON 格式
数据样本
原始格式
text 0: flight: BOS i want to fly from boston at 838 am and arrive in denver at 1110 in the morning EOS BOS O i O want O to O fly O from O boston B-fromloc.city_name at O 838 B-depart_time.time am I-depart_time.time and O arrive O in O denver B-toloc.city_name at O 1110 B-arrive_time.time in O the O morning B-arrive_time.period_of_day EOS O
Rasa NLU Json 格式
json { "rasa_nlu_data": { "common_examples": [ { "text": "i would like to find a flight from charlotte to las vegas that makes a stop in st. louis", "intent": "flight", "entities": [ { "start": 35, "end": 44, "value": "charlotte", "entity": "fromloc.city_name" }, { "start": 48, "end": 57, "value": "las vegas", "entity": "toloc.city_name" }, { "start": 79, "end": 88, "value": "st. louis", "entity": "stoploc.city_name" } ] }, ... ] } }
数据统计
| 样本数 | 词汇数 | 实体数 | 意图数 |
|---|---|---|---|
| 4978(训练集)+893(测试集) | 943 | 129 | 26 |
下载
| 数据格式 | 训练集 | 测试集 |
|---|---|---|
| Python 3 Pickle 格式 | atis.train.pkl | atis.test.pkl |
| Rasa NLU JSON 格式 | train.json | test.json |
数据来源
- 原始数据集:ATIS DataSet by siddhadev
- 部分代码:MicroSoft CNTK Examples




