five

NoVAGraphS FSA User-Agent Corpus

收藏
NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/10822732
下载链接
链接失效反馈
官方服务:
资源简介:
Paper:  Di Nuovo E., Sanguinetti M., Balestrucci P.F,Anselma L., Bernareggi C., Mazzei A. (2024),Educational Dialogue Systems for Visually Impaired Students: Introducing a Task-Oriented User-Agent Corpus. Accepted paper at the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024) Contact person: Elisa Di Nuovo, elisa.dinuovo@gmail.com Dataset Summary Collection of user-agent interactions revolving around the description of Finite State Automata. Daset Description The corpus consists of a CSV file encoded in UTF-8 comprising the following columns: CODE_ID: the id of the interaction Turn: the turn number within the interaction Participant: it identifies the sender (U for the user, S for the agent) Text: the utterance content VIP: it determines whether the user is a Visually-Impaired Person Token count: the number of tokens in the utterance (counted using Spacy tokenizer) DAs_GOLD and Errors_GOLD: the columns including the assigned labels for Dialog Acts and Errors, respectively FSA_ID: the id of the Finite State Automaton that is being referred to within the conversation (it corresponds to the PNG and HTML file names containing the relevant information on the FSA) Additional Data Two PNG files with the graphical representation of the automata Two HTML files containing the state tables of the automata RASA configuration files used to train the DIET classifier on the DAs Access Request To access the data users need to fill out the following Google form
创建时间:
2024-07-06
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作