five

SINAI/Spanish-QC

收藏
Hugging Face2024-03-22 更新2024-06-11 收录
下载链接:
https://hf-mirror.com/datasets/SINAI/Spanish-QC
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: cc-by-nc-sa-4.0 language: - es tags: - Answer Search classification pretty_name: Spanish-QC --- ### Dataset Description **Paper**: [BRUJA: Question Classification for Spanish. Using Machine Translation and an English Classifier.](https://aclanthology.org/W06-1906.pdf) **Point of Contact**: magc@ujaen.es This resource is 6305 questions in Spanish labeled for Answer Search classification, following the taxonomy defined in the article "X. Li and D. Roth. Learning Question Classifiers", which has the following general and detailed categories: - ABBR: abbreviation, expansion - DESC: definition, description, mode, motif - ENTY: animal, body, color, creation, currency, disease/medical, event, food, instrument, language, letter, other, plant, product, religion, sport, substance, symbol, technique, term, vehicle, word - HUM: description, group, individual, title - LOC: city, country, mountain, other, state, other, state - NUM: code, count, date, distance, distance, money, order, other, percentage, period, speed, temperature, size, weight Starting from a set of labeled questions for English, this resource has been generated with various questions in Spanish labeled and reviewed by 3 people. ### Acknowledgments This work has been supported by the Spanish Government (MCYT) with grant TIC2003-07158-C04-04. ### Licensing Information Spanish-QC is released under the [Apache-2.0 License](http://www.apache.org/licenses/LICENSE-2.0). ### Citation Information ```bibtex @inproceedings{a-garcia-cumbreras-etal-2006-bruja, title = "{BRUJA}: Question Classification for {S}panish. Using Machine Translationand an {E}nglish Classifier", author = "Garc{\'\i}a Cumbreras, Miguel {\'A}. and Ure{\~n}a L{\'o}pez, L. Alfonso and Mart{\'\i}nez Santiago, Fernando", booktitle = "Proceedings of the Workshop on Multilingual Question Answering - {MLQA} {`}06", year = "2006", url = "https://aclanthology.org/W06-1906", } ```
提供机构:
SINAI
原始信息汇总

数据集描述

论文: BRUJA: Question Classification for Spanish. Using Machine Translation and an English Classifier.

联系人: magc@ujaen.es

该资源包含6305个西班牙语问题,用于答案搜索分类,遵循文章“X. Li and D. Roth. Learning Question Classifiers”中定义的分类法,包括以下一般和详细类别:

  • ABBR: 缩写, 扩展
  • DESC: 定义, 描述, 模式, 主题
  • ENTY: 动物, 身体, 颜色, 创造, 货币, 疾病/医疗, 事件, 食物, 乐器, 语言, 字母, 其他, 植物, 产品, 宗教, 体育, 物质, 符号, 技术, 术语, 交通工具, 单词
  • HUM: 描述, 团体, 个人, 头衔
  • LOC: 城市, 国家, 山脉, 其他, 州, 其他, 州
  • NUM: 代码, 计数, 日期, 距离, 距离, 金钱, 顺序, 其他, 百分比, 时期, 速度, 温度, 大小, 重量

该资源从一组标记的英语问题开始,生成了多种西班牙语标记问题,并由3人进行了审查。

致谢

该工作得到了西班牙政府(MCYT)的资助,授予TIC2003-07158-C04-04。

许可信息

Spanish-QC 在 Apache-2.0 License 下发布。

引用信息

bibtex @inproceedings{a-garcia-cumbreras-etal-2006-bruja, title = "{BRUJA}: Question Classification for {S}panish. Using Machine Translationand an {E}nglish Classifier", author = "Garc{\i}a Cumbreras, Miguel {A}. and Ure{~n}a L{o}pez, L. Alfonso and Mart{\i}nez Santiago, Fernando", booktitle = "Proceedings of the Workshop on Multilingual Question Answering - {MLQA} {`}06", year = "2006", url = "https://aclanthology.org/W06-1906", }

5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作