five

dhfbk/modafact-ita

收藏
Hugging Face2025-01-21 更新2025-04-19 收录
下载链接:
https://hf-mirror.com/datasets/dhfbk/modafact-ita
下载链接
链接失效反馈
官方服务:
资源简介:
ModaFact是一个意大利语的文本数据集,标注有事件的事实性和情态性。该数据集的目的是以联合的方式建模文本中事件表达的事实性和情态性值。数据来源于EventNet-ITA,包含3039个句子,73784个单词和10445个标注。数据集在词级别进行标注,并提供两种事实性表示:细粒度表示和粗粒度表示。数据集分为训练集、验证集和测试集,每个集合大约包含相同比例的类别分布。

ModaFact is a textual dataset in Italian annotated with Event Factuality and Modality. The goal of ModaFact is to model in a joint way the factuality and modality values of event-denoting expressions in text. The data is sourced from EventNet-ITA and includes 3,039 sentences, 73,784 words, and 10,445 annotations. The dataset is annotated at the token level and provides both fine-grained and coarse-grained representations of factuality. The dataset is split into training, validation, and test sets, each containing an approximately equal distribution of class proportions.
提供机构:
dhfbk
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作