five

audibeal/fr-echr

收藏
Hugging Face2024-03-13 更新2024-06-11 收录
下载链接:
https://hf-mirror.com/datasets/audibeal/fr-echr
下载链接
链接失效反馈
官方服务:
资源简介:
--- language: - fr pretty_name: "French version of ECtHR dataset" task_categories: - text-classification --- # French European Court of Human Rights Dataset ## Description The European Court of Human Rights (ECtHR) adjudicates claims concerning infringements on human rights provisions outlined in the European Convention on Human Rights (ECHR) by European states. The Convention can be accessed at https://www.echr.coe.int/Documents/Convention_ENG.pdf. The dataset construction followed the methodology of Chalkidis et al. (2019), but focused on decisions available in French. This dataset is a multi-label text classification dataset, aiming to predict the violation of one of the ten most violated articles based on given facts. ## Dataset Details - Features : ['facts', '10', '11', '13', '14', '2', '3', '5', '6', '8', 'p1-1'] - Train: 7756 - Dev: 862 - Test: 957 ## Usage You can download this dataset with the "datasets" library. Here's an example of how to load and use it in Python: ```python from datasets import load_dataset dataset = load_dataset("audibeal/fr-echr") ``` ## Cite If you use this dataset in the context of a publication, please cite: ```latex Jargon: A Suite of Language Models and Evaluation Tasks for French Specialized Domains Vincent Segonne, Aidan Mannion, Laura Cristina Alonzo Canul, Alexandre Audibert, Xingyu Liu, Cécile Macaire, Adrien Pupier, Yongxin Zhou, Mathilde Aguiar, Felix Herron, Magali Norré, Massih-Reza Amini, Pierrette Bouillon, Iris Eshkol-Taravella, Emmanuelle Esperança-Rodier, Thomas François, Lorraine Goeuriot, Jérôme Goulian, Mathieu Lafourcade, Benjamin Lecouteux, François Portet, Fabien Ringeval, Vincent Vandeghinste, Maximin Coavoux, Marco Dinarelli and Didier Schwab To appear at LREC-COLING 2024 ```
提供机构:
audibeal
原始信息汇总

French European Court of Human Rights Dataset 概述

数据集描述

  • 语言:法语
  • 任务类别:文本分类
  • 数据集目的:预测基于给定事实的欧洲人权法院裁决中,是否违反了十大最常违反的人权条款。
  • 构建方法:遵循 Chalkidis et al. (2019) 的方法,专注于法语裁决。

数据集详情

  • 特征:[facts, 10, 11, 13, 14, 2, 3, 5, 6, 8, p1-1]
  • 训练集大小:7756
  • 开发集大小:862
  • 测试集大小:957

引用信息

若在出版物中使用此数据集,请引用: latex Jargon: A Suite of Language Models and Evaluation Tasks for French Specialized Domains Vincent Segonne, Aidan Mannion, Laura Cristina Alonzo Canul, Alexandre Audibert, Xingyu Liu, Cécile Macaire, Adrien Pupier, Yongxin Zhou, Mathilde Aguiar, Felix Herron, Magali Norré, Massih-Reza Amini, Pierrette Bouillon, Iris Eshkol-Taravella, Emmanuelle Esperança-Rodier, Thomas François, Lorraine Goeuriot, Jérôme Goulian, Mathieu Lafourcade, Benjamin Lecouteux, François Portet, Fabien Ringeval, Vincent Vandeghinste, Maximin Coavoux, Marco Dinarelli and Didier Schwab
To appear at LREC-COLING 2024

5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作