Jargon: A Suite of Language Models and Evaluation Tasks for French Specialized Domains: Data
收藏NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/10865546
下载链接
链接失效反馈官方服务:
资源简介:
This repository contains the data for the following paper:
Vincent Segonne, Aidan Mannion, Laura Cristina Alonzo Canul, Alexandre Daniel Audibert, Xingyu Liu, Cécile Macaire, Adrien Pupier, Yongxin Zhou, Mathilde Aguiar, Felix E. Herron, Magali Norré, Massih R Amini, Pierrette Bouillon, Iris Eshkol-Taravella, Emmanuelle Esperança-Rodier, Thomas François, Lorraine Goeuriot, Jérôme Goulian, Mathieu Lafourcade, et al.. 2024. Jargon: A Suite of Language Models and Evaluation Tasks for French Specialized Domains. In Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), pages 9463–9476, Torino, Italia. ELRA and ICCL.
1) pretraining data for the Jargon specialized language models
2) ECTHR_FR dataset for text classification in the French legal domain
创建时间:
2024-10-14



