five

higgood/BioWMT22_zh2en

收藏
Hugging Face2024-09-06 更新2025-04-26 收录
下载链接:
https://hf-mirror.com/datasets/higgood/BioWMT22_zh2en
下载链接
链接失效反馈
官方服务:
资源简介:
--- dataset_info: features: - name: zh dtype: string - name: en dtype: string splits: - name: test num_bytes: 114235 num_examples: 264 download_size: 66111 dataset_size: 114235 configs: - config_name: default data_files: - split: test path: data/test-* task_categories: - translation language: - zh - en tags: - biology - medical size_categories: - n<1K modalities: - Text --- # Dataset Card for BioWMT'22 ZH-EN Test Set Test set that was compiled for the [Biomedical Translation Task](https://www.statmt.org/wmt22/biomedical-translation-task.html) 2022 at [WMT](https://machinetranslate.org/wmt). - **Language(s) (NLP):** English, Chinese; ## Citation ```bibtex @InProceedings {neves-EtAl:2022:WMT, author = {Neves, Mariana and Jimeno Yepes, Antonio and Siu, Amy and Roller, Roland and Thomas, Philippe and Vicente Navarro, Maika and Yeganova, Lana and Wiemann, Dina and Di Nunzio, Giorgio Maria and Vezzani, Federica and Gerardin, Christel and Bawden, Rachel and Estrada, Darryl Johan and Lima-Lopez, Salvador and Farre-Maduel, Eulalia and Krallinger, Martin and Grozea, Cristian and Neveol, Aurelie}, title = {Findings of the WMT 2022 Biomedical Translation Shared Task: Monolingual Clinical Case Reports}, booktitle = {Proceedings of the Seventh Conference on Machine Translation}, month = {December}, year = {2022}, address = {Abu Dhabi}, publisher = {Association for Computational Linguistics}, pages = {694--723}, abstract = {In the seventh edition of the WMT Biomedical Task, we addressed a total of seven language pairs, namely English/German, English/French, English/Spanish, English/Portuguese, English/Chinese, English/Russian, English/Italian. This year's test sets covered three types of biomedical text genre. In addition to scientific abstracts and terminology items used in previous editions, we released test sets of clinical cases. The evaluation of clinical cases translations were given special attention by involving clinicians in the preparation of reference translations and manual evaluation. For the main MEDLINE test sets, we received a total of 609 submissions from 37 teams. For the ClinSpEn sub-task, we had the participation of five teams.}, url = {https://aclanthology.org/2022.wmt-1.69} } ```
提供机构:
higgood
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作