five

higgood/BioWMT19_zh2en

收藏
Hugging Face2024-09-06 更新2025-04-26 收录
下载链接:
https://hf-mirror.com/datasets/higgood/BioWMT19_zh2en
下载链接
链接失效反馈
官方服务:
资源简介:
--- dataset_info: features: - name: zh dtype: string - name: en dtype: string splits: - name: test num_bytes: 86840 num_examples: 243 download_size: 57554 dataset_size: 86840 configs: - config_name: default data_files: - split: test path: data/test-* task_categories: - translation language: - zh - en tags: - biology - medical size_categories: - n<1K modalities: - Text --- # Dataset Card for BioWMT'19 ZH-EN Test Set Test set that was compiled for the [Biomedical Translation Task](https://www.statmt.org/wmt19/biomedical-translation-task.html) 2019 at [WMT](https://machinetranslate.org/wmt). - **Language(s) (NLP):** English, Chinese; ## Citation ```bibtex @inproceedings{bawden-etal-2019-findings, title = "Findings of the {WMT} 2019 Biomedical Translation Shared Task: Evaluation for {MEDLINE} Abstracts and Biomedical Terminologies", author = "Bawden, Rachel and Bretonnel Cohen, Kevin and Grozea, Cristian and Jimeno Yepes, Antonio and Kittner, Madeleine and Krallinger, Martin and Mah, Nancy and Neveol, Aurelie and Neves, Mariana and Soares, Felipe and Siu, Amy and Verspoor, Karin and Vicente Navarro, Maika", booktitle = "Proceedings of the Fourth Conference on Machine Translation (Volume 3: Shared Task Papers, Day 2)", month = aug, year = "2019", address = "Florence, Italy", publisher = "Association for Computational Linguistics", url = "https://aclanthology.org/W19-5403", doi = "10.18653/v1/W19-5403", pages = "29--53", } ```

数据集信息: 特征: - 名称:zh,数据类型:字符串 - 名称:en,数据类型:字符串 数据集划分: - 划分名称:test,字节数:86840,样本数:243 下载大小:57554,数据集总大小:86840 配置项: - 配置名称:default,数据文件: - 划分:test,路径:data/test-* 任务类别: - 翻译 涉及语言: - 中文 - 英文 标签: - 生物学 - 医学 样本规模分类: - 样本数少于1000 模态: - 文本 # BioWMT'19 汉英测试集数据集卡片 本测试集专为2019年**机器翻译研讨会(WMT)**举办的[生物医学翻译任务(Biomedical Translation Task)](https://www.statmt.org/wmt19/biomedical-translation-task.html)构建。 - **自然语言处理涉及语言**:英语、中文 ## 引用 bibtex @inproceedings{bawden-etal-2019-findings, title = "2019年WMT生物医学翻译共享任务结果:针对MEDLINE摘要与生物医学术语的评估", author = "Bawden, Rachel and Bretonnel Cohen, Kevin and Grozea, Cristian and Jimeno Yepes, Antonio and Kittner, Madeleine and Krallinger, Martin and Mah, Nancy and Neveol, Aurelie and Neves, Mariana and Soares, Felipe and Siu, Amy and Verspoor, Karin and Vicente Navarro, Maika", booktitle = "第四届机器翻译会议论文集(第3卷:共享任务论文,第2日)", month = "八月", year = "2019", address = "意大利佛罗伦萨", publisher = "国际计算语言学协会", url = "https://aclanthology.org/W19-5403", doi = "10.18653/v1/W19-5403", pages = "29--53", }
提供机构:
higgood
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作