higgood/BioWMT19_zh2en
收藏Hugging Face2024-09-06 更新2025-04-26 收录
下载链接:
https://hf-mirror.com/datasets/higgood/BioWMT19_zh2en
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: zh
dtype: string
- name: en
dtype: string
splits:
- name: test
num_bytes: 86840
num_examples: 243
download_size: 57554
dataset_size: 86840
configs:
- config_name: default
data_files:
- split: test
path: data/test-*
task_categories:
- translation
language:
- zh
- en
tags:
- biology
- medical
size_categories:
- n<1K
modalities:
- Text
---
# Dataset Card for BioWMT'19 ZH-EN Test Set
Test set that was compiled for the [Biomedical Translation Task](https://www.statmt.org/wmt19/biomedical-translation-task.html) 2019 at [WMT](https://machinetranslate.org/wmt).
- **Language(s) (NLP):** English, Chinese;
## Citation
```bibtex
@inproceedings{bawden-etal-2019-findings,
title = "Findings of the {WMT} 2019 Biomedical Translation Shared Task: Evaluation for {MEDLINE} Abstracts and Biomedical Terminologies",
author = "Bawden, Rachel and
Bretonnel Cohen, Kevin and
Grozea, Cristian and
Jimeno Yepes, Antonio and
Kittner, Madeleine and
Krallinger, Martin and
Mah, Nancy and
Neveol, Aurelie and
Neves, Mariana and
Soares, Felipe and
Siu, Amy and
Verspoor, Karin and
Vicente Navarro, Maika",
booktitle = "Proceedings of the Fourth Conference on Machine Translation (Volume 3: Shared Task Papers, Day 2)",
month = aug,
year = "2019",
address = "Florence, Italy",
publisher = "Association for Computational Linguistics",
url = "https://aclanthology.org/W19-5403",
doi = "10.18653/v1/W19-5403",
pages = "29--53",
}
```
数据集信息:
特征:
- 名称:zh,数据类型:字符串
- 名称:en,数据类型:字符串
数据集划分:
- 划分名称:test,字节数:86840,样本数:243
下载大小:57554,数据集总大小:86840
配置项:
- 配置名称:default,数据文件:
- 划分:test,路径:data/test-*
任务类别:
- 翻译
涉及语言:
- 中文
- 英文
标签:
- 生物学
- 医学
样本规模分类:
- 样本数少于1000
模态:
- 文本
# BioWMT'19 汉英测试集数据集卡片
本测试集专为2019年**机器翻译研讨会(WMT)**举办的[生物医学翻译任务(Biomedical Translation Task)](https://www.statmt.org/wmt19/biomedical-translation-task.html)构建。
- **自然语言处理涉及语言**:英语、中文
## 引用
bibtex
@inproceedings{bawden-etal-2019-findings,
title = "2019年WMT生物医学翻译共享任务结果:针对MEDLINE摘要与生物医学术语的评估",
author = "Bawden, Rachel and
Bretonnel Cohen, Kevin and
Grozea, Cristian and
Jimeno Yepes, Antonio and
Kittner, Madeleine and
Krallinger, Martin and
Mah, Nancy and
Neveol, Aurelie and
Neves, Mariana and
Soares, Felipe and
Siu, Amy and
Verspoor, Karin and
Vicente Navarro, Maika",
booktitle = "第四届机器翻译会议论文集(第3卷:共享任务论文,第2日)",
month = "八月",
year = "2019",
address = "意大利佛罗伦萨",
publisher = "国际计算语言学协会",
url = "https://aclanthology.org/W19-5403",
doi = "10.18653/v1/W19-5403",
pages = "29--53",
}
提供机构:
higgood



