mteb/VieMedEVBitextMining
收藏Hugging Face2025-06-20 更新2025-07-05 收录
下载链接:
https://hf-mirror.com/datasets/mteb/VieMedEVBitextMining
下载链接
链接失效反馈官方服务:
资源简介:
VieMedEVBitextMining是一个高质量的越南语-英语医疗领域平行语料库,用于机器翻译任务。该数据集是MTEB基准的一部分,来源于nhuvo/MedEV数据集。数据集包含两个特征:sentence1和sentence2,均为字符串类型。测试集包含2048个样本。数据集遵循cc-by-nc-4.0许可,并标记有mteb和text标签。
VieMedEVBitextMining is a high-quality Vietnamese-English parallel corpus from the medical domain for machine translation tasks. This dataset is a part of the MTEB benchmark and is sourced from the nhuvo/MedEV dataset. It includes two features: sentence1 and sentence2, both of which are strings. The test split contains 2048 examples. The dataset is licensed under cc-by-nc-4.0 and is tagged with mteb and text.
提供机构:
mteb



