mteb/VieMedEVBitextMining

Name: mteb/VieMedEVBitextMining
Creator: mteb
Published: 2025-06-20 19:11:42
License: 暂无描述

Hugging Face2025-06-20 更新2025-07-05 收录

下载链接：

https://hf-mirror.com/datasets/mteb/VieMedEVBitextMining

下载链接

链接失效反馈

官方服务：

资源简介：

VieMedEVBitextMining是一个高质量的越南语-英语医疗领域平行语料库，用于机器翻译任务。该数据集是MTEB基准的一部分，来源于nhuvo/MedEV数据集。数据集包含两个特征：sentence1和sentence2，均为字符串类型。测试集包含2048个样本。数据集遵循cc-by-nc-4.0许可，并标记有mteb和text标签。

VieMedEVBitextMining is a high-quality Vietnamese-English parallel corpus from the medical domain for machine translation tasks. This dataset is a part of the MTEB benchmark and is sourced from the nhuvo/MedEV dataset. It includes two features: sentence1 and sentence2, both of which are strings. The test split contains 2048 examples. The dataset is licensed under cc-by-nc-4.0 and is tagged with mteb and text.

提供机构：

mteb

5,000+

优质数据集

54 个

任务类型

进入经典数据集