mteb/WebFAQBitextMiningQuestions
收藏Hugging Face2025-06-25 更新2025-07-05 收录
下载链接:
https://hf-mirror.com/datasets/mteb/WebFAQBitextMiningQuestions
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了多种语言对之间的翻译数据,每个数据集都配置了特定的语言对,并包括了数据集的名称、特征(句子1和句子2,均为字符串)、示例数量、数据集的字节大小、下载大小以及在默认分割中的示例数量。这些数据集涵盖了广泛的语对,包括阿拉伯语、阿塞拜疆语、孟加拉语、保加利亚语、加泰罗尼亚语、捷克语、丹麦语、德语、英语、爱沙尼亚语、芬兰语、法语、古吉拉特语、印地语、克罗地亚语、印度尼西亚语、冰岛语、意大利语、日语、卡纳达语、哈萨克语、韩语、拉脱维亚语、立陶宛语、马拉雅拉姆语、马拉地语、挪威语、荷兰语、葡萄牙语、罗马尼亚语、俄语、斯洛伐克语、斯洛文尼亚语、塞尔维亚语、西班牙语、瑞典语、泰米尔语、泰卢固语、泰语、土耳其语、乌克兰语、乌尔都语、越南语和中国语。
This dataset contains translation data between multiple language pairs. Each dataset is configured with specific language pairs and includes details such as the name of the dataset, the features (sentence1 and sentence2, both as strings), the number of examples, the size of the dataset in bytes, the download size, and the number of examples in the default split. These datasets cover a wide range of language pairs, including Arabic, Azerbaijani, Bengali, Bulgarian, Catalan, Czech, Danish, German, English, Estonian, Finnish, French, Gujarati, Hindi, Croatian, Indonesian, Icelandic, Italian, Japanese, Kannada, Kazakh, Korean, Latvian, Lithuanian, Malayalam, Marathi, Norwegian, Dutch, Portuguese, Romanian, Russian, Slovak, Slovenian, Serbian, Spanish, Swedish, Tamil, Telugu, Thai, Turkish, Ukrainian, Urdu, Vietnamese, and Chinese.
提供机构:
mteb



