mah92/Khadijah-FA_EN-Public-Phone-Audio-Dataset
收藏Hugging Face2025-02-10 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/mah92/Khadijah-FA_EN-Public-Phone-Audio-Dataset
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是一个多语言文本数据集,包含波斯语和英语文本。数据集经过处理,移除或替换了无法被特定阅读器正确读取的字符和行,包括中文字符、波斯语和英语的单个字母和一些特殊符号。此外,还移除了包含特定标识符或文本的行,以及阿拉伯语部分的文本。但是,具体的数据集用途和内容描述没有在README中提供。
This dataset is a multilingual text corpus containing Persian and English texts. The dataset has been processed to remove or replace characters and lines that are not correctly read by a specific reader, including Chinese characters, single Persian and English alphabets, and special symbols. Additionally, lines containing specific identifiers or texts, as well as Arabic parts of texts, have been removed. However, the README does not provide a specific description of the datasets purpose and content.
提供机构:
mah92



