five

Flux9665/BibleMMS

收藏
Hugging Face2024-06-16 更新2024-06-15 收录
下载链接:
https://hf-mirror.com/datasets/Flux9665/BibleMMS
下载链接
链接失效反馈
官方服务:
资源简介:
该数据集与论文《Meta Learning Text-to-Speech Synthesis in over 7000 Languages》相关联,使用了eBible数据集的子集作为文本输入,生成了超过7000种语言的2000个语音样本。数据集包含了多种语言的ISO-639-3代码,并详细列出了这些代码。数据集的特征包括音频、转录文本和语言代码,数据集大小为508120568184.992字节,下载大小为597640766127字节。

The dataset is associated with the paper Meta Learning Text-to-Speech Synthesis in over 7000 Languages and uses subsets of the eBible dataset as text input to generate 2000 spoken utterances per language across over 7000 languages. The dataset includes ISO-639-3 codes for various languages and lists these codes in detail. The features of the dataset include audio, transcript, and language code, with a dataset size of 508120568184.992 bytes and a download size of 597640766127 bytes.
提供机构:
Flux9665
原始信息汇总

数据集概述

许可证

  • MIT许可证

任务类别

  • 文本转语音

数据集信息

  • 特征

    • 音频 (audio)
    • 转录文本 (string)
    • 语言代码 (string)
  • 数据分割

    • 训练集 (train)
      • 字节数: 508120568184.992
      • 样本数: 736272
  • 下载大小

    • 597640766127 字节
  • 数据集大小

    • 508120568184.992 字节

配置

  • 默认配置 (default)
    • 数据文件路径: data/train-*

语言代码

  • 数据集包含以下ISO-639-3语言代码:

    acf, bss, deu, inb, nca, quh, wap, acr, bus, dgr, ind, maz, nch, qul, tav, wmw, acu, byr, dik, iou, mbb, ncj, qvc, tbc, xed, agd, bzh, djk, ipi, mbc, ncl, qve, tbg, xon, agg, bzj, dop, jac, mbh, ncu, qvh, tbl, xtd, agn, caa, jic, mbj, ndj, qvm, tbz, xtm, agr, cab, emp, jiv, mca, ngp, qvs, tcs, yaa, agu, cap, eng, jvn, mca, ngp, qvs, tcs, yaa, aia, car, ese, mcb, ngu, qvw, yal, cax, kaq, mcd, nhe, qvz, tee, ycn, ake, cbc, far, mco, qwh, yka, alp, cbi, fra, kdc, mcp, nhu, qxh, ame, cbr, gai, kde, mcq, nhw, qxn, tew, yre, amf, cbs, gam, kdl, mdy, nhy, qxo, tfr, yva, amk, cbt, geb, kek, med, nin, rai, zaa, apb, cbu, glk, ken, mee, nko, rgu, zab, apr, cbv, meq, nld, tgo, zac, arl, cco, gng, kje, met, nlg, rop, tgp, zad, grc, klv, mgh, nnq, rro, zai, ata, cek, gub, kmu, mib, noa, ruf, tna, zam, atb, cgc, guh, kne, mie, not, rug, tnk, zao, atg, chf, knf, mih, npl, rus, tnn, zar, awb, chz, gum, knj, mil, sab, tnp, zas, cjo, guo, ksr, mio, obo, seh, toc, zav, azg, cle, gux, kue, mit, omw, sey, tos, zaw, azz, cme, gvc, kvn, miz, ood, sgb, tpi, zca, bao, cni, gwi, kwd, mkl, shp, tpt, zga, bba, cnl, gym, kwf, mkn, ote, sja, trc, ziw, bbb, cnt, gyr, kwi, mop, otq, snn, ttc, zlm, cof, hat, kyc, mox, pab, snp, tte, zos, bgt, con, kyf, mpm, pad, som, tue, zpc, bjr, cot, heb, kyg, mpp, soy, tuf, zpl, bjv, cpa, kyq, mpx, pao, spa, tuo, zpm, bjz, cpb, hlt, kyz, mqb, pib, spp, tur, zpo, bkd, cpu, hns, lac, mqj, pir, spy, txq, zpu, blz, crn, hto, lat, msy, pjt, sri, txu, zpz, bmr, cso, hub, lex, mto, pls, srm, udu, ztq, bmu, ctu, lgl, muy, poi, srn, ukr, zty, bnp, cuc, lid, mxb, pol, stp, upv, zyp, boa, cui, huu, mxq, por, sus, ura, boj, cuk, huv, llg, mxt, poy, suz, urb, box, cwe, hvn, prf, swe, urt, bpr, cya, ign, lww, myk, ptu, swh, usp, bps, daa, ikk, maj, myy, sxb, vid, bqc, dah, nab, qub, tac, vie, bqp, ded, imo, maq, nas, quf, taj, vmy

搜集汇总
数据集介绍
main_image_url
以上内容由遇见数据集搜集并总结生成
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作