AnnaWegmann/Training-Misc
收藏Hugging Face2025-07-15 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/AnnaWegmann/Training-Misc
下载链接
链接失效反馈官方服务:
资源简介:
该数据集为Tokenization is Sensitive to Language Variation论文的训练语料库,但具体内容和组成未在README中描述。
This dataset is the training corpus used in the paper Tokenization is Sensitive to Language Variation, but the specific content and composition are not described in the README.
提供机构:
AnnaWegmann



