zeroMN/hanlp_date-zh
收藏Hugging Face2025-01-20 更新2025-08-30 收录
下载链接:
https://hf-mirror.com/datasets/zeroMN/hanlp_date-zh
下载链接
链接失效反馈官方服务:
资源简介:
这是一个用于中文分词任务的训练和测试数据集,包含黄金标准分词结果和未分词的测试数据。同时,该数据集还包含用于评分的脚本和一个简单的分词器。此外,还提到了一个名为 Synthetic Multimodal Dataset 的合成多模态数据集,用于视觉问答、自动语音识别和图像标题生成任务,但没有提供详细描述。
This is a training and testing dataset for Chinese word segmentation, containing gold standard segmentation results and unsegmented test data. It also includes a scoring script and a simple segmenter. In addition, a Synthetic Multimodal Dataset is mentioned for tasks such as Visual Question Answering, Automatic Speech Recognition, and Image Captioning, but no detailed description is provided.
提供机构:
zeroMN



