michsethowusu/lvl_5_vital_wikipedia_articles_tokenised
收藏Hugging Face2025-03-30 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/michsethowusu/lvl_5_vital_wikipedia_articles_tokenised
下载链接
链接失效反馈官方服务:
资源简介:
这是一个基于Level 5 Vital Wikipedia Articles数据集修改的版本,文本已经被分割成句子,便于进行句子级别的自然语言处理任务,如摘要、句子分类和语言建模等。
This dataset is a modified version of the Level 5 Vital Wikipedia Articles dataset, with text tokenized into sentences to facilitate sentence-level NLP tasks such as summarization, sentence classification, and language modeling.
提供机构:
michsethowusu



