TextZoo
收藏arXiv2018-03-19 更新2024-06-21 收录
下载链接:
https://github.com/wabyking/TextClassificationBenchmark
下载链接
链接失效反馈官方服务:
资源简介:
TextZoo是由腾讯创建的一个文本分类基准数据集,包含超过20种模型和10个数据集,旨在重新考虑文本分类任务。数据集涵盖了多个领域,如新闻组、电影评论等,用于评估不同模型的性能。创建过程中,研究者们重新实现了多种流行的文本表示模型,并通过这些数据集进行系统性评估。该数据集主要用于解决文本分类中的模型比较和性能评估问题,帮助研究者理解不同模型在特定任务上的表现。
TextZoo is a text classification benchmark dataset created by Tencent, which includes over 20 models and 10 datasets, and is designed to revisit the text classification task. The dataset covers multiple domains such as newsgroups and movie reviews, and is used to evaluate the performance of different models. During its development, researchers re-implemented a variety of popular text representation models and conducted systematic evaluations using these datasets. This dataset is primarily used to address the issues of model comparison and performance evaluation in text classification, helping researchers understand the performance of various models on specific tasks.
提供机构:
腾讯
创建时间:
2018-02-11



