brainer/legal_word_dataset
收藏Hugging Face2025-10-24 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/brainer/legal_word_dataset
下载链接
链接失效反馈官方服务:
资源简介:
这是一个包含专业书籍数据的语料库,其中包括书籍ID、类别、流行度、关键词、文本内容、分词信息、出版日期以及命名实体信息。数据集分为两种配置:长配置包含更详细的信息,而短配置则仅包含文本内容。提供了训练集和验证集,可用于文本处理和实体识别等NLP任务。
This is a corpus of professional book data, including book ID, category, popularity, keywords, text content, word segmentation, publication date, and named entity information. The dataset is divided into two configurations: the long configuration contains more detailed information, while the short configuration only includes text content. Training and validation sets are provided, which can be used for NLP tasks such as text processing and entity recognition.
提供机构:
brainer



