jganzabalseenka/noun_phrases_elastic_2024-05-01_2024-05-30

Name: jganzabalseenka/noun_phrases_elastic_2024-05-01_2024-05-30
Creator: jganzabalseenka
Published: 2024-07-01 14:09:51
License: 暂无描述

Hugging Face2024-07-01 更新2024-07-06 收录

下载链接：

https://hf-mirror.com/datasets/jganzabalseenka/noun_phrases_elastic_2024-05-01_2024-05-30

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集包含多个字段，主要涉及名词短语（noun_phrase）、计数（count）、首词（first_word）、总词数（total_words）、尾词（last_words）、标准化形式（normalized）、是否带重音（with_accents）等。数据集分为训练集，包含648,253个样本，总大小为66,952,109字节。这些字段可能用于自然语言处理任务，如文本分析、语言模型训练等。

This dataset is primarily used for natural language processing tasks, featuring various text analysis-related features such as noun phrases, counts, first words, total words, last words, normalized forms, and whether they include accents. The dataset is divided into a training set with 648253 samples, totaling 66952109 bytes. The download size of the dataset is 33910808 bytes.

提供机构：

jganzabalseenka

原始信息汇总