hungphongtrn/vietmed_ner_v5
收藏Hugging Face2024-06-17 更新2024-07-06 收录
下载链接:
https://hf-mirror.com/datasets/hungphongtrn/vietmed_ner_v5
下载链接
链接失效反馈官方服务:
资源简介:
该数据集主要用于自然语言处理任务,特别是命名实体识别(NER)。数据集包含三个主要特征:words、tags和labels。words特征是一个字符串序列,表示文本中的单词;tags特征是一个类别标签序列,用于标注文本中的不同实体类型,如组织、预防医学、疾病症状等;labels特征是一个字符串序列,可能用于进一步的分类或标注任务。数据集被分为训练集、验证集和测试集,分别包含4616、1154和3497个示例。数据集的下载大小为606431字节,总大小为5339324字节。
This dataset is primarily used for natural language processing tasks, particularly named entity recognition (NER). The dataset contains three main features: words, tags, and labels. The words feature is a sequence of strings representing words in the text; the tags feature is a sequence of class labels used to annotate different entity types in the text, such as organizations, preventive medicine, disease symptoms, etc.; the labels feature is a sequence of strings, possibly used for further classification or annotation tasks. The dataset is divided into training, validation, and test sets, containing 4616, 1154, and 3497 examples respectively. The download size of the dataset is 606431 bytes, and the total size is 5339324 bytes.
提供机构:
hungphongtrn



