ferrazzipietro/LS_Llama-2-13b-hf_e3c-sentences_NoQuant_64_64_0.05_16_BestF1

Name: ferrazzipietro/LS_Llama-2-13b-hf_e3c-sentences_NoQuant_64_64_0.05_16_BestF1
Creator: ferrazzipietro
Published: 2024-07-04 13:08:51
License: 暂无描述

Hugging Face2024-07-04 更新2024-07-06 收录

下载链接：

https://hf-mirror.com/datasets/ferrazzipietro/LS_Llama-2-13b-hf_e3c-sentences_NoQuant_64_64_0.05_16_BestF1

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集包含多个特征字段，如句子、实体、原始文本等，每个字段都有其特定的数据类型。数据集被分割为测试集，包含170个样本，总大小为663918字节。数据集的下载大小为133246字节。

The dataset includes multiple feature fields such as sentence, entities, original text, etc., each with its specific data type. The dataset is split into a test set containing 170 samples with a total size of 663918 bytes. The download size of the dataset is 133246 bytes.

提供机构：

ferrazzipietro

原始信息汇总

数据集概述

数据集特征

sentence: 字符串类型，表示句子。
entities: 列表类型，包含以下子特征：
- id: 字符串类型，表示实体ID。
- offsets: 整数序列类型，表示偏移量。
- role: 字符串类型，表示角色。
- semantic_type_id: 字符串类型，表示语义类型ID。
- text: 字符串类型，表示文本。
- type: 字符串类型，表示类型。
original_text: 字符串类型，表示原始文本。
original_id: 字符串类型，表示原始ID。
tokens: 字符串序列类型，表示分词结果。
ner_tags: 整数序列类型，表示命名实体识别标签。
input_ids: 整数序列类型，表示输入ID。
attention_mask: 整数序列类型，表示注意力掩码。
labels: 整数序列类型，表示标签。
predictions: 字符串序列类型，表示预测结果。
ground_truth_labels: 字符串序列类型，表示真实标签。

数据集分割

test: 包含170个样本，占用663918字节。

数据集大小

下载大小: 133246字节
数据集大小: 663918字节

配置

default: 包含测试数据文件，路径为data/test-*。