cicero-im/log
收藏Hugging Face2025-02-09 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/cicero-im/log
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了文本数据及其处理形式,包括去除个人身份信息(PII)的文本、带有遮罩的个人身份信息的文本和清理后的文本。它还包含了时间戳、UUID、任务类型、原始文本、模型输出、实体识别信息、文件类型、创建时间和模型错误等字段。数据集分为训练集,包含1356个样本,大小为1105961字节。
The dataset includes text data and its processed forms, such as text with personal identity information (PII) removed, text with masked PII, and cleaned text. It also contains fields like timestamp, UUID, task type, original text, model output, entity recognition information, file type, creation time, and model error. The dataset is split into a training set with 1356 samples, totaling 1105961 bytes in size.
提供机构:
cicero-im



