kshitijthakkar/loggenix-merged-dataset
收藏Hugging Face2025-07-17 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/kshitijthakkar/loggenix-merged-dataset
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是一个包含文本和相应元信息的集合,具体字段包括编码文本、格式化文本、输入文本、映射任务、消息内容、消息角色、消息长度、对话轮数、输出文本、源数据集、任务类型和总token数。数据集分为训练集、测试集和评估集,每个集合的大小和样本数量不同,提供了丰富的文本数据用于不同的NLP任务。
This dataset is a collection of text and associated metadata, including fields such as encoded text, formatted text, input text, mapped task, message content, message role, message length, number of turns, output text, source dataset, task type, and total token count. The dataset is split into training, testing, and evaluation sets, each with different sizes and number of samples, providing a rich set of text data for various NLP tasks.
提供机构:
kshitijthakkar



