37channel/news-dataset-20240506-g-rss-step-2-debug-1

Name: 37channel/news-dataset-20240506-g-rss-step-2-debug-1
Creator: 37channel
Published: 2024-06-25 09:27:33
License: 暂无描述

Hugging Face2024-06-25 更新2024-06-29 收录

下载链接：

https://hf-mirror.com/datasets/37channel/news-dataset-20240506-g-rss-step-2-debug-1

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集包含多个字段，如标题前后的上下文信息（after_inf_title2context_1和before_inf_title2context）、内容（content）、日期（date）、感兴趣的LLM（interested-llm）、步骤（step）、标题（title）、URL（url）和索引（index）。这些字段可能用于分析文本数据，特别是与标题和上下文相关的信息。数据集分为训练集，包含425个样本，总大小为1466442字节。

The dataset includes multiple fields such as context before and after the title (after_inf_title2context_1 and before_inf_title2context), content, date, interested LLM, step, title, URL, and index. These fields may be used for analyzing text data, particularly information related to titles and contexts. The dataset is divided into a training set containing 425 samples, with a total size of 1466442 bytes.

提供机构：

37channel

原始信息汇总

数据集概述

数据集信息

特征

after_inf_title2context_1: 字符串类型
before_inf_title2context: 字符串类型
content: 字符串类型
date: 字符串类型
interested-llm: 字符串类型
step: 字符串类型
title: 字符串类型
url: 字符串类型
index: 整数类型

数据分割

train:
- 样本数量: 425
- 数据大小: 1466442 字节

数据集大小

下载大小: 847460 字节
数据集大小: 1466442 字节

配置

default:
- 数据文件路径: data/train-*

5,000+

优质数据集

54 个

任务类型

进入经典数据集