james-burton/news_channel_ordinal
收藏Hugging Face2023-05-25 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/james-burton/news_channel_ordinal
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: ' n_tokens_content'
dtype: float64
- name: ' n_unique_tokens'
dtype: float64
- name: ' n_non_stop_words'
dtype: float64
- name: ' n_non_stop_unique_tokens'
dtype: float64
- name: ' num_hrefs'
dtype: float64
- name: ' num_self_hrefs'
dtype: float64
- name: ' num_imgs'
dtype: float64
- name: ' num_videos'
dtype: float64
- name: ' average_token_length'
dtype: float64
- name: ' num_keywords'
dtype: float64
- name: ' global_subjectivity'
dtype: float64
- name: ' global_sentiment_polarity'
dtype: float64
- name: ' global_rate_positive_words'
dtype: float64
- name: ' global_rate_negative_words'
dtype: float64
- name: ' rate_positive_words'
dtype: float64
- name: ' rate_negative_words'
dtype: float64
- name: article_title
dtype: string
- name: channel
dtype: int64
splits:
- name: train
num_bytes: 3354492
num_examples: 17241
- name: validation
num_bytes: 591868
num_examples: 3043
- name: test
num_bytes: 987135
num_examples: 5071
download_size: 3376135
dataset_size: 4933495
---
# Dataset Card for "news_channel_ordinal"
[More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
提供机构:
james-burton
原始信息汇总
数据集概述
数据集特征
- n_tokens_content: float64
- n_unique_tokens: float64
- n_non_stop_words: float64
- n_non_stop_unique_tokens: float64
- num_hrefs: float64
- num_self_hrefs: float64
- num_imgs: float64
- num_videos: float64
- average_token_length: float64
- num_keywords: float64
- global_subjectivity: float64
- global_sentiment_polarity: float64
- global_rate_positive_words: float64
- global_rate_negative_words: float64
- rate_positive_words: float64
- rate_negative_words: float64
- article_title: string
- channel: int64
数据集分割
- 训练集:
- 大小: 3354492字节
- 示例数: 17241
- 验证集:
- 大小: 591868字节
- 示例数: 3043
- 测试集:
- 大小: 987135字节
- 示例数: 5071
数据集大小
- 下载大小: 3376135字节
- 数据集总大小: 4933495字节



