pszemraj/NYTWritingStyleGuide-parsed

Name: pszemraj/NYTWritingStyleGuide-parsed
Creator: pszemraj
Published: 2024-01-26 23:26:54
License: 暂无描述

Hugging Face2024-01-26 更新2024-03-04 收录

下载链接：

https://hf-mirror.com/datasets/pszemraj/NYTWritingStyleGuide-parsed

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集是对TuringsSolutions/NYTWritingStyleGuide数据集的解析版本，解决了原数据集格式难以解析的问题。数据集包含两个配置：default和raw。default配置包含标题、章节和部分信息，每个部分又包含多个子部分，子部分中包含了各种写作建议、方法、最佳实践等内容。raw配置则包含指南的完整结构，包括标题、章节和部分信息，每个部分也包含多个子部分，子部分中包含了各种写作相关的主题、内容、示例、技巧等。

提供机构：

pszemraj

原始信息汇总

数据集概述

基本信息

语言: 英语
许可证: MIT
来源: TuringsSolutions/NYTWritingStyleGuide

数据集配置

默认配置 (`default`)

特征:
- title: 字符串
- chapter: 64位整数
- sections: 列表
  - number: 64位整数
  - section: 64位整数
  - subsections: 列表
    - advice: 字符串
    - approach: 字符串
    - benefit: 字符串
    - best_practice: 字符串
    - best_practices: 字符串
    - caution: 字符串
    - content: 字符串
    - editing: 字符串
    - example: 字符串
    - exercise: 字符串
    - guideline: 字符串
    - guidelines: 字符串
    - insight: 字符串
    - methodology: 字符串
    - note: 字符串
    - number: 64位整数
    - perspective: 字符串
    - practice: 字符串
    - principle: 字符串
    - procedure: 字符串
    - process: 字符串
    - rhetoric: 字符串
    - rule_of_thumb: 字符串
    - strategy: 字符串
    - style: 字符串
    - styleguide: 字符串
    - subsection: 64位整数
    - technique: 字符串
    - tip: 字符串
    - tips: 字符串
    - topic: 字符串
分割:
- train:
  - num_bytes: 33332
  - num_examples: 21
下载大小: 62604
数据集大小: 33332

原始配置 (`raw`)

特征:
- guide: 结构体
  - title: 字符串
  - chapters: 列表
    - chapter: 64位整数
    - title: 字符串
    - sections: 列表
      - number: 64位整数
      - subsections: 列表
        
        number: 64位整数
        
        topic: 字符串
        
        content: 字符串
        
        subsection: 64位整数
        
        example: 字符串
        
        tip: 字符串
        
        note: 字符串
        
        exercise: 字符串
        
        practice: 字符串
        
        technique: 字符串
        
        strategy: 字符串
        
        best_practice: 字符串
        
        approach: 字符串
        
        rule_of_thumb: 字符串
        
        guideline: 字符串
        
        process: 字符串
        
        advice: 字符串
        
        guidelines: 字符串
        
        style: 字符串
        
        tips: 字符串
        
        caution: 字符串
        
        benefit: 字符串
        
        insight: 字符串
        
        best_practices: 字符串
        
        principle: 字符串
        
        methodology: 字符串
        
        procedure: 字符串
        
        styleguide: 字符串
        
        editing: 字符串
        
        perspective: 字符串
        
        rhetoric: 字符串
      - section: 64位整数
分割:
- train:
  - num_bytes: 33377
  - num_examples: 1
下载大小: 65012
数据集大小: 33377

数据文件配置

默认配置 (default):
- train: data/train-*
原始配置 (raw):
- train: raw/train-*

搜集汇总

数据集介绍

背景与挑战

背景概述

该数据集是纽约时报写作风格指南的解析版本，包含21条结构化数据，涵盖写作技巧、伦理标准等内容，适用于文本分析和自然语言处理任务。

以上内容由遇见数据集搜集并总结生成

5,000+

优质数据集

54 个

任务类型

进入经典数据集