five

wayne0019/autotrain-data-lwf-summarization

收藏
Hugging Face2023-07-26 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/wayne0019/autotrain-data-lwf-summarization
下载链接
链接失效反馈
官方服务:
资源简介:
该数据集由AutoTrain自动处理,用于项目lwf-summarization。数据集的语言为中文,任务类别为摘要生成。数据集包含对话文本及其对应的摘要,字段包括feat_id、target和text。数据集被划分为训练集和验证集,分别包含655和164个样本。

该数据集由AutoTrain自动处理,用于项目lwf-summarization。数据集的语言为中文,任务类别为摘要生成。数据集包含对话文本及其对应的摘要,字段包括feat_id、target和text。数据集被划分为训练集和验证集,分别包含655和164个样本。
提供机构:
wayne0019
原始信息汇总

AutoTrain Dataset for project: lwf-summarization

Dataset Description

  • Languages: The dataset is in Chinese (BCP-47 code: zh).
  • Task Categories: The dataset is designed for summarization tasks.

Dataset Structure

Data Instances

  • Sample Instance: json { "feat_id": "13716782", "target": "The scariest place for Jessica was the Capuchin Catacombs in Palermo.", "text": "Kelly: Oh! Oh! Can I pick the first question? Jessica: Sure. Go for it! Kelly: Whats the scariest place youve been to! Jessica: Ill start: Palermo in Italy. Mickey: And whats so scary about that? Did you break your nail? :P Jessica: Shut it, Mickey! No, there are the Capuchin Catacombs with 8000 corpses! Kelly: Ewwww! Corpses? Rly? Jessica: Yeah! And you can look at them like museum exhibits. I think theyre divided somehow, but have no clue how! Ollie: Thats so cool! Do you get to see the bones or are they covered up? Jessica: Well, partly. Most of them were exhibited in their clothes. Basically only skulls and hands. Mickey: Im writing this one down! Thats so precious! Ollie: Me too!" }

Dataset Fields

  • Fields:
    • feat_id: String identifier for the feature.
    • target: String containing the summary or target information.
    • text: String containing the original text.

Dataset Splits

  • Splits:
    • Train: 655 samples.
    • Validation: 164 samples.
搜集汇总
数据集介绍
main_image_url
以上内容由遇见数据集搜集并总结生成
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作