recapper/Course_summaries_dataset
收藏Hugging Face2022-10-25 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/recapper/Course_summaries_dataset
下载链接
链接失效反馈官方服务:
资源简介:
---
language:
- en
license: apache-2.0
size_categories:
- 1M<n<10M
task_categories:
- summarization
- text2text-generation
task_ids: []
tags:
- conditional-text-generation
---
# About Dataset
The dataset consists of data from a bunch of youtube videos ranging from videos from fastai lessons, FSDL lesson to random videos teaching something.
In total this dataset contains 600 chapter markers in youtube and contains 25, 000 lesson transcript.
This dataset can be used for NLP tasks like summarization, topic segmentation etc. You can refer to some of the models we have trained with this dataset
in [github repo link](https://github.com/ohmeow/fsdl_2022_course_project) for Full stack deep learning 2022 projects.
提供机构:
recapper
原始信息汇总
数据集概述
基本信息
- 语言: 英语
- 许可证: Apache 2.0
- 数据规模: 1M<n<10M
- 任务类别:
- 摘要生成
- 文本到文本生成
- 标签: 条件文本生成
数据内容
- 数据集包含来自多个YouTube视频的章节标记和课程转录,涵盖fastai课程、FSDL课程以及其他教学视频。
- 总计包含600个章节标记和25,000条课程转录。
应用场景
- 适用于自然语言处理任务,如摘要生成、主题分割等。



