Dataset Russian Booktube collected Aug-Oct. 2021
收藏Mendeley Data2024-03-27 更新2024-06-28 收录
下载链接:
https://jlupub.ub.uni-giessen.de//handle/jlupub/2203
下载链接
链接失效反馈官方服务:
资源简介:
The data set was created in August-October 2021 as part of a research project on Russian Booktube. Research results will be published in the 14th issue of the peer-reviewed Apparatus Journal (2022:14) in Russian. Data on videos were collected with youtube-dl (development status August 2021). Only officially, public-accessed data were collected. Before collection, video channels were selected through field observation and qualitative and quantitative evaluation of relevant tags. The data for each video from these channels was downloaded in three phases: on 08/23/21, 09/14/21, and 10/07/21. The files EHAMIDY_RUSBOOKTUBE_WITH CLASSIFICATION.xlsx (31,5 MB) and EHAMIDY_RUSBOOKTUBE_WITH CLASSIFICATION.csv (93,1 MB) contain basic video information downloaded on 08/23/21, and information about views downloaded on 09/14/21 and 10/07/21. In addition, the files contain a classification of the videos by titles and tags. Two visualizations are published within the dataset: the visualization of the dataset as an interactive table (EHAMIDY_Booktube_dataset_interactive_table.html) and an interactive graph (EHAMIDY_Booktube_genres.html) showing the number of videos by date and category. Because both excel- and csv-files are large and contain Cyrillic letters they may cause errors if you open them in Excel. In Python, e.g. in Jupyter Notebooks they work perfectly.
本数据集于2021年8月至10月期间构建,作为俄罗斯Booktube研究项目的组成部分。其研究成果将以俄语发表于同行评审期刊《Apparatus》2022年第14期(2022:14)。视频数据通过youtube-dl(2021年8月开发状态)采集,仅获取官方公开可访问的数据。采集前期,通过实地观测、相关标签的定性与定量评估筛选目标视频频道。针对上述频道的单条视频数据分三批次下载:2021年8月23日、2021年9月14日及2021年10月7日。文件EHAMIDY_RUSBOOKTUBE_WITH CLASSIFICATION.xlsx(31.5 MB)与EHAMIDY_RUSBOOKTUBE_WITH CLASSIFICATION.csv(93.1 MB)包含2021年8月23日下载的基础视频信息,以及2021年9月14日、2021年10月7日获取的播放量数据。此外,上述文件还涵盖基于视频标题与标签的视频分类信息。本数据集附带两份可视化成果:一份为数据集交互式表格可视化文件EHAMIDY_Booktube_dataset_interactive_table.html,另一份为交互式图谱可视化文件EHAMIDY_Booktube_genres.html,后者可展示按日期与类别划分的视频数量分布。鉴于Excel与CSV文件体积较大且包含西里尔字母,使用Microsoft Excel打开时可能出现报错;而在Python环境(如Jupyter Notebook)中则可正常运行。
创建时间:
2023-06-28



