Embeddings and topic vectors for MOOC lectures dataset
收藏DataCite Commons2025-05-01 更新2025-04-16 收录
下载链接:
https://data.mendeley.com/datasets/xknjp8pxbj
下载链接
链接失效反馈官方服务:
资源简介:
This dataset is comprised of word embeddings and document topic distribution vectors generated from transcripts of 12032 video lectures from 200 courses that were collected from Coursera learning platform. Two well-known natural language processing techniques, namely Word2Vec and Latent Dirichlet Allocation (LDA) implemented in the Gensim package in Python are used to generate word embeddings and topic vectors, respectively.
提供机构:
Mendeley
创建时间:
2019-12-06



