chriswolfram/embeddings
收藏Hugging Face2025-04-04 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/chriswolfram/embeddings
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含多个不同配置的子数据集,每个子数据集都包含文本数据及其对应的嵌入表示。文本数据包括书籍翻译、常见单词、IMDb电影评论等。嵌入表示包括最后一层嵌入和平均嵌入。部分数据集还包含测试集,用于评估模型性能。
The dataset consists of multiple sub-datasets with different configuration names, each containing text data and their corresponding embedded representations. The text data includes book translations, common words, IMDb movie reviews, etc. The embedded representations include last layer embeddings and mean embeddings. Some datasets also contain a test set for evaluating model performance.
提供机构:
chriswolfram



