Embeddings of Wikipedia entities
收藏DataCite Commons2023-02-08 更新2024-08-18 收录
下载链接:
https://figshare.com/articles/dataset/Embeddings_of_Wikipedia_entities/21666881/2
下载链接
链接失效反馈官方服务:
资源简介:
Embeddings for 2.8 million entities in YAGO3, a large knowledge base derived from Wikipedia and other sources. <br> These 200-dimensional embeddings can be used to enrich data analyses on common entities (cities, people, companies...). See: https://github.com/soda-inria/ken_embeddings <br> The data contains the following files: <br> - emb_all.parquet: embeddings for all entities of YAGO3 (2.8M) - emb_location.parquet: embeddings of geographical entities - emb_person.parquet: embeddings of people - emb_company.parquet: embeddings of companies - emb_movies.parquet: embeddings of movies - emb_album.parquet: embeddings of albums - emb_school.parquet: embeddings of schools and universities <br> - entity_types.parquet: simple type for each entity in YAGO3 - entity_detailed_types.parquet: detailed types for each entity in YAGO3 <br>
YAGO3是一个源自维基百科及其他数据源的大型知识库,本数据集包含该知识库中280万个实体的嵌入向量(Embeddings)。这些维度为200的嵌入向量可用于丰富常见实体(城市、人物、企业等)的数据分析工作。详情请见:https://github.com/soda-inria/ken_embeddings
本数据集包含如下文件:
- emb_all.parquet:YAGO3全部280万个实体的嵌入向量
- emb_location.parquet:地理实体的嵌入向量
- emb_person.parquet:人物实体的嵌入向量
- emb_company.parquet:企业实体的嵌入向量
- emb_movies.parquet:影视实体的嵌入向量
- emb_album.parquet:专辑实体的嵌入向量
- emb_school.parquet:中小学及高等院校实体的嵌入向量
- entity_types.parquet:YAGO3中每个实体的基础类型标注
- entity_detailed_types.parquet:YAGO3中每个实体的详细类型标注
提供机构:
figshare
创建时间:
2023-01-18



