An innovative approach to scalable semantic embedding
收藏IFLA Repository2026-03-24 更新2026-05-16 收录
下载链接:
https://repository.ifla.org/items/d4dfb48d-ecc3-44b4-8b28-06e4f8108683
下载链接
链接失效反馈官方服务:
资源简介:
Embedding words, entities and documents in compact, semantically meaningful vector spaces allows for computable semantic similarity/relatedness which could make search more intelligent and benefit other tasks conducted in libraries, such as entity disambiguation, de-duplication, clustering, recommendation, subject prediction, etc. Deep learning models are powerful but require high computing power and careful tuning hyperparameters for optimal performance. In our quest for practical solutions to support libraries in this field, we revisit the global co-occurrence based embedding methods and propose a conceptually simple and computationally lightweight approach. Our experiments show highly competitive results with a few state-of-the-art embedding methods on different tasks, including the standard STS benchmark and a subject prediction task, at a fraction of the computational cost. We will show the potentials of this scalable semantic embedding method for other applications such as entity disambiguation, citation recommendation, clustering and collection exploration.
提供机构:
International Federation of Library Associations and Institutions
创建时间:
2025-09-24



