five

Empowering a search engine indexes via enhancing the semantic level of the keywords

收藏
Figshare2022-06-24 更新2026-04-28 收录
下载链接:
https://figshare.com/articles/dataset/Empowering_a_search_engine_indexes_via_enhancing_the_semantic_level_of_the_keywords/20151302
下载链接
链接失效反馈
官方服务:
资源简介:
This set contains 84,398 sentences annotated. Regarding Named Entities, the following categories were adopted: Place (PLC), Organization (ORG), Person (PER) and Chemistry (CHE). The file is in Extensible Markup Language (XML) format and divided by sentences. We also provide the file used in Apache Solr. We provide four files: 1-AnnotatedOriginalDataset: The file with the original dataset, downloaded from Kaggle and modeled in .xml format 2-AnnotatedOriginalDataset_Solr: The file with the original dataset, downloaded from Kaggle and modeled to Apache Solr format 3-AnnotatedUpdatedDataset: The file with the updated dataset from the original dataset. We added 2,433 entities and modeled them in .xml format 4-AnnotatedUpdatedDataset_Solr: The file with the updated dataset from the original dataset. Added 2,433 entities and modeled to Apache Solr format Link do arquivo original, antes das modificações: https://www.kaggle.com/datasets/fernandojvdasilva/dbpedia-with-entity-relations-in-portuguese
创建时间:
2022-06-24
二维码
社区交流群
二维码
科研交流群
商业服务