five

Empowering a search engine indexes via enhancing the semantic level of the keywords

收藏
DataCite Commons2022-06-24 更新2024-07-29 收录
下载链接:
https://figshare.com/articles/dataset/Empowering_a_search_engine_indexes_via_enhancing_the_semantic_level_of_the_keywords/20151302/1
下载链接
链接失效反馈
官方服务:
资源简介:
This set contains 84,398 sentences annotated. Regarding Named Entities, the following categories were adopted: Place (PLC), Organization (ORG), Person (PER) and Chemistry (CHE). The file is in Extensible Markup Language (XML) format and divided by sentences. We also provide the file used in Apache Solr. <br> We provide four files: 1-AnnotatedOriginalDataset: The file with the original dataset, downloaded from Kaggle and modeled in .xml format 2-AnnotatedOriginalDataset_Solr: The file with the original dataset, downloaded from Kaggle and modeled to Apache Solr format 3-AnnotatedUpdatedDataset: The file with the updated dataset from the original dataset. We added 2,433 entities and modeled them in .xml format 4-AnnotatedUpdatedDataset_Solr: The file with the updated dataset from the original dataset. Added 2,433 entities and modeled to Apache Solr format Link do arquivo original, antes das modificações: https://www.kaggle.com/datasets/fernandojvdasilva/dbpedia-with-entity-relations-in-portuguese
提供机构:
figshare
创建时间:
2022-06-24
二维码
社区交流群
二维码
科研交流群
商业服务