five

Event-Dataset: Temporal information retrieval and text classification dataset

收藏
IEEE2020-11-06 更新2026-04-17 收录
下载链接:
https://ieee-dataport.org/documents/event-dataset-temporal-information-retrieval-and-text-classification-dataset
下载链接
链接失效反馈
官方服务:
资源简介:
Recently, Temporal Information Retrieval (TIR) has grabbed the major attention of the information retrieval community. TIR exploits the temporal dynamics in the information retrieval process and harnesses both textual relevance and temporal relevance to fulfill the temporal information requirements of a user Ur Rehman Khan etal., 2018. The focus time of document is an important temporal aspect which is defined as the time to which the content of the document refers Jatowt etal., 2015; Jatowt etal., 2013; Morbidoni etal., 2018, Khan etal., 2018. To the best of our knowledge, there does not exist any standard benchmark data set (publicly available) that holds the potential to comprehensively evaluate the performance of focus time assessment strategies. Considering these aspects, we have produced the Event-dataset, which is comprised of 35 queries and set of news articles for each query. Each query in the dataset represents a popular event. To annotate these articles into relevant and non-relevant, we have employed a user-study based evaluation method wherein a group of postgraduate students manually annotate the articles into the aforementioned categories. We believe that the generation of such dataset can provide an opportunity for the information retrieval researchers to use it as a benchmark to evaluate focus time assessment methods specifically and information retrieval methods generically.
提供机构:
Islam, Muhammad Arshad; Khan, Shafiq Ur Rehman
创建时间:
2020-11-06
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作