five

TamperedNews & News400 (IJMIR'21 Update)

收藏
DataCite Commons2022-05-17 更新2025-04-15 收录
下载链接:
https://data.uni-hannover.de/dataset/256653e6-1392-4e77-ab16-b7ea71ff05b0
下载链接
链接失效反馈
官方服务:
资源简介:
# Multimodal Analytics for Real-world News using Measures of Cross-modal Entity Consistency This repository contains the *TamperedNews* and *News400* datasets introduced in the paper: > Eric Müller-Budack, Jonas Theiner, Sebastian Diering, Maximilian Idahl, Sherzod Hakimov und Ralph Ewerth. „Multimodal news analytics using measures of cross-modal entity and context consistency“. In: _International Journal of Multimedia Information Retrieval_ 10.2 (2021), Springer, S. 111–125. DOI: https://doi.org/10.1007/s13735-021-00207-4 ## Content For both datasets *TamperedNews* and *News400*, we provide the: - ```*dataset*.tar.gz``` containing the ```*dataset*.jsonl``` with - Web links to the news texts - Web links to the news image - Outputs of the named entity recognition and disambiguation (NERD) approach - Untampered and tampered entities - ```*dataset*_features.tar.gz```with visual features for events, locations, and persons - ```news400_wordembeddings.tar.gz```: Word embeddings of all nouns in the news texts of the News400 dataset Please note that the word embeddings of the *TamperedNews* dataset (```tamperednews_wordembeddings.tar.gz```) have been already provided in the first version ([Link](https://data.uni-hannover.de/dataset/tamperednews)). For all entities detected in both datasets, we provide: - ```entities.tar.gz``` containing an ```*entity_type*.jsonl``` for all entity types (events, locations, and persons) with: - Wikidata ID - Wikidata label - Meta information used for tampering - Web links to all reference images crawled from Google, Bing, and Wikidata - ```entities_features.tar.gz``` containing the visual features of the reference images for all entities ## Source Code The source code to reproduce our results as well as download scripts to crawl news texts and images can be found on our GitHub page: https://github.com/TIBHannover/cross-modal_entity_consistency
提供机构:
LUIS
创建时间:
2022-05-17
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作