five

Weakly supervised visual-auditory fixation prediction with multigranularity perception

收藏
中国科学数据2026-04-20 更新2026-04-25 收录
下载链接:
https://www.sciengine.com/AA/doi/10.1007/s11432-024-4744-5
下载链接
链接失效反馈
官方服务:
资源简介:
Video saliency detection models have been achieving steady, significant improvements thanks to rapid advances indeep learning and the wide availability of large-scale training sets. However, deep learning-based visual-audio fixationprediction is still in its infancy. At present, only a few visual-audio sequences have been furnished, with real fixationsbeing recorded in real visual-audio environments. Hence, it would neither be efficient nor necessary to recollect realfixations under the same visual-audio circumstances. To address this problem, this paper promotes a novel weaklysupervised approach that alleviates the demand for large-scale training sets for visual-audio model training. By usingonly the video category tags, we propose the selectivec
创建时间:
2026-01-05
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作