OW-JRD (Object-wise Just Recognizable Distortion) dataset
收藏数据集概述
基本信息
- 名称: Object-Wise Just Recognizable Distortion (OW-JRD) Dataset
- 用途: 用于图像和视频压缩中的对象级可识别失真预测
- 发布年份: 2023
- 发布机构: IEEE Transactions on Multimedia (TMM)
- 许可证: 未明确说明(但提及了License标识)
- 相关论文: Learning to Predict Object-Wise Just Recognizable Distortion for Image and Video Compression
数据集内容
- 数据量: 29,218张原始图像
- 对象类别: 80类
- 数据格式:
- 原始图像
- 使用VVC(Versatile Video Coding)压缩的64种失真版本
- 标注信息:
objects_infos.jsoncoco80_indices.jsonJRD_info.jsontrain.jsonval.jsontest.json
数据集结构
Project ├── jsonfiles/ │ ├── objects_infos.json │ ├── coco80_indices.json │ ├── JRD_info.json │ ├── train.json │ ├── val.json │ └── test.json ├── data/ │ ├── original/ │ └── distorted/
下载链接
技术指标
- 预测性能:
- Mean Absolute Errors (MAEs) 为4.90和5.92(不同类别数量下)
- 基准对比: 显著优于现有最先进的JRD预测模型
相关资源
- 预训练模型:
./pre_weights/pre_efficientnetv2-s.pth./pre_weights/Eff/Eff.pth
- 代码库: GitHub代码库
引用格式
bibtex @ARTICLE{zhang2023learning, author={Zhang, Yun and Lin, Haoqin and Sun, Jing and Zhu, Linwei and Kwong, Sam}, journal={IEEE Transactions on Multimedia}, title={Learning to Predict Object-Wise Just Recognizable Distortion for Image and Video Compression}, year={2024}, volume={26}, number={}, pages={5925-5938}, keywords={Image coding;Machine vision;Distortion;Visualization;Predictive models;Image recognition;Task analysis;Deep learning;just recognizable distortion;object detection;video coding for machine}, doi={10.1109/TMM.2023.3340882}}




