Instruction-Based Social Media Caption Evaluation and Enhancement Dataset

Name: Instruction-Based Social Media Caption Evaluation and Enhancement Dataset
Creator: Mendeley Data
Published: 2026-04-27 09:02:54
License: 暂无描述

DataCite Commons2026-04-27 更新2026-05-04 收录

下载链接：

https://data.mendeley.com/datasets/fz28c2ghv4/1

下载链接

链接失效反馈

官方服务：

资源简介：

This dataset contains 1,698 records, consisting of social media advertising captions in Arabic and descriptions of products sold online, all of which is evaluated in detail by experts. It is specifically designed to help in NLP (natural language processing) research, especially in Arabic text generation, copywriting evaluation (content writing), and sentiment analysis in marketing. We collected 1,698 captures of real and active ads on Facebook (collected manually by copying and pasting). Its goal is to evaluate the quality of written advertisements, and provide improved versions of them that will attract the customer and generate serious interaction. How was the evaluation done? (Our standards) Each caption was comprehensively analyzed, and the evaluation was divided into a point system (out of 100) distributed as follows: - Planning (P): 15 points - Interaction (E): 20 points - Quality (Q): 20 points - Reach and CTA (R): 20 points - Influence (I): 25 points Output Structure: Each row in the data will show you the following: - Score: X/100 (based on the distribution above). - Why?: Two sentences explaining the reason for the evaluation and the problems in the original caption. - Two quick edits: practical tips to get the ad right. - Improved version: After the caption has been refreshed and is ready, it can be downloaded and sold. How did we work? (Methodology & Tools): In order to get the work done with this accuracy, we relied on more than one evaluation tool, and our maestro was Generative AI Models. But because the AI sometimes hallucinates, all evaluations and improved versions were done under complete human supervision and careful review by us, in order to ensure that the words are 100% logical and that there are no “hallucinations” appearing in the results. Potential Use Cases: - Fine-tuning Large Language Models (LLMs) for Arabic marketing text generation. - Training models for automated copywriting assessment and scoring.

提供机构：

Mendeley Data

创建时间：

2026-04-27

5,000+

优质数据集

54 个

任务类型

进入经典数据集