医疗美容类美团直播带货违法监测预警数据

Name: 医疗美容类美团直播带货违法监测预警数据
Creator: 诸暨市市场监督管理局,浙江富润数链科技有限公司
Published: 2024-07-27 15:42:04
License: 暂无描述

浙江省数据知识产权登记平台2024-07-27 更新2024-07-28 收录

下载链接：

https://www.zjip.org.cn/home/announce/trends/44215

下载链接

链接失效反馈

官方服务：

资源简介：

对采集的美团平台带货品类为医疗美容类的达人直播视频内容进行转译分析，对达人口播语言内容进行处理、分析，根据达人在直播过程中对预先设置的违规敏感词（比如：认证医师，注射除皱，微创注射逆龄，中胚层美塑治疗，保妥适，修复细胞再生，真皮层，皮下脂肪层，进口100u，一定有效果等）违反的次数和频率，依据触发条件规则提出警告或处理。为诸暨市市场监督局管理区域内规范企业美团直播行为，提供数据支持。将采集完成的直播视频进行进行预处理，第一步：基于原始视频文件，以最大10分钟单位对原始视频进行切片。第二步：对于已完成的切片视频，进行视频内容转语音操作。第三步：对于已完成视频转语音操作的切片，进行语音转文本操作。第四步：使用OCR技术对原始视频中抓取的图片进行文字提取操作。第五步：将所得到的文字内容与违法预警关键词库进行匹配。最终运用多标准决策分析模型，对主播在直播过程中出现的违规语句进行分析计算，得出违法预警值和是否预警判断。违法预警值=（违法预警单关键词命中次数*0.25）+（违法预警组合关键词命中次数* 0.3）+（图片识别命中预警组合关键词个数*0.35）+（直播间近一个月历史违规记录数*0.1）通过公式计算出最终违法预警值，违法预警值 ≤1 时，不触发预警提示，违法预警值＞1 时触发违法预警提示。

This dataset involves translating and analyzing live stream video content of medical beauty product promotion influencers collected from the Meituan platform, and processing and analyzing the spoken language content of these influencers. Based on the number and frequency of violations of pre-set sensitive violation keywords (e.g., "certified physician", "anti-wrinkle injection", "minimally invasive anti-aging injection", "mesotherapy", "Botox", "regenerative repair cells", "dermis layer", "subcutaneous fat layer", "imported 100U", "guaranteed effect", etc.), warnings or penalties will be issued in accordance with the trigger condition rules, providing data support for the standardization of enterprises' live streaming behaviors on Meituan within the jurisdiction of Zhuji Municipal Market Supervision Administration. The collected live stream videos undergo the following preprocessing steps: Step 1: Slice the original video files into segments with a maximum duration of 10 minutes each. Step 2: Convert the video content of the sliced videos into audio. Step 3: Convert the audio from the sliced videos into text. Step 4: Extract text from images captured from the original videos using OCR technology. Step 5: Match the obtained text content with the illegal warning keyword database. Finally, the Multi-criteria Decision Analysis (MCDA) model is used to analyze and calculate the violation statements made by the streamer during the live broadcast, to derive the illegal warning value and the judgment of whether to trigger the warning. The illegal warning value is calculated by the following formula: Illegal Warning Value = (Number of hits for single illegal warning keyword * 0.25) + (Number of hits for combined illegal warning keywords * 0.3) + (Number of combined warning keywords identified by image recognition * 0.35) + (Number of historical violation records of the live broadcast room in the past month * 0.1) The final illegal warning value is calculated using the above formula. No warning prompt will be triggered when the illegal warning value ≤ 1, and an illegal warning prompt will be triggered when the illegal warning value > 1.

提供机构：

诸暨市市场监督管理局,浙江富润数链科技有限公司

创建时间：

2024-06-27

搜集汇总

数据集介绍

特点

该数据集主要用于监测美团平台上医疗美容类直播带货中的违法行为，包含1880条数据，每季度更新。通过分析直播内容中的违规敏感词，结合算法规则计算违法预警值，为市场监督局提供数据支持。

以上内容由遇见数据集搜集并总结生成