药食同源类快手直播带货违法监测预警数据
收藏浙江省数据知识产权登记平台2024-11-01 更新2024-11-02 收录
下载链接:
https://www.zjip.org.cn/home/announce/trends/79668
下载链接
链接失效反馈官方服务:
资源简介:
对采集的快手平台带货品类为药食同源类的达人直播视频内容进行转译分析,对达人口播语言内容进行处理、分析,根据达人在直播过程中对预先设置的违规敏感词(比如:彻底根治、绝对安全、无任何副作用、快速见效、立竿见影、包治百病、100%纯天然、预防疾病、抗癌、抗衰老、政府推荐等)违反的次数和频率,依据触发条件规则提出警告或处理。为服务辖区市场监督局管理区域内规范企业快手直播行为,提供数据支持。将采集完成的直播视频进行进行预处理,第一步:基于原始视频文件,以最大10分钟单位对原始视频进行切片。第二步:对于已完成的切片视频,进行视频内容转语音操作。第三步:对于已完成视频转语音操作的切片,进行语音转文本操作。第四步:使用OCR技术对原始视频中抓取的图片进行文字提取操作。第五步:将所得到的文字内容与违法预警关键词库进行匹配。最终运用多标准决策分析模型,对主播在直播过程中出现的违规语句进行分析计算,得出违法预警值和是否预警判断。违法预警值 ≤1 时,不触发预警提示,违法预警值 >1 时触发违法预警提示。
This dataset conducts translation and analysis on live streaming video content of influencers selling medicinal and edible homologous products on the Kuaishou platform, and processes and analyzes the verbal content uttered by the influencers. Based on the frequency and number of violations of pre-set sensitive prohibited keywords (such as completely cure, absolutely safe, no side effects whatsoever, quick effect, immediate effect, cure all diseases, 100% pure natural, prevent diseases, anti-cancer, anti-aging, government recommendation, etc.) during the live broadcast, warnings or penalties will be issued in accordance with the trigger condition rules. This dataset provides data support for market supervision bureaus in their jurisdictions to standardize the live streaming behaviors of enterprises on Kuaishou.
First, preprocess the collected live streaming videos:
Step 1: Slice the original video files into clips with a maximum duration of 10 minutes each.
Step 2: Perform video-to-speech conversion on the completed sliced videos.
Step 3: Perform speech-to-text conversion on the sliced videos that have undergone video-to-speech conversion.
Step 4: Extract text from images captured from the original videos using OCR technology.
Step 5: Match the obtained text content with the prohibited keyword database for violation warnings.
Finally, a multi-criteria decision analysis (MCDA) model is used to analyze and calculate the prohibited utterances made by the streamer during the live broadcast, to derive the violation warning score and the warning judgment. When the violation warning score is ≤1, no warning prompt will be triggered; when the violation warning score is >1, a violation warning prompt will be triggered.
提供机构:
浙江富润数链科技有限公司
创建时间:
2024-10-11
搜集汇总
数据集介绍

特点
该数据集名为'药食同源类快手直播带货违法监测预警数据',由浙江富润数链科技有限公司自行产生,包含1380条数据,每季度更新一次。数据集通过分析快手直播中的违规敏感词,为市场监督局提供违法预警支持,应用场景为规范企业快手直播行为。
以上内容由遇见数据集搜集并总结生成



