Content Safety API

RapidAPI2026-03-25 更新2026-03-26 收录

下载链接：

https://rapidapi.com/fzaffarana/api/content-safety-api1

下载链接

链接失效反馈

官方服务：

资源简介：

AI-powered content moderation. Classify text for toxicity across 10 safety categories in multiple languages. Detect violence, hate speech, profanity, harassment, and more with confidence scores.

创建时间：

2026-03-25

原始信息汇总

Content Safety API 数据集概述

基本信息

API名称：Content Safety API
提供者：Fabrizio Zaffarana
类别：Text Analysis
流行度：9 Popularity
服务等级：100% Service Level
延迟：430ms Latency
测试状态：100% Test

定价方案

BASIC：$0.00 / 月
PRO：$9.00 / 月
ULTRA：$25.00 / 月
MEGA：$49.00 / 月

功能概述

AI驱动的文本内容审核，可对多种语言的文本进行安全违规分类，涵盖10个安全类别。其检测基于上下文理解，而非简单的关键词匹配。

核心特性

10个安全类别

Violence — 威胁、煽动、图形化描述
Hate Speech — 诽谤、针对受保护群体的歧视
Profanity — 脏话、粗俗语言、粗鲁用语
Harassment — 定向侮辱、欺凌、人身攻击
Sexual Content — 露骨的性内容
Self-Harm — 宣扬自残或自杀
Criminal Activity — 欺诈、盗窃、黑客攻击
Child Exploitation — CSAM检测
Weapons — 大规模杀伤性武器使用说明
Privacy — 个人隐私信息暴露

上下文感知检测

能够理解语境差异，例如：

“This is fucking amazing” → 仅标记profanity（非骚扰性）
“You are a fucking idiot” → 标记profanity + harassment（定向侮辱）
“Te voy a matar” → 标记violence（西班牙语威胁检测）
“The function returns null” → 标记为safe（技术内容）

多语言支持

支持分析英语、西班牙语、法语、德语、葡萄牙语、意大利语、印地语、泰语等多种语言。自动检测输入文本的语言。

结构化输出

每个响应包含：

每个类别的检测结果及置信度分数（0-1）
总体风险分数（0-100）及风险等级
被标记的类别列表
检测结果的可读摘要
检测到的语言代码

技术细节

端点

方法：POST
路径：/moderate
功能：分析文本的安全违规情况

请求体格式

json { "text": "The text to analyze" }

响应格式示例

json { "safe": false, "riskScore": 85, "riskLevel": "critical", "categories": { "violence": { "detected": false, "confidence": 0 }, "hate": { "detected": false, "confidence": 0 }, "sexual": { "detected": false, "confidence": 0 }, "selfHarm": { "detected": false, "confidence": 0 }, "profanity": { "detected": true, "confidence": 0.95 }, "harassment": { "detected": true, "confidence": 0.9 }, "criminal": { "detected": false, "confidence": 0 }, "childExploitation": { "detected": false, "confidence": 0 }, "weapons": { "detected": false, "confidence": 0 }, "privacy": { "detected": false, "confidence": 0 } }, "flaggedCategories": ["profanity", "harassment"], "summary": "The text contains profanity and directed harassment.", "language": "en" }

风险等级定义

等级	分数范围	含义
none	0	内容完全安全
low	1-25	轻微问题
medium	26-50	中等安全问题
high	51-75	严重安全违规
critical	76-100	极度严重安全违规

检测示例

输入	结果	标记类别
“The weather is nice”	safe	-
“This is fucking amazing”	unsafe	profanity
“you are an idiot”	unsafe	harassment
“fuck you”	unsafe	profanity, harassment
“I want to kill you”	unsafe	violence, harassment
“Te voy a matar”	unsafe	violence (lang: es)