Automated Moderation
Set your policies once. The platform enforces them automatically — no manual intervention needed.
Define your moderation policies
Create rules that automatically enforce your moderation policies based on AI scores. Rules are evaluated by priority — the first match wins and executes its actions.
if hate ≥ 0.8 → hide + flag_for_review
if sexual/minors ≥ 0.5 → delete + notify_webhook
if spam ≥ 0.7 → mark_as_spam
Create a rule (API)
POST /v1/admin/rules
{
"name": "Hide hate speech",
"priority": 10,
"conditions": [{
"category": "hate",
"operator": "gte",
"score": 0.8
}],
"actions": [{
"type": "hide"
}, {
"type": "flag_for_review"
}]
} Available actions
Each rule can trigger one or more actions.
Delete
Permanently remove violating content from your community. The removal is carried out through the native platform API (Facebook, YouTube, Discord).
Hide
Make violating content invisible to your users. The content is preserved for review and audit purposes, but end users will not see it.
Flag for Review
Send questionable content to your team's manual review queue. Your moderators can then approve or reject it.
Allow
Whitelist specific content, authors, or channels so they are never flagged or removed.
Notify Webhook
Send real-time alerts to your team via webhook when specific content is flagged, enabling custom workflows or third-party integrations.
Mark as Spam
Platform-specific spam classification. On YouTube, the comment is flagged as spam to reduce its visibility.
How it works
Audit trail — action log
{
"content_id": "cnt_01JXK...",
"rule": "Hide hate speech",
"action_type": "hide",
"connector": "facebook",
"status": "executed",
"executed_at": "2026-06-08T14:30:00Z"
} Ready to simplify your moderation stack?
Deploy in minutes with Docker or start a free trial. One API for every moderation provider, with no vendor lock-in.