Skip to main content

How Moderation in TrollWall AI Works

How TrollWall AI moderation works: platform timing, reversible actions, feature compatibility, and managing hidden comments effectively.

Written by Filip Strycko
Updated yesterday

Overview

TrollWall AI provides automated comment moderation across Facebook, Instagram, YouTube, and TikTok, as well as direct messages in Facebook Messenger. Our system is designed to protect your brand while maintaining transparency and control over moderated content.

Key Moderation Principles

Hate speech gets hidden automatically before your community sees it - no manual review is needed on your end.

Our agents are purpose-built for social media moderation and trained per language by native speakers, so it catches culturally specific insults and slurs that generic tools miss.

You keep full control: every hidden comment is visible in your TrollWall dashboard, and you can unhide anything with one click. We never permanently delete a comment or a message - that decision always stays with you.

Comments Are Hidden, Never Deleted

Important: TrollWall never permanently deletes comments. Instead, we hide inappropriate content, which means:

  • Reversible actions: All moderation decisions can be undone through the TrollWall interface

  • Complete control: You can make any hidden comment visible again at any time

  • Content preservation: Original comments remain intact for review and analysis

User Experience After Moderation

When TrollWall hides a comment:

  • Comment author and their friends: Can still see their own comment (appears normal to them)

  • Public viewers: Cannot see the hidden comment

  • Page owners: Can review and manage hidden comments in TrollWall dashboard

When TrollWall moderates a direct message:

  • Page owners: Can review and manage messages in TrollWall dashboard by accessing the Junk folder

Verifying comments are truly hidden: If you think a hidden comment is still visible, remember that you as a Page admin will always see it. To verify it's hidden from the public, view the post from an account that is not:

  • An admin of your Page

  • Friends with the comment author

Note: The exact user experience may vary slightly between platforms, but the core principle remains consistent across all social media channels.

Platform-Specific Moderation Timing

Meta Platforms (Facebook, Messenger & Instagram )

  • Moderation speed: Almost immediate

  • Typical delay: Just a few seconds

  • Real-time protection: Comments are moderated as soon as they appear

YouTube

  • Moderation frequency: Every 15 minutes

  • Reason for delay: YouTube API limitations and quota restrictions

  • Process: TrollWall regularly checks for new comments and moderates them in batches

TikTok

  • Moderation frequency: Every 15 minutes

  • Reason for delay: TikTok API limitations and quota restrictions

  • Process: TrollWall regularly polls for new comments and applies moderation rules

Special Cases

Facebook Collaborative Posts

Facebook Collaborative Posts (Collab Posts) have unique moderation requirements:

How Collab Posts Work:

  • Multiple Facebook Pages can create content together

  • The post "belongs" to the Page that initiated the collaboration

  • Only the initiating Page controls comment moderation

TrollWall Moderation Rules:

  • Visible in TrollWall: Collab posts only appear in the initiating Page's feed

  • Moderation control: Only works if the initiating Page uses TrollWall

  • Important limitation: If the initiating Page doesn't have TrollWall:

    • The post won't appear in TrollWall at all

    • No moderation is possible, even if other collaborating Pages use TrollWall

Note: This behavior is determined by Facebook's collaborative post structure and applies to all multi-Page collaborations.

Platform Compatibility & Features

TrollWall AI supports different features across platforms based on API limitations:

Facebook

Instagram

TikTok

YouTube

Facebook Messenger

API Integration

Hide comments

✅ - Move to Junk

❌ - Not available

Delete comments

❌ - API limitation

❌ - API limitation

❌ - Not available

Like comments

❌ - API limitation

❌ - API limitation

❌ - Not available

Block users

Reply

Reply with images

❌ - API limitation

❌ - API limitation

❌ - API limitation

⚠️ - Limited availability

Sentiment analysis

Recommended actions

Post comments analysis

AI comment reply generations

Limitations

TikTok

Limit for replies is 150 characters. This limitations is due to TikTok's API ( https://business-api.tiktok.com/portal/docs/create-a-new-comment-on-an-owned-video/v1.3 )

Managing Moderated Content

Reviewing Hidden Comments

  1. Access the Comments section in your TrollWall dashboard

  2. Filter for hidden comments to review moderation decisions

  3. Click on any comment to see detailed moderation reasoning

  4. Use the "Show" action to make comments visible again if needed

Reviewing Social Inbox Conversations Moved to Junk

  1. Access the Junk tab in your Social Inbox to see messages that were classified as toxic/spam

  2. Use the bin icon to move the message thread back to the main inbox

Note: Junk classifications in TrollWall (applied manually or by AI) remain internal to TrollWall and do not affect the conversation's status on Facebook. Due to Facebook API limitations, third-party apps cannot mark conversations as spam or report them to Facebook.

Understanding Comment Status Messages

When reviewing comments in TrollWall AI, you may see different status messages explaining why content is hidden. Be aware, that comment is re-evaluated every time it is edited on social media platform:

TrollWall AI Moderation:

  • Shows specific detection reasons (hate speech, spam, etc.)

  • All decisions can be manually reversed

External Platform Actions:

  • "Social media platform": The comment was hidden by the platform itself (Facebook, Instagram, etc.) either automatically through their systems or manually by platform moderators, not by TrollWall AI. Our AI does not evaluate comments that arrive already hidden.

Sort Comments by Toxicity Type

A new filter that allows you to sort comments by specific toxicity types, including spam, hate speech, and disinformation. This makes it easier to prioritize harmful content and respond faster, ensuring your brand’s community stays safe and on-topic.

Best Practices

Optimizing Moderation Effectiveness

  • Regular reviews: Check hidden comments periodically to ensure accuracy

  • Platform awareness: Understand timing differences between platforms

  • Collab post planning: Ensure the initiating Page has TrollWall for collaborative content

  • Custom rules: Work with your account manager to fine-tune moderation sensitivity

Managing False Positives

If TrollWall hides legitimate comments:

  1. Review the comment in your dashboard

  2. Click "Show" to make it visible to the public

  3. Contact support if you notice patterns of over-moderation

  4. Request custom rule adjustments if needed

Getting Support

For questions about moderation behavior or to request custom moderation rules, contact your TrollWall account manager or email us at [email protected].

Summary

TrollWall's moderation system prioritizes brand protection while maintaining complete transparency and control. With platform-optimized timing and reversible actions, you can trust that your community management remains both automated and flexible.

Did this answer your question?