A newer version of the Gradio SDK is available:
6.2.0
metadata
title: Multilingual Hate Speech Detector
emoji: π‘οΈ
colorFrom: red
colorTo: blue
sdk: gradio
sdk_version: 4.44.0
app_file: app.py
pinned: false
license: mit
short_description: Hate speech detector
models:
- xlm-roberta-base
datasets:
- hate-speech
π‘οΈ Multilingual Hate Speech Detector
Advanced AI system for detecting hate speech in English and Serbian text with innovative contextual analysis
π¬ Key Innovations
1. Contextual Analysis π
- Word-level importance highlighting using transformer attention weights
- Visual explanation showing which words most influenced the classification decision
- Color-coded highlighting: π΄ Red (high influence) β π Orange β π‘ Yellow β βͺ Gray (low influence)
2. Confidence Visualization π
- Interactive Plotly charts showing model confidence across all 8 categories
- Real-time confidence distribution analysis
- Color-coded bars distinguishing hate speech categories from appropriate content
3. Interactive Feedback System π¬
- User rating system (1-5 stars) for continuous model improvement
- Feedback collection for enhancing accuracy
- Community-driven model refinement
π Hate Speech Categories
The system detects 8 categories:
- Race: Racial discrimination and slurs
- Sexual Orientation: Homophobic content, LGBTQ+ discrimination
- Gender: Sexist content, misogyny, gender-based harassment
- Physical Appearance: Body shaming, lookism, appearance-based harassment
- Religion: Religious discrimination, islamophobia, antisemitism
- Class: Classist content, economic discrimination
- Disability: Ableist content, discrimination against disabled people
- Appropriate: Non-hateful, normal conversation
π Multilingual Support
- English: Comprehensive hate speech detection
- Serbian: Native Serbian language support with Cyrillic and Latin scripts
- Cross-lingual: XLM-RoBERTa architecture enables robust multilingual understanding
π§ Technical Architecture
- Base Model: XLM-RoBERTa (Cross-lingual Language Model)
- Training: Fine-tuned on multilingual hate speech datasets
- Attention Mechanism: Transformer attention weights for explainable AI
- Real-time Processing: Optimized for instant classification
- GPU Acceleration: CUDA support for faster inference
π How to Use
- Input Text: Enter any text in English or Serbian
- Analyze: Click "Analyze Text" for instant classification
- Review Results: See category prediction with confidence score
- Examine Context: Check word-level highlighting to understand the decision
- View Confidence: Analyze the confidence distribution chart
- Provide Feedback: Rate the analysis to help improve the model
π― Example Analyses
Appropriate Content
"I really enjoyed that movie last night! Great acting and storyline."
β β
Appropriate (95% confidence)
Hate Speech Detection
"You people are all the same, always causing problems everywhere."
β β οΈ Race (87% confidence)
Serbian Language
"Ovaj film je bio odliΔan, preporuΔujem svima!"
β β
Appropriate (92% confidence)
β‘ Performance
- Accuracy: High-confidence predictions with detailed explanations
- Speed: Real-time processing (< 2 seconds per analysis)
- Languages: English and Serbian with cross-lingual capabilities
- Explainability: Visual attention analysis for transparent decisions
π οΈ Local Development
# Clone the repository
git clone <repository-url>
cd hate-speech-detector
# Install dependencies
pip install -r requirements.txt
# Run the application
python app.py
π Research & Education
This AI system is designed for:
- Research purposes: Understanding hate speech patterns
- Educational use: Learning about AI explainability
- Content moderation: Assisting human moderators
- Linguistic analysis: Cross-lingual hate speech research
β οΈ Important Notes
- Results should be interpreted carefully
- Human judgment should always be applied for critical decisions
- The system is designed to assist, not replace, human moderation
- Continuous improvement through user feedback
π€ Contributing
We welcome feedback and contributions! Please use the interactive feedback system within the application to help improve model accuracy.
π License
MIT License - See LICENSE file for details
β‘ Powered by: Transformer Neural Networks | π Languages: English, Serbian | π― Focus: Explainable AI