البحث
Arabic Sentiment Analysis
🏆 NLP Competition – Arabic Sentiment Analysis
Dataset: HARD (Hotel Arabic Reviews) • Language: Arabic (MSA + dialectal) • Task: 3-class sentiment
Focus: Preprocessing + Modeling + Error Analysis
Deliverables: Predictions + Code + Report
Level: Intermediate–Advanced
Submit Your Work
How it works
- Download: the dataset (link below).
- Build: a solid sentiment classifier (3 classes: positive / negative / neutral).
- Submit: predictions for the test split, your notebook/code, and a short report.
Dataset
HARD – Hotel Arabic Reviews Dataset (Arabic reviews from booking platforms; includes Modern Standard Arabic and dialectal Arabic).
- Labels:
positive,negative,neutral - Fields (typical):
id,review,rating/label(naming may vary per file) - Dataset link: HARD Arabic Dataset
- Create your own train/dev/test splits and document how you did it.
What you need to do
- Preprocessing: Arabic normalization (remove diacritics; normalize alef/ya/ta marbuta; remove Tatweel), basic punctuation/URL cleanup.
- Baselines: TF-IDF (char 3–5 + word 1–2) + Logistic Regression / Linear SVM (with class weights).
- Advanced (optional): fine-tune AraBERT/ARBERT or XLM-R for 3-class classification.
- Evaluation: use Macro F1 on your dev split; report per-class P/R/F1.
- Error Analysis: highlight failure cases (sarcasm, short texts, mixed dialect).
Submission Format
- Predictions file:
predictions.csvwith two columns:id,label(labels in {positive, negative, neutral}). - Code/Notebook: Jupyter notebook or repo link (data prep → training → eval → inference).
- Short Report (PDF/DOC, ≤3 pages): preprocessing, models tried, metrics, insights & error analysis, conclusion.
Evaluation
- Macro F1 on test set — 70%
- Report quality & insights — 20%
- Code clarity & reproducibility — 10%
Ties broken by higher F1 on minority class, then report completeness.
Timeline
- Start: on publish
- Evaluation: within 7 days after review
Submit your work
Upload your predictions, code, and report via the form below. You will receive a confirmation email.
Open Submission Form