Open source, culturally aware LLMs for Arabic tasks
Unit for Research in Arabic Social and Digital Spaces
Part of the Arab Center for Research and Policy Studies (ACRPS)
Empowering Arabic Digital Spaces with NLP, ML, and Computational Linguistics.
Projects & Products
Visualization framework for AOI
Linguistic expert agent
Knowledge graph toolset to power content generation for Arabica
Automating editorial tasks for Arabic research and other domains
Chat with your favorite authors using AI on Mastodon
Custom URL shortener for ACRPS
Creating benchmarks for Arabic NLP tasks, cultural awareness, and local value systems
Curation of multimodal large datasets with different topics and domains
Shaghle
Coming SoonWhatsApp agent that helps you find jobs
Micless Communication System
Coming SoonSolution for awkward silences in live conferences
Publications
"NeoAraBERT: A Modern Foundation Model for Arabic Embeddings with Diacritics-Aware Tokenization and POS-Targeted Masking"
Findings of the Association for Computational Linguistics: ACL 2026, 2026
"Back-of-the-Book Index Automation for Arabic Documents"
Proceedings of the 2nd Workshop on NLP for Languages Using Arabic Script, 2026
"Arabic Citation Parsing using Part of Speech and Named Entity Recognition"
Proceedings of the 2nd Workshop on NLP for Languages Using Arabic Script, 2026
"MASRAD: Arabic Terminology Management Corpora with Semi-Automatic Construction"
LREC 2026, 2026
"R-BPE: Improving BPE-Tokenizers with Token Reuse"
EMNLP, 2025
"ImageEval 2025: The First Arabic Image Captioning Shared Task"
ArabicNLP Shared Tasks, 2025
"AREEj: Arabic Relation Extraction with Evidence"
ArabicNLP, 2024
"DRU at WojoodNER 2024: A Multi-level Method Approach"
ArabicNLP, 2024
"DRU at WojoodNER 2024: ICL LLM for Arabic NER"
ArabicNLP, 2024
"Arabic Topic Classification in the Generative and AutoML Era"
ArabicNLP, 2023
Resources
Relation extraction with evidence
With Wojood and Camel
With BERTopic
Shifts in research discourse on Palestine