care4lang
gw_logo

Publications
Edited Conference/Workshop Proceedings
  • Diab, Mona, Pascale Fung, Julia Hirschberg, Thamar Solorio, Editors. (2016) 2nd Workshop on Computational Approaches to Linguistic Code Switching. In Proceedings of Empirical Methods of Natural Language Processing (EMNLP).

  • Diab, Mona, Houda Bouamor, Ahmed ElKholy, Yuval Marton, Mahmoud Ghoneim, Editors. (2016) Workshop on Machine Translation for Semitic Languages (SEMAT). In Proceedings of Automatic Machine Translation in the Americas (AMTA).

  • Diab, Mona, Pascale Fung, Julia Hirschberg, Thamar Solorio, Editors. (2014) 1st Workshop on Computational Approaches to Linguistic Code Switching. In Proceedings of Empirical Methods of Natural Language Processing (EMNLP).

  • Diab, Mona, Timothy Baldwin, Marco Baroni. Editors. (2013) 2nd International Joint Conference on Semantics (*SEM). Proceedings of *SEM 2013.
Invited Articles
  • Diab, Mona. (2016) Overview of Arabic Computational Linguistics, Routledge Handbook on Arabic Linguistics. Editors Reem Bassiouney and Abbas Benmamoun.

  • Diab, Mona. (2015) Tharawat: A Vision for a Comprehensive Resource for Arabic Computational Processing. Journal for Computational Linguistics and Intelligent Text Processing, pp. 85-97.

  • Diab, Mona and Yuval Marton. (2014) Semitic Semantics. Book Chapter in Natural Language Processing for Semitic Languages, Editor Imed Zitouni , Springer Publishers, pp. 129-159.

  • ElFardy, Heba, Mohamed AlBadrashiny, Mona Diab. (2014) A Hybrid System for Code Switch Point Detection in Informal Arabic Text. XRDS: Crossroads, The ACM Magazine for Students 21. (201), pp. 52-57.

  • Bar, Kfir, Mona Diab, Abdelati Hawwari. (2013) Arabic Multiword Expressions: Resource and Tool Creation. Book Chapter in Natural Language Processing for Semitic Languages, Editors Naachum Derschowitz and Ephraim Nissim, Vol.3, in honor of Yaacov Choueka, Springer Publishers.
Journal Articles
2016
  • Al Aqeel S, Abanmy N, Abeer Aldayel, Hend S. Al-Khalifa, Maha Al-Yahya, Mona Diab. (Submitted). Readability of medication information materials in Saudi Arabia: expert and non-expert evaluation. Journal of Methods of Medical Information. [pdf] [bibtex]

  • Zaghouani, Wajdi, Abdelati Hawwari, Mona Diab. (2016) AMPN: A Lexical Semantic Resource for Arabic Morphological Patterns. International Journal of Speech Technologies , Springer Publishers. [pdf] [bibtex]

2014
  • Abdul-Mageed, Mohammad, Mona Diab and Sandra Kuebler. (2014). SAMAR: A System for Subjectivity and Sentiment Analysis for Arabic Social Media.
    Computer Speech and Language, 28 (1), pp. 20-37. [pdf] [bibtex]

  • Agirre, Eneko, Daniel Cer, Mona Diab, Aitor Gonzalez-Agirre, and Weiwei Guo, Semantic Textual Similarity. (2014) Journal for Language And Resource Evaluation, in preparation. [pdf] [bibtex]

  • Zirikly, Aya and Mona Diab. (2014) ANEAR: Automatic Named Entity Aliasing Resolution. Transactions for Computational Linguistics, in preparation.
  • [pdf] [bibtex]

Peer Reviewed Conference Papers
2016
  • AlQahtani, Sawsan, Mahmoud Ghoneim, Mona Diab. (2016) Impact of Explicit encoding of vowelization in Arabic Machine translation. Proceedings of Automatic Machine Translation Association (AMTA) 2016, Texas Austin, USA, Nov. [pdf] [bibtex]

  • Hawwari, Abdelati, Mohammed Attia, Mahmoud Ghoneim, Mona Diab. (2016) Explicit Fine grained Syntactic and Semantic Annotation of the Idafa Construction in Arabic. In Proceedings of LREC 2016, Slovenia, May. [pdf] [bibtex]

  • Diab, Mona, Mahmoud Ghoneim, Abdelati Hawwari, Fahad AlGhamdi, Nada AlMarwani, Mohamed Al-Badrashiny. (2016) Creating a Large Multi-Layered Representational Repository of Linguistic Code Switched Arabic Data. In Proceedings of LREC 2016, Slovenia, May. [pdf] [bibtex]

  • Zaghouani, Wajdi, Houda Bouamor, Abdelati Hawwari, Mona Diab, Ossama Obeid, Mahmoud Ghoneim, Sawsan Alqahtani, Kemal Oflazer. (2016) Large Scale Arabic Diacritized Corpus: Guidelines and Framework. In Proceedings of LREC 2016, Slovenia, May. [pdf] [bibtex]

  • Al-Badrashiny, Mohamed, Arfath Pasha, Mona Diab, Nizar Habash, Owen Rambow, Wael Salloum, and Ramy Eskander, SPLIT: Smart Preprocessing (Quasi) Language Independent Tool. In Proceedings of LREC 2016, Slovenia, May. [pdf] [bibtex]
2015
  • Attia, Mohamed, Mohamed Al-Badrashiny, Mona Diab. (2015) GWU-HASP-2015@ QALB-‐2015 Shared Task: Priming Spelling Candidates with Probability. Proceedings of the Second Workshop on Arabic Natural Language Processing, pages 138–143, Beijing, China, July 26-31, 2015 [pdf] [bibtex]

  • Agirre, Eneko, and Carmen Banea, Claire Cardie, Daniel Cer, Mona Diab, Aitor Gonzalez-Agirre, Weiwei Guo, Inigo Lopez-Gazpio, Montse Maritxalar, Rada Mihalcea, German Rigau, Larraitz Uria and Janyce Wiebe. (2015) SemEval-2015 Task 2: Semantic Textual Similarity, English, Spanish and Pilot on Interpretability. Proceedings of NAACL SEMEVAL 2015, Denver CO, Jun’15. [pdf] [bibtex]

  • Aminian, Maryam, and Mahmoud Ghoneim, Mona Diab. (2015) Unsupervised False Friend Disambiguation Using Contextual Word Clusters and Parallel Word Alignments. Proceedings of NAACL Workshop 9th SSST. Denver CO, Jun’15. BEST PAPER AWARD. [pdf] [bibtex]

  • Prabhakaran, Vinodkumar, and Tomas By, Julia Hirschberg, Owen Rambow, Samira Shaikh, Tomek Strzalkowski, Jennifer Tracey, Michael Arrigo, Rupayan Basu, Micah Clark, Adam Dalton, Mona Diab, Louise Guthrie, Anna Prokofieva, Stephanie Strassel, Gregory Werner, Yorick Wilks and Janyce Wiebe (2015). A New Dataset and Evaluation for Belief/Factuality. Proceedings of 4th *SEM Conference, Denver CO, Jun 2015. [pdf] [bibtex]

  • Elfardy, Heba, and Mona Diab and Chris Callison-Burch. (2015) Ideological Perspective Detection Using Semantic Features. Proceedings of 4th *SEM Conference, Denver CO, Jun 2015. [pdf] [bibtex]

  • Werner, Gregory, and Vinodkumar Prabhakaran, Mona Diab and Owen Rambow. (2015) Committed Belief Tagging on the Factbank and LU Corpora: A Comparative Study. Proceedings of NAACL Workshop EXPROM, Denver CO, Jun’15. [pdf] [bibtex]

  • Zirikly, Ayah, Mona Diab. (2015) Named Entity Recognition for Arabic Social Media. Proceedings of NAACL Workshop on Vector Space Models for NLP, Denver, CO. Jun ’15. [pdf] [bibtex]

  • Ayah Zirikly and Masato Hagiwara. Cross-Lingual Transfer of Named Entity Recognizers without Parallel Corpora. In Proceeding of the Association for Computational Linguistics (ACL), Beijing, China, 2015 [pdf] [bibtex]
2014
  • Abdul-Mageed, Muhammad, Mona Diab. (2014) SANA: A Large Scale Multi-Genre, Multi-Dialect Lexicon for Arabic Subjectivity and Sentiment Analysis. In Proceedings of Language Resources and Evaluation Conference (LREC 2014). May, Reykjavik, Iceland. [pdf] [bibtex]

  • Agirre, Eneko, Carmen Banea, Claire Cardie, Daniel Cer, Mona Diab, Aitor Gonzalez-Agirre, Weiwei Guo, Rada Mihalcea, German Rigau, J anyce Wiebe. (2014) SemEval-2014 Task 10: Multilingual Semantic Textual Similarity. In Proceedings of SEMEVAL 2014, COLING 2014, August, pp. 81, Dublin, Ireland. [pdf] [bibtex]

  • Aminian, Maryam, Mahmoud Ghoneim, Mona Diab. (2014) Handling OOV Words in Dialectal Arabic to English Machine Translation. In Proceedings of Workshop on Language Technology for Closely-Related Languages and Language Variants (LT4CloseLang), EMNLP 2014, pp. 97. [pdf] [bibtex]

  • Attia, Mohamed, Mohamed AlBadrashiny, Mona Diab. (2014) GWU-HASP: Hybrid Arabic Spelling and Punctuation Correction. In Proceedings of Workshop on Arabic Natural Language Processing (ANLP), EMNLP 2014, page 148. [pdf] [bibtex]

  • Diab, Mona, Mohamed AlBadrashiny, Maryam Aminian, Mohamed Attia, Heba ElFardy, Nizar Habash, Abdelati Hawwari, Wael Salloum, Pradeep Dasigi, Ramy Eskander. (2014) Tharwa: A Large Scale Dialectal Arabic –Standard Arabic – English Lexicon. In Proceedings of Language Resources and Evaluation Conference (LREC 2014). May, Reykjavik, Iceland. [pdf] [bibtex]

  • ElFardy, Heba, Mohamed AlBadrashiny, Mona Diab. (2014) AIDA: Identifying Code Switching in Informal Arabic Text. In Proceedings of First Workshop on Computational Approaches to Linguistic Code Switching (CodeSwitch), EMNLP 2014, pp. 94. [pdf] [bibtex]

  • Guo, Weiwei, Wei Liu, Mona Diab. (2014) Fast Tweet Retrieval with Compact Binary Codes. In Proceedings of COLING 2014, August, Dublin, Ireland. [pdf] [bibtex]

  • Hawwari, Abdelati, Mohamed Attia, Mona Diab. (2014). A Framework for the Classification and Annotation of Multiword Expressions in Dialectal Arabic. In Proceedings of Workshop on Arabic Natural Language Processing (ANLP), EMNLP 2014, pp. 48. [pdf] [bibtex]

  • Pasha, Arfath, Mohamed AlBadrashiny, Mona Diab, Ahmed ElKholy, Ramy Eskander, Nizar Habash, Manoj Poolery, Owen Rambow, Ryan Roth. (2014) MADAMIRA: A Fast, Comprehensive Tool for Morphological Analysis and Disambiguation of Arabic. In Proceedings of Language Resources and Evaluation Conference (LREC 2014). May, Reykjavik, Iceland. [pdf] [bibtex]

  • Salloum, Wael, Heba ElFardy, Linda Alamir-Salloum, Nizar Habash, Mona Diab. (2014). Sentence Level Dialect Identification for Machine Translation System Selection. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL 2014), June, Baltimore, MD. [pdf] [bibtex]

  • Solorio, Thamar, E. Blair, S. Maharjan, S. Bethard, M. Diab, M. Ghoneim, A. Hawwari, F. AlGhamdi, J. Hirschberg, A. Chang, P. Fung. (2014) Overview for the First Shared Task on Language Identification in Code Switched Data. In Proceedings of First Workshop on Computational Approaches to Linguistic Code Switching (CodeSwitch), EMNLP 2014, pp. 62. [pdf] [bibtex]

  • Zirikly, Ayah, Mona Diab. (2014) Named Entity Recognition for Dialectal Arabic. In Proceedings of Workshop on Arabic Natural Language Processing (ANLP), EMNLP 2014, pp. 78. [pdf] [bibtex]

2013
  • Abdul-Mageed, Muhammad, Mona Diab, Sandra Kubler. (2013) ASMA: A system for Automatic Segmentation and Morpho-syntactic Disambiguation of Modern Standard Arabic. In the Proceedings of Recent Advances in Natural Language Processing (RANLP 2013), September, Bulgaria. [pdf] [bibtex]

  • Abu-Jbara, Amjad, Ben King, Mona Diab and Dragomir Radev. (2013) Identifying Opinion Subgroups in Arabic Online Discussions. In Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, ACL 2013, Sofia, Bulgaria. [pdf] [bibtex]

  • Agirre, Eneko, Daniel Cer, Mona Diab, Aitor Gonzalez-Agirre and Weiwei Guo. (2013) *SEM 2013 shared task: Semantic Textual Similarity. In Proceedings of *SEM, 2013, Atlanta, Georgia, USA. [pdf] [bibtex]

  • ElFardy, Heba, and Mona Diab. (2013) Sentence-Level Dialect Identification in Arabic. In Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, ACL 2013, Sofia, Bulgaria. [pdf] [bibtex]

  • ElFardy, Heba, Mohamed AlBadrashiny, Mona Diab. (2013). Code-Switch Point Detection in Arabic. In Proceedings of the 18th International Conference on Application of Natural Language to Information Systems (NLDB 2013), MediaCity, UK. [pdf] [bibtex]

  • Ghoneim, Mahmoud and Mona Diab. (2013) Multiword Expressions in the context of Statistical Machine Translation. In the Proceedings of IJCNLP 2013, October, Nagoya, Japan. [pdf] [bibtex]

  • Guo, Weiwei, and Mona Diab. (2013) Improving Lexical Semantics for Sentential Semantics: Modeling Selectional Preference and Similar Words in a Latent Variable Model. In Proceedings of NAACL, 2013, Atlanta, Georgia, USA. [pdf] [bibtex]

  • Guo, Weiwei, Hao Li, Heng Ji and Mona Diab. (2013) Linking Tweets to News: A Framework to Enrich Online Short Text Data in Social Media. In Proceedings of ACL, 2013, Sofia, Bulgaria. [pdf] [bibtex]

  • Hawwari, Abdelati, Wajdi Zaghouani, Tim O'Gorman, Mona Diab, and Ahmed Badran. (2013) Building a Lexical Semantic Resource for Arabic Morphological Patterns. Proceedings of ICCSPA13, Sharjeh, UAE. February 2013. [pdf] [bibtex]

  • Tomeh, Nadi, Nizar Habash, Ryan Roth, Noura Farra, Pradeep Dasigi and Mona Diab. (2013) Ensemble Reranking with Linguistic and Semantic Features for Arabic Character Recognition. In Proceedings of ACL, 2013, Sofia, Bulgaria. [pdf] [bibtex]

  • Zirikly, Aya and Mona Diab. (2013) ANEAR: Automatic Named Entity Aliasing Resolution. In Proceedings of the 18th International Conference on Application of Natural Language to Information Systems (NLDB 2013), MediaCity, UK. [pdf] [bibtex]
Peer Reviewed Workshop Papers
2016
  • Solorio, Thamar, Mona Diab, Fahad Algamdi, Mahmoud Ghoneim, Giovanni Molina, Julia Hirschberg, Victor Sotto. (2016) An overview of the shared task on multilingual linguistic code switching. Proceedings of EMNLP Workshop on Computational Approaches to Linguistic Code Switching (CALCS 2016), EMNLP, Austin TX, USA, Nov. [pdf] [bibtext]

  • AlBadrashiny, Mohamed and Mona Diab. (2016) A Simple Efficient Language Independent Framework for Linguistic Code Switch Point Detection. Proceedings of EMNLP Workshop on Computational Approaches to Linguistic Code Switching (CALCS 2016), EMNLP, Austin TX, USA, Nov. [pdf] [bibtext]

  • AlBadrashiny, Mohamed and Mona Diab. (2016) The George Washington University System for the Code-Switching Workshop Shared Task 2016. Proceedings of EMNLP Workshop on Computational Approaches to Linguistic Code Switching (CALCS 2016), EMNLP, Austin TX, USA, Nov. [pdf] [bibtext]

  • Ossama Obeid, Houda Bouamor, Wajdi Zaghouani, Mahmoud Ghoneim, Abdelati Hawwari, Mona Diab, Kemal Oflazer. (2016) MANDIAC: A Web-based Annotation System For Manual Arabic Diacritization. Proceedings of the 2nd Workshop on Arabic Corpora and Processing Tools, LREC 2016. BEST POSTER AWARD FROM QNRF [pdf] [bibtext]

  • Abdul-Mageed, Muhammad, Hassan AlHuzliy, Duaa’ Abu Elhija, Mona Diab. (2016) DINA: A Multi-Dialect Dataset for Arabic Emotion Analysis. Proceedings of the 2nd Workshop on Arabic Corpora and Processing Tools, LREC 2016. [pdf] [bibtext]

  • Elfardy, Heba and Mona Diab. (2016) CU-GW Perspective at SemEval-2016 Task 6: Ideological Stance Detection in Informal Text. In Proceedings of the International Workshop on Semantic Evaluation (SemEval 2016), NAACL 2016. San Diego, CA, USA. [pdf] [bibtext]

  • Elfardy, Heba and Mona Diab. (2016) Annotation Complexity: The Case of Annotating Ideological Perspective in Egyptian Social Media. In Proceedings of the 10th Linguistic Annotation Workshop (LAW X), ACL 2016. Berlin, Germany. [pdf] [bibtext]

  • AlDarmaki, Hanan, and Mona Diab. (2016) GW-NLP at SemEval Task 1: Matrix Factorization for Cross Lingual STS. In Proceedings of the International Workshop on Semantic Evaluation (SemEval 2016), NAACL 2016. San Diego, CA, USA. [pdf] [bibtext]

  • AlDarmaki, Hanan, and Mona Diab. (2016) Learning Cross-Lingual Representations with Matrix Factorization. Proceedings of Multilingual and Cross-Lingual Methods in Computational Linguistics (MLCL), NAACL, San Diego CA, Jun. [pdf] [bibtext]

  • Hamidian, Sardar, and Mona Diab. (2016) Rumor Identification and Belief Investigation on Twitter. In Proceedings of Workshop on Computational Approaches to Subjectivity, Sentiment & Social Media Analysis (WASSA), NAACL 2016, San Diego CA, Jun. [pdf] [bibtext]

  • Agirre, Eneko, Carmen Banea, Daniel Cer, Mona Diab, Aitor Gonzalez-Agirre, Rada Mihalcea, German Rigau, Janyce Wiebe. (2016) SemEval-16 Task 1: Semantic Textual Similarity, Monolingual and Cross Lingual Evaluation. In Proceedings of SEMEVAL, North American Association for Computational Linguistics, NAACL 2016, San Diego CA, Jun [pdf] [bibtext]
2015
  • Agirre, Eneko, Carmen Banea, Claire Cardie, Daniel Cer, Mona Diab, Aitor Gonzalez-Agirre, Weiwei Guo, Inigo Lopez-Gazpio, Montse Maritxalar, Rada Mihalcea, German Rigau, Larraitz Uria, Janyce Wiebe. (2015) SemEval-15 Task 2: Semantic Textual Similarity, English, Spanish and Pilot on Interpretability. In Proceedings of SEMEVAL, North American Association for Computational Linguistics (NAACL), Denver CO, USA. [pdf] [bibtext]

  • Aminian, Maryam, Mahmoud Ghoneim, Mona Diab. (2015) Unsupervised False Friend Disambiguation Using Contextual Word Clusters and Parallel Word Alignments. In Proceedings of Workshop 9th Semantics Syntax Statistical Translation (SSST), North American Association for Computational Linguistics (NAACL), Denver CO, USA. BEST PAPER AWARD. [pdf] [bibtext]

  • Werner, Gregory, and Vinodkumar Prabhakaran, Mona Diab and Owen Rambow. (2015) Committed Belief Tagging on the Factbank and LU Corpora: A Comparative Study. In Proceedings of Workshop EXPROM, North American Association for Computational Linguistics (NAACL), Denver CO, USA. [pdf] [bibtext]

  • Zirikly, Ayah, Mona Diab. (2015) Named Entity Recognition for Arabic Social Media. In Proceedings of Workshop on Vector Space Models for NLP, North American Association for Computational Linguistics (NAACL), Denver CO, USA. [pdf] [bibtext]

  • Bouamor, Houda, Wajdi Zaghouani, Mona Diab, Ossama Obeid, Kemal Oflazer, Mahmoud Ghoneim, Abdelati Hawwari. (2015) A Pilot Study on Arabic Multi-Genre Corpus Diacritization. In Proceedings of Second Workshop on Arabic Natural Language Processing (ANLP), Association for Computational Linguistics (ACL), Beijing, China. [pdf] [bibtext]

  • Attia, Mohammed, Mohamed Al-Badrashiny, Mona Diab. (2015) GW-HASP-15$@$QALB-15 Shared Task: Priming Spelling Candidates with Probability. In Proceedings of Second Workshop on Arabic Natural Language Processing (ANLP), Association for Computational Linguistics (ACL), Beijing, China. [pdf] [bibtext]

  • Aldarmaki, Hanan and Mona Diab. (2015) Robust Part-of-speech Tagging of Arabic Text. In Proceedings of Second Workshop on Arabic Natural Language Processing (ANLP), Association for Computational Linguistics (ACL), Beijing, China. [pdf] [bibtext]

  • Hamadian, Sardar, and Mona Diab. (2015) Improved Automatic Rumor Detection. In Proceedings of The Fifth International Conference on Social Media Technologies, Communication, and Informatics (SOTICS), Barcelona, Spain. [pdf] [bibtext]

2014
  • Agirre, Eneko, Carmen Banea, Claire Cardie, Daniel Cer, Mona Diab, Aitor Gonzalez-Agirre, Weiwei Guo, Rada Mihalcea, German Rigau, Janyce Wiebe. (2014) SemEval-14 Task 10: Multilingual Semantic Textual Similarity. In Proceedings of SEMEVAL, Conference of Computational Linguistics (COLING), Dublin, Ireland. [pdf] [bibtext]

  • Aminian, Maryam, Mahmoud Ghoneim, Mona Diab. (2014) Handling OOV Words in Dialectal Arabic to English Machine Translation. In Proceedings of Workshop on Language Technology for Closely Related Languages and Language Variants (LT4CloseLang), Empirical Methods For Natural Language Processing (EMNLP), Doha, Qatar. [pdf] [bibtext]

  • Attia, Mohamed, Mohamed AlBadrashiny, Mona Diab. (2014) GW-HASP: Hybrid Arabic Spelling and Punctuation Correction. In Proceedings of Workshop on Arabic Natural Language Processing (ANLP), Empirical Methods For Natural Language Processing (EMNLP), Doha, Qatar. [pdf] [bibtext]
  • ElFardy, Heba, Mohamed AlBadrashiny, Mona Diab. (2014) AIDA: Identifying Code Switching in Informal Arabic Text. In Proceedings of First Workshop on Computational Approaches to Linguistic Code Switching (CodeSwitch), Empirical Methods For Natural Language Processing (EMNLP), Doha, Qatar. [pdf] [bibtext]

  • Hawwari, Abdelati, Mohamed Attia, Mona Diab. (2014) A Framework for the Classification and Annotation of Multiword Expressions in Dialectal Arabic. In Proceedings of Workshop on Arabic Natural Language Processing (ANLP), Empirical Methods For Natural Language Processing (EMNLP), Doha, Qatar. [pdf] [bibtext]

  • Solorio, Thamar, E. Blair, S. Maharjan, S. Bethard, M. Diab, M. Ghoneim, A. Hawwari, F. AlGhamdi, J. Hirschberg, A. Chang, P. Fung. (2014) Overview for the First Shared Task on Language Identification in Code Switched Data. In Proceedings of First Workshop on Computational Approaches to Linguistic Code Switching (CodeSwitch), Empirical Methods For Natural Language Processing (EMNLP), Doha, Qatar. [pdf] [bibtext]

  • Zirikly, Ayah, Mona Diab. (2014) Named Entity Recognition for Dialectal Arabic. In Proceedings of Workshop on Arabic Natural Language Processing. (ANLP), Empirical Methods For Natural Language Processing (EMNLP), Doha, Qatar. [pdf] [bibtext]
George Washington University +
Natural Language Processing lab +
800 22nd St NW Suite 4934, Washington DC 20036
gw_logo