Skip to main content
User Image

مها بنت محمد بن عبدالعزيز اليحيى Maha Al-Yahya

Associate Professor

عضو هيئة تدريس في قسم تقنية المعلومات

علوم الحاسب والمعلومات
مبنى رقم ٦- الدور الأرضي
publication
Journal Article
2022

AraConv: Developing an Arabic Task-Oriented Dialogue System Using Multi-Lingual Transformer Model mT5

Applied Sciences

Abstract: Task-oriented dialogue systems (DS) are designed to help users perform daily activities using natural language. Task-oriented DS for English language have demonstrated promising performance outcomes; however, developing such systems to support Arabic remains a challenge. This challenge is mainly due to the lack of Arabic dialogue datasets. This study introduces the first Arabic end-to-end generative model for task-oriented DS (AraConv), which uses the multi- lingual transformer model mT5 with different settings. We also present an Arabic dialogue dataset (Arabic-TOD) and used it to train and test the proposed AraConv model. The results obtained are reasonable compared to those reported in the studies of English and Chinese using the same mono-lingual settings. To avoid problems associated with a small training dataset and to improve the AraConv model’s results, we suggest joint-training, in which the model is jointly trained on Arabic dialogue data and data from one or two high-resource languages such as English and Chinese. The findings indicate the AraConv model performed better in the joint-training setting than in the mono-lingual setting. The results obtained from AraConv on the Arabic dialogue dataset provide a baseline for other researchers to build robust end-to-end Arabic task-oriented DS that can engage with complex scenarios.
Keywords: task-oriented dialogue systems; Arabic; multi-lingual transformer model; mT5; natural language processing

Publication Work Type
Original Article
more of publication
publications

Fake news detection (FND) remains a challenge due to its vast and varied sources, especially on social media platforms. While numerous attempts have been made by academia and the industry to…

by Lama Al-Zahrani , Maha Al-Yahya
2024
publications

Authorship attribution (AA) is a field of natural language processing that aims to attribute text to its author. Although the literature includes several studies on Arabic AA in general, applying…

by AlZahrani, F.M.; , Al-Yahya, M.
2023
publications

Abstract: In the domain of law and legal systems, jurisprudence principles (JPs) are considered major sources of legislative reasoning by jurisprudence scholars. Generally accepted JPs are often…

by Nafla AlRumayyan, Maha Al-Yahya
2022