University of Bahrain
Scientific Journals

Exploring Sentence Embedding Representation for Arabic Question/Answering

Show simple item record

dc.contributor.author Lahbari, Imane
dc.contributor.author El Alaou, Sa¨ıd Ouatik
dc.date.accessioned 2023-07-17T04:06:12Z
dc.date.available 2023-07-17T04:06:12Z
dc.date.issued 2024-03-1
dc.identifier.issn 2210-142X
dc.identifier.uri https://journal.uob.edu.bh:443/handle/123456789/5012
dc.description.abstract Question Answering Systems (QAS) are made to automatically respond with precise information to user questions that are phrased in natural language. Due to its intricate and rich morphology, Arabic QAS poses a significant problem. Information retrieval, text summarization, and question-answering systems all fall under the category of natural language processing activities where text representation is a critical step. Comparing SE representation to more traditional approaches like bag-of-words and word embedding, it has demonstrated encouraging results. In this study, we introduce a novel QA approach for the Arabic language that is based on passage retrieval and SE representation. It consists of three steps: ”Question classification and query formulation”, ”Documents and passages retrieval”, and then ”Answers extraction”. In this work, we adopt the AraBert pre-trained model to compute vector representation. It allows us to consider implicit semantics and the words’ context within the text. Furthermore, in order to collect potential passages for user questions, we investigate a method for retrieving Arabic passages using the BM25 model, a query expansion process, and SE representation. The final answer is extracted by fine-tuning AraBERT parameters by ranking passages and extracting the most relevant ones. We carry out a number of tests with the CLEF and TREC datasets by following two different taxonomies. The outcomes demonstrate the efficacy of our methodology en_US
dc.language.iso en en_US
dc.publisher University of Bahrain en_US
dc.subject Question Answering en_US
dc.subject Arabic en_US
dc.subject NLP en_US
dc.subject Word Embedding en_US
dc.subject Sentence Embedding en_US
dc.subject AraBERT en_US
dc.subject Elmo en_US
dc.subject FastText en_US
dc.subject Fine-tuning en_US
dc.title Exploring Sentence Embedding Representation for Arabic Question/Answering en_US
dc.identifier.doi https://dx.doi.org/10.12785/ijcds/150187
dc.volume 15 en_US
dc.issue 1 en_US
dc.pagestart 1229 en_US
dc.pageend 1241 en_US
dc.contributor.authorcountry Morocco en_US
dc.contributor.authoraffiliation Ibn Tofail University en_US
dc.source.title International Journal of Computing and Digital Systems en_US
dc.abbreviatedsourcetitle IJCDS en_US


Files in this item

This item appears in the following Issue(s)

Show simple item record

All Journals


Advanced Search

Browse

Administrator Account