University of Bahrain
Scientific Journals

Global Spelling Correction in Context using Language Models: Application to the Arabic Language

Show simple item record

dc.contributor.author Laaroussi, Saida
dc.contributor.author Yousf, Abdellah
dc.contributor.author Aouragh, Si Lhoussain
dc.contributor.author Alaoui, Said Ouatik El
dc.date.accessioned 2023-01-29T19:50:02Z
dc.date.available 2023-01-29T19:50:02Z
dc.date.issued 2023-01-29
dc.identifier.issn 2210-142X
dc.identifier.uri https://journal.uob.edu.bh:443/handle/123456789/4748
dc.description.abstract Automatic spelling correction is a very important task used in many Natural Language Processing (NLP) applications such as Optical Character Recognition (OCR), Information retrieval, etc. There are many approaches able to detect and correct misspelled words. These approaches can be divided into two main categories: contextual and context-free approaches. In this paper, we propose a new contextual spelling correction method applied to the Arabic language, without loss of generality for other languages. The method is based on both the Viterbi algorithm and a probabilistic model built with a new estimate of n-gram language models combined with the edit distance. The probabilistic model is learned with an Arabic multipurpose corpus. The originality of our work consists in handling up global and simultaneous correction of a set of many erroneous words within sentences. The experiments carried out prove the performance of our proposal, giving encouraging results for the correction of several spelling errors in a given context. The method achieves a correction accuracy of up to 93.6% by evaluating the first given correction suggestion. It is able to take into account strong links between distant words carrying meaning in a given context. The high-level correction accuracy of our method allows for its integration into many applications. en_US
dc.language.iso en en_US
dc.publisher University of Bahrain en_US
dc.subject Global Contextual Correction, Single Error Correction, Misspelling, Viterbi Algorithm, n-gram Language Model, Edit Distance, Arabic NLP en_US
dc.title Global Spelling Correction in Context using Language Models: Application to the Arabic Language en_US
dc.type Article en_US
dc.identifier.doi http://dx.doi.org/10.12785/ijcds/130129
dc.volume 13 en_US
dc.issue 1 en_US
dc.pagestart 361 en_US
dc.pageend 370 en_US
dc.contributor.authoraffiliation ES-Lab, ENSA, Ibn Tofail University, Kenitra, Morocco en_US
dc.contributor.authoraffiliation FSJES Souissi, Mohamed V University in Rabat, Morocco en_US
dc.contributor.authoraffiliation ICES Team, ENSIAS, Mohamed V University in Rabat, Morocco en_US
dc.source.title International Journal of Computing and Digital Systems en_US
dc.abbreviatedsourcetitle IJCDS en_US


Files in this item

This item appears in the following Issue(s)

Show simple item record

All Journals


Advanced Search

Browse

Administrator Account