dc.contributor.author |
Laaroussi, Saida |
|
dc.contributor.author |
Yousf, Abdellah |
|
dc.contributor.author |
Aouragh, Si Lhoussain |
|
dc.contributor.author |
Alaoui, Said Ouatik El |
|
dc.date.accessioned |
2023-01-29T19:50:02Z |
|
dc.date.available |
2023-01-29T19:50:02Z |
|
dc.date.issued |
2023-01-29 |
|
dc.identifier.issn |
2210-142X |
|
dc.identifier.uri |
https://journal.uob.edu.bh:443/handle/123456789/4748 |
|
dc.description.abstract |
Automatic spelling correction is a very important task used in many Natural Language Processing (NLP) applications such
as Optical Character Recognition (OCR), Information retrieval, etc. There are many approaches able to detect and correct misspelled
words. These approaches can be divided into two main categories: contextual and context-free approaches. In this paper, we propose a
new contextual spelling correction method applied to the Arabic language, without loss of generality for other languages. The method
is based on both the Viterbi algorithm and a probabilistic model built with a new estimate of n-gram language models combined
with the edit distance. The probabilistic model is learned with an Arabic multipurpose corpus. The originality of our work consists
in handling up global and simultaneous correction of a set of many erroneous words within sentences. The experiments carried out
prove the performance of our proposal, giving encouraging results for the correction of several spelling errors in a given context. The
method achieves a correction accuracy of up to 93.6% by evaluating the first given correction suggestion. It is able to take into account
strong links between distant words carrying meaning in a given context. The high-level correction accuracy of our method allows for
its integration into many applications. |
en_US |
dc.language.iso |
en |
en_US |
dc.publisher |
University of Bahrain |
en_US |
dc.subject |
Global Contextual Correction, Single Error Correction, Misspelling, Viterbi Algorithm, n-gram Language Model, Edit Distance, Arabic NLP |
en_US |
dc.title |
Global Spelling Correction in Context using Language Models: Application to the Arabic Language |
en_US |
dc.type |
Article |
en_US |
dc.identifier.doi |
http://dx.doi.org/10.12785/ijcds/130129 |
|
dc.volume |
13 |
en_US |
dc.issue |
1 |
en_US |
dc.pagestart |
361 |
en_US |
dc.pageend |
370 |
en_US |
dc.contributor.authoraffiliation |
ES-Lab, ENSA, Ibn Tofail University, Kenitra, Morocco |
en_US |
dc.contributor.authoraffiliation |
FSJES Souissi, Mohamed V University in Rabat, Morocco |
en_US |
dc.contributor.authoraffiliation |
ICES Team, ENSIAS, Mohamed V University in Rabat, Morocco |
en_US |
dc.source.title |
International Journal of Computing and Digital Systems |
en_US |
dc.abbreviatedsourcetitle |
IJCDS |
en_US |