Repository logo
Article

Diacritic-aware Yorùbá spell checker

creativeworkseries.issn1508-2806
dc.contributor.authorAsahiah, Franklin Oládiípò
dc.contributor.authorOnífádé, Mary Taiwo
dc.contributor.authorAsahiah, Adekemisola Olufunmilayo
dc.contributor.authorAdegunlehin, Abayomi Emmanuel
dc.contributor.authorAmoo, Adekemi Olawunmi
dc.date.available2025-06-20T08:09:40Z
dc.date.issued2023
dc.descriptionBibliogr. s. 49-50.
dc.description.abstractSpell checking and correction is still in its infancy for the Yorùbá language, existing tools cannot be directly applied to address the problem, as Yorùbá uses diacritics extensively for distinguishing phonemes and for marking tone. A model was formulated as a parallel combination of a unigram language model and a diacritic model to form a dictionary sub-model that can be used by error-detection and candidate-generation modules. The candidate-generation module was implemented as a reverse Levensthein edit-distance algorithm. The system was evaluated by using detection accuracy (calculated from the precision and recall) and suggestion accuracy (SA) as metrics. Our experimental setups compared the performance of the component subsystems when used alone and with their combination into a unified model. The detection accuracies for the different models range from 93.23 to 95.01%, and the suggestion accuracies range from 26.94 to 72.10%. The results indicated that each of the sub-models in the dictionary played different roles.en
dc.description.placeOfPublicationKraków
dc.description.versionwersja wydawnicza
dc.identifier.doihttps://doi.org/10.7494/csci.2023.24.1.4494
dc.identifier.eissn2300-7036
dc.identifier.issn1508-2806
dc.identifier.urihttps://repo.agh.edu.pl/handle/AGH/113321
dc.language.isoeng
dc.publisherWydawnictwa AGH
dc.relation.ispartofComputer Science
dc.rightsAttribution 4.0 International
dc.rights.accessotwarty dostęp
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/legalcode
dc.subjecttoneen
dc.subjectphonemesen
dc.subjectdiacriticen
dc.subjectunigramen
dc.subjecttoolsen
dc.titleDiacritic-aware Yorùbá spell checkeren
dc.title.relatedComputer Scienceen
dc.typeartykuł
dspace.entity.typePublication
publicationissue.issueNumberNo. 1
publicationissue.paginationpp. 31-51
publicationvolume.volumeNumberVol. 24
relation.isJournalIssueOfPublication4520476e-e012-47d7-90e5-2cd72419c0f2
relation.isJournalIssueOfPublication.latestForDiscovery4520476e-e012-47d7-90e5-2cd72419c0f2
relation.isJournalOfPublication020291ee-249b-4dcf-98a3-276a2f7981aa

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
csci.2023.24.1.31.pdf
Size:
928.49 KB
Format:
Adobe Portable Document Format