Home > CSC-OpenAccess Library > Manuscript Information
EXPLORE PUBLICATIONS BY COUNTRIES |
EUROPE | |
MIDDLE EAST | |
ASIA | |
AFRICA | |
............................. | |
United States of America | |
United Kingdom | |
Canada | |
Australia | |
Italy | |
France | |
Brazil | |
Germany | |
Malaysia | |
Turkey | |
China | |
Taiwan | |
Japan | |
Saudi Arabia | |
Jordan | |
Egypt | |
United Arab Emirates | |
India | |
Nigeria |
Design of A Spell Corrector For Hausa Language
Lawaly Salifou, Harouna  Naroua
Pages - 14 - 26 | Revised - 10-05-2014 | Published - 01-06-2014
MORE INFORMATION
KEYWORDS
Natural Language Processing, Spell Checker, Spell Corrector, Computerization of Hausa, African Languages.
ABSTRACT
In this article, a spell corrector has been designed for the Hausa language which is the second most spoken language in Africa and do not yet have processing tools. This study is a contribution to the automatic processing of the Hausa language. We used existing techniques for other languages and adapted them to the special case of the Hausa language. The corrector designed operates essentially on Mijinguini’s dictionary and characteristics of the Hausa alphabet. After a brief review on spell checking and spell correcting techniques and the state of art in the Hausa language processing, we opted for the data structures trie and hash table to represent the dictionary. The edit distance and the specificities of the Hausa alphabet have been used to detect and correct spelling errors. The implementation of the spell corrector has been made on a special editor developed for that purpose (LyTexEditor) but also as an extension (add-on) for OpenOffice.org. A comparison was made on the performance of the two data structures used.
1 | Al-Bakry, A. M., & Al-Rikaby, M. K. Enhanced Levenshtein Edit Distance Method functioning as a String-to-String Similarity Measure. Distributed Agents for Web Content Filtering, 48. |
2 | Negesse, F. (2015). Classification of Oromo Dialects: A Computational Approach. |
3 | Nijim, A., El Shenawy, A., Mostafa, M. T., & Alez, R. A. Anovel Approach for recognizing text in arabic ancient manuscripts. |
A. Mijiguin and H. Naroua. "Règles de formation des noms en haoussa", Actes de la conférence conjointe JEP-TALN-RECITAL 2012, Atelier TALAf 2012: Traitement Automatique des Langues Africaines, pp. 63-74, 2012. | |
A. Minjinguini. "Dictionnaire élémentaire hausa-français", les éditions GG, 2003. | |
A.V. Aho and M.J. Corasick. "Efficient String Matching: An Aid to Bibliographic Search",Communications of the ACM, vol. 18, No. 6, pp. 333-340, June 1975. | |
A.V. Van Der and D.S. Gilles-Maurice. " The African Languages on the Internet: Case Studies for Hausa, Somali, Lingala and isiXhosa", Cahiers Du Rifal, vol. 23, pp. 33–45,2003. | |
Arrêté N°0212 MEN/SP-CNRE du 19 oct. 1999 modifiant et complétant l'arrêté n°01/MEN/SCNRE/MJSC/MESR/M.INF/MDR/MI du 15 mars 1981 relatif à l'orthographe de la langue hausa. | |
B. Horst. "A Fast Algorithm for Finding the Nearest Neighbor of a Word in a Dictionary",IAM-93-025, November 1993. | |
B. Laurent. "Production de logiciels et d'utilitaires pour le traitement informatique de langues africaines dans un contexte de NTIC multilingues", 2nd World Congress of Community Networks, Buenos Aires, Argentine, du 5 au 7 décembre 2001. | |
C. Bernard. "Les langues au Nigeria", Notre Librairie, Revue des littératures du Sud, Littératures du Nigéria et du Ghana, vol. 2, no. 141, pp. 8-15, 2000 | |
C. Chanard and A. Popescu-Belis. "Encodage informatique multilingue : application au contexte du Niger", Les Cahiers du Rifal, No. 22, pp. 33-45, 2001.[33]Christophe D.,"Apprendre à programmer, algorithmes et conception objet", 2e ed., Eyrolles, 2008. | |
C. Enguehard and C. Mbodj. "Des correcteurs orthographiques pour les langues africaines", Bulletin de Linguistique Appliquée et Générale, 2004. | |
C. Enguehard and H. Naroua. "Evaluation of Virtual Keyboards for West-African Languages", Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08), Marrakech, Morocco, 28-30 May 2008. | |
D. Christophe. "Apprendre à programmer, algorithmes et conception objet", 2e ed.,Eyrolles, 2008. | |
D.S. Gilles-Maurice. "Web for/as Corpus: A Perspective for the African Languages", Nordic Journal of African Studies, vol. 11, No. 2, pp. 266-282, 2002. | |
E.M. Zamora. Pollock J. J. and Antonio Z., "The use of trigram analysis for spelling error detection", Information Processing & Management, vol. 17. No. 6, pp. 305-316, 1981. | |
F.J. Damerau. "A technique for computer detection and correction of spelling errors",Comm. ACM 7, vol. 3 , pp. 171-176, March 1964. | |
G.P. Bargery. "A Hausa-English Dictionary and English-Hausa Vocabulary", Oxford University Press, London, 1934. | |
G.S. Lehal and K. Singh. "A Comparative Study of Data Structures for Punjabi Dictionary",5th International Conference on Cognitive Systems, reviews & previews, ICCS’99, pp. 489-497, 2000. | |
http://www.humnet.ucla.edu/humnet/aflang/Hausa/hausa.html. | |
J. Daniel and H.M. James, "Speech and Language Processing", Prentice Hall, Englewood Cliffs, Inc., 2000. | |
J.L. Peterson . "Computer Programs for Detecting and Correcting Spelling Errors", Comm.ACM, vol. 23, No. 12, December 1980. | |
K. Donald. "The Art of Computer Programming", Addison-Wesley Publishing Co.,Philippines, vol. 3, 1973. | |
K. Kukich. "Techniques for automatically correcting words in text", ACM Computing Surveys, vol. 24, No. 4, 1992. | |
L.L. Hsuan. "Spell Checkers and Correctors: a unified treatment", Master dissertation,November 2008 | |
M. Brett, P. Gary and W. David. "Head First Object-Oriented Analysis and Design",O'Reilly, Nov. 2006. | |
M.G. Maman and H.H. Seydou. "Les Langues de scolarisation dans l’enseignement fondamental en Afrique subsaharienne francophone : cas du Niger", Rapport d’étude pays,2010. | |
M.N. Pierre. "An introduction to language processing with Perl and Prolog", Springer-Verlag Berlin Heidelberg, p.2-3, 2006. | |
M.N. Roxana and N. Paul. "The Hausa Lexicographic Tradition", Lexikos11, AFRILEXreeks,series 11, pp. 263-286, 2001 | |
N. Ahmed. "Adaptation des écritures et de la lecture des langues étrangères au pays Haoussa de l’Afrique de l’Ouest", Synergies Algérie n°6 – 2009, pp. 61-69, 2009. | |
N. Paul. "The Hausa Language An Encyclopedic Reference Grammar", Yale University Press, New Haven, 2000. | |
N.A. Cyril. "String similarity and misspellings", Communications of the A.C.M., vol. 10, no.5, pp. 302-313, May 1967. | |
O. Don. "Les langues africaines a l’ère du numerique, défis et opportunités de l’informatisation des langues autochtones", Les Presses de l’Université Laval, CRDI 2011. | |
OpenOffice.org 3.1 Developer's Guide, https://wiki.openoffice.org. | |
P.N. Mark. "A Comparison of Dictionary Implementations", April 10, 2009. | |
S. Robert and W. Kevin. "Algorithms", 4e ed., Addison Wisley, 2011 | |
The Online Encyclopedia of Writing Systems & Language, 16/12/2013 09:31,http://www.omniglot.com/writing/definition.htm. | |
V. Suzan. "Context-sensitive spell checking based on word trigram probabilities", Master thesis, February - August 2002. | |
Mr. Lawaly Salifou
Université Abdou Moumouni de Niamey - Niger
Associate Professor Harouna  Naroua
Université Abdou Moumouni de Niamey - Niger
hnaroua@yahoo.com
|
|
|
|
View all special issues >> | |
|
|