Spelling correction for kazakh

Aibek Makazhanov, Olzhas Makhambetov, Islam Sabyrgaliyev, Zhandos Yessenbayev

Research output: Chapter in Book/Report/Conference proceedingConference contribution

4 Citations (Scopus)

Abstract

Being an agglutinative language Kazakh imposes certain difficulties on both recognition of correct words and generation of candidate corrections for misspelled words. In this paper we describe a spelling correction method for Kazakh that takes advantage of both morphological analysis and noisy channel-based model. Our method outperforms both open source and commercial analogues in terms of the overall accuracy. We performed a comparative analysis of the spelling correction tools and pointed out some problems of spelling correction for agglutinative languages in general and for Kazakh in particular.

Original languageEnglish
Title of host publicationLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
PublisherSpringer Verlag
Pages533-541
Number of pages9
Volume8404 LNCS
EditionPART 2
ISBN (Print)9783642549021
DOIs
Publication statusPublished - 2014
Event15th International Conference on Computational Linguistics and Intelligent Text Processing, CICLing 2014 - Kathmandu, Nepal
Duration: Apr 6 2014Apr 12 2014

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
NumberPART 2
Volume8404 LNCS
ISSN (Print)03029743
ISSN (Electronic)16113349

Other

Other15th International Conference on Computational Linguistics and Intelligent Text Processing, CICLing 2014
CountryNepal
CityKathmandu
Period4/6/144/12/14

Fingerprint

Morphological Analysis
Comparative Analysis
Open Source
Analogue
Language
Model

ASJC Scopus subject areas

  • Computer Science(all)
  • Theoretical Computer Science

Cite this

Makazhanov, A., Makhambetov, O., Sabyrgaliyev, I., & Yessenbayev, Z. (2014). Spelling correction for kazakh. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (PART 2 ed., Vol. 8404 LNCS, pp. 533-541). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 8404 LNCS, No. PART 2). Springer Verlag. https://doi.org/10.1007/978-3-642-54903-8_44

Spelling correction for kazakh. / Makazhanov, Aibek; Makhambetov, Olzhas; Sabyrgaliyev, Islam; Yessenbayev, Zhandos.

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 8404 LNCS PART 2. ed. Springer Verlag, 2014. p. 533-541 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 8404 LNCS, No. PART 2).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Makazhanov, A, Makhambetov, O, Sabyrgaliyev, I & Yessenbayev, Z 2014, Spelling correction for kazakh. in Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). PART 2 edn, vol. 8404 LNCS, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), no. PART 2, vol. 8404 LNCS, Springer Verlag, pp. 533-541, 15th International Conference on Computational Linguistics and Intelligent Text Processing, CICLing 2014, Kathmandu, Nepal, 4/6/14. https://doi.org/10.1007/978-3-642-54903-8_44
Makazhanov A, Makhambetov O, Sabyrgaliyev I, Yessenbayev Z. Spelling correction for kazakh. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). PART 2 ed. Vol. 8404 LNCS. Springer Verlag. 2014. p. 533-541. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); PART 2). https://doi.org/10.1007/978-3-642-54903-8_44
Makazhanov, Aibek ; Makhambetov, Olzhas ; Sabyrgaliyev, Islam ; Yessenbayev, Zhandos. / Spelling correction for kazakh. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 8404 LNCS PART 2. ed. Springer Verlag, 2014. pp. 533-541 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); PART 2).
@inproceedings{8e2250acae584e39b15e392210a8746d,
title = "Spelling correction for kazakh",
abstract = "Being an agglutinative language Kazakh imposes certain difficulties on both recognition of correct words and generation of candidate corrections for misspelled words. In this paper we describe a spelling correction method for Kazakh that takes advantage of both morphological analysis and noisy channel-based model. Our method outperforms both open source and commercial analogues in terms of the overall accuracy. We performed a comparative analysis of the spelling correction tools and pointed out some problems of spelling correction for agglutinative languages in general and for Kazakh in particular.",
author = "Aibek Makazhanov and Olzhas Makhambetov and Islam Sabyrgaliyev and Zhandos Yessenbayev",
year = "2014",
doi = "10.1007/978-3-642-54903-8_44",
language = "English",
isbn = "9783642549021",
volume = "8404 LNCS",
series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
publisher = "Springer Verlag",
number = "PART 2",
pages = "533--541",
booktitle = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
address = "Germany",
edition = "PART 2",

}

TY - GEN

T1 - Spelling correction for kazakh

AU - Makazhanov, Aibek

AU - Makhambetov, Olzhas

AU - Sabyrgaliyev, Islam

AU - Yessenbayev, Zhandos

PY - 2014

Y1 - 2014

N2 - Being an agglutinative language Kazakh imposes certain difficulties on both recognition of correct words and generation of candidate corrections for misspelled words. In this paper we describe a spelling correction method for Kazakh that takes advantage of both morphological analysis and noisy channel-based model. Our method outperforms both open source and commercial analogues in terms of the overall accuracy. We performed a comparative analysis of the spelling correction tools and pointed out some problems of spelling correction for agglutinative languages in general and for Kazakh in particular.

AB - Being an agglutinative language Kazakh imposes certain difficulties on both recognition of correct words and generation of candidate corrections for misspelled words. In this paper we describe a spelling correction method for Kazakh that takes advantage of both morphological analysis and noisy channel-based model. Our method outperforms both open source and commercial analogues in terms of the overall accuracy. We performed a comparative analysis of the spelling correction tools and pointed out some problems of spelling correction for agglutinative languages in general and for Kazakh in particular.

UR - http://www.scopus.com/inward/record.url?scp=84958531991&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84958531991&partnerID=8YFLogxK

U2 - 10.1007/978-3-642-54903-8_44

DO - 10.1007/978-3-642-54903-8_44

M3 - Conference contribution

AN - SCOPUS:84958531991

SN - 9783642549021

VL - 8404 LNCS

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 533

EP - 541

BT - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

PB - Springer Verlag

ER -