A free/open-source hybrid morphological disambiguation tool for Kazakh

Zhenisbek Assylbekov, Jonathan Washington, Francis Tyers, Assulan Nurkas, Aida Sundetova, Aidana Karibayeva, Balzhan Abduali, Dina Amirova

Research output: Contribution to conferencePaper

Abstract

This paper presents the results of developing a morphological disambiguation tool for Kazakh. Starting with a previously developed rule-based approach, we tried to cope with the complex morphology of Kazakh by breaking up lexical forms across their derivational boundaries into inflectional groups and modeling their behavior with statistical methods. A hybrid rule-based/statistical approach appears to benefit morphological disambiguation demonstrating a per-token accuracy of 91% in running text.
Original languageEnglish
Publication statusPublished - 2016
EventThe First International Conference on Turkic Computational Linguistics - Konya, Turkey
Duration: Apr 2 2016Apr 8 2016
http://nur.nu.edu.kz/bitstream/handle/123456789/1692/kaz-tagger.pdf?sequence=1

Conference

ConferenceThe First International Conference on Turkic Computational Linguistics
Abbreviated titleTurCLing 2016
CountryTurkey
CityKonya
Period4/2/164/8/16
Internet address

Fingerprint

Statistical methods

Cite this

Assylbekov, Z., Washington, J., Tyers, F., Nurkas, A., Sundetova, A., Karibayeva, A., ... Amirova, D. (2016). A free/open-source hybrid morphological disambiguation tool for Kazakh. Paper presented at The First International Conference on Turkic Computational Linguistics, Konya, Turkey.

A free/open-source hybrid morphological disambiguation tool for Kazakh. / Assylbekov, Zhenisbek; Washington, Jonathan; Tyers, Francis; Nurkas, Assulan; Sundetova, Aida; Karibayeva, Aidana; Abduali, Balzhan; Amirova, Dina.

2016. Paper presented at The First International Conference on Turkic Computational Linguistics, Konya, Turkey.

Research output: Contribution to conferencePaper

Assylbekov, Z, Washington, J, Tyers, F, Nurkas, A, Sundetova, A, Karibayeva, A, Abduali, B & Amirova, D 2016, 'A free/open-source hybrid morphological disambiguation tool for Kazakh' Paper presented at The First International Conference on Turkic Computational Linguistics, Konya, Turkey, 4/2/16 - 4/8/16, .
Assylbekov Z, Washington J, Tyers F, Nurkas A, Sundetova A, Karibayeva A et al. A free/open-source hybrid morphological disambiguation tool for Kazakh. 2016. Paper presented at The First International Conference on Turkic Computational Linguistics, Konya, Turkey.
Assylbekov, Zhenisbek ; Washington, Jonathan ; Tyers, Francis ; Nurkas, Assulan ; Sundetova, Aida ; Karibayeva, Aidana ; Abduali, Balzhan ; Amirova, Dina. / A free/open-source hybrid morphological disambiguation tool for Kazakh. Paper presented at The First International Conference on Turkic Computational Linguistics, Konya, Turkey.
@conference{2abcf1da2c8445e39cddd85ee475f0e8,
title = "A free/open-source hybrid morphological disambiguation tool for Kazakh",
abstract = "This paper presents the results of developing a morphological disambiguation tool for Kazakh. Starting with a previously developed rule-based approach, we tried to cope with the complex morphology of Kazakh by breaking up lexical forms across their derivational boundaries into inflectional groups and modeling their behavior with statistical methods. A hybrid rule-based/statistical approach appears to benefit morphological disambiguation demonstrating a per-token accuracy of 91{\%} in running text.",
author = "Zhenisbek Assylbekov and Jonathan Washington and Francis Tyers and Assulan Nurkas and Aida Sundetova and Aidana Karibayeva and Balzhan Abduali and Dina Amirova",
year = "2016",
language = "English",
note = "The First International Conference on Turkic Computational Linguistics, TurCLing 2016 ; Conference date: 02-04-2016 Through 08-04-2016",
url = "http://nur.nu.edu.kz/bitstream/handle/123456789/1692/kaz-tagger.pdf?sequence=1",

}

TY - CONF

T1 - A free/open-source hybrid morphological disambiguation tool for Kazakh

AU - Assylbekov, Zhenisbek

AU - Washington, Jonathan

AU - Tyers, Francis

AU - Nurkas, Assulan

AU - Sundetova, Aida

AU - Karibayeva, Aidana

AU - Abduali, Balzhan

AU - Amirova, Dina

PY - 2016

Y1 - 2016

N2 - This paper presents the results of developing a morphological disambiguation tool for Kazakh. Starting with a previously developed rule-based approach, we tried to cope with the complex morphology of Kazakh by breaking up lexical forms across their derivational boundaries into inflectional groups and modeling their behavior with statistical methods. A hybrid rule-based/statistical approach appears to benefit morphological disambiguation demonstrating a per-token accuracy of 91% in running text.

AB - This paper presents the results of developing a morphological disambiguation tool for Kazakh. Starting with a previously developed rule-based approach, we tried to cope with the complex morphology of Kazakh by breaking up lexical forms across their derivational boundaries into inflectional groups and modeling their behavior with statistical methods. A hybrid rule-based/statistical approach appears to benefit morphological disambiguation demonstrating a per-token accuracy of 91% in running text.

M3 - Paper

ER -