Do LLMs Speak Kazakh? A Pilot Evaluation of Seven Models

Akylbek Maxutov, Ayan Myrzakhmet, Pavel Braslavski

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Citation (Scopus)

Abstract

We conducted a systematic evaluation of seven large language models (LLMs) on tasks in Kazakh, a Turkic language spoken by approximately 13 million native speakers in Kazakhstan and abroad. We used six datasets corresponding to different tasks – questions answering, causal reasoning, middle school math problems, machine translation, and spelling correction. Three of the datasets were prepared for this study. As expected, the quality of the LLMs on the Kazakh tasks is lower than on the parallel English tasks. GPT-4 shows the best results, followed by Gemini and AYA. In general, LLMs perform better on classification tasks and struggle with generative tasks. Our results provide valuable insights into the applicability of currently available LLMs for Kazakh. We made the data collected for this study publicly available: https://github.com/akylbekmaxutov/LLM-eval-using-Kazakh.

Original languageEnglish
Title of host publicationSIGTURK 2024 - 1st Workshop on Natural Language Processing for Turkic Languages, Proceedings of the Workshop
EditorsDuygu Ataman, Mehmet Oguz Derin, Sardana Ivanova, Abdullatif Koksal, Jonne Saleva, Deniz Zeyrek
PublisherAssociation for Computational Linguistics (ACL)
Pages81-91
Number of pages11
ISBN (Electronic)9798891761407
Publication statusPublished - 2024
Event1st Workshop on Natural Language Processing for Turkic Languages, SIGTURK 2024 - Hybrid, Bangkok, Thailand
Duration: Aug 15 2024 → …

Publication series

NameSIGTURK 2024 - 1st Workshop on Natural Language Processing for Turkic Languages, Proceedings of the Workshop

Conference

Conference1st Workshop on Natural Language Processing for Turkic Languages, SIGTURK 2024
Country/TerritoryThailand
CityHybrid, Bangkok
Period8/15/24 → …

ASJC Scopus subject areas

  • Language and Linguistics
  • Computational Theory and Mathematics
  • Software
  • Linguistics and Language

Fingerprint

Dive into the research topics of 'Do LLMs Speak Kazakh? A Pilot Evaluation of Seven Models'. Together they form a unique fingerprint.

Cite this