TY - GEN
T1 - Retrospective of Kazakh-Russian Sign Language Corpus Formation
AU - Imashev, Alfarabi
AU - Kydyrbekova, Aigerim
AU - Mukushev, Medet
AU - Sandygulova, Anara
AU - Islam, Shynggys
AU - Israilov, Khassan
AU - Makazhanov, Aibek
AU - Yessenbayev, Zhandos
N1 - Publisher Copyright:
© 2024 ELRA Language Resources Association: CC BY-NC 4.0.
PY - 2024
Y1 - 2024
N2 - Sign language (SL) is a mode of communication that, in most cases, relies on visual perception exclusively and uses the visual-gestural modality. The advent of machine learning techniques has expanded the range of potential applications, not only in industry but also in addressing societal needs. Previous research has already demonstrated encouraging outcomes in developing sign language recognition systems that are both quite accurate and resilient. Nevertheless, the effectiveness and use of algorithms are impacted not only by their accessibility but also, at times to a greater extent, by the presence of substantial quantities of pertinent data. At the start of the local sign language corpus collection in 2015, there was a notable deficit of local Kazakh-Russian Sign Language data available for computer vision and machine-learning tasks. There were already corpora of another lexically close language, Russian Sign Language, but they were aimed at and tailored for lingustic research. We initiated the procedure by collecting data appropriate for machine-learning purposes. The subsets have been incorporated into the principal corpus and will be subject to future enhancements and refinements. This paper provides an overview of the collected components of the Kazakh-Russian Sign Language Corpus and the resulting outcomes derived from them.
AB - Sign language (SL) is a mode of communication that, in most cases, relies on visual perception exclusively and uses the visual-gestural modality. The advent of machine learning techniques has expanded the range of potential applications, not only in industry but also in addressing societal needs. Previous research has already demonstrated encouraging outcomes in developing sign language recognition systems that are both quite accurate and resilient. Nevertheless, the effectiveness and use of algorithms are impacted not only by their accessibility but also, at times to a greater extent, by the presence of substantial quantities of pertinent data. At the start of the local sign language corpus collection in 2015, there was a notable deficit of local Kazakh-Russian Sign Language data available for computer vision and machine-learning tasks. There were already corpora of another lexically close language, Russian Sign Language, but they were aimed at and tailored for lingustic research. We initiated the procedure by collecting data appropriate for machine-learning purposes. The subsets have been incorporated into the principal corpus and will be subject to future enhancements and refinements. This paper provides an overview of the collected components of the Kazakh-Russian Sign Language Corpus and the resulting outcomes derived from them.
KW - dataset collection
KW - overview
KW - sign language
UR - http://www.scopus.com/inward/record.url?scp=85197484677&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85197484677&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:85197484677
T3 - 11th Workshop on the Representation and Processing of Sign Languages: Evaluation of Sign Language Resources, sign-lang@LREC-COLING 2024
SP - 111
EP - 122
BT - 11th Workshop on the Representation and Processing of Sign Languages
A2 - Efthimiou, Eleni
A2 - Fotinea, Stavroula-Evita
A2 - Hanke, Thomas
A2 - Hochgesang, Julie A.
A2 - Mesch, Johanna
A2 - Schulder, Marc
PB - Association for Computational Linguistics (ACL)
T2 - 11th Workshop on the Representation and Processing of Sign Languages: Evaluation of Sign Language Resources, sign-lang@LREC-COLING 2024
Y2 - 25 May 2024
ER -