TY - GEN
T1 - Kazakh and Russian languages identification using long short-term memory recurrent neural networks
AU - Kozhirbayev, Zhanibek
AU - Yessenbayev, Zhandos
AU - Karabalayeva, Muslima
PY - 2019/4/10
Y1 - 2019/4/10
N2 - Automatic language identification (LID) belongs to the automatic process whereby the identity of the language spoken in a speech sample can be distinguished. In recent decades, LID has made significant advancement in spoken language identification which received an advantage from technological achievements in related areas, such as signal processing, pattern recognition, machine learning and neural networks. This work investigates the employment of Long Short-Term Memory (LSTM) recurrent neural networks (RNNs) for automatic language identification. The main reason of applying LSTM RNNs to the current task is their reasonable capacity in handling sequences. This study shows that LSTM RNNs can efficiently take advantage of temporal dependencies in acoustic data in order to learn relevant features for language recognition tasks. In this paper, we show results for conducted language identification experiments for Kazakh and Russian languages and the presented LSTM RNN model can deal with short utterances (2s). The model was trained using open-source high-level neural networks API Keras on limited computational resources.
AB - Automatic language identification (LID) belongs to the automatic process whereby the identity of the language spoken in a speech sample can be distinguished. In recent decades, LID has made significant advancement in spoken language identification which received an advantage from technological achievements in related areas, such as signal processing, pattern recognition, machine learning and neural networks. This work investigates the employment of Long Short-Term Memory (LSTM) recurrent neural networks (RNNs) for automatic language identification. The main reason of applying LSTM RNNs to the current task is their reasonable capacity in handling sequences. This study shows that LSTM RNNs can efficiently take advantage of temporal dependencies in acoustic data in order to learn relevant features for language recognition tasks. In this paper, we show results for conducted language identification experiments for Kazakh and Russian languages and the presented LSTM RNN model can deal with short utterances (2s). The model was trained using open-source high-level neural networks API Keras on limited computational resources.
KW - Language identification
KW - Long Short-Term Memory Recurrent Neural Networks
UR - http://www.scopus.com/inward/record.url?scp=85052065195&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85052065195&partnerID=8YFLogxK
U2 - 10.1109/ICAICT.2017.8687095
DO - 10.1109/ICAICT.2017.8687095
M3 - Conference contribution
AN - SCOPUS:85052065195
T3 - 11th IEEE International Conference on Application of Information and Communication Technologies, AICT 2017 - Proceedings
BT - 11th IEEE International Conference on Application of Information and Communication Technologies, AICT 2017 - Proceedings
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 11th IEEE International Conference on Application of Information and Communication Technologies, AICT 2017
Y2 - 20 September 2017 through 22 September 2017
ER -