Speaker Recognition for Robotic Control via an IoT Device

Zhanibek Kozhirbayev, Berat A. Erol, Altynbek Sharipbay, Mo Jamshidi

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Speaker Recognition is considered as one of the primary tasks in speech processing. Nowadays, the speaker identification method has been extensively appealing for its broad application in many fields, such as smart environments, securing the cyber-physical systems, speech communications, and robotic controls. Researchers are targeting to perform an effective method that makes it possible to obtain the recognition ability that is close to the hearing of human. In order to get high accuracy, challenges of large-scale applications of speaker identification are overcome through applying techniques not only traditional models based on the GMM, but also deep learning methods. Aiming at effectively dealing with this challenge, in this paper, we present a novel model to increase the recognition accuracy of the short utterance speaker recognition system. We developed a technique to train a Neural Network (NN) on the extracted Mel-Frequency Cepstral Coefficient (MFCC) features from audio samples. Therefore, the recognition system gains the significant accuracy. The model was trained using open-source high-level neural networks API Keras.

Original languageEnglish
Title of host publication2018 World Automation Congress, WAC 2018
PublisherIEEE Computer Society
Pages259-264
Number of pages6
Volume2018-June
ISBN (Print)9781532377914
DOIs
Publication statusPublished - Aug 8 2018
Event2018 World Automation Congress, WAC 2018 - Stevenson, United States
Duration: Jun 3 2018Jun 6 2018

Conference

Conference2018 World Automation Congress, WAC 2018
CountryUnited States
CityStevenson
Period6/3/186/6/18

Fingerprint

Robotics
Neural networks
Speech processing
Speech communication
Audition
Application programming interfaces (API)
Internet of things
Cyber Physical System
Deep learning

Keywords

  • Amazon Echo
  • Human robot interactions
  • Internet of robotic things
  • Neural networks
  • Speaker identification and recognition

ASJC Scopus subject areas

  • Control and Systems Engineering

Cite this

Kozhirbayev, Z., Erol, B. A., Sharipbay, A., & Jamshidi, M. (2018). Speaker Recognition for Robotic Control via an IoT Device. In 2018 World Automation Congress, WAC 2018 (Vol. 2018-June, pp. 259-264). [8430295] IEEE Computer Society. https://doi.org/10.23919/WAC.2018.8430295

Speaker Recognition for Robotic Control via an IoT Device. / Kozhirbayev, Zhanibek; Erol, Berat A.; Sharipbay, Altynbek; Jamshidi, Mo.

2018 World Automation Congress, WAC 2018. Vol. 2018-June IEEE Computer Society, 2018. p. 259-264 8430295.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Kozhirbayev, Z, Erol, BA, Sharipbay, A & Jamshidi, M 2018, Speaker Recognition for Robotic Control via an IoT Device. in 2018 World Automation Congress, WAC 2018. vol. 2018-June, 8430295, IEEE Computer Society, pp. 259-264, 2018 World Automation Congress, WAC 2018, Stevenson, United States, 6/3/18. https://doi.org/10.23919/WAC.2018.8430295
Kozhirbayev Z, Erol BA, Sharipbay A, Jamshidi M. Speaker Recognition for Robotic Control via an IoT Device. In 2018 World Automation Congress, WAC 2018. Vol. 2018-June. IEEE Computer Society. 2018. p. 259-264. 8430295 https://doi.org/10.23919/WAC.2018.8430295
Kozhirbayev, Zhanibek ; Erol, Berat A. ; Sharipbay, Altynbek ; Jamshidi, Mo. / Speaker Recognition for Robotic Control via an IoT Device. 2018 World Automation Congress, WAC 2018. Vol. 2018-June IEEE Computer Society, 2018. pp. 259-264
@inproceedings{13c08ba29f3f485a9c1d19d3897e4ccb,
title = "Speaker Recognition for Robotic Control via an IoT Device",
abstract = "Speaker Recognition is considered as one of the primary tasks in speech processing. Nowadays, the speaker identification method has been extensively appealing for its broad application in many fields, such as smart environments, securing the cyber-physical systems, speech communications, and robotic controls. Researchers are targeting to perform an effective method that makes it possible to obtain the recognition ability that is close to the hearing of human. In order to get high accuracy, challenges of large-scale applications of speaker identification are overcome through applying techniques not only traditional models based on the GMM, but also deep learning methods. Aiming at effectively dealing with this challenge, in this paper, we present a novel model to increase the recognition accuracy of the short utterance speaker recognition system. We developed a technique to train a Neural Network (NN) on the extracted Mel-Frequency Cepstral Coefficient (MFCC) features from audio samples. Therefore, the recognition system gains the significant accuracy. The model was trained using open-source high-level neural networks API Keras.",
keywords = "Amazon Echo, Human robot interactions, Internet of robotic things, Neural networks, Speaker identification and recognition",
author = "Zhanibek Kozhirbayev and Erol, {Berat A.} and Altynbek Sharipbay and Mo Jamshidi",
year = "2018",
month = "8",
day = "8",
doi = "10.23919/WAC.2018.8430295",
language = "English",
isbn = "9781532377914",
volume = "2018-June",
pages = "259--264",
booktitle = "2018 World Automation Congress, WAC 2018",
publisher = "IEEE Computer Society",
address = "United States",

}

TY - GEN

T1 - Speaker Recognition for Robotic Control via an IoT Device

AU - Kozhirbayev, Zhanibek

AU - Erol, Berat A.

AU - Sharipbay, Altynbek

AU - Jamshidi, Mo

PY - 2018/8/8

Y1 - 2018/8/8

N2 - Speaker Recognition is considered as one of the primary tasks in speech processing. Nowadays, the speaker identification method has been extensively appealing for its broad application in many fields, such as smart environments, securing the cyber-physical systems, speech communications, and robotic controls. Researchers are targeting to perform an effective method that makes it possible to obtain the recognition ability that is close to the hearing of human. In order to get high accuracy, challenges of large-scale applications of speaker identification are overcome through applying techniques not only traditional models based on the GMM, but also deep learning methods. Aiming at effectively dealing with this challenge, in this paper, we present a novel model to increase the recognition accuracy of the short utterance speaker recognition system. We developed a technique to train a Neural Network (NN) on the extracted Mel-Frequency Cepstral Coefficient (MFCC) features from audio samples. Therefore, the recognition system gains the significant accuracy. The model was trained using open-source high-level neural networks API Keras.

AB - Speaker Recognition is considered as one of the primary tasks in speech processing. Nowadays, the speaker identification method has been extensively appealing for its broad application in many fields, such as smart environments, securing the cyber-physical systems, speech communications, and robotic controls. Researchers are targeting to perform an effective method that makes it possible to obtain the recognition ability that is close to the hearing of human. In order to get high accuracy, challenges of large-scale applications of speaker identification are overcome through applying techniques not only traditional models based on the GMM, but also deep learning methods. Aiming at effectively dealing with this challenge, in this paper, we present a novel model to increase the recognition accuracy of the short utterance speaker recognition system. We developed a technique to train a Neural Network (NN) on the extracted Mel-Frequency Cepstral Coefficient (MFCC) features from audio samples. Therefore, the recognition system gains the significant accuracy. The model was trained using open-source high-level neural networks API Keras.

KW - Amazon Echo

KW - Human robot interactions

KW - Internet of robotic things

KW - Neural networks

KW - Speaker identification and recognition

UR - http://www.scopus.com/inward/record.url?scp=85052081976&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85052081976&partnerID=8YFLogxK

U2 - 10.23919/WAC.2018.8430295

DO - 10.23919/WAC.2018.8430295

M3 - Conference contribution

SN - 9781532377914

VL - 2018-June

SP - 259

EP - 264

BT - 2018 World Automation Congress, WAC 2018

PB - IEEE Computer Society

ER -