TY - JOUR
T1 - A comprehensive voice dataset for Hindko digit recognition
AU - Ahmed, Tanveer
AU - Khan, Maqbool
AU - Khan, Khalil
AU - Syed, Ikram
AU - Ullah, Syed Sajid
N1 - Publisher Copyright:
© 2024 The Author(s)
PY - 2025/2
Y1 - 2025/2
N2 - Hindko is a language primarily spoken in Northwestern areas of Pakistan. Approximately eight million people speak the Hindko language. According to its native speakers, it is 7th largest language of Pakistan and 2nd largest language of Khyber Pakhtunkhwa. The Hazara region is the cultural hub of Hindko language. About 80% of the population in districts like Haripur, Abbotabad and Mansehra speak Hindko. The spoken content of Hindko covers a wide range of subjects, including religion, education, poetry, politics, theater, and more. Despite all this, Hindko lacks a voice recognition system that could enhance accessibility, preserve the language, and promote digital inclusion for its speakers. This paper presents a voice recognition dataset that consists of 17,597 voice samples, and is accessible to the public for academic and research purposes. The dataset consists of 20 Hindko digits ranging from 1 to 20 and all the voice samples are taken from the students and staff and faculty of Pak-Austria Fachhochschule Institute of Applied Science and Technology.
AB - Hindko is a language primarily spoken in Northwestern areas of Pakistan. Approximately eight million people speak the Hindko language. According to its native speakers, it is 7th largest language of Pakistan and 2nd largest language of Khyber Pakhtunkhwa. The Hazara region is the cultural hub of Hindko language. About 80% of the population in districts like Haripur, Abbotabad and Mansehra speak Hindko. The spoken content of Hindko covers a wide range of subjects, including religion, education, poetry, politics, theater, and more. Despite all this, Hindko lacks a voice recognition system that could enhance accessibility, preserve the language, and promote digital inclusion for its speakers. This paper presents a voice recognition dataset that consists of 17,597 voice samples, and is accessible to the public for academic and research purposes. The dataset consists of 20 Hindko digits ranging from 1 to 20 and all the voice samples are taken from the students and staff and faculty of Pak-Austria Fachhochschule Institute of Applied Science and Technology.
KW - Artificial intelligence
KW - Machine learning
KW - Natural language processing
KW - Signal processing
KW - Voice recognition
UR - http://www.scopus.com/inward/record.url?scp=85212412664&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85212412664&partnerID=8YFLogxK
U2 - 10.1016/j.dib.2024.111220
DO - 10.1016/j.dib.2024.111220
M3 - Article
AN - SCOPUS:85212412664
SN - 2352-3409
VL - 58
JO - Data in Brief
JF - Data in Brief
M1 - 111220
ER -