TY - GEN
T1 - ChatGPT for Visually Impaired and Blind
AU - Kuzdeuov, Askat
AU - Mukayev, Olzhas
AU - Nurgaliyev, Shakhizat
AU - Kunbolsyn, Alisher
AU - Varol, Huseyin Atakan
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - According to the World Health Organization (WHO), hundreds of million people have some type of visual disability. Vision impairment has a personal impact with lifelong consequences because more than 80 % of our perception, cognition, learning, and daily activities are mediated through vision. Moreover, in the era of rapid advancements in artificial intelligence (AI), visually impaired and blind people face challenges at work and in education because of inaccessibility to AI technologies. In this regard, we present an assistive mobile application with an intuitive user interface (UI) for visually impaired and blind people to interact with ChatGPT via natural conversation. The app employs automatic speech recognition (ASR), text-To-speech (TTS), keyword spotting (KWS), voice activity detection (VAD), and a convenient UI to interact with ChatGPT effortlessly. We have made the source code, pre-Trained models, and VI publicly available at https://github.com/IS2AI/talk-llm to stimulate the development of assistive mobile applications.
AB - According to the World Health Organization (WHO), hundreds of million people have some type of visual disability. Vision impairment has a personal impact with lifelong consequences because more than 80 % of our perception, cognition, learning, and daily activities are mediated through vision. Moreover, in the era of rapid advancements in artificial intelligence (AI), visually impaired and blind people face challenges at work and in education because of inaccessibility to AI technologies. In this regard, we present an assistive mobile application with an intuitive user interface (UI) for visually impaired and blind people to interact with ChatGPT via natural conversation. The app employs automatic speech recognition (ASR), text-To-speech (TTS), keyword spotting (KWS), voice activity detection (VAD), and a convenient UI to interact with ChatGPT effortlessly. We have made the source code, pre-Trained models, and VI publicly available at https://github.com/IS2AI/talk-llm to stimulate the development of assistive mobile applications.
UR - http://www.scopus.com/inward/record.url?scp=85189932847&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85189932847&partnerID=8YFLogxK
U2 - 10.1109/ICAIIC60209.2024.10463430
DO - 10.1109/ICAIIC60209.2024.10463430
M3 - Conference contribution
AN - SCOPUS:85189932847
T3 - 6th International Conference on Artificial Intelligence in Information and Communication, ICAIIC 2024
SP - 722
EP - 727
BT - 6th International Conference on Artificial Intelligence in Information and Communication, ICAIIC 2024
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 6th International Conference on Artificial Intelligence in Information and Communication, ICAIIC 2024
Y2 - 19 February 2024 through 22 February 2024
ER -