ChatGPT for Visually Impaired and Blind

Askat Kuzdeuov, Olzhas Mukayev, Shakhizat Nurgaliyev, Alisher Kunbolsyn, Huseyin Atakan Varol

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2 Citations (Scopus)

Abstract

According to the World Health Organization (WHO), hundreds of million people have some type of visual disability. Vision impairment has a personal impact with lifelong consequences because more than 80 % of our perception, cognition, learning, and daily activities are mediated through vision. Moreover, in the era of rapid advancements in artificial intelligence (AI), visually impaired and blind people face challenges at work and in education because of inaccessibility to AI technologies. In this regard, we present an assistive mobile application with an intuitive user interface (UI) for visually impaired and blind people to interact with ChatGPT via natural conversation. The app employs automatic speech recognition (ASR), text-To-speech (TTS), keyword spotting (KWS), voice activity detection (VAD), and a convenient UI to interact with ChatGPT effortlessly. We have made the source code, pre-Trained models, and VI publicly available at https://github.com/IS2AI/talk-llm to stimulate the development of assistive mobile applications.

Original languageEnglish
Title of host publication6th International Conference on Artificial Intelligence in Information and Communication, ICAIIC 2024
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages722-727
Number of pages6
ISBN (Electronic)9798350344349
DOIs
Publication statusPublished - 2024
Event6th International Conference on Artificial Intelligence in Information and Communication, ICAIIC 2024 - Osaka, Japan
Duration: Feb 19 2024Feb 22 2024

Publication series

Name6th International Conference on Artificial Intelligence in Information and Communication, ICAIIC 2024

Conference

Conference6th International Conference on Artificial Intelligence in Information and Communication, ICAIIC 2024
Country/TerritoryJapan
CityOsaka
Period2/19/242/22/24

ASJC Scopus subject areas

  • Artificial Intelligence
  • Computer Networks and Communications
  • Computer Science Applications
  • Computer Vision and Pattern Recognition
  • Information Systems
  • Safety, Risk, Reliability and Quality
  • Health Informatics

Fingerprint

Dive into the research topics of 'ChatGPT for Visually Impaired and Blind'. Together they form a unique fingerprint.

Cite this