Skip to main navigation Skip to search Skip to main content

Character-based feature extraction with LSTM networks for POS-tagging task

  • Aibek Makazhanov
  • , Zhandos Yessenbayev

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

In this paper we describe a work in progress on designing the continuous vector space word representations able to map unseen data adequately. We propose a LSTM-based feature extraction layer that reads in a sequence of characters corresponding to a word and outputs a single fixed-length real-valued vector. We then test our model on a POS tagging task on four typologically different languages. The results of the experiments suggest that the model can offer a solution to the out-of-vocabulary words problem, as in a comparable setting its OOV accuracy improves over that of a state of the art tagger.

Original languageEnglish
Title of host publicationApplication of Information and Communication Technologies, AICT 2016 - Conference Proceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781509018406
DOIs
Publication statusPublished - Jul 25 2017
Event10th IEEE International Conference on Application of Information and Communication Technologies, AICT 2016 - Baku, Azerbaijan
Duration: Oct 12 2016Oct 14 2016

Publication series

NameApplication of Information and Communication Technologies, AICT 2016 - Conference Proceedings

Conference

Conference10th IEEE International Conference on Application of Information and Communication Technologies, AICT 2016
Country/TerritoryAzerbaijan
CityBaku
Period10/12/1610/14/16

Funding

ACKNOWLEDGMENT This work was supported by the Nazarbayev University under the grant number 064-2016/013-2016 and the Ministry of Education of Science of the Republic of Kazakhstan under the research program number 349//049-2016.

Keywords

  • character-based features
  • continuous vector space word representations
  • LSTM networks
  • out-of-vocabulary words
  • POS-tagging

ASJC Scopus subject areas

  • Computer Vision and Pattern Recognition
  • Computer Science Applications
  • Computer Networks and Communications
  • Information Systems
  • Modelling and Simulation

Fingerprint

Dive into the research topics of 'Character-based feature extraction with LSTM networks for POS-tagging task'. Together they form a unique fingerprint.

Cite this