Abstract
In this paper we describe a work in progress on designing the continuous vector space word representations able to map unseen data adequately. We propose a LSTM-based feature extraction layer that reads in a sequence of characters corresponding to a word and outputs a single fixed-length real-valued vector. We then test our model on a POS tagging task on four typologically different languages. The results of the experiments suggest that the model can offer a solution to the out-of-vocabulary words problem, as in a comparable setting its OOV accuracy improves over that of a state of the art tagger.
| Original language | English |
|---|---|
| Title of host publication | Application of Information and Communication Technologies, AICT 2016 - Conference Proceedings |
| Publisher | Institute of Electrical and Electronics Engineers Inc. |
| ISBN (Electronic) | 9781509018406 |
| DOIs | |
| Publication status | Published - Jul 25 2017 |
| Event | 10th IEEE International Conference on Application of Information and Communication Technologies, AICT 2016 - Baku, Azerbaijan Duration: Oct 12 2016 → Oct 14 2016 |
Publication series
| Name | Application of Information and Communication Technologies, AICT 2016 - Conference Proceedings |
|---|
Conference
| Conference | 10th IEEE International Conference on Application of Information and Communication Technologies, AICT 2016 |
|---|---|
| Country/Territory | Azerbaijan |
| City | Baku |
| Period | 10/12/16 → 10/14/16 |
Funding
ACKNOWLEDGMENT This work was supported by the Nazarbayev University under the grant number 064-2016/013-2016 and the Ministry of Education of Science of the Republic of Kazakhstan under the research program number 349//049-2016.
Keywords
- character-based features
- continuous vector space word representations
- LSTM networks
- out-of-vocabulary words
- POS-tagging
ASJC Scopus subject areas
- Computer Vision and Pattern Recognition
- Computer Science Applications
- Computer Networks and Communications
- Information Systems
- Modelling and Simulation
Fingerprint
Dive into the research topics of 'Character-based feature extraction with LSTM networks for POS-tagging task'. Together they form a unique fingerprint.Cite this
- APA
- Standard
- Harvard
- Vancouver
- Author
- BIBTEX
- RIS