Zelalem Fantahun
iCog-Labs, Ethiopia
Posters & Accepted Abstracts: Adv Robot Autom
This paper presents the development of Amharic part-of-speech (POS) tagger using machine learning algorithm. Since Amharic is one of the under-resourced languages, the availability of tagged corpus is the train bottleneck problem for natural language processing, such as POS tagging. The promising direction to tackle this problem is a learning model that does not require annotated dataset. In this study, k-means clustering algorithm is used to group words that have similar features together. This paper also identifies morphological and positional word features that can distinguish the words into different clusters. The experimental result shows that the prototype achieves 81% accuracy. zelalemfantahun@gmail.com
Advances in Robotics & Automation received 1127 citations as per Google Scholar report