• Dec 20, 2018 News!Vol.4, No.2 has been published with online version.   [Click]
  • Sep 17, 2018 News!Welcome to 2019 5th International Conference on Knowledge and Software Engineering (ICKSE 2019), which will be held in Prague, Czech Republic during March 2-4, 2019.   [Click]
  • May 31, 2018 News!Vol.4, No.1 has been published with online version.   [Click]
General Information
    • ISSN: 2382-6185
    • Frequency: Quarterly (2015-2016); semiyearly (Since 2017)
    • DOI: 10.18178/IJKE
    • Editor-in-Chief: Prof. Chen-Huei Chou
    • Executive Editor: Ms. Nina Lee
    • Indexed by: Google Scholar, Crossref, ProQuest
    • E-mail: ijke@ejournal.net
Prof. Chen-Huei Chou
College of Charleston, SC, USA
It is my honor to be the editor-in-chief of IJKE. I will do my best to help develop this journal better.
IJKE 2016 Vol.2(3): 115-121 ISSN: 2382-6185
doi: 10.18178/ijke.2016.2.3.064

Benchmarking Mi-POS: Malay Part-of-Speech Tagger

Benjamin Chu Min Xian, Mohamed Lubani, Liew Kwei Ping, Khalil Bouzekri, Rohana Mahmud, and Dickson Lukose
Abstract—A part-of-speech tagger as signs the correct grammatical category to each word in a given text based on the context surrounding the word. This paper presents Mi-POS, a Malay language Part-of-Speech tagger that is developed using a probabilistic approach with information about the context. The results of benchmarking Mi-POS against several similar systems are also presented in this paper and the lessons learnt from it are highlighted. The dataset used for evaluation consists of manually annotated texts. The authors used the accuracy and time to measure the results of this evaluation. The final results show that Mi-POS outperforms other Malay Part-of-Speech taggers in terms of accuracy with an accuracy of 95.16% obtained by tagging new words from the same training corpus type and 81.12% for words from different corpora types.

Index Terms—Benchmarking, Malay language, natural language processing, part-of-speech tagging.

Dickson Lukose, Khalil Bouzekri and Benjamin Chu Min Xian are with the Artificial Intelligence Lab at MIMOS Berhad, Kuala Lumpur, 57000 Malaysia (e-mail: dickson.lukose@mimos.my, khalil.ben@mimos.my, mx.chu@mimos.my).
Mohamed Lubani and Liew Kwei Ping are with the University of Malaya, Faculty of Computer Science and Information Technology, Kuala Lumpur, 50603 Malaysia (e-mail: mohamed.lubani@siswa.um.edu.my, liewkweiping@siswa.um.edu.my).
Rohana Mahmud is with the Department of Artificial Intelligence, Faculty of Computer Science and Information Technology, University of Malaya, Kuala Lumpur, 50603 Malaysia (e-mail: rohanamahmud@um.edu.my).


Cite: Benjamin Chu Min Xian, Mohamed Lubani, Liew Kwei Ping, Khalil Bouzekri, Rohana Mahmud, and Dickson Lukose, "Benchmarking Mi-POS: Malay Part-of-Speech Tagger," International Journal of Knowledge Engineering vol. 2, no. 3, pp. 115-121, 2016.

Copyright © 2008-2016. International Journal of Knowledge Engineering. All rights reserved.
E-mail: ijke@ejournal.net