Comparison of Effectiveness of Stemming Algorithms in Indonesian Documents

Mustikasari, Dyah, Widaningrum, Ida, Arifin, Rizal and Putri, Wahyu Henggal Eka (2020) Comparison of Effectiveness of Stemming Algorithms in Indonesian Documents. In: Proceedings of the 2nd Borobudur International Symposium on Science and Technology (BIS-STE 2020), 11 August 2021.

[img] Text
10. cp_Comparison of Effectiveness of Stemming Algorithms.pdf

Download (1MB)
Official URL: https://www.atlantis-press.com/proceedings/bis-ste...

Abstract

Stemming is a process to determine basic word with some rules. In Bahasa Indonesia, the way is to eliminate prefixes, infixes, suffixes, or combination of prefixes and suffixes in derivative words. Several stemming algorithms for Bahasa Indonesia have been developed. But their effectiveness has not been studied. In this study, these three stemming algorithms will be compared. We used 900 affixes to conduct the comparison. Each word is searched for their basic words using the three algorithms. The basic word resulted then referred to KBBI or Indonesian dictionary to see whether they are right. Comparison process of stemming show that Sastrawi’s could do the best stemming that 95,2% of the affix words tested could be root words. The Nazief & Adriani Algorithm resulted 92,4%, while Arifin Setiono’s finished at 89%. It could state that Arifin Setiono’s needs a lot of improvement because many affixed words could not return to the root word.

Item Type: Conference or Workshop Item (Paper)
Uncontrolled Keywords: Effectiveness, Stemming, Indonesian, Document
Subjects: T Technology > T Technology (General)
Divisions: Faculty of Engineering
Depositing User: Library Umpo
Date Deposited: 20 Sep 2023 03:29
Last Modified: 20 Sep 2023 03:29
URI: http://eprints.umpo.ac.id/id/eprint/12871

Actions (login required)

View Item View Item