Text Mining in Healthcare for Disease Classification using Machine Learning Algorithm

Buntoro, Ghulam Asrofi, Wibawa, Adhi Dharma and Purnomo, Mauridhi Hery (2021) Text Mining in Healthcare for Disease Classification using Machine Learning Algorithm. In: 2021 International Electronics Symposium (IES), 29-30 September 2021, Institut Teknologi Sepuluh Nopember.

[img] Text
16. Text Mining in Healthcare for Disease Classification using Machine Learning Algorithm.pdf

Download (362kB)
[img] Text
16. Cek Plagiasi_Text Mining in Healthcare for Disease Classification using Machine Learning Algorithm.pdf

Download (1MB)
Official URL: https://ieeexplore.ieee.org/document/9593998

Abstract

The development of information technology and smartphones has caused production of many data around us. In every second million of new data is created in the form of text, audio, image and even videos. This environment then has triggered big data analytics demand. One of big data that is produced daily is data on the history of healthcare services in hospitals. Important new information can be retrieved through this huge dataset, especially concerning the patient symptoms, drug usage and new diseases report. In this study, text processing technique is applied on text data of patient medical record data from public hospital during 2017 till 2019 regarding the patient symptoms and the disease classification. Naïve Bayes Classifier and Random Forest algorithms are used to classify diseases in medical record data with 19 diseases in preprocessing data. A list of modified Indonesian stop words was used to filter the symptom sentences. The result indicates that the Random Forest classification algorithm can achieve the highest accuracy of around 99.9%, better and more accurate than the Naïve Bayes classification algorithm. This experiment shows that our proposed method provides a robust system and good accuracy for classifying medical record data with many diseases.

Item Type: Conference or Workshop Item (Paper)
Uncontrolled Keywords: text mining, healthcare, disease, naïve Bayes classification, random forest
Subjects: T Technology > T Technology (General)
Divisions: Faculty of Engineering > Department of Informatic Engineering
Depositing User: Library Umpo
Date Deposited: 15 Feb 2023 04:06
Last Modified: 15 Feb 2023 04:06
URI: http://eprints.umpo.ac.id/id/eprint/10847

Actions (login required)

View Item View Item