Regression Imputation and Optimized Gaussian Naïve Bayes Algorithm for an Enhanced Diabetes Mellitus Prediction Model
Main Author: | |
---|---|
Publication Date: | 2021 |
Other Authors: | , |
Format: | Article |
Language: | eng |
Source: | Brazilian Archives of Biology and Technology |
Download full: | http://old.scielo.br/scielo.php?script=sci_arttext&pid=S1516-89132021000100624 |
Summary: | Abstract Diabetes mellitus (DM) is a category of metabolic disorders caused by high blood sugar. The DM affects human metabolism, and this disease causes many complications like Heart disease, Neuropathy, Diabetic retinopathy, kidney problems, skin disorder and slow healing. It is therefore essential to predict the presence of DM using an automated diabetes diagnosis system, which can be implemented using machine learning algorithms. A variety of automated diabetes prediction systems have been proposed in previous studies. Even so, the low prediction accuracy of DM prediction systems is a major issue. This proposed work developed a diabetes mellitus prediction system to improve the diabetes mellitus prediction accuracy using Optimized Gaussian Naive Bayes algorithm. This proposed model using the Pima Indians diabetes dataset as an input to build the DM predictive model. The missing values of an input dataset are imputed using regression imputation method. The sequential backward feature elimination method is used in this proposed model for selecting the relevant risk factors of diabetes disease. The proposed machine learning classifier named Optimized Gaussian Naïve Bayes (OGNB) is applied to the selected risk factors to create an enhanced Diabetes diagnostic system which predicts Diabetes in an individual. The performance analysis of this prediction architecture shows that, over other traditional machine learning classifiers, the Optimized Gaussian Naïve Bayes achieves an 81.85% classifier accuracy. This proposed DM prediction system is effective as compared to other diabetes prediction systems found in the literature. According to our experimental study, the OGNB based diabetes mellitus prediction system is more appropriate for DM disease prediction. |
id |
TECPAR-1_2ef7c2669a9c65ab3c97bb7eafe03355 |
---|---|
oai_identifier_str |
oai:scielo:S1516-89132021000100624 |
network_acronym_str |
TECPAR-1 |
network_name_str |
Brazilian Archives of Biology and Technology |
repository_id_str |
|
spelling |
Regression Imputation and Optimized Gaussian Naïve Bayes Algorithm for an Enhanced Diabetes Mellitus Prediction ModelOptimized Gaussian Naïve Bayes classifierRegression imputationSequential backward feature eliminationDiabetes mellitus diagnosisAbstract Diabetes mellitus (DM) is a category of metabolic disorders caused by high blood sugar. The DM affects human metabolism, and this disease causes many complications like Heart disease, Neuropathy, Diabetic retinopathy, kidney problems, skin disorder and slow healing. It is therefore essential to predict the presence of DM using an automated diabetes diagnosis system, which can be implemented using machine learning algorithms. A variety of automated diabetes prediction systems have been proposed in previous studies. Even so, the low prediction accuracy of DM prediction systems is a major issue. This proposed work developed a diabetes mellitus prediction system to improve the diabetes mellitus prediction accuracy using Optimized Gaussian Naive Bayes algorithm. This proposed model using the Pima Indians diabetes dataset as an input to build the DM predictive model. The missing values of an input dataset are imputed using regression imputation method. The sequential backward feature elimination method is used in this proposed model for selecting the relevant risk factors of diabetes disease. The proposed machine learning classifier named Optimized Gaussian Naïve Bayes (OGNB) is applied to the selected risk factors to create an enhanced Diabetes diagnostic system which predicts Diabetes in an individual. The performance analysis of this prediction architecture shows that, over other traditional machine learning classifiers, the Optimized Gaussian Naïve Bayes achieves an 81.85% classifier accuracy. This proposed DM prediction system is effective as compared to other diabetes prediction systems found in the literature. According to our experimental study, the OGNB based diabetes mellitus prediction system is more appropriate for DM disease prediction.Instituto de Tecnologia do Paraná - Tecpar2021-01-01info:eu-repo/semantics/articleinfo:eu-repo/semantics/publishedVersiontext/htmlhttp://old.scielo.br/scielo.php?script=sci_arttext&pid=S1516-89132021000100624Brazilian Archives of Biology and Technology v.64 2021reponame:Brazilian Archives of Biology and Technologyinstname:Instituto de Tecnologia do Paraná (Tecpar)instacron:TECPAR10.1590/1678-4324-2021210181info:eu-repo/semantics/openAccessMohideen,Dhilsath Fathima MohammedRaj,Justin Samuel SavariRaj,Raja Soosaimarian Petereng2022-02-18T00:00:00Zoai:scielo:S1516-89132021000100624Revistahttps://www.scielo.br/j/babt/https://old.scielo.br/oai/scielo-oai.phpbabt@tecpar.br||babt@tecpar.br1678-43241516-8913opendoar:2022-02-18T00:00Brazilian Archives of Biology and Technology - Instituto de Tecnologia do Paraná (Tecpar)false |
dc.title.none.fl_str_mv |
Regression Imputation and Optimized Gaussian Naïve Bayes Algorithm for an Enhanced Diabetes Mellitus Prediction Model |
title |
Regression Imputation and Optimized Gaussian Naïve Bayes Algorithm for an Enhanced Diabetes Mellitus Prediction Model |
spellingShingle |
Regression Imputation and Optimized Gaussian Naïve Bayes Algorithm for an Enhanced Diabetes Mellitus Prediction Model Mohideen,Dhilsath Fathima Mohammed Optimized Gaussian Naïve Bayes classifier Regression imputation Sequential backward feature elimination Diabetes mellitus diagnosis |
title_short |
Regression Imputation and Optimized Gaussian Naïve Bayes Algorithm for an Enhanced Diabetes Mellitus Prediction Model |
title_full |
Regression Imputation and Optimized Gaussian Naïve Bayes Algorithm for an Enhanced Diabetes Mellitus Prediction Model |
title_fullStr |
Regression Imputation and Optimized Gaussian Naïve Bayes Algorithm for an Enhanced Diabetes Mellitus Prediction Model |
title_full_unstemmed |
Regression Imputation and Optimized Gaussian Naïve Bayes Algorithm for an Enhanced Diabetes Mellitus Prediction Model |
title_sort |
Regression Imputation and Optimized Gaussian Naïve Bayes Algorithm for an Enhanced Diabetes Mellitus Prediction Model |
author |
Mohideen,Dhilsath Fathima Mohammed |
author_facet |
Mohideen,Dhilsath Fathima Mohammed Raj,Justin Samuel Savari Raj,Raja Soosaimarian Peter |
author_role |
author |
author2 |
Raj,Justin Samuel Savari Raj,Raja Soosaimarian Peter |
author2_role |
author author |
dc.contributor.author.fl_str_mv |
Mohideen,Dhilsath Fathima Mohammed Raj,Justin Samuel Savari Raj,Raja Soosaimarian Peter |
dc.subject.por.fl_str_mv |
Optimized Gaussian Naïve Bayes classifier Regression imputation Sequential backward feature elimination Diabetes mellitus diagnosis |
topic |
Optimized Gaussian Naïve Bayes classifier Regression imputation Sequential backward feature elimination Diabetes mellitus diagnosis |
description |
Abstract Diabetes mellitus (DM) is a category of metabolic disorders caused by high blood sugar. The DM affects human metabolism, and this disease causes many complications like Heart disease, Neuropathy, Diabetic retinopathy, kidney problems, skin disorder and slow healing. It is therefore essential to predict the presence of DM using an automated diabetes diagnosis system, which can be implemented using machine learning algorithms. A variety of automated diabetes prediction systems have been proposed in previous studies. Even so, the low prediction accuracy of DM prediction systems is a major issue. This proposed work developed a diabetes mellitus prediction system to improve the diabetes mellitus prediction accuracy using Optimized Gaussian Naive Bayes algorithm. This proposed model using the Pima Indians diabetes dataset as an input to build the DM predictive model. The missing values of an input dataset are imputed using regression imputation method. The sequential backward feature elimination method is used in this proposed model for selecting the relevant risk factors of diabetes disease. The proposed machine learning classifier named Optimized Gaussian Naïve Bayes (OGNB) is applied to the selected risk factors to create an enhanced Diabetes diagnostic system which predicts Diabetes in an individual. The performance analysis of this prediction architecture shows that, over other traditional machine learning classifiers, the Optimized Gaussian Naïve Bayes achieves an 81.85% classifier accuracy. This proposed DM prediction system is effective as compared to other diabetes prediction systems found in the literature. According to our experimental study, the OGNB based diabetes mellitus prediction system is more appropriate for DM disease prediction. |
publishDate |
2021 |
dc.date.none.fl_str_mv |
2021-01-01 |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/article |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
format |
article |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
http://old.scielo.br/scielo.php?script=sci_arttext&pid=S1516-89132021000100624 |
url |
http://old.scielo.br/scielo.php?script=sci_arttext&pid=S1516-89132021000100624 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.relation.none.fl_str_mv |
10.1590/1678-4324-2021210181 |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
text/html |
dc.publisher.none.fl_str_mv |
Instituto de Tecnologia do Paraná - Tecpar |
publisher.none.fl_str_mv |
Instituto de Tecnologia do Paraná - Tecpar |
dc.source.none.fl_str_mv |
Brazilian Archives of Biology and Technology v.64 2021 reponame:Brazilian Archives of Biology and Technology instname:Instituto de Tecnologia do Paraná (Tecpar) instacron:TECPAR |
instname_str |
Instituto de Tecnologia do Paraná (Tecpar) |
instacron_str |
TECPAR |
institution |
TECPAR |
reponame_str |
Brazilian Archives of Biology and Technology |
collection |
Brazilian Archives of Biology and Technology |
repository.name.fl_str_mv |
Brazilian Archives of Biology and Technology - Instituto de Tecnologia do Paraná (Tecpar) |
repository.mail.fl_str_mv |
babt@tecpar.br||babt@tecpar.br |
_version_ |
1750318280907685888 |