Regression Imputation and Optimized Gaussian Naïve Bayes Algorithm for an Enhanced Diabetes Mellitus Prediction Model

Bibliographic Details
Main Author: Mohideen,Dhilsath Fathima Mohammed
Publication Date: 2021
Other Authors: Raj,Justin Samuel Savari, Raj,Raja Soosaimarian Peter
Format: Article
Language: eng
Source: Brazilian Archives of Biology and Technology
Download full: http://old.scielo.br/scielo.php?script=sci_arttext&pid=S1516-89132021000100624
Summary: Abstract Diabetes mellitus (DM) is a category of metabolic disorders caused by high blood sugar. The DM affects human metabolism, and this disease causes many complications like Heart disease, Neuropathy, Diabetic retinopathy, kidney problems, skin disorder and slow healing. It is therefore essential to predict the presence of DM using an automated diabetes diagnosis system, which can be implemented using machine learning algorithms. A variety of automated diabetes prediction systems have been proposed in previous studies. Even so, the low prediction accuracy of DM prediction systems is a major issue. This proposed work developed a diabetes mellitus prediction system to improve the diabetes mellitus prediction accuracy using Optimized Gaussian Naive Bayes algorithm. This proposed model using the Pima Indians diabetes dataset as an input to build the DM predictive model. The missing values of an input dataset are imputed using regression imputation method. The sequential backward feature elimination method is used in this proposed model for selecting the relevant risk factors of diabetes disease. The proposed machine learning classifier named Optimized Gaussian Naïve Bayes (OGNB) is applied to the selected risk factors to create an enhanced Diabetes diagnostic system which predicts Diabetes in an individual. The performance analysis of this prediction architecture shows that, over other traditional machine learning classifiers, the Optimized Gaussian Naïve Bayes achieves an 81.85% classifier accuracy. This proposed DM prediction system is effective as compared to other diabetes prediction systems found in the literature. According to our experimental study, the OGNB based diabetes mellitus prediction system is more appropriate for DM disease prediction.
id TECPAR-1_2ef7c2669a9c65ab3c97bb7eafe03355
oai_identifier_str oai:scielo:S1516-89132021000100624
network_acronym_str TECPAR-1
network_name_str Brazilian Archives of Biology and Technology
repository_id_str
spelling Regression Imputation and Optimized Gaussian Naïve Bayes Algorithm for an Enhanced Diabetes Mellitus Prediction ModelOptimized Gaussian Naïve Bayes classifierRegression imputationSequential backward feature eliminationDiabetes mellitus diagnosisAbstract Diabetes mellitus (DM) is a category of metabolic disorders caused by high blood sugar. The DM affects human metabolism, and this disease causes many complications like Heart disease, Neuropathy, Diabetic retinopathy, kidney problems, skin disorder and slow healing. It is therefore essential to predict the presence of DM using an automated diabetes diagnosis system, which can be implemented using machine learning algorithms. A variety of automated diabetes prediction systems have been proposed in previous studies. Even so, the low prediction accuracy of DM prediction systems is a major issue. This proposed work developed a diabetes mellitus prediction system to improve the diabetes mellitus prediction accuracy using Optimized Gaussian Naive Bayes algorithm. This proposed model using the Pima Indians diabetes dataset as an input to build the DM predictive model. The missing values of an input dataset are imputed using regression imputation method. The sequential backward feature elimination method is used in this proposed model for selecting the relevant risk factors of diabetes disease. The proposed machine learning classifier named Optimized Gaussian Naïve Bayes (OGNB) is applied to the selected risk factors to create an enhanced Diabetes diagnostic system which predicts Diabetes in an individual. The performance analysis of this prediction architecture shows that, over other traditional machine learning classifiers, the Optimized Gaussian Naïve Bayes achieves an 81.85% classifier accuracy. This proposed DM prediction system is effective as compared to other diabetes prediction systems found in the literature. According to our experimental study, the OGNB based diabetes mellitus prediction system is more appropriate for DM disease prediction.Instituto de Tecnologia do Paraná - Tecpar2021-01-01info:eu-repo/semantics/articleinfo:eu-repo/semantics/publishedVersiontext/htmlhttp://old.scielo.br/scielo.php?script=sci_arttext&pid=S1516-89132021000100624Brazilian Archives of Biology and Technology v.64 2021reponame:Brazilian Archives of Biology and Technologyinstname:Instituto de Tecnologia do Paraná (Tecpar)instacron:TECPAR10.1590/1678-4324-2021210181info:eu-repo/semantics/openAccessMohideen,Dhilsath Fathima MohammedRaj,Justin Samuel SavariRaj,Raja Soosaimarian Petereng2022-02-18T00:00:00Zoai:scielo:S1516-89132021000100624Revistahttps://www.scielo.br/j/babt/https://old.scielo.br/oai/scielo-oai.phpbabt@tecpar.br||babt@tecpar.br1678-43241516-8913opendoar:2022-02-18T00:00Brazilian Archives of Biology and Technology - Instituto de Tecnologia do Paraná (Tecpar)false
dc.title.none.fl_str_mv Regression Imputation and Optimized Gaussian Naïve Bayes Algorithm for an Enhanced Diabetes Mellitus Prediction Model
title Regression Imputation and Optimized Gaussian Naïve Bayes Algorithm for an Enhanced Diabetes Mellitus Prediction Model
spellingShingle Regression Imputation and Optimized Gaussian Naïve Bayes Algorithm for an Enhanced Diabetes Mellitus Prediction Model
Mohideen,Dhilsath Fathima Mohammed
Optimized Gaussian Naïve Bayes classifier
Regression imputation
Sequential backward feature elimination
Diabetes mellitus diagnosis
title_short Regression Imputation and Optimized Gaussian Naïve Bayes Algorithm for an Enhanced Diabetes Mellitus Prediction Model
title_full Regression Imputation and Optimized Gaussian Naïve Bayes Algorithm for an Enhanced Diabetes Mellitus Prediction Model
title_fullStr Regression Imputation and Optimized Gaussian Naïve Bayes Algorithm for an Enhanced Diabetes Mellitus Prediction Model
title_full_unstemmed Regression Imputation and Optimized Gaussian Naïve Bayes Algorithm for an Enhanced Diabetes Mellitus Prediction Model
title_sort Regression Imputation and Optimized Gaussian Naïve Bayes Algorithm for an Enhanced Diabetes Mellitus Prediction Model
author Mohideen,Dhilsath Fathima Mohammed
author_facet Mohideen,Dhilsath Fathima Mohammed
Raj,Justin Samuel Savari
Raj,Raja Soosaimarian Peter
author_role author
author2 Raj,Justin Samuel Savari
Raj,Raja Soosaimarian Peter
author2_role author
author
dc.contributor.author.fl_str_mv Mohideen,Dhilsath Fathima Mohammed
Raj,Justin Samuel Savari
Raj,Raja Soosaimarian Peter
dc.subject.por.fl_str_mv Optimized Gaussian Naïve Bayes classifier
Regression imputation
Sequential backward feature elimination
Diabetes mellitus diagnosis
topic Optimized Gaussian Naïve Bayes classifier
Regression imputation
Sequential backward feature elimination
Diabetes mellitus diagnosis
description Abstract Diabetes mellitus (DM) is a category of metabolic disorders caused by high blood sugar. The DM affects human metabolism, and this disease causes many complications like Heart disease, Neuropathy, Diabetic retinopathy, kidney problems, skin disorder and slow healing. It is therefore essential to predict the presence of DM using an automated diabetes diagnosis system, which can be implemented using machine learning algorithms. A variety of automated diabetes prediction systems have been proposed in previous studies. Even so, the low prediction accuracy of DM prediction systems is a major issue. This proposed work developed a diabetes mellitus prediction system to improve the diabetes mellitus prediction accuracy using Optimized Gaussian Naive Bayes algorithm. This proposed model using the Pima Indians diabetes dataset as an input to build the DM predictive model. The missing values of an input dataset are imputed using regression imputation method. The sequential backward feature elimination method is used in this proposed model for selecting the relevant risk factors of diabetes disease. The proposed machine learning classifier named Optimized Gaussian Naïve Bayes (OGNB) is applied to the selected risk factors to create an enhanced Diabetes diagnostic system which predicts Diabetes in an individual. The performance analysis of this prediction architecture shows that, over other traditional machine learning classifiers, the Optimized Gaussian Naïve Bayes achieves an 81.85% classifier accuracy. This proposed DM prediction system is effective as compared to other diabetes prediction systems found in the literature. According to our experimental study, the OGNB based diabetes mellitus prediction system is more appropriate for DM disease prediction.
publishDate 2021
dc.date.none.fl_str_mv 2021-01-01
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://old.scielo.br/scielo.php?script=sci_arttext&pid=S1516-89132021000100624
url http://old.scielo.br/scielo.php?script=sci_arttext&pid=S1516-89132021000100624
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv 10.1590/1678-4324-2021210181
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv text/html
dc.publisher.none.fl_str_mv Instituto de Tecnologia do Paraná - Tecpar
publisher.none.fl_str_mv Instituto de Tecnologia do Paraná - Tecpar
dc.source.none.fl_str_mv Brazilian Archives of Biology and Technology v.64 2021
reponame:Brazilian Archives of Biology and Technology
instname:Instituto de Tecnologia do Paraná (Tecpar)
instacron:TECPAR
instname_str Instituto de Tecnologia do Paraná (Tecpar)
instacron_str TECPAR
institution TECPAR
reponame_str Brazilian Archives of Biology and Technology
collection Brazilian Archives of Biology and Technology
repository.name.fl_str_mv Brazilian Archives of Biology and Technology - Instituto de Tecnologia do Paraná (Tecpar)
repository.mail.fl_str_mv babt@tecpar.br||babt@tecpar.br
_version_ 1750318280907685888