<doi_batch xmlns="http://www.crossref.org/schema/4.4.0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" version="4.4.0"><head><doi_batch_id>c1b70a5f-3299-41df-b167-af3093bff8e2</doi_batch_id><timestamp>20250807132835828</timestamp><depositor><depositor_name>wseas:wseas</depositor_name><email_address>mdt@crossref.org</email_address></depositor><registrant>MDT Deposit</registrant></head><body><journal><journal_metadata language="en"><full_title>WSEAS TRANSACTIONS ON COMPUTER RESEARCH</full_title><issn media_type="electronic">2415-1521</issn><issn media_type="print">1991-8755</issn><archive_locations><archive name="Portico"/></archive_locations><doi_data><doi>10.37394/232018</doi><resource>http://wseas.org/wseas/cms.action?id=13372</resource></doi_data></journal_metadata><journal_issue><publication_date media_type="online"><month>1</month><day>10</day><year>2025</year></publication_date><publication_date media_type="print"><month>1</month><day>10</day><year>2025</year></publication_date><journal_volume><volume>13</volume><doi_data><doi>10.37394/232018.2025.13</doi><resource>https://wseas.com/journals/cr/2025.php</resource></doi_data></journal_volume></journal_issue><journal_article language="en"><titles><title>Machine Learning-based Prediction of Diabetes for Improved Healthcare</title></titles><contributors><person_name sequence="first" contributor_role="author"><given_name>Rong</given_name><surname>Zhao</surname><affiliation>Institute of Computer Science and Digital Innovation, UCSI University, WP Kuala Lumpur 56000, MALAYSIA</affiliation></person_name><person_name sequence="additional" contributor_role="author"><given_name>Ghassan Saleh</given_name><surname>Aldharhani</surname><affiliation>Institute of Computer Science and Digital Innovation, UCSI University, WP Kuala Lumpur 56000, MALAYSIA</affiliation></person_name><person_name sequence="additional" contributor_role="author"><given_name>Kurunathan</given_name><surname>Ratnavelu</surname><affiliation>Institute of Computer Science and Digital Innovation, UCSI University, WP Kuala Lumpur 56000, MALAYSIA</affiliation></person_name></contributors><jats:abstract xmlns:jats="http://www.ncbi.nlm.nih.gov/JATS1"><jats:p>Diabetes is a disease that can lead to severe tissue damage and dysfunction, and to improve the accuracy of one's prediction of early diabetes, patient datasets can be used to build Machine Learning (ML) and Deep Learning (DL) models to make the results more accurate and valid. There have been impressive advances in the integration of Artificial Intelligence (AI) and Machine Learning techniques in healthcare systems. This paper presents a comparative analysis of machine learning and deep learning algorithms for diabetes. The dataset used in the experiment is available at www.kaggle.com. In our experiments, we compared and analyzed the classification accuracies of each dataset under different classification algorithms and compared and analyzed the results with the accuracies of the corresponding algorithms listed in the references. The results show that in most cases the proposed algorithm outperforms the references in terms of classification accuracy, and the difference in this result is due to different data preprocessing. The original dataset will be further improved in the data preprocessing section and feature engineering will be further investigated at a later stage. Preprocessing the data and adjusting the model parameters can lead to better classification results. The accuracy of each model varies, and by comparing the results of the various algorithms, it is found that the random forest algorithm and the multilayer perceptron (MLP) algorithm have better accuracy than the other methods, and this finding lays the foundation for subsequent related research.</jats:p></jats:abstract><publication_date media_type="online"><month>8</month><day>7</day><year>2025</year></publication_date><publication_date media_type="print"><month>8</month><day>7</day><year>2025</year></publication_date><pages><first_page>593</first_page><last_page>607</last_page></pages><publisher_item><item_number item_number_type="article_number">53</item_number></publisher_item><ai:program xmlns:ai="http://www.crossref.org/AccessIndicators.xsd" name="AccessIndicators"><ai:free_to_read start_date="2025-08-07"/><ai:license_ref applies_to="am" start_date="2025-08-07">https://wseas.com/journals/cr/2025/b085118-358.pdf</ai:license_ref></ai:program><archive_locations><archive name="Portico"/></archive_locations><doi_data><doi>10.37394/232018.2025.13.53</doi><resource>https://wseas.com/journals/cr/2025/b085118-358.pdf</resource></doi_data><citation_list><citation key="ref0"><unstructured_citation>Centers for Disease Control and Prevention. National Diabetes Statistics Report, Methods for the National Diabetes Statistics Report, [Online]. https://www.cdc.gov/diabetes/data/statisticsreport/index.html (Accessed Date: December 13, 2024). </unstructured_citation></citation><citation key="ref1"><doi>10.3390/jpm13030406</doi><unstructured_citation>C.Y. Chou, D.Y. Hsu, and C.H. Chou, Predicting the onset of diabetes with machine learning methods. Journal of Personalized Medicine, Vol.13, No.3, 2023, pp.406-423. https://doi.org/10.3390/jpm13030406. </unstructured_citation></citation><citation key="ref2"><doi>10.2337/diacare.28.5.1187</doi><unstructured_citation>M. Franciosi, G. De Berardis, M.C. Rossi, M. Sacco, M. Belfiglio, F. Pellegrini, G. Tognoni, M. Valentini, A. Nicolucci, and IGLOO Study Group, Use of the diabetes risk score for opportunistic screening of undiagnosed diabetes and impaired glucose tolerance: the IGLOO (Impaired Glucose Tolerance and Long-Term Outcomes Observational) study. Diabetes Care, Vol.28, No.5, 2005, pp. 1187- 1194. https://doi.org/10.2337/diacare.28.5.1187. </unstructured_citation></citation><citation key="ref3"><doi>10.1111/j.1464-5491.2007.02157.x</doi><unstructured_citation>K.G.M.M. Alberti, P. Zimmet, and J. Shaw, International Diabetes Federation: a consensus on Type 2 diabetes prevention. Diabetic Medicine, Vol.24, No.5, 2007, pp. 451-463. https://doi.org/10.1111/j.1464- 5491.2007.02157.x. </unstructured_citation></citation><citation key="ref4"><doi>10.1007/s00500-023-08390-8</doi><unstructured_citation>N.S. Gnanadesigan, N. Dhanasegar, M.D. Ramasamy, S. Muthusamy, O.P. Mishra, G.K. Pugalendhi, S.C.M. Sundararajan, and A. Ravindaran, An integrated network topology and deep learning model for prediction of Alzheimer disease candidate genes. Soft Computing, Vol.27, No.19, 2023, pp. 14189- 14203. https://doi.org/10.1007/s00500-023- 08390-8. </unstructured_citation></citation><citation key="ref5"><unstructured_citation>N. Girdhar, A. Sinha, and S. Gupta, DenseNet-II: An improved deep convolutional neural network for melanoma cancer detection. Soft Computing, Vol.27 No.18, 2023 pp. 13285-13304. https://doi.org/10.1007/s00500-022-07406-z. </unstructured_citation></citation><citation key="ref6"><doi>10.1016/j.prime.2024.100661</doi><unstructured_citation>N. Katiyar, H.K. Thakur, and A. Ghatak, Recent Advancements using Machine Learning &amp; Deep Learning Approaches for Diabetes Detection: A Systematic Review. ePrime-Advances in Electrical Engineering, Electronics and Energy, Vol.9, 2024, pp. 100661-100678. https://doi.org/10.1016/j.prime.2024.100661. </unstructured_citation></citation><citation key="ref7"><doi>10.1021/ci950063e</doi><unstructured_citation>M.S. Shanker, Using neural networks to predict the onset of diabetes mellitus. Journal of Chemical Information and Computer Sciences, Vol.36 No.1, 1996, pp. 35-41. https://doi.org/10.1021/ci950063e. </unstructured_citation></citation><citation key="ref8"><doi>10.3390/app9214604</doi><unstructured_citation>S. Larabi-Marie-Sainte, L. Aburahmah, R. Almohaini, and T. Saba, Current techniques for diabetes prediction: review and case study. Applied Sciences, Vol.9, No.21, 2019, pp. 4604-4622. https://doi.org/10.3390/app9214604. </unstructured_citation></citation><citation key="ref9"><unstructured_citation>K. Kapadia, H. Abdel-Jaber, F. Thabtah, and W. Hadi, Sport analytics for cricket game results using machine learning: An experimental study. Applied Computing and Informatics, Vol.18 No.3/4, 2020, pp. 256- 266. https://doi.org/10.1016/j.aci.2019.11.006. </unstructured_citation></citation><citation key="ref10"><unstructured_citation>A. Graves, and A. Graves, Supervised sequence labelling, Springer Berlin Heidelberg, 2012. </unstructured_citation></citation><citation key="ref11"><doi>10.46430/phpt0048</doi><unstructured_citation>DataCamp. Support Vector Machines with Scikit-learn Tutorial, [Online]. https://www.datacamp.com/tutorial/svmclassification-scikit-learn-python (Accessed Date: December 13, 2024). </unstructured_citation></citation><citation key="ref12"><unstructured_citation>R. Birjais, A.K. Mourya, R. Chauhan, and H. Kaur, Prediction and diagnosis of future diabetes risk: a machine learning approach. SN Applied Sciences, Vol.1, No.9, 2019, pp. :1112-1119. https://doi.org/10.1007/s42452- 019-1117-9. </unstructured_citation></citation><citation key="ref13"><unstructured_citation>Analytics Vidhya, A Complete Guide to KNearest Neighbors, [Online]. https://www.analyticsvidhya.com/blog/2018/0 3/introduction-k-neighbours-algorithmclustering/ (Accessed Date: December 13, 2024). </unstructured_citation></citation><citation key="ref14"><doi>10.1016/j.imu.2019.100204</doi><unstructured_citation>T.M. Alam, M.A. Iqbal, Y. Ali, A. Wahab, S. Ijaz, T.I. Baig, A. Hussain, M.A. Malik, M.M. Raza, S. Ibrar, and Z. Abbas, A model for early prediction of diabetes. Informatics in Medicine Unlocked, Vol.16, 2019, pp. 100204-100209. https://doi.org/10.1016/j.imu.2019.100204. </unstructured_citation></citation><citation key="ref15"><doi>10.1016/j.icte.2021.02.004</doi><unstructured_citation>J.J. Khanam, and S.Y. Foo, A comparison of machine learning algorithms for diabetes prediction. Ict Express, Vol.7, No.4, 2021, pp. 432-439. https://doi.org/10.1016/j.icte.2021.02.004. </unstructured_citation></citation><citation key="ref16"><doi>10.1109/access.2020.3041355</doi><unstructured_citation>W. Wang, M. Tong, and M. Yu, Blood glucose prediction with VMD and LSTM optimized by improved particle swarm optimization. IEEE Access, Vol.8, 2020, pp. 217908-217916. https://doi.org/10.1109/ACCESS.2020.30413 55. </unstructured_citation></citation><citation key="ref17"><doi>10.1007/s42979-020-00250-8</doi><unstructured_citation>L.J. Muhammad, E.A. Algehyne, and S.S. Usman, Predictive supervised machine learning models for diabetes mellitus. SN Computer Science, Vol.1, No.5, 2020, pp. 240-249. https://doi.org/10.1007/s42979-020- 00250-8. </unstructured_citation></citation><citation key="ref18"><doi>10.1016/j.procs.2020.03.336</doi><unstructured_citation>N.P. Tigga, and S. Garg, Prediction of type 2 diabetes using machine learning classification methods. Procedia Computer Science, Vol.167, 2020, pp. 706-716. https://doi.org/10.1016/j.procs.2020.03.336. </unstructured_citation></citation><citation key="ref19"><doi>10.1016/j.aci.2018.12.004</doi><unstructured_citation>H. Kaur, and V. Kumari, Predictive modelling and analytics for diabetes using a machine learning approach. Applied Computing and Informatics, Vol.18, No.1/2, 2020, pp. 90- 100. https://doi.org/10.1016/j.aci.2018.12.004. </unstructured_citation></citation><citation key="ref20"><doi>10.1007/s00530-021-00817-2</doi><unstructured_citation>C. Azad, B. Bhushan, R. Sharma, A. Shankar, K.K. Singh, and A. Khamparia, Prediction model using SMOTE, genetic algorithm and decision tree (PMSGD) for classification of diabetes mellitus. Multimedia Systems, Vol.28, No.4, 2022, pp. 1289-1307. https://doi.org/10.1007/s00530-021-00817-2. </unstructured_citation></citation><citation key="ref21"><doi>10.1016/j.imu.2020.100299</doi><unstructured_citation>H. Das, B. Naik, and H.S. Behera, Medical disease analysis using neuro-fuzzy with feature extraction model for classification. Informatics in Medicine Unlocked, Vol.18, 2020, pp. 100288-100301. https://doi.org/10.1016/j.imu.2019.100288. </unstructured_citation></citation><citation key="ref22"><unstructured_citation>S. Saru, and S. Subashree, Analysis and prediction of diabetes using machine learning. International Journal of Emerging Technology and Innovative Engineering, Vol.5, No.4, 2019, 167-175. </unstructured_citation></citation><citation key="ref23"><doi>10.31436/iiumej.v21i1.1206</doi><unstructured_citation>A. Ali, M.A. Alrubei, L.F.M. Hassan, M.A. Al-Ja'afari, and S.H. Abdulwahed, Diabetes classification based on KNN. IIUM Engineering Journal, Vol.21, No.1, 2020, pp. 175-181. https://doi.org/10.31436/iiumej.v21i1.1206. </unstructured_citation></citation><citation key="ref24"><doi>10.1016/j.eswa.2018.08.002</doi><unstructured_citation>J.A. Carter, C.S. Long, B.P. Smith, T.L. Smith, and G.L. Donati, Combining elemental analysis of toenails and machine learning techniques as a non-invasive diagnostic tool for the robust classification of type-2 diabetes. Expert Systems with Applications, Vol.115, 2019, pp. 245-255. https://doi.org/10.1016/j.eswa.2018.08.002. </unstructured_citation></citation><citation key="ref25"><doi>10.1109/ubmyk48245.2019.8965542</doi><unstructured_citation>A. Al-Zebari, and A. Sengur, Performance comparison of machine learning techniques on diabetes disease detection. In 2019 1st international informatics and software engineering conference (UBMYK), Ankara, Turkey, 2019, pp. 1-4. IEEE. https://doi.org/10.1109/UBMYK48245.2019.8 965542. </unstructured_citation></citation><citation key="ref26"><doi>10.1007/978-981-13-1280-9_6</doi><unstructured_citation>A. Choudhury, and D. Gupta, A survey on medical diagnosis of diabetes using machine learning techniques. In Recent Developments in Machine Learning and Data Analytics: IC3 2018, 2019, pp. 67-78. Springer Singapore. https://doi.org/10.1007/978-981-13-1280-9_6. </unstructured_citation></citation><citation key="ref27"><doi>10.1155/2022/3820360</doi><unstructured_citation>R. Saxena, S.K. Sharma, M. Gupta, and G.C. Sampada, A novel approach for feature selection and classification of diabetes mellitus: Machine learning methods. Computational Intelligence and Neuroscience, Vol.2022, No.1, 2022, pp. 3820360-3820371. https://doi.org/10.1155/2022/3820360. </unstructured_citation></citation><citation key="ref28"><doi>10.2147/dmso.s312787</doi><unstructured_citation>P. Nagaraj, P. Deepalakshmi, R.F. Mansour, and A. Almazroa, Artificial flora algorithmbased feature selection with gradient boosted tree model for diabetes classification. Diabetes, Metabolic Syndrome and Obesity: Targets and Therapy, Vol.14, 2021, pp. 2789- 2806. https://doi.org/10.2147/DMSO.S312787. </unstructured_citation></citation><citation key="ref29"><doi>10.1016/j.health.2024.100301</doi><unstructured_citation>G. Dharmarathne, T.N. Jayasinghe, M. Bogahawaththa, D.P. Meddage, and U. Rathnayake, A novel machine learning approach for diagnosing diabetes with a selfexplainable interface. Healthcare Analytics, Vol.5, 2024, pp. 100301-100313. https://doi.org/10.1016/j.health.2024.100301. </unstructured_citation></citation><citation key="ref30"><doi>10.3390/buildings12060734</doi><unstructured_citation>P. Meddage, I. Ekanayake, U.S. Perera, H.M. Azamathulla, M.A. Md Said, and U. Rathnayake, Interpretation of machinelearning-based (black-box) wind pressure predictions for low-rise gable-roofed buildings using Shapley additive explanations (SHAP). Buildings, Vol.12, No.6, 2022, pp. 734-760. https://doi.org/10.3390/buildings12060734. </unstructured_citation></citation><citation key="ref31"><doi>10.1016/j.rineng.2023.101388</doi><unstructured_citation>P. Thisovithan, H. Aththanayake, D.P. Meddage, I.U. Ekanayake, and U. Rathnayake, A novel explainable AI-based approach to estimate the natural period of vibration of masonry infill reinforced concrete frame structures using different machine learning techniques. Results in Engineering, Vol.19, 2023, pp. 101388-101408. https://doi.org/10.1016/j.rineng.2023.101388. </unstructured_citation></citation><citation key="ref32"><doi>10.3390/s22124398</doi><unstructured_citation>D.P. Meddage, I.U. Ekanayake, S. Herath, R. Gobirahavan, N. Muttil, and U. Rathnayake, Predicting bulk average velocity with rigid vegetation in open channels using tree-based machine learning: a novel approach using explainable artificial intelligence. Sensors, Vol.22, No.12, 2022, pp. 4398-4426. https://doi.org/10.3390/s22124398. </unstructured_citation></citation><citation key="ref33"><unstructured_citation>N. Gandhi, and S. Mishra, Explainable AI for healthcare: a study for interpreting diabetes prediction. In Machine Learning and Big Data Analytics (Proceedings of International Conference on Machine Learning and Big Data Analytics (ICMLBDA) 2021), Patna, India, 2022 (pp. 95-105). Springer International Publishing. https://doi.org/10.1007/978-3-030-82469-3_9. </unstructured_citation></citation><citation key="ref34"><doi>10.1016/j.scitotenv.2021.150674</doi><unstructured_citation>H. Wei, J. Sun, W. Shan, W. Xiao, B. Wang, X. Ma, W. Hu, X. Wang, and Y. Xia, Environmental chemical exposure dynamics and machine learning-based prediction of diabetes mellitus. Science of The Total Environment. Vol.806, 2022 ， pp.150674. https://doi.org/10.1016/j.scitotenv.2021.15067 4. </unstructured_citation></citation><citation key="ref35"><doi>10.1155/2021/9930985</doi><unstructured_citation>U.M. Butt, S. Letchmunan, M. Ali, F.H. Hassan, A. Baqir, and H.H. Sherazi, Machine learning based diabetes classification and prediction for healthcare applications. Journal of healthcare engineering. Vol.2021, No.1, 2021, pp. 9930985-9931001. https://doi.org/10.1155/2021/9930985. </unstructured_citation></citation><citation key="ref36"><unstructured_citation>J.M. Yin, Y. Li, J.T. Xue, G.W. Zong, Z.Z. Fang, and L. Zou, Explainable Machine Learning-Based Prediction Model for Diabetic Nephropathy. Journal of Diabetes Research. Vol.2024, No.1, 2024, pp. 8857453-8857465. https://doi.org/10.1155/2024/8857453. </unstructured_citation></citation><citation key="ref37"><doi>10.23919/indiacom61295.2024.10498997</doi><unstructured_citation>A. Khanam, and F.S. Masoodi, Early Detection of Type-2 Diabetes Mellitus using Machine Learning based Prediction Models. In 2024 11th International Conference on Computing for Sustainable Global Development (INDIACom), New Delhi, India, 2024 (pp. 1398-1403). IEEE. https://doi.org/10.23919/INDIACom61295.20 24.10498997. </unstructured_citation></citation><citation key="ref38"><unstructured_citation>B.F. Wee, S. Sivakumar, K.H. Lim, W.K. Wong, and F.H. Juwono, Diabetes detection based on machine learning and deep learning approaches. Multimedia Tools and Applications. Vol.83, No.3, 2024, pp. 24153- 24185. https://doi.org/10.1007/s11042-023- 16407-5. </unstructured_citation></citation><citation key="ref39"><doi>10.4018/979-8-3693-2359-5.ch005</doi><unstructured_citation>H. Vardhan, and V. Kumar, Comparative Analysis of Machine Learning-Based Diabetes Prediction Approaches. In Future of AI in Medical Imaging, 2024 (pp. 65-75). IGI Global. https://doi.org/10.4018/979-8-3693- 2359-5.ch005. </unstructured_citation></citation><citation key="ref40"><unstructured_citation>UCI Machine Learning, Pima Indians Diabetes Database, [Online]. https://www.kaggle.com/datasets/uciml/pimaindians-diabetes-database (Accessed Date: December 13, 2024). </unstructured_citation></citation><citation key="ref41"><unstructured_citation>Alex T., Diabetes Health Indicators Dataset, [Online]. https://www.kaggle.com/datasets/alexteboul/d iabetes-health-indicators-dataset (Accessed Date: December 13, 2024). </unstructured_citation></citation><citation key="ref42"><unstructured_citation>Neha P.T., Diabetes Dataset 2019, [Online]. https://www.kaggle.com/datasets/tigganeha4/ diabetes-dataset-2019 (Accessed Date: December 13, 2024). </unstructured_citation></citation><citation key="ref43"><unstructured_citation>B. Humberto, Diabetes 130 US hospitals for years 1999-2008, [Online]. https://www.kaggle.com/datasets/brandao/dia betes (Accessed Date: December 13, 2024). </unstructured_citation></citation><citation key="ref44"><unstructured_citation>Larxel. Early Classification of Diabetes, [Online]. https://www.kaggle.com/datasets/andrewmvd/ early-diabetes-classification (Accessed Date: December 13, 2024). </unstructured_citation></citation><citation key="ref45"><doi>10.1016/j.ijcce.2021.01.001</doi><unstructured_citation>S. Kumari, D. Kumar, and M. Mittal, An ensemble approach for classification and prediction of diabetes mellitus using soft voting classifier. International Journal of Cognitive Computing in Engineering, Vol.2, No.1, 2021, pp. 40-46. https://doi.org/10.1016/j.ijcce.2021.01.001. </unstructured_citation></citation><citation key="ref46"><doi>10.1016/j.procs.2022.12.107</doi><unstructured_citation>M.E. Febrian, F.X. Ferdinan, G.P. Sendani, K.M. Suryanigrum, and R. Yunanda, Diabetes prediction using supervised machine learning. Procedia Computer Science, Vol.216, 2023, pp. 21-30. https://doi.org/10.1016/j.procs.2022.12.107. </unstructured_citation></citation><citation key="ref47"><unstructured_citation>N. Ahmed, R. Ahammed, M.M. Islam, M.A. Uddin, A. Akhter, M.A. Talukder, and B.K. Paul, Machine learning based diabetes prediction and development of smart web application. International Journal of Cognitive Computing in Engineering, Vol.2, 2021, pp. 229-241. https://doi.org/10.1016/j.ijcce.2021.12.001. </unstructured_citation></citation><citation key="ref48"><doi>10.1016/j.imed.2023.01.003</doi><unstructured_citation>N. Nipa, M.H. Riyad, S. Satu, K.C. Howlader, and M.A. Moni, Clinically adaptable machine learning model to identify early appreciable features of diabetes in Bangladesh. Intelligent Medicine, Vol.4, No.1, 2024, pp. 22-32. https://doi.org/10.1016/j.imed.2023.01.003. </unstructured_citation></citation><citation key="ref49"><doi>10.1016/j.health.2022.100118</doi><unstructured_citation>V. Chang, M.A. Ganatra, K. Hall, L. Golightly, and Q.A. Xu, An assessment of machine learning models and algorithms for early prediction and diagnosis of diabetes using health indicators. Healthcare Analytics, Vol.2, 2022, pp. 100118-100131. https://doi.org/10.1016/j.health.2022.100118. </unstructured_citation></citation><citation key="ref50"><doi>10.1016/j.procs.2022.09.195</doi><unstructured_citation>M.S.A. Magboo, and V.P.C. Magboo, Feature Importance Measures as Explanation for Classification Applied to Hospital Readmission Prediction. Procedia Computer Science, Vol.207, 2022, pp. 1388-1397. https://doi.org/10.1016/j.procs.2022.09.195.</unstructured_citation></citation></citation_list></journal_article></journal></body></doi_batch>