Show simple item record

dc.contributor.authorLu, Haohui
dc.contributor.authorUddin, Shahadat
dc.contributor.authorHajati, Farshid
dc.contributor.authorMoni, Mohammad Ali
dc.contributor.authorKhushi, Matloob
dc.date.accessioned2021-07-05T04:11:32Z
dc.date.available2021-07-05T04:11:32Z
dc.date.issued2021
dc.identifier.issn0924-669Xen_US
dc.identifier.doi10.1007/s10489-021-02533-wen_US
dc.identifier.urihttp://hdl.handle.net/10072/405666
dc.description.abstractIn recent years, the prevalence of chronic diseases such as type 2 diabetes mellitus (T2DM) has increased, bringing a heavy burden to healthcare systems. While regular monitoring of patients is expensive and impractical, understanding chronic disease progressions and identifying patients at risk of developing comorbidities are crucial. This research used a real-world administrative claim dataset of T2DM to develop an ensemble of innovative patient network and machine learning approach for disease prediction. The healthcare data of 1,028 T2DM patients and 1,028 non-T2DM patients are extracted from the de-identified data to predict the risk of T2DM. The proposed model is based on the ‘patient network’, which represents the underlying relationships among health conditions for a group of patients diagnosed with the same disease using the graph theory. Besides patients’ socio-demographic and behaviour characteristics, the attributes of the ‘patient network’ (e.g., centrality measure) discover patients’ latent features, which are effective in risk prediction. We apply eight machine learning models (Logistic Regression, K-Nearest Neighbours, Support Vector Machine, Naïve Bayes, Decision Tree, Random Forest, XGBoost and Artificial Neural Network) to the extracted features to predict the chronic disease risk. The extensive experiments show that the proposed framework with machine learning classifiers performance with the Area Under Curve (AUC) ranged from 0.79 to 0.91. The Random Forest model outperformed the other models; whereas, eigenvector centrality and closeness centrality of the network and patient age are the most important features for the model. The outstanding performance of our model provides promising potential applications in healthcare services. Also, we provide strong evidence that the extracted latent features are essential in the disease risk prediction. The proposed approach offers vital insight into chronic disease risk prediction that could benefit healthcare service providers and their stakeholders.en_US
dc.description.peerreviewedYesen_US
dc.languageEnglishen_US
dc.publisherSpringeren_US
dc.relation.ispartofjournalApplied Intelligenceen_US
dc.subject.fieldofresearchArtificial Intelligence and Image Processingen_US
dc.subject.fieldofresearchcode0801en_US
dc.subject.keywordsScience & Technologyen_US
dc.subject.keywordsComputer Scienceen_US
dc.subject.keywordsDisease predictionen_US
dc.titleA patient network-based machine learning model for disease prediction: The case of type 2 diabetes mellitusen_US
dc.typeJournal articleen_US
dc.type.descriptionC1 - Articlesen_US
dcterms.bibliographicCitationLu, H; Uddin, S; Hajati, F; Moni, MA; Khushi, M, A patient network-based machine learning model for disease prediction: The case of type 2 diabetes mellitus, Applied Intelligence, 2021en_US
dc.date.updated2021-07-02T05:35:42Z
gro.description.notepublicThis publication has been entered in Griffith Research Online as an advanced online version.en_US
gro.hasfulltextNo Full Text
gro.griffith.authorHajati, Farshid


Files in this item

FilesSizeFormatView

There are no files associated with this item.

This item appears in the following Collection(s)

  • Journal articles
    Contains articles published by Griffith authors in scholarly journals.

Show simple item record