Decoding Insomnia: Machine Learning Model Predicts Sleep Disorders From Patient Records

Summary: By analyzing demographics, lifestyle data, physical exam results, and laboratory values, a new machine-learning algorithm can accurately predict whether a person is at risk of sleep disorders. Age, weight, and depression are three factors the AI technology identified as being significant predictors of insomnia.

Source: PLOS

A machine learning model can effectively predict a patient’s risk for a sleep disorder using demographic and lifestyle data, physical exam results and laboratory values, according to a new study published this week in the open-access journal PLOS ONE by Samuel Y. Huang of Virginia Commonwealth University School of Medicine, and Alexander A. Huang of Northwestern Feinberg University School of Medicine, US.

The prevalence of diagnosed sleep disorders among American patients has significantly increased over the past decade. This trend is important to better understand and reverse since sleep disorders are a significant risk factor for diabetes, heart disease, obesity, and depression.

In the new work, the researchers used the machine learning model XGBoost to analyze publicly available data on 7,929 patients in the US who completed the National Health and Nutrition Examination Survey. The data contained 684 variables for each patient, including demographic, dietary, exercise and mental health questionnaire responses, as well as laboratory and physical exam information.

This is a drawing of a woman laying in bed
The prevalence of diagnosed sleep disorders among American patients has significantly increased over the past decade. Image is in the public domain

Overall, 2,302 patients in the study had a physician diagnosis of a sleep disorder. XGBoost could predict the risk of sleep disorder diagnosis with a strong accuracy (AUROC=0.87, sensitivity=0.74, specificity=0.77), using 64 of the total variables included in the full dataset. The greatest predictors for a sleep disorder, based on the machine learning model, were depression, weight, age and waist circumference.

The authors conclude that machine learning methods may be effective first steps in screening patients for sleep disorder risk without relying on physician judgement or bias. 

Samuel Y. Huang adds: “What sets this study on the risk factors for insomnia apart from others is seeing not only that depressive symptoms, age, caffeine use, history of congestive heart failure, chest pain, coronary artery disease, liver disease, and 57 other variables are associated with insomnia, but also visualizing the contribution of each in a very predictive model.”

About this machine learning and sleep disorder research news

Author: Hanna Abdallah
Source: PLOS
Contact: Hanna Abdallah – PLOS
Image: The image is in the public domain

Original Research: Open access.
Use of machine learning to identify risk factors for insomnia” by Samuel Y. Huang et al. PLOS ONE


Abstract

Use of machine learning to identify risk factors for insomnia

Importance

Sleep is critical to a person’s physical and mental health, but there are few studies systematically assessing risk factors for sleep disorders.

Objective

The objective of this study was to identify risk factors for a sleep disorder through machine-learning and assess this methodology.

Design, setting, and participants

A retrospective, cross-sectional cohort study using the publicly available National Health and Nutrition Examination Survey (NHANES) was conducted in patients who completed the demographic, dietary, exercise, and mental health questionnaire and had laboratory and physical exam data.

Methods

A physician diagnosis of insomnia was the outcome of this study. Univariate logistic models, with insomnia as the outcome, were used to identify covariates that were associated with insomnia. Covariates that had a p<0.0001 on univariate analysis were included within the final machine-learning model. The machine learning model XGBoost was used due to its prevalence within the literature as well as its increased predictive accuracy in healthcare prediction. Model covariates were ranked according to the cover statistic to identify risk factors for insomnia. Shapely Additive Explanations (SHAP) were utilized to visualize the relationship between these potential risk factors and insomnia.

Results

Of the 7,929 patients that met the inclusion criteria in this study, 4,055 (51% were female, 3,874 (49%) were male. The mean age was 49.2 (SD = 18.4), with 2,885 (36%) White patients, 2,144 (27%) Black patients, 1,639 (21%) Hispanic patients, and 1,261 (16%) patients of another race. The machine learning model had 64 out of a total of 684 features that were found to be significant on univariate analysis (P<0.0001 used). These were fitted into the XGBoost model and an AUROC = 0.87, Sensitivity = 0.77, Specificity = 0.77 were observed. The top four highest ranked features by cover, a measure of the percentage contribution of the covariate to the overall model prediction, were the Patient Health Questionnaire depression survey (PHQ-9) (Cover = 31.1%), age (Cover = 7.54%), physician recommendation of exercise (Cover = 3.86%), weight (Cover = 2.99%), and waist circumference (Cover = 2.70%).

Conclusion

Machine learning models can effectively predict risk for a sleep disorder using demographic, laboratory, physical exam, and lifestyle covariates and identify key risk factors.

Join our Newsletter
I agree to have my personal information transferred to AWeber for Neuroscience Newsletter ( more information )
Sign up to receive our recent neuroscience headlines and summaries sent to your email once a day, totally free.
We hate spam and only use your email to contact you about newsletters. You can cancel your subscription any time.