Novel Pediatric Height Outlier Detection Methodology for Electronic Health Records via Machine Learning With Monotonic Bayesian Additive Regression Trees. J Pediatr Gastroenterol Nutr 2022 Aug 01;75(2):210-214
Date
06/02/2022Pubmed ID
35641892DOI
10.1097/MPG.0000000000003492Scopus ID
2-s2.0-85135203067 (requires institutional sign-in at Scopus site)Abstract
OBJECTIVE: To create a new methodology that has a single simple rule to identify height outliers in the electronic health records (EHR) of children.
METHODS: We constructed 2 independent cohorts of children 2 to 8 years old to train and validate a model predicting heights from age, gender, race and weight with monotonic Bayesian additive regression trees. The training cohort consisted of 1376 children where outliers were unknown. The testing cohort consisted of 318 patients that were manually reviewed retrospectively to identify height outliers.
RESULTS: The amount of variation explained in height values by our model, R2 , was 82.2% and 75.3% in the training and testing cohorts, respectively. The discriminatory ability to assess height outliers in the testing cohort as assessed by the area under the receiver operating characteristic curve was excellent, 0.841. Based on a relatively aggressive cutoff of 0.075, the outlier sensitivity is 0.713, the specificity 0.793; the positive predictive value 0.615 and the negative predictive value is 0.856.
CONCLUSIONS: We have developed a new reliable, largely automated, outlier detection method which is applicable to the identification of height outliers in the pediatric EHR. This methodology can be applied to assess the veracity of height measurements ensuring reliable indices of body proportionality such as body mass index.
Author List
Sparapani RA, Teng BQ, Hilbrands J, Pipkorn R, Feuling MB, Goday PSAuthors
Rodney Sparapani PhD Associate Professor in the Institute for Health and Equity department at Medical College of WisconsinBi Qing Teng Biostatistician II in the Institute for Health and Equity department at Medical College of Wisconsin
MESH terms used to index this publication - Major topics in bold
Bayes TheoremChild
Child, Preschool
Electronic Health Records
Humans
Machine Learning
ROC Curve
Retrospective Studies