Back to Search Start Over

Machine learning to improve false-positive results in the Dutch newborn screening for congenital hypothyroidism.

Authors :
Stroek, Kevin
Visser, Allerdien
van der Ploeg, Catharina P.B.
Zwaveling-Soonawala, Nitash
Heijboer, Annemieke C.
Bosch, Annet M.
van Trotsenburg, A.S. Paul
Boelen, Anita
Hoogendoorn, Mark
de Jonge, Robert
Source :
Clinical Biochemistry. Jun2023, Vol. 116, p7-10. 4p.
Publication Year :
2023

Abstract

• Machine learning has the potential to improve the positive predictive value of neonatal screening for primary and central congenital hypothyroidism. • Currently, detection of central congenital hypothyroidism is accompanied by a high number of false-positives. • In order of importance, variables determining detection of congenital hypothyroidism were TSH, T4/TBG, (gestational) age, TBG, T4, and age at NBS sampling. • From the current routine screening parameters, T4/TBG is the most important parameter to detect central congenital hypothyroidism with a low positive predictive value. • New screening parameters are needed to improve the detection of central congenital hypothyroidism. The Dutch Congenital hypothyroidism (CH) Newborn Screening (NBS) algorithm for thyroidal and central congenital hypothyroidism (CH-T and CH-C, respectively) is primarily based on determination of thyroxine (T4) concentrations in dried blood spots, followed by thyroid-stimulating hormone (TSH) and thyroxine-binding globulin (TBG) measurements enabling detection of both CH-T and CH-C, with a positive predictive value (PPV) of 21%. A calculated T4/TBG ratio serves as an indirect measure for free T4. The aim of this study is to investigate whether machine learning techniques can help to improve the PPV of the algorithm without missing the positive cases that should have been detected with the current algorithm. NBS data and parameters of CH patients and false-positive referrals in the period 2007–2017 and of a healthy reference population were included in the study. A random forest model was trained and tested using a stratified split and improved using synthetic minority oversampling technique (SMOTE). NBS data of 4668 newborns were included, containing 458 CH-T and 82 CH-C patients, 2332 false-positive referrals and 1670 healthy newborns. Variables determining identification of CH were (in order of importance) TSH, T4/TBG ratio, gestational age, TBG, T4 and age at NBS sampling. In a Receiver-Operating Characteristic (ROC) analysis on the test set, current sensitivity could be maintained, while increasing the PPV to 26%. Machine learning techniques have the potential to improve the PPV of the Dutch CH NBS. However, improved detection of currently missed cases is only possible with new, better predictors of especially CH-C and a better registration and inclusion of these cases in future models. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
00099120
Volume :
116
Database :
Academic Search Index
Journal :
Clinical Biochemistry
Publication Type :
Academic Journal
Accession number :
163795501
Full Text :
https://doi.org/10.1016/j.clinbiochem.2023.03.001