Diagnostic accuracy of the WHO tuberculosis treatment decision algorithms for children with presumptive tuberculosis: An individual participant data meta-analysis.

Olbrich L Larsson L Dunbar R Dodd PJ Palmer M Huyen Ton Nu Nguyet M d'Elbée M Hesseling AC Heinrich N Zar HJ Ntinginya NE Khosa C Nliwasa M Verghese VP Bonnet M Wobudeya E Nduna B Moh R Mwanga-Amumpere J Mustapha A Breton G Taguebue JV Borand L Chabala C Marcy O Seddon JA van der Zalm MM
PLoS medicine 2025 Nov ; 22(11); . doi: 10.1371/journal.pmed.1004610. Epub 2025 11 18

Abstract

INTRODUCTION: In 2023, almost 200,000 children under 15 years died from tuberculosis, most without appropriate treatment. Treatment decision algorithms (TDAs), developed to facilitate rapid anti-tuberculosis treatment initiation in children, were recommended by the World Health Organization (WHO) in 2022, conditional on validation in different cohorts and settings. We performed a retrospective external evaluation of WHO TDAs using an individual participant dataset (IPD).

METHODS AND FINDINGS: The IPD comprised four paediatric cohorts, restricted to children with presumptive pulmonary TB  10 years, and including children in high-risk groups (children living with HIV "CLHIV", children with severe acute malnutrition "SAM", and children 2 years). All children in the IPD were retrospectively evaluated using both TDA A (an algorithm including chest X-ray) and TDA B (without chest X-ray), excluding the triage step. The diagnostic accuracy against a composite reference standard (confirmed and unconfirmed tuberculosis versus unlikely tuberculosis) was determined and reported as sensitivities and specificities. Of 1,886 children included (RaPaed-TB: n = 740, Umoya: n = 474, TB-Speed HIV: n = 204, TB-Speed Decentralisation: n = 468), the median age was 2.9 years (interquartile range [IQR]:1.3,5.5), 741 (39.3%) were 2 years, 382 (20.3%) were CLHIV, and 284 (15.1%) had SAM. 281 (14.9%) had confirmed tuberculosis, 672 (35.6%) were classified as unconfirmed tuberculosis (clinically diagnosed, microbiological investigations negative), and 933 (49.5%) as unlikely tuberculosis. For TDAs A and B, algorithm sensitivity was 84.3% (95% CI: 74.8, 90.6) and 90.6% (95% CI: 83.8, 94.7), respectively, with a specificity of 50.6% (95% CI: 30.4, 70.7) and 30.8% (95% CI: 21.5, 42.0), respectively. For TDA A, estimated sensitivity in children in high-risk groups was lower than those with low-risk (83.0%, 95% CI: 79.4%, 86.1%; versus 88.0%, 95% CI: 84.8%, 90.6%), while having a gain in specificity (50.0%, 95% CI: 44.9%, 55.1%; versus 36.6%, 95% CI: 32.7%, 40.7%). Trends were similar for TDA B. As for limitations, most diagnostic tuberculosis studies in children, including two of those included in the IPD, are performed at secondary or tertiary hospitals with higher levels of healthcare and thus the target population might differ somewhat from the IPD, potentially limiting the generalisability of our results.

CONCLUSIONS: This retrospective external evaluation of WHO TDAs in a large IPD shows high sensitivity but sub-optimal specificity for both TDAs, in line with the meta-analyses that generated the algorithms. Prospective studies that evaluate the entire TDA, including triage step are needed. Additionally, the integration of novel diagnostic tools within the TDAs should aim to enhance the accuracy, especially the specificity.

Copyright: © 2025 Olbrich et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.