Smart Data Analysis and Statistics

Publication

Research

Transparent reporting of multivariable prediction models developed or validated using clustered data: TRIPOD-Cluster checklist

Debray TPA, Collins GS, Riley RD, Snell KIE, Van Calster B, Reitsma JB, Moons KGM

The increasing availability of large combined datasets (or big data), such as those from electronic health records and from individual participant data meta-analyses, provides new opportunities and challenges for researchers developing and validating (including updating) prediction models. These datasets typically include individuals from multiple clusters (such as multiple centres, geographical locations, or different studies). Accounting for clustering is important to avoid misleading conclusions and enables researchers to explore heterogeneity in prediction model performance across multiple centres, regions, or countries, to better tailor or match them to these different clusters, and thus to develop prediction models that are more generalisable. However, this requires prediction model researchers to adopt more specific design, analysis, and reporting methods than standard prediction model studies that do not have any inherent substantial clustering. Therefore, prediction model studies based on clustered data need to be reported differently so that readers can appraise the study methods and findings, further increasing the use and implementation of such prediction models developed or validated from clustered datasets.

Full Text

ABOUT THE AUTHORS

Thomas Debray

Founder & Managing Director

ABOUT THE JOURNAL

Journal Name: The BMJ
Journal Abbreviation: BMJ
Twitter
RSS Feed

KEYWORDS

hemodiafiltration hemodialysis treatment effect prediction treatment effect heterogeneity

ALTMETRIC

PUMX

Transparent reporting of multivariable prediction models developed or validated using clustered data: TRIPOD-Cluster checklist