Revistes Publicacions URV: SORT - Statistics and Operations Research Transactions> 2019

On the optimism correction of the area under the receiver operating characteristic curve in logistic prediction models

  • Datos identificativos

    Identificador: RP:3412
    Autores:
    Rodríguez-Álvarez, María XoséBarrio, IrantzuIparragirre, Amaia
    Resumen:
    When the same data are used to fit a model and estimate its predictive performance, this estimate may be optimistic, and its correction is required. The aim of this work is to compare the behaviour of different methods proposed in the literature when correcting for the optimism of the estimated area under the receiver operating characteristic curve in logistic regression models. A simulation study (where the theoretical model is known) is conducted considering different number of covariates, sample size, prevalence and correlation among covariates. The results suggest the use of k-fold cross-validation with replication and bootstrap.
  • Otros:

    Autor/es de la URV: Rodríguez-Álvarez, María Xosé Barrio, Irantzu Iparragirre, Amaia
    Palabras clave: Prediction models, logistic regression, area under the receiver operating characteristic curve, validation, bootstrap
    Resumen: When the same data are used to fit a model and estimate its predictive performance, this estimate may be optimistic, and its correction is required. The aim of this work is to compare the behaviour of different methods proposed in the literature when correcting for the optimism of the estimated area under the receiver operating characteristic curve in logistic regression models. A simulation study (where the theoretical model is known) is conducted considering different number of covariates, sample size, prevalence and correlation among covariates. The results suggest the use of k-fold cross-validation with replication and bootstrap.
    Año de publicación de la revista: 2019
    Tipo de publicación: info:eu-repo/semantics/publishedVersion info:eu-repo/semantics/article
  • Palabras clave:

    Prediction models, logistic regression, area under the receiver operating characteristic curve, validation, bootstrap
  • Documentos:

  • Cerca a google

    Search to google scholar