Revistes Publicacions URV: SORT - Statistics and Operations Research Transactions> 2019

Automatic regrouping of strata in the goodness-of-fit chi-square test

  • Identification data

    Identifier: RP:3411
    Authors:
    Vidal-Meliá, CarlosVentura-Marco, ManuelRegúlez-Castillo, MartaPérez-Salamero, Juan ManuelNúñez-Antón, Vicente
    Abstract:
    Pearson’s chi-square test is widely employed in social and health sciences to analyse categorical data and contingency tables. For the test to be valid, the sample size must be large enough to provide a minimum number of expected elements per category. This paper develops functions for regrouping strata automatically, thus enabling the goodness-of-fit test to be performed within an iterative procedure. The usefulness and performance of these functions is illustrated by means of a simulation study and the application to different datasets. Finally, the iterative use of the functions is applied to the Continuous Sample of Working Lives, a dataset that has been used in a considerable number of studies, especially on labour economics and the Spanish public pension system.
  • Others:

    URV's Author/s: Vidal-Meliá, Carlos Ventura-Marco, Manuel Regúlez-Castillo, Marta Pérez-Salamero, Juan Manuel Núñez-Antón, Vicente
    Keywords: Goodness-of-fit chi-square test, statistical software, Visual Basic for Applications, Mathematica, Continuous Sample of Working Lives
    Abstract: Pearson’s chi-square test is widely employed in social and health sciences to analyse categorical data and contingency tables. For the test to be valid, the sample size must be large enough to provide a minimum number of expected elements per category. This paper develops functions for regrouping strata automatically, thus enabling the goodness-of-fit test to be performed within an iterative procedure. The usefulness and performance of these functions is illustrated by means of a simulation study and the application to different datasets. Finally, the iterative use of the functions is applied to the Continuous Sample of Working Lives, a dataset that has been used in a considerable number of studies, especially on labour economics and the Spanish public pension system.
    Journal publication year: 2019
    Publication Type: info:eu-repo/semantics/publishedVersion info:eu-repo/semantics/article
  • Keywords:

    Goodness-of-fit chi-square test, statistical software, Visual Basic for Applications, Mathematica, Continuous Sample of Working Lives
  • Documents:

  • Cerca a google

    Search to google scholar