1. Calibration and Partial Calibration on Principal Components
- Author
-
Camelia Goga, Hervé Cardot, Muhammad Shehzad, Institut de Mathématiques de Bourgogne [Dijon] (IMB), Centre National de la Recherche Scientifique (CNRS)-Université de Franche-Comté (UFC), Université Bourgogne Franche-Comté [COMUE] (UBFC)-Université Bourgogne Franche-Comté [COMUE] (UBFC)-Université de Bourgogne (UB), Bahauddin Zakariya University (BZU), Institut de Mathématiques de Bourgogne [Dijon] ( IMB ), Université de Bourgogne ( UB ) -Centre National de la Recherche Scientifique ( CNRS ), Bahauddin Zakariya University - BZU (PAKISTAN), Laboratoire Chrono-environnement ( LCE ), and Université Bourgogne Franche-Comté ( UBFC ) -Centre National de la Recherche Scientifique ( CNRS ) -Université de Franche-Comté ( UFC )
- Subjects
[ MATH ] Mathematics [math] ,Statistics and Probability ,multipurpose surveys ,Calibration curve ,Calibration (statistics) ,partial calibration ,01 natural sciences ,010104 statistics & probability ,[MATH.MATH-ST]Mathematics [math]/Statistics [math.ST] ,Estimators ,partial least squares ,ridge regression ,model-assisted estimation ,[ MATH.MATH-ST ] Mathematics [math]/Statistics [math.ST] ,[MATH]Mathematics [math] ,0101 mathematics ,survey sampling ,Selection ,Mathematics ,Remote sensing ,Penalized Calibration ,Regression ,[MATH.MATH-PR]Mathematics [math]/Probability [math.PR] ,Principal component analysis ,Dimension reduction ,Statistics, Probability and Uncertainty ,[ MATH.MATH-PR ] Mathematics [math]/Probability [math.PR] ,variance approximation - Abstract
International audience; In survey sampling, calibration is a popular tool used to make total estimators consistent with known totals of auxiliary variables and to reduce variance. When the number of auxiliary variables is large, calibration on all the variables may lead to estimators of totals whose mean squared error (MSE) is larger than the MSE of the Horvitz-Thompson estimator even if this simple estimator does not take account of the available auxiliary information. We study a new technique based on dimension reduction through principal components that can be useful in this large dimension context. Calibration is performed on the first principal components, which can be viewed as the synthetic variables containing the most important part of the variability of the auxiliary variables. When some auxiliary variables play a more important role than others, the method can be adapted to provide an exact calibration on these variables. Some asymptotic properties are given in which the number of variables is allowed to tend to infinity with the population size. A data driven selection criterion of the number of principal components ensuring that all the sampling weights remain positive is discussed. The methodology of the paper is illustrated, in a multipurpose context, by an application to the estimation of electricity consumption with the help of 336 auxiliary variables.
- Published
- 2017