New Methods for Small Area Estimation with Linkage Uncertainty


Multiple Data Sources, Data Linkage

Dario Briscolini (Sapienza University of Rome) (Speaker)
Loredana Di Consiglio (Istat, Rome)
Brunero Liseo (Sapienza University of Rome)
Andrea Tancredi (Sapienza Università di Roma)
Tiziana Tuoto (Istat, Rome)

In Official Statistics, interest for data integration has been increasingly growing, due to the need of extracting information from different sources. However, the effects of these procedures on the validity of the resulting statistical analyses has been disregarded for a long time.
In recent years, it has been largely recognized that linkage is not an error-free procedure and linkage errors, as false links and/or missed links can invalidate the reliability of estimates even in standard statistical models.
In this paper we consider the general problem of making inference using data that have been probabilistically linked and we explore the effect of potential linkage errors on the production of small area estimates.
We describe the existing methods and propose and compare new approaches both from a classical and from a Bayesian perspective.
We present results based on a pseudo population whose values $Y$ and $X$ are obtained from the survey on Household Income and Wealth, Bank of Italy and the person identifiers coming from the fictitious population census data created for the ESSnet DI 2011.

Only logged in users can see slides when author's permission was given. Please register to have access.