Post on 09-Feb-2016
description
New technologies used in 2010 Census Round – Polish case study
Janusz DygaszewiczDirector of Central Census BureauGeneva, 30 September – 03 October 2013
Data collection channels in 2010 Census Round
3
•Including spatial data reference registers
Administrative Sources
•CAII (CAWI) – Computer Assisted Internet Interview
Self-enumeration by Internet
•CATI - Computer Assisted Telephone Interview
Telephone Interview
•Registered on hand-held terminals with usage GPS and GIS serviceCAPI - Computer Assisted Personal Interview
Interview with respondents realized by the census enumerator
•CAII - Computer Assisted Internet Interview,•CAPI - Computer Assisted Personal Interview,•CATI - Computer Assisted Telephone Interviewing.
CAxI
CAxI
4
CAXI
Mixed model - a combination of data fromadministrative sources with data acquired from
statistical surveys
Mixed Model for Census
5
– as a direct source of census data (personalisation of questionnaires)
– to create:
• compilations of buildings, dwellings and persons list
• an address-residence register
• a sampling frame
The use of administrative sources in census
6
Data from administrative systems was used in the census:
Census architecture - The IT Census System
For the purposes of census design and conduction, the CentralStatistical Office of Poland implemented the IT Census System(ISS)
The ISS integrated various technologies (from Internet data collection portal, applications installed on mobile terminals, through applications managing field works and assisting in telephone interviews, to specialist microdata bases, data warehouses up to analytical and reporting tools)
7
Register A
Register B
Register C
RegisterN
Sample Survey
Converted Register A
Converted Register B
Converted Register C
Converted Register N
Administrative Sources including spatial dataReference Registers
Edition & Normalization
Integrated MicroData
BASE
Data Integration(Estimation, Calibration)
Anonymisation
Un-Personalization Personal Data Recoding Table
MicroData BASE
for statistical
processing
ImputationEstimation,Definition,
ClasificationMETADATA
Public METADATA
Dissemination
WeightingEstimation
Geo-SpatialAnalysis
Internet
Janusz Dygaszewicz, 19.06.2007
Analysis
PublicArea
9
XML
TXT
Registry 1
Metadata server
Operational Microdata
Base
Registry 2
Registry nAnalitycalMicrodata
Base
ETL Tools
Portal
CAXI
Data processing infrastructure
XML
FilesStatistical
FilesGolden Record
Metadata MetadataMetadata
SDMX
Questionaries
• The Metainformation Subsystem gathered indispensable metainformation describing data and census processes. The task of the Metainformation Subsystem was to ensure the coherent definition of statistical objects for the OMB and AMB.
• The Operational Micro Data Base (OMB) - system included hardware-system-tool infrastructure (computer hardware, system software, software tools) and applications (computer programs that are the result of programming work). This base enabled the inclusion of data transmitted in electronic form through four informational channels by entities and to conduct further data processing. In the OMB there took place processes connected with the control, correction, and linking of data, up to their complete cleansing. Next, depersonalised data were transferred to the Analytical Microdata Base (AMB).
Census architecture - The IT Census System
10
For the census purposes were implemented the following systems:
• The role of the Analytical Microdata Base is to store depersonalised census data in their final form. In this dataset every type of statistical analyses is carried out to acquire results for publication, i.e. the census products. The AMB allows all the recipients of statistical information to quickly acquire data in the form of tables, aggregates and hypercubes. The AMB system constitutes an analytical and reporting platform that currently enables the statistical preparation of the outcome data from the National Population and Housing Census 2011. The results of analyses in the form of documents, reports and breakdowns are shared with internal and external users.The AMB also allows the calculation of spatial analysis and aggregates available in the Geostatistics Portal as maps (cartograms and cartodiagrams).
Census architecture - The IT Census System
11
Electronic media
CAII method system
12
OBMZKSOnline
questionnaire system
Internet
Online method Offline method
Offline questionnaire
Downloading the application file
Online
Browser
• Identification Used to confirm the identity of the respondent. • Entering identification data in a questionnaire (PIN code –
(PESEL, NIP) first name, last name) or additional authentication qualities (f.ex. a place of birth, mother’s maiden name)
• Establishing a new password which jointly with PESEL was the basis of authentication within 14 days
Self-enumeration by Internet filling the questionnaire by the respondent
13
14
The most significant functionality of Call Center
Hotline
InterviewingArranging visits by
census enumerators
Confirming the identity of the
interviewer/census enumerator
Dedicated APNMobile network
CAPI method system
15
OBMZKSDispatching application
- server -Communication
server
WAN CSO
Dispatching application
- client -
Statistical Office
Map server
Mobile application
Management of a terminal
Cryptographic SIM cardModule GPS
In 2010 Census Round a combination of data coming from administrative registers and geographic sources containing spatial data was linked with statistical data for the first time.
GIS Technology
16
2010 Census Round
registers containing spatial
data
administrative sources
Address point identification system
Spatial address point Identification system
Statistical adress points• address points for
residential buildings
Statistical distribution boundaries• statistical regions• enumeration areas
Spatial data compiled in the statistics today
19
The digital maps based on the GIS technology were used during:
20
community update
pre-census field check
census survey
Enumerator – GIS technology
Map module - GIS– Ortophotomap– Cadastral Data– Assigned Tasks– Started Tasks– Completed Tasks
21
The portal will allow statistical data presentation in form of any spatial unit:
Geostatistics Portalgeo.stat.gov.pl
22
• 5’ grid• 1 km2 grid• administrative division• urban division• statistical division• any other polygon
23
Thematic map – region (NUTS1) level
24
Thematic map – voivodeship (NUTS2) level
25
Thematic map – subregion (NUTS3) level
26
Thematic map – county (LAU1) level
27
Thematic map – municipality (LAU2) level
28
Cartodiagrams- Choropleths and Proportional symbols
Census organisation support systems
• Implementation of the Notification System and the Knowledge Base facilitated effective communication with all members of the census organizations:– In the Notification System, several thematic sections, so-called
projects, were created. Each project encompassed a selected thematic area.
– In the Knowledge Base were gathered up-to-date versions of documents, including instructions, training materials, and the operational schedule.
• To ensure effective management of the remote enumerator instruction process, the m-learning application was employed. It was installed on mobile terminals.
29
Census staff organisation
30
Central Census Office
Census Management CentreCentral Controllers
Cenus Data Processing CentreAdministrators of census systems
Summary
– simultaneous data collection, without paper, from four different channels (i.e. administrative registers, Internet self-enumeration (CAII), direct interviews conducted by census enumerators, using electronic questionnaires (CAPI), and telephone interviews conducted by statistical interviewers (CATI)) was used and implemented on such a large scale for the first time in Europe,
– data from 27 administrative registers and 3 non-administrative systems were effectively integrated,
– paper questionnaires were completely eliminated, and were replaced by ICT solutions,
31
The census in Poland turned out to be innovative project not only countrywide but also worldwide on grounds of the following facts and figures:
Summary
– the use of GIS technology helped to conduct the census preparatory work and an ongoing census process monitoring and give possibility to compile and present census results based on multi-dimensional spatial analyses
– IT Census System comprised a number of solutions ensuring the high level of security of the processed data,
– the modern statistical data processing technologies have been developed – they will have a considerable influence on the methodology of on going and future statistical surveys,
32
Instead of a conclusion
Census in 2002
• 180 thousands of census enumerators
• 120 mln of questionnaires
• 1 000 tons of papers
• At the end shredding census questionnaires
Census 2011
• 18 thousands of census enumerators
• 0 questionnaires
• 0 tons of papers
• ca. 40 mln € less• better data• the more reliable results• Infrastructure for statistical
surveys in the future
33
Central Statistical Office - Poland
Thank you for your attention
Janusz Dygaszewicz Director of Central Census Bureau POLAND