Operational Issues : Technical Session 19bUse of technology for field data capture and compilation
Data Capture Technology
description
Transcript of Data Capture Technology
![Page 1: Data Capture Technology](https://reader038.fdocuments.us/reader038/viewer/2022110214/56815d29550346895dcb207f/html5/thumbnails/1.jpg)
Data Capture TechnologyData Capture Technology
Statistical Centre Of IRAN
Presented by :
MS. SOMAYE AHANGAR
Vice – Presidency forStrategic Planning and Supervision
Statistical Centre Of Iran
![Page 2: Data Capture Technology](https://reader038.fdocuments.us/reader038/viewer/2022110214/56815d29550346895dcb207f/html5/thumbnails/2.jpg)
Data Capture TechnologiesData Capture Technologies
Personal Digital Assistant (PDA)
Intelligent Character Recognition (ICR)
![Page 3: Data Capture Technology](https://reader038.fdocuments.us/reader038/viewer/2022110214/56815d29550346895dcb207f/html5/thumbnails/3.jpg)
PDA TechnologyPDA Technology
Instances of the PDA applications in the SCI Surveys and Censuses
General Advantages Application specification Localization and customization Activities for conducting the survey Limitations
![Page 4: Data Capture Technology](https://reader038.fdocuments.us/reader038/viewer/2022110214/56815d29550346895dcb207f/html5/thumbnails/4.jpg)
Instance of the PDA application in the SCI Instance of the PDA application in the SCI
Collecting images and multimedia contents on tribes in the Census of Nomadic Tribes of Iran in order to be used in Encyclopedia of Iranian Tribes.
In sampling surveys including monthly and periodical surveys PDAs are frequently used.
![Page 5: Data Capture Technology](https://reader038.fdocuments.us/reader038/viewer/2022110214/56815d29550346895dcb207f/html5/thumbnails/5.jpg)
General Advantages of PDAGeneral Advantages of PDA
They are convenient tools for Personal Information Management (PIM).
They offer various kinds of input keyboard (Hardware keyboard, Virtual Keyboard on LCD, Finger- touch screen)
Communication via USB, Infrared and Bluetooth
Accurate, rapid, single- time data entry.
They could lead to elimination of paper questionnaires in surveys/census.
There will be no need for verification step.
PDA application will reduce the need for human resources leading to more efficiency and cost effectiveness in projects
![Page 6: Data Capture Technology](https://reader038.fdocuments.us/reader038/viewer/2022110214/56815d29550346895dcb207f/html5/thumbnails/6.jpg)
Application specificationApplication specification
The current devices’ model in use are : HP iPAQ HW6945 HP iPAQ HW-6955
Developed software packages consists of 3 sections: The mobile software edition used in PDA device The desktop application for central management system in provinces The desktop application for central management system in the SCI
Developed in .NET technologies and SQL Server 2005 CE
![Page 7: Data Capture Technology](https://reader038.fdocuments.us/reader038/viewer/2022110214/56815d29550346895dcb207f/html5/thumbnails/7.jpg)
Localization and customizationLocalization and customization Application of GPS devices to take spatial Information
Assignment of sample working scopes to enumerators in monthly survey is dynamic. It should be mentioned that this dynamic process may causes some problems and concerns.
Measuring and evaluating spent time in each place under enumeration.
Simultaneous error validations to maintain data consistency.
Flexible navigation tool enabling browsing between different pages for filling their blank fields.
Possibility of data transmission thought internet
![Page 8: Data Capture Technology](https://reader038.fdocuments.us/reader038/viewer/2022110214/56815d29550346895dcb207f/html5/thumbnails/8.jpg)
Activities for conducting the surveyActivities for conducting the survey
Collecting Data by interviewers
Moving collected data from devices to central management system in provinces
Reviewing and verifying data by experts of provinces
Sending database files (.sdf files) by data communication from provinces to the SCI
Converting and processing received files
Generate new .sdf file for using in next period
![Page 9: Data Capture Technology](https://reader038.fdocuments.us/reader038/viewer/2022110214/56815d29550346895dcb207f/html5/thumbnails/9.jpg)
LimitationsLimitations Requires trained staff
In order to find direction and paths by using digital geographical maps, a more comprehensive database of maps of roads is necessary which itself requires updating the current data.
For using GSM features proper coverage of mobile communication network is necessary in all areas under enumeration.
Hardware problems like short battery life of some PDAs and their failure to operate properly as well as problem in synchronizing of PDA with pc were among other problems.
a comprehensive management systems– particularly in monthly surveys- is necessary to process the received files in timely manner in order to generate the required data file for next month. Sometimes, time constraint is a major problem.
![Page 10: Data Capture Technology](https://reader038.fdocuments.us/reader038/viewer/2022110214/56815d29550346895dcb207f/html5/thumbnails/10.jpg)
ICR TechnologyICR Technology
Overview Top Operational Reasons Accuracy ICR Stages Instances of the ICR usages in the SCI
Surveys and Censuses Application specification and consideration The Strength Points using ICR in the SCI
![Page 11: Data Capture Technology](https://reader038.fdocuments.us/reader038/viewer/2022110214/56815d29550346895dcb207f/html5/thumbnails/11.jpg)
ICR Overview ICR Overview
ICR is an advanced automatic data entry system
Captures the images of document forms
Reads handwritten, machine-printed, barcodes and other data fields
The Interpret engines convert the precious data of the images into digital format
Recognizes printed letters with one letter in each box
![Page 12: Data Capture Technology](https://reader038.fdocuments.us/reader038/viewer/2022110214/56815d29550346895dcb207f/html5/thumbnails/12.jpg)
Top Operational ReasonsTop Operational Reasons
Cost effectiveness
Improved accuracy
Shorter cycle time
Improved work distribution (skill, location)
Audit trails and reporting and tracking tools
Data privacy
![Page 13: Data Capture Technology](https://reader038.fdocuments.us/reader038/viewer/2022110214/56815d29550346895dcb207f/html5/thumbnails/13.jpg)
How Accurate? It dependsHow Accurate? It depends
Quality of incoming documents is a very critical issue depending on :
Form design Scan
Original vs. photocopy Dropout vs. non-dropout
![Page 14: Data Capture Technology](https://reader038.fdocuments.us/reader038/viewer/2022110214/56815d29550346895dcb207f/html5/thumbnails/14.jpg)
ICR StagesICR Stages
Template/Form creation
Image capture & scan
Image pre-processing
Batch processing
Recognition and data extraction
Data validation and Review
![Page 15: Data Capture Technology](https://reader038.fdocuments.us/reader038/viewer/2022110214/56815d29550346895dcb207f/html5/thumbnails/15.jpg)
ICR Process SchemaICR Process Schema
![Page 16: Data Capture Technology](https://reader038.fdocuments.us/reader038/viewer/2022110214/56815d29550346895dcb207f/html5/thumbnails/16.jpg)
Character VerificationCharacter Verification
![Page 17: Data Capture Technology](https://reader038.fdocuments.us/reader038/viewer/2022110214/56815d29550346895dcb207f/html5/thumbnails/17.jpg)
Field VerificationField Verification
![Page 18: Data Capture Technology](https://reader038.fdocuments.us/reader038/viewer/2022110214/56815d29550346895dcb207f/html5/thumbnails/18.jpg)
Instance of the ICR usagesInstance of the ICR usages
Conducting a pilot census in some provinces in 2005
Conducting 2006 National Population and Housing Census
Using it as a major data capture tool in Iranian Households Economic Data Collection Project in 2008
![Page 19: Data Capture Technology](https://reader038.fdocuments.us/reader038/viewer/2022110214/56815d29550346895dcb207f/html5/thumbnails/19.jpg)
Application specification & considerationApplication specification & consideration
Various kind of forms with different layouts and contents
High complex structure of Farsi writing in comparison to Latin
Scan forms for about 45 pages per-minute for double sides in A3 size forms with 200 DPI resolution
Distribute the tasks using completely automated management systems
Developed modules for rule and batch editing, within the ICR software packages
![Page 20: Data Capture Technology](https://reader038.fdocuments.us/reader038/viewer/2022110214/56815d29550346895dcb207f/html5/thumbnails/20.jpg)
Batch & Rule VerificationBatch & Rule Verification
![Page 21: Data Capture Technology](https://reader038.fdocuments.us/reader038/viewer/2022110214/56815d29550346895dcb207f/html5/thumbnails/21.jpg)
The Strength PointsThe Strength Points
Using the product of the same company’s software in 2006 Census, which was used in 2005 Pilot Census because of its accuracy and reliability.
Improving quality of the developed software based on feedbacks and experiences gained from the Pilot Census and experimental versions.
A good practice in designing and preparing of the questionnaire forms
A good practice in offering an acceptable estimation of scanning and reading times.