Federal Statistics in an Age of a Self-Monitoring Social and Economic Eco-System Robert M. Groves US...
-
Upload
toby-medlock -
Category
Documents
-
view
215 -
download
0
Transcript of Federal Statistics in an Age of a Self-Monitoring Social and Economic Eco-System Robert M. Groves US...
![Page 1: Federal Statistics in an Age of a Self-Monitoring Social and Economic Eco-System Robert M. Groves US Census Bureau.](https://reader036.fdocuments.us/reader036/viewer/2022062511/551709d6550346f5558b5279/html5/thumbnails/1.jpg)
Federal Statistics in an Age of a Self-Monitoring Social and Economic Eco-
System
Robert M. GrovesUS Census Bureau
![Page 2: Federal Statistics in an Age of a Self-Monitoring Social and Economic Eco-System Robert M. Groves US Census Bureau.](https://reader036.fdocuments.us/reader036/viewer/2022062511/551709d6550346f5558b5279/html5/thumbnails/2.jpg)
5 Observations
1. The difficulties of measuring the busy, diverse, and independent modern society and economy are increasing every year (that is, it costs more money to do the same things the statistical agencies have done for years).
2
![Page 3: Federal Statistics in an Age of a Self-Monitoring Social and Economic Eco-System Robert M. Groves US Census Bureau.](https://reader036.fdocuments.us/reader036/viewer/2022062511/551709d6550346f5558b5279/html5/thumbnails/3.jpg)
2. The demands by business, state, local, and community leaders for timely statistics on their populations are continually increasing.
3
![Page 4: Federal Statistics in an Age of a Self-Monitoring Social and Economic Eco-System Robert M. Groves US Census Bureau.](https://reader036.fdocuments.us/reader036/viewer/2022062511/551709d6550346f5558b5279/html5/thumbnails/4.jpg)
3. New technologies are being invented almost daily that can be used to make it more convenient for the public to participate in these efforts to inform us about the status of the country.
4
![Page 5: Federal Statistics in an Age of a Self-Monitoring Social and Economic Eco-System Robert M. Groves US Census Bureau.](https://reader036.fdocuments.us/reader036/viewer/2022062511/551709d6550346f5558b5279/html5/thumbnails/5.jpg)
4. New digital data resources are being created both from national-state-local government programs, private sector transactions, and internet-related activities.
5
![Page 6: Federal Statistics in an Age of a Self-Monitoring Social and Economic Eco-System Robert M. Groves US Census Bureau.](https://reader036.fdocuments.us/reader036/viewer/2022062511/551709d6550346f5558b5279/html5/thumbnails/6.jpg)
5. Near-term central government budgets are likely to be flat or declining.
6
![Page 7: Federal Statistics in an Age of a Self-Monitoring Social and Economic Eco-System Robert M. Groves US Census Bureau.](https://reader036.fdocuments.us/reader036/viewer/2022062511/551709d6550346f5558b5279/html5/thumbnails/7.jpg)
Profound Conclusion
1. Higher costs
2. More demand for timely statistics
3. New technologies
4. New data resources
5. No new money
Conclusion: current practices are unsustainable
7
![Page 8: Federal Statistics in an Age of a Self-Monitoring Social and Economic Eco-System Robert M. Groves US Census Bureau.](https://reader036.fdocuments.us/reader036/viewer/2022062511/551709d6550346f5558b5279/html5/thumbnails/8.jpg)
Outline
• The sample survey as a scientific instrument
• The rise of “organic” data • Organic data and new statistical
information• A possible future
![Page 9: Federal Statistics in an Age of a Self-Monitoring Social and Economic Eco-System Robert M. Groves US Census Bureau.](https://reader036.fdocuments.us/reader036/viewer/2022062511/551709d6550346f5558b5279/html5/thumbnails/9.jpg)
The Sample Survey
• Sometimes called the most important invention of the social sciences in 20th century
• Challenged on rigidity, blunt nature, lean measurement
• However, the principal source of social science research data
![Page 10: Federal Statistics in an Age of a Self-Monitoring Social and Economic Eco-System Robert M. Groves US Census Bureau.](https://reader036.fdocuments.us/reader036/viewer/2022062511/551709d6550346f5558b5279/html5/thumbnails/10.jpg)
Three Eras of Survey Research
• Pre-1940’s– No common use universe frames for
household populations– No probability sampling– Face to face
![Page 11: Federal Statistics in an Age of a Self-Monitoring Social and Economic Eco-System Robert M. Groves US Census Bureau.](https://reader036.fdocuments.us/reader036/viewer/2022062511/551709d6550346f5558b5279/html5/thumbnails/11.jpg)
• 1940-1990– Area frames RDD frames – Probability sampling & quota– Face to face Telephone– Decline of response rates
![Page 12: Federal Statistics in an Age of a Self-Monitoring Social and Economic Eco-System Robert M. Groves US Census Bureau.](https://reader036.fdocuments.us/reader036/viewer/2022062511/551709d6550346f5558b5279/html5/thumbnails/12.jpg)
• Post 1990– RDD frames ABS Cell – Probability sampling & volunteers– Telephone Web, mail– Decline of response rates
![Page 13: Federal Statistics in an Age of a Self-Monitoring Social and Economic Eco-System Robert M. Groves US Census Bureau.](https://reader036.fdocuments.us/reader036/viewer/2022062511/551709d6550346f5558b5279/html5/thumbnails/13.jpg)
Relative Sizes of Digital Data Production, c.1960
Information Processing
Science
Gov’tStatistics
JournalismPrivate SectorResearch
![Page 14: Federal Statistics in an Age of a Self-Monitoring Social and Economic Eco-System Robert M. Groves US Census Bureau.](https://reader036.fdocuments.us/reader036/viewer/2022062511/551709d6550346f5558b5279/html5/thumbnails/14.jpg)
Relative Sizes of Digital Data Production, 2010
Information Processing
Private SectorResearch
Science
Journalism
Gov’tStatistics
![Page 15: Federal Statistics in an Age of a Self-Monitoring Social and Economic Eco-System Robert M. Groves US Census Bureau.](https://reader036.fdocuments.us/reader036/viewer/2022062511/551709d6550346f5558b5279/html5/thumbnails/15.jpg)
![Page 16: Federal Statistics in an Age of a Self-Monitoring Social and Economic Eco-System Robert M. Groves US Census Bureau.](https://reader036.fdocuments.us/reader036/viewer/2022062511/551709d6550346f5558b5279/html5/thumbnails/16.jpg)
GovernmentPrivate Sector
6,040 Petabytes848
Petabytes
![Page 17: Federal Statistics in an Age of a Self-Monitoring Social and Economic Eco-System Robert M. Groves US Census Bureau.](https://reader036.fdocuments.us/reader036/viewer/2022062511/551709d6550346f5558b5279/html5/thumbnails/17.jpg)
Changes in the Data World
• Digitization of administrative data• Improved record-matching• Continuous time process data
![Page 18: Federal Statistics in an Age of a Self-Monitoring Social and Economic Eco-System Robert M. Groves US Census Bureau.](https://reader036.fdocuments.us/reader036/viewer/2022062511/551709d6550346f5558b5279/html5/thumbnails/18.jpg)
A Self-Monitoring Social and Economic Eco-System
• Organic data– Those produced auxiliary to processes, to
record the process
• Designed Data– Those produced to discover the unmeasured
![Page 19: Federal Statistics in an Age of a Self-Monitoring Social and Economic Eco-System Robert M. Groves US Census Bureau.](https://reader036.fdocuments.us/reader036/viewer/2022062511/551709d6550346f5558b5279/html5/thumbnails/19.jpg)
Examples of Organic Data
• Google searches (Google Flu)• “Scraped” data from websites• Tweets• CCTV, traffic camera data• Retail scanner data• Credit card transaction data• Data.gov
![Page 20: Federal Statistics in an Age of a Self-Monitoring Social and Economic Eco-System Robert M. Groves US Census Bureau.](https://reader036.fdocuments.us/reader036/viewer/2022062511/551709d6550346f5558b5279/html5/thumbnails/20.jpg)
Common Features of Organic Data
• New data sources provide looks at interesting new phenomena
• They tend to be behaviors, not direct measures of internalized states
• The data are near real-time relative to the behaviors measured
• The data tend to be lean in variables• They are grossly incomplete on coverage of
usual target populations
![Page 21: Federal Statistics in an Age of a Self-Monitoring Social and Economic Eco-System Robert M. Groves US Census Bureau.](https://reader036.fdocuments.us/reader036/viewer/2022062511/551709d6550346f5558b5279/html5/thumbnails/21.jpg)
![Page 22: Federal Statistics in an Age of a Self-Monitoring Social and Economic Eco-System Robert M. Groves US Census Bureau.](https://reader036.fdocuments.us/reader036/viewer/2022062511/551709d6550346f5558b5279/html5/thumbnails/22.jpg)
![Page 23: Federal Statistics in an Age of a Self-Monitoring Social and Economic Eco-System Robert M. Groves US Census Bureau.](https://reader036.fdocuments.us/reader036/viewer/2022062511/551709d6550346f5558b5279/html5/thumbnails/23.jpg)
![Page 24: Federal Statistics in an Age of a Self-Monitoring Social and Economic Eco-System Robert M. Groves US Census Bureau.](https://reader036.fdocuments.us/reader036/viewer/2022062511/551709d6550346f5558b5279/html5/thumbnails/24.jpg)
Approaching a Mixed-Source Future from the Survey Side
• In contrast to societies that have register systems, Canada-Mexico-USA might approach a blended data world by building on top of existing surveys
![Page 25: Federal Statistics in an Age of a Self-Monitoring Social and Economic Eco-System Robert M. Groves US Census Bureau.](https://reader036.fdocuments.us/reader036/viewer/2022062511/551709d6550346f5558b5279/html5/thumbnails/25.jpg)
11
ConstructInferential Population
Measurement
Response
Target Population
Sampling Frame
Sample
Validity
MeasurementError
CoverageError
SamplingError
Measurement Representation
Respondents
NonresponseError
Edited Data
ProcessingError
Survey Statistic
![Page 26: Federal Statistics in an Age of a Self-Monitoring Social and Economic Eco-System Robert M. Groves US Census Bureau.](https://reader036.fdocuments.us/reader036/viewer/2022062511/551709d6550346f5558b5279/html5/thumbnails/26.jpg)
11
ConstructInferential Population
Measurement
Response
Target Population
Sampling Frame
Sample
Validity
MeasurementError
CoverageError
SamplingError
Measurement Representation
Respondents
NonresponseError
Edited Data
ProcessingError
Survey Statistic
![Page 27: Federal Statistics in an Age of a Self-Monitoring Social and Economic Eco-System Robert M. Groves US Census Bureau.](https://reader036.fdocuments.us/reader036/viewer/2022062511/551709d6550346f5558b5279/html5/thumbnails/27.jpg)
11
ConstructInferential Population
Measurement
Response
Target Population
Sampling Frame
Sample
Validity
MeasurementError
CoverageError
SamplingError
Measurement Representation
Respondents
NonresponseError
Edited Data
ProcessingError
Survey Statistic
![Page 28: Federal Statistics in an Age of a Self-Monitoring Social and Economic Eco-System Robert M. Groves US Census Bureau.](https://reader036.fdocuments.us/reader036/viewer/2022062511/551709d6550346f5558b5279/html5/thumbnails/28.jpg)
A Vision of the Future
• Multiple modes of data collection/acquisition– Internet behaviors– Administrative records – Internet self-report– Telephone, face-to-face, paper
• Real-time mode switch to “fill-in” missing data
• Real-time estimation
![Page 29: Federal Statistics in an Age of a Self-Monitoring Social and Economic Eco-System Robert M. Groves US Census Bureau.](https://reader036.fdocuments.us/reader036/viewer/2022062511/551709d6550346f5558b5279/html5/thumbnails/29.jpg)
Administrative Records
Frame Variables on Sample
Collection Engine
Internet Self-Report
Telephone Self-Report
Face to Face Self-Report
Mail Self-Report
Decision-Rule Engine
ImputationEngine
EstimationEngine
Ecological attributes of Sample
Nonlinked Digital Data
![Page 30: Federal Statistics in an Age of a Self-Monitoring Social and Economic Eco-System Robert M. Groves US Census Bureau.](https://reader036.fdocuments.us/reader036/viewer/2022062511/551709d6550346f5558b5279/html5/thumbnails/30.jpg)
Administrative Records
Frame Variables on Sample
Collection Engine
Internet Self-Report
Telephone Self-Report
Face to Face Self-Report
Mail Self-Report
Decision-Rule Engine
ImputationEngine
EstimationEngine
Ecological attributes of Sample
Nonlinked Digital Data
![Page 31: Federal Statistics in an Age of a Self-Monitoring Social and Economic Eco-System Robert M. Groves US Census Bureau.](https://reader036.fdocuments.us/reader036/viewer/2022062511/551709d6550346f5558b5279/html5/thumbnails/31.jpg)
Administrative Records
Frame Variables on Sample
Collection Engine
Internet Self-Report
Telephone Self-Report
Face to Face Self-Report
Mail Self-Report
Decision-Rule Engine
ImputationEngine
EstimationEngine
Ecological attributes of Sample
Nonlinked Digital Data
![Page 32: Federal Statistics in an Age of a Self-Monitoring Social and Economic Eco-System Robert M. Groves US Census Bureau.](https://reader036.fdocuments.us/reader036/viewer/2022062511/551709d6550346f5558b5279/html5/thumbnails/32.jpg)
Administrative Records
Frame Variables on Sample
Collection Engine
Internet Self-Report
Telephone Self-Report
Face to Face Self-Report
Mail Self-Report
Decision-Rule Engine
ImputationEngine
EstimationEngine
Ecological attributes of Sample
Nonlinked Digital Data
![Page 33: Federal Statistics in an Age of a Self-Monitoring Social and Economic Eco-System Robert M. Groves US Census Bureau.](https://reader036.fdocuments.us/reader036/viewer/2022062511/551709d6550346f5558b5279/html5/thumbnails/33.jpg)
Attributes of the Vision
• 24-hour cycles on mode-switch, imputation, estimation
• Empirical stopping rules for continued self-report efforts
• Statistical modeling to combine survey data with external, relevant other digital data
• Reduced cost, increased timeliness
![Page 34: Federal Statistics in an Age of a Self-Monitoring Social and Economic Eco-System Robert M. Groves US Census Bureau.](https://reader036.fdocuments.us/reader036/viewer/2022062511/551709d6550346f5558b5279/html5/thumbnails/34.jpg)
Implications for Survey/Census Operations
• Mixed-modes create dynamic workloads• Mixed-modes create more preloaded data
for questionnaire movement• Field work automation becomes more
advantageous• Real-time estimation requires software
integration
![Page 35: Federal Statistics in an Age of a Self-Monitoring Social and Economic Eco-System Robert M. Groves US Census Bureau.](https://reader036.fdocuments.us/reader036/viewer/2022062511/551709d6550346f5558b5279/html5/thumbnails/35.jpg)
Key Questions Facing Social Science
• Will “organic data” replace the designed data of surveys, given their low cost?
• Will new blends of organic data and designed data emerge?
• Will survey researchers blend or will IT masters add “designed data” to organic data?
• Whither “designed data”?