Albert Anderson, Public Data Queries, Inc. Edward Brent, Idea Works, Inc.
description
Transcript of Albert Anderson, Public Data Queries, Inc. Edward Brent, Idea Works, Inc.
![Page 1: Albert Anderson, Public Data Queries, Inc. Edward Brent, Idea Works, Inc.](https://reader033.fdocuments.us/reader033/viewer/2022061517/568144e8550346895db1b84d/html5/thumbnails/1.jpg)
From Question to Query: An Intelligent Strategy for Making Complex Data Accessible to Novice Users
Albert Anderson, Public Data Queries, Inc.
Edward Brent, Idea Works, Inc.
Pawel Slusarz, Idea Works, Inc.
Heather Branton, Public Data Queries, Inc;.
IASSIST 2003 Ottawa Strength in Numbers May 30, 2003
![Page 2: Albert Anderson, Public Data Queries, Inc. Edward Brent, Idea Works, Inc.](https://reader033.fdocuments.us/reader033/viewer/2022061517/568144e8550346895db1b84d/html5/thumbnails/2.jpg)
Acknowledgements
PDQ-Expert is based on an integration of:
Qualrus, a qualitative analysis system from The Idea Works;
PDQ-Explore, an information retrieval system from Public Data Queries, with contributions from:
Paul Anderson John Vidolich
Patrick Rady Marc Williams
Lisa Neidert, Consultant
![Page 3: Albert Anderson, Public Data Queries, Inc. Edward Brent, Idea Works, Inc.](https://reader033.fdocuments.us/reader033/viewer/2022061517/568144e8550346895db1b84d/html5/thumbnails/3.jpg)
And “Thank You” to NSF and NIH
The Idea Works has developed the Qualrus system for qualitative analysis in part with the support of small business funding from the National Science Foundation (NSF).
Public Data Queries, Inc., has developed PDQ-Explore in part with the support of small business funding from the National Institute of Child Health and Human Development (NICHD) and the National Institute on Aging (NIA) of the National Institutes of Health (NIH).
![Page 4: Albert Anderson, Public Data Queries, Inc. Edward Brent, Idea Works, Inc.](https://reader033.fdocuments.us/reader033/viewer/2022061517/568144e8550346895db1b84d/html5/thumbnails/4.jpg)
Homepages
www.ideaworks.com www.pdq.com
![Page 5: Albert Anderson, Public Data Queries, Inc. Edward Brent, Idea Works, Inc.](https://reader033.fdocuments.us/reader033/viewer/2022061517/568144e8550346895db1b84d/html5/thumbnails/5.jpg)
Setting the Context
As we grow, we learn to make sense of the world through inductive and deductive processes:– Generalizing from observations to concepts and
relationships;– Then seeing and experiencing the world in
terms of those generalizations. These processes can be enabling or
disabling.
![Page 6: Albert Anderson, Public Data Queries, Inc. Edward Brent, Idea Works, Inc.](https://reader033.fdocuments.us/reader033/viewer/2022061517/568144e8550346895db1b84d/html5/thumbnails/6.jpg)
“Makes a Man Proud to Be a Frog” Many years ago, Walt Kelly offered this punch-
line to an old nursery rhyme:– Hi diddle, diddle;
– The cat and the fiddle;
– The dish ran away with the spoon;
– The little dog laughed to see such sport;
– And the cow jumped over the moon.
And the three frogs stood at attention, saluted, and said: “It makes a man proud to be a frog.”
![Page 7: Albert Anderson, Public Data Queries, Inc. Edward Brent, Idea Works, Inc.](https://reader033.fdocuments.us/reader033/viewer/2022061517/568144e8550346895db1b84d/html5/thumbnails/7.jpg)
“Seeing Is Believing and Believing Is Seeing”
More recently, Agnes, the creation of T. Cochran and a precocious little girl who can pontificate on any topic, responds to her friend’s suggestion that “a little research might be appropriate” with:
“Research is for the faithless.”
![Page 8: Albert Anderson, Public Data Queries, Inc. Edward Brent, Idea Works, Inc.](https://reader033.fdocuments.us/reader033/viewer/2022061517/568144e8550346895db1b84d/html5/thumbnails/8.jpg)
Access to Data Is Not Enough
Intelligent use of data requires that users:– Know data and metadata;
– Know how to manage and analyze data;
– Know how to interpret results;
– Know how to apply results to problems;
– Know the limitations of their data, tools, and selves,
This knowledge comes through experience and the understanding that there are few simple answers to life’s problems.
![Page 9: Albert Anderson, Public Data Queries, Inc. Edward Brent, Idea Works, Inc.](https://reader033.fdocuments.us/reader033/viewer/2022061517/568144e8550346895db1b84d/html5/thumbnails/9.jpg)
Making Sense of Data For inexperienced users of complex data sets,
mastering the data and metadata can be a formidable task:– Which data sets are relevant to a given concern? – How “good” are the data?– Are the items acceptable measures of more general
concepts? – What are the nuances and “gotchas?”
Our objective is to speed and augment the acquisition of the experience needed to make informed and wise use of data.
![Page 10: Albert Anderson, Public Data Queries, Inc. Edward Brent, Idea Works, Inc.](https://reader033.fdocuments.us/reader033/viewer/2022061517/568144e8550346895db1b84d/html5/thumbnails/10.jpg)
Introduction to PDQ-Expert
Digital representations of data make it possible to have intelligent interactive social science data capable of helping users formulate questions, specify analyses, and interpret results.
This presentation describes our progress on developing an intelligent user interface for the PDQ-Explore information system that strives to achieve some of these objectives.
The interface uses the Idea Works’ Qualrus Intelligent Qualitative Analysis Program to imbed case-based reasoning within a system of logical rules and semantic networks.
![Page 11: Albert Anderson, Public Data Queries, Inc. Edward Brent, Idea Works, Inc.](https://reader033.fdocuments.us/reader033/viewer/2022061517/568144e8550346895db1b84d/html5/thumbnails/11.jpg)
Qualrus
Qualrus provides tools for creating a semantic network:
– Concepts, operations, and empirical measureslinked in a network by logical rules.
These are used with case-based reasoning to point the user to relevant examples.
The examples serve as starting points for analysis and re-analysis.
![Page 12: Albert Anderson, Public Data Queries, Inc. Edward Brent, Idea Works, Inc.](https://reader033.fdocuments.us/reader033/viewer/2022061517/568144e8550346895db1b84d/html5/thumbnails/12.jpg)
Qualrus Link View
![Page 13: Albert Anderson, Public Data Queries, Inc. Edward Brent, Idea Works, Inc.](https://reader033.fdocuments.us/reader033/viewer/2022061517/568144e8550346895db1b84d/html5/thumbnails/13.jpg)
Qualrus Code Editor
![Page 14: Albert Anderson, Public Data Queries, Inc. Edward Brent, Idea Works, Inc.](https://reader033.fdocuments.us/reader033/viewer/2022061517/568144e8550346895db1b84d/html5/thumbnails/14.jpg)
Qualrus at WorkWhere do women earn the most money?
![Page 15: Albert Anderson, Public Data Queries, Inc. Edward Brent, Idea Works, Inc.](https://reader033.fdocuments.us/reader033/viewer/2022061517/568144e8550346895db1b84d/html5/thumbnails/15.jpg)
Qualrus at Work: EncoreAre whites better off than blacks?
![Page 16: Albert Anderson, Public Data Queries, Inc. Edward Brent, Idea Works, Inc.](https://reader033.fdocuments.us/reader033/viewer/2022061517/568144e8550346895db1b84d/html5/thumbnails/16.jpg)
PDQ-Explore The PDQ-Explore information system
combines paralleled high performance processors, data cached in random access memory, and efficient retrieval algorithms to process, in effect, tens or even hundreds of millions of records per second.
Complex queries can be defined and executed in real time to produce tabulations, summary statistics, correlation matrices, and data extracts.
![Page 17: Albert Anderson, Public Data Queries, Inc. Edward Brent, Idea Works, Inc.](https://reader033.fdocuments.us/reader033/viewer/2022061517/568144e8550346895db1b84d/html5/thumbnails/17.jpg)
PDQ-Explore at Work--Setup
Where do women earn the most money?
![Page 18: Albert Anderson, Public Data Queries, Inc. Edward Brent, Idea Works, Inc.](https://reader033.fdocuments.us/reader033/viewer/2022061517/568144e8550346895db1b84d/html5/thumbnails/18.jpg)
PDQ-Explore at Work--Results
Where do women earn the most money?
![Page 19: Albert Anderson, Public Data Queries, Inc. Edward Brent, Idea Works, Inc.](https://reader033.fdocuments.us/reader033/viewer/2022061517/568144e8550346895db1b84d/html5/thumbnails/19.jpg)
PDQ-Explore at Work
Are whites better off than blacks?
![Page 20: Albert Anderson, Public Data Queries, Inc. Edward Brent, Idea Works, Inc.](https://reader033.fdocuments.us/reader033/viewer/2022061517/568144e8550346895db1b84d/html5/thumbnails/20.jpg)
PDQ-Explore
Quality of Housing
![Page 21: Albert Anderson, Public Data Queries, Inc. Edward Brent, Idea Works, Inc.](https://reader033.fdocuments.us/reader033/viewer/2022061517/568144e8550346895db1b84d/html5/thumbnails/21.jpg)
PDQ-Explore at Work
State-to-State Migration
![Page 22: Albert Anderson, Public Data Queries, Inc. Edward Brent, Idea Works, Inc.](https://reader033.fdocuments.us/reader033/viewer/2022061517/568144e8550346895db1b84d/html5/thumbnails/22.jpg)
The PDQ-Expert WWW Interface
The WWW Interface for PDQ-Expert lets users type in a free-form question based on U.S. Census Microdata (IPUMS).
Users enter their question in the “Query” field then click on the “Submit Query” button.
![Page 23: Albert Anderson, Public Data Queries, Inc. Edward Brent, Idea Works, Inc.](https://reader033.fdocuments.us/reader033/viewer/2022061517/568144e8550346895db1b84d/html5/thumbnails/23.jpg)
The System’s Understanding of Your Query
The first part of PDQ-Expert’s response to the user is a restatement of the user’s question as understood by the system.
The purpose of this is to help the user see what the program thinks they are asking so they can identify any areas where the program may be going astray.
![Page 24: Albert Anderson, Public Data Queries, Inc. Edward Brent, Idea Works, Inc.](https://reader033.fdocuments.us/reader033/viewer/2022061517/568144e8550346895db1b84d/html5/thumbnails/24.jpg)
Refining Your Query
After reviewing what the program thinks the user was asking, the next step is to consider key concepts from their question they may want to change or clarify.
For example, here the original question suggests the concepts, “sex,” “total personal income,” and “1990.”
![Page 25: Albert Anderson, Public Data Queries, Inc. Edward Brent, Idea Works, Inc.](https://reader033.fdocuments.us/reader033/viewer/2022061517/568144e8550346895db1b84d/html5/thumbnails/25.jpg)
Modifying the Query to Address Related Concepts
Clicking the “Hide/Show” button associated with a concept shows a list of that concept along with other similar concepts that the user may want to examine.
Users can check additional related concepts or substitute them for the original by unchecking it.
![Page 26: Albert Anderson, Public Data Queries, Inc. Edward Brent, Idea Works, Inc.](https://reader033.fdocuments.us/reader033/viewer/2022061517/568144e8550346895db1b84d/html5/thumbnails/26.jpg)
Similar Cases
The next part of the feedback to the user shows a list of previous questions ordered with the ones most like the current displayed at the top of the list.
Clicking the “Hide/Show” button displays details for that previous question.
Users can incorporate elements of previous similar queries into the current one.
![Page 27: Albert Anderson, Public Data Queries, Inc. Edward Brent, Idea Works, Inc.](https://reader033.fdocuments.us/reader033/viewer/2022061517/568144e8550346895db1b84d/html5/thumbnails/27.jpg)
User Assessment and Saving of the Result in the Case File
The final step in the CBR process is to assess the user’s satisfaction with the result, then save the resulting query along with the original question and all of its relevant parameters as a case in the database.
“Successful” cases will be given scores that encourage them to be displayed in the future, while “unsuccessful” cases will be used to help avoid repeating past mistakes.
![Page 28: Albert Anderson, Public Data Queries, Inc. Edward Brent, Idea Works, Inc.](https://reader033.fdocuments.us/reader033/viewer/2022061517/568144e8550346895db1b84d/html5/thumbnails/28.jpg)
The Database of Previous Cases
Each query, including both the text of the question and all codes associated with the question describing the resulting query, is saved in the database for consideration when the user enters future questions.
![Page 29: Albert Anderson, Public Data Queries, Inc. Edward Brent, Idea Works, Inc.](https://reader033.fdocuments.us/reader033/viewer/2022061517/568144e8550346895db1b84d/html5/thumbnails/29.jpg)
Summary
This approach to helping users overcome their lack of knowledge about data and metadata appears to be a fruitful strategy that promises to provide a versatile and powerful interface to census and similar microdata.
The approach has three specific strengths:
![Page 30: Albert Anderson, Public Data Queries, Inc. Edward Brent, Idea Works, Inc.](https://reader033.fdocuments.us/reader033/viewer/2022061517/568144e8550346895db1b84d/html5/thumbnails/30.jpg)
(1) Clarifying the Unknown
Users (not just novice users) often do not know precisely what they want to ask, which data sets are relevant and appropriate to their concerns, and the characteristics of the items in the data sets.
Users can be helped and their thinking clarified by viewing examples of similar questions and the queries they generate.
![Page 31: Albert Anderson, Public Data Queries, Inc. Edward Brent, Idea Works, Inc.](https://reader033.fdocuments.us/reader033/viewer/2022061517/568144e8550346895db1b84d/html5/thumbnails/31.jpg)
(2) Serving Diverse Users
Novice and experienced users tend to ask very different kinds of questions and to need different kinds of help.
This CBR system provides an effective way to supply diverse users with informative feedback and suggestions.
![Page 32: Albert Anderson, Public Data Queries, Inc. Edward Brent, Idea Works, Inc.](https://reader033.fdocuments.us/reader033/viewer/2022061517/568144e8550346895db1b84d/html5/thumbnails/32.jpg)
(3) Quick Implementation along with
Continuing Improvement
This CBR strategy can be put into place relatively quickly.
It provides a framework for continued improvement in the knowledge of the system as new cases are added.
![Page 33: Albert Anderson, Public Data Queries, Inc. Edward Brent, Idea Works, Inc.](https://reader033.fdocuments.us/reader033/viewer/2022061517/568144e8550346895db1b84d/html5/thumbnails/33.jpg)
Current Status The development of PDQ-Expert has
continued over the past year. Our focus has been on handling large
semantic arrays more efficiently. We are now working on the link between
the PDQ-Expert Interface and the PDQ-Explore interface and backend.
![Page 34: Albert Anderson, Public Data Queries, Inc. Edward Brent, Idea Works, Inc.](https://reader033.fdocuments.us/reader033/viewer/2022061517/568144e8550346895db1b84d/html5/thumbnails/34.jpg)
Thank You
Albert Anderson, Public Data Queries, Inc.– www.pdq.com
Edward Brent, Idea Works, Inc. – www.ideaworks.com
Pawel Slusarz, Idea Works, Inc. Heather Branton, Public Data Queries, Inc.
IASSIST 2003 Ottawa May 30, 2003