Post on 17-Dec-2015
Creating Relevance and Reuse withTargeted Semantics
Michael C. DacontaMetadata Program ManagerNovember 16, 2004
2Michael C. Daconta November 16, 2004
Welcome & IntroductionMichael C. Daconta
Author/co-author of 10 Technical books C, C++, Java, XML, Semantic Web
XML experience: First XML project: 1996 Wrote XML Courses: 1998 Lead for the DOD/IC Core Taxonomy Chief Architect of the DIA’s Virtual
Knowledge Base Inventor of the Fannie Mae Electronic
Mortgage Standard http://www.daconta.net
High Five! XML has won!
3Michael C. Daconta November 16, 2004
The Department of Homeland Security
The “Mother of all Integrations”CIO Office and CIO Council leading the
charge… Lee Holcomb, Steve Cooper Martin Smith, Ryan Cast, Lee Smith,
Sean Thrash … many others!
The Information Sharing Imperative!Metadata Center of Excellence
Expose, Standardize and Federate DHS Metadata Working Group
Presidential Orders EO 13356 and HSPD 11
We will aggressively execute this mission!
The Train has left the station! Are you on board?
4Michael C. Daconta November 16, 2004
An “Aggressive” AgendaModeling and the State of Things
Evolution and Revolution
Entity-based XML
XML Profile of FEA DRM
The Three R’s of Semantics
Targeted Semantics
Semantic Bootstrapping
DHS CIO Efforts
Other Efforts
The Challenge
A lot to cover so …Hang On!
5Michael C. Daconta November 16, 2004
Modeling and the State of Things… The “Messy Reality Scale” affects
how you view modeling.My View: There are stable subsets of reality that can be
effectively modeled. We all use internal “maps”. Maps are not supposed to be reality, just
effective.
And … the need is great!
And … Hardware is still improving: GPS, cell-phones and voice recognition will
ratchet up the requirements for “real-time relevance!”
64 Bit Computing
So …
SubjectiveUnpredictable
XML is necessary but not sufficient!
ObjectiveStable
6Michael C. Daconta November 16, 2004
Data Evolution Timeline
Age of Programs
Age of Proprietary
Data
Age of OpenData
Age of Open
Metadata
Age of SemanticModels
Program-Data
GIGO/minis/micros www / Netscape Web services OWL
Text, Office DocsDatabases
(proprietary schema)
HTML,XML
(open schema)
Namespaces,Taxonomies,
RDF
Ontologies&
Inference
1945 -1970 2000 - 20031994 - 20001970 - 1994 2003 -
ProceduralProgramming
Object-OrientedProgramming
Model-DrivenProgramming
“Data is lesslessimportant
than code”
“Data is asasimportantas code”
“Data is moremoreimportant
than code”
7Michael C. Daconta November 16, 2004
Entity Based XMLLessons Learned:
Separate entities from transactions
Lesson: Use RDF data model to design your XML schemas.
Examples:
DDMS
Global Justice XML Data Model (GJXDM)
8Michael C. Daconta November 16, 2004
Entity-Based XML(2) Biometrics Automated Toolset
Army Fielded System (i.e. Centcom)
Functions
Biometrics: Fingerprints, Iris, Facial Recognition
GUID
“Dossiers”XML Driven. Below is “entities.xml”
<ENTITIES> <ENTITY name="Person" tablename="PersonalData" XMLBaseName="FORMS" representativeFields="LNAME;FNAME;MNAME;SVCIDNO;ALERT"> <RELATIONSHIP withentity="Account" relationshipname="Related Account" tablename="Pers_Acct_Junct" entity1="Person" entity1key="PERS_GUID" entity2="Account" entity2key="ACCT_GUID"></RELATIONSHIP>…</ENTITY><!-- other entities Account, Facility, Equipment, Location, etc. … -->…</ENTITIES>
9Michael C. Daconta November 16, 2004
FEA DRM Structure
10Michael C. Daconta November 16, 2004
Revised DRM Structure
Subject Context Security ContextService Context
Information Access
Data Element Description Resource Description
Subject Area
Association
Query ClassAssociation
Resource Class
AssociationData Property
Data Class
Unique Identifiers
Association
Security Class
AssociationSubject ClassAssociation
Input/OutputService Class
Who What When Where Why
Context
Information Exchange
Exchange PayloadAssociation
Exchange Class
Sharing
Description
Structured
Semi-Structured
Unstructured
11Michael C. Daconta November 16, 2004
XML Profile of the DRM
Subject Context Security ContextService Context
Information Access
Data Element Description Resource Description
Who What When Where Why
Context
Information ExchangeSharing
DescriptionStructured
Semi-Structured
Unstructured
XML Topic Map (XTM) Web ServiceEntry (UDDI)
IC InformationSecurity Marking
“Document” XMLSchema
FederatedQuery & RSS
DublinCore/DDMS
XML SchemaTypes
XLINK
XLINK/XPointer
XLINK
12Michael C. Daconta November 16, 2004
The Road to Revolution Done with Graphics Fidelity … shifting to Data Fidelity
Smart Data Continuum
“Smart” = Application Independent, Contextual and enables Inference
Relevance
Reuse
© Microsoft
13Michael C. Daconta November 16, 2004
Revolution (2): Copernican Shift
Apps
Data
We had it all wrong…In the beginning…
Put the “smarts” in the data… watch what happens…
Apps
Apps
Apps
14Michael C. Daconta November 16, 2004
Three R’s of SemanticsRelevance
The intersection of what the user wants and what is available.
Reuse
The repurposing of the smallest semantic chunk of content.
Relation
First-class Associations between things.
© 2003 Stanford Medical Informatics
15Michael C. Daconta November 16, 2004
Relevance: Right Information to the Right Person at the Right Time
User Context&
Requirements
RelevanceContent
& Services
- 5W Search Axes- Taxonomy- Query- Visualization- Question/Answer- Profiles- Feedback Loop
- Entity/Associations- Class/Property- Class/Instance- Predicate/Inferences- System and Domain Metamodels
- Structured- Semi-Structured- Unstructured- Web-Services- RSS
Bottom Line: You have to model what you mean by “Right”
16Michael C. Daconta November 16, 2004
Personal Observations on RelevanceMissing the Mark
Traveling to Las Cruces, New Mexico
Off by one in a known email address
What should have happened
Relevance should enable “What is best for me…”
Context should enable “What I mean versus What I say…” Tucson
Arizona
DenverColorado
AlbuquerqueNew Mexico
Las CrucesNew Mexico
Literal path
Optimal path
17Michael C. Daconta November 16, 2004
Solution Set: XML
Taxonomy
Wordnet
Entity Extraction
Multiple Search Axes: Time, Geography, People, Subject…
3R’s: Reuse Problem: Reusable, Modular
ContentSemantic Chunks
Doctrine, Video, EntitiesDigital production
Workflow / Transactions
Linchpin Granularity
<topic id="topic"><title>Some topic</title><body><p><person>Joe Smith</person> is …</p></body></topic>
people
18Michael C. Daconta November 16, 2004
3R’s: Relation
Standard Associations Example: PartOf versus SubclassOf
Car
Sports Car SedanSUV
Corvette Mustang
Car
Engine Wheel
Transmission Carburetor
19Michael C. Daconta November 16, 2004
Targeted Semantics Precedent:
general natural language recognition is far-off, domain-specific is here today (e.g. airline reservations)
Focus: 5W Search Axes: Who, What, When, Where, Why Entities, Associations, Events Community of Interest modeling
Go with the Stable Subset Precedent: Programming Language Features Stick with Standards
Sequence is Important Semantic Bootstrapping
20Michael C. Daconta November 16, 2004
Semantic Bootstrapping
Shared Identity(Naming & Addressing)
Shared Metamodels(Domain & System)
Shared Business Logic(Services & Rules)
Shared Transactions(Containers & Context)
Targeted Semantics(Definitions & Scope)
21Michael C. Daconta November 16, 2004
Current DHS Efforts Introduction to Metadata
Flash
DHS Taxonomy
Enterprise Metadata Registry
DHS Metadata Working Group
Core XML Schema Types
Standards Adoption Process
DHS Data Strategy
22Michael C. Daconta November 16, 2004
Semantic Web Centralhttp://www.semwebcentral.org
DARPA sponsored
Tools!
Tutorials!
Resources!
23Michael C. Daconta November 16, 2004
Your Challenge
Get Positive! The only security is action!
Get Involved! Semantic Interoperability Community of Practice (SiCOP)
http://www.web-services.gov
Get Going!After the Semantic Web is the Pragmatic Web
Syntax
Pragmatics
Semantics