Registry Services Bringing Value to US EPA, States, and Tribes Exchange Network Vendors Meeting...
-
Upload
alexandra-richardson -
Category
Documents
-
view
215 -
download
0
Transcript of Registry Services Bringing Value to US EPA, States, and Tribes Exchange Network Vendors Meeting...
Registry ServicesBringing Value to US EPA, States, and Tribes
Exchange Network Vendors MeetingApril 24, 2007
Cynthia Dickinson EPA/OEI/OIC
Data Standards Branch Chief
2data sets program system dataapplications
substances value domains facilitiesterms
XMLdata setsmetadata
Purpose
To outline a new approach to registries based on Service Oriented Architecture
To show how the registries can be used during project implementation
3data sets program system dataapplications
substances value domains facilitiesterms
XMLdata setsmetadata
Overview
The Future: SOA
Terminology Management Services
Code Set Management Services
Enterprise Architecture Support Services
Quality Assurance Services
Managing Change
Ensuring Success
Appendix A: Data Standards Services
Appendix B: Semantic Vision for the Registries
4data sets program system dataapplications
substances value domains facilitiesterms
XMLdata setsmetadata
The Future: SOA
US EPA and its partners are moving toward seamless machine to machine data transactions to increase accuracy, efficiency, reliability, and reduce cost
Registries support: Terminology management services Code set management and translation services Enterprise architecture services Quality assurance Change management
5data sets program system dataapplications
substances value domains facilitiesterms
XMLdata setsmetadata
Terminology Management Services
Current services Access to environmental terms and definitions Repository and support for stewards managing environmental term sets such as
glossaries, taxonomies (hierarchies), keyword lists, or thesauri (operational April 16) Create new terminology structures (such as glossaries) from existing terms
Future services (Summer 2008) Automated download and update of term sets from Environmental Terminology System
and Services (ETSS) to any system or web page (support for refresh of web-displayed glossaries will be first)
New front end for EPA and partners to retrieve terminology (currently in design phase) Collaborative area and support services for communities of interest developing term lists
and taxonomies Vision
Storage and maintenance of ontologies to support future semantic development in the IT industry
6data sets program system dataapplications
substances value domains facilitiesterms
XMLdata setsmetadata
Code Set Management Services
Current services Easy access to code sets/value sets related to data standards and other
commonly used lists in the Substance Registry System (SRS) and the Facility Registry System (FRS)
As of February 2007 Enhanced SRS allows exchange and management of federal and state chemicals with substance web services for:
Discover and compare (single substance query) Load (submit new substances that are in SRS but not in programmatic
list) Maintain system accuracy (solicit bulk loads to refresh programmatic
systems) Future services
Enhanced user front end for SRS and reporting capability (by November 1) New Environmental Data Registry (EDR) will allow code set management
and mapping for program offices and EN partners Web services for translation between code sets traveling on the Exchange
Network
7data sets program system dataapplications
substances value domains facilitiesterms
XMLdata setsmetadata
Code Set Management Services (cont.)
Business need for managing multiple code set values States and Tribes have their own “environmental interest” values
US EPA has its “environmental interest” values
Use value meaning as connector reduce mappings (Case 3)
State A
Tribe A
State B
Tribe B
Tribe ATribe A
State BState B State AState A
Tribe B Tribe BState C
EPAEPA
EPA
Case 2 – Limited Use Case 3 - SolutionCase 1 - Undesirable
8data sets program system dataapplications
substances value domains facilitiesterms
XMLdata setsmetadata
Cross Reference Between Code Sets
Via Value MeaningValue Meaning for Environmental Interest
(Part of Facility Identification Standard)
US EPA Value Connecticut DEP Value
Facilities classified as a Clean Air Act Stationary Source Major discharger of air pollutants according to the Alabama power decision's definition of a major source or the 1993 EPA Compliance Monitoring Branch Classification Guidance.
Air Major Title V Stationary Sources
Facilities classified as a CAA Stationary Source Minor discharger of air pollutants.
Air Minor GPLPE
Environmental programs that maintain a national emission inventory which characterizes emissions of hazardous air pollutants (HAPS). HAPS, which are also known as air toxics, are defined in Section 112(b) of the 1990 Clean Air Act Amendments.
Hazardous Air Pollutants Inventory
HAPS
The EPCRA program providing the public with knowledge of and access to information regarding the use, storage, production, and release of hazardous chemicals to the environment, and encourages and supports response planning for environmental emergencies.
EPCRA Tier II Reporting
9data sets program system dataapplications
substances value domains facilitiesterms
XMLdata setsmetadata
Enterprise Architecture Support Services
Current services Locate US EPA systems/applications in the Registry of US EPA
Applications and Databases (READ) Access to Exchange Network XML schema (XML Registry)
Future services (late 2008) Find US EPA system/application data dictionaries in EDR Enhanced compare tool will allow developers to find:
Similar data elements across US EPA systems US EPA data elements that match Exchange Network data standards
Vision Service Component Registry and Repository (SCRR)
10data sets program system dataapplications
substances value domains facilitiesterms
XMLdata setsmetadata
Service Component Registry and Repository (SCRR)
An integrated vehicle for outreach and discovery of different types of reusable system components designed by US EPA and Exchange Network partners
Registry (and sometimes repository) for: Web services Reusable software designs and templates Metadata and data services XML data flows and shared schema components Other data flows Data models (conceptual, logical, physical) Reusable pieces of code, in various languages Development models and guidance, such as the Core Reference Model (CRM)
Will leverage existing registries and repositories (UDDI, ENDS) and may add new ones. Requirements effort beginning by June.
11data sets program system dataapplications
substances value domains facilitiesterms
XMLdata setsmetadata
Quality Assurance Services
Quality assurance oversight for registration Improved stewardship processes (built into the new registries) – data
maintained by people that know it best. Oversight by others. Automated quality review of metadata – increasing dependence on
web services Maintenance of:
Authoritative source of chemicals and facility information for US EPA Authoritative US EPA system inventory Comprehensive catalog of XML schema for Exchange Network use
Other related quality services CDX Schematron for program system business rules Generic XML document parsing service
12data sets program system dataapplications
substances value domains facilitiesterms
XMLdata setsmetadata
Managing Change
Shifting from operating registries to delivering services
Transferring stewardship to those responsible for the data while improving quality assurance
Identifying cost recovery/avoidance opportunities
Increasing the emphasis on SOA
Addressing business needs for classifying information
13data sets program system dataapplications
substances value domains facilitiesterms
XMLdata setsmetadata
Ensuring Success
This approach will allow US EPA and its Exchange Network partners to better
– Document and store;
– Identify and locate;
– Translate and use; and
– Enable integration of
environmental information for all users
14data sets program system dataapplications
substances value domains facilitiesterms
XMLdata setsmetadata
Services Enabled by Registries
Service RegistryEDR READ ETSS SRS and
FRSXML/
SCRR
Enterprise Architecture Support
Data Standards
Code Set Management
Terminology Management
Quality Assurance
15data sets program system dataapplications
substances value domains facilitiesterms
XMLdata setsmetadata
Appendix A
Data Standards Services
16data sets program system dataapplications
substances value domains facilitiesterms
XMLdata setsmetadata
Data Standards Services
Development - Collaborative process Standards development procedures and processes enable partners to
develop standards collaboratively. ENLC approves all data standard development initiatives and products. Action teams of subject matter experts are convened to develop the
standards. Public comment is solicited. Subject matter experts also review. Comments
are resolved by action teams Maintenance - Periodic or as needed
Standards are reviewed periodically or when users request changes Some standards may include standardized code sets (these changed as
new codes become available or in response to user requests) ENLC approves changes
17data sets program system dataapplications
substances value domains facilitiesterms
XMLdata setsmetadata
Data Standards Services (cont.)
Access - Information about EN data standards is available as: Text documents on Environmental Data Standards Council (now
superseded by the ENLC) website at http://www.envdatastandards.net Also available as data elements within the US EPA Environmental Data
Registry at http://www.epa.gov/edr Support for Implementation –
ECOS and US EPA staff are available to answer questions about standards and assist developers implementing standards
Implementation guidance is specific to each state, tribe, or US EPA US EPA is offering a training course in Data Standards Implementation (EN
partners are welcome to attend)
18data sets program system dataapplications
substances value domains facilitiesterms
XMLdata setsmetadata
Appendix B
Semantic Vision for Registries
19data sets program system dataapplications
substances value domains facilitiesterms
XMLdata setsmetadata
System of Registries Semantic Vision
XML Registry
Data Element Concepts,
Value Meanings
Environmental Data
Registry(EDR)
Data Dictionaries
ThesauriClassification
Schemes (Taxonomies)
SystemOntologies
Reference and Data Models
Code Sets
Business Area
OntologiesData ElementNames and Definitions Controlled
Vocabularies
Data Standards
Terms
XML Schema
Black – Current CapabilityRed – Future Capability
andRelationships
XML Tags
EnvironmentalTerminology
System and
Services (ETSS)