IRS XML Initiatives Sol Safran Enterprise Data Management Organization 21 April 2004

14
1 IRS XML Initiatives Sol Safran Enterprise Data Management Organization 21 April 2004

description

IRS XML Initiatives Sol Safran Enterprise Data Management Organization 21 April 2004. AGENDA. Introduction EDMO Vision & Mission Background Managing IRS XML Standards Managing IRS XML Users – Stakeholders Managing IRS Data. Partnered Organization IRS / PRIME / MITRE. - PowerPoint PPT Presentation

Transcript of IRS XML Initiatives Sol Safran Enterprise Data Management Organization 21 April 2004

Page 1: IRS XML Initiatives Sol Safran Enterprise Data  Management Organization 21 April 2004

1

IRS XML Initiatives

Sol Safran

Enterprise Data Management Organization

21 April 2004

IRS XML Initiatives

Sol Safran

Enterprise Data Management Organization

21 April 2004

Page 2: IRS XML Initiatives Sol Safran Enterprise Data  Management Organization 21 April 2004

2

AGENDA

• Introduction– EDMO Vision & Mission– Background

• Managing IRS XML Standards

• Managing IRS XML Users – Stakeholders

• Managing IRS Data

Page 3: IRS XML Initiatives Sol Safran Enterprise Data  Management Organization 21 April 2004

3

Enterprise Data Management Office – Mission & Vision

Enterprise Data Manager

Data EngineeringWorking Group

(DEWG)

Data ManagementIssues Resolution

Team (DMIRT)

Data Policy & Strategic Planning

Data Administration

Data Architecture

Database Administration

• Provide the necessary infrastructure, policies, standards and tools which provide for common integrated, consistent and effective data management practices• Develop, promote and oversee compliance with policies • Develop and maintain the Enterprise Conceptual and Logical data models

• Coordinate and ensure the optimal use of IRS agency resources and assets in the physical manipulation of data and data artifacts • Develop and implement plans for data transition, data migration, data conversion, COTS data stores, and custom data stores • Facilitate the creation, storage, manipulation, integration, distribution, use and management of enterprise level shared metadata and XML Registry• Support application and engineering projects in developing enterprise compliant data solutions.

• Establish and implement an Enterprise-wide data management program. • Define long term objectives and identify projects and activities that help attain those objectives. • Recruit and maintain project staff and integrate services and deliverables into the overall environment. • Develop a strategic plan, management reports, schedules, finance and budget • Coordinate the support, planning prioritization and monitoring of activities that manage IRS data at the enterprise level to meet the business needs of the agency

Partnered OrganizationIRS / PRIME / MITRE

Page 4: IRS XML Initiatives Sol Safran Enterprise Data  Management Organization 21 April 2004

4

IRS Enterprise Data Management Office – Background

• Data management work and XML standards being performed by PRIME Enterprise Data Management (EDM)– EDM is EDMO support contractor– CSC is PRIME integration contractor for IRS

Systems Modernization– Coalition of CSC, IBM, Northrop Grumann,

and others

Page 5: IRS XML Initiatives Sol Safran Enterprise Data  Management Organization 21 April 2004

5

IRS Enterprise Data Management Office – Background

• XML plays large role in Systems Modernization– Tax returns and forms from external

providers– Storage of tax returns as received, plus

modifications– Inter-system messaging

• IRS had many independent XML initiatives– Inter-operability was jeopardized– Wheels were being re-invented– Each project learned as it went

Page 6: IRS XML Initiatives Sol Safran Enterprise Data  Management Organization 21 April 2004

6

IRS Enterprise Data Management Office – Background

• Tax data in XML– Also increasingly: tax returns are converted to XML and processed/stored – All tax forms must be retained as originally submitted … increasingly in

XML from 3rd party providers and large corporations– Eventually most tax forms and returns will be in XML format in IRS– Tax Year 2004 return projections:

• 1120 (Corp): 6M• 94x (Employment): 30M• 1040: 134M

• Impact of XML– 2004 projections

• 1120 returns (Corp): 100k

• 94x returns (Employment): 29M– XML schemas can amplify data

• Some 1120 schema instances up to 350 MB

Page 7: IRS XML Initiatives Sol Safran Enterprise Data  Management Organization 21 April 2004

7

Managing IRS XML Standards - Benefits

– Improved Systems Interoperability• Common business vocabulary• Common XML message content• Reduce interface development costs• Facilitate re-use and interoperability

– Improved Data Quality• Known sources and locations of data• Contradiction between databases removed• Typing to ensure values

– Improved Taxpayer Interaction• More accurate, timely information

– IRS commitment to states/external developers

Page 8: IRS XML Initiatives Sol Safran Enterprise Data  Management Organization 21 April 2004

8

Managing IRS XML Standards - Goals

• Organize stakeholders• Pull together internal and external knowledge and best practices

– Identify existing industry best practices

– Liaison with external organizations – TIGERS, OASIS, FED XML Working Groups, etc.

– Leverage IRS project knowledge and lessons learned across the IRS

• Provide direction, standards and guidelines– Define XML specific standards and guidelines

– Define appropriate uses of XML

• Plan for XML architecture/infrastructure support (e.g. XML Registry, XML tools, etc.)

Page 9: IRS XML Initiatives Sol Safran Enterprise Data  Management Organization 21 April 2004

9

Managing IRS XML Standards - Process

• Fact finding

• Develop initial standards to assist team

• Conduct workshops

• Important considerations– Maintain enterprise perspective– Try to minimize impact to projects

• Communicate process and findings to stakeholders

• Formalize

• On-going

Page 10: IRS XML Initiatives Sol Safran Enterprise Data  Management Organization 21 April 2004

10

Managing IRS XML Standards – Results & Where Next

• Formalized XML standards & guidelines - ongoing– Enterprise Data Standards & Guidelines– Stakeholder Working groups

• Architectural Issues– What and how to XMLize, and what not to

• E.g., messaging, OLTP vs. reporting/analytics, etc.– Structural commonalities and differences

• XML for messaging vs. processing vs. data storage– Structural variations for different kinds of processing (SAX vs DOM, vs Custom)– Architecting for data sharing, re-use, and interoperability– Architecting for performance and scalability

• Technology/performance implications of data volume amplification

• Standard XML schema based on Enterprise Logical Data Model

• XML Registry

• Organization outreach (TIGERS, OASIS, XML Working Group, etc.)

• Taxonomy

• SCORM

Page 11: IRS XML Initiatives Sol Safran Enterprise Data  Management Organization 21 April 2004

11

Managing IRS XML Users - Stakeholders

• XML Stakeholders:– Large, diverse group (100+)– Major IRS organizations, all levels of projects,

external organizations– Members participate in XML working groups– Recently expanded scope to include

stakeholder presentations

IRS OrganizationsChief CounselElectronic Tax AdminMedia & PublishingTaxpayer AdvocateLMSBTEGEW&IBSMOBSDEUESSecurity ServicesWeb Services

IRS Projects1120 e-FileCADEHR ConnectInternet EINMDAModernized E-FileM-TRDBTREES...

DoD XML RegistryDoT XML standardsFed XML Working GroupMITRENISTPRIME

Other Organizations

Page 12: IRS XML Initiatives Sol Safran Enterprise Data  Management Organization 21 April 2004

12

Managing IRS Data - IRS XML Registry

• Registry benefits

• Registry analysis underway

• Stakeholders involved

• Meta Data – Meta Data Management

– Meta Data Strategy

Page 13: IRS XML Initiatives Sol Safran Enterprise Data  Management Organization 21 April 2004

13

Questions ??

Page 14: IRS XML Initiatives Sol Safran Enterprise Data  Management Organization 21 April 2004

14

Contact Info

Karla Tropea Program Manager, IRS Enterprise Data Management Office

202-283-5976 [email protected]

Sol Safran Senior Data Engineer – XML Lead, Prime Enterprise Data Management

301-429-7543 [email protected]