Presentation 2: Data Warehouse Design Discussion Adwait Mulye, Yuga Pawar, Floyd J. Srubar,...

24
CRIMEWATCH CONSULTING FIRM Presentation 2: Data Warehouse Design Discussion Adwait Mulye, Yuga Pawar, Floyd J. Srubar, Vidyasagar Velamala

Transcript of Presentation 2: Data Warehouse Design Discussion Adwait Mulye, Yuga Pawar, Floyd J. Srubar,...

Page 1: Presentation 2: Data Warehouse Design Discussion Adwait Mulye, Yuga Pawar, Floyd J. Srubar, Vidyasagar Velamala.

CRIMEWATCH CONSULTING FIRM

Presentation 2: Data Warehouse Design Discussion

Adwait Mulye, Yuga Pawar, Floyd J. Srubar, Vidyasagar Velamala

Page 2: Presentation 2: Data Warehouse Design Discussion Adwait Mulye, Yuga Pawar, Floyd J. Srubar, Vidyasagar Velamala.

Brief Review of Business Needs

“Value You’ll See Consulting” provides decision support services to clientele from various industries.City of Houston – Staffing and

Resource PlanningRealtors – Neighborhood Crime

StatisticsSchool Districts – Land PurchasesBusiness Owners – Location Decisions

Page 3: Presentation 2: Data Warehouse Design Discussion Adwait Mulye, Yuga Pawar, Floyd J. Srubar, Vidyasagar Velamala.
Page 4: Presentation 2: Data Warehouse Design Discussion Adwait Mulye, Yuga Pawar, Floyd J. Srubar, Vidyasagar Velamala.

Brief Review of Business Needs

Additionally, the use of a data warehouse allows our firm to compile and re-assemble raw publicly available crime data into specific decision supporting material tailored to our clients’ information needs.

Page 5: Presentation 2: Data Warehouse Design Discussion Adwait Mulye, Yuga Pawar, Floyd J. Srubar, Vidyasagar Velamala.

Tonight’s Discussion Overview

Tonight’s discussions will address the structure of the Data Warehousing system for our consulting firm.Database & Table Structure Discussion

○ Facts HPD Crime Data: June 01 – December 31, 2009

○ Dimensions [Date, Type of Crime, Police Beat, Premises]

Dimensional Modeling Discussion○ Star Schema○ Snowflake: Dimensional Hierarchies

Page 6: Presentation 2: Data Warehouse Design Discussion Adwait Mulye, Yuga Pawar, Floyd J. Srubar, Vidyasagar Velamala.

Database and Table Structures

Page 7: Presentation 2: Data Warehouse Design Discussion Adwait Mulye, Yuga Pawar, Floyd J. Srubar, Vidyasagar Velamala.

Database & Table Structure Takeaways

The fact table is publicly available data from the City of Houston website.

Dimensional tables are a joint effort by both the City of Houston and our consulting firm.

City of Houston data revealed natural dimensions based on hierarchies found in the HPD organizational chart.

Future Dimensions in the works:○ DimTimeOfDay: Morning, Mid-Day, Evening, Overnight○ DimSceneOfCrime: Based on Premises node, we see a

pattern emerging in that table for rolling up, or drilling down.

Part of the “Data Cleansing Process” involved simple tasks such as changing field names, and removing five orphaned records.

Page 8: Presentation 2: Data Warehouse Design Discussion Adwait Mulye, Yuga Pawar, Floyd J. Srubar, Vidyasagar Velamala.

Fact Table: HPD Crime Data Jun – Dec 2009

*The data originates from the Houston Police Department’s OLTP systems.

Page 9: Presentation 2: Data Warehouse Design Discussion Adwait Mulye, Yuga Pawar, Floyd J. Srubar, Vidyasagar Velamala.

Fact Table: HPD Crime Data Jun – Dec 2009

*The data originates from the Houston Police Department’s OLTP systems.

Page 10: Presentation 2: Data Warehouse Design Discussion Adwait Mulye, Yuga Pawar, Floyd J. Srubar, Vidyasagar Velamala.

Dimension Table: Police Beats

Page 11: Presentation 2: Data Warehouse Design Discussion Adwait Mulye, Yuga Pawar, Floyd J. Srubar, Vidyasagar Velamala.

Dimension Table: Police Beats

Page 12: Presentation 2: Data Warehouse Design Discussion Adwait Mulye, Yuga Pawar, Floyd J. Srubar, Vidyasagar Velamala.

Dimension Table: DimOffenseTypes

Page 13: Presentation 2: Data Warehouse Design Discussion Adwait Mulye, Yuga Pawar, Floyd J. Srubar, Vidyasagar Velamala.

Dimension Table: DimOffenseTypes

Page 14: Presentation 2: Data Warehouse Design Discussion Adwait Mulye, Yuga Pawar, Floyd J. Srubar, Vidyasagar Velamala.

Dimension Table: DimDates

Page 15: Presentation 2: Data Warehouse Design Discussion Adwait Mulye, Yuga Pawar, Floyd J. Srubar, Vidyasagar Velamala.

Dimension Table: DimDates

Page 16: Presentation 2: Data Warehouse Design Discussion Adwait Mulye, Yuga Pawar, Floyd J. Srubar, Vidyasagar Velamala.

Dimension Table: DimPremisesCodes

Page 17: Presentation 2: Data Warehouse Design Discussion Adwait Mulye, Yuga Pawar, Floyd J. Srubar, Vidyasagar Velamala.

Dimension Table: DimPremisesCodes

Page 18: Presentation 2: Data Warehouse Design Discussion Adwait Mulye, Yuga Pawar, Floyd J. Srubar, Vidyasagar Velamala.

Dimensional Modeling

Page 19: Presentation 2: Data Warehouse Design Discussion Adwait Mulye, Yuga Pawar, Floyd J. Srubar, Vidyasagar Velamala.

Snowflake Schema

Page 20: Presentation 2: Data Warehouse Design Discussion Adwait Mulye, Yuga Pawar, Floyd J. Srubar, Vidyasagar Velamala.

Snowflake Schema

Page 21: Presentation 2: Data Warehouse Design Discussion Adwait Mulye, Yuga Pawar, Floyd J. Srubar, Vidyasagar Velamala.

Dimensional Hierarchies

Page 22: Presentation 2: Data Warehouse Design Discussion Adwait Mulye, Yuga Pawar, Floyd J. Srubar, Vidyasagar Velamala.

Police Beat Hierarchy

Structure is similar to the HPD organizational chart.Divisions: Treat these as Police Station

Locations, as this trend emerges from the fact tables, and later discovered on the HPD website. (see map)

Districts: A Division can have authority over multiple Districts. (e.g. Airport Division covers Hobby and Bush Airport districts.

Police Beats: A District has jurisdictional authority over many police patrol beats.

Page 23: Presentation 2: Data Warehouse Design Discussion Adwait Mulye, Yuga Pawar, Floyd J. Srubar, Vidyasagar Velamala.
Page 24: Presentation 2: Data Warehouse Design Discussion Adwait Mulye, Yuga Pawar, Floyd J. Srubar, Vidyasagar Velamala.