Tamr Launch with Andy Palmer
-
Upload
tamrinc -
Category
Technology
-
view
150 -
download
1
description
Transcript of Tamr Launch with Andy Palmer
tamr
New tech is great, but the quality and connectedness of enterprise data often sucks
the dirty data secret
scientific freedom
Good for research creativity, bad for data connectivity
scientific freedom
Good for research creativity, bad for data connectivity
the integrated view Collaborative R&D through open data sharing
Good for research creativity, bad for data connectivity
the integrated view Collaborative R&D through open data sharing
the source challengeFifteen thousand strong…and in need of a new approach
scientific freedom
top down integrationNeat, clean…
Neat, clean…and relatively inflexible
top down integration
Neat, clean…and relatively inflexible
The Choice:Ignore itOr start all over!
The Consequences: Missed opportunity Ballooning costs
top down integration
An exponential challenge
the missing capability
Connecting and curating in an automated way
semi-structured data: JSON sources
Embrace the reality of data variety across the entire enterprise
bottom-up curation
Probabilistic approach as primary design pattern — some semantic web mojo
the time has come
1990’s web:probabilistic search and website connection!
2020’s enterprise:probabilistic data source connection & curation
back to the future
Can we remove the ceiling on the number of data sources that can be dynamically integrated?
hypothesis
NEA
®
early production results
15K sources integrated into one view
Tamr unified view
early production results
Over 90% reduction in manual reviews
Records
90%reduction
Unique
Manual ReviewMatched
3% to manually review
Proprietary
Tamr
!key design point!
• Continuous bottom-up/ probabilistic approach Combination of Machine Learning and Expert SourcingIntegrated data and metadata through APIs
•
•
NEA
®