US Census Bureau: Addressing Data Collection Vulnerabilities
Data Synthesis—Addressing Small Data Problems Faced by Big Data
-
Upload
primal -
Category
Data & Analytics
-
view
564 -
download
2
description
Transcript of Data Synthesis—Addressing Small Data Problems Faced by Big Data
![Page 1: Data Synthesis—Addressing Small Data Problems Faced by Big Data](https://reader033.fdocuments.us/reader033/viewer/2022061214/547e4309b4af9faa158b5616/html5/thumbnails/1.jpg)
©2013 Primal Fusion Inc.
Data Synthesis
The Big Problem with Small Data
![Page 2: Data Synthesis—Addressing Small Data Problems Faced by Big Data](https://reader033.fdocuments.us/reader033/viewer/2022061214/547e4309b4af9faa158b5616/html5/thumbnails/2.jpg)
Treat your customers as individuals.
MASS MARKETS OF INDIVIDUALS
• Media and advertising• Healthcare and medicine
• Education• Ecommerce and marketing
• Etc.
![Page 3: Data Synthesis—Addressing Small Data Problems Faced by Big Data](https://reader033.fdocuments.us/reader033/viewer/2022061214/547e4309b4af9faa158b5616/html5/thumbnails/3.jpg)
Implicit SemanticsStatistical Approaches
Explicit SemanticsOntological Approaches
The Long Tail of Big Data
EXPRESSIVENESS
DATA
COMPLEX SCHEMASIMPLE SCHEMA
SMALL
BIG
Cost-Performance Barrier
MASS MARKETS OF INDIVIDUALSStatistical methods lose significance
Ontological methods prohibitively expensiveHybrid
Approaches
![Page 4: Data Synthesis—Addressing Small Data Problems Faced by Big Data](https://reader033.fdocuments.us/reader033/viewer/2022061214/547e4309b4af9faa158b5616/html5/thumbnails/4.jpg)
Example: Expertise Search
Source: James Cridland
![Page 5: Data Synthesis—Addressing Small Data Problems Faced by Big Data](https://reader033.fdocuments.us/reader033/viewer/2022061214/547e4309b4af9faa158b5616/html5/thumbnails/5.jpg)
Statistical Approaches
![Page 6: Data Synthesis—Addressing Small Data Problems Faced by Big Data](https://reader033.fdocuments.us/reader033/viewer/2022061214/547e4309b4af9faa158b5616/html5/thumbnails/6.jpg)
Manual Approaches
![Page 7: Data Synthesis—Addressing Small Data Problems Faced by Big Data](https://reader033.fdocuments.us/reader033/viewer/2022061214/547e4309b4af9faa158b5616/html5/thumbnails/7.jpg)
![Page 8: Data Synthesis—Addressing Small Data Problems Faced by Big Data](https://reader033.fdocuments.us/reader033/viewer/2022061214/547e4309b4af9faa158b5616/html5/thumbnails/8.jpg)
![Page 9: Data Synthesis—Addressing Small Data Problems Faced by Big Data](https://reader033.fdocuments.us/reader033/viewer/2022061214/547e4309b4af9faa158b5616/html5/thumbnails/9.jpg)
![Page 10: Data Synthesis—Addressing Small Data Problems Faced by Big Data](https://reader033.fdocuments.us/reader033/viewer/2022061214/547e4309b4af9faa158b5616/html5/thumbnails/10.jpg)
Primal’s approach:Modeling knowledge generation, not modeling knowledge
Natural Language Primal Semantics
Words+
Grammatical rules=
Statements and queries
Atomic semantics+
Constructive rules=
Semantic representations
![Page 11: Data Synthesis—Addressing Small Data Problems Faced by Big Data](https://reader033.fdocuments.us/reader033/viewer/2022061214/547e4309b4af9faa158b5616/html5/thumbnails/11.jpg)
![Page 12: Data Synthesis—Addressing Small Data Problems Faced by Big Data](https://reader033.fdocuments.us/reader033/viewer/2022061214/547e4309b4af9faa158b5616/html5/thumbnails/12.jpg)
![Page 13: Data Synthesis—Addressing Small Data Problems Faced by Big Data](https://reader033.fdocuments.us/reader033/viewer/2022061214/547e4309b4af9faa158b5616/html5/thumbnails/13.jpg)
![Page 14: Data Synthesis—Addressing Small Data Problems Faced by Big Data](https://reader033.fdocuments.us/reader033/viewer/2022061214/547e4309b4af9faa158b5616/html5/thumbnails/14.jpg)
![Page 15: Data Synthesis—Addressing Small Data Problems Faced by Big Data](https://reader033.fdocuments.us/reader033/viewer/2022061214/547e4309b4af9faa158b5616/html5/thumbnails/15.jpg)
![Page 16: Data Synthesis—Addressing Small Data Problems Faced by Big Data](https://reader033.fdocuments.us/reader033/viewer/2022061214/547e4309b4af9faa158b5616/html5/thumbnails/16.jpg)
Treat your customers as individuals.
Massive opportunities in truly individualized services, but...
…huge challenges in the long tail of big data.
The cost-performance barrier requires solutions with fundamentally different cost structures.
Primal’s semantic synthesis technology is one such solution.
![Page 17: Data Synthesis—Addressing Small Data Problems Faced by Big Data](https://reader033.fdocuments.us/reader033/viewer/2022061214/547e4309b4af9faa158b5616/html5/thumbnails/17.jpg)
About PrimalPrimal powers the rapid development of personalized and intelligent systems.
Cloud-based data service (DaaS). Software and IP licensing opportunities are available for larger companies.
Professional services available, with expertise in knowledge representation, statistical computing, information retrieval and extraction, database, and cloud computing.
More info: primal.com
![Page 18: Data Synthesis—Addressing Small Data Problems Faced by Big Data](https://reader033.fdocuments.us/reader033/viewer/2022061214/547e4309b4af9faa158b5616/html5/thumbnails/18.jpg)
Contact Info
Peter Sweeney,Founder & President
[email protected]@petersweeney
Further reading: blog.primal.com