Dario Marrocchelli Insight Demo
-
Upload
dario-marrocchelli -
Category
Data & Analytics
-
view
231 -
download
1
Transcript of Dario Marrocchelli Insight Demo
![Page 1: Dario Marrocchelli Insight Demo](https://reader033.fdocuments.us/reader033/viewer/2022042907/58877ef91a28ab63208b573b/html5/thumbnails/1.jpg)
RISKPREDICTORIdentify health risk before it is too late
Dario Marrocchelli
Insight Health Data Fellow
![Page 2: Dario Marrocchelli Insight Demo](https://reader033.fdocuments.us/reader033/viewer/2022042907/58877ef91a28ab63208b573b/html5/thumbnails/2.jpg)
The problem
Wellness companies do not know which programs will be most effective for a specific population
![Page 3: Dario Marrocchelli Insight Demo](https://reader033.fdocuments.us/reader033/viewer/2022042907/58877ef91a28ab63208b573b/html5/thumbnails/3.jpg)
My solution
RiskPredictor A tool that identifies those people at risk1 and their underlying conditions
1 Risk is defined as an individual’s predicted healthcare cost
![Page 4: Dario Marrocchelli Insight Demo](https://reader033.fdocuments.us/reader033/viewer/2022042907/58877ef91a28ab63208b573b/html5/thumbnails/4.jpg)
The data: unique, rich and messy
Unique de-identified data set from Zakipoint that combines three types of information:
Claims: ICD-9 codes, medical costs, gender, age
Biometric: BMI, BP, cholesterol, A1C, etc. Behavioral: HRA and wellness program
participation
There are about 250,000 rows and 2,000 people in these datasets
![Page 5: Dario Marrocchelli Insight Demo](https://reader033.fdocuments.us/reader033/viewer/2022042907/58877ef91a28ab63208b573b/html5/thumbnails/5.jpg)
Cannot discuss feature engineering
![Page 6: Dario Marrocchelli Insight Demo](https://reader033.fdocuments.us/reader033/viewer/2022042907/58877ef91a28ab63208b573b/html5/thumbnails/6.jpg)
Model performs very well
1 http://us.milliman.com/mara/
Model performance in line with proprietary programs which cost $100,000+ There is room for improvement (more data, more features, etc.)
R2Model
RiskPredictor - Random Forest ACG1 (Commercial) RiskPredictor – Linear (Ridge) MARA1 (Commercial)
20.5%
29.7%
34.4%
57.9%
![Page 8: Dario Marrocchelli Insight Demo](https://reader033.fdocuments.us/reader033/viewer/2022042907/58877ef91a28ab63208b573b/html5/thumbnails/8.jpg)
PhD in Chemistry2006-2010
Postdoctoral Associate2010-2011
Postdoctoral Fellow
2011-2013
Research Scientist & Instructor
2013-Present
Short Bio Fun FactI designed and taught at MIT a course on the Science of Cooking (rated 9/10)
Computational Materials Scientist working on renewable energy
30+ papers, 900+ citations
![Page 9: Dario Marrocchelli Insight Demo](https://reader033.fdocuments.us/reader033/viewer/2022042907/58877ef91a28ab63208b573b/html5/thumbnails/9.jpg)
Extra slides
![Page 10: Dario Marrocchelli Insight Demo](https://reader033.fdocuments.us/reader033/viewer/2022042907/58877ef91a28ab63208b573b/html5/thumbnails/10.jpg)
![Page 11: Dario Marrocchelli Insight Demo](https://reader033.fdocuments.us/reader033/viewer/2022042907/58877ef91a28ab63208b573b/html5/thumbnails/11.jpg)
![Page 12: Dario Marrocchelli Insight Demo](https://reader033.fdocuments.us/reader033/viewer/2022042907/58877ef91a28ab63208b573b/html5/thumbnails/12.jpg)
Model performs very well
1 http://us.milliman.com/mara/
There is room for improvement (more data, more features, etc.)…
… but model performance is in line with proprietary programs which cost $100,000+
R2Model
RiskPredictor - Random Forest ACG1 (Commercial) RiskPredictor – Linear (Ridge) MARA1 (Commercial)
20.5%
29.7%
34.4%
57.9%
![Page 13: Dario Marrocchelli Insight Demo](https://reader033.fdocuments.us/reader033/viewer/2022042907/58877ef91a28ab63208b573b/html5/thumbnails/13.jpg)
![Page 14: Dario Marrocchelli Insight Demo](https://reader033.fdocuments.us/reader033/viewer/2022042907/58877ef91a28ab63208b573b/html5/thumbnails/14.jpg)
No diabetes
Diabetes (59)
(972)
![Page 15: Dario Marrocchelli Insight Demo](https://reader033.fdocuments.us/reader033/viewer/2022042907/58877ef91a28ab63208b573b/html5/thumbnails/15.jpg)
No hypertension
Hypertension (210)
(821)
![Page 16: Dario Marrocchelli Insight Demo](https://reader033.fdocuments.us/reader033/viewer/2022042907/58877ef91a28ab63208b573b/html5/thumbnails/16.jpg)
Data
Claims Biometric Behavioral
![Page 17: Dario Marrocchelli Insight Demo](https://reader033.fdocuments.us/reader033/viewer/2022042907/58877ef91a28ab63208b573b/html5/thumbnails/17.jpg)
Communication with Zakipoint
Ramesh Kumar, CEO Heather Richie,VP Product Management
Several emails Google Hangout
![Page 18: Dario Marrocchelli Insight Demo](https://reader033.fdocuments.us/reader033/viewer/2022042907/58877ef91a28ab63208b573b/html5/thumbnails/18.jpg)
DataUnique data set from zph that combines:
1) Claim information (ICD-9 codes, medical costs, gender, age)2) Biometric information (BMI, BP, cholesterol, A1C, etc.)3) Behavioral (HRA and wellness program participation)
The dataset contains 2k lives and is in csv format (masked)
Obese
Overweight
Normal
Underweight (5)
(245)
(430)
(411)
![Page 19: Dario Marrocchelli Insight Demo](https://reader033.fdocuments.us/reader033/viewer/2022042907/58877ef91a28ab63208b573b/html5/thumbnails/19.jpg)
Algorithm anatomy
Raw data(icd-9 codes, BMI)
1000s diagnostic features(e.g. diabetes, obesity, etc.)
Regression (linear & random forest)
Predicted cost (risk scores)