(BIG) DATA IN CREDIT SCORING - ADM · Company confidential –Do not distribute without notice ©AE...
Transcript of (BIG) DATA IN CREDIT SCORING - ADM · Company confidential –Do not distribute without notice ©AE...
Company confidential – Do not distribute without notice ©AE 2014 1
(BIG) DATA IN CREDIT SCORING
Value and Approach
Bram Vanschoenwinkel
ADM 04/10/2016
Company confidential – Do not distribute without notice ©AE 2014 2
PASSIONATE DECISION SUPPORT ARCHITECT
Master in Computer Science, Ph.D. in Science (Machine Learning, Data Mining)
6y Management Consulting @ MÖBIUS
5y Information Management & Analytics @ AE
(different sectors: finance, retail, utilities, postal services, industry, insurance, HR services,…)
SOLUTION LEAD ANALYTICS
Responsible for the Analytics offering @ AE
Strategy, presales, recruitment, team lead, project lead
+ inspire other people
To acknowledge the potential and joy of analytics
Company confidential – Do not distribute without notice ©AE 2014 3
In a world where technology is radically changing the way your customers interact with the world, you want to be able to make better decisions faster.
AE conceptualizes, designs and builds intelligent decision support systems to
optimize operational excellence
support the best
customer experience
drive your strategy by enhanced
insights
develop innovating
business models
Company confidential – Do not distribute without notice ©AE 2014 4
Credit Scoring
Possibilities and value introduced by (big) data
Project Approach
Company confidential – Do not distribute without notice ©AE 2014 5
CREDIT SCORING
Credit Risk:
uncertainty or likelihood that a negative event takes place.
Risk management is the broad term to
control and regulate the risk to the extent possible.
In financial terms, a negative event is a loss.
Risk of default, most commonly defined as 90 days past due
Credit is vital to theeconomy,
but not without
RISK
Company confidential – Do not distribute without notice ©AE 2014 6
To be honest, I amnot that
creditworthy
I already have a lot of debt
elsewhere but I am going to try
here
I am extremelycareful, will payoff straight away
I am not too suremyself whether I can pay off my
loan
ADM
CREDIT SCORING
Company confidential – Do not distribute without notice ©AE 2014 7
NOW EOL
We can only usedata generated bythe applicant up to
this point!
PREDICT
Probability of default
1%
67%
20%
11%
78%
CREDIT SCORING
Company confidential – Do not distribute without notice ©AE 2014 8
Credit Scoring
Possibilities and value introduced by (big) data
Project Approach
Company confidential – Do not distribute without notice ©AE 2014 9
POSSIBILITIES INTRODUCED BY (BIG) DATA
InternalTraditional
Same methods used for over decades
VariablesUsually length of relationship with the bankPrevious credit informationTotal amount of resources at the bankStill a lot of subjective interpretation involved to make the final decision
Classification TechniqueLogistic regression
Company confidential – Do not distribute without notice ©AE 2014 10
InternalNon Exploited
POSSIBILITIES INTRODUCED BY (BIG) DATA
New variables that are already available internally
• current and savings account informatione.g. pseudo social network built on credit and debittransactions
• derived variables on account movementse.g. preferred day of week for transactions
• data collected for marketing purposes• unstructured data
e.g. calls to call center, complaints, emails• etc.
“New” classification techniquesSupport Vector Machines, Random Forests, …
Company confidential – Do not distribute without notice ©AE 2014 11
External
POSSIBILITIES INTRODUCED BY (BIG) DATA
Variables sourced externallySocial Media informationKBO sector informationTelecomGovernment
“New” classification techniquesSupport Vector Machines, Random Forests,…
Company confidential – Do not distribute without notice ©AE 2014 12
POSSIBILITIES INTRODUCED BY (BIG) DATA
InternalTraditional
InternalNon
ExploitedExternal
New
classification
techniques
Company confidential – Do not distribute without notice ©AE 2014 13
For the riskiest 5% of thepopulation, we now have an 8 times better chance of pickinga real defaulter than a random
pick
3 times better than the
original traditional model
Lift chart
VALUE INTRODUCED BY (BIG) DATA
Company confidential – Do not distribute without notice ©AE 2014 14
Big improvementin risky segments
VALUE INTRODUCED BY (BIG) DATA
Company confidential – Do not distribute without notice ©AE 2014 15
Credit Scoring
Possibilities and value introduced by (big) data
Project Approach
Company confidential – Do not distribute without notice ©AE 2014 16
Agile Analytics
Iterative and Incremental
cost
value
cost value
PROJECT APPROACH
Company confidential – Do not distribute without notice ©AE 2014 17
Scrum
Was last option valuable?If so, continue. If not, ..
Check next option?
Ability to showcase results at each iteration + check if you are on right track
What went well last sprint?What didn’t?
Ability to redirect at each iteration/sprintAbility to tackle problems before they grow
PROJECT APPROACH
Company confidential – Do not distribute without notice ©AE 2014 18
Profiles involved
Risk Business experts
External Data Scientists
PROGRAM MANAGER
PRODUCT OWNER
SCRUM MASTER
BUSINESS STAKEHOLDER
PROJECT APPROACH
Company confidential – Do not distribute without notice ©AE 2014 19
We live in a data flooded world,
agile is a great way to
scratch the surface
so that we can take a
direction
that is guided by business value
ae.be