Data Science - IITGN Event Calendarevents.iitgn.ac.in/2017/CLSTL/wp-content/uploads/... · Data...
Transcript of Data Science - IITGN Event Calendarevents.iitgn.ac.in/2017/CLSTL/wp-content/uploads/... · Data...
1 | Copyright © 2015 Tata Consultancy Services Limited
Data Science
A radical approach for Information Professionals
TCS PUBLIC
Smitha P. | Sukeshini Horannavar Information Resource Center
Tata Consultancy Services Ltd.
2
27% of companies report that they successfully integrate new analytics talent with more traditional data workers.
- Forbes
Data Science – the buzz word !
TCS Public
The ability to take data, to be able to understand it, to process it, to extract value from it, to visualize it, to communicate it, that's going to be a hugely important skill.
- Hal Varian, Google
Businesses will need one million data scientists by 2018.
- IDC
3
Advances in data science are sparking more creative business opportunities – Gartner
TCS Public
4
Applications for data science cropping up in new industries
TCS Public
Healthcare startups are using data science to move ever-closer to personalized medicine and using artificial intelligence to examine images like x-rays and MRIs to diagnose problems quickly and accurately.
Companies in consumer lending are refining how to assess creditworthiness using non-traditional factors like social media networks.
Traffic management agencies are able to use real-time traffic and weather data to predict traffic flows and manage emergency response.
. . .
5
What is Data Science ?
TCS Public
The study of where information comes from, what it represents and how it can be turned into a valuable resource in the creation of business and IT strategies.
Mining large amounts of structured and unstructured data to identify patterns can help an organization rein in costs, increase efficiencies, recognize new market opportunities and increase the organization's competitive advantage.
- CIO Magazine
“ Storytelling
Data Engineering
Business Analysis
DATA SCIENCE
6
History of Data Science
TCS Public
7
Data Scientist - The detective
Collects all evidences (Data)
Identifies the missing links (in the available data)
Establish new relationship with the involved characters (Data sets)
Unlocks the case (insights from the data)
TCS Public
“Data! Data! Data!” he cried impatiently. “I can’t make bricks without clay.”
8
What do they do ?
Use the ability to find and interpret rich data sources
Manage large amounts of data
Overcome constraints around hardware, software, and bandwidth
Merge data sources
Ensure consistency of data sets
Create visualizations to aid in understanding data
Build mathematical models using the data
Present and communicate the data insights / findings to specialists and
scientists in their team and if required to a non-expert audience.
TCS Public
9
The workflow
TCS Public
Write reports
Deploy online
Archive experiment
Share experiment
Explore alternatives
Edit analysis scripts
Debug
Inspect outputs
Execute scripts
Make comparisons
Take notes
Hold meetings
ANALYSIS PREPARATION
Acquire data
Reformat and clean data
REFLECTION
DISSEMINATION
Sou
rce
: Co
mm
un
icat
ion
s o
f th
e A
CM
Data scientist has that unique blend of skills that can both unlock the insights of data and tell a fantastic story via the data.
- DJ Patil & Thomas Davenport
10
The Age Of The Citizen Data Scientist Is Dawning - Gartner
TCS Public
“A person who creates or generates models that use advanced diagnostic analytics or predictive and prescriptive capabilities, but whose primary job function is outside the field of statistics and analytics.” – Gartner
Gartner believes that by 2019, citizen data scientists will surpass data scientists in the amount of advanced analysis produced. “
Citizen data science is a branch of data science that allows users to extract advanced insights from data while not requiring the users to be highly skilled.
11
Librarian Data Scientist
Proficient in data collation
Information search skills
Knowledge of data sources
Identifying the relevant and non-relevant
TCS Public
PR
EPA
RA
TIO
N
Acquire data
Reformat and clean data
Edit analysis scripts
Debug
Inspect outputs
Execute scripts
AN
ALY
SIS
Aware of basic analysis tools
Programming skills
Advanced coding
Analytics tools
Handling unstructured data
Make comparisons
Take notes
Hold meetings REF
LEC
TIO
N
Keen eye for detailing
Observation skills
Enhanced industry knowledge
Problem solving
Write reports
Deploy online
Archive experiment
Share experiment DIS
SEM
INA
TIO
N
Documentation
Archiving and retrieval
Business acumen
Deriving quantitative insights
Technical writing / infographics
12
Bridging the gap
TCS Public
T E C H N I C A L S K I L L S
• Database Management • Data blending • Querying
• Basic descriptive statistics • Advanced analytics • Predictive modeling
• Data visualizations • Report design • Insights presentation
• Big Data analytics • Machine learning • Unstructured data analysis
S O F T S K I L L S
• Curious • Explorative mindset • Devise alternatives
• Effective communication • Diverse audience handling
• Team management • Cross-cultural flexibility
• Industry Knowledge • Business problem-solving
13
Acquiring skills
DST4L
Data Scientist Training for Librarians (DST4L) is an experimental course, started at the Harvard-Smithsonian Center for Astrophysics John G. Wolbach Library and the Harvard Library to train librarians to respond to the growing data needs of their communities. In this hands-on course, librarians learn the latest tools for extracting, wrangling, storing, analyzing, and visualizing data.
edx
Data Science Academy
CS109 Data Science
Coursera
Udacity
TCS Public
14
What's in store ?
TCS Public
Repositioning in the information lifecycle
Meet the expectations of the customers
Embrace new technologies
Indispensable research partner
Contributor to open science
Competitive advantage
15 | Copyright © 2015 Tata Consultancy Services Limited
Thank You