Data Prep 101 - Rapid Insight...Data Prep 101 Alex Ziko, Data Analyst James Cousins, Senior...
Transcript of Data Prep 101 - Rapid Insight...Data Prep 101 Alex Ziko, Data Analyst James Cousins, Senior...
Data Prep 101
Alex Ziko,Data Analyst
James Cousins,Senior Statistical Analyst
January 15th, 2019
Itinerary• Who We Are
• Define Data Prep
• Data Prep Challenges
• Overcoming the Challenges
• Software Demonstration
• Upcoming Events
About Rapid Insight
Founded in 2002 and headquartered in Conway, NH
Predictive analytics and data preparation software company empowering professionals of all skill levels to turn raw data into actionable insights
Serving hundreds of customers worldwide, ranging from healthcare to higher education
The Veera platform enables users to easily build predictive models, perform advanced data analysis, and share insights
Code free (but code friendly)self-service analytics platform
Meet Your Presenters
Alexander ZikoData Analyst and Customer Success Manager
As a Data Analyst and Customer Success Manager, Alex works closely with customers to help themunderstand their data using the Veera platform. Alex holds a BS from Green Mountain College, and anMBA from Franklin Pierce University. When not in the office Alex is usually found on the coast of Maineor in the mountains of New Hampshire.
As a Senior Statistical Analyst, James works directly with organizations bringing data to bear indecision-making, building analytic capacity along the way. His work has involved hundreds oforganizations- from a single analyst to teams of more than ten. James holds a B.S. in Mathematics fromDickinson College, and is pursuing his M.S. in Data Analytics from Johnson and Wales University.
James CousinsSenior Statistical Analyst
Data Prep 101
Alex Ziko,Data Analyst
James Cousins,Senior Statistical Analyst
January 15th, 2019
Data Prep/da·ta prep/ [noun]1. The process of transforming data from its raw form into information that is
useful for reporting, analysis, and predictive analytics
2. The duty that often occupies 80% of a data analyst’s workload
Projects Reliant on Data Prep
•Reporting
•Predictive Modeling
•Ad Hoc Analysis
Data Prep Challenges
•Disparate Data Sources
•Messy Data Entry
•Manual Processes
•Varying Expertise
The Ideal Data Analyst Toolkit
Data Access Intuitive ToolsScheduled Processes
MergingWhen your data is scattered in multiple datasets, merging allows you to combine the relevant parts of those data sources to create a new dataset.
AppendingStacking two datasets to create one larger dataset is called appending. When appending data, the datasets typically contain the same (or very similar) fields.
FilteringBy filtering a dataset, you are narrowing it down to just a specific group of records.
DeDupingTo dedupe is to remove duplicates from a dataset. Selection rules can be made to dedupe on specific conditions.
Data CleansingTo cleanse a column is to edit or replace values within the column cell.
RenameRenaming allows you to enter a new name for your columns.
TransformingTo transform a column is to perform an operation that creates a new outcome — this could be a new variable entirely, or a different version of the original column.
AggregatingAggregating allows you to select specific variables and calculate summary statistics.
TransposingBy transposing you can turn your rows into columns.
ConnectIntegrate data in any format, from virtually any source
PrepareCreate step-by-step processes using easy, drag-and-drop visual workflows with no coding required
AnalyzeBuild and schedule jobs to run automatically, or run on-demand analyses
ShareWrite back to databases, create and disseminate reports, publish dashboards to visual analytics tools such as Tableau, or output datasets for predictive modeling
Software Demonstration
Expert Tips on the Data Prep ProcessJanuary 29, 2 PM ET / 11 AM PT
Join Senior Statistical Analyst James Cousins as he discusses some of the most common data preparation projects. He will explore ways you can make your data tasks more reliable, accurate, and faster. While these tips benefit all industries, you can expect real-use cases from healthcare, higher education, and fundraising.
www.rapidinsightinc.com/blog/webinars/
Upcoming Events
Questions?