Cleaning and Wrangling Data
beginner
data wrangling
It is essential to have good tools to wrangle data to fit our needs. Cleaning data a necessary step before your analysis.
Image credit: tidydatatutor.com
Description
In general, the data we get to work with is not in the format we need to do our analyses. The dplyr
and tidy
R packages provide a fairly complete and extremely powerful set of functions for us to do this reformatting quickly. This lesson aims to provide tools for reformatting data to meet your analysis needs. Learning these tools well will significantly increase your efficiency as an analyst.
Prerequisites
- Basic knowledge of R and RStudio.
- If you are new to R, consider taking our Intro to Programming in R using RStudio lesson.
Learning Goals
- Understand the basics of data wrangling conceptually
- Introduce
dplyr
andtidyr
functions to clean and wrangle data for analysis - Learn about the pipe operator as a tool to pass on argument(s) through a sequence of functions
- Describe the difference between wide vs. long table formats and how to convert between them
Duration
2 hours