Working with text data in R
beginner
text data
Apply data wrangling and data visualization methods to text, making text mining easier, more effective, and consistent.
Image credit: displayr.com
Description
Text mining is the process of transforming unstructured text into a structured format to prepare it for analysis. The goal of this lesson is to introduce tools that allow us to move from unstructured text to a structured layout to perform analysis.
Prerequisites
- Basic knowledge of R and RStudio
- Familiarity with the packages
dyplr
,tidyr
, andggplot2
Learning Goals
- Describe what a token is and how they are used
- Demonstrate how to use stop words
- Create customized stop words
- Provide tools to give structure to unstructured text
Duration
2 hours