Working with text data in R

beginner
text data

Apply data wrangling and data visualization methods to text, making text mining easier, more effective, and consistent.
Image credit: displayr.com

Description

Text mining is the process of transforming unstructured text into a structured format to prepare it for analysis. The goal of this lesson is to introduce tools that allow us to move from unstructured text to a structured layout to perform analysis.

Prerequisites

  • Basic knowledge of R and RStudio
  • Familiarity with the packages dyplr, tidyr, and ggplot2

Learning Goals

  • Describe what a token is and how they are used
  • Demonstrate how to use stop words
  • Create customized stop words
  • Provide tools to give structure to unstructured text

Duration

2 hours