Data Analysis

R visualization: making boxplot with ggplot

Boxplot is a great visualization tool to use during EPA stage, as it shows the outliers intuitively. In this post I’ll use some features such as jitters and annotations to include even more information in the plot

R string split by pattern

When obtaining data, the raw data could be messy. In this article, I’ll give some examples on how to split a string or a column to multiple strings or multiple columns by using Regex or pattern.