Data comes in many shapes and forms from all kinds of data sources. The first step before any statistical analysis can be done, is to bring the data into a suitable format. In R, there are three different package ecosystems to transform data, namely base R, tidyverse and data.table.
Advanced Data Transformation covers the most popular ways of transforming data into all kinds shapes and forms.
- base R is already integrated into the R language itself
- tidyverse provides many packages for data manipulation—-most importantly dplyr and tidyr
- data.table is a highly optimized, in-memory transformation and query interface for tabular data
There is no on-size-fits-all solution to a problem, so in Advanced Data Transformation you will learn how to use the right tool for your data use cases. For each available package ecosystem it covers all essentials, including:
- Data Filtering
- Grouping and Aggregating
- and more!