R Class-Conditional Distributions

March 12, 2015

A demo of some basic R capabilities, including generating class-conditional distribution of variables and adding new columns to a dataset.
This is a simple example to show some of the capabilities of the Execute R module. Specifically I use ggplot to plot class-conditional distributions for some of the variables in the Adult dataset. I also use data.table to add some derived variables to the set - specifically I calculate a feature "relative age", defined as the age of the person relative to the mean age for others in the same occupation. To see the results, visualize the right-hand output of the Execute R module.