How to create a set of indicator (booleans / onehot ) variables from a categorical (factor) variables in R

Here is an example of a categorical variable (factor in R) .

 data = cbind(data,model.matrix( ~ 0 + user_state, data))

Here user_state is a variable containing 51 values (1 for each state in US).. After the operation, we end up with the data variable containing 51 indicator variables, 1 for each state

Advertisements
This entry was posted in Uncategorized. Bookmark the permalink.

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s