Order the rows of a data randomly
Takes in data and seed, and returns the data with randomly ordered observations
randomise(data, seed = NULL)
data |
a matrix, data.frame or data.table |
seed |
an integer value |
Some of the modeling algorithms pick top p percent of the observations for training the model, which could lead to skewed predictions. This function solves that problem by randomly ordering the observations so that the response variable has more or less the same distribution even if the algorithms don't pick training observations randomly.
data
of same class as input with randomly ordered observations
Akash Jain
# A 'data.frame' df <- data.frame(x = c(1, 2, 3, 4, 5), y = c('a', 'b', 'c', 'd', 'e')) # Change the order of the observations randomly dfRan <- randomise(data = df) dfRan <- randomise(data = df, seed = 150)
Please choose more modern alternatives, such as Google Chrome or Mozilla Firefox.