Become an expert in R — Interactive courses, Cheat Sheets, certificates and more!
Get Started for Free

hacide

Half circle filled data


Description

Simulated training and test set for imbalanced binary classification. The rare class may be described as a half circle depleted filled with the prevalent class, which is normally distributed and has elliptical contours.

Usage

data(hacide)

Format

Data represent 2 real features (denoted as x1, x2) and a binary label class (denoted as cls). Positive examples occur in about 2% of the data.

hacide.train

Includes 1000 rows and 20 positive examples.

hacide.test

Includes 250 rows and 5 positive examples.

Data have been simulated as follows:

-

if cls = 0 then (x1, x2)\sim \mathbf{N}_{2} ≤ft(\mathbf{0}_{2}, (1/4, 1) \mathbf{I}_{2}\right)

-

if cls = 1 then (x1, x2)\sim \mathbf{N}_{2} ≤ft(\mathbf{0}_{2}, \mathbf{I}_{2}\right) \cap ≤ft\|\mathbf{x}\right\|^2>4 \cap x_2 ≤q 0

References

Lunardon, N., Menardi, G., and Torelli, N. (2014). ROSE: a Package for Binary Imbalanced Learning. R Jorunal, 6:82–92.

Menardi, G. and Torelli, N. (2014). Training and assessing classification rules with imbalanced data. Data Mining and Knowledge Discovery, 28:92–122.

Examples

data(hacide)
summary(hacide.train)
summary(hacide.test)

ROSE

ROSE: Random Over-Sampling Examples

v0.0-3
GPL-2
Authors
Nicola Lunardon, Giovanna Menardi, Nicola Torelli
Initial release
2014-01-30

We don't support your browser anymore

Please choose more modern alternatives, such as Google Chrome or Mozilla Firefox.