Become an expert in R — Interactive courses, Cheat Sheets, certificates and more!
Get Started for Free

brca

Breast Cancer Wisconsin Diagnostic Dataset from UCI Machine Learning Repository


Description

Biopsy features for classification of 569 malignant (cancer) and benign (not cancer) breast masses.

Usage

data(brca)

Format

An object of class list.

Details

Features were computationally extracted from digital images of fine needle aspirate biopsy slides. Features correspond to properties of cell nuclei, such as size, shape and regularity. The mean, standard error, and worst value of each of 10 nuclear parameters is reported for a total of 30 features.

This is a classic dataset for training and benchmarking machine learning algorithms.

  • y. The outcomes. A factor with two levels denoting whether a mass is malignant ("M") or benign ("B").

  • x. The predictors. A matrix with the mean, standard error and worst value of each of 10 nuclear measurements on the slide, for 30 total features per biopsy:

    • radius. Nucleus radius (mean of distances from center to points on perimeter).

    • texture. Nucleus texture (standard deviation of grayscale values).

    • perimeter. Nucleus perimeter.

    • area. Nucleus area.

    • smoothness. Nucleus smoothness (local variation in radius lengths).

    • compactness. Nucleus compactness (perimeter^2/area - 1).

    • concavity, Nucleus concavity (severity of concave portions of the contour).

    • concave_pts. Number of concave portions of the nucleus contour.

    • symmetry. Nucleus symmetry.

    • fractal_dim. Nucleus fractal dimension ("coastline approximation" -1).

Source

Examples

data(brca)
table(brca$y)
dim(brca$x)
head(brca$x)

dslabs

Data Science Labs

v0.7.4
Artistic-2.0
Authors
Rafael A. Irizarry, Amy Gill
Initial release

We don't support your browser anymore

Please choose more modern alternatives, such as Google Chrome or Mozilla Firefox.