Spam E-mail Data
The data consist of 4601 email items, of which 1813 items were identified as spam.
spam7
This data frame contains the following columns:
total length of words in capitals
number of occurrences of the \$ symbol
number of occurrences of the ! symbol
number of occurrences of the word ‘money’
number of occurrences of the string ‘000’
number of occurrences of the word ‘make’
outcome variable, a factor with levels
n
not spam,
y
spam
George Forman, Hewlett-Packard Laboratories
These data are available from the University of California at Irvine Repository of Machine Learning Databases and Domain Theories. The address is: http://www.ics.uci.edu/~Here
require(rpart) spam.rpart <- rpart(formula = yesno ~ crl.tot + dollar + bang + money + n000 + make, data=spam7) plot(spam.rpart) text(spam.rpart)
Please choose more modern alternatives, such as Google Chrome or Mozilla Firefox.