grep for nonstandard characters
Return the indices of elements of x
containing characters that are not in
standardCharacters.
grepNonStandardCharacters(x, value=FALSE,
standardCharacters=c(letters, LETTERS, ' ',
'.', ',', 0:9, '\"', "\'", '-', '_', '(',
')', '[', ']', '\n'),
... )x |
character vector in which it is desired to
identify elements containing characters not
in |
value |
logical:
|
standardCharacters |
Characters to overlook in |
... |
optional arguments for
|
1. x. <- strsplit(x, ''): convert
the input character vector to a list of
vectors of character vectors with
nchar(x.[i]) == 1 for i in
1:length(x).
2. sapply(x., ...) to identify all
elements for which any element of x[[i]] is
not in standardCharacters.
an integer vector identifying all elements
of x containing a character not in
standardCharacters.
Spencer Graves
Names <- c('Raul', 'Ra`l', 'Torres,Raul', 'Torres, Raul')
# confusion in character sets can create
# names like Names[2]
chk <- grepNonStandardCharacters(Names)
all.equal(chk, 2)
chkv <- grepNonStandardCharacters(Names, TRUE)
all.equal(chkv, Names[2])Please choose more modern alternatives, such as Google Chrome or Mozilla Firefox.