Extract a subset of a corpus
Returns subsets of a corpus that meet certain conditions, including direct
logical operations on docvars (document-level variables). corpus_subset
functions identically to subset.data.frame()
, using non-standard
evaluation to evaluate conditions based on the docvars in the corpus.
corpus_subset(x, subset, drop_docid = TRUE, ...)
x |
corpus object to be subsetted |
subset |
logical expression indicating the documents to keep: missing values are taken as false |
drop_docid |
if |
... |
not used |
corpus object, with a subset of documents (and docvars) selected according to arguments
summary(corpus_subset(data_corpus_inaugural, Year > 1980)) summary(corpus_subset(data_corpus_inaugural, Year > 1930 & President == "Roosevelt"))
Please choose more modern alternatives, such as Google Chrome or Mozilla Firefox.