corpus: corpus-package – R documentation

Pricing

Become an expert in R — Interactive courses, Cheat Sheets, certificates and more!

Get Started for Free

Documentation

corpus

corpus-package

The Corpus Package

Description

Text corpus analysis functions

Details

This package contains functions for text corpus analysis. To create a text object, use the read_ndjson or as_corpus_text function. To split text into sentences or token blocks, use text_split. To specify preprocessing behavior for transforming a text into a token sequence, use text_filter. To tokenize text or compute term frequencies, use text_tokens, term_stats or term_matrix. To search for or count specific terms, use text_locate, text_count, or text_detect.

For a complete list of functions, use library(help = "corpus").

Author(s)

Patrick O. Perry

corpus

Text Corpus Analysis

v0.10.2

Apache License (== 2.0) | file LICENSE

Authors

Leslie Huang [cre, ctb], Patrick O. Perry [aut, cph], Finn Årup Nielsen [cph, dtc] (AFINN Sentiment Lexicon), Martin Porter and Richard Boulton [ctb, cph, dtc] (Snowball Stemmer and Stopword Lists), The Regents of the University of California [ctb, cph] (Strtod Library Procedure), Carlo Strapparava and Alessandro Valitutti [cph, dtc] (WordNet-Affect Lexicon), Unicode, Inc. [cph, dtc] (Unicode Character Database)

Initial release

corpus-package

Description

Details

Author(s)

corpus

We don't support your browser anymore