spacyr: process_document – R documentation

Become an expert in R — Interactive courses, Cheat Sheets, certificates and more!

process_document

Tokenize text using spaCy

Description

Tokenize text using spaCy. The results of tokenization is stored as a Python object. To obtain the tokens results in R, use get_tokens(). http://spacy.io.

Usage

process_document(x, multithread, ...)

Arguments

`x`	input text functionalities including the tagging, named entity recognition, dependency analysis. This slows down `spacy_parse()` but speeds up the later parsing. If FALSE, tagging, entity recognition, and dependency analysis when relevant functions are called.
`multithread`	logical;
`...`	arguments passed to specific methods

Value

result marker object

Examples

spacy_initialize()
# the result has to be "tag() is ready to run" to run the following
txt <- c(text1 = "This is the first sentence.\nHere is the second sentence.", 
         text2 = "This is the second document.")
results <- spacy_parse(txt)

spacyr

Wrapper to the 'spaCy' 'NLP' Library

v1.2.1

GPL-3

Authors

Kenneth Benoit [cre, aut, cph] (<https://orcid.org/0000-0002-0797-564X>), Akitaka Matsuo [aut] (<https://orcid.org/0000-0002-3323-6330>), European Research Council [fnd] (ERC-2011-StG 283794-QUANTESS)

Initial release