Convert text to a sequence of words (or tokens).
Convert text to a sequence of words (or tokens).
text_to_word_sequence(
text,
filters = "!\"#$%&()*+,-./:;<=>?@[\\]^_`{|}~\t\n",
lower = TRUE,
split = " "
)text |
Input text (string). |
filters |
Sequence of characters to filter out such as punctuation. Default includes basic punctuation, tabs, and newlines. |
lower |
Whether to convert the input to lowercase. |
split |
Sentence split marker (string). |
Words (or tokens)
Other text preprocessing:
make_sampling_table(),
pad_sequences(),
skipgrams(),
text_hashing_trick(),
text_one_hot()
Please choose more modern alternatives, such as Google Chrome or Mozilla Firefox.