Tokenize_df
Tokenize texts in 'df[text_cols]' in parallel using 'n_workers'
tokenize_df( df, text_cols, n_workers = 6, rules = NULL, mark_fields = NULL, tok = NULL, res_col_name = "text" )
df |
data frame |
text_cols |
text columns |
n_workers |
number of workers |
rules |
rules |
mark_fields |
mark_fields |
tok |
tokenizer |
res_col_name |
res_col_name |
None
Please choose more modern alternatives, such as Google Chrome or Mozilla Firefox.