Choose and add multitoken strings based on multitoken categories
Given a multitoken category (e.g., named entity ids), this function finds the most frequently occuring string in this category and adds it as a label for the category
add_multitoken_label( tc, colloc_id, feature = "token", new_feature = sprintf("%s_l", colloc_id), pref_subset = NULL )
tc |
a tcorpus object |
colloc_id |
the data column containing the unique id for multitoken tokens |
feature |
the name of the feature column |
new_feature |
the name of the new feature column |
pref_subset |
Optionally, a subset call, to specify a subset that has priority for finding the most frequently occuring string |
Please choose more modern alternatives, such as Google Chrome or Mozilla Firefox.