Grady Ward's Moby Parts of Speech
A dataset containing a hash lookup of Grady Ward's parts of speech from the Moby project. The words with non-ASCII characters removed.
grady_pos_feature
- A function for augmenting hash_grady_pos
with 3 additional columns: (1) n_pos
- the number of parts of speech
a word has, (2) space
- logical; indicating if a word contains a space,
& (3) primary
- logical; indicating if this is the most likely part of
speech given the word.
data(hash_grady_pos) grady_pos_feature(data)
data |
This should be |
A data frame with 246,691 rows and 3 variables
word. The word.
pos. The part of speech; one of :Adjective
, Adverb
, Conjunction
, Definite Article
, Interjection
, Noun
, Noun Phrase
, Plural
, Preposition
, Pronoun
, Verb (intransitive)
, Verb (transitive)
, or Verb (usu participle)
. Note that the first part of speech for a word is its primary use; all other uses are secondary.
Originally downloaded from: http://icon.shef.ac.uk/Moby
## Not run: library(data.table) hash_grady_pos <- grady_pos_feature(hash_grady_pos) hash_grady_pos['dog'] hash_grady_pos[primary == TRUE, ] hash_grady_pos[primary == TRUE & space == FALSE, ] ## End(Not run)
Please choose more modern alternatives, such as Google Chrome or Mozilla Firefox.