Expected Vocabulary Growth by Binomial Interpolation (zipfR)
vgc.interp
computes the expected vocabulary growth curve for
random sample taken from a data set described by the frequency
spectrum object obj
.
vgc.interp(obj, N, m.max=0, allow.extrapolation=FALSE)
obj |
an object of class |
N |
a vector of increasing non-negative integers specifying the sample sizes for the expected vocabulary size is calculated (as well as expected spectrum elements if requested) |
m.max |
an integer in the range 1 … 9, specifying the number of spectrum elements to be included in the vocabulary growth curve (default: none) |
allow.extrapolation |
if |
See the EV.spc
manpage for more information, especially
concerning binomial extrapolation.
Note that the result of vgc.interp
is an object of class
vgc
(a vocabulary growth curve), but its input is an
object of class spc
(a frequency spectrum).
An object of class vgc
, representing the expected vocabulary
growth curves for random samples taken from the data set described by
obj
. Data points will be generated for the specified sample
sizes N
.
spc.interp
computes the expected frequency spectrum for
a random sample by binomial interpolation.
## load the Tiger PP expansion spectrum ## (sample size: about 91k tokens) data(TigerPP.spc) ## binomially interpolated curve TigerPP.bin.vgc <- vgc.interp(TigerPP.spc,(1:100)*910) summary(TigerPP.bin.vgc) ## let's also add growth of V_1 to V_5 and plot TigerPP.bin.vgc <- vgc.interp(TigerPP.spc,(1:100)*910,m.max=5) plot(TigerPP.bin.vgc,add.m=c(1:5))
Please choose more modern alternatives, such as Google Chrome or Mozilla Firefox.