Estimated relvariance components for 3-stage sample
Estimate components of relvariance for a sample design where primary sampling units (PSUs) are selected with probability proportional to size with replacement (ppswr) and secondary sampling units (SSUs) and elements within SSUs are selected via simple random sampling (srs). The input is a sample selected in this way.
BW3stagePPSe(dat, v, Ni, Qi, Qij, m)
dat |
data frame for sample elements with PSU and SSU identifiers, weights, and analysis variable(s). The data frame should be sorted in hierarchical order: by PSU and SSU within PSU.
Required names for columns:
|
v |
Name or number of column in data frame |
Ni |
|
Qi |
|
Qij |
vector of numbers of elements in the population in the sample SSUs |
m |
number of sample PSUs |
BW3stagePPSe
computes the between and within population relvariance components appropriate
for a three-stage sample in which PSUs are selected with varying probabilities and with replacement.
SSUs and elements within SSUs are selected by simple random sampling.
The estimated components are appropriate for approximating the relvariance of the
pwr-estimator of a total when the same number of SSUs are selected within each PSU,
and the same number of elements are selected within each sample SSU.
List with values:
Vpsu |
estimated between PSU unit variance |
Vssu |
estimated second-stage unit variance among SSU totals |
Vtsu |
estimated third-stage unit variance |
B |
estimated between PSU unit relvariance |
W |
estimated within PSU unit relvariance computed as if the sample were two-stage |
k1 |
estimated ratio of |
W2 |
estimated unit relvariance among SSU totals |
W3 |
estimated third-stage unit relvariance among elements within PSU/SSUs |
k2 |
estimated ratio of |
delta1 |
homogeneity measure among elements within PSUs estimated as B^2/(B^2+W^2) |
delta2 |
homogeneity measure among elements within SSUs estimated as W_{2}^2/(W_{2}^2 + W_{3}^2) |
Richard Valliant, Jill A. Dever, Frauke Kreuter
Hansen, M.H., Hurwitz, W.N., and Madow, W.G. (1953, chap. 9, sect. 10). Sample Survey Methods and Theory, Vol.II. New York: John Wiley & Sons.
Valliant, R., Dever, J., Kreuter, F. (2013, sect. 9.4.2). Practical Tools for Designing and Weighting Survey Samples. New York: Springer.
## Not run: # select 3-stage sample from Maryland population data(MDarea.pop) MDpop <- MDarea.pop require(sampling) require(reshape) # has function that allows renaming variables # make counts of SSUs and elements per PSU xx <- do.call("rbind",list(by(1:nrow(MDpop),MDpop$SSU,head,1))) pop.tmp <- MDpop[xx,] Ni <- table(pop.tmp$PSU) Qi <- table(MDarea.pop$PSU) Qij <- table(MDpop$SSU) m <- 30 # no. of PSUs to select probi <- m*Qi / sum(Qi) # select sample of clusters sam <- cluster(data=MDpop, clustername="PSU", size=m, method="systematic", pik=probi, description=TRUE) # extract data for the sample clusters samclus <- getdata(MDpop, sam) samclus <- rename(samclus, c(Prob = "p1i")) samclus <- samclus[order(samclus$PSU),] # treat sample clusters as strata and select srswor of block groups from each # identify psu IDs for 1st instance of each ssuID xx <- do.call("rbind",list(by(1:nrow(samclus),samclus$SSU,head,1))) SSUs <- cbind(PSU=samclus$PSU[xx], SSU=samclus$SSU[xx]) # select 2 SSUs per tract n <- 2 s <- strata(data = as.data.frame(SSUs), stratanames = "PSU", size = rep(n,m), method="srswor") s <- rename(s, c(Prob = "p2i")) # extract the SSU data # s contains selection probs of SSUs, need to get those onto data file SSUsam <- SSUs[s$ID_unit, ] SSUsam <- cbind(SSUsam, s[, 2:3]) # identify rows in PSU sample that correspond to sample SSUs tmp <- samclus$SSU %in% SSUsam$SSU SSUdat <- samclus[tmp,] SSUdat <- merge(SSUdat, SSUsam[, c("p2i","SSU")], by="SSU") # select srswor from each sample SSU n.SSU <- m*n s <- strata(data = as.data.frame(SSUdat), stratanames = "SSU", size = rep(50,n.SSU), method="srswor") s <- rename(s, c(Prob = "p3i")) samclus <- getdata(SSUdat, s) del <- (1:ncol(samclus))[dimnames(samclus)[[2]] %in% c("ID_unit","Stratum")] samclus <- samclus[, -del] # extract pop counts for PSUs in sample pick <- names(Qi) %in% sort(unique(samclus$PSU)) Qi.sam <- Qi[pick] # extract pop counts of SSUs for PSUs in sample pick <- names(Ni) %in% sort(unique(samclus$PSU)) Ni.sam <- Ni[pick] # extract pop counts for SSUs in sample pick <- names(Qij) %in% sort(unique(samclus$SSU)) Qij.sam <- Qij[pick] # compute full sample weight and wts for PSUs and SSUs wt <- 1 / samclus$p1i / samclus$p2i / samclus$p3i w1i <- 1 / samclus$p1i w2ij <- 1 / samclus$p1i / samclus$p2i samdat <- data.frame(psuID = samclus$PSU, ssuID = samclus$SSU, w1i = w1i, w2ij = w2ij, w = wt, samclus[, c("y1","y2","y3","ins.cov", "hosp.stay")]) BW3stagePPSe(dat=samdat, v="y1", Ni=Ni.sam, Qi=Qi.sam, Qij=Qij.sam, m) ## End(Not run)
Please choose more modern alternatives, such as Google Chrome or Mozilla Firefox.