Directional: dirknn – R documentation

Pricing

Become an expert in R — Interactive courses, Cheat Sheets, certificates and more!

Get Started for Free

Documentation

Directional

dirknn

k-NN algorithm using the arc cosinus distance

Description

It classifies new observations to some known groups via the k-NN algorithm.

Usage

dirknn(x, xnew, k = 5, ina, type = "S", mesos = TRUE, parallel = FALSE, rann = FALSE)

Arguments

`x`	The data, a numeric matrix with unit vectors.
`xnew`	The new data whose membership is to be predicted, a numeric matrix with unit vectors.
`k`	The number of nearest neighbours, set to 5 by default. It can also be a vector with many values.
`ina`	A variable indicating the groups of the data x.
`type`	If type is "S", the standard k-NN algorithm is to be used, else "NS" for the non standard one. See below (details) for more information.
`mesos`	A boolean variable used only in the case of the non standard algorithm (type="NS"). Should the average of the distances be calculated (TRUE) or not (FALSE)? If it is FALSE, the harmonic mean is calculated.
`parallel`	If you want the standard -NN algorithm to take place in parallel set this equal to TRUE.
`rann`	If you have large scale datasets and want a faster k-NN search, you can use kd-trees implemented in the R package "RANN". In this case you must set this argument equal to TRUE.

Details

The standard algorithm is to keep the k nearest observations and see the groups of these observations. The new observation is allocated to the most frequent seen group. The non standard algorithm is to calculate the classical mean or the harmonic mean of the k nearest observations for each group. The new observation is allocated to the group with the smallest mean distance.

Value

A vector including:

`g`	A matrix with the predicted group(s). It has as many columns as the values of k.

Author(s)

Michail Tsagris

R implementation and documentation: Michail Tsagris mtsagris@uoc.gr.

References

Tsagris M. and Alenazi A. (2019). Comparison of discriminant analysis methods on the sphere. Communications in Statistics: Case Studies, Data Analysis and Applications, 5(4), 467–491.

Examples

k <- runif(4, 4, 20)
prob <- c(0.2, 0.4, 0.3, 0.1)
mu <- matrix(rnorm(16), ncol = 4)
mu <- mu / sqrt( rowSums(mu^2) )
da <- rmixvmf(200, prob, mu, k)
nu <- sample(1:200, 180)
x <- da$x[nu, ]
ina <- da$id[nu]
xx <- da$x[-nu, ]
id <- da$id[-nu]
a1 <- dirknn(x, xx, k = 5,  ina, type = "S", mesos = TRUE)
a2 <- dirknn(x, xx, k = 5,ina, type = "NS", mesos = TRUE)
a3 <- dirknn(x, xx, k = 5, ina, type = "S", mesos = FALSE)
a4 <- dirknn(x, xx, k = 5, ina, type = "NS", mesos = FALSE)
b <- vmfda.pred(xx, x, ina)
table(id, a1)
table(id, a2)
table(id, a3)
table(id, a4)

Directional

A Collection of R Functions for Directional Data Analysis

v4.9

GPL-2

Authors

Michail Tsagris, Giorgos Athineou, Anamul Sajib, Eli Amson, Micah J. Waldstein

Initial release

2021-03-26

dirknn

Description

Usage

Arguments

Details

Value

Author(s)

References

See Also

Examples

Directional

We don't support your browser anymore