Become an expert in R — Interactive courses, Cheat Sheets, certificates and more!
Get Started for Free

sim2

Synthetic dataset used in section 5.1.2 of the reference paper.


Description

Dataset used for testing clustering with HMM-VB. The data dimension is 5. Data points were drawn from a 10-component Gaussian Mixture Model. By specific choice of the means, the data contains 10 distinct clusters. For details see the references.

Usage

sim2

Format

A data frame with 5000 rows and 5 variables. Last column contains ground truth cluster labels.

References

Lin Lin and Jia Li, "Clustering with hidden Markov model on variable blocks," Journal of Machine Learning Research, 18(110):1-49, 2017.


HDclust

Clustering High Dimensional Data with Hidden Markov Model on Variable Blocks

v1.0.3
GPL (>= 2)
Authors
Yevhen Tupikov [aut], Lin Lin [aut], Lixiang Zhang [aut], Jia Li [aut, cre]
Initial release
2019-04-05

We don't support your browser anymore

Please choose more modern alternatives, such as Google Chrome or Mozilla Firefox.