Add New Points to an Existing Embedding
umap_transform( X = NULL, model = NULL, nn_method = NULL, init_weighted = TRUE, search_k = NULL, tmpdir = tempdir(), n_epochs = NULL, n_threads = NULL, n_sgd_threads = 0, grain_size = 1, verbose = FALSE, init = "weighted" )
X |
The new data to be transformed, either a matrix of data frame. Must
have the same columns in the same order as the input data used to generate
the |
model |
Data associated with an existing embedding. |
nn_method |
Optional pre-calculated nearest neighbor data. It must be a list consisting of two elements:
Multiple nearest neighbor data (e.g. from two different pre-calculated
metrics) can be passed by passing a list containing the nearest neighbor
data lists as items.
The |
init_weighted |
If |
search_k |
Number of nodes to search during the neighbor retrieval. The
larger k, the more the accurate results, but the longer the search takes.
Default is the value used in building the |
tmpdir |
Temporary directory to store nearest neighbor indexes during
nearest neighbor search. Default is |
n_epochs |
Number of epochs to use during the optimization of the
embedded coordinates. A value between |
n_threads |
Number of threads to use, (except during stochastic gradient descent). Default is half the number of concurrent threads supported by the system. |
n_sgd_threads |
Number of threads to use during stochastic gradient descent. If set to > 1, then results will not be reproducible, even if 'set.seed' is called with a fixed seed before running. |
grain_size |
Minimum batch size for multithreading. If the number of
items to process in a thread falls below this number, then no threads will
be used. Used in conjunction with |
verbose |
If |
init |
how to initialize the transformed coordinates. One of:
This parameter should be used in preference to |
Note that some settings are incompatible with the production of a UMAP model
via umap
: external neighbor data (passed via a list to the
argument of the nn_method
parameter), and factor columns that were
included in the UMAP calculation via the metric
parameter. In the
latter case, the model produced is based only on the numeric data.
A transformation is possible, but factor columns in the new data are ignored.
A matrix of coordinates for X
transformed into the space
of the model
.
iris_train <- iris[1:100, ] iris_test <- iris[101:150, ] # You must set ret_model = TRUE to return extra data needed iris_train_umap <- umap(iris_train, ret_model = TRUE) iris_test_umap <- umap_transform(iris_test, iris_train_umap)
Please choose more modern alternatives, such as Google Chrome or Mozilla Firefox.