mlr3fselect: FSelector – R documentation

Become an expert in R — Interactive courses, Cheat Sheets, certificates and more!

FSelector

Description

Abstract FSelector class that implements the base functionality each fselector must provide. A FSelector object describes the feature selection strategy, i.e. how to optimize the black-box function and its feasible set defined by the FSelectInstanceSingleCrit / FSelectInstanceMultiCrit object.

A fselector must write its result into the FSelectInstanceSingleCrit / FSelectInstanceMultiCrit using the assign_result method of the bbotk::OptimInstance at the end of its selection in order to store the best selected feature subset and its estimated performance vector.

Private Methods

.optimize(instance) -> NULL
Abstract base method. Implement to specify feature selection of your subclass. See technical details sections.
.assign_result(instance) -> NULL
Abstract base method. Implement to specify how the final feature subset is selected. See technical details sections.

Technical Details and Subclasses

A subclass is implemented in the following way:

Inherit from FSelector.
Specify the private abstract method $.optimize() and use it to call into your optimizer.
You need to call instance$eval_batch() to evaluate feature subsets.
The batch evaluation is requested at the FSelectInstanceSingleCrit / FSelectInstanceMultiCrit object instance, so each batch is possibly executed in parallel via mlr3::benchmark(), and all evaluations are stored inside of instance$archive.
Before the batch evaluation, the bbotk::Terminator is checked, and if it is positive, an exception of class "terminated_error" is generated. In the later case the current batch of evaluations is still stored in instance, but the numeric scores are not sent back to the handling optimizer as it has lost execution control.
After such an exception was caught we select the best feature subset from instance$archive and return it.
Note that therefore more points than specified by the bbotk::Terminator may be evaluated, as the Terminator is only checked before a batch evaluation, and not in-between evaluation in a batch. How many more depends on the setting of the batch size.
Overwrite the private super-method .assign_result() if you want to decide yourself how to estimate the final feature subset in the instance and its estimated performance. The default behavior is: We pick the best resample-experiment, regarding the given measure, then assign its feature subset and aggregated performance to the instance.

Public fields

param_set: (paradox::ParamSet).
param_classes: (character()).
properties: (character()).
packages: (character()).

Methods

Method `new()`

Creates a new instance of this R6 class.

Usage

FSelector$new(param_set, properties, packages = character(0))

Arguments

param_set: paradox::ParamSet
Set of control parameters for fselector.
properties: (character())
Set of properties of the fselector. Must be a subset of mlr_reflections$fselect_properties.
packages: (character())
Set of required packages. Note that these packages will be loaded via requireNamespace(), and are not attached.

Method `format()`

Helper for print outputs.

Usage

FSelector$format()

Returns

(character()).

Method `print()`

Print method.

Usage

FSelector$print()

Returns

(character()).

Method `optimize()`

Performs the feature selection on a FSelectInstanceSingleCrit or FSelectInstanceMultiCrit until termination. The single evaluations will be written into the ArchiveFSelect that resides in the FSelectInstanceSingleCrit / FSelectInstanceMultiCrit. The result will be written into the instance object.

Usage

FSelector$optimize(inst)

Arguments

inst: (FSelectInstanceSingleCrit|FSelectInstanceMultiCrit).

Returns

data.table::data.table.

Method `clone()`

The objects of this class are cloneable with this method.

Usage

FSelector$clone(deep = FALSE)

Arguments

deep: Whether to make a deep clone.

Examples

library(mlr3)

terminator = trm("evals", n_evals = 3)

instance = FSelectInstanceSingleCrit$new(
  task = tsk("iris"),
  learner = lrn("classif.rpart"),
  resampling = rsmp("holdout"),
  measure = msr("classif.ce"),
  terminator = terminator
)

# swap this line to use a different FSelector
fselector = fs("random_search")

# modifies the instance by reference
fselector$optimize(instance)

# returns best feature subset and best performance
instance$result

# allows access of data.table / benchmark result of full path of all evaluations
instance$archive

mlr3fselect

Feature Selection for 'mlr3'

v0.5.1

LGPL-3

Authors

Marc Becker [aut, cre] (<https://orcid.org/0000-0002-8115-0400>), Patrick Schratz [aut] (<https://orcid.org/0000-0003-0748-6624>), Michel Lang [aut] (<https://orcid.org/0000-0001-9754-0393>), Bernd Bischl [aut] (<https://orcid.org/0000-0001-6002-6980>)

Initial release

FSelector

Description

Private Methods

Technical Details and Subclasses

Public fields

Methods

Public methods

Method new()

Usage

Arguments

Method format()

Usage

Returns

Method print()

Usage

Returns

Method optimize()

Usage

Arguments

Returns

Method clone()

Usage

Arguments

Examples

mlr3fselect

We don't support your browser anymore

Method `new()`

Method `format()`

Method `print()`

Method `optimize()`

Method `clone()`