CRAN/E | mlr3resampling

mlr3resampling

Resampling Algorithms for 'mlr3' Framework

Installation

About

A supervised learning algorithm inputs a train set, and outputs a prediction function, which can be used on a test set. If each data point belongs to a group (such as geographic region, year, etc), then how do we know if it is possible to train on one group, and predict accurately on another group? Cross-validation can be used to determine the extent to which this is possible, by first assigning fold IDs from 1 to K to all data (possibly using stratification, usually by group and label). Then we loop over test sets (group/fold combinations), train sets (same group, other groups, all groups), and compute test/prediction accuracy for each combination. Comparing test/prediction accuracy between same and other, we can determine the extent to which it is possible (perfect if same/other have similar test accuracy for each group; other is usually somewhat less accurate than same; other can be just as bad as featureless baseline when the groups have different patterns). For more information, describes the method in depth. How many train samples are required to get accurate predictions on a test set? Cross-validation can be used to answer this question, with variable size train sets.

github.com/tdhock/mlr3resampling
Bug report File report

Key Metrics

Version 2024.1.23
Published 2024-02-01 88 days ago
Needs compilation? no
License GPL-3
CRAN checks mlr3resampling results

Downloads

Yesterday 8 0%
Last 7 days 107 -14%
Last 30 days 231 +381%
Last 90 days 621 +9%
Last 365 days 1.170

Maintainer

Maintainer

Toby Hocking

toby.hocking@r-project.org

Authors

Toby Hocking

aut / cre

Michel Lang

ctb

(Author of mlr3 when Resampling/ResamplingCV was copied/modified)

Bernd Bischl

ctb

(Author of mlr3 when Resampling/ResamplingCV was copied/modified)

Jakob Richter

ctb

(Author of mlr3 when Resampling/ResamplingCV was copied/modified)

Patrick Schratz

ctb

(Author of mlr3 when Resampling/ResamplingCV was copied/modified)

Giuseppe Casalicchio

ctb

(Author of mlr3 when Resampling/ResamplingCV was copied/modified)

Stefan Coors

ctb

(Author of mlr3 when Resampling/ResamplingCV was copied/modified)

Quay Au

ctb

(Author of mlr3 when Resampling/ResamplingCV was copied/modified)

Martin Binder

ctb

Florian Pfisterer

ctb

(Author of mlr3 when Resampling/ResamplingCV was copied/modified)

Raphael Sonabend

ctb

(Author of mlr3 when Resampling/ResamplingCV was copied/modified)

Lennart Schneider

ctb

(Author of mlr3 when Resampling/ResamplingCV was copied/modified)

Marc Becker

ctb

(Author of mlr3 when Resampling/ResamplingCV was copied/modified)

Sebastian Fischer

ctb

(Author of mlr3 when Resampling/ResamplingCV was copied/modified)

Material

NEWS
Reference manual
Package source

Vignettes

Comparing training on same or other groups
Comparing train set sizes

macOS

r-release

arm64

r-oldrel

arm64

r-release

x86_64

Windows

r-devel

x86_64

r-release

x86_64

r-oldrel

x86_64

Old Sources

mlr3resampling archive

Imports

data.table
R6
checkmate
paradox
mlr3
mlr3misc

Suggests

animint2
lgr
future
testthat
knitr
rmarkdown
nc
rpart