CRAN/E | vtreat

vtreat

A Statistically Sound 'data.frame' Processor/Conditioner

Installation

About

A 'data.frame' processor/conditioner that prepares real-world data for predictive modeling in a statistically sound manner. 'vtreat' prepares variables so that data has fewer exceptional cases, making it easier to safely use models in production. Common problems 'vtreat' defends against: 'Inf', 'NA', too many categorical levels, rare categorical levels, and new categorical levels (levels seen during application, but not during training). Reference: "'vtreat': a data.frame Processor for Predictive Modeling", Zumel, Mount, 2016, doi:10.5281/zenodo.1173313.

github.com/WinVector/vtreat/
winvector.github.io/vtreat/
Bug report File report

Key Metrics

Version 1.6.4
R ≥ 3.4.0
Published 2023-08-19 253 days ago
Needs compilation? no
License GPL-2
License GPL-3
CRAN checks vtreat results

Downloads

Yesterday 87 +26%
Last 7 days 612 +9%
Last 30 days 2.202 -6%
Last 90 days 7.277 -32%
Last 365 days 31.493 -14%

Maintainer

Maintainer

John Mount

jmount@win-vector.com

Authors

John Mount

aut / cre

Nina Zumel

aut

Win-Vector LLC

cph

Material

README
NEWS
Reference manual
Package source

Vignettes

Multi Class vtreat
Saving Treatment Plans
vtreat Variable Importance
vtreat package
vtreat cross frames
vtreat grouping example
vtreat overfit
vtreat Rare Levels
vtreat scale mode
vtreat significance
vtreat data splitting
Variable Types
vtreat Formal Article

macOS

r-release

arm64

r-oldrel

arm64

r-release

x86_64

r-oldrel

x86_64

Windows

r-devel

x86_64

r-release

x86_64

r-oldrel

x86_64

Old Sources

vtreat archive

Depends

R ≥ 3.4.0
wrapr ≥ 2.0.9

Imports

stats
digest

Suggests

rquery ≥ 1.4.9
rqdatatable ≥ 1.3.2
data.table ≥1.12.2
isotone
lme4
knitr
rmarkdown
parallel
DBI
RSQLite
datasets
R.rsp
tinytest

Reverse Imports

crispRdesignR

Reverse Suggests

mlr3pipelines