vtreat

A Statistically Sound 'data.frame' Processor/Conditioner

A 'data.frame' processor/conditioner that prepares real-world data for predictive modeling in a statistically sound manner. 'vtreat' prepares variables so that data has fewer exceptional cases, making it easier to safely use models in production. Common problems 'vtreat' defends against: 'Inf', 'NA', too many categorical levels, rare categorical levels, and new categorical levels (levels seen during application, but not during training). Reference: "'vtreat': a data.frame Processor for Predictive Modeling", 'Zumel', 'Mount', 2016, DOI:10.5281/zenodo.1173314.

Total

20,726

Last month

746

Last week

147

Average per day

25

Daily downloads

Total downloads

Description file content

Package
vtreat
Type
Package
Title
A Statistically Sound 'data.frame' Processor/Conditioner
Version
1.0.4
Date
2018-05-05
URL
BugReports
https://github.com/WinVector/vtreat/issues
Maintainer
John Mount
Description
A 'data.frame' processor/conditioner that prepares real-world data for predictive modeling in a statistically sound manner. 'vtreat' prepares variables so that data has fewer exceptional cases, making it easier to safely use models in production. Common problems 'vtreat' defends against: 'Inf', 'NA', too many categorical levels, rare categorical levels, and new categorical levels (levels seen during application, but not during training). Reference: "'vtreat': a data.frame Processor for Predictive Modeling", 'Zumel', 'Mount', 2016, DOI:10.5281/zenodo.1173314.
License
GPL-3
Depends
R (>= 3.0.0)
Imports
stats
Suggests
testthat, knitr, parallel, rmarkdown, data.table, dplyr, ggplot2, DBI, RSQLite, datasets
LazyData
true
VignetteBuilder
knitr
RoxygenNote
6.0.1
ByteCompile
true
NeedsCompilation
no
Packaged
2018-05-05 17:30:11 UTC; johnmount
Author
John Mount [aut, cre], Nina Zumel [aut], Win-Vector LLC [cph]
Repository
CRAN
Date/Publication
2018-05-05 17:42:53 UTC

install.packages('vtreat')

1.0.4

a month ago

https://github.com/WinVector/vtreat/

John Mount

GPL-3

Depends on

R (>= 3.0.0)

Imports

stats

Suggests

testthat, knitr, parallel, rmarkdown, data.table, dplyr, ggplot2, DBI, RSQLite, datasets

Discussions