Feature idea - provide custom validation sets for early stopping

Thanks for creating this excellent package. I created a [similar fork of treesnip](https://gitlab.com/ccao-data-science---modeling/packages/lightsnip) but am planning to replace it with `{bonsai}` in all our production models.

One feature that I think would be incredibly useful in `{bonsai}` is the ability to provide custom validation sets during early stopping (instead of using a random split of the training data). This would have a few potential benefits:

1. More training data. In many cases, you're already going to have a validation set set aside from a classic `train`, `validate`, `test` split. Currently, `{bonsai}` will further split the `train` data into `train subset` and `validation specifically for early stopping` sets. Instead, it would be ideal to be able to pass the `validate` set directly. This would mean _all_ of `train` would be used for training.
2. Ability to do more complex cross-validation. Certain cross-validation techniques (rolling origin, spatial, etc.) don't rely on a random sample of the training data and instead use some sort of partitioning (time or geographic). Allowing custom validation data would let users use the "correct" validation set for early stopping when using these more complex methods.
3. Better integration with tidymodels. Tidymodels supports k-fold and other types of cross-validation. Using the validation set created for each fold rather than splitting a separate validation set specifically for early stopping would be much simpler.

Let me know if this is out-of-scope for this project. If not, I'm happy to contribute if needed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Feature idea - provide custom validation sets for early stopping #48

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Feature idea - provide custom validation sets for early stopping #48

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions