2.5 Watching an evaluation set

Racket

2.5 Watching an evaluation set🔗ℹ

Rather than letting train run the whole boosting loop, you can drive it one round at a time and inspect a held-out metric after each — the basis for logging, early stopping, or custom schedules. This example trains a regressor while watching RMSE on both the training and an evaluation split.

The two pieces are booster-update-one-iter! (do one boosting round) and eval-one-iter (return XGBoost’s metric line, which parse-eval-line turns into a hash).

<r05-require> ::=

(require ffi/vector
xgboost)

<r05-provide> ::=

(provide run-example)

The data. Two non-overlapping splits of a y ≈ 2·x₀ + x₁ − x₂ dataset — eight training rows, four evaluation rows:

<r05-data> ::=

(define dtrain
  (make-dmatrix (f32vector 1.0 2.0 0.5   2.0 1.0 1.5   3.0 0.5 0.0
                           0.5 3.0 2.0   4.0 2.0 1.0   1.5 1.5 0.5
                           2.5 3.5 1.5   0.0 1.0 0.0)
                #:nrow 8 #:ncol 3
                #:labels (f32vector 3.5 3.5 6.5 2.0 9.0 4.0 7.0 1.0)))
(define deval
  (make-dmatrix (f32vector 2.0 0.5 0.5   1.0 1.0 1.0
                           3.5 1.0 0.5   0.5 0.5 0.5)
                #:nrow 4 #:ncol 3
                #:labels (f32vector 4.0 2.0 7.5 1.0)))

Set up the booster. Training with #:rounds 0 and an #:evals list builds the booster and binds both matrices into its cache (so the GC keeps them alive) without doing any boosting yet:

<r05-setup> ::=

(define booster
  (train dtrain
         #:evals (list (cons "eval" deval))
         #:objective "reg:squarederror"
         #:max-depth 3
         #:eta 0.1
         #:verbosity 0
         #:rounds 0))

The loop. Each round, advance the booster and record the parsed metrics for both watched matrices. run-example returns the booster and the per-round history:

<r05-loop> ::=

(define eval-set (list (cons "train" dtrain) (cons "eval" deval)))
(define history
  (for/list ([iter (in-range 30)])
    (booster-update-one-iter! booster iter dtrain)
    (parse-eval-line (eval-one-iter booster iter eval-set))))

The harness "test/05-train-with-eval.rkt" prints the per-round table and the final metrics, and asserts the evaluation RMSE falls over training:

; iter train-rmse eval-rmse
; 0 3.8019 3.6960
; 29 0.0530 0.3327

<r05-run> ::=

(define (run-example)
  <r05-data>
  <r05-setup>
  <r05-loop>
  (values booster history))

<*> ::=

<r05-require>
<r05-provide>
<r05-run>

2.1	Building a DMatrix
2.2	Training a regressor
2.3	Binary classification
2.4	Multiclass classification
2.5	Watching an evaluation set
2.6	Iris: a full classification pipeline
2.7	Get Started
2.8	Robust regression
2.9	Quantile regression
2.10	Poisson count regression
2.11	Survival analysis (AFT)
2.12	Custom objective
2.13	Saving and loading models
2.14	Booster snapshots
2.15	DMatrix constructors
2.16	DMatrix metadata
2.17	Slicing and binary serialization
2.18	Quantile cuts
2.19	The high-level API end to end
2.20	Booster lifecycle and config
2.21	Booster attributes
2.22	Model dumps and feature importance
2.23	In-place prediction (dense)
2.24	In-place prediction (CSR)
2.25	In-place prediction (columnar)
2.26	Parameter recipes
2.27	Learning to rank
2.28	Global and process APIs
2.29	CUDA regression
2.30	CUDA classification