277. Validation

Tests for confirming that a system-dynamics model represents reality usefully. Sterman (2000, Ch. 21) gives a canonical 12-test battery, divided into structure-oriented and behavior-oriented tests.

277.1. Structure-oriented tests (does the model capture what it should?)

277.2. Behavior-oriented tests (does the model produce realistic output?)

277.3. Quantitative fit: Theil’s U decomposition

For predicted vs actual :

Decomposes MSE into three components:

. Ideal model: and small, large — model captures direction even if magnitudes slightly off.

277.4. Behavior modes matter more than point fits

For SD models, qualitative behavior (does it overshoot? oscillate? saturate?) is usually more valuable than precise fit. A model that predicts the right pattern of bullwhip with the wrong amplitude is better than one with right amplitude but no pattern.

Sterman emphasizes: don’t fixate on RMSE; check that the behavior modes match.

277.5. Calibration methods

For data-fitting:

For decision-rule parameters (e.g., beer-game ): typically estimated by least-squares from gameplay data.

277.6. Common pitfalls

277.7. Sensitivity analysis types

277.8. See also