Why develop a workflow?
Modules 1–5 introduced the major families of regression models—linear models, generalized linear models, mixed models, GEEs, splines, and GAMs. Each module focused on one modeling framework at a time. But real statistical analysis does not proceed one module at a time.
In real applications, analysts begin with a scientific question, explore the data, build an initial model, diagnose its limitations, and then iteratively refine the model until they reach a defensible, interpretable representation of the data-generating process.
This requires a workflow—a structured sequence of decisions that guides model building, model checking, and model improvement.
A workflow answers the following questions:
What we are trying to estimate?
Why we chose a particular modeling strategy?
How we checked whether that strategy was reasonable?
How we refined the model when diagnostics suggested problems?
How we visualized and communicated the final result?
The best guiding principle is simple but powerful:
Start with the simplest plausible model and build complexity only as needed.