course_model

Modeling to predict the future

Our prior course focused on visualizations as a means to understand. This course builds on that concept, but mainly focuses on creating models. A model is a simplified version of the real-world, expressed in mathematical terms (i.e., y = x + 1).

Outcomes

Links

Key Terms

Approaches to Model Building

We have two different approaches to building a model:

Statistical modeling aims to understand relationships and test hypotheses, while machine learning aims to make accurate predictions regardless of interpretability.

After we create a model, we want to evaluate its performance:

EDA Process

The EDA process (exploratory data analysis) is similar for both statistical modeling and machine learning. This is covered in our prior class, see EDA Chapter. If you did not take the class with me, please go back and review the data structure chapter.

We follow these steps:

Once the data is clean, we can build a model:

Finally, we communicate our results: