Course Notes for Data Modeling in Python
Nathan Garrett, PhD CPA
Department of Accounting and Information Systems
West Virginia University
This course introduces students to data analysis and modeling using Python programming language. Students will learn how to clean, analyze, visualize, and model data using popular Python libraries such as Pandas, NumPy, Matplotlib, Seaborn, and Scikit-learn. The course covers essential concepts in data science, including exploratory data analysis, statistical inference, regression analysis, and machine learning techniques.
Each week covers Python and Machine Learning concepts. Most modules have a deliverable that is due Monday night at 11:59pm.
Schedule
Week 1 - January 12, 2026
- py00: Set up your computer
- ml01: Introduction to modeling (with gems dataset)
Week 2 - January 19, 2026
- DataCamp: Introduction to Python
- py01: Python basic syntax
- py02: Python data structures
- ml02: Data and distributions
Week 3 - January 26, 2026
- DataCamp: Data manipulation with pandas
- py03: Pandas 1: load data from a file, filtering
- ml03: Predicting categories
Week 4 - February 2, 2026
- py04: Pandas 2: new columns with mutation
- ml04: Predicting numbers
Week 5 - February 9, 2026
Exam 1
Week 6 - February 16, 2026
- Datacamp: Regression in Python
- ml05: Classical Regression with OLS
Week 7 - February 23, 2026
- ml06: Regression 2, Logistic regression and data transformations
Week 8 - March 2, 2026
- DataCamp: Machine learning with Tree-based models in Python
- py06-pandas-group-join/index.md: Pandas 3: aggregation, joins
- ml07: Decision Trees
Exam 2
Week 9 - March 9, 2026
- DataCamp: Unsupervised learning in Python
- ml08: K-Means Clustering
- py07-pandas-reshaping/index.md: Pandas 4: reshaping
Week 10 - March 16, 2026
Spring Break - No activities
Week 11 - March 23, 2026
Week 12 - March 30, 2026
- Datacamp: Intermediate Python
- py09: Control flow
- py10: Functions
Exam 3
Week 13 - April 6, 2026
Datacamp: NLP in Python
- ml10-nlp: NLP with Python
- Optional readings:
Week 14 - April 13, 2026
- ml11-nn/index.md: Neural Networks with Python
Week 15 - April 20, 2026
Project week - No new material
Week 16 - April 27, 2026
Project, Exam 4
Week 17 - May 4, 2026
Final Exam Week, May 4-8
Further Reading