We will have a standard format for our EDA reports in this class. Refer to the sections below.
The first section should have your title and introduction paragraph.
# Title of Analysis
*Created by Nathan Garrett. Updated 1/1/11*
## 1. Introduction
Purpose of analysis. Data being analyzed. Main question of the analysis. Major finding.
Created by Nathan Garrett. Updated 1/1/11
This analysis uses holiday toy sales to test for the existence of Santa Claus. Its data includes holiday toy sales from Amazon.com and Target. If Santa exists, we would expect that most of holiday gifts would be from the North Pole. However, we see a significant increase in parent gift purchases, which is consistent with the Grinch hypothesis.
The second section should describe your data.
## 2. Data Description
Describe the data and its source. Summarize the number of observations, time period covered, and key variables.
Amazon and Target provided three million order records from the 2020-2021 holiday season. These included row-level details of each sale, purchaser, and item. Three-thousand rows related to Hanukkah were excluded.
Example:
“A linear regression model was employed to analyze the relationship
between advertising spend and sales revenue. Data was cleaned to remove
outliers and fill missing values using median imputation.”
Example:
- “Sales of Product X increased by 20% in February compared to January.”
- “There is a significant positive correlation (r = 0.85) between
advertising spend and sales revenue.”
Example:
“The analysis suggests that increased advertising spend is strongly
associated with higher sales revenue. It is recommended to maintain or
increase the advertising budget in the upcoming quarter to sustain this
growth trend.”
Example:
“This analysis did not account for external factors such as market
competition or economic conditions, which could influence sales
performance. Future research could incorporate these factors to enhance
the model’s accuracy.”
Example:
- “Company Sales Database, Q1 2024.” - “R Programming Language for
Statistical Computing.”