Assignment 2b

Download as xlsx, pdf, or txt
Download as xlsx, pdf, or txt
You are on page 1of 10

Average

Week Sales Advertising Temp


1 669 100 30
This dataset represents 13 weeks of sales (in 00
2 706 120 27 each of those thirteen weeks is captured. Furth
3 687 105 32 heat index, the average temperature (per week
4 699 90 30 in the class as we study forecasting, it is a good
visualize the data to make meaningful assumpti
5 759 140 32 Module 2.
6 674 100 29
7 677 105 28
8 709 110 32 1. Let us start with the "Sales" data. In the tab
9 684 90 34 * Create a histogram of the sales data using ex
10 678 85 31 * Create a simple stem and leaf plot from the s
11 686 90 32 * Finally, create a box and whisker plot for the
12 714 100 34 * Make sure that you properly explain the take
13 718 110 33
2. Now let us look at the way sales data behave
Mean (AVERAGE) 696.92 103.46 31.08 * Create a simple time plot of sales data by we
Median (MEDIAN) 687 100 32 * Now create another time plot to chart both s
Maximum (MAX) 759 140 34 * Make sure that you can properly explain the
Minimum (MIN) 669 85 27
Range (MAX-MIN) 90 55 7 3. Logically, weeks with higher temperature wi
Skewness (SKEW) 1.34 1.23 -0.46 * Create a scatter plot between sales and avera
Variance (VAR.P)
Standard Deviation To be discussed in detail when studying * Are there any outliers? Explain.
Coefficient of(STDEV.P)
Variation Learning Module 3 * Check for association of the scatter (form, dir
(STDEV.P/AVERAGE) * Draw the line of best fit and provide the equa

4. Complete the calculations for the six summa

5. DO NOT FORGET EXCEL ETIQUETTE IN YOUR


nts 13 weeks of sales (in 000s of $) for a suntan lotion. Advertising dollars spent on that product line during
en weeks is captured. Furthermore, since the usage of suntan lotion seems to be highly correlated to the
age temperature (per week, in degree celcius) is also captured. While we will use this dataset in detail later
udy forecasting, it is a good idea to start looking at the summary statistics from this dataset and ways to
make meaningful assumptions. Please complete each subsequent part exactly as discussed in Learning

he "Sales" data. In the tab labeled as "Sales Data Visualization", complete the following:
of the sales data using excel (not using the Data Analytics add-in) as discussed in Learning Module 2; create a frequncy table and explain
m and leaf plot from the sales data
x and whisker plot for the sales data; are there any outliers?
u properly explain the takeaways from the histogram as well as the box and whsiker plot in the context of the problem.

the way sales data behave over time (in the tab labeled as "Time Plots"):
me plot of sales data by week.
er time plot to chart both sales data and advertising dollars as a function of time. Use advertising dollars on the secondary axis
u can properly explain the takeaways from the time plots in the context of the problem.

ith higher temperature will induce more usage (sales) of suntan lotion, let us look at what the data says (tab labeled as "Scatter Plots"):
ot between sales and average weekly temperature. Ensure you understand which variable goes as "x" and which one goes as "y".
ers? Explain.
on of the scatter (form, direction, strength) and provide an analytical representation (correlation)
est fit and provide the equation and R-squared value on the chart; explain what the numbers mean in the context of the problem.

ulations for the six summary measures as discussed in Learning Module 1. Comment on the skewness of each column (sales, ad, avg. temp

EXCEL ETIQUETTE IN YOUR WORK.


a frequncy table and explain

e problem.

he secondary axis

labeled as "Scatter Plots"):


hich one goes as "y".

ntext of the problem.

h column (sales, ad, avg. temp.)


Sales Minimum: 669
669 Maximum: 759
706 Range: 90
687 # of Bins: 5
699 Bin Width: 18
759
674
677 Bins Frequency Rel. Freq. Cum. Freq.
709 687 7 54% 54%
684 705 1 8% 62%
678 723 4 31% 92%
686 741 0 0% 92%
714 759 1 8% 100%
718 13

Based on the dataset, there is a 8% chance that sales lie between 687 -705.

Based on the dataset, ther is a 62% chance that a sale will get 705 or below.

Sales Frequency Histogram


Frequency

8
7
7
6
5
4
4
3
2
1 1
1
0
0
1 2 3 4 5
Sales Bins
Sales_Array Stem Leaf
669 66 9
674 67 4 7 8
677 68 4 6 7
678 69 9
684 70 6 9
686 71 4 8
687 72
699 73
706 74
709 75 9
714
718 75 9 means 759 on the dataset
759

Sales This chart isn't available in your version of Excel.


669
706 Editing this shape or saving this workbook into a different Largest=759
687 file format will permanently break the chart.
699
759 Q3=third quartile=711.5
674
677 Average=697
709 Median=687
684
678 Q1=first quartile=677.5
686
714
718 Smallest=669

Interquartile range(IQR): Q3-Q1=711.5-677.5=34


1.5*IQR=34*1.5=51
The largest point 759, which is 47.5 above the third quartile and lower than 1.5 *IQR of 51.
Thus, there is not suspected outlier.
Largest=759

Q3=third quartile=711.5

Average=697
Median=687

Q1=first quartile=677.5

Smallest=669

than 1.5 *IQR of 51.


Sales ( In thousands)
Week Sales Advertising
1 669 100 Sales Over Time
2 706 120 780
3 687 105 760
4 699 90 740
5 759 140 720
6 674 100
700
7 677 105
680
8 709 110
660
9 684 90
640
10 678 85
620
11 686 90 0 2 4 6 8 10 12 14
12 714 100
Weeks
13 718 110

The trend is arise that persists over time, despite small irregularities.
Sales and Advertising Cost Over Time

Advertising Cost
780 160
760 140
740 120
Sales ( in thousands)

720 100
700 80
680 60
660 40
640 20
620 0
1 2 3 4 5 6 7 8 9 10 11 12 13
Weeks
Sales Advertising

all irregularities. The patten over time for the advertising cost closely resembles that for the sales,
which shows advertising cost is about 12%-18% of sales.
Average
Temp Sales Does Average Temperature Impact Sales?
27 706 780
28 677 760
29 674
740
30 669
30 699 720

Sales
31 678 700 f(x) = 3.56756756756757 x + 586.054054054054
R² = 0.099697029516019
32 686 680
32 687 660
32 709
640
32 759
620
33 718 26 27 28 29 30 31 32 33 34 35
34 684 Temperature
34 714

The points of sale of 706 at temperate 27 and sale of 759 at temperate 32


are deviate from the highly linear patten. Thus, there are outliers.
Characteristics of Association:

Correlation: 0.32

R2 means that the change in the temperature


only explain 10% of the change in the sales.

You might also like