Vous êtes sur la page 1sur 31

Dataset of Mall Customers

CustomerID Age Annual Income (k$) Spending Score (1-100)


1 19 15 39
2 21 15 81
3 20 16 6
4 23 16 77
5 31 17 40
6 22 17 76
7 35 18 6
8 23 18 94
9 64 19 3
10 30 19 72
11 67 19 14
12 35 19 99
13 58 20 15
14 24 20 77
15 37 20 13
16 22 20 79
17 35 21 35
18 20 21 66
19 52 23 29
20 35 23 98
21 35 24 35
22 25 24 73
23 46 25 5
24 31 25 73
25 54 28 14
26 29 28 82
27 45 28 32
28 35 28 61
29 40 29 31
30 23 29 87
31 60 30 4
32 21 30 73
33 53 33 4
34 18 33 92
35 49 33 14
36 21 33 81
37 42 34 17
38 30 34 73
39 36 37 26
40 20 37 75
41 65 38 35
42 24 38 92
43 48 39 36
44 31 39 61
45 49 39 28
46 24 39 65
47 50 40 55
48 27 40 47
49 29 40 42
50 31 40 42
51 49 42 52
52 33 42 60
53 31 43 54
54 59 43 60
55 50 43 45
56 47 43 41
57 51 44 50
58 69 44 46
59 27 46 51
60 53 46 46
61 70 46 56
62 19 46 55
63 67 47 52
64 54 47 59
65 63 48 51
66 18 48 59
67 43 48 50
68 68 48 48
69 19 48 59
70 32 48 47
71 70 49 55
72 47 49 42
73 60 50 49
74 60 50 56
75 59 54 47
76 26 54 54
77 45 54 53
78 40 54 48
79 23 54 52
80 49 54 42
81 57 54 51
82 38 54 55
83 67 54 41
84 46 54 44
85 21 54 57
86 48 54 46
87 55 57 58
88 22 57 55
89 34 58 60
90 50 58 46
91 68 59 55
92 18 59 41
93 48 60 49
94 40 60 40
95 32 60 42
96 24 60 52
97 47 60 47
98 27 60 50
99 48 61 42
100 20 61 49
CustomerID Age Annual Income (k$) Spending Score (1-100)
1 19 15 39
2 21 15 81
3 20 16 6
4 23 16 77
5 31 17 40
6 22 17 76
7 35 18 6
8 23 18 94
9 64 19 3
10 30 19 72
11 67 19 14
12 35 19 99
13 58 20 15
14 24 20 77
15 37 20 13
16 22 20 79
17 35 21 35
18 20 21 66
19 52 23 29
20 35 23 98
21 35 24 35
22 25 24 73
23 46 25 5
24 31 25 73
25 54 28 14
26 29 28 82
27 45 28 32
28 35 28 61
29 40 29 31
30 23 29 87
31 60 30 4
32 21 30 73
33 53 33 4
34 18 33 92
35 49 33 14
36 21 33 81
37 42 34 17
38 30 34 73
39 36 37 26
40 20 37 75
41 65 38 35
42 24 38 92
43 48 39 36
44 31 39 61
45 49 39 28
46 24 39 65
47 50 40 55
48 27 40 47
49 29 40 42
50 31 40 42
51 49 42 52
52 33 42 60
53 31 43 54
54 59 43 60
55 50 43 45
56 47 43 41
57 51 44 50
58 69 44 46
59 27 46 51
60 53 46 46
61 70 46 56
62 19 46 55
63 67 47 52
64 54 47 59
65 63 48 51
66 18 48 59
67 43 48 50
68 68 48 48
69 19 48 59
70 32 48 47
71 70 49 55
72 47 49 42
73 60 50 49
74 60 50 56
75 59 54 47
76 26 54 54
77 45 54 53
78 40 54 48
79 23 54 52
80 49 54 42
81 57 54 51
82 38 54 55
83 67 54 41
84 46 54 44
85 21 54 57
86 48 54 46
87 55 57 58
88 22 57 55
89 34 58 60
90 50 58 46
91 68 59 55
92 18 59 41
93 48 60 49
94 40 60 40
95 32 60 42
96 24 60 52
97 47 60 47
98 27 60 50
99 48 61 42
100 20 61 49
Variance of Annual Income
15 Age Annual Income (k$)
15 Geometric Mean 36.65995183 Geometric Mean
16 Mean 39.75 Mean
16 Standard Error 1.562656053 Standard Error
17 Median 36.5 Median
17 Mode 35 Mode
18 Standard Deviation 15.62656053 Standard Deviation
18 Sample Variance 244.1893939 Sample Variance
19 Kurtosis -1.0902607 Kurtosis
19 Skewness 0.322613414 Skewness
19 Range 52 Range
19 Minimum 18 Minimum
20 Maximum 70 Maximum
20 Sum 3975 Sum
20 Count 100 Count
20
21 Quartiles Output
21 1 24.75
23 2 37
23 3 50.75
24 4 70
24
25
25
28
28
28
28
29
29
30
30
33
33
33
33
34
34
37
37
38
38
39
39
39
39
40
40
40
40
42
42
43
43
43
43
44
44
46
46
46
46
47
47
48
48
48
48
48
48
49
49
50
50
54
54
54
54
54
54
54
54
54
54
54
54
57
57
58
58
59
59
60
60
60
60
60
60
61
61
201.986262626263
Annual Income (k$) Spending Score (1-100)
36.631302527 Geometric Mean 42.3865179
39.56 Mean 49.93
1.4212187116 Standard Error 2.16562936
41 Median 50
54 Mode 55
14.212187116 Standard Deviation 21.6562936
201.98626263 Sample Variance 468.995051
-1.228128111 Kurtosis 0.13996282
-0.201484686 Skewness -0.11148192
46 Range 96
15 Minimum 3
61 Maximum 99
3956 Sum 4993
100 Count 100
This chart isn't available in your v

Age Group Annual Income Editing this shape or saving this w


18-25 908 break the chart.
26-35 826
36-50 1172
51-65 684
66-70 366

Note:
1. Axis: Y axis represents the Total
X axis represents the range

2.Interpret a. The Hisogram has been


b. A series or output of ann
c. The least earning group
d. A fair comparison can be
Histogram
hart isn't available in your version of Excel.

g this shape or saving this workbook into a different file format will permanently
the chart.

Y axis represents the Total annual income in $ 1000


X axis represents the range of age of individuals

a. The Hisogram has been made to analyse the earnings of particular group of age.
b. A series or output of annual income can be observed
c. The least earning group age is 66-70 whereas the heighest earning age group is 36-50
d. A fair comparison can be seen between age group 18-25 and 36-50
Line Graph

Age Annual Income (k


80 70

70 60
60
50
50
40
40
30
30
20
20

10 10

0 0
1 6 11 16 21 26 31 36 41 46 51 56 61 66 71 76 81 86 91 96 1 5 9 13 17 21 25 29 33 37 41 45 49 53 57 6
ne Graph

Annual Income (k$) Spending Score (1-100)


120

100

80

60

40

20

0
1 5 9 13 17 21 25 29 33 37 41 45 49 53 57 61 65 69 73 77 81 85 89 9
25 29 33 37 41 45 49 53 57 61 65 69 73 77 81 85 89 93 97
re (1-100)

53 57 61 65 69 73 77 81 85 89 93 97
BOX PLOTS
This chart isn't available in your version of Excel. This chart isn't available in your version of Excel.

Editing this shape or saving this workbook into a different file Editing this shape or saving this workbook into a
format will permanently break the chart. format will permanently break the chart.
BOX PLOTS
lable in your version of Excel. This chart isn't available in your version of Excel.

r saving this workbook into a different file Editing this shape or saving this workbook into a different file format
ently break the chart. will permanently break the chart.
ferent file format
Scatter Plo
Annual Income & Spending Score (1-100)
120

100
Spending Score ( 1-100)

80

60

40

20

0
10 20 30 40 50 60 70
Annual Income ( K$)

we can say that this is slightly poritive/negative co relation

Correlation
Age
Age 1
Annual Income (k$) 0.229366
Spending Score (1-100) -0.432462
Scatter Plots
Age Group & Spending Score (1-100)
120

100
Spending Score ( 1-100)
80

60

40

20

0
10 20 30 40 50 60 70 80
Age Group

negative corelation

Annual Income (k$) Spending Score (1-100)

1
-0.004433134 1
70 80
Annual Spending Score
CustomerID Age x-mean(x) y-mean(y) FxG
Income (k$) (1-100)
1 19 15 39 -20.75 -24.56 509.62
2 21 15 81 -18.75 -24.56 460.5
3 20 16 6 -19.75 -23.56 465.31
4 23 16 77 -16.75 -23.56 394.63
5 31 17 40 -8.75 -22.56 197.4
6 22 17 76 -17.75 -22.56 400.44
7 35 18 6 -4.75 -21.56 102.41
8 23 18 94 -16.75 -21.56 361.13
9 64 19 3 24.25 -20.56 -498.58
10 30 19 72 -9.75 -20.56 200.46
11 67 19 14 27.25 -20.56 -560.26
12 35 19 99 -4.75 -20.56 97.66
13 58 20 15 18.25 -19.56 -356.97
14 24 20 77 -15.75 -19.56 308.07
15 37 20 13 -2.75 -19.56 53.79
16 22 20 79 -17.75 -19.56 347.19
17 35 21 35 -4.75 -18.56 88.16
18 20 21 66 -19.75 -18.56 366.56
19 52 23 29 12.25 -16.56 -202.86
20 35 23 98 -4.75 -16.56 78.66
21 35 24 35 -4.75 -15.56 73.91
22 25 24 73 -14.75 -15.56 229.51
23 46 25 5 6.25 -14.56 -91
24 31 25 73 -8.75 -14.56 127.4
25 54 28 14 14.25 -11.56 -164.73
26 29 28 82 -10.75 -11.56 124.27
27 45 28 32 5.25 -11.56 -60.69
28 35 28 61 -4.75 -11.56 54.91
29 40 29 31 0.25 -10.56 -2.64
30 23 29 87 -16.75 -10.56 176.88
31 60 30 4 20.25 -9.56 -193.59
32 21 30 73 -18.75 -9.56 179.25
33 53 33 4 13.25 -6.56 -86.92
34 18 33 92 -21.75 -6.56 142.68
35 49 33 14 9.25 -6.56 -60.68
36 21 33 81 -18.75 -6.56 123
37 42 34 17 2.25 -5.56 -12.51
38 30 34 73 -9.75 -5.56 54.21
39 36 37 26 -3.75 -2.56 9.6
40 20 37 75 -19.75 -2.56 50.56
41 65 38 35 25.25 -1.56 -39.39
42 24 38 92 -15.75 -1.56 24.57
43 48 39 36 8.25 -0.56 -4.62
44 31 39 61 -8.75 -0.56 4.9
45 49 39 28 9.25 -0.56 -5.18
46 24 39 65 -15.75 -0.56 8.82
47 50 40 55 10.25 0.44 4.51
48 27 40 47 -12.75 0.44 -5.61
49 29 40 42 -10.75 0.44 -4.73
50 31 40 42 -8.75 0.44 -3.85
51 49 42 52 9.25 2.44 22.57
52 33 42 60 -6.75 2.44 -16.47
53 31 43 54 -8.75 3.44 -30.1
54 59 43 60 19.25 3.44 66.22
55 50 43 45 10.25 3.44 35.26
56 47 43 41 7.25 3.44 24.94
57 51 44 50 11.25 4.44 49.95
58 69 44 46 29.25 4.44 129.87
59 27 46 51 -12.75 6.44 -82.11
60 53 46 46 13.25 6.44 85.33
61 70 46 56 30.25 6.44 194.81
62 19 46 55 -20.75 6.44 -133.63
63 67 47 52 27.25 7.44 202.74
64 54 47 59 14.25 7.44 106.02
65 63 48 51 23.25 8.44 196.23
66 18 48 59 -21.75 8.44 -183.57
67 43 48 50 3.25 8.44 27.43
68 68 48 48 28.25 8.44 238.43
69 19 48 59 -20.75 8.44 -175.13
70 32 48 47 -7.75 8.44 -65.41
71 70 49 55 30.25 9.44 285.56
72 47 49 42 7.25 9.44 68.44
73 60 50 49 20.25 10.44 211.41
74 60 50 56 20.25 10.44 211.41
75 59 54 47 19.25 14.44 277.97
76 26 54 54 -13.75 14.44 -198.55
77 45 54 53 5.25 14.44 75.81
78 40 54 48 0.25 14.44 3.61
79 23 54 52 -16.75 14.44 -241.87
80 49 54 42 9.25 14.44 133.57
81 57 54 51 17.25 14.44 249.09
82 38 54 55 -1.75 14.44 -25.27
83 67 54 41 27.25 14.44 393.49
84 46 54 44 6.25 14.44 90.25
85 21 54 57 -18.75 14.44 -270.75
86 48 54 46 8.25 14.44 119.13
87 55 57 58 15.25 17.44 265.96
88 22 57 55 -17.75 17.44 -309.56
89 34 58 60 -5.75 18.44 -106.03
90 50 58 46 10.25 18.44 189.01
91 68 59 55 28.25 19.44 549.18
92 18 59 41 -21.75 19.44 -422.82
93 48 60 49 8.25 20.44 168.63
94 40 60 40 0.25 20.44 5.11
95 32 60 42 -7.75 20.44 -158.41
96 24 60 52 -15.75 20.44 -321.93
97 47 60 47 7.25 20.44 148.19
98 27 60 50 -12.75 20.44 -260.61
99 48 61 42 8.25 21.44 176.88
100 20 61 49 -19.75 21.44 -423.44
5043
(x-mean(x))2 (y-mean(y))2 Correlation Coefficient
430.5625 603.1936
351.5625 603.1936
390.0625 555.0736 r= (∑128▒(𝒙−𝒙 ̅ )(𝒚−𝒚 ̅ ) )/√(∑128▒ 〖 (𝒙−𝒙 ̅ )^𝟐 𝜮(𝒚−𝒚 ̅ )^
280.5625 555.0736
76.5625 508.9536
315.0625 508.9536
22.5625 464.8336 r= 0.229366220503994
280.5625 464.8336
588.0625 422.7136
95.0625 422.7136 Since r=0.229366221, it can be said that the correlation between age and ann
742.5625 422.7136 Relationship is positive because there is an corresponding increment in both the tre
22.5625 422.7136
333.0625 382.5936
248.0625 382.5936 r correlation coeffecient
7.5625 382.5936 x Age
315.0625 382.5936 y Annual Income
22.5625 344.4736 Mean (x) Average of overall age of individual
390.0625 344.4736 Mean(y) Average of overall age of income earned by individual
150.0625 274.2336 Σ Summation
22.5625 274.2336
22.5625 242.1136 Age Annual Income (k$)
217.5625 242.1136 Age 1
39.0625 211.9936 Annual Income ( 0.229366221 1
76.5625 211.9936 Spending Score ( -0.43246224 -0.004433134
203.0625 133.6336
115.5625 133.6336
27.5625 133.6336
22.5625 133.6336
0.0625 111.5136
280.5625 111.5136
410.0625 91.3936
351.5625 91.3936
175.5625 43.0336
473.0625 43.0336
85.5625 43.0336
351.5625 43.0336
5.0625 30.9136
95.0625 30.9136
14.0625 6.5536
390.0625 6.5536
637.5625 2.4336
248.0625 2.4336
68.0625 0.3136
76.5625 0.3136
85.5625 0.3136
248.0625 0.3136
105.0625 0.1936
162.5625 0.1936
115.5625 0.1936
76.5625 0.1936
85.5625 5.9536
45.5625 5.9536
76.5625 11.8336
370.5625 11.8336
105.0625 11.8336
52.5625 11.8336
126.5625 19.7136
855.5625 19.7136
162.5625 41.4736
175.5625 41.4736
915.0625 41.4736
430.5625 41.4736
742.5625 55.3536
203.0625 55.3536
540.5625 71.2336
473.0625 71.2336
10.5625 71.2336
798.0625 71.2336
430.5625 71.2336
60.0625 71.2336
915.0625 89.1136
52.5625 89.1136
410.0625 108.9936
410.0625 108.9936
370.5625 208.5136
189.0625 208.5136
27.5625 208.5136
0.0625 208.5136
280.5625 208.5136
85.5625 208.5136
297.5625 208.5136
3.0625 208.5136
742.5625 208.5136
39.0625 208.5136
351.5625 208.5136
68.0625 208.5136
232.5625 304.1536
315.0625 304.1536
33.0625 340.0336
105.0625 340.0336
798.0625 377.9136
473.0625 377.9136
68.0625 417.7936
0.0625 417.7936
60.0625 417.7936
248.0625 417.7936
52.5625 417.7936
162.5625 417.7936
68.0625 459.6736
390.0625 459.6736
24174.75 19996.64
on Coefficient

−𝒚 ̅ ) )/√(∑128▒ 〖 (𝒙−𝒙 ̅ )^𝟐 𝜮(𝒚−𝒚 ̅ )^𝟐 〗 )

that the correlation between age and annual income is linear relationship
an corresponding increment in both the trends

income earned by individual

Spending Score (1-100)

1
Regression of annual income and spending score
SUMMARY OUTPUT

Regression Statistics
Multiple R 0.00443313
R Square 1.9653E-05
Adjusted R Square -0.0101842
Standard Error 21.7662905
Observations 100

ANOVA
df SS MS F Significance F
Regression 1 0.912484 0.912484 0.001926 0.96508440149
Residual 98 46429.6 473.7714
Total 99 46430.51

Coefficients
Standard Error t Stat P-value Lower 95% Upper 95%
Intercept 50.1972331 6.46656 7.762586 8.115E-12 37.3645543957 63.02991187
X Variable 1 -0.0067551 0.153924 -0.043886 0.9650844 -0.3122119986 0.298701729
ding score Regression of Age & Spending Sco
SUMMARY OUTPUT

Regression Statistics
Multiple R 0.4324622425
R Square 0.1870235912
Adjusted R Square 0.1787279136
Standard Error 19.625813197
Observations 100

ANOVA
df SS MS F
Regression 1 8683.60072234 8683.601 22.5447
Residual 98 37746.9092777 385.1725
Total 99 46430.51

Coefficients Standard Error t Stat P-value


Lower 95.0% Upper 95.0% Intercept 73.753527131 5.38763549927 13.68941 1.767E-24
37.3645544 63.0299119 X Variable 1 -0.599334016 0.12622537367 -4.748126 7.013E-06
-0.312212 0.29870173
& Spending Score

Significance F
7.012818788E-06

Lower 95% Upper 95%Lower 95.0%


Upper 95.0%
63.06193982552 84.445114 63.0619398 84.44511
-0.849824161844 -0.3488439 -0.8498242 -0.348844

Vous aimerez peut-être aussi