Académique Documents
Professionnel Documents
Culture Documents
Devoir de Machine Learning
Devoir de Machine Learning
'''
df = pd.read_csv('D:\IA\data.csv')
#lecture de la base de donnée (dataset)
df.head(5)
#affichage des 5 premières données
In [25]: df.info()
#affichage de toutes les caractéristiques de la dataset
variables_qualitatives = df.select_dtypes(include=['object'])
variables_quantitatives = df.select_dtypes(exclude=['object'])
<class 'pandas.core.frame.DataFrame'>
RangeIndex: 50 entries, 0 to 49
Data columns (total 12 columns):
# Column Non-Null Count Dtype
--- ------ -------------- -----
0 layer_height 50 non-null float64
1 wall_thickness 50 non-null int64
2 infill_density 50 non-null int64
3 infill_pattern 50 non-null object
4 nozzle_temperature 50 non-null int64
5 bed_temperature 50 non-null int64
6 print_speed 50 non-null int64
7 material 50 non-null object
8 fan_speed 50 non-null int64
9 roughness 50 non-null int64
10 tension_strenght 50 non-null int64
11 elongation 50 non-null float64
dtypes: float64(2), int64(8), object(2)
memory usage: 4.8+ KB
Variables qualitatives :
['infill_pattern', 'material']
Variables quantitatives :
['layer_height', 'wall_thickness', 'infill_density', 'nozzle_temperatur
e', 'bed_temperature', 'print_speed', 'fan_speed', 'roughness', 'tensio
n_strenght', 'elongation']
In [57]: df['material'].value_counts()
#compte des différents types de matériaux
abs 25
Out[57]:
pla 25
Name: material, dtype: int64
In [26]: df['infill_pattern'].value_counts()
#identifier et dénombrer les différents types de motif de remplissage
grid 25
Out[26]:
honeycomb 25
Name: infill_pattern, dtype: int64
'''
In [ ]: _='''
layer_height :
wall_thickness :
infill_density :
nozzle_temperature :
bed_temperature :
-print_speed :
-fan_speed :
-roughness :
-tension_strenght :
- elongation :
'''
In [49]: df['infill_density'].plot.box()
#réprésentation d'un graphique en boîte (boîtes à moustaches) pour visua
<Axes: >
Out[49]:
In [54]: Q1 = df['infill_density'].quantile(0.25)
Q2 = df['infill_density'].quantile(0.75)
iqr = Q2 - Q1
coefficient = iqr * 1.5
li = Q1 - coefficient
ls = Q2 + coefficient
result = df[(df['infill_density'] > li) | (df['infill_density'] < ls)]
result.sort_values("infill_density")
In [55]: pg=df[(df['infill_density']<=ls)&(df['infill_density']>=li)]
pg['infill_density'].plot.box()
<Axes: >
Out[55]:
In [ ]: _='''
on constate que malgrés le fait qu'on ait ajuster les paramètres, donnée
notre nouveau graphique en boites tracé ne fait état d'aucun point abbé
une bonne cohérence des données
une précision lors de la collecte des données
une bonne distribution des données
'''
In [ ]: _='''
Le prochain dataset dont on fera l'etude descriptive des donnees, porte
Une série d’expériences d’usinage ont été menées sur des blocs de cire d
sur le banc d’essai de recherche sur la fabrication et l’automatisation
Les données d’usinage ont été collectées à partir d’une machine CNC pou
de la vitesse d’avance et de la pression de serrage. Chaque expérience a
gravée dans la face supérieure.
'''
gf.head(5)
#affichage des 5 premières données
In [19]: gf.info()
#affichage de toutes les caractéristiques de la dataset
variables_qualitatives = gf.select_dtypes(include=['object'])
variables_quantitatives = gf.select_dtypes(exclude=['object'])
<class 'pandas.core.frame.DataFrame'>
RangeIndex: 18 entries, 0 to 17
Data columns (total 7 columns):
# Column Non-Null Count Dtype
--- ------ -------------- -----
0 No 18 non-null int64
1 material 18 non-null object
2 feedrate 18 non-null int64
3 clamp_pressure 18 non-null float64
4 tool_condition 18 non-null object
5 machining_finalized 18 non-null object
6 passed_visual_inspection 14 non-null object
dtypes: float64(1), int64(2), object(4)
memory usage: 1.1+ KB
Variables qualitatives :
['material', 'tool_condition', 'machining_finalized', 'passed_visual_in
spection']
Variables quantitatives :
['No', 'feedrate', 'clamp_pressure']
In [20]: gf['material'].value_counts()
#compte des différents types de matériaux
wax 18
Out[20]:
Name: material, dtype: int64
In [52]: # d'après le graphe ci-dessus, on constate que tous les matériaux sont d
In [23]: gf['tool_condition'].value_counts()
#compte des différents types de matériaux
worn 10
Out[23]:
unworn 8
Name: tool_condition, dtype: int64
In [ ]: #il en ressort qu'il ya plus d'entites de type worn que de type unworn
In [25]: gf['machining_finalized'].value_counts()
#compte des différents types de matériaux
yes 14
Out[25]:
no 4
Name: machining_finalized, dtype: int64
In [ ]: # ceci montre que l'usinages est finalisé dans presque 75% des cas
In [27]: gf['passed_visual_inspection'].value_counts()
#compte des différents types de matériaux
yes 10
Out[27]:
no 4
Name: passed_visual_inspection, dtype: int64
La moyenne de No est:
9.5
La moyenne de feedrate est:
9.722222222222221
La moyenne de clamp_pressure est:
3.4166666666666665
La mediane de No est:
9.5
La mediane de feedrate est:
6.0
La mediane de clamp_pressure est:
3.5
'''
In [32]: gf['feedrate'].plot.box()
#réprésentation d'un graphique en boîte (boîtes à moustaches) pour visua
<Axes: >
Out[32]:
In [35]: Q1 = gf['feedrate'].quantile(0.25)
Q2 = gf['feedrate'].quantile(0.75)
iqr = Q2 - Q1
coefficient = iqr * 1.5
li = Q1 - coefficient
ls = Q2 + coefficient
result = gf[(gf['feedrate'] > li) | (gf['feedrate'] < ls)]
result.sort_values("feedrate")
In [36]: dg=gf[(gf['feedrate']<=ls)&(gf['feedrate']>=li)]
dg['feedrate'].plot.box()
<Axes: >
Out[36]:
In [ ]: _='''
on constate que malgrés le fait qu'on ait ajuster les paramètres, donnée
notre nouveau graphique en boites tracé ne fait état d'aucun point abbé
une bonne cohérence des données
une précision lors de la collecte des données
une bonne distribution des données
'''
In [ ]: _='''
Dataset HousePrediction
'''
5 rows × 81 columns
In [42]: ep.info()
#affichage de toutes les caractéristiques de la dataset
variables_qualitatives = ep.select_dtypes(include=['object'])
variables_quantitatives = ep.select_dtypes(exclude=['object'])
<class 'pandas.core.frame.DataFrame'>
RangeIndex: 1460 entries, 0 to 1459
Data columns (total 81 columns):
# Column Non-Null Count Dtype
--- ------ -------------- -----
0 Id 1460 non-null int64
1 MSSubClass 1460 non-null int64
2 MSZoning 1460 non-null object
3 LotFrontage 1201 non-null float64
4 LotArea 1460 non-null int64
5 Street 1460 non-null object
6 Alley 91 non-null object
7 LotShape 1460 non-null object
8 LandContour 1460 non-null object
9 Utilities 1460 non-null object
10 LotConfig 1460 non-null object
11 LandSlope 1460 non-null object
12 Neighborhood 1460 non-null object
13 Condition1 1460 non-null object
14 Condition2 1460 non-null object
15 BldgType 1460 non-null object
16 HouseStyle 1460 non-null object
17 OverallQual 1460 non-null int64
18 OverallCond 1460 non-null int64
19 YearBuilt 1460 non-null int64
20 YearRemodAdd 1460 non-null int64
21 RoofStyle 1460 non-null object
22 RoofMatl 1460 non-null object
23 Exterior1st 1460 non-null object
24 Exterior2nd 1460 non-null object
25 MasVnrType 1452 non-null object
26 MasVnrArea 1452 non-null float64
27 ExterQual 1460 non-null object
28 ExterCond 1460 non-null object
29 Foundation 1460 non-null object
30 BsmtQual 1423 non-null object
31 BsmtCond 1423 non-null object
32 BsmtExposure 1422 non-null object
33 BsmtFinType1 1423 non-null object
34 BsmtFinSF1 1460 non-null int64
35 BsmtFinType2 1422 non-null object
36 BsmtFinSF2 1460 non-null int64
37 BsmtUnfSF 1460 non-null int64
38 TotalBsmtSF 1460 non-null int64
39 Heating 1460 non-null object
40 HeatingQC 1460 non-null object
41 CentralAir 1460 non-null object
42 Electrical 1459 non-null object
43 1stFlrSF 1460 non-null int64
44 2ndFlrSF 1460 non-null int64
45 LowQualFinSF 1460 non-null int64
46 GrLivArea 1460 non-null int64
47 BsmtFullBath 1460 non-null int64
48 BsmtHalfBath 1460 non-null int64
49 FullBath 1460 non-null int64
50 HalfBath 1460 non-null int64
51 BedroomAbvGr 1460 non-null int64
52 KitchenAbvGr 1460 non-null int64
53 KitchenQual 1460 non-null object
54 TotRmsAbvGrd 1460 non-null int64
55 Functional 1460 non-null object
56 Fireplaces 1460 non-null int64
57 FireplaceQu 770 non-null object
58 GarageType 1379 non-null object
59 GarageYrBlt 1379 non-null float64
60 GarageFinish 1379 non-null object
61 GarageCars 1460 non-null int64
62 GarageArea 1460 non-null int64
63 GarageQual 1379 non-null object
64 GarageCond 1379 non-null object
65 PavedDrive 1460 non-null object
66 WoodDeckSF 1460 non-null int64
67 OpenPorchSF 1460 non-null int64
68 EnclosedPorch 1460 non-null int64
69 3SsnPorch 1460 non-null int64
70 ScreenPorch 1460 non-null int64
71 PoolArea 1460 non-null int64
72 PoolQC 7 non-null object
73 Fence 281 non-null object
74 MiscFeature 54 non-null object
75 MiscVal 1460 non-null int64
76 MoSold 1460 non-null int64
77 YrSold 1460 non-null int64
78 SaleType 1460 non-null object
79 SaleCondition 1460 non-null object
80 SalePrice 1460 non-null int64
dtypes: float64(3), int64(35), object(43)
memory usage: 924.0+ KB
Variables qualitatives :
['MSZoning', 'Street', 'Alley', 'LotShape', 'LandContour', 'Utilities',
'LotConfig', 'LandSlope', 'Neighborhood', 'Condition1', 'Condition2',
'BldgType', 'HouseStyle', 'RoofStyle', 'RoofMatl', 'Exterior1st', 'Exte
rior2nd', 'MasVnrType', 'ExterQual', 'ExterCond', 'Foundation', 'BsmtQu
al', 'BsmtCond', 'BsmtExposure', 'BsmtFinType1', 'BsmtFinType2', 'Heati
ng', 'HeatingQC', 'CentralAir', 'Electrical', 'KitchenQual', 'Functiona
l', 'FireplaceQu', 'GarageType', 'GarageFinish', 'GarageQual', 'GarageC
ond', 'PavedDrive', 'PoolQC', 'Fence', 'MiscFeature', 'SaleType', 'Sale
Condition']
Variables quantitatives :
['Id', 'MSSubClass', 'LotFrontage', 'LotArea', 'OverallQual', 'OverallC
ond', 'YearBuilt', 'YearRemodAdd', 'MasVnrArea', 'BsmtFinSF1', 'BsmtFin
SF2', 'BsmtUnfSF', 'TotalBsmtSF', '1stFlrSF', '2ndFlrSF', 'LowQualFinS
F', 'GrLivArea', 'BsmtFullBath', 'BsmtHalfBath', 'FullBath', 'HalfBat
h', 'BedroomAbvGr', 'KitchenAbvGr', 'TotRmsAbvGrd', 'Fireplaces', 'Gara
geYrBlt', 'GarageCars', 'GarageArea', 'WoodDeckSF', 'OpenPorchSF', 'Enc
losedPorch', '3SsnPorch', 'ScreenPorch', 'PoolArea', 'MiscVal', 'MoSol
d', 'YrSold', 'SalePrice']
La moyenne de Id est:
730.5
La moyenne de MSSubClass est:
56.897260273972606
La moyenne de LotFrontage est:
70.04995836802665
La moyenne de LotArea est:
10516.828082191782
La moyenne de OverallQual est:
6.0993150684931505
La moyenne de OverallCond est:
5.575342465753424
La moyenne de YearBuilt est:
1971.267808219178
La moyenne de YearRemodAdd est:
1984.8657534246574
La moyenne de MasVnrArea est:
103.68526170798899
La moyenne de BsmtFinSF1 est:
443.6397260273973
La moyenne de BsmtFinSF2 est:
46.54931506849315
La moyenne de BsmtUnfSF est:
567.2404109589041
La moyenne de TotalBsmtSF est:
1057.4294520547944
La moyenne de 1stFlrSF est:
1162.626712328767
La moyenne de 2ndFlrSF est:
346.99246575342465
La moyenne de LowQualFinSF est:
5.844520547945206
La moyenne de GrLivArea est:
1515.463698630137
La moyenne de BsmtFullBath est:
0.42534246575342466
La moyenne de BsmtHalfBath est:
0.057534246575342465
La moyenne de FullBath est:
1.5650684931506849
La moyenne de HalfBath est:
0.38287671232876713
La moyenne de BedroomAbvGr est:
2.8664383561643834
La moyenne de KitchenAbvGr est:
1.0465753424657533
La moyenne de TotRmsAbvGrd est:
6.517808219178082
La moyenne de Fireplaces est:
0.613013698630137
La moyenne de GarageYrBlt est:
1978.5061638868744
La moyenne de GarageCars est:
1.7671232876712328
La moyenne de GarageArea est:
472.9801369863014
La moyenne de WoodDeckSF est:
94.2445205479452
La moyenne de OpenPorchSF est:
46.66027397260274
La moyenne de EnclosedPorch est:
21.954109589041096
La moyenne de 3SsnPorch est:
3.4095890410958902
La moyenne de ScreenPorch est:
15.060958904109588
La moyenne de PoolArea est:
2.758904109589041
La moyenne de MiscVal est:
43.489041095890414
La moyenne de MoSold est:
6.321917808219178
La moyenne de YrSold est:
2007.8157534246575
La moyenne de SalePrice est:
180921.19589041095
La mediane de Id est:
730.5
La mediane de MSSubClass est:
50.0
La mediane de LotFrontage est:
69.0
La mediane de LotArea est:
9478.5
La mediane de OverallQual est:
6.0
La mediane de OverallCond est:
5.0
La mediane de YearBuilt est:
1973.0
La mediane de YearRemodAdd est:
1994.0
La mediane de MasVnrArea est:
0.0
La mediane de BsmtFinSF1 est:
383.5
La mediane de BsmtFinSF2 est:
0.0
La mediane de BsmtUnfSF est:
477.5
La mediane de TotalBsmtSF est:
991.5
La mediane de 1stFlrSF est:
1087.0
La mediane de 2ndFlrSF est:
0.0
La mediane de LowQualFinSF est:
0.0
La mediane de GrLivArea est:
1464.0
La mediane de BsmtFullBath est:
0.0
La mediane de BsmtHalfBath est:
0.0
La mediane de FullBath est:
2.0
La mediane de HalfBath est:
0.0
La mediane de BedroomAbvGr est:
3.0
La mediane de KitchenAbvGr est:
1.0
La mediane de TotRmsAbvGrd est:
6.0
La mediane de Fireplaces est:
1.0
La mediane de GarageYrBlt est:
1980.0
La mediane de GarageCars est:
2.0
La mediane de GarageArea est:
480.0
La mediane de WoodDeckSF est:
0.0
La mediane de OpenPorchSF est:
25.0
La mediane de EnclosedPorch est:
0.0
La mediane de 3SsnPorch est:
0.0
La mediane de ScreenPorch est:
0.0
La mediane de PoolArea est:
0.0
La mediane de MiscVal est:
0.0
La mediane de MoSold est:
6.0
La mediane de YrSold est:
2008.0
La mediane de SalePrice est:
163000.0
C:\Users\PC\AppData\Local\Temp\ipykernel_948\1844451391.py:4: FutureWar
ning: iteritems is deprecated and will be removed in a future version.
Use .items instead.
for value, count in counts.iteritems():
C:\Users\PC\AppData\Local\Temp\ipykernel_948\1844451391.py:4: FutureWar
ning: iteritems is deprecated and will be removed in a future version.
Use .items instead.
for value, count in counts.iteritems():
C:\Users\PC\AppData\Local\Temp\ipykernel_948\1844451391.py:4: FutureWar
ning: iteritems is deprecated and will be removed in a future version.
Use .items instead.
for value, count in counts.iteritems():
C:\Users\PC\AppData\Local\Temp\ipykernel_948\1844451391.py:4: FutureWar
ning: iteritems is deprecated and will be removed in a future version.
Use .items instead.
for value, count in counts.iteritems():
C:\Users\PC\AppData\Local\Temp\ipykernel_948\1844451391.py:4: FutureWar
ning: iteritems is deprecated and will be removed in a future version.
Use .items instead.
for value, count in counts.iteritems():
C:\Users\PC\AppData\Local\Temp\ipykernel_948\1844451391.py:4: FutureWar
ning: iteritems is deprecated and will be removed in a future version.
Use .items instead.
for value, count in counts.iteritems():
C:\Users\PC\AppData\Local\Temp\ipykernel_948\1844451391.py:4: FutureWar
ning: iteritems is deprecated and will be removed in a future version.
Use .items instead.
for value, count in counts.iteritems():
C:\Users\PC\AppData\Local\Temp\ipykernel_948\1844451391.py:4: FutureWar
ning: iteritems is deprecated and will be removed in a future version.
Use .items instead.
for value, count in counts.iteritems():
C:\Users\PC\AppData\Local\Temp\ipykernel_948\1844451391.py:4: FutureWar
ning: iteritems is deprecated and will be removed in a future version.
Use .items instead.
for value, count in counts.iteritems():
C:\Users\PC\AppData\Local\Temp\ipykernel_948\1844451391.py:4: FutureWar
ning: iteritems is deprecated and will be removed in a future version.
Use .items instead.
for value, count in counts.iteritems():
C:\Users\PC\AppData\Local\Temp\ipykernel_948\1844451391.py:4: FutureWar
ning: iteritems is deprecated and will be removed in a future version.
Use .items instead.
for value, count in counts.iteritems():
C:\Users\PC\AppData\Local\Temp\ipykernel_948\1844451391.py:4: FutureWar
ning: iteritems is deprecated and will be removed in a future version.
Use .items instead.
for value, count in counts.iteritems():
C:\Users\PC\AppData\Local\Temp\ipykernel_948\1844451391.py:4: FutureWar
ning: iteritems is deprecated and will be removed in a future version.
Use .items instead.
for value, count in counts.iteritems():
C:\Users\PC\AppData\Local\Temp\ipykernel_948\1844451391.py:4: FutureWar
ning: iteritems is deprecated and will be removed in a future version.
Use .items instead.
for value, count in counts.iteritems():
C:\Users\PC\AppData\Local\Temp\ipykernel_948\1844451391.py:4: FutureWar
ning: iteritems is deprecated and will be removed in a future version.
Use .items instead.
for value, count in counts.iteritems():
C:\Users\PC\AppData\Local\Temp\ipykernel_948\1844451391.py:4: FutureWar
ning: iteritems is deprecated and will be removed in a future version.
Use .items instead.
for value, count in counts.iteritems():
C:\Users\PC\AppData\Local\Temp\ipykernel_948\1844451391.py:4: FutureWar
ning: iteritems is deprecated and will be removed in a future version.
Use .items instead.
for value, count in counts.iteritems():
C:\Users\PC\AppData\Local\Temp\ipykernel_948\1844451391.py:4: FutureWar
ning: iteritems is deprecated and will be removed in a future version.
Use .items instead.
for value, count in counts.iteritems():
C:\Users\PC\AppData\Local\Temp\ipykernel_948\1844451391.py:4: FutureWar
ning: iteritems is deprecated and will be removed in a future version.
Use .items instead.
for value, count in counts.iteritems():
C:\Users\PC\AppData\Local\Temp\ipykernel_948\1844451391.py:4: FutureWar
ning: iteritems is deprecated and will be removed in a future version.
Use .items instead.
for value, count in counts.iteritems():
C:\Users\PC\AppData\Local\Temp\ipykernel_948\1844451391.py:4: FutureWar
ning: iteritems is deprecated and will be removed in a future version.
Use .items instead.
for value, count in counts.iteritems():
C:\Users\PC\AppData\Local\Temp\ipykernel_948\1844451391.py:4: FutureWar
ning: iteritems is deprecated and will be removed in a future version.
Use .items instead.
for value, count in counts.iteritems():
C:\Users\PC\AppData\Local\Temp\ipykernel_948\1844451391.py:4: FutureWar
ning: iteritems is deprecated and will be removed in a future version.
Use .items instead.
for value, count in counts.iteritems():
C:\Users\PC\AppData\Local\Temp\ipykernel_948\1844451391.py:4: FutureWar
ning: iteritems is deprecated and will be removed in a future version.
Use .items instead.
for value, count in counts.iteritems():
C:\Users\PC\AppData\Local\Temp\ipykernel_948\1844451391.py:4: FutureWar
ning: iteritems is deprecated and will be removed in a future version.
Use .items instead.
for value, count in counts.iteritems():
C:\Users\PC\AppData\Local\Temp\ipykernel_948\1844451391.py:4: FutureWar
ning: iteritems is deprecated and will be removed in a future version.
Use .items instead.
for value, count in counts.iteritems():
C:\Users\PC\AppData\Local\Temp\ipykernel_948\1844451391.py:4: FutureWar
ning: iteritems is deprecated and will be removed in a future version.
Use .items instead.
for value, count in counts.iteritems():
C:\Users\PC\AppData\Local\Temp\ipykernel_948\1844451391.py:4: FutureWar
ning: iteritems is deprecated and will be removed in a future version.
Use .items instead.
for value, count in counts.iteritems():
C:\Users\PC\AppData\Local\Temp\ipykernel_948\1844451391.py:4: FutureWar
ning: iteritems is deprecated and will be removed in a future version.
Use .items instead.
for value, count in counts.iteritems():
C:\Users\PC\AppData\Local\Temp\ipykernel_948\1844451391.py:4: FutureWar
ning: iteritems is deprecated and will be removed in a future version.
Use .items instead.
for value, count in counts.iteritems():
C:\Users\PC\AppData\Local\Temp\ipykernel_948\1844451391.py:4: FutureWar
ning: iteritems is deprecated and will be removed in a future version.
Use .items instead.
for value, count in counts.iteritems():
C:\Users\PC\AppData\Local\Temp\ipykernel_948\1844451391.py:4: FutureWar
ning: iteritems is deprecated and will be removed in a future version.
Use .items instead.
for value, count in counts.iteritems():
C:\Users\PC\AppData\Local\Temp\ipykernel_948\1844451391.py:4: FutureWar
ning: iteritems is deprecated and will be removed in a future version.
Use .items instead.
for value, count in counts.iteritems():
C:\Users\PC\AppData\Local\Temp\ipykernel_948\1844451391.py:4: FutureWar
ning: iteritems is deprecated and will be removed in a future version.
Use .items instead.
for value, count in counts.iteritems():
C:\Users\PC\AppData\Local\Temp\ipykernel_948\1844451391.py:4: FutureWar
ning: iteritems is deprecated and will be removed in a future version.
Use .items instead.
for value, count in counts.iteritems():
C:\Users\PC\AppData\Local\Temp\ipykernel_948\1844451391.py:4: FutureWar
ning: iteritems is deprecated and will be removed in a future version.
Use .items instead.
for value, count in counts.iteritems():
C:\Users\PC\AppData\Local\Temp\ipykernel_948\1844451391.py:4: FutureWar
ning: iteritems is deprecated and will be removed in a future version.
Use .items instead.
for value, count in counts.iteritems():
C:\Users\PC\AppData\Local\Temp\ipykernel_948\1844451391.py:4: FutureWar
ning: iteritems is deprecated and will be removed in a future version.
Use .items instead.
for value, count in counts.iteritems():
C:\Users\PC\AppData\Local\Temp\ipykernel_948\1844451391.py:4: FutureWar
ning: iteritems is deprecated and will be removed in a future version.
Use .items instead.
for value, count in counts.iteritems():
C:\Users\PC\AppData\Local\Temp\ipykernel_948\1844451391.py:4: FutureWar
ning: iteritems is deprecated and will be removed in a future version.
Use .items instead.
for value, count in counts.iteritems():
C:\Users\PC\AppData\Local\Temp\ipykernel_948\1844451391.py:4: FutureWar
ning: iteritems is deprecated and will be removed in a future version.
Use .items instead.
for value, count in counts.iteritems():
C:\Users\PC\AppData\Local\Temp\ipykernel_948\1844451391.py:4: FutureWar
ning: iteritems is deprecated and will be removed in a future version.
Use .items instead.
for value, count in counts.iteritems():
C:\Users\PC\AppData\Local\Temp\ipykernel_948\1844451391.py:4: FutureWar
ning: iteritems is deprecated and will be removed in a future version.
Use .items instead.
for value, count in counts.iteritems():
In [66]: ep['SalePrice'].plot.box()
#réprésentation d'un graphique en boîte (boîtes à moustaches) pour visua
<Axes: >
Out[66]:
In [73]: Q1 = ep['SalePrice'].quantile(0.25)
Q2 = ep['SalePrice'].quantile(0.75)
iqr = Q2 - Q1
coefficient = iqr * 1
li = Q1 - coefficient
ls = Q2 + coefficient
result = ep[(ep['SalePrice'] > li) | (ep['SalePrice'] < ls)]
result.sort_values("SalePrice")
In [74]: eg=ep[(ep['SalePrice']<=ls)&(ep['SalePrice']>=li)]
eg['SalePrice'].plot.box()
<Axes: >
Out[74]:
In [76]: _='''
En éliminant les points aberrants dans le diagramme en boîte, nous avon
la visualisation et nous nous concentrons davantage sur la tendance géné
Cela permet une meilleure interprétation des mesures centrales, de la di
des données sans être biaisé par des valeurs atypiques.
Cela peut nous aider à obtenir une image plus précise et représentative
'''
ensemble_X :
Id
MSSubClass
MSZoning
LotFrontage
LotArea
Street
Alley
LotShape
LandContour
Utilities
LotConfig
LandSlope
Neighborhood
Condition1
Condition2
BldgType
HouseStyle
OverallQual
OverallCond
YearBuilt
YearRemodAdd
RoofStyle
RoofMatl
Exterior1st
Exterior2nd
MasVnrType
MasVnrArea
ExterQual
ExterCond
Foundation
BsmtQual
BsmtCond
BsmtExposure
BsmtFinType1
BsmtFinSF1
BsmtFinType2
BsmtFinSF2
BsmtUnfSF
TotalBsmtSF
Heating
HeatingQC
CentralAir
Electrical
1stFlrSF
2ndFlrSF
LowQualFinSF
GrLivArea
BsmtFullBath
BsmtHalfBath
FullBath
HalfBath
BedroomAbvGr
KitchenAbvGr
KitchenQual
TotRmsAbvGrd
Functional
Fireplaces
FireplaceQu
GarageType
GarageYrBlt
GarageFinish
GarageCars
GarageArea
GarageQual
GarageCond
PavedDrive
WoodDeckSF
OpenPorchSF
EnclosedPorch
3SsnPorch
ScreenPorch
PoolArea
PoolQC
Fence
MiscFeature
MiscVal
MoSold
YrSold
SaleType
SaleCondition
ensemble_Y
SalePrice
In [ ]: _='''
Pour l'analyse de ce dataset, l'ensemble y (la cible) est le prix de ve
lorqu'on a tous les autres.
'''