Académique Documents
Professionnel Documents
Culture Documents
RESEARCH
UNIVERSITY OF DELHI
OBjECTIVE:
To establish the relationship between the temperature
change and the factors associated with it namely
month, greenhouse gases like CO2, N2O, CH4, CFC.11,
CFC.12, Aerosols, TSI and MEI.
INTRODUCTION
It has been observed that there is a constant increase in temperature every
year. This climate change, also known as greenhouse effect, is because of the
rise in average surface temperatures on Earth. Greenhouse effect, by
definition, is the process by which radiation from a planet's atmosphere warms
the planet's surface to a temperature above what it would be without its
atmosphere.
Most climate scientists agree the main cause of the current global warming
trend is human expansion of the "greenhouse effect" — warming that results
when the atmosphere traps heat radiating from Earth toward space.
Certain gases in the atmosphere block heat from escaping. Long-lived gases
that remain semi-permanently in the atmosphere and do not respond
physically or chemically to changes in temperature are described as "forcing"
climate change.
On average, Earth will become warmer. Some regions may welcome warmer
temperatures, but others may not.
A stronger greenhouse effect will warm the oceans and partially melt
glaciers and other ice, increasing sea level. Ocean water also will expand if it
warms, contributing further to sea level rise.
Aerosols: the mean stratospheric aerosol optical depth at 550 nm. This
variable is linked to volcanoes, as volcanic eruptions result in new particles
being added to the atmosphere, which affect how much of the sun's energy is
reflected back into space.
TSI: The total solar irradiance (TSI) in W/m2 (the rate at which the sun's
energy is deposited per unit area). Due to sunspots and other solar
phenomena, the amount of energy that is given off by the sun varies
substantially with time.
MEI: Multivariate El Nino Southern Oscillation index (MEI), a measure of
the strength of the El Nino/La Nina-Southern Oscillation (a weather effect in
the Pacific Ocean that affects global temperatures).
THE MODEL
Multiple regression is an extension of simple linear regression. It is used when
we want to predict the value of a variable based on the value of two or more
other variables. The variable we want to predict is called the dependent
variable (or sometimes, the outcome, target or criterion variable). The
variables we are using to predict the value of the dependent variable are called
the independent variables (or sometimes, the predictor, explanatory or
regressor variables).
Multiple regression also allows you to determine the overall fit (variance
explained) of the model and the relative contribution of each of the predictors
to the total variance explained. As a result, we have employed Multiple linear
regression model to helps us determine how are these major factors used to
predict the change in temperature of a region due to per unit change in these
parameters.
To establish the relationship between the temperature and the factors
associated with it namely month, CO2, CH4, N2O, CFC-11, CFC-12, total solar
irradiance (TSI), Aerosols, MEI
In this model, we have one dependent variable (Yi) that is temperature and 4
independent variables (Xi) or explanatory variables which are CO2, MEI, TSI
and Aerosol that contribute to our regression analysis.
Mathematically,
Yi= the difference in degrees Celsius between the average global
temperature in that period and a reference value.
X1= atmospheric concentrations of carbon dioxide (CO2) expressed in
ppmv (parts per million by volume i.e., 397 ppmv of CO2 means that
CO2 constitutes 397 millionths of the total volume of the atmosphere)
X2= Multivariate El Nino Southern Oscillation index (MEI) a measure of
the strength of the El Nino/La Nina-Southern Oscillation (a weather
effect in the Pacific Ocean that affects global temperatures).
X3= the mean stratospheric aerosol optical depth at 550 nm. This
variable is linked to volcanoes, as volcanic eruptions result in new
particles being added to the atmosphere, which affect how much of the
sun's energy is reflected back into space.
X4= the total solar irradiance (TSI) in W/m2 (the rate at which the sun's
energy is deposited per unit area). Due to sunspots and other solar
phenomena, the amount of energy that is given off by the sun varies
substantially with time.
The multiple linear regression equation is as follows:
BEST-FITTED MODEL:
We used the sequential approach (step-wise) for selecting the best regression
model. Out of the 9 variables, there were just 4variables contributing to the
model, namely- CO2, MEI, AEROSOLS and TSI.
This graph shows that there is a linear relationship between our dependent
variable-TEMP and independent variables- TSI, MEI, AREOSOL and CO2.
The histogram with a bell shaped curve shows that our error term
(disturbance) follows Normal Distribution, hence following the assumptions of
the model.
R: This is the correlation coefficient. It tells you how strong the linear
relationship is. The correlation coefficient between the observed and
predicted values. It ranges in value from 0 to 1. A small value indicates
that there is little or no linear relationship between the dependent
variable and the independent variables. In our result the m R value is
0.848 which is close to 1 hence, there is a strong linear relationship
between the dependent and independent variables.
This table indicates that the regression model predicts the dependent variable
significantly well. How do we know this? Look at the "Regression" row and go
to the "Sig." column. This indicates the statistical significance of the regression
model that was run. Here, p < 0.05, and indicates that, overall, the regression
model statistically significantly predicts the outcome variable (i.e., it is a good
fit for the data).
The table is the F-test, the linear regression's F-test has the null hypothesis
that there is no linear relationship between the variables (in other words
R²=0). The F-test is highly significant, thus we can assume that there is a linear
relationship between the variables in our model.
This column shows the predictor variables (CO2, MEI, AEROSOLS and
TSI). The first variable (constant) represents the constants as the Y
intercept, the height of the regression line when it crosses the Y axis. In
other words, this is the predicted value of TEMP when all other
variables are 0.
B - These are the values for the regression equation for predicting the
dependent variable from the independent variable. The regression
equation is presented in many different ways, for example:
Y(predicted) = b0 + b1*x1 + b2*x2 + b3*x3 + b4*x4
The column of estimates provides the values for b0, b1, b2, b3 and b4 for this
equation.
1. CO2 - The coefficient for CO2 is .010. So for every unit increase in CO2,
a 0.010 unit increase in temperature is predicted, holding all other
variables constant.
2. MEI - For every unit increase in MEI, we expect a 0.068 unit increase
in the temperature, holding all other variables constant.
3. Aerosol - The coefficient for aerosols is 1.721. So for every unit
increase in Aerosols, we expect an approximately 1.721 point decrease
in the temperature, holding all other variables constant.
4. TSI - The coefficient for TSI is .099. So for every unit increase in TSI, we
expect a .099 point increase in the temperature.
Std. Error - These are the standard errors associated with the
coefficients.
Where di =Yi^ - |ε i|
We set up the hypotheses as follows:
.
Calculate the p-value p = P (tn-2 > t) where tn-2 denotes a student’s t random
variable with n-2 degrees of freedom. If p < α/2 then reject the null hypothesis
at α level of significance, i.e. Heteroscedasticity is taken to be present.
For the model under study, we have
Thus we have p-value for the t-test is greater than 0.05. Hence we may
conclude that there is no heteroscedasticity in our model, i.e. our model is
homoscedastic in nature.
AUTOCORRELATION
We can see that the value of d under Durbin Watson D-test is d=0.911
We know that value of k=no of independent variables+1
Therefore, the value of k = 4+1=5.
Here, n= no of observations in our data
Therefore, n=308.
By using the significant table for finding the critical values of dL and dU
We find dL= 1.78766 and dU=1.83991
We can very well see that the value of d lies between 0 and the lower limit dL.
This implies that there is Positive autocorrelation present in our model.
REMOVAL:
Here we make use of the COCHRAN ORCUTT ITERATIVE METHOD
Consider the model-