Artificial Neural Network Applications in Air Quality Monitoring

International
Volume Journal
II Number 1 2011for Environmental Rehabilitation and Conservation
[30-64]
Volume[ISSN
II No. 0975
1 2011 [30 – 64]
- 6272] [ISSN
Prachi 0975 - 6272]
et al.
Artificial neural network applications in air quality monitoring and

management
Prachi1, Kumar Nishant2 and Matta, Gagan3
Received: January 11, 2011 ⏐ Accepted: April 19, 2011 ⏐ Online: July 20, 2011
With the loss of biodiversity on an areas, especially in densely populated and

unprecedented scale, and addition of industrialized areas (Boznar and Mlakar,
pollutants with the potential of altering 2002). Indeed, many epidemiological studies
climates and poisoning environments on a have consistently shown an association
global scale, the pressure to understand and between particulate air pollution and
manage the natural environment are far cardiovascular and respiratory diseases. The
greater now than could ever have been decrease in respiratory efficiency and
conceived even 50 years ago (Lek and impaired capability to transport oxygen
Guegan, 1999). through the blood caused by a high
concentration of air pollutants may be
Air pollutants exert a wide range of impacts hazardous to certain sensitive groups in the
on biological, physical, and economic population i.e. those having pre-existing
systems. Sulfur dioxide (SO2); nitrogen oxide respiratory and coronary artery diseases such
(NOx); nitric oxide (NO), nitrogen dioxide as children, asthmatics and elderly people.
(NO2); carbon monoxide (CO); Ozone (O3); Consequently, it has become a vital task to
respirable suspended particulates (RSPs); etc. accurately keep track of the variation of
are some of the major airborne pollutants ambient air pollution levels in urban areas and
contributing to the quality of living in urban to take controlling measures (Barai et al.,
2000; Pasero and Mesin, 2006).
Keywords: Artificial Neural Network ⏐
Environmental Informatics ⏐ Air Pollution Air pollution control is needed to prevent the
monitoring ⏐ Pollution modeling ⏐ Modeling ⏐ situation from becoming worse in the long
For Correspondence: run. Forecasting of air quality is needed in
order to take preventive and evasive actions
1
Rotary Environmental Lab., Haridwar during episodes of airborne pollution. In this
2
Faculty of Engineering, Gurukula Kangri University, way, by influencing people's daily habits or
Haridwar, India by placing restrictions on traffic and industry
3
Department of Zoology and Environmental Science,
it should be possible to avoid excessive
Gurukula Kangri University, Haridwar, India.
Artificial Neural Network Applications in Air Quality Monitoring and Management

30

Volume II Number 1 2011 [30-64]
[ISSN 0975 - 6272] Prachi et al.
medication, reduce the need for hospital pollution prediction, flood prediction and so
treatment and even prevent premature deaths on. In the last two decades several methods
(Kolehmainen et al., 2001). based on artificial intelligence were proposed
by taken into account that they can offer more
Studies have shown that pollutants are usually informed methods that use domain specific
found entrapped into the planetary boundary knowledge and provide solutions faster than
layer (PBL), which is the lowest part of the the traditional methods, those based on a
atmosphere and has behavior directly mathematical formalism (Oprea and Matei).
influenced by its contact with the ground. In
this layer, physical quantities such as flow Numerous researches and studies have been
velocity, temperature, moisture and pollutants carried out on the subject of nature and
display rapid fluctuations (turbulence) and dynamic behavior of pollutants, emission,
vertical mixing is strong. Conclusively, propagation and effects of pollutants.
numerous studies have shown the existence of Predicting future dispersion of air pollution is
high correlation between air pollution and of immense importance since it can provide
meteorological variables (Cogliani, 2001; an effective decision making tool by giving
Pasero and Mesin, 2006). advance warning of excessive pollution
beyond the threshold. Also, it enables early
Other than the meteorological variables, the air quality control to mitigate the adverse
layout of the city, the existence of green and impacts. On the other hand, high dynamism
un-built areas, the geometry, the architectural and nonlinear behavior of air pollutant data
morphology and the thermal properties of the makes prediction difficult or inaccurate. High
buildings, the vehicular traffic, the stationary capabilities of artificial neural networks e.g.
thermal systems, all define the urban flexible structure and the use of dynamic
environment as a multi-dimensional, learning algorithm promote the application of
multivariable system. Thus, air pollution is a these intelligent systems in this domain
problem that cannot be treated independently (Abbaspour et al., 2005).
of the urban web. Concerning photochemical
pollutants, it should be noted that their Natural phenomena are mostly a time series
dynamic nature, accompanied by the strong with some degree of randomness. Pollutants
non-linearities in the underlying physical and in the atmosphere may disperse or concentrate
chemical mechanisms involved in their during varied time periods. Previous studies
creation, chemical transformation and (Giorgio and Piero, 1996) have indicated that
transportation-diffusion, is always among the the data of ambient air quality are stochastic
major challenges for the development of any time series, thereby making it possible to
modeling – forecasting method and tool make a short-term forecast on the basis of
(Athanasiadis et al., 2006). historical data. Though models may be
imperfect, they are the best tool for use in all
Admittedly, the main environmental problem aspect of air quality planning where
that need efficient software tool is the prediction is a major component such as for
prediction problem. More concrete, it can emission control (Melas et al., 2000),
mean meteorological prediction, air/soil/water accidental release of pollutant, land-use

31

planning, traffic planning (Hadjiiski and neural network was presented in 1958
Hopke, 2000), planning of measurement (Rosenblatt, 1958). So, their history goes back
programs (Rao and Rao, 2000), analyses of more than 50 years, but due to the availability
measurements/ trends and episode forecasting of modern computers from the 1980’s they
(Melas et al., 2000; Barai, et al. 2000). have grown to be a competitive tool that has
been applied widely since the mid 1990’s (Jef
The modeling and forecasting of et al., 2005). One of the reasons for their
environmental parameters involves a variety success is their capability to make regressive
of approaches. One approach is to use the approximations of non-linear functions in
atmospheric diffusion model to predict future high dimensional spaces, something that is
pollutant concentrations. A second is to missing in classical statistics. The flexibility
devise statistical models that attempt to of neural networks (NNs) led to their use in
determine the underlying relationship all possible scientific branches. In the last two
between a set of input variables (original data) decades, ANNs have been already explored in
and the targets (Shi and Harrison, 1997). various fields like chemical research
Statistical methods, either the time-series (Kvasnicka, 1990; Wythoff et al., 1990; Smits
methods, which do not use meteorological et al., 1992), physics research (Dekruger and
inputs, or regression and similar methods, Hunt, 1994), molecular biology, ecology and
which are based on multivariate linear environmental sciences and demonstrated
relationship between meteorological remarkable success.
conditions and air pollution concentrations,
are commonly used (Barai et al., 2000). ANNs imitate the learning process of the
animal brain (Lippmann, 1987) and can
In other words, though the statistical methods process problems involving very nonlinear
do provide reasonable results, these are and complex data even if the data are
essentially incapable of capturing complexity imprecise and noisy. ANNs can identify and
and non-linearity of pollution-weather learn correlated patterns between input data
relationships. To overcome this demerit of sets and corresponding target values. After
statistical methods, Artificial neural networks training, ANNs can be used to predict the
(ANN), the third approach developed in output of new independent input data. This is
recent years has become the focus of much regarded as an intelligent, cost-effective
attention, largely because they can handle the approach and has received much attention in
non-linearity and have been used to model environmental engineering. ANNs are also
pollutant concentrations with promising known as “universal function approximators”
results (Boznar et al., 1993; Comrie, 1997; because of their capacity to approximate
Gardner and Dorling, 1996, 1998; Hadjiiski virtually any continuous nonlinear function
and Hopke, 2000). with arbitrary accuracy. Thus, ANNs have the
ability to solve many complex problems in
Artificial Neural Networks (ANNs) which a priori knowledge is incomplete or
unavailable (Fabbian and Dear, 2007).
The concept of artificial neural networks was
established in 1943 (McCulloch and Pitts,
1943). Perceptron, the first practical artificial
32

It is true that as compared to traditional related to the important issue of different air
statistical techniques, a neural network (NN) pollutant concentration time series
excels by its flexibility. The main drawback is approximation and forecasting using NN that
that a NN which is trained by data from a appeared until this date. The aim is to provide
given measuring location can only forecast concise information regarding the NN types
for that specific location and it cannot give and architectures used for this purpose, their
insight into the physics behind the data: a NN results and a critical evaluation between
merely learns from examples and it is not different NN models to some crucial points.
suited to generalize to other situations (Jef et
al., 2005). Thus, there are certain questions on General overview of Artificial Neural
ANN which still remain unsolved and Network (ANN)
continue to challenge researchers; e.g., the
curse of dimensionality, local minima, The experiment with an idea to find a simple
overfitting, etc. (Lek and Guegan, 1999; Barai non-linear model for a real neuron led to the
et al., 2000; Boznar and Mlakar, 2002; Lu et development of artificial Neural Network
al., 2004). (ANN) theory by McCulloh and Pitts in 1943.
McCulloh and Pitts tried to model the bio-
Apart from some drawbacks, computing with systems using nets of simple logical
neural networks is one of the fastest growing operations. This innovation fetched a great
fields in the history of artificial intelligence interest from various researchers and
(Comrie, 1997; Gardner and Dorling, 1998; scientists all over the world. Thus several
Tecer, 2007). NNs have applications for both ANN-based models in different fields were
air pollutant time series modeling and air discovered. The technology though had lost
pollutant concentrations forecasting. A its momentum in the late 1969 till 1986 when
systematic flow of published papers on such the back-propagation of error was discovered.
applications starts in the early 1990s, boosted ANN-based models have been successfully
by the constantly improving performance and implemented in a number of disciplines
the decreasing cost of powerful computers in ranging from: Medical, automotive, defense,
one hand and by the fact that software electronics, aerospace, entertainment,
packages, commercial or open source, which financial and so on. Based on ANN
required a minimum programming effort by methodologies, Crawford, 2000 reported the
non-specialists in NN algorithm were acute appendicitis analysis. ANN is a part and
becoming widely available. parcel of intelligent based systems, designed
distinctively to improve the performance of
NNs, are developing constantly. In the conventional computing techniques. The
initiating phase of NNs, multilayer perceptron biggest drawback associated with the so
seemed to be an effective alternative to more called conventional methods is the inability to
traditional statistical techniques. But later on, learn and identify patterns in dynamic
many researches were performed using other systems. Thus the need to eliminate this
types and architectures of NNs for the same shortcoming through learning is proven
purpose of air pollution modeling. The essential.
present review focuses on the literature

33

ANN models are computer programs that are distributed computations. Figure 3 shows a
designed to emulate human knowledge taxonomy tree of NN which clearly depicts
processing, speech, prediction, classification the various NN types depending on their
and control. ANN is a cellular information architectures. The Architecture of a simple
processing system designed and developed on ANN i.e. feedforward neural network is
the basis of the perceived notion of the human shown in Figure 2B. The attraction of ANN-
brain and its neural system. The human brain based models comes with the network’s
has 100 billion biological neurons with about ability to learn, recognize data patterns, and
100 000 connections per neuron. Thus, neural adapt to a changing environment like the
networks are basically a collection of human brain. This adaptive characteristic is
interconnected neurons, each one with several often called “the human-like reasoning”.
inputs and one output. The inputs' weight
(importance) and numbers can vary. The
output is a function of all inputs. The figure
below shows how biological neurons interact
with each other.
Figure 1: A biological neuron

Above figure shows that a blue neuron is
sending an impulse to the yellow neuron. The
The architecture illustrated in Figure 2B,
yellow neuron may receive other impulses
presents a three layered feed-forward
(varying in strength) from other neurons, but
network. ANN has a remarkable capability to
is only sending one signal (a function of all
develop sense from convoluted or imprecise
the inputs). Humans and highly trained
data, extract patterns and detect trends that are
animals use the same configuration and
too complex often only noticeable by either
summing up to extremely complex networks.
humans or other computer techniques. A
Figure 2A shows that how a biological system
supervised neural network can be trained as a
can be transferred into a computer system via
"guru" in a given problem space. In broad
a sketchy representation of artificial neuron.
terms, ANN-based models offer a variety of
Similarly, an artificial network is made up of benefits namely: adaptive learning, self
simple interconnected processing elements organization, real time operation, fault
called neurons. The neurons are arranged in a tolerance via redundant information coding.
layered structure to complete a network Thus neural network processes information in
competent of executing parallel and the similar way the human brain does.

34

Voluume II Number
r 1 2011 [30-64
4]
[ISSN 0975 - 6272] P
Prachi et al.
A B
Figure 2:2 A) Sketch hy representtation of an artificial neeuron. B)

Example of a feedfo orward neuraal network, with a singlle hidden
layer andd a single outtput neuron (Pasero and Mesin, 20066).
Figurre 3: Taxxonomy off neural

netwoork architecctures (Garddner and
Dorliing, 1998).
Artificial
A Neural Network
N Appliccations in Air Quuality Monitoring and Managem
ment
35
5

Neural Network Topologies 2. Recurrent Neural Network
The most commonly used neural networks Unlike MLP network, recurrent structure
types are: radial basis function network introduces cycles or loops and backward links
(RBFN), multi-layered perceptron networks in the network. Feedback networks are
(MLP), and recurrent neural networks exceptionally dominant and can get extremely
(RNNs). The distinction in different network convoluted. The behavior of these types of
topologies can perhaps be attributed to the networks is known to be changing
arrangement of neurons and the connection continuously until they reach an equilibrium
patterns of the layers. point. This implies the state of the network
remains at the equilibrium point until the
The inner structure of the processing element input changes and a new equilibrium needs to
(neuron) in each network is interconnected be found. Feedback architectures are also
differently, and the configuration set-up is referred to as interactive or recurrent,
often referred to as network topology. The although the latter term is often used to
behavior of the network relies greatly on the denote feedback connections in single-layer
network topology. organizations.
1. Feedforward Network 3. Radial Basis Function (RBF) Network
MLP feed-forward network is referred to as a A radial basis function network is an artificial

directed cyclic graph in which the neural network that uses radial basis functions
connections are unidirectional and no loops as activation functions. RBF emerged as a
are introduced in the network, thus each variant of ANN in late 80’s and it is
neuron is linked only to neurons in the next commonly used as a pattern recognition
layer. This implies no backward links either. technique, function approximation, time
series prediction, and control. Architecture of
Multi-layer feed-forward neural networks a radial basis function network involves an
trained by backpropagation algorithm, i.e. input vector which is used as input to all
backpropagation network (BPN). The BPN is radial basis functions, each with different
one of the easiest networks to understand. Its parameters. The output of the network is a
learning and update procedure is based on a linear combination of the outputs from radial
relatively simple concept: if the network gives basis functions. A linear transfer function is
the wrong answer, then the weights are used in the output layer and a nonlinear
corrected, so the error is lessened so future transfer function (normally the Gaussian) for
responses of the network are more likely to be the hidden layer. The radial basis function
correct. The conceptual basis of the network is probably the second widely used
backpropagation algorithm was first presented type of Artificial Neural Network in contrast
in by Webos (1974), then independently to the standard Feedforward MLP network.
reinvented by Parker (1982), and presented to
a wide readership by Rumelhart et al., 1986
(Lek and Guegan, 1999).

36

Neural Network Models and its forecasting conditions with the wind blowing from the
or prediction applications in air pollution – power plant to Zavodnje when there is no
thermal inversion. So, the situation was the
Boznar et al., 1993 presented the method of same as the one appearing at Veliki vrh.
neural network for short-term air pollution These results show clearly that it is easy for a
prediction. Their study was on the prediction neural network to recognize inversion
of SO2 emissions around the Slovenian conditions, but the prediction of changeable
thermal power plant (TPP) at Sostanj. wind direction still needs improvement. For
Velenje station, the prediction found good
Air dispersion models and Cyclo-stationary because the peak SO2 concentration was
Auto Regressive (CSAR) predictors have recognized completely. Thus, the results
already been applied to the area but these obtained at all above stations were the best
models failed pertaining to the complexity of possible. They show, that with a better set of
the terrain. As input to the model, a historical training patterns, prediction of SO2
set of significant meteorological and concentrations with high accuracy would be
ecological data was used. The input possible.
parameters included the following: the wind
data, air temperature, SO2 concentrations This study set a great landmark in the field of
(actual and historical), relative humidity, solar air pollution prediction because of its
radiation, emissions from the TPP and time of inclination towards a new and better approach
day. The training set of patterns was taken of neural network. Along with this, some
from the beginning of February and the test questions also raised up like when to stop the
set of patterns from the later days of the same learning process to get the best results with
month and the method was tested on the data the unknown patterns, and how to set the back
collected by the Environmental Information propagation algorithm parameters to get the
System (EIS) of the Sostanj Thermal Power best possible convergence of the learning
Plant (TPP). The output feature was the SO2 process. The author also mentioned about the
concentration measured after half an hour. preparation of on-line implementation of the
Some predictions with good results were method in the software of the central unit of
made for the stations at Zavodnje, Veliki vrh the EIS. Their planning was to implement an
and Velenje. For the station Veliki vrh, wind automatic algorithm for selection of new
direction showed a major role in pollution. training patterns which could enable the
According to the results the network automatic updating and adapting of the neural
anticipated higher concentrations, but not network to the newly measured data.
high enough. The reason might be the
changeable wind direction at the site. For the Yi and Prybutok, 1996 describe a multilayer
Zavodnje station, unique terrain that perceptron that predicts surface ozone
surrounds it found to be significant, especially concentrations in an industrialised area of
because of the morning thermal inversions. North America. The model takes nine input
The results of the prediction on the testing set variables to predict the maximum daily
of patterns have shown that the peaks were surface ozone concentration. These variables
slightly shifted. These patterns belong to include the morning ozone concentration, the

37

maximum daily temperature, CO2, NO, NO2 dramatic improvements in the performance of
and NOx levels, and also wind speed and neural networks. Model comparison statistics
direction. Results from the multilayer clearly proved that neural networks are
perceptron were shown to be better than those somewhat, but not overwhelmingly, better
obtained from regression analysis (using the than multiple regression models for weather-
same input data). The authors also suggest based ozone forecasting. It is interesting to
that the multilayer perceptron outperforms an note that the relative improvement between
ARIMA time-series modelling approach, unlagged and lagged models is greater for
however such comparisons between regression models than neural networks (i.e.,
techniques must be made with care. For neural networks using lagged data do not have
example, to fairly compare an ARIMA time- to rely as much on persistence information in
series model with a multilayer perceptron those data). A general observation is that the
model, requires that both models are inclusion of lagged ozone data (the ozone
constructed using the same data. maximum from the previous day) improves
both kinds of model. Still, the paper suggests
Comrie, 1997 made a direct comparison of the scope of improvement in individual
the NN models and multiple regression regression and neural models. Likely
models for weather based ozone (O3) strategies suggested were using additional
forecasting in order to identify the best variables (e.g., extra weather elements,
general modeling approach. The ozone data synoptic patterns, traffic flow or day-of-the-
from eight cities around the United States week information) or changing the nature of
(Seattle, Pittsburg, Chicago, Atlanta, the model (e.g., incorporating a priori
Charlotte, Boston, Tucson and Phoenix) was curvilinear transformations of input data for
used. The data used were the daily maximum regressions, or changing the number of hidden
1-hour concentrations, for the months of May- nodes and layers in neural networks).
September, over the five-year period 1991-
1995. U.S. Environmental Protection Gardner and Dorling (1998), presented a
Agency’s (EPA) Aerometric Information and general introduction and discussion of recent
Retrieval System (AIRS) database was used applications of the multilayer perceptron
for this data. The variables selected for this (MLP), one type of ANN, in the atmospheric
study were those most closely related to sciences. Author’s main focus was on the
ozone behavior: daily maximum temperature architecture of MLP, its applications along
(TMAX), average daily dew point with a critical evaluation of back-propagation
temperature (DPTP), average daily wind algorithm. After their exhaustive study,
speed (AWND), and daily total sunshine Gordon and Dorling accepted the fact that
(TSUN). The neural network modeling was neural networks are difficult to implement
performed using the NevProp software and interpret due to some problems. First, the
package. Feedforward neural network non-availability of any rule to decide the
architecture was used along with the back- neural architecture, the number of layers and
propagation algorithm to minimize the error. nodes in those layers. Second, overfitting the
In the choice of a non-linear transfer function, training data which results in poor
sigmoid worked well. Results showed no generalization performance. Third, failure of

38

back-propagation algorithm in the case of less explicit information concerning the likely
number of nodes when it can not be able to emissions of NOx or other pollutants. MLP2
converge to a minimum during training. outperformed an autoregressive model
Fourth, curse of dimensionality which affects developed by Shi and Harrison (1997) which
the speed of backpropagation algorithm. included the previous hour lagged NOx
Strategies to reduce the complexity of NN concentrations as an input variable. It is
like feature selection and pattern selection apparent from the results that the lack of any
were suggested in the paper. So, properly information concerning diurnal emissions
trained multilayer perceptrons still shows the does not seriously hinder the MLP neural
potential to represent relationships, often with network model. This would be unlikely to
surprising accuracy that is not fully occur in other traditional models where such a
understood by the traditional theory. complex interaction between predictor
variables is not permitted. Two MLP models
In their follow up work, Gardner and Dorling, (MLP3 and MLP4) and two linear regression
1999 applied Multilayer perceptron (MLP) models (LR3 and LR4) were trained without
neural networks to model hourly NOx and time of day inputs and using a NOx emissions
NO2 pollutant concentrations in Central factor as an additional input. The performance
London from basic hourly meteorological of both models (MLP3 and MLP4) were
data. Results have shown that the models extremely similar to the performance of the
perform well when compared to previous models with no emissions factor (MLP1 and
attempts by Shi and Harrison (1997) to model MLP2). This leads to the conclusion that the
the same pollutants using regression based emissions factor could be replaced by time of
models. The data used were selected that were day inputs when using MLP neural networks
close to those used in their original study of without any detrimental effects in this work.
Shi and Harrison (1997). Hourly NOx and MLP models are able to make efficient use of
NO2 data were obtained from the Department proxy data when the optimum predictor
of the Environment, Transport and the variables are unavailable. It was shown when
Regions (DETR) automatic monitoring the addition of an hourly sunshine input to the
network between 1990 and 1991 for two MLP models did not significantly improve the
monitoring sites in Central London. Hourly models performance. Thus the lack of
meteorological data were included the Low intuitively important variables does not
cloud amount (LOW), Base of lowest cloud necessarily restrict the likelihood of
(BASE), Visibility (VIS), Dry bulb developing a reasonable model given the
temperature (DRY), Vapor pressure (VP), availability of a number of suitable proxies.
Wind speed (WS). The models were trained This is less likely to be the case when
on data from 1990. Data from 1991 were used developing regression models where
as both the validation and test data sets. In the interactions between input variables are not
study, Neural networks were trained with and permitted. Tests using previous lagged
without the emission factor (diurnal nature of concentrations illustrate that while the best
NOx emissions which is a predictor of both predictions are made by models with
NOx and NO2 concentrations). MLP 1 and additional 1 h lagged pollutant concentration
MLP2 models were generated without any inputs, the models with 24 h lagged pollutant

39

inputs manage to predict pollution episodes These data were daily average concentrations
reasonably well. Such models could be for last two years from 3/7/2000 to 20/8/2001.
implemented as air quality forecast models to This data set has been collected from Tata
provide 24 h forecasts of pollutant levels. Energy Research Institute web site
“www.teri.in”. However, only the data for
Spelmann, 1999 applied NN model to predict Ashram Chowk was used for carrying out
ozone concentrations for London, Harwell simulation studies. The dataset size had 110
and Birmingham cities. In this study, patterns. The models studied for this case, in
correlation coefficients were found 0.77, 0.72 general showed the prediction of air quality
and 0.53 respectively. The result clearly with modest accuracy. However, among all
depicts that neural networks are best as the models implemented, Self-organizing
compared to conventional methods. Feature Map (SOFM) based model has
performed extremely well in comparison to
Barai, et al., 2000 compared the performance other models. The performance of various
of neural network models: Recurrent Network models for different air quality parameters
Model (RNM), Change Point detection Model was estimated in terms of mean percentage
with RNM (CPDM), Sequential Network error (PE). The author concluded that models
Construction Model (SNCM), and Self in general have performed reasonably well
Organizing Feature Maps (SOFM) for air even with the limited historical data and the
quality forecasting. The investigation was improvement in the performance of models
carried out for long-term as well as short-term studied could be expected in case more
air quality data set. Two cases studies were available data.
performed. Case Study 1 demonstrated an
example of an annual average emission (long- Tadashi Kondo et al., 2000 dealt with the
term) data prediction using the above neural problem of environmental planning and
network models for a very limited dataset. impact assessment by estimating the spatial
The data was taken for 115 counties of distribution of each air pollution source at a
California State in United States of America large area. For the purpose, author
and collected from US EPA website implemented the GMDH (Group Method of
(www.epa.gov). This data is annual average Data Handling)-type neural network with a
data for 15 years from 1985 to 1999 for seven feedback loop in order to identify the large-
parameters namely VOC (volatile organic spatial air pollution patterns and compared it
carbon), NOX (oxides of nitrogen), CO with other identification methods. The
(carbon monoxide), SO2 (sulfur dioxide), GMDH-type neural network is a multilayered
PM10 (particulate matter with size less than neural network architecture which is
10 microns), PM2.5 (particulate matter with automatically organized by using the heuristic
size less than 2.5 microns) and NH3 self-organization method and the structural
(ammonia). In the Case Study 2, the data for parameters such as the useful input variables,
three parameters namely RPMA (Respiratory the number of layers, the number of neurons
Particulate Matter Average), SO2 (sulfur in a hidden layer and the optimum
architectures of the neurons in a hidden layer,
dioxide) and NO2 (nitrogen dioxide) was
are automatically determined in this
collected for Delhi State at nine locations.
40

algorithm. The given method was used to methods used for feature determination were
accurately estimate the source-receptor preprocessing, heuristic determination, feature
matrix. The source-receptor matrix presents a extraction and feature selection. The most
relationship between the multiple air pollution important methods for pattern selection were
sources and the air pollution concentration at meteorological knowledge-based cluster
the multiple monitoring stations. Then the air determination and Kohonen neural network
pollution concentration patterns at a large area based cluster determination. These feature
are identified by using estimated source- determination and pattern selection strategies
receptor matrix. The comparison shows that provided the guidelines for finding the most
GMDH-type neural networks are easy to relevant information in the data base available
apply for the identification problem of large- for model construction. The given paper
spatial air pollution patterns because the clearly shows the improvement of the model
optimum neural network architecture is depicted by correlation coefficient,
automatically organized. It is also more normalized mean square error or fractional
accurate than other identification methods. bias. Thus, the problems of feature
determination and pattern selection are the
Boznar et al., 2001, in their series of work essential ones that should be solved in order
from Boznar et al. (1993) presented their to obtain a good neural network-based air
work on the implementation of feature pollution prediction model. The data used for
determination and pattern selection methods neural network training (learning) should be
in the case of SO2 concentration prediction for free of patterns and features with low or no
half an hour advance around Slovenia’s information relevant to pollution prediction.
largest thermal power plant at Šoštanj (ŠTPP). The model will predict the output values of
The model was based on a multilayer the unknown (during the learning process)
perceptron neural network with two hidden patterns well enough in the case of above
layers and a sigmoid transfer function in the rules satisfied.
hidden layer and output neurons. Training
was done with backpropagation algorithm. Werner et al., 2001 tried to overcome the
Input features (data) used were major demerit of neural network models i.e.
meteorological measurements (ground level its ‘black-box’ nature. The ‘very black-box’
wind speed and direction, temperature, nature makes difficult the assessment of the
relative humidity) and ambient concentrations relevance of the different input variables
at some of the measuring station for the within the model. The neural networks lack
current half hour interval and for the previous the error calculation that some statistical
intervals. There was only one output feature – methods can do. Except for the generalization
the ambient concentration at a selected error, i.e. the RSME over all test inputs,
measuring station for the following half hour neural networks have not much to offer for
interval (Mlakar, 1972). The solutions of the measuring the quality of the net prediction.
problems regarding proper training of neural Author’s new approach was to develop a
network models suggested by Gardner and measure of how sure the network is about its
Dorling (1998) were finally experimented by answer. This paper presents an idea of using a
Boznar et al. (2001, 2002). The important two-segmented network, where the first

41

segment works as an input-oriented, (mostly the periodic components and for neural
trained by unsupervised methods) processing using the MLP was Matlab version
classification device, whereas the second 5.4. The neural processing using SOM and the
segment produces the output based on the pre-processing of the data were carried out
classification given by the first segment. In with the neural data analysis (NDA) software
the study, radial basis function self organized package (http://erin.math.jyu."). For data
map (RBFSOM) networks, which are RBF validation, statistical indicators of error,
networks using SOM as hidden layer and the namely the root mean square error (RMSE),
SOM-training to obtain the appropriate its systematic (RMSES) and unsystematic
centers for the Gaussian functions was used. (RMSEU) components, the coefficient of
For each input, the two parameters MA determination (R2), index of agreement (d),
‘maximum activity in the hidden layer’ and proportion of systematic error (PSE) and bias,
RA ‘maximum relative activity of distant were calculated for each of the five methods.
neurons’ provide a measure for the quality of Histograms of error residuals for the year
the net output. Hence the RBFSOM networks, 1998 were also plotted and shapes of the error
calculating variable bandwidths and the histograms for the periodic components and
parameters MA, RA, provide a neural for the SOM were least satisfactory, while the
network type that allows the user some best profile was achieved by MLP algorithm.
insight, how sure the network is about its However, all the methods gave fairly good
answer and how reliable the answer can be. estimates but the combination of a linear and
Thus, the new network approach found neural method did not yield any advantage
suitable for creating models that are capable over the direct application of neural networks.
to estimate the accuracy of their response
even in the situation where only few data for Vaziri, 2001 compared artificial neural
training are available. network (ANN) and regression modeling to
model motor vehicle air pollution emission
Kolehmainen et al., 2001 investigated the attributes for Tehran (Iran). The emission data
forecasting capability of the following five consisted of 2000 records extracted from the
methods: regression using periodic Tehran Air Pollution Database. The relevant
components at the year, week and day levels, emission information (consisted of variables
the MLP and SOM methods applied to the namely, emission density of CO,
original time series and the MLP and SOM hydrocarbons, oxygen and carbon dioxide),
methods applied to the residual of the periodic vehicle information and meteorological
components. An hourly time series of NO2 information were also used. The testing data
concentrations were used as an example and showed an average RMSE reduction of 90%,
basic meteorological variables from the city when four ANN’s predictions were compared.
of Stockholm were collected from years In the study, developed ANNs were found
1994-1998. The meteorological variables used superior to the developed regression models
were temperature, wind speed, wind direction as is clear from the RMSE. Thus, the
and solar radiation, in addition to which the regression modeling is ineffective in
hour of the day and month of the year were modeling the relationships among air
recorded. The software used for calculation of pollutant, meteorological and motor vehicle

42

characteristics but ANN seem to be a very al., 2001). It is remarkable that after training a
useful tool. three-layer neural network with 1995 data,
author was able to generate reasonably good
Perez et al., 2001 have shown that a three- predictions for 1996 data. Thus, the results
layer neural network may be a useful tool to clearly indicated the superiority of NN as
predict PM2.5 concentrations in the compared to others.
atmosphere of downtown Santiago, Chile
several hours in advance when hourly Lu et al., 2002 developed an improved neural
concentrations of the previous day were used network model which combines both the
as input. The paper also stressed on the fact principal component analysis (PCA)
that improvement of predictions is possible by technique and the radial basis function (RBF)
using another neural network for noise network to analyze and predict the
reduction on the original series which resulted concentration variations of six pollutants:
in significant prediction error. In the same Respirable suspended particle (RSP) , SO2,
year, Perez, 2001 predicted the SO2 NOx, NO, NO2 and CO measured hourly
concentrations in the atmosphere of Santiago, during 1999 at the Causeway Bay roadside
Chile and then showed the comparison of gaseous monitoring station. The concentration
predictions using persistence, linear of these six pollutants were used as original
regressions and feed forward neural networks. input variables and the corresponding RSP
concentration level was used as the output of
The neural network used here had 37 inputs, the models. In the study, PCA was used to
among which were the past values of SO2 reduce and orthogonalize the original
concentrations measured at different stations variables. Such orthogonalization make a
in the study area plus meteorological neural network more easily trained due to
variables as temperature, wind speed and filtering of the noise that exists in the data set.
wind direction. The data corresponded to The orthogonalized variables treated are then
hourly averages for the period that goes from used as input vectors in a RBF neural network
18 May 1995–30 September 1995, and from model to forecast the pollutant levels, e.g., the
18 May 1996–30 September 1996 which RSP level in the downtown area of Hong
include the winter seasons of years 1995 and Kong until the network training error achieves
1996. From the results shown, this paper the given error. This improved method was
concluded that knowledge of the sequence of evaluated based on hourly time series RSP
past values of sulfur dioxide concentrations is concentrations collected at the Causeway Bay
important in order to estimate its future roadside gaseous monitoring station in Hong
values. The relevance of these past values Kong during 1999. The simulation results
appears to be greater than in the case of NO showed the effectiveness of the model. The
and NO2 predictions (Perez and Trier, 2001). proposed algorithm involving the
Meteorological conditions at the time of the combination of RBF and PCA technique
intended prediction have also an important proved to be much more effective than the
effect. Being able to forecast the weather traditional RBF network and also acted as a
seems of greater importance as compared better alternative to multilayer feed-forward
with the case of PM2.5 prediction (Perez et neural networks because of simpler network

43

architecture and faster learning speed without predicted values were compared with the
any compromise with the generalization measured concentrations at three sites in
capability of the network. PCA technique Delhi. A multivariate regression model was
removes the curse of dimensionality faces by also used for comparison with the results
MLP. In this research, an adaptive method obtained by using the neural network model.
was also used to decide the number of hidden The study results indicated that the neural
nodes based on a trial-and-error procedure. network was able to give better predictions
Although it may give more nodes than with less residual mean square error than
actually needed, such an adaptive method is those given by multivariate regression
an effective potential RBF approach in models.
pollutant modeling. In this study, although the
performance of the PCA/ RBF network for Morabito et al., 2002 reported a study of
long-term prediction is not studied here, the estimation and short time prediction of
better properties expressed in the given case atmospheric pollutants using a multi-
study provide sufficient evidence to conclude resolution dynamic forecasting system
that it is worth pursuing further research using (MDFS), a combined approach of both neural
the model proposed. Although this work networks (NN) and wavelet analysis concepts.
ignored the impact of climatic conditions, Particularly, the prediction of the hydrocarbon
satisfactory results are still obtained using the (HC) in the air was carried out and an
improved PCA/RBF method. It can be environmental database that refers to the
foreseen that, if meteorological parameters Southern Italy area of the Messina Strait was
are considered, better prediction results can be used. The available database clearly showed
achieved. the characteristics of regularity (cyclicality) of
the atmospheric parameters: for example, the
Viotti et al., 2002 used ANN to forecast short temperature attains the maximum values
and middle long-term concentration levels for during mid-day, while during night and early
benzene, NOx, CO and ozone. The results morning it reaches its minimum values. The
shown a good accord with the monitored data same can be said for the atmospheric
and allowed the use of ANN as the pressure, whose regularity is strictly related to
forecasting model on a 24–48 h basis the period of the year the observations have
requiring only the meteorological conditions been taken. But given system is not able to
and the traffic level. generalize in other periods of the year due to
the effect that weather has on the presence
Abdul-Wahab and Al-Alawi, 2002 applied and stabilization of the pollutants in the air.
NN to predict ozone concentrations as a That means, a set of non-linear models need
function of meteorological conditions and to be designed in different background
various air quality parameters. The results of weather conditions. For evaluation of the
their study indicate that the ANN is a performance of the combined approach,
promising method for air pollution modeling. standard deviation was used as a reference
value. The proposed MDFS proved useful in
Chelani et al., 2002 predicted SO2 predicting experimental time series at an
concentration by using an ANN, and the improved accuracy level with respect to other

44

known approaches. The use of a most Lu et al., 2004 again applied the combined
complex system that utilizes the different time approach of PCA/RBF to forecast RSP, NOx
series derived from the multi-resolution and NO2 concentrations in an hourly time
decomposition permits us to achieve better series in Mong Kok urban area, Hong Kong.
resolution in prediction and finally, the best This case study also validated the superiority
accuracy results. The MDFS is also most of the PCA/RBF network over the simple
amenable to implementation in parallel RBF network as found in the earlier work.
processing.
Zhang et al., 2004 developed an idea of using
Dorling et al., 2003 worked on APPETISE adaptive neural network (AWNN) as a tool
project to quantitatively inter-compare the for prediction of air pollution abatement
performance of Deterministic (DET) and scenarios, and focused on avoiding overfit
Statistical Air Quality Models. For the same, during training of AWNN. The wavelet
a case study of Helsinki was used for the concept has already been applied by Morabito
prediction of NOx, NO2 and PM10 et al., 2002. The simplicity of the structure
concentration. The experimental data of and the algorithm of AWNN, improves its
Helsinki from 1996-1999 was used and efficiency of on-line learning. Author
included the concentration data of PM10, NOx obtained the data in the form of hourly NOx
and NO2, meteorological data and traffic flow and NO2 from the monitoring site of the
data. Output was recorded as the Bureau of the Environment of Heilong Jiang
concentration time series of PM10 and NO2 at in Jiamu Si City in 1995 and 1998. The
the stations of Toolo and Vallila of Helsinki. hourly metrological data were obtained for
Index of agreement as a statistical evaluation the same period from the Bureau of the
of model performance showed that NN model Weather of Heilongjiang. The meteorological
performance is better for NO2, compared with variables used in this work were similar to
that for PM10. The comparison of NN model that used by Gardner and Dorling, 1999.
and Deterministic model says that NN are About 39000 data were taken to train or test
computationally more effective but DET AWNN. Instead of the emission factors, the
models can be more easily extended to other network is given two additional time of day
locations and time periods. A tool named as inputs consisting of the sine and cosine of the
JANN (Java Artificial Neural Network) was time of day normalized between 0 and 24 h.
developed for air pollution modelling using The samples were divided into two sets—
Multi-layer Perceptron Neural Networks. training set (data of 1995) and testing set
(data of 1998). The MAE-mean absolute error
Gariazzo & Tirabassi, 2003 tested the was used as performance statistics calculated
combination of two methodologies i.e. ANN over the whole year. Results have shown that
and dispersion modeling for air pollutant the accuracy of AWNN is low in the
prediction. Results have shown better beginning of learning, but the errors gradually
performance (correlation factor, r=0.90) as decrease after about seven days learning. The
compared to earlier studies. predicted line derives from the actual line
between about 10-15 January because the
windy weather is very different from that of

45

early month, which forces AWNN to learn 65 sets for the second mill in 15 min interval
again to suit the new weather condition; in the time for several days. The data taken were
same time, this also indicates AWNN can then divided into two major parts: simulation
predict well again after several days learning (training and validation) and testing according
when it meets a new condition. Thus, the to the rule set by Environmental Protective
WNN can work similar to other static NN Agency of United States of America
when the condition of prediction is stable. (EPAUSA). The tool developed from this
Results have also shown that the new study can be utilized to predict and control
algorithm ensures good generalization of any pollution. Thus, GA was employed to
AWNN during online learning when the find the optimal operating conditions so that
condition of prediction is unstable, but the the overlimit release of emission is reduced to
classic algorithm cannot ensure the the allowable limit. This paper has shown a
generalization during the same course. new dimension of regulating air pollution
directly from industries.
Ahmad et al., 2004 utilized the combination
of ANN and genetic algorithm (GA) to find Niska et al., 2004 worked on the problem of
the optimal operating conditions in boiler neural network architecture selection which is
emission processes from the palm oil mill. In a time consuming task. The paper presented a
the study, the procedure adopted was as study where a parallel genetic algorithm (GA)
follows. First GA writes the selected input was employed for selecting the inputs and
parameters that are written in the text file. The designing the high-level architecture of a
text file is then read by the ANN and received multi-layer perceptron model for forecasting
as a new input parameter. Then, ANN will hourly concentrations of nitrogen dioxide at a
predict the output value. The output generated busy urban traffic station in Helsinki.
from this prediction is compared with the In this context, the evolutionary and genetic
pollutant limit. If the value exceeds the limit, algorithms (GA) have proven to be powerful
GA generates new input parameters from the techniques due to their ability to solve linear
GA operator, i.e. mutation and crossover. and non-linear problems by exploring all
These steps are repeated until the optimal regions of the state space and exploiting
input values of fuel are found. This was an promising areas through genetic operations.
iterative process at the end of which the GA The main drawbacks related to the using of
arrives at the optimum set of fuel parameters GAs for optimizing NNs have been high
which produce emission within acceptable computational requirement and complex
limit. Commercial software of MATLAB search space.
Version 5.3 and Neural Network Toolbox
from Mathworks Inc. (MATLAB User’s
Guides, 1998) were used to generate neural
network Modeling. Input data consisted: the concentration data
comprising of hourly concentrations of NO2,
By considering the equipment provided and NOx, O3, PM10, SO2 and CO monitored at the
the operation time for palm oil mill, the data urban air quality monitoring station in Toolo
collected were 120 sets for the first mill and (in Helsinki central) and meteorological data.

46

The test data set was extracted from the using neural networks, fuzzy logic,
APPETISE (Air pollution Episodes: generalized additive techniques, and other
Modelling Tools for Improved Smog recently proposed statistical approaches. They
Management, found that artificial neural network-based
http://www.uea.ac.uk/env/appetise/) database. models and neuro-fuzzy models were the
The data set comprised the concentrations of most promising.
airborne pollutants and meteorological
soundings and observations, monitored in Abbaspour et al., 2005 presented a new
Helsinki metropolitan area during the years structure based on neural networks e.g.
1996–1999. Out of which, the data from the cerebellar model (CMAC model). This model
years 1996-1998 was used as training data can be used as the black box approach which
and data from the year 1999 as model shows superior ability to memorize and
validation data. The fitness of model was predict unknown time series data. The CMAC
assessed from the observed and predicted model was modified into TD-CMAC to
values by calculating the index of agreement predict future behavior of air pollutant density
(IA), which is a dimensionless measure (carbon monoxide) by using the current and
limited to the range of 0–1 and thus, allows past values of observed and collected data.
the comparison of different models. The The simulations were conducted for 1-hour-
results showed that GA is an applicable ahead prediction and for 24-hour-ahead
technique in this domain; it is capable of prediction (one-day-ahead) of air pollutants
searching feasible high-level architectures and time series to compare the prediction
particularly reducing the need of capability of TD-CMAC, conventional
computational efforts by eliminating CMAC (TW-CMAC) and a multi-layer
irrelevant inputs. In the case of air quality perceptron (MLP) neural network. Time
forecasting this can also imply smaller costs series data used in this evaluation were taken
due to the smaller amount of measurements from CO values recorded and averaged at
required. No clear connection between Villa station in Tehran, Iran from October 3rd
architectural issues and performance was 2001 to March 14th 2002 at one-hour
found, so it was suggested to use somewhat intervals (3912 samples). In these simulations,
simpler architecture instead of complex one in the models, learning algorithms, output
order to minimize the risk of noise over- mapping and inputs were the same, but the
fitting. outputs were different. The efficiency
evaluation revealed that under equal
Sahin et al., 2004 applied multi-layer conditions TD-CMAC has better prediction
perceptron NN model to predict daily CO capabilities than the TW-CMAC and MLP
concentrations using metereological variables neural network. Still the short-term prediction
as predictors for the European part of provides better result than the long-term
Istanbul, Turkey. prediction. Features like usage of less
memory with lower hardware implementation
Nunnari et al., 2004 did inter-comparison costs, simplicity and ability to conduct online
between several statistical techniques for implementation, all makes TD-CMAC model
modeling SO2 concentration at a point by the better option than MLP and TW-CMAC.

47

Jef et al., 2005 examined the feasibility of a road (AQCR2), in the city of Delhi, India.
neural network short term forecasting model The eight hourly CO concentration data have
for ambient PM10 concentrations in Belgium. been collected from Central Pollution Control
In this research, the dataset consisted of half- Board (CPCB), New Delhi for a period of 3
hourly PM10 concentration from the ten years from January 1997 to December 1999,
measuring stations of Belgium for the period for both the AQCRs. The meteorological data
1997-2001. The two input parameters of including 8-h average observations of cloud
Model 1 were the forecasted boundary layer cover, pressure, mixing height, sunshine
height and the PM10 measurements of the hours, visibility, temperature, wind speed,
morning of day0. This model got a wind direction and humidity have been
reasonable accuracy and was tested in online collected from Indian Meteorological
operational mode (IRCELCELINE). By Department, New Delhi. The eight hourly
extending the model with other input average traffic characteristics data have been
parameters: cloud cover, day of week and collected from Central Road Research
wind direction, the increase in the Institute (CRRI), New Delhi for the respective
performance was noticed with more forecast AQCRs. The vehicles have been classified
accuracy because these parameters contained into four groups i.e. two wheelers, three
explanatory value for the PM10 phenomenon wheelers, four wheelers gasoline powered and
in Belgium, complementary to the two inputs four wheeler diesel powered, for which
of Model 1. The author clearly stated that gain emission factors, developed by Indian
in additional accuracy from the use of Institute of Petroleum, have been used for
temperature or the wind speed was not estimating CO and NO2 source strengths. The
marginal. Thus the features carrying useful performance of all the developed models was
information for the prediction of PM10 evaluated on the basis of index of agreement
concentrations were selected from the dataset. (d) and other statistical parameters. The
This brings the author to the conclusion that forecast performance of the developed
day to day fluctuations of PM10 models, with meteorological and traffic
concentrations in Belgian urban areas are to a characteristics (d=0.78 for AQCR1 and
large extent driven by meteorological d=0.69 for AQCR2) and with only
conditions and to a lesser extent by changes in meteorological inputs (d=0.77 for AQCR1
anthropogenic sources. and d=0.67 for AQCR2), were comparable
with the measured data. The results show that
Sharma et al., 2005 gave a review of ANN ANN-based CO models, with both
techniques in vehicular pollution modeling. meteorological and traffic characteristic
variables and with only meteorological
Nagendra and Khare, 2005 discussed the variables show best performance on the test
development and performance evaluation of data set at both the AQCRs. The study also
ANN-based vehicular exhaust emission shows that elimination of traffic characteristic
(VEE) models for predicting 8-h average CO variables from the model inputs causes
concentrations at two Air Quality Control negligible effect on model performance.
Regions (AQCRs), one representing a traffic However, the models developed with only
intersection (AQCR1) and other, an arterial traffic characteristic inputs showed poor

48

performance on the test data set at both the temperature, relative humidity, wind speed
AQCRs and reflected their inability to take and wind direction. The output represented
into account the ‘lag-effect’. PM10, CO, and NO2 concentrations. The
generalization of the model was tested by
Cigizoglu et al., 2005, studied the air correlation coefficient. Based on the results
pollution parameter estimation in Istanbul obtained, annual correlation coefficient was
based on two ANN methods, radial basis found to be 0.997, 0.969 and 0.978 for PM10,
function (RBF) and feed forward back CO and NO2 respectively. These results show
propagation (FFBP), and comparison was that approximately 98% of the variation in the
done with multi-linear regression (MLR) dependent variables (output parameters) can
method. In the study the SO2, NO and CO be explained by the independent variables
measurements in the Asian part of Istanbul, (input parameters) selected. Then the author
Ümraniye, for November 1999-April 2003 has used the mean bias error (MBE) to
period were used. One measurement was describe how much the ANN model
taken during a day. The data was not underestimates or overestimates the situation.
continuous and had gaps. The paper is divided MBE results indicate that, ANN model
in two parts. Firstly, FFBP and RBF methods always tend to over-estimate air pollutant
were employed to estimate SO2 using the past concentrations, but remain in a domain of
and present NO measurements. Secondly, the errors for which this model can be applied
study was repeated again to estimate SO2 but with good accuracy. Thus the overall results
this time using the past and present CO demonstrated that, ANN model can estimate
measurements. Both methods provided air pollutant concentrations in urban area, for
significantly superior estimations compared the given data set with an accuracy of
with the conventional MLR method. The approximately 96%. Elminir et al., 2006 also
performance evaluation criteria of FFBP for showed their concern for the inherent
the testing period was slightly better than limitations of ANN. The main limitation is
RBF. However, multiple FFBP simulations the extension of model in terms of time period
were required until obtaining satisfactory and location; this always requires training
performance criteria and this total duration with locally measured data. According to the
was longer than the unique RBF application. author, the ANN models cannot therefore be
The presented study has shown that both recommended for analyzing various air
ANN methods could be successfully pollution abatement scenarios for future
employed to infill the gaps in the air pollution years.
record.
Pasero et al., 2006 evaluated the performance
Elminir et al., 2006 applied ANN model to of MLP and Support Vector Machines (SVM)
estimate air pollutant concentrations (PM10, for the accurate prediction of the time
CO and NO2) in Egypt. The work was based evolution of air pollutant. The main aim of
upon measurements of different indicators this research was the medium-term
performed during the period 2000 to 2002 at forecasting of the air pollutants mean and
Abbassyia station, Egypt. The input variables maximum values by means of meteorological
used in the model were ambient air actual and forecasted data. Author focused on

49

the selection of features and the modeling and Neural networks (Voted Perceptron). All
processing techniques based on the theory of these algorithms are implemented in WEKA
ANN, using MLP and SVM. The study (open source software) and was used for the
provided an example of application based on data mining experiments in this work. In this
data measured every hour by a station located study, the overall forecasting performance of
in the urban area of the city of Goteborg, all algorithms was found within accepted
Sweden. The database considered was based range but the most efficient algorithm for the
on meteorological and air pollutant experiments conducted is C4.5(J48), which is
information sampled for the time period a decision tree learning algorithm. Author
January 2004 to October 2005. MLP managed to conclude that classification
performance, both for the samples under the methods should be considered as appropriate
threshold and for the samples above the for operational air quality forecasting
threshold, increased when the number of input applications. In addition to this, results clearly
features increased. Results were obtained with indicate that classification methods could be
5115 samples of days under the threshold and employed together with statistical methods
61 samples of days above the threshold. It can and other “fast” data analysis and prediction
be noted that the probability to have a false algorithms, for the creation of operational air
alarm is really low (0.82%) while the quality forecasting modules that may
capability to forecast when the concentrations effectively support operational air quality
are above the threshold is about 80%. In the management on a day-to-day basis, in line
case using SVM method, the probability to with contemporary EU environmental
have a false alarm was higher than in the legislation.
MLP (0.96%) but the capability to forecast
when the concentrations were above the Bianchini et al., 2006 proposed a novel
threshold was nearly 90%. Thus, the use of approach of using cyclostationary neural
Multi Layer Perceptrons and Support Vector network (CNN) architecture to model and
Machines were proposed as an efficient predict hourly NO2 concentration being
strategy to perform an accurate prediction of independent from exogenous data. With the
the time evolution of air pollutant supposition that the relevant meteorological
concentration. influence is already present in the time series,
it uses only the time series of NO and NO2
Athanasiadis et al, 2006 studied the concentrations for prediction and no
application of classification data mining meteorological data was taken into account.
algorithms for the development of an CNN is composed of a number of MLP
operationally efficient OF module and the blocks equal to the estimated cyclostationary
comparison of their performance with period in the analyzed phenomenon.
statistical analysis methods. Algorithms used Cyclostationary process is a subclass of non-
were iBK, Kstar, Nnge (Nearest Neighbor stationary processes which vary periodically
With Generalization), rule-based classifiers, with time. The experiment was done on the
as Conjunctive Rules, OneR, Decision Tables, data gathered from ARPA (Regional agency
decision trees (C4.5, ADTrees), along with for environmental protection), Lombardia
Bayesian Classifiers (NaiveBayes), and (Northern Italy). The task consisted in

50

modeling the NO2 time series, based on the relative magnitude and sign changes of each
past concentrations of NO and NO2. In this output. Results of this analysis showed that
case, it was evident that a strong correlation gas absorption temperature is one of the
exists between the past NO data and the important operating parameter affecting the
current value of the NO2, with a daily NOx emission and then an ANN model based
periodicity. This means that the NO2 pollution on the daily averages was trained, thus
at time t + 1 depends on the NO sampled at t eliminating the diurnal temperature change
− 24, t − 48, etc. Therefore, the considered effect. Overall, the 5 minute NOx
process has a cyclostationary period T = 24. concentration at the stack modeling results
Consequently, a CNN model composed by 24 have given a mean average error of 0.6%
MLP blocks was used to face the prediction between the actual measurements and the
task. Completely independent of exogenous ANN model, with a standard deviation of
data, such as weather condition (i.e. pressure, 6.7%. The daily average ANN model was
wind, humidity, etc.) and geographic trained to give the total NOx emission and
information, focus was only be on the NO2 gave a mean model error of 0.006 Kg/Hr
estimation and the resulting model expected NOx, with a standard deviation of 0.61 Kg/Hr.
to be much more robust against noise and Thus, the ANN models trained from daily
prediction errors. Preliminary experiments plant feature averages proved more
were very promising and have shown a informative than the 5 minute data, although
significant improvement in performance, it may be the results of the importance of the
together with a low computational cost and diurnal temperature changes in this plant.
faster speed for the CNN learning phase with
respect to standard statistical tools. CNN Ozcan et al., 2006 proposed a study of
performed better near the peak hours. estimation of methane values of Istanbul
(Turkey) landfill area using multi-layered
Boger, 2006 explained the modeling of NOx ANN approach for the measured data using 4
emission from the production plants in Iran parameters (CH4, Co2, CO, temperature) as
utilizing ANN model. For the training of inputs during 1 year (July 2002-April 2003).
ANN, the Guterman-Boger (GB) training In the present model, NN was trained and
algorithm set was used because this algorithm tested using Matlab 6.0. Statistical
can easily train large scale ANN models, as it performance indices: 1) Mean bias error
starts from non-random initial connection (Bias); 2) Mean absolute error (MAE); 3)
weights, obtained by the assumption that the Root mean square error (RMSE); 4)
inputs and outputs of the training data set are Correlation coefficient (r); and 5) index of
linearly related. Initially, the ANN model was agreement were calculated: Bias, 3.04; MAE,
trained with the database at 5 minutes interval 7.98; RMSE, 10.95; R, 0.81; and d, 0.88.
collected in the January-July 2005 period to Extraordinary performance of ANN was
predict the NOx concentration. Subsequent found indicating it as a promising technique
analysis of a trained ANN model has shown for parameter estimation of landfill areas.
that causal index algorithm for knowledge
extraction was found to be very useful in Agirre-Basurko et al., 2006 presented the
relating each input change influence on the ANN model to forecast O3 and NO2

51

concentrations in the Bilbao area, Spain. The provided by the LR model (Agirre, 2003).
primary goal of the work was to build an The cause of this uncertainty was unknown
accurate statistical model to forecast O3 and that time but suggested to get resolved by
NO2 levels k hours ahead (k = 1, 4, 5, 6, 7, 8) choosing new dataset for MLP.
in the Bilbao area. Meteorological variables,
air pollution variables and traffic variables Lira et al., 2007 investigated the forecasting
(the number of vehicles NV, the occupation capability of linear models (such as ARX,
percentage OP, and the variable KHZ ARMAX, output-error and Box-Jenkins), and
(NV/OP), which gives an idea of the velocity) neural networks. Input data included 24-h
were used to develop the model. The data PM10 concentration of the present day and
used in this work were hourly current data meteorological variables from the city of
and historical data from the air pollution Uberlandia (Brazil) during the period of
network and the traffic network of Bilbao 2003-2005 and the output foreseen by the
during years 1993–1994. The study was models obtained as 24-h PM10 concentration,
limited to four stations in Bilbao, namely with horizon of prediction of up to three days
Elorrieta, Txurdinaga, Mazarredo and Deusto. ahead. PM10 concentration data was collected
Two multilayer perceptron-based models by School of Chemical Engineering of the
(MLP1 and MLP2) and one multiple linear Federal University of Uberlândia (UFU). The
regression model (MLR) were developed. meteorological data used in study were
After introducing the appropriate inputs, the obtained in the climatic station of the Institute
outputs of the models were the forecasted O3 of Geography of the UFU. The software
or NO2 levels at time t+k (k = 1,., 8). Matlab was used for NN and linear models
were adjusted using the System Identification
The MLP neural networks used in this work toolbox. For the neural network model
were trained using the scaled conjugate (MLP), the learning algorithm used was
gradient (SCG) algorithm. At the same time, Levenberg-Marquardt back-propagation
in order to avoid overtraining, the early (Neural Network toolbox). The transfer
stopping technique was applied: the first 85% functions selected for the layers were
of the data from 1993 were the training set, hyperbolic tangent for the hidden layer and
the last 15% of the data from 1993 formed the linear for the output layer. For the evaluation
validation set and the data from 1994 were of the model’s performance, three statistical
chosen to be the test set. A joint study of the measures were selected, namely the root mean
values obtained from the statistics of the square error (RMSE), the coefficient of
Model Validation Kit showed that MLP determination (R2) and the index of
models performed better than the MLP based agreement (d). Results have shown that fairly
model in 75% of cases (in six forecasts up to good estimates can be achieved by all of the
eight) in the Bilbao area. Moreover, for k = 1, models, but Box-Jenkins model presented
4, 5, 6, 7, 8 h ahead the MLP2 model best fit and predictability.
provided the most accurate forecasts of O3
and NO2 at time t+k in the studied area.
Nevertheless, for k = 2, 3 the most accurate
forecasts of O3 and NO2 at time t+k were

52

Iliadis et al., 2007 developed a reliable ANN study, a three-layer recurrent network was
model for estimating the tropospheric ozone used. Results have shown that the predicted
concentrations at the site of Lykovryssi, pollutant concentrations were concordant with
Athens. Input variables included both the the observed concentrations on a majority of
pollution parameters (CO, NO, NO2 and the days. The best results were obtained in the
PM10) and meteorological parameters (mean training data set. At the Bahçelievler station,
air temperature, total solar radiation, mean the determination coefficient (R2) between
pressure at sea level, the relative humidity, observed and predicted values of SO2
mean wind speed, NW_SE direction wind concentrations for the training data was 0.829
component and SW-NE direction wind and for testing data was found to be 0.668.
component). Thus, in total, 4232 data records For PM concentrations, R2 was 0.820 for the
were used; 74% of them were used for the training set, and 0.808 for the testing set,
training procedure, while 26% of them were respectively. The determination coefficient
used for the testing procedure. Missing values indicates that the fitted model explains the
were excluded. Input Contribution of input percentage of the variability between
parameters was done during the testing phase observed values and the neural network
by the Neuralworks II Plus software and the model predictions. ANOVA analysis was
results have clearly shown that the CO and performed to check the fitting between
NO play the most important role in the observed values and ANN prediction. The P-
formulation of Ozone. The Modular ANN value in the ANOVA analysis was found to
gave the R2 = 0.8827 and the RMS be less than 0.01, which indicated a
Error=0.1164 in the training phase, hereas in statistically significant relationship between
the testing process the R2 = 0.742 and the the variables at the 99% confidence level.
RMS Error=0.156. Thus, the paper closed its Author concluded that knowledge of the
discussion with the conclusion that ANN has sequence of past values of air pollutant
proven to be a powerful tool offering a very concentrations is of considerable significance
reliable approach towards ozone in order to predict its future values.
concentration estimation. It can be applied Meteorological conditions at the intended
reliably towards the design of environmental time of prediction also have an important
protection policy and management. effect on air pollution modeling. It was also
shown that the neural network model provides
Tecer, 2007 proposed the application of ANN a good agreement with measured values of air
to predict the concentrations of SO2 and PM pollutants.
at two different stations in Zonguldak city
(Turkey). The 24‑hour SO2 and PM Caselli et al., 2008 presented the prediction of
concentration averages were used as pollutant PM10 concentration utilizing feed-forward
parameters for the period January-December neural network and its comparison with
2002. The input for the model included the multivariate regression model. Data used
meteorological variables provided by the were the meteorological data and data of
Governmental Meteorology Office and SO2 PM10 from San Nicola monitoring station
and PM concentrations pertaining to the day during the period January 2005 to March
the meteorological data was taken. In this 2006. The prediction of PM10 concentration

53

12h and 36h before gave an RMSE of 26% The models were also used to determine
and 30% respectively and the correlation meteorological conditions that most affect
coefficient between the observed and SO2 concentrations. To find the percent
predicted values was 0.7 which showed a contribution of each of the input variables
good accuracy by ANN. Comparison of with respect to the output variables, the
results obtained by neural network with those partitioning method of the connection weights
of multivariate regression for one and two proposed by Garson was used. The method
days forecasting was done and found in involves partitioning the hidden-output
concordance with previous researches till connection weights of each hidden neuron
date. Neural network outweighed multivariate into components associated with each input
regression modeling. neuron. Some of these methods are (1) Partial
Derivatives or ‘PaD’; (2) the “weights method
Abdul-Wahab et al., 2008 developed ANN or the partitioning method used in the current
models for the prediction of ground-level SO2 work; (3) the “perturb” method; (4) the
in the Sultanate of Oman. This study “Profile” method; (5) the “Classical
indicated the potential of the neural network Stepwise”; (6) the “improved stepwise a”; (7)
approach for capturing the non-linear the “improved stepwise”. Using this method,
interactions between SO2 levels and for Model I, it was found that the highest
meteorological variables and for the contributing variable that affects the SO2
identification of the relative importance of concentration level is the wind direction;
these variables. In this work, two ANN while in Model II, wind direction, stability
models were generated with different and wind speed are the highest contributors. It
objectives. The first model (Model I) was was found that the neural network model has
used to predict SO2 levels at certain receptors given superior predictions consistently and
from the Mina Al-Fahal refinery in Oman. the results produced were encouraging.
Second model (Model II) was used for the
prediction of first three maximum SO2 Popentiu et al., 2009 applied the artificial
concentrations and their corresponding neural network approach in order to monitor
locations with respect to the refinery. The the urban pollution level from “Moldocim”
Feed Forward network using the Back- cement plant, in Tasca (Romania). Their work
Propagation (BP) algorithm was used to stressed on the fact that the study of air
develop these two models. The training pollution should be done with regard to the
process was performed using the NeuroShell weather patterns of the local area because the
simulator. The R2 value for the training set pollutant concentration is strongly influenced
was 0.9693. The statistical analysis of these by the movements and characteristics of the
results indicated that the R2 value for the wind into which they are emitted. For this
testing set was 0.9666, the mean squared error purpose, a professional computer aided
was 8.902 and the mean absolute error was modeling and pollution control tool (referred
1.173. Thus, the model prediction was in to as PoLogCem Software- Pollution Logistic
good agreement with the actual results. Cement) was used. Results have clearly
shown the considerable impact of wind speed
and wind direction on pollution distribution.

54

Postolache et al., 2009 proposed a novel concentrations using KBNNs, namely Prior
strategy to assess the indoor air quality for the Knowledge Input (PKI), Source Difference
first time. This work presented a Wi-Fi Model (SDM), and Space-Mapped Neural
indoor-outdoor air quality monitoring Network (SMNN).
network along with the combination of
capabilities of tin oxide sensors using an Author compared the performance of these
advanced sensor data processing based on techniques to conventional interpolation
multilayer perceptron neural networks for an techniques and the MLP3 approach. The
accurate measurement of air quality. Neural SMNN has given the best performance in
network algorithm was implemented in terms of MAE, Fa2, RMSE, and FB; whereas
JavaScript for the calculation of several air the PKI method gave the best performance in
quality values. term of NMSE. Hence, the SMNN method
was found to be more reliable in the scenario
Arjun Akkala et al., 2010 for the first time of predicting missing radon concentrations in
demonstrated the use of ANN approach to Ohio. It could also be seen that all the
model Radon concentrations in Ohio, US. knowledge based approaches performed more
Along with ANN, a new technique named as efficiently as compared to rest of the
knowledge based neural networks (KBNNs) techniques such as RBF and Kriging and also
was used to improve the reliability of the simple MLP3 approach.
estimations from ANN. In the KBNNs used in
this work, knowledge in the form of radon Selvaraj et al., 2010 developed a method for
concentration data and uranium concentration short term prediction of ozone concentration
data is embedded into the neural network using the neural network technique. The study
structure. In the ANN approach, the fact that area Chenbagaramanputhur, a rural place in
radon is formed from uranium by the decay Kanyakumari district was used. The
chain has been considered. And the neural parameters used to develop the model were:
networks employed in this work were 3-layer mean surface ozone concentration as
multi-layer perceptrons. The databases of response, nitrogen dioxide concentration as
Radon concentration were requested from predictor, mean temperature as predictor and
laboratories, university researchers, and prevailing % relative humidity as predictor.
others to compile a unified database Results shown the mean square error of the
consisting of 145,996 measurements for 1492 data during testing: 0.15375 ppb and the
zip code areas in Ohio. It was observed that accuracy of testing data: 99.85 %. The above
the houses and other structures built above results validated the proposed model. Hence it
uranium-bearing rocks or sediments may have is concluded that the given model can be used
higher indoor radon levels. This fact is for predicting surface ozone concentration
corroborated by the uranium concentration with nitrogen dioxide, temperature, % relative
map of Ohio with the radon concentration humidity as predictors.
map. The observation represented knowledge,
which could be used for further improving In a conference Boznar et al., presented their
ANN model accuracies. Three approaches research work to construct a forecasting
were applied for modeling radon and uranium model of ozone concentration values of the

55

following day. The intention behind this it has been used to forecast the 24h average
project was to inform citizens that a day with PM10 concentration in Kozani. The daily
ozone alarm value is coming. Authors forecasts become available through the
concentrated their research on the problem of Laboratory’s website
maximal hourly value of ozone concentration (http://www.airlab.edu.gr). The MNN was
that would appear in the following day. The developed to receive several inputs from air
tools, multilayer perceptron neural network pollution and environment monitoring
and fuzzy logic both were used and all were stations at the center of Kozani and was
implemented in the Matlab software package. trained by the data during the period 1 June
The selected input features were: air 2004 and 31 May 2005 (12 months). The
temperature, global solar radiation, NO, NO2, prediction capacity of the presented system
NOx, CO, O3, prognostic vector wind speed has been demonstrated using observed and
for the day of prediction, sinus of prognostic predicted concentrations collected since it
vector wind direction for the day of became operational as part of the IAQMS.
prediction, prognostic maximal hourly air The study shows a mean absolute error:
temperature for the day of prediction (all three 13.7% on the validation set and 26.3% during
taken from the available measured database). application.
Crucial point was the features and patterns
used. Both models tested used the same input Conclusion
features and the same learning patterns set.
When the models were constructed they were The above studies and discussion provide us
tested on the same independent verification an understanding of artificial neural network
set of patterns. Although, the verification set (ANN) technique in air pollution modelling.
was relatively small, still MPNN and FL have ANN models are computer programs that are
given satisfactory results. So, the author designed to emulate human knowledge
concluded that both the models are good processing, speech, prediction, classification
enough to use the model for informing and control. ANN is a cellular information
citizens about possibilities of high and alarm processing system designed and developed on
concentrations. Nunnari et al., 2004 also the basis of perceived notion of the human
tested the use of fuzzy logic and NN for SO2 brain and its neural system. This concept of
prediction and came up with the same results. artificial neural networks was established in
1943 (McCulloch and Pitts, 1943). Later, in
Kapegeridis et al., presented their work on the 1958, the first practical artificial neural
development of a modular neural network network was presented: the perceptron
(MNN) based on RBF networks for the (Rosenblatt, 1958). In the last two decades,
prediction of 24h average PM10 ANNs have been already explored in various
concentrations for the next day at the city fields like chemical research (Kvasnicka,
center of Kozani. The developed system was 1990; Wythoff et al., 1990; Smits et al.,
converted to a computer code (a DLL 1992), physics research (Dekruger and Hunt,
application extension) that was integrated 1994), molecular biology, ecology and
with the Integrated Air Quality Management environmental sciences and demonstrated
System (IAQMS) in August 2007. Since then, remarkable success.

56

In environmental studies, air pollution is an the prediction of sulphur dioxide (SO2). Since
extremely significant issue that should be then several authors have developed different
focused on all around the globe as it affects neural network-based models to forecast air
human health, ecosystems and the pollutant concentrations.
environment, destroys aesthetical conditions,
gives harm to matters, buildings, and in turn Easiest in implementation, multilayer
economy. It is defined as a condition, in perceptron (MLP) is the most used ANN
which the substances that result from both method. Almost in all the findings where the
natural and anthropogenic activities are comparison between ANN and regression
present in the air at concentrations sufficiently method comes, the former takes the
high above their normal ambient levels superiority level, mainly because of its nature
producing considerable impacts on humans, to deal with non-linear relationships between
animals, vegetation, or materials. the predictors, the more flexibility and the
ability to make efficient use of proxy data
Literature survey shows a lot of methods for when the optimum predictor variables are
the prediction of air quality ranging from unavailable and the better predictions than
numerical, mathematical and statistical those given by other methods e.g. multivariate
methods (e.g., regression) to techniques based regression models, statistical linear models,
on artificial intelligence, particularly ANNs. etc. But, the main advantages of neural
All the meteorological variables and factors networks (NN) are that air pollution
have a non-linear relationship with air quality, concentration can be predicted with time
which can be accurately captured by non- series and basic meteorological variables.
linear models such as ANNs and Support This enables the models to be easily
Vector Machines. constructed. MLPs perform very well with
proxy data and are successful with both
The facts and the quality of results provided secondary (ozone) and primary pollutants.
by ANNs make them more attractive to apply Secondary pollutant like Ozone, its formation
than other models. The advantages of these is a complex, non-linear process that neural
models are that they do not require very networks are able to capture more accurately
exhaustive information about air pollutants, without many of the usual limiting
reaction mechanisms, meteorological assumptions of other statistical methods. In
parameters or traffic flow and that they have the case, a pollutant shows a serial correlation
the ability of allowing nonlinear relationships i.e. present day concentration of a pollutant
between very different predictor variables. depends on previous day’s concentration,
Due to the growing development of regression model fails because it assumes that
computer-aided analysis, its easy accessibility observations are statistically independent
to all researchers has also facilitated the events. Although the inclusion of lagged data
application of ANNs in air pollution in regression modeling has shown an
modeling. improvement in the accuracy of regression
predictions, still ANN serves a much better
Boznar et al. (1993) presented the earliest alternative. The performance of MLP found
paper based on the use of neural networks for superior to conventional statistical and

57

stochastic methods in continuous flow series in the model construction because the
forecasting. selection of proper information with which to
train the neural network is crucial for model’s
Instead of all the rewards, there are efficiency. Feature determination strategies
limitations with implementation of ANN. seek for the most important and significant
Some shortfalls faced when using multilayer independent variables that describe the
perceptron (MLP/FFBP) can be listed as the studied process. Pattern selection strategies
dependency on different random weight construct the group of training samples in
assignments in the beginning of each training, such a way that the model can learn the non-
their tendency to trap by the local minima, linear function behavior over the whole
network overfitting, and curse of domain. Both strategies in the ideal case
dimensionality and black box nature of NN. should ensure that the whole domain of the
Performance of a network also depends on the studied function is uniformly and densely
choice of network architecture, the number of enough covered with the training samples.
layers and nodes in those layers. There are no Kohonen neural network based and the expert
rules to help in this process and so, the knowledge based pattern selection strategies
number of input and output nodes is have shown significant improvement in
determined by the complexity of problem at comparison to the model trained with
hand. Different number of a particular layer unselected patterns. Approaches like Self
neurons need to be tried in a particular study organizing map (SOM) and two-segmented
and the number that gives a minimum mean network are able to remove black-box nature
square error (MSE) are chosen for the final. of neural networks.
This process increases the total duration of
NN functioning. The different possibilities of using ANN
variants in air pollution modeling includes
The reliability factor that ANNs are gaining in Radial basis function (RBF), Change point
the field of prediction, also draws more detection model with RNM (CPDM),
concentration for its advancement. Research Sequential network construction model
shows the testing of different combinations of (SNCM), Self organizing feature maps
ANN and other algorithms which can include (SOFM), Group methods of data handling
Genetic algorithm (GA) and Principal (GMDH), TD-CMAC (an extension to the
component analysis (PCA). GA technique is conventional Cerebellar Model Arithmetic
for the search of feasible high-level Computer), AWNN etc. Among the above
architectures of neural network and PCA ANN methods, RBF, SOFM and TD-CMAC
works in the reduction of the dimensionality came up as the best tools. RBF networks
of input data and its orthogonalization. PCA possess the property of best approximation
and RBF combination executes prediction and shows a clear improvement in the
task with much accuracy than the learning efficiency. A new algorithm,
conventional models. adaptive wavelet neural network (AWNN) in
which wavelet neural network (WNN) is a
Feature determination and pattern selection kind of RBF possesses many advantages than
strategies considers as the most important step the general networks, such as faster

58

convergence, avoiding local minimum, easy Athanasiadis, I.N., Karatzas, K.D. and
decision and adaptation of structure, so acts as Mitkas, P.A. (2006). Classification
an efficient tool for the prediction of air techniques for air quality forecasting.
pollutants concentration along with the ability In the fifth workshop on binding
of avoiding overfit during the training. environmental sciences and artificial
intelligence, 17th European conference
Consequently, neural network technique is on artificial intelligence, pp. 41-47.
regarded as a reliable and cost-effective
method for the task of prediction. In an Barai, S.V., Dikshit, A.K., Sharma, S.: Neural
operational setting for any specific place, Network Models for Air Quality
neural network models could be improved Prediction: A Comparative Study.
through experimentation and fine-tuning.
Likely strategies include using additional Basurko, E.A., Berastegi, G.I. and Madariaga,
variables (e.g., extra weather elements, I. (2006). Regression and multilayer
synoptic patterns, traffic flow or day-of-the- perceptron-based models to forecast
week information) or changing the nature of hourly O3 and NO2 levels in the
the model (e.g., changing the number of Bilbao area. Environmental Modelling
hidden nodes and layers in neural networks). & Software, 21, pp. 430–446.
Use of Fuzzy modeling independently and
with ANN is also showing brighter side of Bianchini, M., Iorio, E.D., Maggini, M.,
soft-computing application in air pollution Mocenni, C., Pucci, A. (2006). A
modeling. Cyclostationary Neural Network
Model for the Prediction of the NO2
References Concentration. In the proceedings of
European Symposium on Artificial
Abbaspour, M., Rahmani, A.M. and Neural Networks, pp. 67-72.
Teshnehlab, M. (2005). Carbon
monoxide prediction using novel Boger, Z. (2006). Artificial Neural Networks
intelligent network. International Modeling to Reduce Industrial Air
Journal of Environmental Science & Pollution.
Technology, 1, pp. 257-264.
Boznar, M.Z., Mlakar, P. and Grasic, B.
Adame, J.M.B., Januchs, M.G.C., Andina, D. Neural networks based ozone
and Corona, A.V. (2009). Monitoring forecasting. 9th Int. Conf. on
strategy to assessment the air pollution Harmonisation within Atmospheric
level in Salamanca (México). Dispersion Modelling for Regulatory
Purposes. Pp. 356-360.
Ahmad, A.L., Azid, I.A., Yusof, A.R.,
Seetharamu, K.N. (2004). Emission Boznar, M., Lesjak, M., and Mlakar, P.
control in palm oil mills using (1993). A neural network-based
artificial neural network and genetic method for short-term predictions of
algorithm. Computers and Chemical ambient So2 concentrations in highly
Engineering, 28, pp. 2709–2715. polluted industrial areas of complex
59

terrain. Atmospheric Environment, Ozone Forecasting. Air & Waste

Vol. 27B, pp. 221-230. Management Association, 47, pp. 653-
663.
Božnar, M.Z., and Primož Mlakar, P. (2001).
Artificial neural network-based Dorling, S., Cawley, G., Kukkonen, J. and
environmental models. Air Pollution Karppinen, A. (2003). Air quality
Modeling and Its Application XIV, pp. forecasting using neural network
483-492. approaches. Air quality forecasting
meeting, Culham.
Boznar, M.Z. and Mlakar, P. (2002). Use of
neural networks in the field of air Elminir, H.K. and Galil, H.A. (2006).
pollution modelling. Air Pollution Estimation of air pollutant
Modeling and Its Application XV, pp. concentrations from meteorological
375-383. parameters using artificial neural
network. Journal of Electrical
Caselli, M., Trizio, L., Gennaro, G.D. and Engineering, 57, pp. 105–110.
Ielpo, P. (2008). PM10 Forecasting by
a Feed Forward Neural Network and Fabbian, D. and Dear, R.D. (2007).
Comparison with Multivariate Linear Application of Artificial Neural
Regression Model. Network Forecasts to Predict Fog at
Canberra International Airport.
Chelani, A.B., Raoi, C.V., Phadke, K.M. and Weather and forecasting, 22, pp. 372-
Hasan, M.Z. (2002). Prediction of 381.
sulfur dioxide concentration using
artificial neural networks. Gardner, M. W. and Dorling, S. R. (1998):
Environmental Modelling & Software, Artificial neural networks (the
17, pp. 161-168. multilayer perceptron) – a review of
applications in the atmospheric
Cigizoglu, H.K., Alp, K. and Kömürcü, M. sciences. Atmospheric Environment,
(2005). Estimation of air pollution Vol. 32, No. 14/15, pp. 2627-2636.
parameters using artificial neural
networks. Advances in Air Pollution Gardner, M.W., Dorling, S.R. (1999): Neural
Modeling for Environmental Security, network modelling and prediction of
pp. 63–75. hourly NOx and NO2 concentrations in
urban air in London. Atmospheric
Cogliani, E. (2001). Air pollution forecast in Environment, 33, pp. 709-719.
cities by an air pollution index highly
correlated with meteorological Giorgio, F. and Piero, M. (1996).
variables, Atm. Env., 35, pp. 2871- Mathematical models for planning and
2877. control-ling air quality. Proceedings of
IIASA Workshop, 17.
Comrie, A.C. (1997). Comparing Neural
Networks and Regression Models for

60

Hadjiiski, L., and Hopke, P. (2000). Feedback Loop and their Application
Application of artificial neural to the Identification of Large-spatial
networks to modeling and prediction Air Pollution Patterns. In the
of ambient ozone concentrations. Proceedings of 39th SICE Annual
Journal of Air and Waste Management Conference International Session
Association, 50, 894-901. Papers, 112A-4, pp. 1-6.
Hall, T., Brooks, H.E. and Doswell, C.A. Lek, S., Guegan, J.F. (1999): Artificial neural
(1999). Precipitation forecasting using networks as a tool in ecological
a neural network. Whether modeling, an introduction. Ecological
Forecasting, 14, pp. 338–345. Modelling, 120, pp. 65–73.
Iliadis, L.S., Spartalis, S.I., Paschalidou, A.K. Lippmann, R. P. (1987). An introduction to

and Kassomenos, P. (2007). Artificial computing with neural nets. IEEE
Neural Network Modelling of the Acoustics, Speech Signal Process.
surface Ozone Concentration. Magazine, 4, pp. 4–22.
International Journal of Computational
and Applied Mathematics, 2, pp. 125– Lira, T.S., Barrozo, M.A.S. and Assis, A.J.
138. (2007). Air quality prediction in
Uberlândia, Brazil, using linear
Jef, H., Clemens, M., Gerwin, D., Frans, F. models and neural networks. 17th
and Olivier, B. (2005). A neural European Symposium on Computer
network forecast for daily average Aided Process Engineering –
PM10 concentrations in Belgium. ESCAPE17, pp 1-6.
Atmospheric Environment, 39, pp.
3279-3289. Lu, W.Z., Wang, W.J., Fan, H.Y., Leung,
A.Y.T, Xu, Z.B., Lo, S.M. and Wong,
Kapageridis, I., Evagelopoulos, V. and J.C.K. (2002). Prediction of Pollutant
Triantafyllou, A. Prediction of PM10 Levels in Causeway Bay Area of
Concentrations using a Modular Hong Kong Using an Improved
Neural Network System and Neural Network Model. Journal of
Integration with an Online Air Quality Environmental Engineering, 128, pp.
Management System. 1146–1157.
Kolehmainen, M., Martikainen, H., Lu, W.Z., Wang, W.J., Wang, X.K., Yan,
Ruuskanen, J. (2001). Neural S.H. and Lam, J.C. (2004). Potential
networks and periodic components assessment of a neural network model
used in air quality forecasting. with PCA/RBF approach for
Atmospheric Environment, 35, pp. forecasting pollutant trends in Mong
815-825. [15] Kok urban air, Hong Kong.
Environmental Research, 96, pp. 79–
Kondo, T., and Pandya, A.S. (2000): GMDH- 87.
type Neural Networks with a
61

Marzban, C. (2004). The ROC curve and the Environmental Modelling & Software,
area under it as a performance 19, pp. 887-905.
measure. Weather Forecasting, 19, pp.
1106–1114. Oprea, M. and Matei, A. Applying Artificial
Neural Networks in Environmental
Melas, D., Kioutsioukis, I., and Ziomas, I.C. Prediction Systems. Recent advances
(2000). Neural network models for in automation and information, pp.
predicting peak photochemical 110-115.
pollutant levels. Journal of Air and
Waste Management Association, 50, Osródka, L., Wojtylak, M. and Krajny, E.
495-501. Forecasting of high-level air pollution
in urban-industrial agglomeration by
Morabito, F.C. (2002). Wavelet neural means of numerical weather
network processing of urban air forecasting.
pollution. pp. 1-6.
Ozcan, H.K., Ucan, O.N., Sahin, U., Borat,
Nagendra, S.M.S. and Khare, M. (2005). M. and Bayat, C. (2006). Artificial
Modelling urban air quality using neural network modeling of methane
artificial neural network. Clean emissions at Istanbul Kemerburgaz-
Technology Environment Policy, 7, Odayeri landfill site. Journal of
pp. 116–126. Scientific & Industrial Research, 65,
pp. 128-134.
Nikov, A., Karaca, F., Alagha, O., Kurt, A.,
Hakkoymaz, H. (2005): AirPolTool: A Parker, D.B. (1982). Learning logic.
web-based tool for Istanbul air Invention report S81-64, File 1, Office
pollution forecasting and control, to be of Technology Licensing, Stanford
published in Proc. of 3rd International University.
Symposium on Air Quality
Management at Urban. Pasero, E. and Mesin, L. 2006. Artificial
Neural Networks to Forecast Air
Niska, H., Hiltunen, T., Karppinen, A., Pollution. pp. 221-240.
Ruuskanen, J., Mikko Kolehmainen,
M. (2004). Evolving the neural Perez, P. (2001): Prediction of sulfur dioxide
network model for forecasting air concentrations at a site near downtown
pollution time series. Engineering Santiago, Chile. Atmospheric
Applications of Artificial Intelligence, Environment, 35, pp. 4929-4935.
17, pp. 159–167.
Perez, P. and Reyes, J. (2001): Prediction of
Nunnari, G., Dorling, S., Schlink, U., Cawley, Particulate Air Pollution using Neural
G., Foxall, R. and Chatterton, T. Techniques. Neural Computing &
(2004). Modelling SO2 concentration Applications, 10, pp. 165–171.
at a point with statistical approaches.

62

Popentiu, F., Lungu, A. and Creosteanu, R. using artificial neural network

(2009). Modelling wind speed with technique: A review. Journal of
neural networks for monitoring the air Scientific & Industrial Research, 64,
pollution. Fascicula de Energetică, 15, pp. 637-647.
pp. 280-283.
Shi, J. P. and R.M. Harrison (1997):
Postolache, O.A., Pereira, J.M.D. and Girão, Regression modelling of hourly NOx
P.M.B.S. (2009). Smart Sensors and NO2 concentrations in urban air in
Network for Air Quality Monitoring London. Atmospheric Environment,
Applications. IEEE transactions on 31, pp. 4081-4094.
instrumentation and measurement, 58,
pp. 3253-3262. Spellman, G. (1999). An application of
artificial neural networks to the
Rao, M.N., and Rao, H.V. (2000). Air prediction of surface ozone
Pollution. Tata McGraw-Hill, New concentrations in the United Kingdom,
Delhi. Application Geography, 19, pp. 123-
136.
Rummelhart, D. E., and McClelland, J.L.
(1986). Parallel Distributed Tecer, L.H. (2007). Prediction of SO2 and PM
Processing. The MIT Press, 1. Concentrations in a Coastal Mining
Area (Zonguldak, Turkey) Using an
Sahin, U., Ucan, O.N., Soyhan, B. and Bayat, Artificial Neural Network. Polish
C. (2004). Modeling of CO Journal of Environmental Studies, 16,
distribution in Istanbul using artificial pp. 633-638.
neural networks. Fresen Environ Bull,
13, pp. 839-845. Tirabassi, T., Pelliccioni, A. and Garazzo, C.
(2003). Coupling of neural network
San, Y. and Zhang, Z. (2004). Adaptive and dispersion models: A novel
wavelet neural network for prediction methodology for air pollution models,
of hourly NO and NO2 concentrations. Journal of Environmental Pollution,
Proceedings of the 2004 Winter 20, pp. 136-146.
Simulation Conference, pp. 1-9.
Tresp, V., R. Neuneier, and S. Ahmad, 1995:
Selvaraj, R.S., Elampari, K., Gayathri, R. and Efficient methods for dealing with
Jeyakumar, S.J. (2010). A neural missing data in supervised learning.
network model for short term Advances in Neural Information
prediction os surface ozone at tropical Processing Systems 7, G. Tesauro, D.
city. International Journal of S. Touretzky, and T. K. Leen, Eds.,
Engineering Science and Technology, The MIT Press, 689–696.
2, pp. 5306-5312.
Vaziri, M. (2001): Modeling vehicle air
Sharma, N., Chaudhry, K.K. and Rao, C.V.C. pollution attributes by artificial neural
(2005). Vehicular pollution modeling

63

network. Scientia Iranica, 8, pp. 10- Journal of Environmental Sciences, 4,

15. pp. 473-481.
Votti, P. and Genova, P. D. (2002). Webos, P. (1974). Beyond regression: new

Atmospheric urban pollution: tools for prediction and analysis in the
applications of artificial neural behavioral sciences. Thesis, Harvard
network (ANN) to the city of Perugia, University.
Ecological Modeling, 148, pp. 27-46.
Werner, H., Obach, M. (2001): New neural
Wahab-Abdul, S.A. and Al-Alawi, S. M. network types estimating the accuracy
(2002). Assessment and prediction of of response for ecological modeling.
tropospheric ozone concentration Ecological Modelling, 146, pp. 289–
levels using artificial neural networks, 298.
Environ Model Softw, 17, pp. 219-
228. Yi, J. and Prybutok, R. (1996). A neural
network model forecasting for
Wahab, S.A.A. and Alawi, S.M.A. (2008). prediction of daily maximum ozone
Prediction of Sulfur Dioxide (SO2) concentration in an industrialised
Concentration Levels from the Mina urban area. Environmental Pollution,
Al-Fahal Refinery in Oman Using 92, pp. 349-357.
Artificial Neural Networks. American

64

Artificial Neural Network Applications in Air Quality Monitoring

Transféré par

Informations du document

Copyright

Formats disponibles

Partager ce document

Partager ou intégrer le document

Options de partage

Avez-vous trouvé ce document utile ?

Ce contenu est-il inapproprié ?

Droits d'auteur :

Formats disponibles

Artificial Neural Network Applications in Air Quality Monitoring

Transféré par

Droits d'auteur :

Formats disponibles

International

Artificial neural network applications in air quality monitoring and

Prachi1, Kumar Nishant2 and Matta, Gagan3

With the loss of biodiversity on an areas, especially in densely populated and

Artificial Neural Network Applications in Air Quality Monitoring and Management

Artificial Neural Network Applications in Air Quality Monitoring and Management

Artificial Neural Network Applications in Air Quality Monitoring and Management

Figure 1: A biological neuron

Artificial Neural Network Applications in Air Quality Monitoring and Management

Figure 2:2 A) Sketch hy representtation of an artificial neeuron. B)

Figurre 3: Taxxonomy off neural

Neural Network Topologies 2. Recurrent Neural Network

1. Feedforward Network 3. Radial Basis Function (RBF) Network

MLP feed-forward network is referred to as a A radial basis function network is an artificial

Artificial Neural Network Applications in Air Quality Monitoring and Management

Artificial Neural Network Applications in Air Quality Monitoring and Management

Artificial Neural Network Applications in Air Quality Monitoring and Management

Artificial Neural Network Applications in Air Quality Monitoring and Management

Artificial Neural Network Applications in Air Quality Monitoring and Management

Artificial Neural Network Applications in Air Quality Monitoring and Management

Artificial Neural Network Applications in Air Quality Monitoring and Management

Artificial Neural Network Applications in Air Quality Monitoring and Management

Artificial Neural Network Applications in Air Quality Monitoring and Management

Artificial Neural Network Applications in Air Quality Monitoring and Management

Artificial Neural Network Applications in Air Quality Monitoring and Management

Artificial Neural Network Applications in Air Quality Monitoring and Management

Artificial Neural Network Applications in Air Quality Monitoring and Management

Artificial Neural Network Applications in Air Quality Monitoring and Management

Artificial Neural Network Applications in Air Quality Monitoring and Management

Artificial Neural Network Applications in Air Quality Monitoring and Management

Artificial Neural Network Applications in Air Quality Monitoring and Management

Artificial Neural Network Applications in Air Quality Monitoring and Management

Artificial Neural Network Applications in Air Quality Monitoring and Management

Artificial Neural Network Applications in Air Quality Monitoring and Management

Artificial Neural Network Applications in Air Quality Monitoring and Management

Artificial Neural Network Applications in Air Quality Monitoring and Management

terrain. Atmospheric Environment, Ozone Forecasting. Air & Waste

Artificial Neural Network Applications in Air Quality Monitoring and Management

Iliadis, L.S., Spartalis, S.I., Paschalidou, A.K. Lippmann, R. P. (1987). An introduction to

Artificial Neural Network Applications in Air Quality Monitoring and Management

Popentiu, F., Lungu, A. and Creosteanu, R. using artificial neural network

Artificial Neural Network Applications in Air Quality Monitoring and Management

network. Scientia Iranica, 8, pp. 10- Journal of Environmental Sciences, 4,

Votti, P. and Genova, P. D. (2002). Webos, P. (1974). Beyond regression: new

Artificial Neural Network Applications in Air Quality Monitoring and Management

Vous aimerez peut-être aussi