Académique Documents
Professionnel Documents
Culture Documents
5, MAY 2007
1303
I. INTRODUCTION
1304
(1)
is the smoothed image, at the spatial position
where
given by the coordinates vector
, at scale , defined on
, with boundary
. The remaining terms in
the domain
, which is a nonlinear
(1) are the diffusion coefficient
is the original, noisy
function of the image gradient
image and
is the derivative along the normal to
that defines the boundary conditions of the PDE.
By a convenient selection of the diffusion coefficient in (1),
the intensity of the image is allowed to diffuse within the image
structures, eliminating, thus, the intraobject variability, while
preventing diffusion on the edges, characterized by a high
intensity gradient. However, as Alvarez et al. showed [14],
[17], [18], the diffusion coefficients proposed by PeronaMalik
lead to ill posedness of (1). Another shortcoming of the
PeronaMalik equation is that noise on the edges may be
amplified by backward diffusion. Alvarez et al. [17] showed
that the PeronaMalik equation can be made well posed,
by smoothing isotropically the image, before computing the
image gradient used by the diffusion coefficient. Equation (2)
corresponds to the regularized version of the PeronaMalik
PDE, where, for simplicity, we have dropped the dependence
on the spatial coordinates, , and time , and
is a smoothed version of obtained by convolving the image
of variance . With the
with a zero-mean Gaussian kernel
same boundary conditions as (1), the regularized PeronaMalik
equation is given by
(2)
In our computations, we use the nonlinear diffusion coefficient
proposed by Weickert [19]
(3)
, and
D. Explicit Scheme
For a 2-D scalar image with
posed as
1305
the Thomas algorithm [19]. Additionally, we use several preconditioning techniques to accelerate the convergence of the conjugated gradient method, which is the optimum iterative method
, provided that
to solve large sparse linear systems,
is symmetric positive definite, which is regularly the case of discretized PDEs.
1) Additive Operator Splitting (AOS) Method: AOS approximates the solution of (7) as [19]
(4)
Let us call
, the number of pixels in the image along the
axis and
the number of pixels along the axis. Numbering
the pixels of the image in major column format, the explicit
discretization of (4), in matrix-vector notation, is given by [19]
(5)
, being
the discretization of time
where
and
the discretization of the spatial coordinates,
is
, corresponding to the image (taken in
a vector of length
and
are both matrices
major column format) at scale
, being the identity matrix and
of size
the matrix of diffusion coefficients at scale
given by
(8)
. Since,
and
where
are both tridiagonal matrices, and can be obtained in linear
time using the Thomas algorithm.
2) Alternating Direction Implicit (ADI) Methods: Here, we
will consider, the three most widely used ADI methods [20],
[21]: Locally one-dimensional (LOD), DouglasRachford, and
PeacemanRachford. The simplest approximation to (7) is
given by ADI-LOD
(9)
The DouglasRachford method solves (7) as
(10)
(6)
In (6),
and
are, respectively, the image intensity and difand scale
fusion coefficient at coordinates
with
being the indices of the image, in
major column format. Here, we make the usual assumption that
so that
is the scale step.
E. Semi-Implicit Schemes
From consistency and stability considerations [19], [20], the
, which
explicit scheme indicated in (5) requires that
constitutes a severe limitation on the step size. Alternatively, we
can use semi-implicit discretization schemes, given by [19]
(7)
and are defined as before. Semi-implicit schemes
where
are unconditionally stable for all values of [19], [20]. Howequaever, (7) requires us to solve a linear system with
tions and unknowns, at each iteration step. The extra computational cost required to update the solution is compensated by
the numerical stability of semi-implicit schemes that allow us to
choose much larger scale steps, limited only by the accuracy of
the computed solution.
We consider here the AOS and ADI semi-implicit methods
that decompose (7) as a sum (AOS) or a product (ADI) of two
tridiagonal systems, which can be solved, in linear time, using
AOS and the ADI schemes considered until now are only
first order accurate in scale. A scheme that is second order
accurate in scale, for the isotropic diffusion equation, is the
PeacemanRachford scheme given in (11) [20]. However, this
scheme does not achieve second order accuracy in scale when
it is used on the nonlinear anisotropic diffusion equation, since
the diffusion coefficients are computed at the previous step
[21], tough, better accuracies can be expected using this scheme
is close to , i.e., at small scale steps
if
(11)
3) Preconditioned Conjugated Gradient (PCG): The conjugated gradient (CG) can be considered as an acceleration of
steepest descent to solve the linear system
, when
is symmetric positive definite [20]. The basic idea of precondiby [22]
tioning is to replace the system
(12)
where matrix
is called the preconditioner of and
is a matrix with better condition number than , such that the
conjugated gradient method converges faster, and the operation
must be performed fast, for any vector .
1306
(13)
being a lower triangular matrix and
. For any vector , the product
is equivalent
as
to solve the system
where
(14)
(16)
can be computed in linear time using forward
where
and backward substitution, since
and
are lower
and upper tridiagonal triangular matrices, respectively.
Another preconditioner commonly used in practice is the inas the
complete Cholesky factorization that approximates
product
, where
and is the error in the
approximation. In our work, we use the incomplete Cholesky
factorization with 0 drop tolerance as indicated in [23], which
means that has the same sparcity pattern as the lower triangular part of .
AOS and ADI methods provide also an approximation to matrix , as given by (9)(11); hence, they can be used also as
preconditioners. In particular, ADI is usually run with a fixed
number of times as a preconditioning step as in [24].
III. EXTENSION TO HYPERSPECTRAL IMAGERY
A hyperspectral image is an especial case of multispectral images in the sense that we have now hundreds of bands, instead
of tents of bands as it is usual in multispectral images, providing
much more information about the physical nature of the underlying substrate.
The first problem one face trying to extend the methods used
in computer vision for grayscale image processing to vectorvalued images is to extend the concept of gradient. The first
formal treatment of gradient in vector-valued images is due to
Di Zenzo in 1986 [25].
be a vector-valued image, with
Let
components
. Hence, the
first fundamental form in differential geometry is given by [26]
(18)
(15)
For a unit vector
is a measure of the
rate of change in the image on the direction. The extrema
are obtained by the eigenvalues of the matrix
of
in the directions given by the eigenvectors. Let
be, respectively, the maximum and minimum values of the rate of
and
the respective directions of maximal
change in
(19)
1307
(22)
(23)
1308
Fig. 2. Superimposed spectra showing the spectral variability within each object and background on the synthetic image.
Fig. 1. Grayscale (spectral mean) representation of: (a) Indian Pines, (b) Synthetic, (c) Cuprite, and (d) Noisy False Leaves hyperspectral images.
The synthetic image is used to quantify the numerical performance in terms of the accuracy achieved by the different semiimplicit methods implemented, as the scale step increases, relative to a reference image generated using a very small scale step
and the semi-implicit CrankNicholson scheme
[20], which is a second order accurate scheme, both in scale and
space.
that preserves the edges in the
The highest value of
synthetic image, while reduce most of the internal variability
within the image objects is 0.015. Otherwise, the accuracy
of the explicit scheme at its maximum possible step size,
and the accuracy of AOS, ADI and PCG semi-imand
were all
plicit schemes for
compared to the reference image. We perform 1000 iterations
of the CrankNicholson scheme so that the real evolution in
; hence, the explicit
scale of the PDE is
should be run
times and the
scheme using
100, 20, 10, 5,
semi-implicit schemes should be run
and 2 times.
The best values for in the PCG-SSOR scheme were found,
simply by sweeping in the 0.01 to 2.0 range at intervals of
0.1. The values of found by this mean were 0.5, 0.4, 0.3, 0.15,
and 0.05 for
and
, respectively,
and they also correspond to the best values for the synthetic
and real hyperspectral images used. Finally, AOS and ADI-LOD
schemes used as preconditioners were implemented as indicated
1, 0.5,
in (27) and (28), where best results were found using
and
,
0.25, 0.125, and 0.025 for
respectively.
Fig. 2 shows the synthetic image and the spectral variability
within each image object and background, obtained by superposition of the spectrums of each pixel within each image region.
Fig. 3 shows the strong reduction on the spectral variability
within each image region, after nonlinear diffusion, while preserving the edges. Table I indicates the reduction on the variance within each image region. Fig. 4 shows the classification
map using the spectral angle mapper (SAM) in Multispec and
all image bands available.
Fig. 3. Superimposed spectra showing the spectral variability within each object and background on the smoothed synthetic image.
1309
Fig. 5. Square error on the computed solution of each algorithm, relative to the
CrankNicholson scheme.
TABLE I
REDUCTION IN THE SPATIAL/SPECTRAL VARIABILITY
Fig. 4. Classification of the synthetic image using SAM on (a) original and
(b) smoothed images.
1310
Fig. 6. Indian Pines image: (a) Ground truth and (b) training and testing samples (RGB shown corresponds to bands 29, 15, and 12).
Fig. 8. Training and testing samples on (a) Cuprite image (RGB corresponds
to bands 183, 193, and 207) and (b) False Leaves image (RGB corresponds to
bands 90, 68, and 29).
cedony, Na-Montmorillonite, Chlorite and Muscovite or Montmorillonite, High-Al Muscovite, and Med-Al Muscovite. We
consider the different kinds of Alunites as a single class, given
that it is extremely difficult to obtain pure training and testing
1311
TABLE II
CLASSIFICATION ACCURACIES, INDIAN PINES IMAGE
TABLE III
CLASSIFICATION ACCURACIES, CUPRITE IMAGE
very unstable when the region of the training samples is too uniform. This effect affects more the Cuprite and False Leaves im0.015) than in the
ages given that the smoothing is higher (
0.012) and also because
case of the Indian Pines image (
these images have more bands.
On the other hand, the FLL classifier benefits from the reduction in the variability within the image classes [31], and, hence,
it has the highest classification accuracies on all the images.
ECHO classifier is based in Multispec on either a quadratic
or Fisher linear spectral-spatial algorithm. The results indicated
on Tables IIIV for ECHO correspond to the maximum value
between the two possible classifiers, which was almost always
FLL for the smoothed images and quadratic for the original images. In general, ECHO is just a little superior than FLL in classification accuracy. The difference between ECHO and FLL reduces as the smoothing increases, as can be appreciated on Ta0.015, meanwhile the difference
bles III and IV, where
0.012. This is
is higher in the Indian Pines image, where
due to the fact that ECHO tries to homogenize the image before classifying it, by choosing a small window (2 2 pixels in
our simulations). Hence, if the region within the objects is already smooth, due to diffusion, the difference between ECHO
and FLL is reduced.
The remaining classifiers, ED, SAM, and MF are, in general,
very insensitive to the scale step, but, in general, they do not
achieve good classification accuracies, except for the SAM classifier on the Cuprite image. The relative good performance of
SAM on this image agrees with the reported studies on mineral classification using the spectral angle and a high number of
bands [33].
In terms of the implemented numerical methods, AOS and
achieving high speedups and
ADI are very insensitive to
on all the images
classification accuracies up to
analyzed here. On the other hand, the DouglasRachford and
PeacemanRachford methods are sensitive to the scale step,
,
achieving high classification accuracy only up to
which limits their speedup. Despite of this limitation, these
schemes are of higher accuracy than AOS and ADI and they
achieved the highest classification accuracies on all the images.
PCG methods are very insensitive to the scale step and all
behave similarly in terms of classification accuracy. The best
classification accuracies and speed-ups are achieved by PCGCholesky initialized by ADI-LOD. These results are similar to
the ones obtained in terms of the accuracy of the computed solution (Fig. 4). This means that the accuracy of the computed
solution affects the classification accuracy in the case of ML,
FLL, and ECHO classifiers.
It is noteworthy, though, that AOS has a better performance
than the expected from Fig. 4, for the Indian Pines image. We
believe that this occurs because K is small here; hence, the magnitude of the error is lower. AOS is also symmetric, and, hence,
the error introduced can be reduced by a classifier as ECHO,
which tends to average out random variations in a small window.
It is also fortunate that the Indian Pines image consists of large
1312
TABLE IV
CLASSIFICATION ACCURACIES, FALSE LEAVES IMAGE
Fig. 11. Fake Leaves classification map: (a) noisy; (b) smoothed.
V. CONCLUSION
PDE-based methods for image enhancement, segmentation, and restoration have a large history of success for scalar
and color images in computer vision, but it has been disregarded in segmentation and classification of hyperspectral
imagery. Recently, Lennon et al. [34], [35] implemented the
PeronaMalik nonlinear diffusion equation to smooth a hyperspectral image and classify it using support vector machines.
However, they used the original, unregularized explicit scheme
of PeronaMalik, given in (1) and used only 17 bands.
This work shows that PDE-based image processing methods
can improve significantly image enhancing, segmentation, and
classification in hyperspectral imagery at a low computational
cost, using semi-implicit schemes. Traditional statistical classification methods are very robust at low-dimensional spaces, but
they require an enormous amount of data at higher dimensional
spaces, as is the case of hyperspectral imagery. Otherwise, parabolic PDEs offer a well-sounded, common framework to perform image smoothing, restoration and object-based segmentation and classification, with accuracy and highly parallelizable discretizations that can speedup PDE image processing in
high-dimensional spaces.
In particular, this work shows that nonlinear diffusion can enhance significantly image classification accuracies by reducing
both, the spatial and spectral variability in hyperspectral imagery. AOS and ADI semi-implicit schemes offer high performance in terms of accuracy and speedup of the computed solution of the nonlinear PDE, when the complexity of the image
is not high in terms of highly variability within the image objects. When the complexity of the image increases, more accurate methods such as the Douglas and Peaceman schemes and
PCG methods can achieve accuracies and speedups superior to
the less accurate AOS and ADI-LOD methods, justifying their
higher computational cost.
PCG linear solvers are less sensitive to the scale step as the
approximated ADI and AOS schemes, which mean that higher
can be used. However, PCG methods also require
values of
more space, and finding a good preconditioner is still an art as
we could corroborate here. In fact, PCG-methods also depend
on the size of the image, making it difficult to generalize them
to all image sizes. Even though image complexity can reduce
sensibly the speedup that can be achieved with the numerical
methods presented here; we achieved significant speedups of
10 and higher on all the images used, over the explicit scheme,
which justifies their use in hyperspectral imagery.
REFERENCES
[1] G. Wilkinson, Are remotely sensed image classification techniques
improving? Results of a long term trend analysis, presented at the
IEEE Workshop on Advances in Techniques for Analysis of Remotely
Sensed Data, Oct. 2728, 2003.
[2] D. A. Landgrebe, Hyperspectral image data analysis, IEEE Signal
Process. Mag., vol. 19, no. 1, pp. 1728, Jan. 2002.
[3] R. L. Ketting and D. A. Landgrebe, Classification of multispectral
image data by extraction and classification of homogeneous objects,
IEEE Trans. Geosci. Electron., vol. GE-14, no. 1, pp. 1926, Jan.
1976.
[4] A. Marangoz, M. Oruc, and G. Buyuksalih, Object-oriented image
analysis and semantic network for extracting the roads and buildings
from IKONOS pan-sharpened images, Int. Arch. Photogramm. Remote Sens., vol. 35, no. B3, 2004.
[5] W. Tadesse, T. L. Coleman, and T. D. Tsegaye, Improvement of land
use and land cover classification of an urban area using image segmentation from Landsat ETM+ data, presented at the Proc. 30th Int. Symp.
Remote Sensing of the Environment, Honolulu, HI, 2003, PS I-35.
[6] J. Schiewe, L. Tufte, and M. Ehlers, Potential and problems of multiscale segmentation methods in remote sensing, GeoBIT/GIS J. Spatial
Inf. Decision Making, vol. 6, pp. 3439, 2001.
[7] U. C. Benz, P. Hofmann, G. Willhauck, I. Lingenfelder, and M.
Heynen, Multi-resolution, object-oriented fuzzy analysis of remote
sensing data for GIS-ready information, J. Photogramm. Remote
Sens., vol. 58, pp. 239258, Jan. 2004.
[8] T. Sziranyi, I. Kopilovic, and B. P. Toth, Anisotropic diffusion as a
preprocessing step for efficient image compression, in Proc. 14th Int.
Conf. Pattern Recognition, 1998, vol. 2, pp. 15651567.
[9] L. Alvarez, J. Weickert, and J. Sanchez, Scale-space approach to nonlocal optical flow calculations, in Proc. 2nd Int. Conf. Scale-Space
Theories in Computer Vision, 1999, vol. 1682, pp. 235246.
1313
1314