Académique Documents
Professionnel Documents
Culture Documents
Neurocomputing
journal homepage: www.elsevier.com/locate/neucom
a r t i c l e in f o
a b s t r a c t
Article history:
Received 28 October 2008
Received in revised form
20 January 2010
Accepted 12 February 2010
Communicated by A. Zobaa
Available online 1 August 2010
Nowadays, most factories rely on machines to help boost up their production and process. Therefore, an
effective machine condition monitoring system plays an important role in these factories to ensure that
their production and process are running smoothly all the time. In this paper, a new and effective
machine condition monitoring system using log-polar mapper, quaternion based thermal image
correlator and max-product fuzzy neural network classier is proposed. Two classication characteristics, namely peak to sidelobe ratio (PSR) and real to complex ratio of the discrete quaternion
correlation output (p-value) are applied in this proposed machine condition monitoring system. Large
PSR and p-value showed a good match among correlation of the input thermal image with a particular
reference image, but reversely for small PSR and p-value match. In the simulation, log-polar mapping is
found to have solved the rotation and scaling invariant problems in quaternion based thermal image
correlation. Besides, log-polar mapping can possess two fold data compression capability. Log-polar
mapping helps smoothen up the output correlation plane, hence making better measurement for PSR
and p-values. The simulation results have also proven that the proposed system is an efcient machine
condition monitoring system with an accuracy of more than 94%.
& 2010 Elsevier B.V. All rights reserved.
Keywords:
Thermal condition monitoring system
Log-polar mapping
Quaternion correlation
Max-product fuzzy neural network
Thermal imaging
1. Introduction
The recently developed concept of quaternion correlation,
based on quaternion algebra introduced by Hamilton in 1843 [1],
is found useful in color pattern recognition (e.g. human face
recognition [2], color alphanumeric words recognition [3]). A
quaternion array is generalized from the complex number
representation and it can be considered as a number with a real
and imaginary term consisting of three orthogonal components as
follows [4]:
f fr fi i fj j fk k
where fr, fi, fj, fk are real quaternion numbers and i, j, k, are
imaginary operators.
In conventional correlation for pattern recognition, patterns
have to be converted to gray level scale or by processing the three
color channels (R, G, B) separately before combining them again.
But for the quaternion correlation technique, all color channels
are processed together using the quaternion array. In quaternion
array, the R, G, B in color image can be represented by inserting
the value of three color channels into the three imaginary terms of
n
165
166
3. Log-polar mapping
Log-polar imaging geometry is a biological inspiration approach to human eye vision whereby it transforms the original
two dimensional captured image into a spatially variant (retinalike) representation [10]. Generally, the Cartesian image in any
vision system can be resampled to retina-like image through the
use of the mask, inspiring the retina. The retina can be roughly
divided into two distinct regions: fovea and periphery. The fovea
region (central portion of the retina) is formed by approximately
constant size receptive elds and organized in hexagonal form
[10]. In the periphery region, the receptive elds are circularly
distributed with an area exponentially increasing as a function of
the distance to the retina center. Overlapping may exist among
the receptive elds in order to prevent some Cartesian areas from
not being covered by the retina transformation (due to the
circular geometry of the receptive elds) [10].
In many vision systems, there has been a trend to design and use
true retina-like sensors [11,12] or simulate the log-polar images by
software conversion [13,14]. In the software conversion of log-polar
images, practitioners in pattern recognition usually named it as logpolar mapping [13,15]. Log-polar mapping has been used in
designing systems that are scale and rotation invariant [16,17],
and showed good performance. Another advantage is log-polar
image representation has data compression manner. Therefore, in
this proposed machine condition monitoring system, log-polar
mapping is used to convert the input thermal image captured by
thermal camera into log-polar image for solving the scaling and
rotational invariant due to the change of position in either thermal
camera or machines during monitoring process, and preserving ne
image quality in a higher data compression manner. An example
R
ro
Ny
y1
arctan
2p
x1
y2 r, y l ro sin
2py
Ny
where R is the distance between the given point and the center of
q
mapping x21 y21 , ro is a scaling factor dening the size of the
circle at r(x1, y1)0 and l is the base of the algorithm.
1 sinp=Ny
1sinp=Ny
Fig. 3. An example of log-polar mapping of a N rings retina containing 3 rings fovea. To simplify the gure, no overlapping was use. Note that periphery portion is log-polar.
167
l
l1
9
Fig. 6. Unwarping process.
10
11
12
M
1 N
1
X
X
9Im,n9
13
m0n0
14
M
1 N
1
X
X
9Im,n9
15
m0n0
M
1 N
1
X
X
t0Z0
18
168
where Pr(ms, ns) is the real part of P(ms, ns). Pi(ms, ns), Pj(ms, ns) and
Pk(ms, ns) are the i, j k parts of P(ms, ns), respectively. If p Zd1 and
c1 o9P(ms, ns)9 oc2, it implies that at location (ms, ns), there is an
object that has the same shape, size, color and brightness as the
reference image. If d1 o 1, c1 o1 oc2 and all the variables are near
to 1, the value of p decays faster with the color difference between
the matching image and the reference image.
Another classication characteristic used in quaternion based
thermal image correlation is the peak-to-sidelobe ratio (PSR), which
will be discussed in detail as below. The quaternion based thermal
image correlation consists of 2 stages: enrollment stage and
recognition stage. In the enrollment stage, one or multiple images
of each machine condition are acquired. These multiple reference
images have variety in color tones for different temperature
conditions of the machines. The DQFT of the reference images are
used to train fuzzy neural network and to determine the correlation
lter for each possible machines conditions. In recognition stage, the
thermal camera captures live machine thermal image and the DQFT
of the image is correlated with the one of the reference images, stored
in the database together with their corresponding lter coefcients.
The inverse DQFT of this product is the result in the correlation output
of that lter.
A strong peak can be observed in the correlation output if the
input image comes from imposter class. The peak-to-sidelobe
ratio (PSR) is a method for measuring the peak sharpness which is
dened as follows [2]:
peakmeansidelobe
PSR
ssidelobe
21
22
M
1 N
1
X
X
23
t0Z0
24
25
26
27
t2 m,n
M
1 N
1
X
X
t0Z0
I s(t1 )
I s(t1 )
28
169
t2 m,n
1 N
1
X
X
1 M
em1 2pmt=M gst1 ,
4p2 t 0 Z 0
t2 m,
nem2 2pnZ=N
29
30
31
p
1a2 C1 m
32
170
33
where i represents the input image, hsr(i) represents the real part
of quaternion array of sth machine section for input image i, s 1,
2, y, S represents the number of partitioned machines sections.
hsR(i), hsG(i) and hsB(i) each represents the i-, j-, k-imaginary parts of
sth machine section for input image i, respectively.
The quaternion array in (33) performs DQFT to transform the
quaternion image to the quaternion frequency domain. A twoside form of DQFT is used:
hsi m,n
M
1 N
1
X
X
34
t0Z0
M
1 N
1
X
X
35
t0Z0
M
1 N
1
X
X
1
em1 2pmt=M gsi,t2 m,n em2 2pnZ=N
4p2 t 0 Z 0
36
"
#
PSRst1 ,t2 12
s2
39
Xs training
Gs1,1
6G
6 s2,1
6
6 ^
4
GsT,1
Gs1,2
Gs2,2
&
GsT,2
Gs1,T
40
48
49
Gs2,T 7
7
7
^ 7
5
GsT,T
171
41
50
6. Experimental results
In this section, the application of log-polar mapper, quaternion
based thermal image correlator together with max-product fuzzy
neural network classier for machine condition monitoring
system is illustrated. Here, some experiments were used to prove
the algorithms introduced in Sections 35.
"
GPSRsi,t exp
PSRsi,t2 12
42
s2
Gpsi,t2 exp
"
#
PSRsi,t2 12
43
s2
(2) Calculate the product value for sth machine section of the
fuzzy neural network classier at input image on the training
images in the database:
Gsi,t2 GPSRsi,t Gpsi,t2
44
Gsi,2
Gsi,T
45
(4) Obtain the classication outcomes for each machine condition in the sth section by multiplying (45) with the weight
trained at (41):
Ys classification Xs classification ws
46
Gsi,2
...
Gsi,T
47
172
Fig. 11. Image on the site capture by thermal camera for all machines are in
overheats condition.
Fig. 14. Sample input thermal image (all machines in overheat condition).
173
Fig. 16. Samples correlation plane for input thermal image (for every machine is overheated) matching with one of the reference image of overheated class for both all the
machines in the database (authentic case) (a) section machine A, (b) section machine B, (c) section machine C.
Table 1
Normalized PSR and p-values for both authentic and imposter case.
Fig. 17. Sample input thermal image (all machines in overheat condition).
Authentic case
Normalized PSR
Normalized p-value
Section mac. A
Section mac. B
Section mac. C
0.95281
1.0000
0.9674
0.7971
0.8987
0.8317
Imposter case
Normalized PSR
Normalized p-value
Section mac. A
Section mac. B
Section mac. C
0.0338
0.0421
0.0371
0.0287
0.0373
0.0345
Fig. 18. Log polar mapping of Fig. 16 as divided into 3 partitioned machines
sections.
Table 1 shows the PSR and p-value for both authentic and
imposter case as shown in Fig. 16 and 19 for section machine A, B
and C. Note that the sharp correlation peak resulting in large
174
Fig. 19. Samples correlation plane for input thermal image (for every machine is not-overheated) matching with one of the reference image of overheated class for both all
the machines in the database (imposter case) (a) section machine A, (b) section machine B, (c) section machine C.
Fig. 20. An example case of rotational invariant (a) Cartesian input image capture
by digital camera (1.6 m), (b) the same image as in (a) capture using thermal
camera during the machine being overheated, (c) log polar mapping image of (b),
(d) rotational invariant of (a) capture by digital camera, (e) thermal image of (d) in
Cartesian form, (f) log-polar form of (e).
175
Fig. 21. Output correlation planes among: (a) Fig. 20b and b itself, (b) Fig. 20b and e, (c) Fig. 20c and c itself, (d) Fig. 20c and f.
Fig. 22. An example case of scaling invariant (a) Cartesian input image capture by
digital camera (3.2 m), (b) the same image as in (a) capture using thermal camera
during the machine is overheated, (c) log polar mapping image of (b).
176
Fig. 23. Output correlation planes among: (a) Fig. 20b and Fig. 22b, (b) Fig. 20c and Fig. 22c.
7. Conclusion
This paper presented a system capable of monitoring machine
condition using log-polar mapping method, quaternion based
thermal image correlator and max-product fuzzy neural network
classier. The results show that the proposed machine condition
monitoring system achieved very high accuracy, i.e. more than
94%. One of the advantages using quaternion correlation rather
than conventional correlation method is that quaternion correlation method deals with color images without converting those
images into grayscale images. Hence, important color information
can be preserved. Max-product fuzzy neural network provides
high level framework for approximate reasoning; hence it is best
suited for classication use. We also found that the use of logpolar mapping solves rotation and scaling problems in quaternion
based thermal image correlation. In addition, log-polar mapping
has data compression capability. Therefore the use of log-polar
mapping reduces the computation time and memory storage
needs. Log-polar mapping also smoothens the output correlation
plane, hence making better measurement for PSR and p-value.
Based on the experimental results, the use of PSR and p-value
provides higher accuracy in tracking thermal condition in the
proposed machine condition monitoring system. In future, there
is a plan to implement this machine condition monitoring system
in a wide area coverage using minimum hardware manner,
whereby the machines surrounded in an omnidirectional (3601)
view can be monitored by using a single thermal camera and a
specic design hyperbolic mirror. Therefore, a mathematical
model has to be formulated to design the geometry of such
mirror and an unwarping processing method also needs to be
researched for unwarping the omnidirectional image into panoramic forms for better machine partitioning purpose. These
topics will be addressed in future work.
References
[1] W.R. Hamilton, in: Elements of Quaternions, Longmans, Green, London, U.K, 1866.
[2] C. Xie, M. Savvides, B.V.K. Vijaya Kumar, Quaternion correlation lters for face
recognition in wavelet domain, in: Proceedings of the International Conference on
Acoustic, Speech and Signal Processing (ICASSP 2005), pp. II85II88.
[3] S.C. Pei, J.J. Ding, J. Chang, Color pattern recognition by quaternion correlation,
in: Proceedings of the International Conference on Image Processing, vol. 1,
2001, pp. 894897.
[4] C. Eddie Moxey, S.J. Sangwine, T.A. Ell, Hypercomplex correlation techniques
for vector images, IEEE Trans. Signal Process. 51 (7) (2003) 19411953.
[5] C. Weiman, Log-polar vision for mobile robot navigation, in: Proceedings of the
Electronic Imaging Conferences, November 1990Boston, USA, pp. 382385.
[6] B.V.K. Vijaya Kumar, M. Savvides, K. Venkataramani, C. Xie, Spatial frequency
domain image processing for biometric recognition, Proc. Int. Conf. Image
Process. 1 (2002) I53I56.
[7] R.O. Duda, P.E. Hart, D.G. Stork, in: Pattern Classication, second ed., Wiley,
N.Y, 2001.
[8] R.K. Brouwer, A fuzzy threshold max-product unit, with learning algorithm,
for classication of pattern vectors, in: Proceedings of the VI Brazillian
Symposium on Neural Networks, January 2000, pp. 208212.
[9] /http://www.aas2.com/products/axt100/S.
[10] H.M. Gomes, R.B. Fisher, Learning-based versus model-based log-polar
feature extraction operators: a comparative study, XVI Brazillian Symposium
on Computer Graphics and Image Processing, October 2003, pp. 299306.
[11] LIRA Lab, Document on specication, Technical Report, Espirit Project
n.31951-SVAVISCA-available at http://www.lira.dist.unige.it.
[12] R. Wodnicki, G.W. Roberts, M.D. Levine, A foveated image sensor in standard
CMOS technology, in: Proceedings of the Custom Integrated Circuits
Conference, May 1995Santa Clara, pp. 357360.
[13] F. Jurie, A new log-polar mapping for space variant imaging: application to
face detection and tracking, Pattern Recognition, 32, Elsevier Science, 1999,
pp. 865875.
[14] V.J. Traver, Motion estimation algorithms in log-polar images and application
to monocular active tracking, Ph.D. Thesis, Dep. Llenguatges I, Sistemes
Informatics, University Jaume I, Castellon, Spain, September 2002.
[15] H. Araujo, J.M. Dias, An introduction to the log-polar mapping, Proc. 2nd
Workshop Cybernatic Vision (1996) 139144.
[16] G. Sandidni, M. Tristarelli, Vision and space-variant sensing, Neural Networks
for Perception, vol. 1, Academic Press, 1992, pp. 398425.
[17] H. Gomes, R. Fisher, Structural learning from iconic representations, Lecture
Notes in Articial Intelligence, Springer, 2000, pp. 399408.
[18] F. Berton, A brief introduction to log-polar mapping, Technical Report, LIRALab, University of Genova, 2006.
[19] M. Bolduc, M.D. Levine, A review of biologically-motivated space variant data
reduction models for robotic vision, Comput. Vision Image Understanding 69
(2) (1998) 170184.
[20] C.G. Ho, R.C.D. Young, C.R. Chatwin, Sensor geometry and sampling methods
for space variant image processing, Pattern Analysis and Application,
Springer Verlag, 2002, pp. 369384.
[21] T.A. Ell, Quaternion-Fourier transforms for analysis of two-dimensional linear
time-invariant partial differential systems, Proc. 32nd Conf. Decision Contr.
(1993) 18301841 Dec.
[22] T.A. Ell, Hypercomplex spectral transforms, Ph.D. Dissertation, University of
Minnesota, Minneapolis, 1992.
[23] S.C. Pei, J.J. Ding, J.H. Chang, Efcient implementation of quaternion Fourier
transform convolution and correlation by 2-D complex FFT, IEEE Trans. Signal
Process. 49 (11) (2001) 27832797.
[24] S.J. Sangwine, T.A. Ell, Hypercomplex auto- and cross-correlation of colour
images, Proc. Int. Conf. Image Process., ICIP, 1999, pp. 319323.
[25] T.A. Ell, S.J. Sangwine, Colour-sensitive Edge Detection using Hypercomplex
Filters, EUSIPCO, 2000, pp. 151154.
[26] A. Vanderlugt, Signal detection by complex spatial ltering, IEEE Trans. Inf.
Theory 10 (1964) 139145.
[27] M. Saviddes, K. Venkataramani, B.V.K. Vijaya Kumar, Incremental updating of
advanced correlation lters for biometric authentication systems, Proc. Int.
Conf. Multimedia Expo, 3, ICME, 2003 p.p. 229232.
[28] A. Mahalanobis, B.V.K. Vijaya Kumar, D. Casasent, Minimum average
correlation energy lters, Appl. Opt. 26 (1987) 36333640.
[29] M. Savvides, B.V.K. Vijaya Kumar, P. Khosla, Face verication using
correlations lters, in: Proceedings of the Third IEEE Automatic Identication
Advanced Technologies, Tarrytown, NY, 2002) pp. 5661.
[30] A. Mahalanobis, B.V.K. Vijaya Kumar, S.R.F. Sims, J.F. Epperson, Unconstrained
correlation lters, Appl. Opt. 33 (1994) 37513759.
[31] P. Refreiger, Filter design for optical pattern recognition: multi-criteria
optimization approach, Opt. Lett. 15 (1990) 854856.
[32] B.V.K. Kumar, D.W. Carlson, A. Mahalanobis, Optimal trade-off synthetic
discriminant function lters for arbitrary devices, Opt. Lett. 19 (19) (1994)
15561558.
[33] S. Kumar, in: McGraw Hill Int. (Ed.), Neural Networks: A Classroom Approach,
2004.
[34] J.J. Buckley, Y. Hayashi, Fuzzy neural networks: a survey, Fuzzy Sets Syst. 66
(1994) 113.
[35] C.T. Lin, C.S.G. Lee, in: Neural Fuzzy Systems: A Neuro-fuzzy Synergism to
Intelligent Systems, Prentice Hall, Upper Saddle River, NJ, 1996.
[36] D. Nauck, F. Klawonn, R. Kurse, in: Foundations of Neuro-fuzzy Systems,
Wiley, Chichester, U.K, 1997.
[37] S.K. Pal, S. Mitra, in: Neuro-fuzzy Pattern Recognition: Methods in Soft
Computing, Wiley, Chichester, UK, 1999.
[38] R. Ostermark, A fuzzy neural network algorithm for multigroup classication,
Elsevier Science, Fuzzy Sets and Systems, 105 (1999) 113122.
[39] H.K. Kwan, Y. Cai, A fuzzy neural network and its application to pattern
recognition, IEEE Trans. Fuzzy Syst. 2 (3) (1997) 185193.
[40] G.Z. Li, S.C. Fang, Solving interval-valued fuzzy relation equations, IEEE Trans.
Fuzzy Syst. 6 (2) (1998) 321324.
[41] J. Leotamonphong, S. Fang, An efcient solution procedure for fuzzy relation
equations with max product composition, IEEE Trans. Fuzzy Syst. 7 (4) (1999)
441445.
[42] M.M. Bourke, D.G. Fisher, A predictive fuzzy relational controller, in:
Proceedings of the Fifth International Conference on Fuzzy Systems, 1996,
pp. 14641470.
177
[43] M.M. Bourke, D.G. Fisher, Solution algorithms for fuzzy relational equations
with max-product composition, Fuzzy Sets Syst. 94 (1998) 6169.
[44] P. Xiao, Y. Yu, Efcient learning algorithm for fuzzy max-product associative
memory network, SPIE 3077 (1997) 388395.
[45] J. Owen, A. Hunter, E. Fletcher, A fast model-free morphology-based object
tracking algorithm, Br. Mach. Vision Conf. (2002) 767776.
[46] H.V. Kennedy, Modeling noise in thermal imaging systems, Proc. SPIE 1969
(1993) 6670.