Académique Documents
Professionnel Documents
Culture Documents
Chapter 1
Introduction
Infrared (IR) cameras (also called thermal cameras) are passive sensors, which have a
spectral sensitivity to the 7µm to 14µm wavelength band. They are sensitive to the
radiation emitted, transmitted, and reflected by an object in the scene. The special
properties of the infrared cameras give us a new perspective to solve those vision
problems which are difficult or complex for conventional visible images. They have been
widely used in many fields: transportation, automation, law enforcement, firefighting,
research, building, security, etc. For example, in industry, workers use the IR cameras to
detect the stress and fault of the machine. IR sensors are even embedded on some mobile
phones, expanding IR imaging to people’s daily life. The sensor are supposed to use the
thermal map as augmented reality that help people find unusual stuffs at home as well as
act as a night vision camera.
However, the IR camera usually has low resolution, mostly only 160×120, and 320×240,
which is limited by the finite size of the infrared focal plane, the optics of the camera and
cooling of the sensor. Though to improve imaging device to obtain a high resolution IR
image, it would dramatically increase the production cost. Little work has been done to
enhance the low resolution IR imaging, either hardware or software. This limits the work
in high-level image processing tasks which require good quality of the imagery. The
application and user experience of the IR cameras are largely constrained due to their low
resolution while the color cameras are developing rapidly. A high quality visual camera
has become the common equipment on many devices on account of its giveaway prices
and high resolution. Thus, to reconstruct the low-resolution IR images guided by the
visible-light camera. Super-resolution is a technique to allow the construction of a higher
resolution image of an area, based on the merging of information from other related
images. It is related to many other image processing problems, such as image coding,
reconstruction, error concealment, and deblocking.
The fact that a registered high-quality texture image can provide significant information
to enhance the infrared image due to their strong correlation. The key method is the use
of a guided filter in the correlated region between IR image and color image and
construct a cost volume of IR image values probability based on the input IR image. A
guided filter is then applied to the cost volume in the correlated region. After that, two
steps including a best cost selecting and sub-pixel refinement are taken to produce a
refined IR image. The output image is finally got after a outlier detection. The algorithm
works fast and turns out to be suitable for IR image super resolution.
Chapter 2
In this Chapter, it introduces an apparatus and methodology to get the registered IR and
visual image sets. Firstly, describe system settings. Then, to show how to calibrate the
two cameras. Finally leverage a well performed algorithm to register images from two
different sensors, including the IR sensor and the RGB sensor. With this system,
contribute several video sequences for further research. Dataset provide HR color video
and corresponding IR map, including both indoor and outdoor scenes
(a)
(a) (b)
(c)
Figure 2.2 (a) Chess printed by 3D printer (b) the thermal image of chess after filled
with water (c) Some examples from datasets
(a)
Figure 3.1(a) Histogram of channels that most relevant to IR sensor
𝑐𝑤 1
𝐶𝑖(𝑖) =|𝜔| ∑𝑘:𝑖∈𝜔𝑘(𝑎𝑘 𝐻𝑟(𝑖) + 𝑏𝑘 ) (2, 2)
𝑏𝑘 =𝑝𝑘-𝑎𝑘 𝜇𝑘 (2, 4)
Here,𝜇𝑘 and 𝜎𝑘2 are the mean and variance of I in 𝜔𝑘 , |w| is the number of pixels in
1
window 𝜔𝑘 , and 𝑝𝑘 =|𝜔| ∑𝑘∈𝜔𝑖 𝑎𝑘 the mean of p in 𝜔𝑘 . After this, we can get sharp and
right edges while keeping the true values in smooth areas.
f(𝑥𝑚𝑖𝑛 ) is the minimum of function f(x). The parameters a, b and c of the function f(x)
can be calculated given d, f(d),f(𝑑− ), f(𝑑+ ). So xmin can be calculated:
𝑓 (𝑑+ )−𝑓(𝑑− )
𝑥𝑚𝑖𝑛 =d-2(𝑓(𝑑 (2, 6)
+ )+𝑓(𝑑− )−2𝑓(𝑑− ))
Chapter 4 Results
We test our approach using the real world examples collected in our dataset including
indoors and outdoors images. Like the other super-resolution methods, we use the
original IR image in our dataset as the ground truth. And we prepare our test in the
following manner: An input sequence is blurred using a 3×3 uniform mask, decimated by
a factor of 1 : 4 (in each axis), and then contaminated by an additive noise with std = 2.
For the main parameters in our algorithm, we set them as: ε = 0.012, η = 0.5, L = 20000,
T = 0.35 and k = 6 respectively, which makes the best performance in our experiments.
The system configuration for experiments is Quad-Core 3.4GHz CPU, 4GB RAM. We
implement our algorithm via Matlab. The bicubic interpolator, joint bilateral upsampling,
are used for comparison. Three experiment results in three different scenes are listed in
the Fig. 4.
Figure 4: From left to right are the suitable channel images, the ground truth IR
images, the bicubic interpolation results, the result using JBU and the results using
proposed algorithm. The result images has been up sampled by a factor of 1:4
(in each axis).
We can see that algorithm performs well in all the situations. Bicubic works well in
smooth areas but smooth the discontinuous areas. The guided filter sharpen the IR image
edges in true place by using the information provided by the aligned color image. It is
only applied in the correlated region to avoid wrong edges transfering. After processing,
the edges of the enhanced IR images are sharpen, and the added noise is removed. More
importantly, the over texture and over smooth problem is avoided.
Chapter 5 Conclusion
Bibliography