Académique Documents
Professionnel Documents
Culture Documents
ABSTRACT
This paper presents a novel approach based on multiwavelet transform in Web
information retrieval. The influence multiwavelet transform on feature extraction
and information retrieval ability of calibration model and solve problem of
selecting optimum wavelet transform for sentence query entered of any language
by internet user was investigated. The empirical results show that the proposed
method performs accurate retrieval of the multiwavelet transform than scalar
wavelet transform. The aptitude of multiwavelet transform to represent one
language (domain) of the multilingual world. Regardless of type, script, word
order, direction of writing and difficult font problems of the language. This work is
a step towards multilingual (English, Spanish, Danish, Dutch, German, Greek,
Portuguese, French, Italian, Russian, Arabic, Chinese (Simplified and Traditional),
Japanese and Korean (CJK)) search engine. We consider multiwavelet transform
as signal processing tool in Soft Computing (SC) as well.
Keywords: Multiwavelet Transform, Multilingual, Search Engine.
INTRODUCTION
Volume 3 Number 3
Page 175
www.ubicc.org
c1,-1,k
c1,0,k
c2,j-1,k
c2,-1,k
c2,0,k
G
d1,-1,k
d2,-1,k
d1, j-1,k
d2,j-1,k
Page
Here
denotes
the
(t ) = [1 (t )2 (t )... r 1 (t ) ]
satisfy
the
matrix
Volume 3 Number 3
M 1
Similarly
k =0
H k ( 2 t k )
for
the
(1)
multiwavelets
r 1
equation is obtain by
(t ) =
M 1
k =0
G k ( 2 t k )
(2)
176
www.ubicc.org
START
Initialize variables of query
Select the language
f (t) = C j 2
M 1
k =0
(2 j t k ) + Dj (k)2 2 (2 j t k) (3)
1
j0
2
j = j0
More
Language
YES
NO
END
2 G ( k )C j ( 2 t k )
k
(5)
3 5 2
45
1 20 3 10 2
3 5 2
0
H1=
9 20 1 2
0
0
0
0
H2 =
H3 =
9 20 3 10 2
1 20 0
H0 =
and
1 20 3 10 2
1 10 2
3 10
9 20
1 2
G1 =
9 10 2
G0=
9 20
3 10 2
9 10 2
3
10
0
1 20
G3 =
1 10 2 0
G2 =
Volume 3 Number 3
j 1 ( k ) =
Page 177
Methodologies
This paper is extended to our previous work [12]. The main objective of the proposed method is to
make the internet user easily access it using ones
language or any language for the required
information.
In our previous work, we applied our novel
method that the sentence queries entered of 14
languages of five language families by internet user
depend on ones own language convert to signal.
Then, the signal is processed by using Forty-two
wavelet functions (mother wavelets) of six families,
namely Haar (haar), Daubechies (db1-10),
Biorthogonal (bior1.1- 6.8 and rbior1.1 6.8),
Cofilets (coif 1-5), Symlets (sym 1 - 10), and Dmey
(dmey), of wavelet transform.
However, we can not determine one wavelet
function to be used for multilingual web information
retrieval perfectly. Due to, the wavelet function, type
and sentence query of language should have good
similarities. This means that the wavelet function
should have good similarities not just with type of
language, but also with sentence query of the same
language.
In the current paper, we have assumed that some
www.ubicc.org
Volume 3 Number 3
Page 178
www.ubicc.org
REFERENCES
Volume 3 Number 3
Page 179
www.ubicc.org
a) English Language.
b) Spanish Language.
c) Arabic Language.
d) Japanese Language.
g) Korean Language.
Figure 1: shows What is your name? query converted to signal for six languages of five language families.
TABLE 1:
The sentence query of What is your name? translates to 6 Languages with one level decomposition of FWT and DMWT.
The sentence of What is your name? translates to 6 languages(decomposition(D)) and reconstruction (R))
Compar.
D
R(DMWT)
DMWT
R(haar)
haar
R (bior2.2)
bior2.2
R (bior3.1)
bior3.1
Simplified Chinese
100%
94%
100%
94%
Traditional Chinese
100%
94%
100%
94%
Spanish
Cul es su nombre?
Cul es su nombre?
100
Cul essunombre?
89.5%
Cul es su nombre?
100
Cul es su nombre?
100%
English
What is your name?
What is your name?
100%
Whatisyourname?
83%
What is your name?
100%
What is your name?
100%
Arabic
100%
87.5%
100%
87.5%
Korean
?
?
100%
>
88%
?
100%
?
84%
Japanese
100%
96%
100%
96%
TABLE 2:
The sentence query of Good morning, Hi. translates to 10 Languages with one level decomposition of FWT and DMWT.
Sentence of Good morning, Hi. Translate to 10 languages(decomposition(D)) and reconstruction (R))
Compar.
D
R(haar)
haar
R (bior3.1)
bior3.1
R (DMWT)
DMWT
English
Good morning, Hi.
Good morning, Hi.
100%
Good morning, Hi.
100%
Good morning, Hi.
100%
Volume 3 Number 3
Arabic
.
.
88.2%
.
100%
.
100%
Russion
, .
, .
100%
, .
100%
, .
100%
Greek
, Hi.
, Hi.
100%
, Hi.
100%
, Hi.
100%
Page
Dutch
Goedemorgen, Hi.
Goedemorgen, Hi.
100%
Goedemorgen, Hi.
100%
Goedemorgen, Hi.
100%
180
German
Guten Morgen, Hi.
GutenMorgen, Hi.
94.1%
Guten Morgen, Hi.
100%
Guten Morgen, Hi.
100%
Portuguese
Bom dia, Hi.
Bom dia, Hi.
100%
Bom dia, Hi.
100%
Bom dia, Hi.
100%
French
Bonjour, Salut.
Bonjour, Salut.
100%
Bonjour, Salut.
100%
Bonjour, Salut.
100%
www.ubicc.org
Italian
Buongiorno, Hi.
Buongiorno, Hi.
100%
Buongiorno, Hi.
100%
Buongiorno, Hi.
100%
Danish
Goddag, Hej.
Goddag, Hej.
100%
Goddag, Hej.
100%
Goddag, Hej.
100%
TABLE 3:
The sentence query of What is your name? translates to Traditional Chinese Languages with four levels decomposition of
FWT (previous work results)
wavelets
haar
db 1
db2
db 3
db 4
db 5
db 6
db 7
db 8
db 9
db 10
coif 1
coif 2
coif 3
coif 4
coif 5
sym 1
sym 2
sym 3
sym 4
sym 5
sym 6
sym 7
sym 8
sym 9
sym 10
bior1.1
bior1.3
bior1.5
bior2.2
bior2.4
bior2.6
bior2.8
bior3.1
bior3.3
bior3.5
bior3.7
bior3.9
bior4.4
bior5.5
bior6.8
Dmey
Average %
Level 1
Volume 3 Number 3
%
94
94
61
72
67
83
72
67
78
67
78
78
78
56
61
61
94
61
72
78
67
78
61
67
72
67
94
94
83
100
100
89
83
94
94
100
89
89
78
67
78
83
78
Level 2
%
Level 3
%
100
94
100
94
72
61
78
89
61
50
83
89
89
83
72
83
83
89
78
67
72
44
56
72
72
56
61
61
72
61
66
72
100
94
72
61
78
89
61
78
78
61
61
78
72
67
61
50
61
61
61
89
100
94
94
100
83
94
100
100
94
94
94
94
100
100
94
94
89
94
94
94
94
94
83
89
61
50
78
89
78
50
83
89
79
79
Page
181
Level 4
www.ubicc.org
%
94
94
94
89
50
89
83
89
89
44
44
78
44
61
67
67
94
44
89
78
56
94
56
44
61
94
94
100
94
100
100
100
100
89
94
100
100
78
44
89
44
94
77
TABLE 4:
The sentence query of What is your name? translates to Korean Languages with four levels decomposition of FWT (previous work
results)
wavele
ts
Haar
db 1
db2
db 3
db 4
db 5
db 6
db 7
db 8
db 9
db 10
coif 1
coif 2
coif 3
coif 4
coif 5
sym 1
sym 2
sym 3
sym 4
sym 5
sym 6
sym 7
sym 8
sym 9
sym 10
bior1.1
bior1.3
bior1.5
bior2.2
bior2.4
bior2.6
bior2.8
bior3.1
bior3.3
bior3.5
bior3.7
bior3.9
bior4.4
bior5.5
bior6.8
Dmey
Avr %
Level 1
>
>
?
>
?
>
?
>
>
?
?
>
?
>
?
>
>
?
>
>
?
>
?
?
>
>
>
>
>
?
>
>
>
?
>
>
?
>
?
>
?
>
Volume 3 Number 3
%
88
88
60
68
52
76
60
76
76
60
64
64
52
64
68
64
88
60
68
64
64
64
68
64
60
72
88
88
76
100
92
84
88
84
88
80
88
92
52
80
60
60
72
Page
182
%
100
100
60
72
56
76
80
72
72
60
52
60
56
56
60
56
100
60
72
64
64
68
60
60
60
72
100
96
96
88
92
68
92
88
100
100
100
100
56
72
56
64
74
Level 4
?
?
?
>
?
>
>
>
>
?
?
>
?
>
?
?
?
?
>
>
?
>
?
?
>
>
?
?
?
?
?
?
>
?
?
?
?
?
?
>
?
>
www.ubicc.org
%
100
100
56
76
52
76
72
72
76
56
52
64
56
56
60
60
100
56
76
68
56
76
60
60
56
72
100
96
96
100
88
96
84
100
100
92
96
100
52
76
56
72
75
AUTHORS BIOGRAPHIES
Shawki A. Al-Dubaee was born in 1978, Taiz, Yemen. He received his B.E degree in
Computer Technology Engineering from Technical college and M.Sc degree in Computer
Engineering from Mosul University, Mosul, Iraq, in 2000 and 2003 respectively with the
thesis Feature Extraction of Signal Processed Using Wavelet and Artificial Neural
Networks. In 2008, he received his Post Graduate Diploma in Linguistics, Department of
Linguistics from Aligarh Muslim University, Aligarh, India. He is a lecturer on leave in
Department of Computer and Communication, College of Engineering and Information
Technology at Taiz University, Taiz, Yemen.
Currently, he is a research scholar in Department of Computer Engineering, Z. H.
College of Engineering and Technology, at Aligarh Muslim University, Aligarh, India.
His current research interests include Web Intelligence, Soft Computing, Computational
Linguistic, Question and Answering System, and Semantic Web. He is an IEEE student
member and Computational Intelligent Society member. Also, he is member in the
International Association of Engineers (IAENG).
Nesar Ahmad was born in 1961 in Patna, India. He received his B.Sc (Engg) degree
in Electronics & Communication Engineering from Bihar College of Engineering, Patna
University, India, and M.Sc degree in Information Engineering from City University,
London, U.K., in 1984 and 1989 respectively. In 1993, he received his Ph.D. degree from
the Indian Institute of Technology (IIT) Delhi, New Delhi, India.
He worked as an Assistant Professor in the Department of Electrical Engineering, IIT
Delhi till December 2004. He was with King Saud University, Riyadh during 1997-99 as
Assistant Professor. Currently, he is a Professor and Chairman of Computer Engineering
Department, Z. H. College of Engineering and Technology, at Aligarh Muslim
University, Aligarh, India. His current research interests mainly include Soft Computing,
Web Intelligence, and Embedded Systems Design.
Volume 3 Number 3
Page 183
www.ubicc.org