Académique Documents
Professionnel Documents
Culture Documents
Search Engine
Web crawler
Search Engine
A Web search engine is a tool or a program
designed to search for information on the WWW on
the basis of specified keywords and returns a list of
the documents where the keywords were found.
The search results are usually presented in a list and
are commonly called hits. The information may
consist of webpages, images, information and other
types of files
Web Crawler
A Web crawler is a computer program that browses the
WWW in a methodical, automated manner.
Archie
Veronica and Jughead
Excite
Yahoo
Lycos
Alta Vista
Keyword searching
Most common form of text search on the Web
“Keyword” specified by the user is searched
Those keywords would actually tell a user something
about the subject and content of this page.
It's up to the search engine to determine the type keyword
They may refer to the words specified as the title of the
documents or their first line content for “MATCHING ”
purpose
Keyword searching
Problems with keyword searching
Same spelled KEYWORD
Stemming Problem
Synonym Problem
Stemming Problem
Search Engine XYZ
BIG_
GO!
BIG_
GO!
BIG_
GO!
It will return…………..the following
Hard drive 100KB www.Hddve
Hard Exam 115 KB www.Hdex
.com
Hard stone 105 KB www.Hrdsto
m.cm
.com
Most of these are IRRELEVANT to the user ,
Also the problem of CASE SENSITIVITY
Refined Searching
ADVANCED SEARCH
“Criteria of searching” is given by the user
Uses BOOLEAN operators
Allow the user to
Search entire phrase ,
Field Searching,
specify what form he would like his results to appear in
,
restrict his search to certain fields on the internet (i.e.,
usenet or the Web)
BOOLEAN operators
Boolean AND
FCC AND WIRELESS
AND
COMMUNICATION
Boolean OR
FCC OR WIRELESS OR
COMMUNICATION
BOOLEAN operators
Boolean AND NOT
- AND NOT
Phrase Search
Example
Concept-based searching
Search Engine XYZ
GO!
GO!