“I bought this guide a few days ago to prepare for my interview with Oracle. Many of the questions they asked me were from this guide. I found this book absolutely great!”
Before a search engine can tell you where a file or document is, it must be found.
To find information on the hundreds of millions of Web pages that exist, a search engine employs special software robots, called spiders, to build lists of the words found on Web sites. When a spider is building its lists, the process is called Web crawling. In order to build and maintain a useful list of words, a search engine’s spiders have to look at a lot of pages.
Let me brief you about the way search engines work based on the knowledge I have about it.
Some of the search engines are Google, Yahoo, Altavista etc..
Google indexes important words from the websites excluding unimportant words like ‘a’,'an’,'the’ etc. Altavista, on the other hand, indexes each and every word in a site.. Spiders also use metatags of the sites. Spiders will correlate the metacontents with the page content, rejecting the page if they don’t match.
Robot Exclusion Protocol - Some site owners may not want their site to be displayed using search engines. In such case, Robot Exclusion Protocol will exclude such sites.
While building the index, Ranking is needed to present the most useful data to the users. Weights are assigned to entries based on many factors.
Encoding is done to save storage space.
It checks all the words which user enters, and checks all that in the websites available and those which have maximum relevance shows them accordingly
Before a search engine can tell you where a file or document is, it must be found.
To find information on the hundreds of millions of Web pages that exist, a search engine employs special software robots, called spiders, to build lists of the words found on Web sites. When a spider is building its lists, the process is called Web crawling. In order to build and maintain a useful list of words, a search engine’s spiders have to look at a lot of pages.
Let me brief you about the way search engines work based on the knowledge I have about it.
Some of the search engines are Google, Yahoo, Altavista etc..
Google indexes important words from the websites excluding unimportant words like ‘a’,'an’,'the’ etc. Altavista, on the other hand, indexes each and every word in a site.. Spiders also use metatags of the sites. Spiders will correlate the metacontents with the page content, rejecting the page if they don’t match.
Robot Exclusion Protocol - Some site owners may not want their site to be displayed using search engines. In such case, Robot Exclusion Protocol will exclude such sites.
While building the index, Ranking is needed to present the most useful data to the users. Weights are assigned to entries based on many factors.
Encoding is done to save storage space.
Leave an Answer/Comment