A search engine operates, in the following order: 1) Crawling; 2) Deep Crawling Depth-first search (DFS); 3) Fresh Crawling Breadth-first search (BFS); 4) Indexing; 5) Searching.
Web search engines work by storing information about a large number of web pages, which they retrieve from the WWW itself. These pages are retrieved by a web crawler (also known as a spider) - an automated web browser which follows every link it sees, exclusions can be made by the use of robots.txt. The contents of each page are then analyzed to determine how it should be indexed. Data about web pages is stored in an index database for use in later queries. Some search engines, such as Google, store all or part of the source page (referred to as a cache) as well as information about the web pages, whereas some store every word of every page it finds, such as AltaVista. This cached page always holds the actual search text since it is the one that was actually indexed, so it can be very useful when the content of the current page has been updated and the search terms are no longer in it.
This problem might be considered to be a mild form of linkrot, and Google's handling of it increases usability by satisfying user expectations that the search terms will be on the returned web page. This satisfies the principle of least astonishment since the user normally expects the search terms to be on the returned pages. Increased search relevance makes these cached pages very useful, even beyond the fact that they may contain data that may no longer be available elsewhere.
When a user comes to the search engine and makes a query, typically by giving keywords, the engine looks up the index and provides a listing of best-matching web pages according to its criteria, usually with a short summary containing the document's title and sometimes parts of the text. Most search engines support the use of the boolean terms AND, OR and NOT to further specify the search query. An advanced feature is proximity search, which allows you to define the distance between keywords.
The usefulness of a search engine depends on the relevance of the results it gives back. While there may be millions of Web pages that include a particular word or phrase, some pages may be more relevant, popular, or authoritative than others. Most search engines employ methods to rank the results to provide the "best" results first. How a search engine decides which pages are the best matches, and what order the results should be shown in, varies widely from one engine to another. The methods also change over time as Internet usage changes and new techniques evolve.
Most web search engines are commercial ventures supported by advertising revenue and, as a result, some employ the controversial practice of allowing advertisers to pay money to have their listings ranked higher in search results.
The vast majority of search engines are run by private companies using proprietary algorithms and closed databases, the most popular currently being Google, MSN Search, and Yahoo! Search. However, Open source search engine technology does exist, such as Dig, Nutch, Senas, Egothor, OpenFTS, DataparkSearch, and many others.
Print Article
Popular Posts
-
If you visited the Gmail log-in page recently or on today, you may have noticed that Gmail Front Page looked a bit different. Actually Goog...
-
A couple of weeks ago, I got an email from one of my Blog reader asking about any good webmaster forums in my knowledge which I would recomm...
-
In a latest Press Release Apple announced that their much awaited 2nd Generation Apple iPad aka "iPad2" will be available to Asia...
-
Google Celebrating 132nd Birthday of Gideon Sundback's (Well-known for development of the Zipper) as on 24th April, 2012 by displaying i...
-
Google Celebrating 112th Birthday of Jorge Luis Borges as on 24th August, 2011 as Google Doodle Logo at their home page. Jorge Luis Bor...
Blog Archive
- October 2013 (1)
- June 2013 (1)
- April 2012 (2)
- September 2011 (1)
- August 2011 (2)
- June 2011 (1)
- May 2011 (1)
- April 2011 (6)
- March 2011 (3)
- February 2011 (4)
- January 2011 (2)
- December 2010 (2)
- November 2010 (5)
- October 2010 (1)
- September 2010 (5)
- August 2010 (1)
- May 2010 (2)
- March 2010 (2)
- February 2010 (5)
- January 2010 (3)
- December 2009 (3)
- November 2009 (1)
- October 2009 (6)
- September 2009 (3)
- August 2009 (4)
- July 2009 (6)
- June 2009 (5)
- May 2009 (6)
- April 2009 (2)
- March 2009 (2)
- February 2009 (2)
- January 2009 (3)
- December 2008 (1)
- November 2008 (2)
- October 2008 (3)
- September 2008 (4)
- August 2008 (27)
- July 2008 (4)
- June 2008 (2)
- May 2008 (7)
- April 2008 (6)
- March 2008 (2)
- February 2008 (2)
- January 2008 (4)
- December 2007 (2)
- October 2007 (1)
- September 2007 (1)
- August 2007 (4)
- June 2007 (4)
- May 2007 (1)
- April 2007 (1)
- January 2007 (1)
- November 2006 (1)
- October 2006 (1)
- September 2006 (2)
- June 2006 (2)
- May 2006 (2)
- April 2006 (3)
- February 2006 (4)
- January 2006 (15)
Categories
Google Search Logo
(23)
toprankseoblog.com
(9)
google olympics logo
(6)
Google-Search-Logo
(5)
toprankseoblog
(5)
twitter.com
(5)
Google Search
(4)
google-logo
(4)
Blog Rankings
(3)
Google Doodle
(3)
Google Rankings
(3)
apple i phone
(3)
sulumits retsambew
(3)
Afzal Khan
(2)
MSN Rankings
(2)
Professional-Indian-Blogger
(2)
SEM-Event
(2)
SEO Expert
(2)
SEO Guide
(2)
best-internet-marketing-posts
(2)
ecommerce-customer
(2)
siliconindia-news
(2)
sitelinks
(2)
Avinash-Kaushik
(1)
Blogger Help
(1)
Bruce Clay Advanced SEO Training
(1)
Bruce-Clay-SEO-training
(1)
Can I Trust Google
(1)
Gmail
(1)
Google Image Search
(1)
Google Infographic
(1)
Google SEO Report Card
(1)
Google Sitelinks
(1)
Google Sitemap
(1)
Googlebot
(1)
Guest Article
(1)
Infographics
(1)
Internet marketing
(1)
OMCAR-2008
(1)
Off Page SEO
(1)
Online-Marketing-Careers
(1)
Online-Marketing-Event-India
(1)
SEO Expert Quiz
(1)
SEO Techniques
(1)
SEO Toolset Training Seminar
(1)
SEO funny images
(1)
SEO infographic
(1)
SEO-Expert-Resume
(1)
SEO/SEM Event in Chennai
(1)
SEO/SEM Event in Delhi
(1)
SEOmoz Quiz
(1)
Search Engine Optimization Expert
(1)
Search Engine Optimization Starter Guide
(1)
Sitemap Generetor
(1)
Social Networks
(1)
The-SEMMYS
(1)
Web 2 marketing
(1)
WebSite Sitelinks at Google
(1)
XML sitemaps
(1)
Yahoo Sitemap
(1)
advanced SEO training
(1)
affiliate marketing
(1)
article submitter review
(1)
best-internet-marketing-posts-2008
(1)
best-internet-marketing-posts-2009
(1)
best-online-marketing-blog
(1)
business-blog-marketing
(1)
business-bloging
(1)
free seo guide
(1)
free seo tools
(1)
google alerts
(1)
google announcement
(1)
google search engine optimization
(1)
google seo guide
(1)
google webmaster tool
(1)
google-algorithm-update
(1)
google-analytic-photographs
(1)
google-brand-promotion
(1)
google-brand-seo
(1)
google-serps-logo
(1)
google-sitelinks
(1)
important SEO technique
(1)
online product launch tips
(1)
online tools
(1)
online-business-tips
(1)
search engine marketing professional
(1)
search engine marketing professionals
(1)
search engine optimization tool
(1)
social bookmarking sites
(1)
social media marketing
(1)
social networking sites
(1)
stumbleupon
(1)
techpedia.com
(1)
twitter-indian-directory
(1)
web 2.0 sites
(1)
webmaster tools
(1)
0 comments:
Post a Comment