Wednesday 11 September 2013

Search Engine related Questions with Answer



Q: What is spider?
Ans: Spider also called as bot, crawler or robot is a set of computer program that browses the World Wide Web in methodical and orderly fashion as well automatically scan the webpages and website for updated content and download a copy to its data center to index.

Q: Name the bots (spider) of major search engine?
Ans: The name of bots/spider of Google search engine is GoogleBot, Yahoo Slurp for Yahoo search and BingBot for Bing search engine.

Q: Can you differentiate ‘no follow‘ and ‘do follow‘?
Ans: No follow link is exactly vice-versa of do follow link. These are non-crawling link which is not passed by search engine bots and hence can’t be cached or indexed. It is obvious when we wish to prevent a link from crawling and indexing.
Do follow link is a kind of hyperlink which says all search engine crawlers to pass through which also put an impact over page rank. When we opt to employ or attempt to achieve a do follow link then it is counted by search engines and sits in the eye of Google, Bing, MSN, Yahoo etc. as a back link for your website and enhances your site ranking.

Q: Define Page Rank.
Ans: Page Rank is a set of algorithm for link analysis named after Larry Page and employed by Google search engine towards defining a numerical value from 1 to 10 to each component of hyperlinked documents like the World Wide Web. The value accepts only round figure that means decimal are not allowed. Page rank is calculated by their inbound links.

Q: Establish a difference between PR & SERP.
Ans: PR is Page Rank which is defined by quality inbound links from other website or WebPages to a web page or website as well as say the importance of that site.

SERP stands for Search Engine Result Page is the placement of the website or webpage which is returned by a search engine after a search query or attribute.

Q: What is Cache?
Ans: Cache is the process performed by search engine crawler at a regular interval of time. It used to scan and take snapshot of each page over world wide web as well as store as a backup copy. Almost every search engine result page incorporates a cached link for every site. However, clicking over cached link show you the last Google cached version of that specific page rather than of current version. Also, you can directly prefix “cache: http://www.webgranth.com” with the desired URL to view its cached version.

Q: What do you know about Adsense?
Ans: Adsense is a web program conducted by Google that enables publishers of content websites to cater text, rich media, image, video advertisements automatically which are relevant to content of the website and audience. These advertisements are included, maintained and sorted by Google itself and earn money either by per-click or per-impression basis. Ans: Adword is referred as the main advertising product of Google which is useful to make appear your ads on Google and its partner websites including Google Search. This Google’s product offer PPC (Pay Per Click) advertising which is a primary module and incorporate a sub module CPC (Cost Per Click) where we bid that rate that will be charged only when the users click your advertisement. One another sub module is CPM (Cost Per Thousand Impression) advertising where advertiser pay for a thousand impressions on flat rate to the publisher. In addition it also includes website targeted advertising of banner, text and rich-media ads. Moreover, the ad will appear especially to those people who are already looking for such type of product you are offering as well as offer to choose particular sites with the geographical area to show your ads.

Q: What is PPC?
Ans: PPC is the abbreviated form of Pay Per Click and is an advertisement campaign conducted by Google. It is referred as a primary module with two sub module CPC (Cost-per-click) and CPM (Cost per thousand impressions) through a bidding and flat rate respectively. In CPC the advertiser would be only charged when the user click over to their advert.

Q: What do you know about RSS?
Ans: RSS stands for Really Simple Syndication is useful to frequently publish all updated works including news headlines, blog entries etc. This RSS document also known as web feed, feed or channel that incorporate summarized text including metadata i.e. authorship and publishing dates etc.
However, RSS feeds make the publishers flexible by syndicating the content automatically. There is a standardized file format XML that lets the information to be published once which can be visible to several distinct programs. Also, this makes readers more ease to get updates timely by allowing them to subscribe from their favorite sites.

Q: How would you define Alexa?
Ans: Alexa is a California based subsidiary company of Amazon.com which is widely known for its website and toolbar. This Alexa toolbar congregates browsing behavior data and send it to website, where the data is analyzed and stored and create reports for company’s web traffic. Also, Alexa provides data concerned with traffic, global ranking and other additional information for a website.

Q: What is Site Map and distinguish between HTML sitemap and XML sitemap?
Ans: A sitemap incorporates list of webpages which are accessible to users or crawlers. It might be a document in any form employed as a tool for planning either a web page or web design that enables them to appear on a website as well as typically placed in a hierarchical style. This helps search engine bots and users to find out the pages on a website. The site map renders our website more search engine friendly as well as enhances the probability for frequent indexing.

HTML sitemap can be incorporated directly in a web page for user’s flexibility and can be implemented through proper design. On the other hand, XML sitemap is useful only for search engine crawlers or spiders and doesn’t visible to users. It sits in the root of the website.
Which is better: an HTML site map or XML Sitemap? BY Matt Cutt

Q: What’s the significance of Robots.txt file in a website?
Ans: Robots. text file is considered as a useful convention to prevent cooperating web robots and web crawlers from accessing all or part of a website or its content for which we don’t want to be crawled and indexed but publicly viewable. It is also employed by search engines to archive and categorize website and to generate a rule of no follow regarding some particular areas of our websites.

Q: What you opined about HTML either it is Case Sensitive or Case Insensitive?
Ans: HTML is case Insensitive. It doesn’t matter and deliver identical result either you write in Upper Case or Lower Case.
Under what circumstances you would intend to eliminate pages from search engines through robots.txt vs. Meta robots tag? Ans: Generally, I would continue to employ robots.txt in order to make search engine indexing a directory on a website. This might be often a directory that is concerned with admin function or incorporate contents only in the form of script or image gallery. Generally, robots.txt is employed to prevent a directory and its sub-folders and files to crawl by search engine bot as well as Meta robots tag for a specific web page

2 comments: