Creating Robots.txt File and its Importance

Written by San Christopher


Continued from page 1

Let's suppose yours is a dynamic database site containing information of your newsletter subscribers, customers, their address, phone numbers etc. All these confidential information is kept in a separate directory called "admin". (It is recommended to keep such information in a separate directory. Handling data will be easier for you and so will be easy to keeprepparttar search engines away. We will just know how.) I am sure you would never want any unauthorized person to visit this area leave alonerepparttar 127779 search engines. It does not helprepparttar 127780 search engines either since they have nothing to do withrepparttar 127781 data or files there. Here comesrepparttar 127782 role of a robots.txt file. Writerepparttar 127783 following inrepparttar 127784 robots.txt file: (Ignorerepparttar 127785 horizontal row - they are included only to separaterepparttar 127786 commands from rest ofrepparttar 127787 text.)

--------------------------------------------------------------------------------

User-agent: * Disallow: /admin/

--------------------------------------------------------------------------------

This does not allowrepparttar 127788 spiders to index anything inrepparttar 127789 admin directory also including sub-directories if any.

The asterisk (*) mark indicates allrepparttar 127790 search engines. How do you stop a particular search engine from spidering your files or directory?

Suppose you want to stop Excite from spidering this directory:

--------------------------------------------------------------------------------

User-agent: ArchitextSpider Disallow: /admin/

--------------------------------------------------------------------------------

Suppose you want to stop Excite and Google from spidering this directory:

--------------------------------------------------------------------------------

User-agent: ArchitextSpider Disallow: /admin/

User-agent: Googlebot Disallow: /admin/

--------------------------------------------------------------------------------

Files are no different. Suppose you want a file datafile.html not to be spidered by Excite:

--------------------------------------------------------------------------------

User-Agent: ArchitextSpider Disallow: /datafile.html

--------------------------------------------------------------------------------

Similarly, you do not want it to be spidered by Google too:

--------------------------------------------------------------------------------

User-agent: ArchitextSpider Disallow: /datafile.html

User-agent: Googlebot Disallow: /datafile.html

--------------------------------------------------------------------------------

Suppose you want two files datafile1.html and datafile2.html not to be spidered by Excite:

--------------------------------------------------------------------------------

User-Agent: ArchitextSpider Disallow: /datafile1.html Disallow: /datafile2.html

--------------------------------------------------------------------------------

Can you guess what doesrepparttar 127791 following mean?

--------------------------------------------------------------------------------

User-agent: ArchitextSpider Disallow: /datafile1.html Disallow: /datafile2.html

User-agent: Googlebot Disallow: /datafile1.html

--------------------------------------------------------------------------------

Excite will not spider datafile1.html and datafile2.html, but Google will not spider only datafile1.html. It will spider datafile2.html andrepparttar 127792 rest ofrepparttar 127793 files inrepparttar 127794 directory.

Imagine you have a file kept in a sub-directory that you wouldn't like to be spidered. What do you do? Lets supposerepparttar 127795 sub-directory is "official" andrepparttar 127796 file is "confidential.html".

--------------------------------------------------------------------------------

User-agent: * Disallow: /official/confidential.html

--------------------------------------------------------------------------------

I hope that's enough. A little practice is of course required. Ifrepparttar 127797 syntax of your robots.txt file is not written correctly,repparttar 127798 search engines will ignore that particular command. Before uploadingrepparttar 127799 robots.txt file double check for any possible errors. You should upload robots.txt file inrepparttar 127800 ROOT Directory of your server. The search engines look for robots.txt file only inrepparttar 127801 root directory else they totally ignore it. Mostly root directory isrepparttar 127802 directory whererepparttar 127803 index page is kept. In that case keeprepparttar 127804 robots.txt file inrepparttar 127805 same directory asrepparttar 127806 index file.

I know a user-friendly software that will write robots command for you (the software is introduced atrepparttar 127807 beginning of this article). It can make error-free robots.txt file very easily. This software RoboGen is a great tool. Never bother ever again to checkrepparttar 127808 syntax of your robots.txt file or even write a robots.txt file yourself. RoboGen is a visual editor for Robot Exclusion Files and is easy to use. Just select files you want to be visited or not to be visited byrepparttar 127809 search engines, and it createsrepparttar 127810 robots.txt file. You can also selectrepparttar 127811 search engines of your choice. RoboGen maintains a database of over 180 search engine user-agents, which are selectable from a drop down menu. It isrepparttar 127812 BEST and ONLY software onrepparttar 127813 Internet to write robots.txt file correctly and effectively. This great tool is cheaper than you expect. CLICK HERE NOW to know more!

Note: You should be able to see robots.txt file if you typerepparttar 127814 following inrepparttar 127815 address bar of your Internet browser.

http://www.your-domain.com/robots.txt

(Where your-domain isrepparttar 127816 domain name of your website. If yours is not a .com site, replace .com withrepparttar 127817 respective extension your website. For e.g. .net, .us, .org etc.)

You must be wondering whether to use Meta tag or Robots.txt or which of these is more effective!

A robots.txt correctly written is more effective thanrepparttar 127818 meta tag. All search engines support robots.txt, but not all search engines support robots command written inrepparttar 127819 meta tags. I recommend that you use both so that you cover your site in bothrepparttar 127820 scenarios. RoboGen will help you to write both!

One last thing - You can look in your web server log files to see what search engine robots have visited. They all leave signatures that can be detected. These signatures are nothing but name of their robots. For instance if Google has spidered your site it will leave a log file called Googlebot. This is how you know which search engine has spidered your pages and when!

Senior Manager - Internet Promotions http://www.searchengineoptimizationpromotion.com


Search Engines: Different Types, Different Strategies

Written by Terry Nicholls


Continued from page 1

There's a catch, of course. The most popular keywords have become quite expensive at Overture.com (the first and biggest PPC engine) and are rising atrepparttar others.

Directories

Directories are different from Search Engines in that they do not spider pages. Humans review each submission, visit each site, and decide what gets in.

Search engines and directories provide search results for each other. If a search turns up nothing inrepparttar 127778 directory's database of sites, it will showrepparttar 127779 search results from one ofrepparttar 127780 spidered engines. Allrepparttar 127781 directories use one ofrepparttar 127782 major engines.

The reverse is also true. Most Search Engines also provide directory results, in addition to their own search results. All of them use one ofrepparttar 127783 "Big 3" -- Yahoo!, Open Directory, or LookSmart.

Adapt Or Disappear

The difference betweenrepparttar 127784 types of Search Engines requires that you adapt your strategy to take maximum advantage of each engine. We'll help you with that.

For a more detailed explanation of these Search Engines, along with specific strategies and mistakes to avoid, please visit My Home-Based Business Advisor .

Terry Nicholls My Home-Based Business Advisor www.my-home-based-business-advisor.com

Copyright © by Terry Nicholls. All Rights Reserved.

Terry Nicholls writes from his own experiences in trying to start his own home-based business. To benefit from his success, visit My Home-Based Business Advisor - Helping YOUR Home Business Start and Succeed for free help for YOUR home business, including ideas, startup, and expansion advice.


    <Back to Page 1
 
ImproveHomeLife.com © 2005
Terms of Use