Importance of robots.txt

Singapore SEO Specialist | September 20th, 2009 Leave a comment Go to comments

When search engine’s crawler visits your website, one special file it will look for on your website is the “robots.txt” file.

The robots.txt file is a simple text file that resides in the root directory of your domain. The content of a robots.txt file consists of information that inform the search engines which directories or pages of your website should be indexed and which should be ignored.

For blogs that use WordPress, a SEO friendly content management software, an example of the robots.txt is as follow:

User-agent: *
Disallow: /cgi-bin
Disallow: /wp-admin
Disallow: /wp-includes
Disallow: /wp-content/plugins
Disallow: /wp-content/cache
Disallow: /wp-content/themes
Disallow: /trackback
Disallow: /feed
Disallow: /comments
Disallow: /category
Disallow: /page
Disallow: /tag

# Google Image
User-agent: Googlebot-Image
Disallow:
Allow: /*

# Google AdSense
User-agent: Mediapartners-Google*
Disallow:
Allow: /*

Sitemap: http://www.yourdomain.com/sitemap.xml”

Share and Enjoy:
  • Digg
  • del.icio.us
  • Facebook
  • BlinkList
  • Propeller
  • Reddit
  • StumbleUpon
  • Technorati
  • Twitter
  • Yahoo! Buzz
  1. No comments yet.
You must be logged in to post a comment.