Drupal SEO – using robots.txt to avoid content duplication
Google really doesn't like content duplication on sites and so it is advisable to prevent the Google crawler from reaching the same content on your site from more than one url. Since Drupal does offer many ways of reaching your content, you should block certain URL and URL paths by defining the right robots.txt.
Up till Drupal 5, the Drupal installation didn't come with a built it robots.txt, and so make sure you go over robots.txt if you're running Drupal 6 or add a robots.txt file if you're running versions lower than 5 .
Here's my suggestion for the robots.txt with minor adjustments to the robots.txt that comes with Drupal 5 (notice, for example, disallowing feed and taxonomy):
User-agent: Googlebot
Crawl-delay: 10# Directories
Disallow: /tracker/
Disallow: /xtracker/
Disallow: /user/
Disallow: /book/export/
Disallow: /forward/
Disallow: /database/
Disallow: /includes/
Disallow: /misc/
Disallow: /modules/
Disallow: /sites/
Disallow: /themes/
Disallow: /scripts/
Disallow: /updates/
Disallow: /profiles/
Disallow: /comment/
Disallow: /taxonomy/
Disallow: /popular/
Disallow: */feed*
Disallow: */comment/reply/
Disallow: /popular/*
Disallow: /comments*
Disallow: /frontpage*
Disallow: /comments*
Disallow: /aggregator*
Disallow: /aggregator2*
Disallow: *?sort=asc&order=Time*
Disallow: *comment-*
# Files
Disallow: /rss.xml
Disallow: /feed
Disallow: /xmlrpc.php
Disallow: /cron.php
Disallow: /update.php
Disallow: /install.php
Disallow: /INSTALL.mysql.txt
Disallow: /INSTALL.pgsql.txt
Disallow: /CHANGELOG.txt
Disallow: /MAINTAINERS.txt
Disallow: /LICENSE.txt
Disallow: /UPGRADE.txt# Paths (clean URLs)
Disallow: /admin/
Disallow: /comment/reply/
Disallow: /contact/
Disallow: /logout/
Disallow: /node/add/
Disallow: /search/
Disallow: /user/register/
Disallow: /user/password/
Disallow: /user/login/# Paths (no clean URLs)
Disallow: /?q=admin/
Disallow: /?q=comment/reply/
Disallow: /?q=contact/
Disallow: /?q=logout/
Disallow: /?q=node/add/
Disallow: /?q=search/
Disallow: /?q=user/password/
Disallow: /?q=user/register/
Disallow: /?q=user/login/

Pingback: Drupal SEO : The top Drupal SEO modules and how to use them » fiLi’s tech
Pingback: Jorge Yau » links for 2007-05-28 - Diseñador Web
Pingback: Drupal SEO : A complete beginners step-by-step video tutorial | fiLi's tech