WordPress SEO : using robots.txt to avoid content duplication
Google really doesn’t like content duplication on sites and so it is advisable to prevent the Google crawler from reaching the same content on your site from more than one url. Since WordPress does offer many ways of reaching your content, you should block certain URL and URL paths by defining the right robots.txt.
Here’s my suggestion for the WordPress robots.txt :
User-agent: Googlebot
# Disallow all directories and files within
Disallow: /cgi-bin/
Disallow: /wp-admin/
Disallow: /wp-includes/
# Disallow all files ending with these extensions
Disallow: /*.php$
Disallow: /*.js$
Disallow: /*.inc$
Disallow: /*.css$
# Disallow parsing individual post feeds, categories and trackbacks..
Disallow: */trackback/
Disallow: */feed/
Disallow: /category/*
Be extremely careful when implementing this. For example, some WordPress installations have Gallery2 embedded which - for reasons unknown - likes to run with main.php in the url (even with url-rewrite enabled!). Furthermore, if your blog is in a sub-directory in your domain and you change the robots.txt for the entire domain note that you might block essential pages in other sub-directories. I imagine this is the reason why robots.txt isn't included as part of the default wordpress installation.
As explained by my fellow bloggers who trackbacked, you also need to take care with the agents you block, and it would be wise to target bots specifically instead of using the problematic * symbol in the "user-agent" field.

Pingback: Make Easy Money with Google and AdSense
Pingback: עבודות SEO על בלוג וורדפרס · הבלוג של אח”י דקר
Pingback: Content duplication, redirecting and other tips for using Feedburner feeds » fiLi’s tech
Pingback: Mark’s (we)Blog » Removing duplicate search engine content using robots.txt
Pingback: Wordpress SEO : Using excerpt, robots.txt and noindex meta-tag for duplicate content in index, archives and categories | fiLi’s tech
Pingback: Promoting the Israeli independent blogosphere - the Wordpress SEO review project | fiLi’s tech
Pingback: Concentrating on robots.txt specifically for Wordpress
Pingback: WordPress no es “google-friendly” » blogpocket 6.0
Pingback: WordPress SEO 之 Robots.txt at 无知博客
Pingback: How to Create an Effective Internal Linking Structure in WordPress Pt. 3 | Bracing Your Brand
Pingback: Rss Feeds Txt
Pingback: Will the WordPress “All in One SEO Pack” Plugin Help or Hinder Your Blog? | SEONoobs
Pingback: Add Robots.txt to get traffic increase | Sha Money Maker
Pingback: ביקורת בלוגים - לטאת האמבט | הבלוג של אח"י דקר
Pingback: The Web Robots Pages - Labpress
Pingback: How to Setup a WordPress Blog | Niche Store Strategies
Pingback: 为你的affiliate商店建立一个wordpress博客 : 互联网营销|Internet Marketing
Pingback: How to get a new blog indexed in Search