<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Drupal SEO &#8211; using robots.txt to avoid content duplication</title>
	<atom:link href="http://teqsnacks.com/2007/02/22/drupal-seo-using-robotstxt-to-avoid-content-duplication/feed/" rel="self" type="application/rss+xml" />
	<link>http://teqsnacks.com/2007/02/22/drupal-seo-using-robotstxt-to-avoid-content-duplication/</link>
	<description>Simple CMS, Blogging, Tech, SEO and Social Media</description>
	<lastBuildDate>Fri, 20 Jan 2012 18:52:00 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: Arison</title>
		<link>http://teqsnacks.com/2007/02/22/drupal-seo-using-robotstxt-to-avoid-content-duplication/#comment-37959</link>
		<dc:creator>Arison</dc:creator>
		<pubDate>Mon, 08 Nov 2010 17:05:12 +0000</pubDate>
		<guid isPermaLink="false">http://www.filination.com/tech/2007/02/22/drupal-seo-using-robotstxt-to-avoid-content-duplication/#comment-37959</guid>
		<description>The same behavior would have been found if those pages had been given the &quot;nofollow&quot; attribute - the googlebot would not have officially visited them, and although the result could appear in the Search Engine Result Page</description>
		<content:encoded><![CDATA[<p>The same behavior would have been found if those pages had been given the "nofollow" attribute - the googlebot would not have officially visited them, and although the result could appear in the Search Engine Result Page</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: RSS2MYSQL</title>
		<link>http://teqsnacks.com/2007/02/22/drupal-seo-using-robotstxt-to-avoid-content-duplication/#comment-2008</link>
		<dc:creator>RSS2MYSQL</dc:creator>
		<pubDate>Wed, 27 May 2009 00:52:31 +0000</pubDate>
		<guid isPermaLink="false">http://www.filination.com/tech/2007/02/22/drupal-seo-using-robotstxt-to-avoid-content-duplication/#comment-2008</guid>
		<description>Important to note that unless your content duplication comes off as malicious or scheme-like, there will be no real penalty. The typical penalty for dupe content is to be less favored on the page of question , when compared to the OP(Original Poster). This mean they might get indexed, and you no, or you just ignored.

Google might even look at your overall content relevancy and decide that the content fits better with your own site, and show you favor for the copied content.--- ive heard of this happening, but never experienced it myself.</description>
		<content:encoded><![CDATA[<p>Important to note that unless your content duplication comes off as malicious or scheme-like, there will be no real penalty. The typical penalty for dupe content is to be less favored on the page of question , when compared to the OP(Original Poster). This mean they might get indexed, and you no, or you just ignored.</p>
<p>Google might even look at your overall content relevancy and decide that the content fits better with your own site, and show you favor for the copied content.--- ive heard of this happening, but never experienced it myself.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Steven Davies</title>
		<link>http://teqsnacks.com/2007/02/22/drupal-seo-using-robotstxt-to-avoid-content-duplication/#comment-74</link>
		<dc:creator>Steven Davies</dc:creator>
		<pubDate>Fri, 18 Apr 2008 21:34:53 +0000</pubDate>
		<guid isPermaLink="false">http://www.filination.com/tech/2007/02/22/drupal-seo-using-robotstxt-to-avoid-content-duplication/#comment-74</guid>
		<description>If you block *all* of your Drupal paging URLs your search engine traffic might drop.
That is right Drupalzilla.
And if you want to increase it just continuing to use drupal:)</description>
		<content:encoded><![CDATA[<p>If you block *all* of your Drupal paging URLs your search engine traffic might drop.<br />
That is right Drupalzilla.<br />
And if you want to increase it just continuing to use drupal:)</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Hagrin</title>
		<link>http://teqsnacks.com/2007/02/22/drupal-seo-using-robotstxt-to-avoid-content-duplication/#comment-73</link>
		<dc:creator>Hagrin</dc:creator>
		<pubDate>Fri, 01 Feb 2008 03:04:10 +0000</pubDate>
		<guid isPermaLink="false">http://www.filination.com/tech/2007/02/22/drupal-seo-using-robotstxt-to-avoid-content-duplication/#comment-73</guid>
		<description>Not only can you use robots.txt to prevent content duplication, but you can use your .htaccess file to resolve any canonical domain issues, Google Webmaster Central to pick your preferred domain and the Global Redirect module to get rid of trailing slashes.

Hope that helps!</description>
		<content:encoded><![CDATA[<p>Not only can you use robots.txt to prevent content duplication, but you can use your .htaccess file to resolve any canonical domain issues, Google Webmaster Central to pick your preferred domain and the Global Redirect module to get rid of trailing slashes.</p>
<p>Hope that helps!</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Drupal SEO : A complete beginners step-by-step video tutorial &#124; fiLi&#39;s tech</title>
		<link>http://teqsnacks.com/2007/02/22/drupal-seo-using-robotstxt-to-avoid-content-duplication/#comment-72</link>
		<dc:creator>Drupal SEO : A complete beginners step-by-step video tutorial &#124; fiLi&#39;s tech</dc:creator>
		<pubDate>Fri, 21 Dec 2007 11:34:46 +0000</pubDate>
		<guid isPermaLink="false">http://www.filination.com/tech/2007/02/22/drupal-seo-using-robotstxt-to-avoid-content-duplication/#comment-72</guid>
		<description>[...] SEO : The top Drupal SEO modules and how to use them&#8220;, make sure you&#8217;re &#8220;Drupal SEO: using robots.txt to avoid content duplication&#8221; and add a few more general website SEO [...]</description>
		<content:encoded><![CDATA[<p>[...] SEO : The top Drupal SEO modules and how to use them&#8220;, make sure you&#8217;re &#8220;Drupal SEO: using robots.txt to avoid content duplication&#8221; and add a few more general website SEO [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Drupalzilla</title>
		<link>http://teqsnacks.com/2007/02/22/drupal-seo-using-robotstxt-to-avoid-content-duplication/#comment-69</link>
		<dc:creator>Drupalzilla</dc:creator>
		<pubDate>Mon, 20 Aug 2007 18:06:37 +0000</pubDate>
		<guid isPermaLink="false">http://www.filination.com/tech/2007/02/22/drupal-seo-using-robotstxt-to-avoid-content-duplication/#comment-69</guid>
		<description>Unless you have custom Views set up, spiders will not be able to find all of your content if you block paginated content. &lt;br /&gt;
When you create nodes and promote them to the front page they will paginate like this: http://example.com/node?page=2.&#160;&#160; &lt;br /&gt;
If you don&#039;t promote posts to the front page, you can access them through taxonomy terms, but those are paginated also, for example, http://example/category?page=2.&#160;&#160;&#160; &lt;br /&gt;
So if you block off all paginated pages, search engines will only be able to find the first 10 posts on your home page and the first 10 posts under any taxonomy term. &lt;br /&gt;
There are other ways for nodes to be displayed, but they are all paginated with ?page=...&#160;&#160;&#160; &lt;br /&gt;
Post #20 in category X will be blocked from search engines unless you have created a specific link to it from another page.&#160; I&#039;ve blocked dynamic pages on a Drupal site as a test (two different sites).&#160; &lt;br /&gt;
As soon as I ended the experiment, search engine referrals went up significantly.&#160; I usually create a custom front page on my Drupal sites, and block /node (which includes all front page pagination).&#160; I let crawlers get to the content through taxonomy terms pages because they are listed with other posts on a similar keyword theme.</description>
		<content:encoded><![CDATA[<p>Unless you have custom Views set up, spiders will not be able to find all of your content if you block paginated content. <br />
When you create nodes and promote them to the front page they will paginate like this: <a href="http://example.com/node?page=2.&nbsp;&#038;nbsp" rel="nofollow">http://example.com/node?page=2.&nbsp;&#038;nbsp</a>; <br />
If you don't promote posts to the front page, you can access them through taxonomy terms, but those are paginated also, for example, <a href="http://example/category?page=2.&nbsp;&nbsp;&#038;nbsp" rel="nofollow">http://example/category?page=2.&nbsp;&nbsp;&#038;nbsp</a>; <br />
So if you block off all paginated pages, search engines will only be able to find the first 10 posts on your home page and the first 10 posts under any taxonomy term. <br />
There are other ways for nodes to be displayed, but they are all paginated with ?page=...&nbsp;&nbsp;&nbsp; <br />
Post #20 in category X will be blocked from search engines unless you have created a specific link to it from another page.&nbsp; I've blocked dynamic pages on a Drupal site as a test (two different sites).&nbsp; <br />
As soon as I ended the experiment, search engine referrals went up significantly.&nbsp; I usually create a custom front page on my Drupal sites, and block /node (which includes all front page pagination).&nbsp; I let crawlers get to the content through taxonomy terms pages because they are listed with other posts on a similar keyword theme.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: fiLi</title>
		<link>http://teqsnacks.com/2007/02/22/drupal-seo-using-robotstxt-to-avoid-content-duplication/#comment-59</link>
		<dc:creator>fiLi</dc:creator>
		<pubDate>Mon, 20 Aug 2007 12:57:05 +0000</pubDate>
		<guid isPermaLink="false">http://www.filination.com/tech/2007/02/22/drupal-seo-using-robotstxt-to-avoid-content-duplication/#comment-59</guid>
		<description>Drupalzilla - what&#039;s in paged URLs that isn&#039;t in direct nodes? what kind of unique content do the paginated sections provide?</description>
		<content:encoded><![CDATA[<p>Drupalzilla - what's in paged URLs that isn't in direct nodes? what kind of unique content do the paginated sections provide?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Drupalzilla</title>
		<link>http://teqsnacks.com/2007/02/22/drupal-seo-using-robotstxt-to-avoid-content-duplication/#comment-58</link>
		<dc:creator>Drupalzilla</dc:creator>
		<pubDate>Mon, 20 Aug 2007 12:49:03 +0000</pubDate>
		<guid isPermaLink="false">http://www.filination.com/tech/2007/02/22/drupal-seo-using-robotstxt-to-avoid-content-duplication/#comment-58</guid>
		<description>If you block *all* of your Drupal paging URLs your search engine traffic might drop.  Search engine crawlers won&#039;t be able to reach all of your content.  Better to only selectively block the specific paginated sections that are causing problems...</description>
		<content:encoded><![CDATA[<p>If you block *all* of your Drupal paging URLs your search engine traffic might drop.  Search engine crawlers won't be able to reach all of your content.  Better to only selectively block the specific paginated sections that are causing problems...</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: fiLi</title>
		<link>http://teqsnacks.com/2007/02/22/drupal-seo-using-robotstxt-to-avoid-content-duplication/#comment-61</link>
		<dc:creator>fiLi</dc:creator>
		<pubDate>Mon, 20 Aug 2007 08:00:01 +0000</pubDate>
		<guid isPermaLink="false">http://www.filination.com/tech/2007/02/22/drupal-seo-using-robotstxt-to-avoid-content-duplication/#comment-61</guid>
		<description>Dheeraj - robots.txt isn&#039;t the only way to prevent from indexing. You can also insert a noindex metatag to the pages you don&#039;t wish to index :
&lt;blockquote&gt;meta name=&quot;googlebot&quot; content=&quot;noindex,nofollow&quot; /&lt;/blockquote&gt;

For paging, I believe you can just do &quot;Disallow: */page=* . I suggest you experiment with robots.txt on the Google webmaster tools to see what the right robots.txt is for you to block the pages you don&#039;t want to index.

Good luck, let me know if that helped.</description>
		<content:encoded><![CDATA[<p>Dheeraj - robots.txt isn't the only way to prevent from indexing. You can also insert a noindex metatag to the pages you don't wish to index :</p>
<blockquote><p>meta name="googlebot" content="noindex,nofollow" /</p></blockquote>
<p>For paging, I believe you can just do "Disallow: */page=* . I suggest you experiment with robots.txt on the Google webmaster tools to see what the right robots.txt is for you to block the pages you don't want to index.</p>
<p>Good luck, let me know if that helped.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Dheeraj</title>
		<link>http://teqsnacks.com/2007/02/22/drupal-seo-using-robotstxt-to-avoid-content-duplication/#comment-67</link>
		<dc:creator>Dheeraj</dc:creator>
		<pubDate>Mon, 20 Aug 2007 06:01:11 +0000</pubDate>
		<guid isPermaLink="false">http://www.filination.com/tech/2007/02/22/drupal-seo-using-robotstxt-to-avoid-content-duplication/#comment-67</guid>
		<description>How do I stop paging URL from getting indexed? I&#039;m having duplicate content error as i have taxonomy menu for taxonomy navigation. also the content is displayed on the homepage as well as the content link itself. Paging links are like [domain name]/[category name]/[article title]/?page=2  This link is also getting indexed in search engines. Can we write in robot.txt such that it should not crawl the paging links i.e. paging links are not followed. Currently we have patched the pageing to work on POST rather than the GET method.</description>
		<content:encoded><![CDATA[<p>How do I stop paging URL from getting indexed? I'm having duplicate content error as i have taxonomy menu for taxonomy navigation. also the content is displayed on the homepage as well as the content link itself. Paging links are like [domain name]/[category name]/[article title]/?page=2  This link is also getting indexed in search engines. Can we write in robot.txt such that it should not crawl the paging links i.e. paging links are not followed. Currently we have patched the pageing to work on POST rather than the GET method.</p>
]]></content:encoded>
	</item>
</channel>
</rss>

<!-- Performance optimized by W3 Total Cache. Learn more: http://www.w3-edge.com/wordpress-plugins/

Page Caching using disk: enhanced
Object Caching 0/0 objects using disk: basic

Served from: teqsnacks.com @ 2012-02-10 06:16:49 -->
