<?xml version="1.0" encoding="utf-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	>
<channel>
	<title>Comments on: robots.txt Adventure</title>
	<atom:link href="http://www.nextthing.org/archives/2007/03/12/robotstxt-adventure/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.nextthing.org/archives/2007/03/12/robotstxt-adventure</link>
	<description>by Andrew Wooster</description>
	<pubDate>Thu, 24 Jul 2008 01:13:14 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.5.1</generator>
		<item>
		<title>By: robots.txt analysiert &#124; Webmaster</title>
		<link>http://www.nextthing.org/archives/2007/03/12/robotstxt-adventure#comment-10274</link>
		<dc:creator>robots.txt analysiert &#124; Webmaster</dc:creator>
		<pubDate>Wed, 26 Sep 2007 08:08:05 +0000</pubDate>
		<guid isPermaLink="false">http://www.nextthing.org/archives/2007/03/12/robotstxt-adventure#comment-10274</guid>
		<description>[...] Eine interessante Untersuchung von Andrew Wooster. Der ließ 4,6 Millionen Domains von einem selbst gebastelten Spider ansteuern, um jeweils die Datei robots.txt einzusammeln und zu analysieren. Dabei kamen nicht nur statistische Daten über Statuscodes und Mime-Typen zu Tage, auch allerlei Merkwürdigkeiten wurden ausgemacht, die auf ein sonderbares Verständnis der Datei schließen lassen. So finden sich Texte aller Art, Keywords, Logs, Listen und sogar ASCII-Kunst in einer Datei, die sich ausschließlich an Bots und Spider richtet. [...]</description>
		<content:encoded><![CDATA[<p>[...] Eine interessante Untersuchung von Andrew Wooster. Der ließ 4,6 Millionen Domains von einem selbst gebastelten Spider ansteuern, um jeweils die Datei robots.txt einzusammeln und zu analysieren. Dabei kamen nicht nur statistische Daten über Statuscodes und Mime-Typen zu Tage, auch allerlei Merkwürdigkeiten wurden ausgemacht, die auf ein sonderbares Verständnis der Datei schließen lassen. So finden sich Texte aller Art, Keywords, Logs, Listen und sogar ASCII-Kunst in einer Datei, die sich ausschließlich an Bots und Spider richtet. [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jason King</title>
		<link>http://www.nextthing.org/archives/2007/03/12/robotstxt-adventure#comment-10269</link>
		<dc:creator>Jason King</dc:creator>
		<pubDate>Tue, 25 Sep 2007 01:37:32 +0000</pubDate>
		<guid isPermaLink="false">http://www.nextthing.org/archives/2007/03/12/robotstxt-adventure#comment-10269</guid>
		<description>Oh that's amusing, interesting and useful too.

I frequently do health checks on other peoples' websites but it hadn't occured to me to check they've written their robots file correctly. I'll add that to my list of checks.</description>
		<content:encoded><![CDATA[<p>Oh that&#8217;s amusing, interesting and useful too.</p>
<p>I frequently do health checks on other peoples&#8217; websites but it hadn&#8217;t occured to me to check they&#8217;ve written their robots file correctly. I&#8217;ll add that to my list of checks.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Max Design - standards based web design, development and training &#187; Some links for light reading (25/9/07)</title>
		<link>http://www.nextthing.org/archives/2007/03/12/robotstxt-adventure#comment-10268</link>
		<dc:creator>Max Design - standards based web design, development and training &#187; Some links for light reading (25/9/07)</dc:creator>
		<pubDate>Mon, 24 Sep 2007 21:51:33 +0000</pubDate>
		<guid isPermaLink="false">http://www.nextthing.org/archives/2007/03/12/robotstxt-adventure#comment-10268</guid>
		<description>[...] robots.txt Adventure [...]</description>
		<content:encoded><![CDATA[<p>[...] robots.txt Adventure [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Links on a fickle monday</title>
		<link>http://www.nextthing.org/archives/2007/03/12/robotstxt-adventure#comment-10262</link>
		<dc:creator>Links on a fickle monday</dc:creator>
		<pubDate>Mon, 24 Sep 2007 18:56:18 +0000</pubDate>
		<guid isPermaLink="false">http://www.nextthing.org/archives/2007/03/12/robotstxt-adventure#comment-10262</guid>
		<description>[...] Interesting web surveys: robots.txt and http headers (via Simon Willison). [...]</description>
		<content:encoded><![CDATA[<p>[...] Interesting web surveys: robots.txt and http headers (via Simon Willison). [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Andrew</title>
		<link>http://www.nextthing.org/archives/2007/03/12/robotstxt-adventure#comment-10260</link>
		<dc:creator>Andrew</dc:creator>
		<pubDate>Mon, 24 Sep 2007 16:51:27 +0000</pubDate>
		<guid isPermaLink="false">http://www.nextthing.org/archives/2007/03/12/robotstxt-adventure#comment-10260</guid>
		<description>Sorry about that Sean. I've fixed it in the article.</description>
		<content:encoded><![CDATA[<p>Sorry about that Sean. I&#8217;ve fixed it in the article.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Sean Conner</title>
		<link>http://www.nextthing.org/archives/2007/03/12/robotstxt-adventure#comment-10258</link>
		<dc:creator>Sean Conner</dc:creator>
		<pubDate>Mon, 24 Sep 2007 08:54:35 +0000</pubDate>
		<guid isPermaLink="false">http://www.nextthing.org/archives/2007/03/12/robotstxt-adventure#comment-10258</guid>
		<description>Just one small quibble:  it's "c-o-n-n-E-r".</description>
		<content:encoded><![CDATA[<p>Just one small quibble:  it&#8217;s &#8220;c-o-n-n-E-r&#8221;.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Nick Gully</title>
		<link>http://www.nextthing.org/archives/2007/03/12/robotstxt-adventure#comment-10254</link>
		<dc:creator>Nick Gully</dc:creator>
		<pubDate>Sun, 23 Sep 2007 18:26:32 +0000</pubDate>
		<guid isPermaLink="false">http://www.nextthing.org/archives/2007/03/12/robotstxt-adventure#comment-10254</guid>
		<description>I think you're missing some important rules for robots to follow:
# A robot may not injure a human being or through inaction allow a human being to come to harm.
# A robot must obey the orders given it by human beings, except where such orders would conflict with the First Law
# A robot must protect its own existence, as long as such protection does not conflict with the First or Second Laws.</description>
		<content:encoded><![CDATA[<p>I think you&#8217;re missing some important rules for robots to follow:<br />
# A robot may not injure a human being or through inaction allow a human being to come to harm.<br />
# A robot must obey the orders given it by human beings, except where such orders would conflict with the First Law<br />
# A robot must protect its own existence, as long as such protection does not conflict with the First or Second Laws.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: egorych</title>
		<link>http://www.nextthing.org/archives/2007/03/12/robotstxt-adventure#comment-10249</link>
		<dc:creator>egorych</dc:creator>
		<pubDate>Sun, 23 Sep 2007 10:45:05 +0000</pubDate>
		<guid isPermaLink="false">http://www.nextthing.org/archives/2007/03/12/robotstxt-adventure#comment-10249</guid>
		<description>Hey, I've translated this article into Russian (of course you've got some more links :)).
This is great. I'm surprised how many sites from Dmoz have such stupid errors. They are likely to be good sites, aren't they? It's so hard to get into dmoz now...

Good job.</description>
		<content:encoded><![CDATA[<p>Hey, I&#8217;ve translated this article into Russian (of course you&#8217;ve got some more links :)).<br />
This is great. I&#8217;m surprised how many sites from Dmoz have such stupid errors. They are likely to be good sites, aren&#8217;t they? It&#8217;s so hard to get into dmoz now&#8230;</p>
<p>Good job.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: links for 2007-09-23 &#171; Simply&#8230; A User</title>
		<link>http://www.nextthing.org/archives/2007/03/12/robotstxt-adventure#comment-10245</link>
		<dc:creator>links for 2007-09-23 &#171; Simply&#8230; A User</dc:creator>
		<pubDate>Sun, 23 Sep 2007 00:30:59 +0000</pubDate>
		<guid isPermaLink="false">http://www.nextthing.org/archives/2007/03/12/robotstxt-adventure#comment-10245</guid>
		<description>[...] nextthing.org Â» robots.txt Adventure (tags: web robots.txt http search spider robots standards internet google genius analysis **) [...]</description>
		<content:encoded><![CDATA[<p>[...] nextthing.org Â» robots.txt Adventure (tags: web robots.txt http search spider robots standards internet google genius analysis **) [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: James</title>
		<link>http://www.nextthing.org/archives/2007/03/12/robotstxt-adventure#comment-10241</link>
		<dc:creator>James</dc:creator>
		<pubDate>Sat, 22 Sep 2007 13:36:15 +0000</pubDate>
		<guid isPermaLink="false">http://www.nextthing.org/archives/2007/03/12/robotstxt-adventure#comment-10241</guid>
		<description>You missed the blog in http://www.webmasterworld.com/robots.txt</description>
		<content:encoded><![CDATA[<p>You missed the blog in <a href="http://www.webmasterworld.com/robots.txt" rel="nofollow">http://www.webmasterworld.com/robots.txt</a></p>
]]></content:encoded>
	</item>
</channel>
</rss>

<!-- Dynamic Page Served (once) in 0.404 seconds -->
