<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
		>
<channel>
	<title>Comments for Ken&#039;s Techno Tidbits</title>
	<atom:link href="http://ken-blog.krugler.org/comments/feed/" rel="self" type="application/rss+xml" />
	<link>http://ken-blog.krugler.org</link>
	<description></description>
	<lastBuildDate>Mon, 08 Mar 2010 13:47:57 +0000</lastBuildDate>
	<generator>http://wordpress.com/</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>Comment on Google Earth overlay for California Thirteeners peak list by kkrugler</title>
		<link>http://ken-blog.krugler.org/2009/05/11/google-earth-overlay-for-california-thirteeners-peak-list/#comment-511</link>
		<dc:creator>kkrugler</dc:creator>
		<pubDate>Mon, 08 Mar 2010 13:47:57 +0000</pubDate>
		<guid isPermaLink="false">http://ken-blog.krugler.org/?p=132#comment-511</guid>
		<description>Hi Victor,

Glad you like it. You should check out Schmed&#039;s updated info on his site (embedded Google Earth, plus revised KML) - http://www.vulgarianramblers.org/ca_100.php?show_google_earth=true</description>
		<content:encoded><![CDATA[<p>Hi Victor,</p>
<p>Glad you like it. You should check out Schmed&#8217;s updated info on his site (embedded Google Earth, plus revised KML) &#8211; <a href="http://www.vulgarianramblers.org/ca_100.php?show_google_earth=true" rel="nofollow">http://www.vulgarianramblers.org/ca_100.php?show_google_earth=true</a></p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Google Earth overlay for California Thirteeners peak list by Victor Hanson-Smith</title>
		<link>http://ken-blog.krugler.org/2009/05/11/google-earth-overlay-for-california-thirteeners-peak-list/#comment-510</link>
		<dc:creator>Victor Hanson-Smith</dc:creator>
		<pubDate>Mon, 08 Mar 2010 05:31:49 +0000</pubDate>
		<guid isPermaLink="false">http://ken-blog.krugler.org/?p=132#comment-510</guid>
		<description>Hi, thanks for creating and posting this KML file.  It&#039;s super useful!  By the way, it works fine in Google Earth version 5.1.3533.1731 on a Mac OSX 10.6.</description>
		<content:encoded><![CDATA[<p>Hi, thanks for creating and posting this KML file.  It&#8217;s super useful!  By the way, it works fine in Google Earth version 5.1.3533.1731 on a Mac OSX 10.6.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Optimal solar panel angle by davea0511</title>
		<link>http://ken-blog.krugler.org/2008/03/16/optimal-solar-panel-angle/#comment-509</link>
		<dc:creator>davea0511</dc:creator>
		<pubDate>Sat, 06 Mar 2010 13:26:09 +0000</pubDate>
		<guid isPermaLink="false">http://kkrugler.wordpress.com/?p=51#comment-509</guid>
		<description>The best value is the Xandex sunMizer MPPT modules.  You only put them on the panels that get shaded.  they&#039;re $160 each.  Looking at your picture though it seems like you have a lot of roof space not being used in bright sunlight.  I&#039;d move as many of your panels there as possible, then sell the few left over, and with that money but the Xandex SunMizers, and my guess is that you&#039;ll see a significant improvement right away without it costing you a thing.</description>
		<content:encoded><![CDATA[<p>The best value is the Xandex sunMizer MPPT modules.  You only put them on the panels that get shaded.  they&#8217;re $160 each.  Looking at your picture though it seems like you have a lot of roof space not being used in bright sunlight.  I&#8217;d move as many of your panels there as possible, then sell the few left over, and with that money but the Xandex SunMizers, and my guess is that you&#8217;ll see a significant improvement right away without it costing you a thing.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Optimal solar panel angle by Bill Sr</title>
		<link>http://ken-blog.krugler.org/2008/03/16/optimal-solar-panel-angle/#comment-507</link>
		<dc:creator>Bill Sr</dc:creator>
		<pubDate>Fri, 05 Mar 2010 04:08:11 +0000</pubDate>
		<guid isPermaLink="false">http://kkrugler.wordpress.com/?p=51#comment-507</guid>
		<description>{Blue Locktite}</description>
		<content:encoded><![CDATA[<p>{Blue Locktite}</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Converting Vimeo embedded HTML to XHTML by YouTube and Vimeo embed code will not validate &#171; OneBigMug</title>
		<link>http://ken-blog.krugler.org/2008/06/14/converting-vimeo-embedded-html-to-xhtml/#comment-503</link>
		<dc:creator>YouTube and Vimeo embed code will not validate &#171; OneBigMug</dc:creator>
		<pubDate>Tue, 16 Feb 2010 16:45:35 +0000</pubDate>
		<guid isPermaLink="false">http://kkrugler.wordpress.com/?p=58#comment-503</guid>
		<description>[...] Or use the generator found here. Details for XHTML valid Vimeo code is here. [...]</description>
		<content:encoded><![CDATA[<p>[...] Or use the generator found here. Details for XHTML valid Vimeo code is here. [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Performance problems with vertical/focused web crawling by kkrugler</title>
		<link>http://ken-blog.krugler.org/2009/05/19/performance-problems-with-verticalfocused-web-crawling/#comment-502</link>
		<dc:creator>kkrugler</dc:creator>
		<pubDate>Fri, 05 Feb 2010 23:22:09 +0000</pubDate>
		<guid isPermaLink="false">http://ken-blog.krugler.org/?p=146#comment-502</guid>
		<description>Hi Otis,

I haven&#039;t been closely tracking Nutch patches, though I see that there have been improvements to how the plugins can communicate information - which was one of the major headaches previously.

As to a comparison, I think the recent post by Stefano Cherchi on the Nutch list is a good example of where Bixo is much easier to use than Nutch. I had to deal with almost exactly the same use case he describes, where you have top-level pages that have links to the real content, but you have to crawl (and not index) these top pages first.

In Bixo I created a workflow that took top level URLs in, fetched them, used a modified parser to extract the links, then fed these into a second fetch pipe, which in turn fed the results into the parser.

So I wound up with one very simple (conceptually) Cascading workflow, which was very reliable...e.g. no error-prone editing of config files in between runs.

But it would be interesting to do a side-by-side comparison of Bixo &amp; Nutch for some vertical crawl project. If you have something in mind, I&#039;d be interested in giving that a try.

-- Ken</description>
		<content:encoded><![CDATA[<p>Hi Otis,</p>
<p>I haven&#8217;t been closely tracking Nutch patches, though I see that there have been improvements to how the plugins can communicate information &#8211; which was one of the major headaches previously.</p>
<p>As to a comparison, I think the recent post by Stefano Cherchi on the Nutch list is a good example of where Bixo is much easier to use than Nutch. I had to deal with almost exactly the same use case he describes, where you have top-level pages that have links to the real content, but you have to crawl (and not index) these top pages first.</p>
<p>In Bixo I created a workflow that took top level URLs in, fetched them, used a modified parser to extract the links, then fed these into a second fetch pipe, which in turn fed the results into the parser.</p>
<p>So I wound up with one very simple (conceptually) Cascading workflow, which was very reliable&#8230;e.g. no error-prone editing of config files in between runs.</p>
<p>But it would be interesting to do a side-by-side comparison of Bixo &amp; Nutch for some vertical crawl project. If you have something in mind, I&#8217;d be interested in giving that a try.</p>
<p>&#8211; Ken</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Performance problems with vertical/focused web crawling by Otis Gospodnetic</title>
		<link>http://ken-blog.krugler.org/2009/05/19/performance-problems-with-verticalfocused-web-crawling/#comment-501</link>
		<dc:creator>Otis Gospodnetic</dc:creator>
		<pubDate>Mon, 01 Feb 2010 17:37:30 +0000</pubDate>
		<guid isPermaLink="false">http://ken-blog.krugler.org/?p=146#comment-501</guid>
		<description>Ken, at this point, with various semi-recent patches applied to Nutch to address some of these long-tail issues, would you say Bixo is still a better tool to use for vertical crawls?  If so, what are the &quot;remaining&quot; aspects of Bixo that Nutch still misses to be equally good for such crawls?</description>
		<content:encoded><![CDATA[<p>Ken, at this point, with various semi-recent patches applied to Nutch to address some of these long-tail issues, would you say Bixo is still a better tool to use for vertical crawls?  If so, what are the &#8220;remaining&#8221; aspects of Bixo that Nutch still misses to be equally good for such crawls?</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Why fetching web pages doesn&#8217;t map well to map-reduce by kkrugler</title>
		<link>http://ken-blog.krugler.org/2009/12/12/why-fetching-web-pages-doesnt-map-well-to-map-reduce/#comment-500</link>
		<dc:creator>kkrugler</dc:creator>
		<pubDate>Mon, 01 Feb 2010 17:25:54 +0000</pubDate>
		<guid isPermaLink="false">http://ken-blog.krugler.org/?p=197#comment-500</guid>
		<description>Hi Otis,

My comment about parallelism being limited by the number of reducers is based on a design goal of trying to use straight MR, without the complexity of additional threading support.

Since you don&#039;t get sufficient parallelism via just the number of reducers, you have to deal with the added complexity of a multi-threaded reducer. Which is what Nutch and Bixo both do - but it&#039;s not pretty :)</description>
		<content:encoded><![CDATA[<p>Hi Otis,</p>
<p>My comment about parallelism being limited by the number of reducers is based on a design goal of trying to use straight MR, without the complexity of additional threading support.</p>
<p>Since you don&#8217;t get sufficient parallelism via just the number of reducers, you have to deal with the added complexity of a multi-threaded reducer. Which is what Nutch and Bixo both do &#8211; but it&#8217;s not pretty <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Why fetching web pages doesn&#8217;t map well to map-reduce by sematext</title>
		<link>http://ken-blog.krugler.org/2009/12/12/why-fetching-web-pages-doesnt-map-well-to-map-reduce/#comment-499</link>
		<dc:creator>sematext</dc:creator>
		<pubDate>Mon, 01 Feb 2010 17:04:08 +0000</pubDate>
		<guid isPermaLink="false">http://ken-blog.krugler.org/?p=197#comment-499</guid>
		<description>Regarding the &quot;the maximum amount of parallelization would be equal to the number of reducers, which typically is something close to the number of ccores (servers * cores/server). So on a 10 server cluster w/dual cores, you’d have 20 threads active.&quot; piece:

Do number from this example match what you&#039;ve observed with Nutch?  I don&#039;t have an example of Nutch log handy, but it does include the number of active threads, so I&#039;m wondering if in your experience it really matches the above theory(?)?</description>
		<content:encoded><![CDATA[<p>Regarding the &#8220;the maximum amount of parallelization would be equal to the number of reducers, which typically is something close to the number of ccores (servers * cores/server). So on a 10 server cluster w/dual cores, you’d have 20 threads active.&#8221; piece:</p>
<p>Do number from this example match what you&#8217;ve observed with Nutch?  I don&#8217;t have an example of Nutch log handy, but it does include the number of active threads, so I&#8217;m wondering if in your experience it really matches the above theory(?)?</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on 193MPH Volkswagen Van by The Abbot of Unreason</title>
		<link>http://ken-blog.krugler.org/2010/01/05/193mph-volkswagen-van/#comment-492</link>
		<dc:creator>The Abbot of Unreason</dc:creator>
		<pubDate>Wed, 06 Jan 2010 15:59:16 +0000</pubDate>
		<guid isPermaLink="false">http://ken-blog.krugler.org/?p=228#comment-492</guid>
		<description>I think it would be great if they hadn&#039;t changed the colors, or if they had painted it dull beige with a lot of peace signs.  I&#039;d like to see that hurtle down the road.</description>
		<content:encoded><![CDATA[<p>I think it would be great if they hadn&#8217;t changed the colors, or if they had painted it dull beige with a lot of peace signs.  I&#8217;d like to see that hurtle down the road.</p>
]]></content:encoded>
	</item>
</channel>
</rss>
