<?xml version="1.0" encoding="utf-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Another RSS bandwidth reducing technique</title>
	<atom:link href="http://www.jowilson.org/weblog/archives/2004/12/20/another-rss-bandwidth-reducing-technique/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.jowilson.org/weblog/archives/2004/12/20/another-rss-bandwidth-reducing-technique/</link>
	<description>A weblog for people who otherwise wouldn&#039;t.</description>
	<lastBuildDate>Fri, 01 Apr 2011 03:58:11 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.1</generator>
	<item>
		<title>By: Doug Gbison</title>
		<link>http://www.jowilson.org/weblog/archives/2004/12/20/another-rss-bandwidth-reducing-technique/comment-page-1/#comment-1023</link>
		<dc:creator>Doug Gbison</dc:creator>
		<pubDate>Wed, 29 Dec 2004 13:44:22 +0000</pubDate>
		<guid isPermaLink="false">/?p=319#comment-1023</guid>
		<description>Yeah, I agree that serving limited content on the first-time hit is not so bad, and I would propose to programatically include an entry explaining the issue and encouraging the user to use well-behaved RSS readers. My concern with doing this related to the potential for the last-checked and eTags timing out or being cleared from cache by the the RSS client. Is this a concern at all? Does anyone have any insight into how RSS readers work in that regard? 

Thanks for bearing with me - It&#039;s great to come up with countermeasures for abuse, but I am always very cautious as to not hinder the legitamate users in the process.</description>
		<content:encoded><![CDATA[<p>Yeah, I agree that serving limited content on the first-time hit is not so bad, and I would propose to programatically include an entry explaining the issue and encouraging the user to use well-behaved RSS readers. My concern with doing this related to the potential for the last-checked and eTags timing out or being cleared from cache by the the RSS client. Is this a concern at all? Does anyone have any insight into how RSS readers work in that regard? </p>
<p>Thanks for bearing with me &#8211; It&#8217;s great to come up with countermeasures for abuse, but I am always very cautious as to not hinder the legitamate users in the process.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: John Wilson</title>
		<link>http://www.jowilson.org/weblog/archives/2004/12/20/another-rss-bandwidth-reducing-technique/comment-page-1/#comment-1022</link>
		<dc:creator>John Wilson</dc:creator>
		<pubDate>Thu, 23 Dec 2004 01:20:15 +0000</pubDate>
		<guid isPermaLink="false">/?p=319#comment-1022</guid>
		<description>Yeah, I&#039;ve thought about that, and I&#039;m okay with it.  The thing is, the very first time they get a new update, all of the old summary articles will change to full feed articles.  Yes it&#039;s a bit of a pain in the ass, but I think people can deal with it.  

If an RSS aggregator wanted to be a smart ass, on the very first fetch it could pass in a If-Modified-Since header with a date like, oh, March 1, 1989 (the month and year that Tim Berners-Lee put forth the first paper outlining what would become today&#039;s web).  That would prove that the RSS reader was well behaved, as well as having a sense of humor.</description>
		<content:encoded><![CDATA[<p>Yeah, I&#8217;ve thought about that, and I&#8217;m okay with it.  The thing is, the very first time they get a new update, all of the old summary articles will change to full feed articles.  Yes it&#8217;s a bit of a pain in the ass, but I think people can deal with it.  </p>
<p>If an RSS aggregator wanted to be a smart ass, on the very first fetch it could pass in a If-Modified-Since header with a date like, oh, March 1, 1989 (the month and year that Tim Berners-Lee put forth the first paper outlining what would become today&#8217;s web).  That would prove that the RSS reader was well behaved, as well as having a sense of humor.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Robert Sayre</title>
		<link>http://www.jowilson.org/weblog/archives/2004/12/20/another-rss-bandwidth-reducing-technique/comment-page-1/#comment-1021</link>
		<dc:creator>Robert Sayre</dc:creator>
		<pubDate>Thu, 23 Dec 2004 00:48:15 +0000</pubDate>
		<guid isPermaLink="false">/?p=319#comment-1021</guid>
		<description>Ok, but this means that first-time visitors will not receive full text feeds. No matter how well behaved, an aggregator cannot send those headers on the first visit. Do you really want to penalize new readers like that? I like the idea, but maybe start doing it after a pattern of abuse.</description>
		<content:encoded><![CDATA[<p>Ok, but this means that first-time visitors will not receive full text feeds. No matter how well behaved, an aggregator cannot send those headers on the first visit. Do you really want to penalize new readers like that? I like the idea, but maybe start doing it after a pattern of abuse.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Doug Gbison</title>
		<link>http://www.jowilson.org/weblog/archives/2004/12/20/another-rss-bandwidth-reducing-technique/comment-page-1/#comment-1013</link>
		<dc:creator>Doug Gbison</dc:creator>
		<pubDate>Tue, 21 Dec 2004 20:05:18 +0000</pubDate>
		<guid isPermaLink="false">/?p=319#comment-1013</guid>
		<description>OK, so Ican do some manual testing of the headers myself. But does anyone have any insight into what determines how long client cache this sort of info?</description>
		<content:encoded><![CDATA[<p>OK, so Ican do some manual testing of the headers myself. But does anyone have any insight into what determines how long client cache this sort of info?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: John Wilson</title>
		<link>http://www.jowilson.org/weblog/archives/2004/12/20/another-rss-bandwidth-reducing-technique/comment-page-1/#comment-1012</link>
		<dc:creator>John Wilson</dc:creator>
		<pubDate>Tue, 21 Dec 2004 19:31:04 +0000</pubDate>
		<guid isPermaLink="false">/?p=319#comment-1012</guid>
		<description>Okay, let me rephrase that.  This is all the headers I get from one of my RSS users: 

    Accept: image/gif, image/x-xbitmap, image/jpeg, image/pjpeg, */*
    Accept-encoding: gzip
    Host: crazybutable.com
    User-Agent: FeedOnFeeds/0.1.8 (+http://minutillo.com/steve/feedonfeeds/)

This is probably a problem with their setup, as all of my FOF 0.1.7 subscribers are sending me nice headers.</description>
		<content:encoded><![CDATA[<p>Okay, let me rephrase that.  This is all the headers I get from one of my RSS users: </p>
<p>    Accept: image/gif, image/x-xbitmap, image/jpeg, image/pjpeg, */*<br />
    Accept-encoding: gzip<br />
    Host: crazybutable.com<br />
    User-Agent: FeedOnFeeds/0.1.8 (+<a href="http://minutillo.com/steve/feedonfeeds/" rel="nofollow">http://minutillo.com/steve/feedonfeeds/</a>)</p>
<p>This is probably a problem with their setup, as all of my FOF 0.1.7 subscribers are sending me nice headers.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: kellan</title>
		<link>http://www.jowilson.org/weblog/archives/2004/12/20/another-rss-bandwidth-reducing-technique/comment-page-1/#comment-1010</link>
		<dc:creator>kellan</dc:creator>
		<pubDate>Tue, 21 Dec 2004 18:40:15 +0000</pubDate>
		<guid isPermaLink="false">/?p=319#comment-1010</guid>
		<description>All versions of FoF are well behaved assuming they&#039;ve been installed and configured correctly in part because they build on Magpie which ships with conditional GET, and gzip encoding support.</description>
		<content:encoded><![CDATA[<p>All versions of FoF are well behaved assuming they&#8217;ve been installed and configured correctly in part because they build on Magpie which ships with conditional GET, and gzip encoding support.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: John Wilson</title>
		<link>http://www.jowilson.org/weblog/archives/2004/12/20/another-rss-bandwidth-reducing-technique/comment-page-1/#comment-1008</link>
		<dc:creator>John Wilson</dc:creator>
		<pubDate>Tue, 21 Dec 2004 17:19:41 +0000</pubDate>
		<guid isPermaLink="false">/?p=319#comment-1008</guid>
		<description>So you&#039;re looking for programs to test with, if I understand you correctly.  Hmm.  I curled up in front of the fire with a copy of the HTTP/1.1 spec, crafted some test cases, and blasted them at the server by hand.  Later I put in some logging code on my server to log what headers I&#039;m getting from RSS clients themselves.  

Alas, I don&#039;t have a very big subscriber base to get stats from.  As the only example of a &quot;bad&quot; feed reader I&#039;ve found, it appears that FeedOnFeeds 0.17 is well behaved, but FeedOnFeeds 0.18 is not!  For some reason FOF 0.18 isn&#039;t sending me any nice headers.  

Here&#039;s a workaround using Sharpreader.  The first time Sharpreader hits a feed, it doesn&#039;t pass any nice headers, it just grabs it.  Paste the URL of your feed in Sharpreader and hit enter, and that&#039;s the example of a &quot;bad&quot; feedreader.  Then subscribe to the feed using SR.  Add something to the feed and manually refresh it.  Bam! there&#039;s your example of a well behaved feed reader.  Delete the subscribed feed from SR and start again.  (Note that you need a feed you don&#039;t care about to test this with (because you have to keep editing and adding things to it to get Sharpreader to grab it on the second try.))

Nothing really works as well as writing your own program to manually send headers though, because then you can really test every case.</description>
		<content:encoded><![CDATA[<p>So you&#8217;re looking for programs to test with, if I understand you correctly.  Hmm.  I curled up in front of the fire with a copy of the HTTP/1.1 spec, crafted some test cases, and blasted them at the server by hand.  Later I put in some logging code on my server to log what headers I&#8217;m getting from RSS clients themselves.  </p>
<p>Alas, I don&#8217;t have a very big subscriber base to get stats from.  As the only example of a &#8220;bad&#8221; feed reader I&#8217;ve found, it appears that FeedOnFeeds 0.17 is well behaved, but FeedOnFeeds 0.18 is not!  For some reason FOF 0.18 isn&#8217;t sending me any nice headers.  </p>
<p>Here&#8217;s a workaround using Sharpreader.  The first time Sharpreader hits a feed, it doesn&#8217;t pass any nice headers, it just grabs it.  Paste the URL of your feed in Sharpreader and hit enter, and that&#8217;s the example of a &#8220;bad&#8221; feedreader.  Then subscribe to the feed using SR.  Add something to the feed and manually refresh it.  Bam! there&#8217;s your example of a well behaved feed reader.  Delete the subscribed feed from SR and start again.  (Note that you need a feed you don&#8217;t care about to test this with (because you have to keep editing and adding things to it to get Sharpreader to grab it on the second try.))</p>
<p>Nothing really works as well as writing your own program to manually send headers though, because then you can really test every case.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Doug Gbison</title>
		<link>http://www.jowilson.org/weblog/archives/2004/12/20/another-rss-bandwidth-reducing-technique/comment-page-1/#comment-1007</link>
		<dc:creator>Doug Gbison</dc:creator>
		<pubDate>Tue, 21 Dec 2004 16:31:39 +0000</pubDate>
		<guid isPermaLink="false">/?p=319#comment-1007</guid>
		<description>This is a great idea and something similar to what I was considering given the recent articles that sparked my interest is bandwidth optimization of RSS. One thing I would do in such a case is hard-code a reference to a post/article about &quot;well behaved RSS Aggregators&quot; and why a headline-only feed is being served, etc.

I do have a couple of questions/concerns though. What actually governs whether the proper headers will be sent back? I mean, some aggregators store everything in what amounts to a database, but others I assume will &quot;cache&quot; the info more like a web-cache. So what governs how long this info persists so that when the user revists (say after vacation) the If-Modified-Since and If-None-Match are again passed to retrieve the full RSS posts? Anyone know of a feature matrix of aggregators or RSS readers that support these features? Or a badly behaved RSS reader I can test with?</description>
		<content:encoded><![CDATA[<p>This is a great idea and something similar to what I was considering given the recent articles that sparked my interest is bandwidth optimization of RSS. One thing I would do in such a case is hard-code a reference to a post/article about &#8220;well behaved RSS Aggregators&#8221; and why a headline-only feed is being served, etc.</p>
<p>I do have a couple of questions/concerns though. What actually governs whether the proper headers will be sent back? I mean, some aggregators store everything in what amounts to a database, but others I assume will &#8220;cache&#8221; the info more like a web-cache. So what governs how long this info persists so that when the user revists (say after vacation) the If-Modified-Since and If-None-Match are again passed to retrieve the full RSS posts? Anyone know of a feature matrix of aggregators or RSS readers that support these features? Or a badly behaved RSS reader I can test with?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Regular Sucking Schedule</title>
		<link>http://www.jowilson.org/weblog/archives/2004/12/20/another-rss-bandwidth-reducing-technique/comment-page-1/#comment-1005</link>
		<dc:creator>Regular Sucking Schedule</dc:creator>
		<pubDate>Tue, 21 Dec 2004 04:48:52 +0000</pubDate>
		<guid isPermaLink="false">/?p=319#comment-1005</guid>
		<description>&lt;strong&gt;Give Poor Aggregators Less&lt;/strong&gt;
 John Wilson has a great idea about dealing with bad aggregators: give them less. He has reconfigured his copy of Word Press, blogging software, to provide tiny summaries for aggregators that fail to offer up the right HTTP headers that demonstrate the...</description>
		<content:encoded><![CDATA[<p><strong>Give Poor Aggregators Less</strong><br />
 John Wilson has a great idea about dealing with bad aggregators: give them less. He has reconfigured his copy of Word Press, blogging software, to provide tiny summaries for aggregators that fail to offer up the right HTTP headers that demonstrate the&#8230;</p>
]]></content:encoded>
	</item>
</channel>
</rss>

