<?xml version="1.0" encoding="UTF-8"?><!-- generator="wordpress/2.0.2" -->
<rss version="2.0" 
	xmlns:content="http://purl.org/rss/1.0/modules/content/">
<channel>
	<title>Comments for smackman</title>
	<link>http://smackman.com</link>
	<description>Putting the smack down, one smack at a time</description>
	<pubDate>Fri, 21 Nov 2008 06:08:56 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.0.2</generator>

	<item>
		<title>Comment on An Old Idea by steve</title>
		<link>http://smackman.com/2006/06/01/an-old-idea/#comment-14</link>
		<pubDate>Thu, 01 Jun 2006 05:56:27 +0000</pubDate>
		<guid>http://smackman.com/2006/06/01/an-old-idea/#comment-14</guid>
					<description>Scott,

The point is not that the document can or cannot be parsed with a schema.  The point is that the schema allows parsers to be generated automatically rather than hand-code them for each microformat and for each programming language.

As for collisions, it's not that you couldn't write a parser that knew that an email address inside a vcard was different than an email address inside an hreview, or that hreviews might contain vcards which, in turn, might contain email addresses that should be associated with the latter.  The point is that if there is a clear grammar, then one doesn't have to worry about whether their hand parser will misread a microformat when another, unexpected microformat is nested inside it.</description>
		<content:encoded><![CDATA[<p>Scott,</p>
<p>The point is not that the document can or cannot be parsed with a schema.  The point is that the schema allows parsers to be generated automatically rather than hand-code them for each microformat and for each programming language.</p>
<p>As for collisions, it&#8217;s not that you couldn&#8217;t write a parser that knew that an email address inside a vcard was different than an email address inside an hreview, or that hreviews might contain vcards which, in turn, might contain email addresses that should be associated with the latter.  The point is that if there is a clear grammar, then one doesn&#8217;t have to worry about whether their hand parser will misread a microformat when another, unexpected microformat is nested inside it.
</p>
]]></content:encoded>
				</item>
	<item>
		<title>Comment on An Old Idea by Scott Reynen</title>
		<link>http://smackman.com/2006/06/01/an-old-idea/#comment-13</link>
		<pubDate>Thu, 01 Jun 2006 01:32:24 +0000</pubDate>
		<guid>http://smackman.com/2006/06/01/an-old-idea/#comment-13</guid>
					<description>It's not clear to me what problem this would solve. It doesn't appear to solve the problem of collision, as knowing that vcard contains email doesn't tell you that email can also be part of hreview.

Can you give a specific example of a document that could be parsed automatically with a schema, and couldn't be parsed automatically without?</description>
		<content:encoded><![CDATA[<p>It&#8217;s not clear to me what problem this would solve. It doesn&#8217;t appear to solve the problem of collision, as knowing that vcard contains email doesn&#8217;t tell you that email can also be part of hreview.</p>
<p>Can you give a specific example of a document that could be parsed automatically with a schema, and couldn&#8217;t be parsed automatically without?
</p>
]]></content:encoded>
				</item>
	<item>
		<title>Comment on An Old Idea by hans.gerwitz&#187; Blog Archive &#187;</title>
		<link>http://smackman.com/2006/06/01/an-old-idea/#comment-12</link>
		<pubDate>Thu, 01 Jun 2006 00:53:16 +0000</pubDate>
		<guid>http://smackman.com/2006/06/01/an-old-idea/#comment-12</guid>
					<description>[...] Microformat schema are sorely needed. Parsing for microformats presently feels&amp;#8230;clunky. [...]</description>
		<content:encoded><![CDATA[<p>[&#8230;] Microformat schema are sorely needed. Parsing for microformats presently feels&#8230;clunky. [&#8230;]
</p>
]]></content:encoded>
				</item>
	<item>
		<title>Comment on Fried pizza, really? by Bahar</title>
		<link>http://smackman.com/2006/05/20/fried-pizza-really/#comment-9</link>
		<pubDate>Sun, 21 May 2006 21:20:50 +0000</pubDate>
		<guid>http://smackman.com/2006/05/20/fried-pizza-really/#comment-9</guid>
					<description>Fried pizza!  Sounds brilliant!</description>
		<content:encoded><![CDATA[<p>Fried pizza!  Sounds brilliant!
</p>
]]></content:encoded>
				</item>
	<item>
		<title>Comment on Fried pizza, really? by Roo</title>
		<link>http://smackman.com/2006/05/20/fried-pizza-really/#comment-8</link>
		<pubDate>Sun, 21 May 2006 13:07:13 +0000</pubDate>
		<guid>http://smackman.com/2006/05/20/fried-pizza-really/#comment-8</guid>
					<description>I love your S5 stylesheet. Can I nick it for a(n internal IBM) presentation I'm doing next week?</description>
		<content:encoded><![CDATA[<p>I love your S5 stylesheet. Can I nick it for a(n internal IBM) presentation I&#8217;m doing next week?
</p>
]]></content:encoded>
				</item>
	<item>
		<title>Comment on hCalendar and timezones by Peter Krantz</title>
		<link>http://smackman.com/2006/05/10/hcalendar-and-timezones/#comment-7</link>
		<pubDate>Wed, 17 May 2006 14:57:33 +0000</pubDate>
		<guid>http://smackman.com/2006/05/10/hcalendar-and-timezones/#comment-7</guid>
					<description>One idea could be to provide parsers with the format used in the element value. Datetime constructs are typically standardized in writing anyway (you may even have a national standard).

So, instead of trying to squeeze in data in an attribute which isn't meant to hold it (e.g. title) one could do like this:

[span class=&quot;dtstart MMM-dd-yyyy&quot;]Dec 24 2007[/span]

Of course this creates other problems (e.g. how to represent other characters while maintaining valid values for class names). Maybe some more research should be done to see if this works.

But, we follow DRY and do not abuse the title element.</description>
		<content:encoded><![CDATA[<p>One idea could be to provide parsers with the format used in the element value. Datetime constructs are typically standardized in writing anyway (you may even have a national standard).</p>
<p>So, instead of trying to squeeze in data in an attribute which isn&#8217;t meant to hold it (e.g. title) one could do like this:</p>
<p>[span class=&#8221;dtstart MMM-dd-yyyy&#8221;]Dec 24 2007[/span]</p>
<p>Of course this creates other problems (e.g. how to represent other characters while maintaining valid values for class names). Maybe some more research should be done to see if this works.</p>
<p>But, we follow DRY and do not abuse the title element.
</p>
]]></content:encoded>
				</item>
	<item>
		<title>Comment on application/atom+json by Dennis</title>
		<link>http://smackman.com/2006/05/08/applicationatomjson/#comment-6</link>
		<pubDate>Fri, 12 May 2006 04:48:38 +0000</pubDate>
		<guid>http://smackman.com/2006/05/08/applicationatomjson/#comment-6</guid>
					<description>Wordpress stripped out some of my Atom code.

&quot;Gibberish
Code&quot;

Should be:
&amp;#60;dc:subject&amp;#62;Gibberish&amp;#60;/dc:subject&amp;#62;
&amp;#60;dc:subject&amp;#62;Code&amp;#60;/dc:subject&amp;#62;

Hopefully this will work.</description>
		<content:encoded><![CDATA[<p>Wordpress stripped out some of my Atom code.</p>
<p>&#8220;Gibberish<br />
Code&#8221;</p>
<p>Should be:<br />
&lt;dc:subject&gt;Gibberish&lt;/dc:subject&gt;<br />
&lt;dc:subject&gt;Code&lt;/dc:subject&gt;</p>
<p>Hopefully this will work.
</p>
]]></content:encoded>
				</item>
	<item>
		<title>Comment on application/atom+json by Dennis</title>
		<link>http://smackman.com/2006/05/08/applicationatomjson/#comment-5</link>
		<pubDate>Fri, 12 May 2006 04:43:09 +0000</pubDate>
		<guid>http://smackman.com/2006/05/08/applicationatomjson/#comment-5</guid>
					<description>I just spent couple of minutes and looked in the MagpieRSS source code.  I see what you are talking about.

This chunk of code in rss_parse.inc seems to explain the problem.

// if inside an Atom content construct (e.g. content or summary) field treat tags as text
elseif ($this-&amp;#62;feed_type == ATOM and $this-&amp;#62;incontent ) 
        {
            // if tags are inlined, then flatten
            $attrs_str = join(' ', 
                    array_map('map_attrs', 
                    array_keys($attrs), 
                    array_values($attrs) ) );
            
            $this-&amp;#62;append_content( &quot;&quot;  );
                    
            array_unshift( $this-&amp;#62;stack, $el );
}

The PHP code takes the below Atom data:
Gibberish
Code 

And turns it in to (output as JSON):
&quot;dc&quot;:{&quot;subject&quot;:&quot;GibberishCode&quot;}

Which I agree is not a good solution, and would go as far as saying that it is just wrong.  Flattening the data structure destroys a lot of the stored information (which does not have to happen).

I can see at least one solution off the top of my head, but I do not know the ramifications of the modifications without studying the library a good bit more.</description>
		<content:encoded><![CDATA[<p>I just spent couple of minutes and looked in the MagpieRSS source code.  I see what you are talking about.</p>
<p>This chunk of code in rss_parse.inc seems to explain the problem.</p>
<p>// if inside an Atom content construct (e.g. content or summary) field treat tags as text<br />
elseif ($this-&gt;feed_type == ATOM and $this-&gt;incontent )<br />
        {<br />
            // if tags are inlined, then flatten<br />
            $attrs_str = join(&#8217; &#8216;,<br />
                    array_map(&#8217;map_attrs&#8217;,<br />
                    array_keys($attrs),<br />
                    array_values($attrs) ) );</p>
<p>            $this-&gt;append_content( &#8220;&#8221;  );</p>
<p>            array_unshift( $this-&gt;stack, $el );<br />
}</p>
<p>The PHP code takes the below Atom data:<br />
Gibberish<br />
Code </p>
<p>And turns it in to (output as JSON):<br />
&#8220;dc&#8221;:{&#8221;subject&#8221;:&#8221;GibberishCode&#8221;}</p>
<p>Which I agree is not a good solution, and would go as far as saying that it is just wrong.  Flattening the data structure destroys a lot of the stored information (which does not have to happen).</p>
<p>I can see at least one solution off the top of my head, but I do not know the ramifications of the modifications without studying the library a good bit more.
</p>
]]></content:encoded>
				</item>
	<item>
		<title>Comment on hCalendar and timezones by karl</title>
		<link>http://smackman.com/2006/05/10/hcalendar-and-timezones/#comment-4</link>
		<pubDate>Fri, 12 May 2006 02:32:19 +0000</pubDate>
		<guid>http://smackman.com/2006/05/10/hcalendar-and-timezones/#comment-4</guid>
					<description>Even better if you do that, really simplify it. Right now there are two issues about this.

[abbr class=&quot;dtstart&quot; title=&quot;2006-05-01T12:15:03.0Z&quot;]5:15am[/abbr]

title is for accessibility. It helps to have a better understanding of the content. In the microformat use it's becoming the opposite. In a sense, this would be more logical in terms of accessibility

[abbr class=&quot;dtstart&quot; title=&quot;5:15am&quot;]2006-05-01T12:15:03.0Z[/abbr] 

but then it would go against the microformat principles of making it readable.

The second issue is that it doesn't respect the DRY principle. There is a redundancy of information and then it's prone to errors when we update it. For example, someone with a wysiwyg tool.

I would rather prefer something like that:

[span class=&quot;dtstart tz-Z time&quot;]5:15am[/span]

And then the rest of javascript needed to make it right in display and processing.</description>
		<content:encoded><![CDATA[<p>Even better if you do that, really simplify it. Right now there are two issues about this.</p>
<p>[abbr class=&#8221;dtstart&#8221; title=&#8221;2006-05-01T12:15:03.0Z&#8221;]5:15am[/abbr]</p>
<p>title is for accessibility. It helps to have a better understanding of the content. In the microformat use it&#8217;s becoming the opposite. In a sense, this would be more logical in terms of accessibility</p>
<p>[abbr class=&#8221;dtstart&#8221; title=&#8221;5:15am&#8221;]2006-05-01T12:15:03.0Z[/abbr] </p>
<p>but then it would go against the microformat principles of making it readable.</p>
<p>The second issue is that it doesn&#8217;t respect the DRY principle. There is a redundancy of information and then it&#8217;s prone to errors when we update it. For example, someone with a wysiwyg tool.</p>
<p>I would rather prefer something like that:</p>
<p>[span class=&#8221;dtstart tz-Z time&#8221;]5:15am[/span]</p>
<p>And then the rest of javascript needed to make it right in display and processing.
</p>
]]></content:encoded>
				</item>
	<item>
		<title>Comment on application/atom+json by steve</title>
		<link>http://smackman.com/2006/05/08/applicationatomjson/#comment-3</link>
		<pubDate>Tue, 09 May 2006 14:31:37 +0000</pubDate>
		<guid>http://smackman.com/2006/05/08/applicationatomjson/#comment-3</guid>
					<description>Thanks Dennis -- Actually, it was writing the same PHP script (with Magpie and JSON, but not quite as clean as yours) that lead me to write my post.  The problem with this approach is that the schema is reflective of Magpie's internal data structures which I think are lossy (they don't preserve everything from the atom feed, but convert it to a lowest common denominator of atom and rss) and are non standard.  I should be able to get an atom+json feed from most sites that already support atom+xml and rely on its structure.

James Snell (also at IBM) has agreed to work on this with me... hopefully we'll have an IETF draft soon.</description>
		<content:encoded><![CDATA[<p>Thanks Dennis &#8212; Actually, it was writing the same PHP script (with Magpie and JSON, but not quite as clean as yours) that lead me to write my post.  The problem with this approach is that the schema is reflective of Magpie&#8217;s internal data structures which I think are lossy (they don&#8217;t preserve everything from the atom feed, but convert it to a lowest common denominator of atom and rss) and are non standard.  I should be able to get an atom+json feed from most sites that already support atom+xml and rely on its structure.</p>
<p>James Snell (also at IBM) has agreed to work on this with me&#8230; hopefully we&#8217;ll have an IETF draft soon.
</p>
]]></content:encoded>
				</item>
</channel>
</rss>
