Submitted by : 127.0.0.1 at: 2005-04-12T09:50:42+00:00 (12 years ago)
Name :
Category : Severity : Status :
Optional subject :  
Optional comment :

Getting this issue on the Ubuntu new wiki page rss feed, caused by the title of one of the pages:

XML Parsing Error: not well-formed
Location: http://ubuntulinux.org/wiki/pages_rss
Line Number 38, Column 40:      <title>Web Browsing slow ( IVP6 <-> IVP4 )</title>

... --d2m, Tue, 12 Apr 2005 10:51:34 -0700 reply

you need to escape the '<,>,&'  or put the element value in a CDATA section
like <title><![CDATA[Web Browsing slow ( IVP6 <-> IVP4 )]]></title>
to hide the special characters from the parser.

using html_quote now, patch in darcs --betabug, Fri, 23 Feb 2007 14:46:01 +0000 reply

Status: open => closed

feeds still validate too

Needs amendment --betabug, Sun, 25 Feb 2007 11:22:31 +0000 reply

Status: closed => open

Feeds that contain HTML entities (and really should contain those entities, not the characters they render to), we need to encode the characters '&' and '<'. See: http://feedvalidator.org/docs/warning/ContainsHTML.html

I will have to check if the currently used function does this exactly like we need it for the titles. So I'm reopening for the moment. (The feeds currently validate, unless one of the page titles contains things like '&amp;'.

another patch in darcs, as good as it gets --betabug, Mon, 26 Feb 2007 15:52:19 +0000 reply

Status: open => closed

We're now properly encoding &, < and > as we're told in http://www.rssboard.org/rss-profile#data-types-characterdata - feedvalidator still issues warnings for some of the more funny things one can do in the content with these characters. The feed validates though.