[prev in list] [next in list] [prev in thread] [next in thread] 

List:       xml-dev
Subject:    RE: [xml-dev] What to escape when serializing XML
From:       "Paul Spencer" <xml-dev-list () boynings ! co ! uk>
Date:       2007-01-03 12:01:12
Message-ID: NBBBIBMKFOFCNEBAKDPLMEFBLKAA.xml-dev-list () boynings ! co ! uk
[Download RAW message or body]

Don't forget that multiple sequential whitespace characters in attribute
values will be normalised. This affects your third bullet. From the REC
section 3.3.3:

If the attribute type is not CDATA, then the XML processor MUST further
process the normalized attribute value by discarding any leading and
trailing space (#x20) characters, and by replacing sequences of space (#x20)
characters by a single space (#x20) character.

Paul Spencer

> -----Original Message-----
> From: Frans Englich [mailto:frans.englich@telia.com]
> Sent: 03 January 2007 10:18
> To: xml-dev@lists.xml.org
> Subject: Re: [xml-dev] What to escape when serializing XML
>
>
> On Wednesday 03 January 2007 10:00, Henri Sivonen wrote:
> > On Jan 2, 2007, at 17:11, Pete Cordell wrote:
> > > In terms of end-of-line encoding, the approach seems to be to
> > > output what is convenient (CR, LF, or CRLF) and have the receiving
> > > application sort out the situation.
>
> So let me summarize.
>
> This needs to be escaped when serializing XML 1.0 content without
> taking into
> account XML 1.1 compatibility but with the purpose of being able
> to roundtrip
> the content being serialized:
>
> * Required characters like '<' and '&', etc.
> * Characters unable to be represented in the given encoding
> * Whitespace except 0x20 in attributes since parsers do Attribute Value
> Normalization
> * End of line characters since the parser normalizes those as well(2.11
> End-of-Line Handling)
>
> Is that all?
>
> XSLT 2.0 and XQuery 1.0 Serialization hints there is more. It says
> "Specifically, CR, NEL and LINE SE ...". Note the use of the word
> "specifically". And what is the reason to that it requires "#x7F
> through #x9F
> in text nodes and attribute nodes MUST be output as character references"?
>
> It seems the XML 1.0 specification has the perspective of an XML
> consumer, not
> producer.
>
>
> Cheers,
>
> 		Frans
>
> _______________________________________________________________________
>
> XML-DEV is a publicly archived, unmoderated list hosted by OASIS
> to support XML implementation and development. To minimize
> spam in the archives, you must subscribe before posting.
>
> [Un]Subscribe/change address: http://www.oasis-open.org/mlmanage/
> Or unsubscribe: xml-dev-unsubscribe@lists.xml.org
> subscribe: xml-dev-subscribe@lists.xml.org
> List archive: http://lists.xml.org/archives/xml-dev/
> List Guidelines: http://www.oasis-open.org/maillists/guidelines.php
>


_______________________________________________________________________

XML-DEV is a publicly archived, unmoderated list hosted by OASIS
to support XML implementation and development. To minimize
spam in the archives, you must subscribe before posting.

[Un]Subscribe/change address: http://www.oasis-open.org/mlmanage/
Or unsubscribe: xml-dev-unsubscribe@lists.xml.org
subscribe: xml-dev-subscribe@lists.xml.org
List archive: http://lists.xml.org/archives/xml-dev/
List Guidelines: http://www.oasis-open.org/maillists/guidelines.php


[prev in list] [next in list] [prev in thread] [next in thread] 

Configure | About | News | Add a list | Sponsored by KoreLogic