[prev in list] [next in list] [prev in thread] [next in thread] 

List:       xml4lib
Subject:    RE: [XML4Lib] Question on trailing whitespace in MARCXML
From:       "Houghton,Andrew" <houghtoa () oclc ! org>
Date:       2008-07-03 13:26:13
Message-ID: 6548F17059905B48B2A6F28CE3692BAAC89F48 () OAEXCH4SERVER ! oa ! oclc ! org
[Download RAW message or body]

[Attachment #2 (unknown)]

> From: xml4lib-bounces@webjunction.org [mailto:xml4lib-
> bounces@webjunction.org] On Behalf Of Mathias Kratzer
> Sent: Thursday, July 03, 2008 5:30 AM
> To: xml4lib
> Subject: [XML4Lib] Question on trailing whitespace in MARCXML
> controlfieldelements
> 
> Dear all,
> 
> is it valid to omit trailing whitespace in MARCXML controlfield
> elements?
> 
> Example:
> 
> <controlfield tag="008">920219s1993    caua   j      000 0
> eng</controlfield>
> 
> instead of
> 
> <controlfield tag="008">920219s1993    caua   j      000 0 eng
> </controlfield>
> 
> 
> According to the MARCXML schema leader and controlfields are subject to
> "whitespace preservation", and for interior whitespace it is clear why
> this is vital. However, any application could easily fill any missing
> positions with whitespaces by default. IMHO it is still _not_ valid to
> omit trailing whitespace - so my real problem is that I'm not able to
> find any sort of document that states "yes, doing so will let your
> MARCXML become invalid".
> 
> I am completely aware of the fact that the LoC (as maintaining agency
> of the MARCXML standard) is the appropriate address for my question but
> the "Contact Us" link on http://www.loc.gov/standards/marcxml/// only
> led me to the general "Ask a Librarian" page. So I thought this mailing
> list is probably the more direct way to ask the experts :-)

The MARC-XML schema use whitespace preservation because whitespace is a content \
issue.  You could omit the whitespace and your MARC-XML would still validate.  \
However, the MARC 21 standards, e.g., authority, bibliographic, holdings, etc. \
specify what the content should be.  In those standards the 008 is exactly 40 \
characters long.  Which means that those trailing spaces are significant to the \
interpretation of the content that is serialized into any format whether it be ISO \
2709 or a MARC-XML instance document.


Andy.



_______________________________________________
XML4Lib mailing list
XML4Lib@webjunction.org
http://lists.webjunction.org/mailman/listinfo/xml4lib


[prev in list] [next in list] [prev in thread] [next in thread] 

Configure | About | News | Add a list | Sponsored by KoreLogic