[prev in list] [next in list] [prev in thread] [next in thread] 

List:       xml-dev
Subject:    [xml-dev] Neat article by Jon Hanna: Entry-Level Unicode for XML
From:       "Costello, Roger L." <costello () mitre ! org>
Date:       2018-10-23 13:35:48
Message-ID: BN7PR09MB254601F0AE0A9EFF8785CA25C8F50 () BN7PR09MB2546 ! namprd09 ! prod ! outlook ! com
[Download RAW message or body]

A few excerpts:



The Unicode Standard is one of the most readable books ever to have the wor=
d "Standard" in its title and an online version is freely available (https:=
//www.unicode.org/versions/Unicode4.0.0/ch01.pdf). Really you should go and=
 read that instead, but people often don't when I say that. So here's somet=
hing shorter that'll help you out a bit and hopefully also make you realize=
 that this stuff is interesting in itself and you'll be convinced the book'=
s worth taking a look at.



When I say "XML application" in this document I'm referring to a type of XM=
L document combined with a specification as to how it should be used, as su=
ch XHTML, RDF/XML and SVG are XML applications. I am not referring to the s=
oftware that processes or produces such documents, although they are also r=
eferred to as applications in different contexts.



Just what is a character? Is 'a' the shape you can see if you are reading t=
his on screen or page, or is it the sound you can hear if you are using a s=
creen reader, or is it the pattern of bumps you can feel if you are using a=
 Braille reader? Is the 'a' in "nap" the same as the 'a' in "nape"? Is 'a' =
the dot-dash of Morse Code, or the 0x61 of a byte in a computer's memory?



None of these descriptions are adequate descriptions of the letter 'a'. Yet=
 young children manage to handle the concept of 'a' well enough to set them=
 on the path to learning to read and write. If a two-year-old can manage it=
 then it can't be that hard!


Clearly 'a' is an abstract concept. Talking about it as an abstract concept=
 is difficult, as are most abstract concepts, but we can recognize the rela=
tionship between this concept and the shapes, sounds and other communicatio=
n methods.



https://www.hackcraft.net/xmlUnicode/


[Attachment #3 (text/html)]

<html xmlns:v="urn:schemas-microsoft-com:vml" \
xmlns:o="urn:schemas-microsoft-com:office:office" \
xmlns:w="urn:schemas-microsoft-com:office:word" \
xmlns:m="http://schemas.microsoft.com/office/2004/12/omml" \
xmlns="http://www.w3.org/TR/REC-html40"> <head>
<meta http-equiv="Content-Type" content="text/html; charset=us-ascii">
<meta name="Generator" content="Microsoft Word 15 (filtered medium)">
<style><!--
/* Font Definitions */
@font-face
	{font-family:"Cambria Math";
	panose-1:2 4 5 3 5 4 6 3 2 4;}
@font-face
	{font-family:Calibri;
	panose-1:2 15 5 2 2 2 4 3 2 4;}
/* Style Definitions */
p.MsoNormal, li.MsoNormal, div.MsoNormal
	{margin:0in;
	margin-bottom:.0001pt;
	font-size:11.0pt;
	font-family:"Calibri",sans-serif;}
a:link, span.MsoHyperlink
	{mso-style-priority:99;
	color:#0563C1;
	text-decoration:underline;}
a:visited, span.MsoHyperlinkFollowed
	{mso-style-priority:99;
	color:#954F72;
	text-decoration:underline;}
p.MsoPlainText, li.MsoPlainText, div.MsoPlainText
	{mso-style-priority:99;
	mso-style-link:"Plain Text Char";
	margin:0in;
	margin-bottom:.0001pt;
	font-size:11.0pt;
	font-family:"Calibri",sans-serif;}
span.EmailStyle17
	{mso-style-type:personal-compose;
	font-family:"Calibri",sans-serif;
	color:windowtext;}
span.PlainTextChar
	{mso-style-name:"Plain Text Char";
	mso-style-priority:99;
	mso-style-link:"Plain Text";
	font-family:"Calibri",sans-serif;}
.MsoChpDefault
	{mso-style-type:export-only;
	font-family:"Calibri",sans-serif;}
@page WordSection1
	{size:8.5in 11.0in;
	margin:1.0in 1.0in 1.0in 1.0in;}
div.WordSection1
	{page:WordSection1;}
--></style><!--[if gte mso 9]><xml>
<o:shapedefaults v:ext="edit" spidmax="1026" />
</xml><![endif]--><!--[if gte mso 9]><xml>
<o:shapelayout v:ext="edit">
<o:idmap v:ext="edit" data="1" />
</o:shapelayout></xml><![endif]-->
</head>
<body lang="EN-US" link="#0563C1" vlink="#954F72">
<div class="WordSection1">
<p class="MsoPlainText">A few excerpts:<o:p></o:p></p>
<p class="MsoPlainText"><o:p>&nbsp;</o:p></p>
<p class="MsoPlainText">The Unicode Standard is one of the most readable books ever \
to have the word &quot;Standard&quot; in its title and an online version is freely \
available (<a href="https://www.unicode.org/versions/Unicode4.0.0/ch01.pdf">https://www.unicode.org/versions/Unicode4.0.0/ch01.pdf</a>).
  Really you should go and read that instead, but people often don&#8217;t when I say \
that. So here&#8217;s something shorter that&#8217;ll help you out a bit and \
hopefully also make you realize that this stuff is interesting in itself and \
you&#8217;ll be convinced the book&#8217;s worth  taking a look at.<o:p></o:p></p>
<p class="MsoPlainText"><o:p>&nbsp;</o:p></p>
<p class="MsoPlainText">When I say &quot;XML application&quot; in this document \
I&#8217;m referring to a type of XML document combined with a specification as to how \
it should be used, as such XHTML, RDF/XML and SVG are XML applications. I am not \
referring to the software  that processes or produces such documents, although they \
are also referred to as applications in different contexts.<o:p></o:p></p> <p \
class="MsoPlainText"><o:p>&nbsp;</o:p></p> <p class="MsoPlainText">Just what is a \
character? Is 'a' the shape you can see if you are reading this on screen or page, or \
is it the sound you can hear if you are using a screen reader, or is it the pattern \
of bumps you can feel if you are using a Braille  reader? Is the 'a' in \
&quot;nap&quot; the same as the 'a' in &quot;nape&quot;? Is 'a' the dot-dash of Morse \
Code, or the 0x61 of a byte in a computer&#8217;s memory?<o:p></o:p></p> <p \
class="MsoPlainText"><o:p>&nbsp;</o:p></p> <p class="MsoPlainText">None of these \
descriptions are adequate descriptions of the letter 'a'. Yet young children manage \
to handle the concept of 'a' well enough to set them on the path to learning to read \
and write. If a two-year-old can manage it then it  can&#8217;t be that \
hard!<o:p></o:p></p> <p class="MsoPlainText"><o:p>&nbsp;</o:p></p>
<p class="MsoNormal">Clearly 'a' is an abstract concept. Talking about it as an \
abstract concept is difficult, as are most abstract concepts, but we can recognize \
the relationship between this concept and the shapes, sounds and other communication \
methods.<o:p></o:p></p> <p class="MsoPlainText"><o:p>&nbsp;</o:p></p>
<p class="MsoPlainText"><a \
href="https://www.hackcraft.net/xmlUnicode/">https://www.hackcraft.net/xmlUnicode/</a>
 <o:p></o:p></p>
<p class="MsoNormal"><o:p>&nbsp;</o:p></p>
</div>
</body>
</html>



[prev in list] [next in list] [prev in thread] [next in thread] 

Configure | About | News | Add a list | Sponsored by KoreLogic