[prev in list] [next in list] [prev in thread] [next in thread] 

List:       konq-bugs
Subject:    Bug#45970: Some unicode glyphs replaced with incorrect glyphs
From:       ian.goldby () ntlworld ! com
Date:       2002-07-30 22:17:13
[Download RAW message or body]

Package: khtml
Version: 4.0 (using KDE 3.0.2 )
Severity: normal
Installed from:    Gentoo
Compiler:          gcc version 2.95.3 20010315 (release)
OS:                Linux (i686) release 2.4.18-xfs
OS/Compiler notes: 

The following test case renders incorrect glyphs the following unicode entities in \
all fonts that I have tried. (Mozilla does render the correct entities.)

shi, middot, ndash, mdash, lsquo, rsquo, ldquo, rdquo, thetasym, upsih, piv, and \
bull.

(The character encoding is specified as UTF-8, but it makes no difference to the \
result.)

E.g. lsquo and rsquo render as a single straight quote, middot and bull renders as an \
asterisk, thetasym, upsih and piv render as question marks.

If however the file extension is changed from .html to .xml, then all apart from shy, \
thetasym, upsih and piv render correctly when addressed by their number, but are \
missed completely when addressed by name. In fact, no entities are rendered when \
addressed by name if the file extension is .xml

Test case (to be saved as test.html and test.xml):

<?xml version="1.0" ?>
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN" "DTD/xhtml1-strict.dtd">
<html xmlns="http://www.w3.org/1999/xhtml" xml:lang="en" lang="en">
<head>
<meta http-equiv="Content-Type" content="text/html; charset=utf-8" />
<title>Strict xhtml test document</title>
</head>

<body>
<table>
<tbody>
<tr><td>shy</td><td>&shy;</td><td>&amp;#173;</td><td>&#173;</td><td>U+00AD</td><td>soft \
hyphen = discretionary hyphen</td><td>ISOnum</td></tr> \
<tr><td>middot</td><td>&middot;</td><td>&amp;#183;</td><td>&#183;</td><td>U+00B7</td><td>middle \
dot = Georgian comma = Greek middle dot</td><td>ISOnum</td></tr> \
<tr><td>ndash</td><td>&ndash;</td><td>&amp;#8211;</td><td>&#8211;</td><td>U+2013</td><td>en \
dash</td><td>ISOpub</td></tr> \
<tr><td>mdash</td><td>&mdash;</td><td>&amp;#8212;</td><td>&#8212;</td><td>U+2014</td><td>em \
dash</td><td>ISOpub</td></tr> \
<tr><td>lsquo</td><td>&lsquo;</td><td>&amp;#8216;</td><td>&#8216;</td><td>U+2018</td><td>left \
single quotation mark</td><td>ISOnum</td></tr> \
<tr><td>rsquo</td><td>&rsquo;</td><td>&amp;#8217;</td><td>&#8217;</td><td>U+2019</td><td>right \
single quotation mark</td><td>ISOnum</td></tr> \
<tr><td>ldquo</td><td>&ldquo;</td><td>&amp;#8220;</td><td>&#8220;</td><td>U+201C</td><td>left \
double quotation mark</td><td>ISOnum</td></tr> \
<tr><td>rdquo</td><td>&rdquo;</td><td>&amp;#8221;</td><td>&#8221;</td><td>U+201D</td><td>right \
double quotation mark</td><td>ISOnum</td></tr> \
<tr><td>thetasym</td><td>&thetasym;</td><td>&amp;#977;</td><td>&#977;</td><td>U+03D1</td><td>greek \
small letter theta symbol</td><td>NEW</td></tr> \
<tr><td>upsih</td><td>&upsih;</td><td>&amp;#978;</td><td>&#978;</td><td>U+03D2</td><td>greek \
upsilon with hook symbol</td><td>NEW</td></tr> \
<tr><td>piv</td><td>&piv;</td><td>&amp;#982;</td><td>&#982;</td><td>U+03D6</td><td>greek \
pi symbol</td><td>ISOgrk3</td></tr> \
<tr><td>bull</td><td>&bull;</td><td>&amp;#8226;</td><td>&#8226;</td><td>U+2022</td><td>bullet \
= black small circle</td><td>ISOpub</td></tr> </tbody>
</table>

</body>
</html>


(Submitted via bugs.kde.org)
(Called from KBugReport dialog)


(Complete bug history is available at http://bugs.kde.org/db/45/45970.html)
_______________________________________________
Konq-bugs mailing list
Konq-bugs@mail.kde.org
http://mail.kde.org/mailman/listinfo/konq-bugs


[prev in list] [next in list] [prev in thread] [next in thread] 

Configure | About | News | Add a list | Sponsored by KoreLogic