[prev in list] [next in list] [prev in thread] [next in thread]
List: konq-bugs
Subject: Bug#45970: Some unicode glyphs replaced with incorrect glyphs
From: ian.goldby () ntlworld ! com
Date: 2002-07-30 22:17:13
[Download RAW message or body]
Package: khtml
Version: 4.0 (using KDE 3.0.2 )
Severity: normal
Installed from: Gentoo
Compiler: gcc version 2.95.3 20010315 (release)
OS: Linux (i686) release 2.4.18-xfs
OS/Compiler notes:
The following test case renders incorrect glyphs the following unicode entities in \
all fonts that I have tried. (Mozilla does render the correct entities.)
shi, middot, ndash, mdash, lsquo, rsquo, ldquo, rdquo, thetasym, upsih, piv, and \
bull.
(The character encoding is specified as UTF-8, but it makes no difference to the \
result.)
E.g. lsquo and rsquo render as a single straight quote, middot and bull renders as an \
asterisk, thetasym, upsih and piv render as question marks.
If however the file extension is changed from .html to .xml, then all apart from shy, \
thetasym, upsih and piv render correctly when addressed by their number, but are \
missed completely when addressed by name. In fact, no entities are rendered when \
addressed by name if the file extension is .xml
Test case (to be saved as test.html and test.xml):
<?xml version="1.0" ?>
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN" "DTD/xhtml1-strict.dtd">
<html xmlns="http://www.w3.org/1999/xhtml" xml:lang="en" lang="en">
<head>
<meta http-equiv="Content-Type" content="text/html; charset=utf-8" />
<title>Strict xhtml test document</title>
</head>
<body>
<table>
<tbody>
<tr><td>shy</td><td>­</td><td>&#173;</td><td>­</td><td>U+00AD</td><td>soft \
hyphen = discretionary hyphen</td><td>ISOnum</td></tr> \
<tr><td>middot</td><td>·</td><td>&#183;</td><td>·</td><td>U+00B7</td><td>middle \
dot = Georgian comma = Greek middle dot</td><td>ISOnum</td></tr> \
<tr><td>ndash</td><td>–</td><td>&#8211;</td><td>–</td><td>U+2013</td><td>en \
dash</td><td>ISOpub</td></tr> \
<tr><td>mdash</td><td>—</td><td>&#8212;</td><td>—</td><td>U+2014</td><td>em \
dash</td><td>ISOpub</td></tr> \
<tr><td>lsquo</td><td>‘</td><td>&#8216;</td><td>‘</td><td>U+2018</td><td>left \
single quotation mark</td><td>ISOnum</td></tr> \
<tr><td>rsquo</td><td>’</td><td>&#8217;</td><td>’</td><td>U+2019</td><td>right \
single quotation mark</td><td>ISOnum</td></tr> \
<tr><td>ldquo</td><td>“</td><td>&#8220;</td><td>“</td><td>U+201C</td><td>left \
double quotation mark</td><td>ISOnum</td></tr> \
<tr><td>rdquo</td><td>”</td><td>&#8221;</td><td>”</td><td>U+201D</td><td>right \
double quotation mark</td><td>ISOnum</td></tr> \
<tr><td>thetasym</td><td>ϑ</td><td>&#977;</td><td>ϑ</td><td>U+03D1</td><td>greek \
small letter theta symbol</td><td>NEW</td></tr> \
<tr><td>upsih</td><td>ϒ</td><td>&#978;</td><td>ϒ</td><td>U+03D2</td><td>greek \
upsilon with hook symbol</td><td>NEW</td></tr> \
<tr><td>piv</td><td>ϖ</td><td>&#982;</td><td>ϖ</td><td>U+03D6</td><td>greek \
pi symbol</td><td>ISOgrk3</td></tr> \
<tr><td>bull</td><td>•</td><td>&#8226;</td><td>•</td><td>U+2022</td><td>bullet \
= black small circle</td><td>ISOpub</td></tr> </tbody>
</table>
</body>
</html>
(Submitted via bugs.kde.org)
(Called from KBugReport dialog)
(Complete bug history is available at http://bugs.kde.org/db/45/45970.html)
_______________________________________________
Konq-bugs mailing list
Konq-bugs@mail.kde.org
http://mail.kde.org/mailman/listinfo/konq-bugs
[prev in list] [next in list] [prev in thread] [next in thread]
Configure |
About |
News |
Add a list |
Sponsored by KoreLogic