From koffice-devel Mon Aug 26 15:13:09 2002 From: Vadim Plessky Date: Mon, 26 Aug 2002 15:13:09 +0000 To: koffice-devel Subject: Re: unicode char in KWord X-MARC-Message: https://marc.info/?l=koffice-devel&m=103037574520600 On Saturday 24 August 2002 2:18 pm, Ariya Hidayat wrote: | I need help. I use KWord to import a test WordPerfect document, which have | Unicode quotes characters, U201F and U201E respectively. Using Helvetica, | Arial, or Courier New, these 2 characters don't appear. They show up only | in Unifont. | I think I found a bug in KWord filter, and it's also related to font encodings. Please get this Word document first: http://freetype.newmail.ru/misc/serbian.doc which contains a page in Serbian language. // In fact, I was testing excatly non-Russian Cyrilic languages today, and hitted this bug. Import it into KWord, and you will get this: http://freetype.newmail.ru/misc/Serbian.kwd which is *not ok*. First paragrpah should be: "Српска усмена књижевност настала је, уз наслеђе које је морала донети из старе словенске постојбине, на размеђи источне и западне цивилизације, у сусрету са затеченом традицијом на новом тлу, у непосредном додиру с наслеђем антике, а касније, у својеврсној одбрани од оријенталних утицаја и у прожимању с њима." (you can read/view this text if your MUA support Unicode, and Unicode fonts are installed) Note specific Serbian characters "њ", "ђ", "ј" (which is different/not present in Russian) - and "њ", "ђ" are missing after import of Word .doc into KWord :-( Than I just pasted text from the web page to KWord it can be found at web page http://www.rastko.org.yu/isk/isk_11_c.html, project Rastko) Here is *correct* KWord file http://freetype.newmail.ru/misc/Serbian-pasted.kwd And same text exported to HTML (works ok): http://freetype.newmail.ru/misc/Serbian-cp1251.html http://freetype.newmail.ru/misc/Serbian-UTF8.html Can someone check what's happening wrong in KWord's Word import filter, and fix it for the Release? | My question: does Qt automatically handling things like this ? I mean, I | load document with many Cyrillic (and Hebrew) characters and it's just | fine (using either Arial or Helvetica). Why do simple quote characters pose | a problem ? Am I missing something ? | | _______________________________________________ | koffice-devel mailing list | koffice-devel@mail.kde.org | http://mail.kde.org/mailman/listinfo/koffice-devel -- Vadim Plessky http://kde2.newmail.ru (English) 33 Window Decorations and 6 Widget Styles for KDE http://kde2.newmail.ru/kde_themes.html KDE mini-Themes http://kde2.newmail.ru/themes/ _______________________________________________ koffice-devel mailing list koffice-devel@mail.kde.org http://mail.kde.org/mailman/listinfo/koffice-devel