[prev in list] [next in list] [prev in thread] [next in thread] 

List:       sqwebmail
Subject:    Re: [sqwebmail] Re: content encoding etc. (rewritten)
From:       Reinke Bonte <rbonte () choicenet ! ne ! jp>
Date:       2004-05-19 6:12:22
Message-ID: 20040519151222.368688db.rbonte () choicenet ! ne ! jp
[Download RAW message or body]

> > East Asian languages don't use spaces so you can wrap after almost
> > any character (there are defined exceptions, which are different per
> > language).
> 
> I wonder if there's a convenient table on unicode.org that identifies
> the characters involved.
> 

This document is pretty exhaustive on that topic (and include c sample
code for line wrapping): 

http://www.unicode.org/reports/tr14/index.html

As an example:
OpenOffice suggest these characters as non-line-starters in Japanese

!%),.:;?]}$B!q!k!G!I"s!l!m!n!"!#!9!S!U!W!Y![!M$!$#$%$'$)$C$c$e$g$n!+!,!5!6%!(B
$B%#%%%'%)%C%c%e%g%n%u%v!&!<!3!4!*!s!K!$!%!'!(!)!O!Q_________________________________(B
_______________$B!q(B

And these characters as non-line-enders

$([(J\{$B!r(J\$B!F!H!R!T!V!X!Z!L!p!J!N!P___!r!o(B


Reinke
[prev in list] [next in list] [prev in thread] [next in thread] 

Configure | About | News | Add a list | Sponsored by KoreLogic