[prev in list] [next in list] [prev in thread] [next in thread]
List: openjdk-openjfx-dev
Subject: Re: RFR: JDK-8314215 Trailing Spaces before Line Breaks Affect the Center Alignment of Text
From: John Hendrikx <jhendrikx () openjdk ! org>
Date: 2023-10-30 19:59:43
Message-ID: E1DDRiRpkdd0zPyUzVTEU-JEXwCAXwVUsJL7wSHD8vI=.dbc8f183-55aa-4a6f-81d7-07838ad23321 () github ! com
[Download RAW message or body]
On Mon, 30 Oct 2023 19:00:08 GMT, Andy Goryachev <angorya@openjdk.org> wrote:
> > There are a number of tickets open related to text rendering:
> >
> > https://bugs.openjdk.org/browse/JDK-8314215
> >
> > https://bugs.openjdk.org/browse/JDK-8145496
> >
> > https://bugs.openjdk.org/browse/JDK-8129014
> >
> > They have in common that wrapped text is taking the trailing spaces on each \
> > wrapped line into account when calculating where to wrap. This looks okay for \
> > text that is left aligned (as the spaces will be trailing the lines and generally \
> > aren't a problem, but looks weird with CENTER and RIGHT alignments. Even with \
> > LEFT alignment there are artifacts of this behavior, where a line like `AAA BBB \
> > CCC` (note the **double** spaces) gets split up into `AAA `, `BBB ` and `CCC`, \
> > but if space reduces further, it will wrap **too** early because the space is \
> > taken into account (ie. `AAA` may still have fit just fine, but `AAA ` doesn't, \
> > so the engine wraps it to `AA` + `A ` or something).
> > The fix for this is two fold; first the individual lines of text should not \
> > include any trailing spaces into their widths; second, the code that is taking \
> > the trailing space into account when wrapping should ignore all trailing spaces \
> > (currently it is ignoring all but one trailing space). With these two fixes, the \
> > layout in LEFT/CENTER/RIGHT alignments all look great, and there is no more early \
> > wrapping due to a space being taking into account while the actual text still \
> > would have fit (this is annoying in tight layouts, where a line can be wrapped \
> > early even though it looks like it would have fit).
> > If it were that simple, we'd be done, but there may be another issue here that \
> > needs solving: wrapped aligned TextArea's.
> > TextArea don't directly support text alignment (via a setTextAlignment method \
> > like Label) but you can change it via CSS.
> > For Left alignment + wrapping, TextArea will ignore any spaces typed before a \
> > line that was wrapped. In other words, you can type spaces as much as you want, \
> > and they won't show up and the cursor won't move. The spaces are all getting \
> > appended to the previous line. When you cursor through these spaces, the cursor \
> > can be rendered out of the control's bounds. To illustrate, if you have the text \
> > `AAA BBB CCC`, and the text gets wrapped to `AAA`, `BBB`, `CCC`, \
> > typing spaces before `BBB` will not show up. If you cursor back, the cursor may \
> > be outside the control bounds because so many spaces are trailing `AAA`.
> > The above behavior has NOT changed, is pretty standard for wrapped text \
> > controls,...
>
> From https://www.unicode.org/reports/tr14-4/
>
>
> 5.6 Break opportunity after characters (A)
> Breaking Spaces
> SPACE (SP) � U+0020
>
> The space characters are explicit break opportunities, but spaces at the end of a \
> line are not measured for fit. If there is a sequence of space characters, and \
> breaking after any of the space characters would result in the same visible line, \
> the line breaking position after the last space character in the sequence is the \
> locally most optimal one. In other words, since the last character measured for fit \
> is BEFORE the space character, any number of space characters are kept together \
> invisibly on the previous line and the first non-space character starts the next \
> line.
> It is sometimes convenient to use SP, but not the other breaking spaces to override \
> context based behavior of other characters under the "anywhere, except where \
> prohibited" style of line breaking (context analysis style 2).
> EN QUAD � U+2000
> EM QUAD � U+2001
> EN SPACE � U+2002
> EM SPACE � U+2003
> THREE-PER-EM SPACE � U+2004
> FOUR-PER-EM SPACE � U+2005
> SIX-PER-EM SPACE � U+2006
> PUNCTUATION SPACE � U+2008
> THIN SPACE � U+2009
> HAIR SPACE � U+200A
>
> The preceding list of characters all have a specific width, but behave otherwise as \
> breaking spaces .
> ZERO WIDTH SPACE (ZWSP) � U+200B
>
> This character does not have width. It is used in a style 2 context analysis to \
> provide additional (invisible) break opportunities.
> IDEOGRAPHIC SPACE � U+3000
>
> This character has the width of an ideograph but like ZWSP is fully subject to the \
> style 2 context analysis.
>
> A quick check with (the latest?) MS Word 2208 on Windows shows that, at least with \
> EN QUAD U+2000 it is treated as a regular character (i.e. it is always "displayed" \
> even if right aligned).
@andy-goryachev-oracle
> The space characters are explicit break opportunities, but spaces at the end of a \
> line are not measured for fit. If there is a sequence of space characters, and \
> breaking after any of the space characters would result in the same visible line, \
> the line breaking position after the last space character in the sequence is the \
> locally most optimal one. In other words, since the last character measured for fit \
> is BEFORE the space character, any number of space characters are kept together \
> invisibly on the previous line and the first non-space character starts the next \
> line.
That's certainly a good description of how the breaking should be handled.
-------------
PR Comment: https://git.openjdk.org/jfx/pull/1236#issuecomment-1785940681
[prev in list] [next in list] [prev in thread] [next in thread]
Configure |
About |
News |
Add a list |
Sponsored by KoreLogic