[prev in list] [next in list] [prev in thread] [next in thread]
List: apache-modperl
Subject: Re: Weird behaviour with strings and accents
From: Marius Feraru <altblue () n0i ! net>
Date: 2006-12-29 15:09:09
Message-ID: 45952F95.80605 () n0i ! net
[Download RAW message or body]
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1
tomas@tuxteam.de wrote:
> On Thu, Dec 28, 2006 at 07:18:29PM +0100, Cyril SCETBON wrote:
>> tomas@tuxteam.de wrote:
>>> On Wed, Dec 27, 2006 at 04:56:50PM +0100, Cyril SCETBON wrote:
>> [no utf8]
>
>> $VAR1 = [
>> 'à présent protégé'
>> ];
>> $VAR1 = [
>> "\x{c3}\x{a0} pr\x{c3}\x{a9}sent
>> prot\x{c3}\x{a9}g\x{c3}\x{a9}login774"
>> ];
>> It's really weird, isn't it ???
> No, it isn't. So your source is actually in iso-8859-1. Just the second
> half looks weird to me (what is changing the perceived encoding of the
> string after appending something seemingly harmless?
Not really, as Cyril didn't provide us with details about his
environment, so my best guess is that "$login" is already utf8 decoded,
that's why the "automatic decoding" happens at concatenating those strings.
- --
Marius Feraru
-----BEGIN PGP SIGNATURE-----
iD8DBQFFlS+VtZHp/AYZiNkRAo3rAJ9gdao+NjCGZVc55atRDvvRgOv+iwCgtl3X
F9TezyuFsOhak2bw0oXTK+s=
=bcsM
-----END PGP SIGNATURE-----
[prev in list] [next in list] [prev in thread] [next in thread]
Configure |
About |
News |
Add a list |
Sponsored by KoreLogic