[prev in list] [next in list] [prev in thread] [next in thread] 

List:       apache-modperl
Subject:    Re: Weird behaviour with strings and accents
From:       Marius Feraru <altblue () n0i ! net>
Date:       2006-12-29 15:09:09
Message-ID: 45952F95.80605 () n0i ! net
[Download RAW message or body]

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

tomas@tuxteam.de wrote:
> On Thu, Dec 28, 2006 at 07:18:29PM +0100, Cyril SCETBON wrote:
>> tomas@tuxteam.de wrote:
>>> On Wed, Dec 27, 2006 at 04:56:50PM +0100, Cyril SCETBON wrote:
>> [no utf8]
> 
>> $VAR1 = [
>>           'à présent protégé'
>>         ];
>> $VAR1 = [
>>           "\x{c3}\x{a0} pr\x{c3}\x{a9}sent 
>> prot\x{c3}\x{a9}g\x{c3}\x{a9}login774"
>>         ];
>> It's really weird, isn't it ???
> No, it isn't. So your source is actually in iso-8859-1. Just the second
> half looks weird to me (what is changing the perceived encoding of the
> string after appending something seemingly harmless?

Not really, as Cyril didn't provide us with details about his
environment, so my best guess is that "$login" is already utf8 decoded,
that's why the "automatic decoding" happens at concatenating those strings.

- --
Marius Feraru
-----BEGIN PGP SIGNATURE-----

iD8DBQFFlS+VtZHp/AYZiNkRAo3rAJ9gdao+NjCGZVc55atRDvvRgOv+iwCgtl3X
F9TezyuFsOhak2bw0oXTK+s=
=bcsM
-----END PGP SIGNATURE-----
[prev in list] [next in list] [prev in thread] [next in thread] 

Configure | About | News | Add a list | Sponsored by KoreLogic