[prev in list] [next in list] [prev in thread] [next in thread] 

List:       aspell-user
Subject:    Re: [Aspell-user] Affixes leftover from expanded wordlist dumps
From:       Kevin Atkinson <kevina () gnu ! org>
Date:       2008-06-06 22:21:15
Message-ID: 20080606161458.V78361 () bas ! flux ! utah ! edu
[Download RAW message or body]

On Fri, 6 Jun 2008, Isaac Colley wrote:

> Hello,
>
> I am building a dictionary based language detection program using the
> dumps of aspell dictionaries.
>
> I need to expand wordlists completely, however some languages, such as
> Russian, after expansion will leave behind affixes (I think) after a
> '?'.  For example:
>
> aspell dump master ru | aspell -l ru expand

I think it might be an encoding problem.  Try setting your locale to C by 
setting the LANG environmental variable and making sure the locale was 
changes.  For example using bash:

   $ export LANG=C
   $ locale
   LANG=C
   LC_CTYPE="C"
   LC_COLLATE="C"
   LC_TIME="C"
   LC_NUMERIC="C"
   LC_MONETARY="C"
   LC_MESSAGES="C"
   LC_ALL=

You might also need to set LC_ALL.




[prev in list] [next in list] [prev in thread] [next in thread] 

Configure | About | News | Add a list | Sponsored by KoreLogic