|
From: | Kevin Atkinson |
Subject: | Re: [Aspell-user] Affixes leftover from expanded wordlist dumps |
Date: | Fri, 6 Jun 2008 16:21:15 -0600 (MDT) |
On Fri, 6 Jun 2008, Isaac Colley wrote:
Hello, I am building a dictionary based language detection program using the dumps of aspell dictionaries. I need to expand wordlists completely, however some languages, such as Russian, after expansion will leave behind affixes (I think) after a '?'. For example: aspell dump master ru | aspell -l ru expand
I think it might be an encoding problem. Try setting your locale to C by setting the LANG environmental variable and making sure the locale was changes. For example using bash:
$ export LANG=C $ locale LANG=C LC_CTYPE="C" LC_COLLATE="C" LC_TIME="C" LC_NUMERIC="C" LC_MONETARY="C" LC_MESSAGES="C" LC_ALL= You might also need to set LC_ALL.
[Prev in Thread] | Current Thread | [Next in Thread] |