[Top][All Lists]
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: International support
From: |
Kevin Atkinson |
Subject: |
Re: International support |
Date: |
Sun, 10 Jan 1999 01:24:29 +0000 |
Samphan Raruenrom wrote:
> Very interesting!
> In Thai, we don't put spaces between words at all so
> the same situation happends naturally.
> Typical Thai word-segmentation algorithm (which usually
> do spelling check also) use maximal-match backtracking
> algorithm with trie word list(s).
> My implementation is at http://www.thai.net/libinthai/
> IBM Classes for Unicode implementation is at
> http://www.ibm.com/java/education/boundaries/boundaries.html
Ok so how do you detect bonduries of unknown or misspelled words.
--
Kevin Atkinson
address@hidden
http://metalab.unc.edu/kevina/