[Top][All Lists]

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[bug-libunistring] UAX #29 changes

From: Daiki Ueno
Subject: [bug-libunistring] UAX #29 changes
Date: Wed, 19 Oct 2016 16:11:17 +0200
User-agent: Gnus/5.13 (Gnus v5.13) Emacs/25.1.50 (gnu/linux)


I have been trying to update libunistring to Unicode 9.0.0.  Initially I
planned it for the end of this month, but now I'm almost giving up,
because of the recent additions to the UAX #29 algorithms:

- The 3 rules added to the Grapheme Cluster Boundary Rules, namely
  (GB10, GB12, GB13), involve 3 consequent characters, while the current
  API uc_is_grapheme_break() only takes 2 characters

- The similar rules are also added to the Word Boundary Rules.  Though
  it wouldn't be a problem as uniwbrk.h doesn't expose such API, the
  implementation of WB15 and WB16 could be complicated because it
  requires lookahead of a next character

I have pushed the current version to my github repository:

but I will be unlikely to have time to complete it any time soon.
Sorry.  If anyone could finish it or rewrite it, it would be

By the way, I would like to change the "Maintainer:" field of the Gnulib
modules to "all", if nobody opposes.

Daiki Ueno

reply via email to

[Prev in Thread] Current Thread [Next in Thread]