[Top][All Lists]
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[PATCH] Wide characters
From: |
Mike Gran |
Subject: |
[PATCH] Wide characters |
Date: |
Sat, 21 Feb 2009 03:16:39 -0800 (PST) |
Hi-
I've been playing with this wide char stuff, and I have a patch that
would move the encoding of characters to UCS-4.
This is completely useless on its own, because, in this
patch, the internal encoding of strings is still 8-bit chars, and,
thus, there is no way to use the wide characters in strings.
It is all pretty simple. Since the internal representation of chars
becomes UCS-4, I used scm_t_uint32 as the char type, and I removed the
code that supported EBCDIC-encoded characters. I changed the tables
of character names to deal with more names and discontiguous control
characters. And, as a temporary kludge, I made a macro
SCM_MAKE_8BIT_CHAR to cast the 8-bit characters used in strings to a
scm_t_uint32. Also, I used functions from the Gnulib unicase and
unictype modules for character properties, including a couple that
Bruno Haible of Gnulib was kind enough to create for me.
Thanks,
Mike
The gnulib invocation for this was
gnulib-tool --import --dir=. --lib=libgnu --source-base=lib
--m4-base=m4 --doc-base=doc --tests-base=tests --aux-dir=build-aux
--lgpl --libtool --macro-prefix=gl --no-vc-files
alloca-opt autobuild count-one-bits extensions full-read full-write
strcase strftime unicase/tolower unicase/toupper
unictype/property-alphabetic unictype/property-lowercase
unictype/property-numeric unictype/property-uppercase
unictype/property-white-space
patch.txt
Description: Text document
- [PATCH] Wide characters,
Mike Gran <=