|
From: | Paul Eggert |
Subject: | better i18n for join, uniq, etc. |
Date: | Mon, 30 Oct 2023 01:48:44 -0700 |
User-agent: | Mozilla Thunderbird |
The idea is to continue this process of using mcel for the other programs where vanilla Coreutils doesn't conform to POSIX in multi-byte locales.
The key patch is 0009. Patch 0010 brings in the Fedora tests for join and uniq in multi-byte locales; these tests pass for me.
Some work is still needed for ignoring case in join and uniq. As I understand it, the Fedora patches don't support 'uniq --ignore-case' in multi-byte locales. They do support 'join --ignore-case', though they ignore it in the simple-minded way that GNU diff does (except diff lowercases first whereas Fedora join uppercases first; although neither approach is perfect isn't lowercasing better?).
Comments welcome. If the idea isn't a good one we can back out the patches. But I hope this can move forward.
0001-maint-prefer-c_isxdigit-when-that-is-the-intent.patch
Description: Text Data
0002-digest-omit-unnecessary-b2sum-includes.patch
Description: Text Data
0003-maint-move-field_sep-into-separate-module.patch
Description: Text Data
0004-maint-include-ctype.h-selectively.patch
Description: Text Data
0005-maint-port-to-oddball-tolower.patch
Description: Text Data
0006-dircolors-assume-C-locale-spaces.patch
Description: Text Data
0007-stdbuf-port-to-oddball-toupper.patch
Description: Text Data
0008-test-allow-non-blank-white-space-in-numbers.patch
Description: Text Data
0009-join-uniq-support-multi-byte-separators.patch
Description: Text Data
0010-maint-copy-join-uniq-tests-from-Fedora.patch
Description: Text Data
0011-maint-pacify-make-syntax-check.patch
Description: Text Data
[Prev in Thread] | Current Thread | [Next in Thread] |