[Top][All Lists]
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: tr doesn't work for multibyte encodings
From: |
Jim Meyering |
Subject: |
Re: tr doesn't work for multibyte encodings |
Date: |
Wed, 15 Dec 2004 17:17:32 +0100 |
Tim Waugh <address@hidden> wrote:
> In 5.2.1, tr doesn't work for multibyte encodings such as UTF-8. Are
> there any plans to fix this?
It's most certainly on the list, (hmm.. it's been on the list in my head
for years, but wasn't listed in the TODO file -- I've just added it there)
The same applies to many of the other word/line/character-based programs,
like wc, fmt, cut, sort, etc. Here's what I've added to TODO:
Adapt tools like wc, tr, fmt, etc. (most of the textutils) to be
multibyte aware. The problem is that I want to avoid duplicating
significant blocks of logic, yet I also want to incur only minimal
(preferably `no') cost when operating in single-byte mode.