[Top][All Lists]
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
bug#7372: fmt and multi-byte encodings
From: |
Pádraig Brady |
Subject: |
bug#7372: fmt and multi-byte encodings |
Date: |
Thu, 11 Nov 2010 16:01:02 +0000 |
User-agent: |
Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.9.1.8) Gecko/20100227 Thunderbird/3.0.3 |
On 11/11/10 09:32, Ineiev wrote:
> Hello;
>
> Today I fed a text in Russian in UTF-8 to fmt
> and discovered that the utility counts the line width
> in bytes rather than in characters (the lines written in
> Cyrillics were roughly twice as short as the lines
> written in Latin script), which was not what I wanted.
> I checked fmt from coreutils-8.6.
>
> As a workaround, I could iconv the text into a single-byte
> encoding like KOI8-R, but I would limit the character
> set then.
>
> I've never used fmt before personally, so actually I'm not
> sure whether it was a bug or I did something wrong.
>
> Any hints?
We're starting to apply multi-byte support,
so hopefully this will be fixed soon.
$ echo "1 2 æ 4 5 6" | fmt -w6
1 2
æ 4
5 6
That is with the official fedora
version of `fmt`
cheers,
Pádraig