groff
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Groff] Re: Question concerning Czech man pages in UTF-8


From: Mike Fabian
Subject: [Groff] Re: Question concerning Czech man pages in UTF-8
Date: Thu, 12 Sep 2002 22:46:27 +0200
User-agent: Gnus/5.090004 (Oort Gnus v0.04) XEmacs/21.4 (Honest Recruiter, i386-suse-linux)

Werner LEMBERG <address@hidden> writes:

>> Running the following commandline 
>> 
>>     zcat /usr/share/man/cs/man1/man.1.gz | groff -mandoc -Tutf8 - | less
>> 
>> in an UTF-8 locale in an UTF-8 capable terminal results in the display
>> of wrong characters in a Czech man page:
>> 
>>     http://www.suse.de/~mfabian/screenshots/czech-man-page-utf-8.png
>> 
>> But the following commandline 
>> 
>>    zcat /usr/share/man/cs/man1/man.1.gz | (LC_ALL=cs_CZ.ISO-8859-2 groff 
>> -mandoc -Tlatin1 - ) | iconv -f iso-8859-2 -t utf-8 | less
>> 
>> appears to work fine:
>> 
>>     
>> http://www.suse.de/~mfabian/screenshots/czech-man-page-utf-8-with-help-of-iconv.png
>> 
>> Why is it necessary to force groff to run in cs_CZ locale and use
>> -Tlatin1 (or -Tascii8) and convert the result to UTF-8 instead of
>> using -Tutf8 directly?
>
> Currently, groff only supports UTF8 for output, not for input.  This
> will change as soon as groff's conversion to Unicode has been
> implemented.

OK, thank you. As a temporary workaround until groff supports UTF-8
for input, I changed /usr/bin/nroff a little bit.  In case of
ja_JP.UTF-8 locale or cs_CZ.UTF-8 locale I call groff in the
respective non-UTF-8 locale and convert the result to UTF-8 with
iconv.

The /usr/bin/nroff I'm currently using is attached.

Attachment: nroff
Description: Binary data

-- 
Mike Fabian   <address@hidden>   http://www.suse.de/~mfabian
睡眠不足はいい仕事の敵だ。

reply via email to

[Prev in Thread] Current Thread [Next in Thread]