bug-gnu-utils
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Gawk and non-ASCII characters


From: Eli Zaretskii
Subject: Re: Gawk and non-ASCII characters
Date: Sat, 16 Oct 2010 16:30:47 +0200

> Date: Sat, 16 Oct 2010 09:58:37 -0400
> From: Charles Kozierok <address@hidden>
> Cc: address@hidden
> 
> It says the codepage is "437".

What Windows version is that?

> I'm just not sure how to make the program properly deal with these
> characters. The only way I can even look for them is to generate a
> non-ASCII character using "sprintf("%c", 194)". Is that pretty much
> it, or is there a more elegant way of handling such characters? I'm
> trying to look at this in a bigger picture sense as well.

Processing individual characters in Awk is very painful.  Why do you
doing it in Gawk?  I think `tr' (from Coreutils) is much better.  This
works for me on Windows:

   tr "\200-\377" "X" < file.dat

This is just a simple replacement by an upper-case X, but you can
easily tweak that for your needs.

A Windows port of Coreutils is available from the GnuWin32 project's
site.



reply via email to

[Prev in Thread] Current Thread [Next in Thread]