[Top][All Lists]

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [bug-gawk] feature request: iconv/recode dynamic extension

From: arnold
Subject: Re: [bug-gawk] feature request: iconv/recode dynamic extension
Date: Sat, 22 Dec 2018 12:11:59 -0700
User-agent: Heirloom mailx 12.5 7/5/10


You've already had some good responses; it looks like I don't need
to really do anything. An extension to provide access to iconv would
be useful, but I don't have the cycles for that (unless you want to
discuss my consulting rates).

For searching your data, instead of looping, you may want to rework
things to take advantage of gawk's associative arrays and the ability
to see if some value exists as a subscript in an array.  I strongly
recommend working your way through the gawk manual, particularly Part I
thereof, to learn more.

Best of luck,


Franta Hanzl??k <address@hidden> wrote:

> Hello,
> not sure when it is good idea, but I think this may be usefull for
> others also: I'm just doing some word processing in gawk, and it's
> part is two string comparison. These strings are plaintext ASCII
> strings obtained by removing diacritics from the original Latin-1
> and Latin-2 strings - thus I need conversion as
>  "??????????????????????????" -> "aaeeooscyiuuu".
> For now I solve this by calling external conversion program - as
> iconv -f UTF-8 -t US-ASCII//TRANSLIT <<< "??????????????????????????????"
>    or
> recode -f u8..flat <<< "??????????????????????????????"
> but for thousands strings it is too slow (and resource expensive).
> There is perhaps lot of similar text conversions cases, where gawk
> dynamic extension for this needs wil be very useful.
> Eventually, when this idea isn't totally bad, I can try to program
> it, but I have no programming skills - thus can You please give me
> some advice on how to do this?
> -- 
> Thanks in advance, Franta Hanzlik

reply via email to

[Prev in Thread] Current Thread [Next in Thread]