bug-gawk
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [bug-gawk] gawk stops reading input at SUB character


From: arnold
Subject: Re: [bug-gawk] gawk stops reading input at SUB character
Date: Tue, 12 Sep 2017 12:18:32 -0600
User-agent: Heirloom mailx 12.4 7/29/08

OK, so that explains the issue.

Should binary mode be the default?  There are probably pros and cons ..

Thanks,

Arnold

Eli Zaretskii <address@hidden> wrote:

> > From: Paavo Tamminen <address@hidden>
> > Date: Tue, 12 Sep 2017 17:58:51 +0300
> > 
> > However, I ran into problem as gawk stops reading the input file if there 
> > is a <SUB> character in the file. The
> > character <SUB> is a control character 'substitute', x1A in hex.
> > 
> > input file (test.txt:) has three lines with <SUB> at line two:
> > line 1 aA
> > line 2 b<SUB>B
> > line 3 cC
> > 
> > On windows cmd-promt the following shows output only to the up to character 
> > b. So <SUB> seems to be
> > treated as an end of file.
> > 
> > gawk.exe "{print $0}" test.txt
> > line 1 aA
> > line 2 b
> > 
> > gawk.exe --version
> > GNU Awk 4.1.4, API: 1.1 (GNU MPFR 3.1.0-p8, GNU MP 5.0.2)
> > 
> > My gawk (gawk-4.1.4-w32-bin.zip) is loaded from 
> > https://sourceforge.net/projects/ezwinports/
>
> This is expected: the default text-mode reads stop at the first ^Z
> character.  This is a Windows "feature" You should use binary-mode
> reads instead:
>
>   gawk -v BINMODE=1 "{print $0}" test.txt



reply via email to

[Prev in Thread] Current Thread [Next in Thread]