bug-bash
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: RFE: Please allow unicode ID chars in identifiers


From: Chet Ramey
Subject: Re: RFE: Please allow unicode ID chars in identifiers
Date: Tue, 13 Jun 2017 14:13:55 -0400
User-agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.12; rv:52.0) Gecko/20100101 Thunderbird/52.1.1

On 6/2/17 12:52 AM, dualbus wrote:

> - There are some questions that must be answered first:
> 
>   * How do you how to decode multibyte character sequences into Unicode? 
>     Should UTF-8 be assumed?

It has to be the current locale.

>   * Will the parsing of a script depend upon the user locale?

Only in the sense that identifiers will depend on the current locale.

>   * Should this special parsing code be disabled if POSIX mode is
>     enabled?

Yes. Posix requires that variables be names, as defined below.  However,
it should be possible to enable it while in Posix mode as an extension.

>   * Right now `name' or `identifier' is defined as:
> 
>       name: A word consisting only of alphanumeric characters and
>       underscores, and beginning with an alphabetic character or an
>       underscore. Also referred to as an identifier.
> 
>     How will the definition look like with Unicode identifiers?

Add 'from the current locale's character set'.

-- 
``The lyf so short, the craft so long to lerne.'' - Chaucer
                 ``Ars longa, vita brevis'' - Hippocrates
Chet Ramey, UTech, CWRU    chet@case.edu    http://cnswww.cns.cwru.edu/~chet/



reply via email to

[Prev in Thread] Current Thread [Next in Thread]