Re: Encoding of LilyPond console output

lilypond-devel

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Encoding of LilyPond console output

From:	David Kastrup
Subject:	Re: Encoding of LilyPond console output
Date:	Mon, 02 Jan 2012 20:35:39 +0100
User-agent:	Gnus/5.13 (Gnus v5.13) Emacs/24.0.92 (gnu/linux)

David Kastrup <address@hidden> writes:

> I am indeed top-posting.

<URL:http://code.google.com/p/lilypond/issues/detail?id=2173>

> Wilbert Berendsen <address@hidden> writes:
>
>> Everything boils down to the question in which encoding the 8bit
>> console output of LilyPond is presented. I was always assuming that
>> this was UTF-8, which worked correctly on Linux. I.e. both
>> filenameswith accented letters and translated messages (such as
>> French, with many accented letters) always showed up correctly in the
>> LilyPond console output. Which is read by Frescobaldi as an 8bit
>> bytestream and decoded into unicode strings using the UTF-8 encoding.
>>
>> Then, on Windows, I discovered that filenames do not use UTF-8
>> encoding, but rather 'mbcs' or something like that (what is returned
>> by sys.getfilesystemencoding() in Python).
>>
>> So I changed Frescobaldi to use that encoding when reading LilyPond
>> console output, but then we discovered that translated messages (such
>> as the French ones) with accented characters do show in a garbled
>> encoding (clearly showing something like UTF-8 displayed as Latin1).
>>
>> So again I changed Frescobaldi, and now it reads the console output
>> byte stream and parses that for file references (such as:
>> file.ly:12:3: error: blabla) and decodes those filenames using the
>> filesystem encoding, and the rest using UTF-8.
>>
>> This seems to work well: file references are correctly parsed and
>> messages are readable still.
>
> It will likely cease working at some point of time.  Lilypond is
> supposed to be UTF-8.
>
>> So everything thrown together: is my analysis of the mixed output
>> encodings LilyPond uses on stdout and stderr correct?
>
> Probably.
>
>> And in line with this: can LilyPond be made more aware of this, and
>> use the same encoding for all output (correctly encoding filenames)?
>
> It will have to at one point of time.  I am currently preparing a patch
> where it will reject all non-UTF-8 input, and of course this includes
> filenames in \include and similar statements.  I have no idea how to
> deal with the command line on Windows.

-- 
David Kastrup

[Prev in Thread]

Current Thread

[Next in Thread]

Re: Encoding of LilyPond console output, David Kastrup <=

Prev by Date: Doc: Learning: Use voices in the intended order. (issue 5507050)
Next by Date: Re: PATCH: Countdown to 20120103
Previous by thread: Doc: Learning: Use voices in the intended order. (issue 5507050)
Next by thread: critical issues
Index(es):
- Date
- Thread