[Top][All Lists]

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Encoding for a file containing filenames?

From: Jan Djärv
Subject: Re: Encoding for a file containing filenames?
Date: Fri, 09 Nov 2007 12:05:35 +0100
User-agent: Thunderbird (X11/20070728)

Eli Zaretskii skrev:
From: Stefan Monnier <address@hidden>
Cc: Eli Zaretskii <address@hidden>,  Juanma Barranquero <address@hidden>,  
Date: Thu, 08 Nov 2007 23:01:31 -0500

It's actually a bit worse: it shouldn't just be encodable with utf-8,
but it should also be the case that encoding to utf-8 and back should
return the exact same string (since these are filenames and will be
compared with simple byte-comparison in the kernel).

What kernel are we talking about here?  The Windows filesystem, for
example, does not compare bytes, but rather 16-bit words (UTF-16).
And Linux filesystems use UTF-8 for file names anyway, right?

Linux filesystems (and others) don't interpret the file names. They are just sequences of bytes. It is the user space tools like ls, emacs and others that put meaning to these bytes.

        Jan D.

reply via email to

[Prev in Thread] Current Thread [Next in Thread]