[Top][All Lists]

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

bug#3526: 23.0.94; `list-character-sets' display

From: Kenichi Handa
Subject: bug#3526: 23.0.94; `list-character-sets' display
Date: Wed, 17 Jun 2009 15:42:57 +0900

In article <address@hidden>, "Drew Adams" <address@hidden> writes:
> > I updated the Info node (emacs)Charsets for the new way of
> > charsets handling and added @findex for list-character-sets
> > (in doc/emacs/mule.texi), and modified list-character-sets-1
> > to do above (in lisp/international/mule-diag.el).  Please
> > check the English wording of the new text and fix/improve
> > it.

> Thanks for the fix.

> If you can send me the new text or point me to a URL that has it, I'll take a
> look at the English. If not, someone else can perhaps help with this. Thx.

This is the one-line help in *Character Set List* buffer,
Character sets for defining another charset or obsolete now

and this is the "Charsets" node of mule.texi.

@node Charsets
@section Charsets
@cindex charsets

  Emacs defines most of popular character sets (e.g. ascii,
iso-8859-1, cp1250, big5, unicode) as @dfn{charsets} and a few of its
own charsets (e.g. emacs, unicode-bmp, eight-bit).  All supported
characters belong to one or more charsets.  Usually you don't have to
take care of ``charset'', but knowing about it may help understanding
the behavior of Emacs in some cases.

  One example is a font selection.  In each language environment,
charsets have different priorities.  Emacs, at first, tries to use a
font that matches with charsets of higher priority.  For instance, in
Japanese language environment, the charset @code{japanese-jisx0208}
has the highest priority (@xref{describe-language-environment}).  So,
Emacs tries to use a font whose @code{registry} property is
``JISX0208.1983-0'' for characters belonging to that charset.

  Another example is a use of @code{charset} text property.  When
Emacs reads a file encoded in a coding systems that uses escape
sequences to switch charsets (e.g. iso-2022-int-1), the buffer text
keep the information of the original charset by @code{charset} text
property.  By using this information, Emacs can write the file with
the same byte sequence as the original.

@findex list-charset-chars
@cindex characters in a certain charset
@findex describe-character-set
  There are two commands for obtaining information about Emacs
charsets.  The command @kbd{M-x list-charset-chars} prompts for a
charset name, and displays all the characters in that character set.
The command @kbd{M-x describe-character-set} prompts for a charset
name and displays information about that charset, including its
internal representation within Emacs.

@findex list-character-sets
  To display a list of all the supported charsets, type @kbd{M-x
list-character-sets}.  The list gives the names of charsets and
additional information to identity each charset (see ISO/IEC's this
page <http://www.itscj.ipsj.or.jp/ISO-IR/> for the detail).  In the
list, charsets are categorized into two; the normal charsets are
listed first, and the supplementary charsets are listed last.  A
charset in the latter category is used for defining another charset
(as a parent or a subset), or was used only in Emacs of the older

  To find out which charset a character in the buffer belongs to,
put point before it and type @kbd{C-u C-x =}.

Kenichi Handa

reply via email to

[Prev in Thread] Current Thread [Next in Thread]