[Top][All Lists]
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
iconv bug in 2.2.3
From: |
Dana Lacoste |
Subject: |
iconv bug in 2.2.3 |
Date: |
Tue, 6 Nov 2001 06:06:46 -0800 |
This isn't a really serious bug, but I'm not sure how
to get it fixed. I'm using iconv to convert from an
IBM 850 character to 'unicodelittle' (note that it
doesn't appear to be a problem using code page 1250,
just 850)
the character is the copyright symbol. According to
the two maps below, it should be straightforward to
convert them :
http://www.unicode.org/charts/PDF/U0080.pdf
http://utopia.knoware.nl/users/eprebel/Communication/CharacterSets/CodePage8
50.html
When I do it however (methodology below) I get the Registered
Trademark symbol instead of the copyright symbol.
Any suggestions? I don't want to have to patch the character
mapping if I can get away with it :)
Dana Lacoste Linux Developer
Peregrine Systems Ottawa, Canada
Methodology :
1 - Create a test file :
cd /tmp
vi test1
© - copyright symbol
® - registered trademark
2 - run the iconv program to convert the symbols :
iconv -f ibm850 -t unicodelittle < test1 > output1
3 - look at the results
cat output1
® - copyright symbol
« - registered trademark
3a- Insert Shocked Response
WHAT?
4 - try again with a different/compatible code page to see what happens :
iconv -f cp1250 -t unicodelittle < test1 > output2
(IF I remember correctly [and web searches seem to back me up] CP1250
and IBM850 are the same)
5 - look at the results
cat output2
© - copyright symbol
® - registered trademark
6 - Draw conclusions
It looks like there's something wrong with IBM850.
- iconv bug in 2.2.3,
Dana Lacoste <=