I'm is really I'm

Jason Rumney
Subject: Re: I'm is really I'm
Wed, 07 Jul 2010 12:19:38 +0800
On 07/07/2010 09:57, Lennart Borgman wrote:
I hoped there were some easy cases where some characters commonly used
for typographic reasons could be replaced by more "wellknown"

There are some filters in Gnus to handle this type of problem, but the problem you saw is different. PDF allows fonts to be embedded in the document, and when this happens the mapping from character encoding to glyph gets optimised so there is no common standard. I've seen this before when attempting to copy and paste from a Japanese PDF document - there was no way of getting useful information out short of using OCR on a bitmap of the PDF reader's display.

