## Re: Encoding help

 From: B. T. Raven
Subject: Re: Encoding help
Date: Wed, 03 Jun 2009 12:35:11 -0500

Thanks for the heads up on these functions, Eli. I did use the C-x ret c utf-8 ploy but that just repeats my default settings. I see most characters legibly with C-x ret c iso-8859-1 but there are still a few escape sequences sprinkled around. The most common are those pretty quotes that Latex substitutes for ascii single or double quote. What were vowels with macrons in the pdf are bare vowels so they must have been compiled into the pdf as uncomposed (not monolithic composed glyphs).
It might be a good idea to submit a bug report with "M-x
report-emacs-bug RET", then.  Please tell there what program(s) and
command line options you used to produce the text file from PDF, and
someone will look into this and see whether Emacs could do any
better.

Okay. I'll do that if I'm convinced that Emacs is at fault. Now I'm not so sure. Adobe Reader 8.0 offers only (Accessible) .txt as an option for Save as text. In the meanwhile I made a similar pdf with auctex and the .txt file produced by Adobe Reader is even more fragmented than the first one. I guess this is not surprising after the orginal .tex file goes through \usepackage[utf8x]{inputenc} and \usepackage{babel}.
