[Top][All Lists]
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [help-texinfo] [Q] converting pdf to texinfo
From: |
joakim |
Subject: |
Re: [help-texinfo] [Q] converting pdf to texinfo |
Date: |
Mon, 07 Aug 2006 20:32:19 +0200 |
User-agent: |
Gnus/5.110006 (No Gnus v0.6) Emacs/22.0.51 (gnu/linux) |
address@hidden (Karl Berry) writes:
> Hi Joakim,
>
> http://www.ecma-international.org/publications/files/ECMA-ST/Ecma-262.pdf
>
> My basic idea is that maybe pdftotext can be used, and then some
> script that converts it to texinfo.
>
> Yes, although since pdftotext loses the original markup, whatever it is,
> it would take a fair amount of work to restore it all -- it seems you'd
> have to do it by hand, looking at their pdf and inserting the Texinfo
> commands (@code, @xref, etc.) in the new source. I see lots of
> cross-references, special terms (bold, italic, typewriter), etc. All of
> that will require attention.
>
> Perhaps it would be worth writing to them, if you can find a contact
> address, and ask for the original source. Even if it's Microsoft Word
> (which is kind of what it looks like, given the terrible word/letter
> spacing), it would be possible to convert it retaining much of the
> information.
Ok, I will write to them and see whats possible.
Failing that, I will see if its possible to find some other pdf
parser, which supplies more information. Maybe www.pdfbox.com.
Thanks for the help!
> The main point of the exercise would be to use "C-h S" to jump to
> documentation for javascript intrinsincs.
>
> A nice goal for JavaScript hackers.
>
> The Ecma site seems to say that the specification is public domain
> here: http://www.ecma-international.org/publications/index.html(free
> of charge and copyright)
>
> I agree. Seems ok to me.
>
> Hope this helps,
> Karl
--
Joakim Verona
http://www.verona.se