help-texinfo
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [help-texinfo] [Q] converting pdf to texinfo


From: joakim
Subject: Re: [help-texinfo] [Q] converting pdf to texinfo
Date: Mon, 07 Aug 2006 20:32:19 +0200
User-agent: Gnus/5.110006 (No Gnus v0.6) Emacs/22.0.51 (gnu/linux)

address@hidden (Karl Berry) writes:

> Hi Joakim,
>
>     http://www.ecma-international.org/publications/files/ECMA-ST/Ecma-262.pdf
>
>     My basic idea is that maybe pdftotext can be used, and then some
>     script that converts it to texinfo.
>
> Yes, although since pdftotext loses the original markup, whatever it is,
> it would take a fair amount of work to restore it all -- it seems you'd
> have to do it by hand, looking at their pdf and inserting the Texinfo
> commands (@code, @xref, etc.) in the new source.  I see lots of
> cross-references, special terms (bold, italic, typewriter), etc.  All of
> that will require attention.
>
> Perhaps it would be worth writing to them, if you can find a contact
> address, and ask for the original source.  Even if it's Microsoft Word
> (which is kind of what it looks like, given the terrible word/letter
> spacing), it would be possible to convert it retaining much of the
> information.

Ok, I will write to them and see whats possible.

Failing that, I will see if its possible to find some other pdf
parser, which supplies more information. Maybe www.pdfbox.com.

Thanks for the help!


>     The main point of the exercise would be to use "C-h S" to jump to
>     documentation for javascript intrinsincs.
>
> A nice goal for JavaScript hackers.
>
>     The Ecma site seems to say that the specification is public domain
>     here: http://www.ecma-international.org/publications/index.html(free
>     of charge and copyright)
>
> I agree.  Seems ok to me.
>
> Hope this helps,
> Karl

-- 
Joakim Verona
http://www.verona.se




reply via email to

[Prev in Thread] Current Thread [Next in Thread]