Re: Academic workflow with old PDFs

From: Eduardo Ochs
Subject: Re: Academic workflow with old PDFs
Date: Sat, 20 Aug 2022 18:32:09 -0300

On Sat, 20 Aug 2022 at 18:03, Alessandro Bertulli
<> wrote:
> Here my bad, I should have asked a narrower question. Looking back, I'd
> say you're right, my first question was if it was possible (in the
> community's opinion) to study old, scanned, poorly indexed PDFs with
> pdf-tools, and if not, what other tools do you use. I should have been
> more focused, I apologize.

Hi Alessandro,

my favorite tool for indexing PDFs - and that I use even for PDFs that
only contain photos of whiteboards, and that are totally unOCRizable -
is the module of eev that is explained in this tutorial,

and in the video whose index is here:

Look for the lines in the index that look like these ones,

  (find-eev2020video "4:52" "`find-pdf-page' calls an external program")
  (find-eev2020video "5:26" "`find-pdf-text' converts the PDF to text and")
  (find-eev2020video "10:45" "`code-pdf-page' creates a short
hyperlink function for a PDF")
  (find-eev2020video "11:38" "let's try...")
  (find-eev2020video "11:55" "`find-fongspivatext'")
  (find-eev2020video "12:25" "This block is a kind of an index for that book")
  (find-eev2020video "12:54" "This block is a kind of an index for that video")

and click on the links with the timemarks...

If that looks like something that you would like to try then send me
an e-mail and let's see if we can arrange to chat by IRC or by some
other means!

    Eduardo Ochs

