emacs-orgmode
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Orgmode] Searching inside of attachments (pdf, odt)?


From: Samuel Wales
Subject: Re: [Orgmode] Searching inside of attachments (pdf, odt)?
Date: Mon, 12 Oct 2009 15:59:30 -0700

Hi Karl,

I have been thinking about this recently also, but in a
different direction.  I agree that searching inside
attachments is important.

On Mon, Oct 12, 2009 at 06:40, Karl Maihofer <address@hidden> wrote:
> does anyone use something like Lucene[*] with orgmode to search inside
> attachments like pdf- and odt-files? At the moment I use org for

My idea is to use ordinary agenda search like this:

  1) agenda search displays the headline that has the
     attachment.
  2) org uses an alist to determine the correct textifier
     according to extension.  e.g. '((".pdf" . "pdf2text")).
  3) agenda searches normally (as if the contents of the
     attachment were body text).

Note that we are searching only attachments that the agenda
would search.  Thus, "<" in the agenda will work
properly.[1]

Also, note that archived tasks always still point to
attachments.  With the above solution, if you search agenda
files, the results won't be polluted with archived
attachments.  If you use an external solution, you would
have to find a way to exclude the archived attachments.

IR software could still be integrated.  At the very least,
you might choose Lucene as a back-end textifier for all
extensions.

It's not as fancy as integrating IR with all of the IR
features, but it might be a simple solution.


Samuel


[1] This raises another, much more general idea.  Is
there a feature to restrict agenda commands (including
search) to the currently displayed (or even marked) agenda
results?  i.e. you run an agenda search, filter however you
like, then search within the results (or run any custom
agenda command) within those results.  It
would allow fast switching among multiple user-defined
sorting strategies (kind of like filtering with "/"),
which is something I've wanted.  But I just thought of it now, and
don't know if it's a good idea.




reply via email to

[Prev in Thread] Current Thread [Next in Thread]