[Top][All Lists]

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[bug #55107] PDFPIC: .psbb: support extraction of MediaBox from pdf file

From: Keith Marshall
Subject: [bug #55107] PDFPIC: .psbb: support extraction of MediaBox from pdf files
Date: Tue, 12 Oct 2021 14:39:22 -0400 (EDT)
User-agent: Mozilla/5.0 (X11; Linux x86_64; rv:93.0) Gecko/20100101 Firefox/93.0

Follow-up Comment #1, bug #55107 (project groff):

In this mailing-list message
<https://lists.nongnu.org/archive/html/groff/2021-09/msg00064.html>[1], Deri
<https://savannah.gnu.org/users/deri> offered two PDF files, namely
<https://lists.nongnu.org/archive/html/groff/2021-09/pdf7tyGN4NLTE.pdf>[2] and
from which the original prototype code
as referenced on this ticket, is unable to extract any valid MediaBox

In this follow-up message
<https://lists.nongnu.org/archive/html/groff/2021-10/msg00043.html>[5], I
explained that the failure to extract the MediaBox from Picture.pdf was caused
by an omission from the groff-psbb lexer's pattern matching rules for the PDF
dictionary scanning state, resulting in mishandling of nested dictionaries;
this is readily resolved by the [file #52093 attached patch][6].

OTOH, croptest.pdf uses new PDF (post PDF-1.5) features, and lacks any trailer
dictionary, or free-standing cross reference table, (both of which are
_required_ by the current groff-psbb prototype implementation); to support
these new PDF features, substantial additions to the current implementation
will be required.

[1]: https://lists.nongnu.org/archive/html/groff/2021-09/msg00064.html
[2]: https://lists.nongnu.org/archive/html/groff/2021-09/pdf7tyGN4NLTE.pdf
[3]: https://lists.nongnu.org/archive/html/groff/2021-09/pdfBjudbNbwI2.pdf
[5]: https://lists.nongnu.org/archive/html/groff/2021-10/msg00043.html
[6]: [file #52093 patch file #52093]


Reply to this item at:


  Message sent via Savannah

reply via email to

[Prev in Thread] Current Thread [Next in Thread]