'Re: [poppler] Hi All, I have a question about libpoppler and need your helps,'

[prev in list] [next in list] [prev in thread] [next in thread] 

List:       freedesktop-poppler
Subject:    Re: [poppler] Hi All, I have a question about libpoppler and need your helps,
From:       Brad Hards <bradh () frogmouth ! net>
Date:       2012-02-21 9:09:01
Message-ID: 201202212009.01882.bradh () frogmouth ! net
[Download RAW message or body]

On Tuesday 21 February 2012 12:20:36 Zhenbang Xi wrote:
> *I am developing  a program using libpoppler to convert PDF to plain text.*
> *And I want to distinguish the page header and page footer from a page,in
> other words,I want to output them  separately(including the main content).*
> *How can I do this? Is there any structure or class that hold them in
> memory?*
There is no way to identify this reliably - PDF (and hence poppler) doesn't 
have any feature to interpret the intent of certain characters. It might be 
possible to come up with a good heuristic for some documents, based on page 
location.

Brad
_______________________________________________
poppler mailing list
poppler@lists.freedesktop.org
http://lists.freedesktop.org/mailman/listinfo/poppler
[prev in list] [next in list] [prev in thread] [next in thread]