[prev in list] [next in list] [prev in thread] [next in thread]
List: web4lib
Subject: Re: [WEB4LIB] Extracting words out of a .pdf
From: Judy Daniluk <jdaniluk777 () GMAIL ! COM>
Date: 2023-05-28 22:42:10
Message-ID: CANqOSb1gRB0u_TEMicSfC0QmnbRqCtpCLmqe9S_ge+8P9uYPFg () mail ! gmail ! com
[Download RAW message or body]
It depends on how the PDF is created. Some scanning software does OCR,
which recognizes the text and makes a PDF containing text that you can
select and copy. Other scanning software just creates images where the
text is not recognizable.
I have had good luck with the free Adobe Scan mobile app.
Judy Daniluk
jdaniluk777@gmail.com
On Sun, May 28, 2023 at 5:35 PM Robert Sullivan <robert.g.sullivan@gmail.com>
wrote:
> On Sun, May 28, 2023 at 4:13 PM charles meyer <reachmeplace@gmail.com>
> wrote:
> >
> > I've read through Googled results which many suggest in Acrobat Reader
> (fee) I should be able to highlight words in a PDF and copy and paste them
> into a Word document as plain text.
> >
> > My Acrobat Reader won't do that.
>
> Hi Charles,
>
> I have been able to do this - but I am generally working with newspaper
> PDFs which were created to be searchable. I suspect that is your trouble.
>
> I have OmniPage and it will extract the text from PDFs (it's essentially
> scanning them). I'd be happy to help you with this if you'd like to send
> me the files.
>
> > I found a good Web site for learning GIMP skills with screenshots - for
> the unintimidated - https://thegimptutorials.com/
>
> I always enjoy your questions, wish I had more answers.
>
> --
> Bob Sullivan
> Schenectady County (NY) Public Library
>
[Attachment #3 (text/html)]
<div dir="ltr">It depends on how the PDF is created. Some scanning software does \
OCR, which recognizes the text and makes a PDF containing text that you can select \
and copy. Other scanning software just creates images where the text is not \
recognizable. <div><br></div><div>I have had good luck with the free Adobe Scan \
mobile app. </div><div><br clear="all"><div><div dir="ltr" class="gmail_signature" \
data-smartmail="gmail_signature"><div dir="ltr"><div>Judy Daniluk<br></div><a \
href="mailto:jdaniluk777@gmail.com" \
target="_blank">jdaniluk777@gmail.com</a><br></div></div></div><br></div></div><br><div \
class="gmail_quote"><div dir="ltr" class="gmail_attr">On Sun, May 28, 2023 at \
5:35 PM Robert Sullivan <<a \
href="mailto:robert.g.sullivan@gmail.com">robert.g.sullivan@gmail.com</a>> \
wrote:<br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px \
0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div dir="ltr">On Sun, \
May 28, 2023 at 4:13 PM charles meyer <<a href="mailto:reachmeplace@gmail.com" \
target="_blank">reachmeplace@gmail.com</a>> wrote:<br>><br>> I've read \
through Googled results which many suggest in Acrobat Reader (fee) I should be able \
to highlight words in a PDF and copy and paste them into a Word document as plain \
text.<br>><br>> My Acrobat Reader won't do that.<br><br><div>Hi \
Charles,</div><div><br></div><div>I have been able to do this - but I am generally \
working with newspaper PDFs which were created to be searchable. I suspect that is \
your trouble.</div><div><br></div><div>I have OmniPage and it will extract the text \
from PDFs (it's essentially scanning them). I'd be happy to help you with \
this if you'd like to send me the files.</div><div><br></div><div>> I found a \
good Web site for learning GIMP skills with screenshots - for the unintimidated - <a \
href="https://thegimptutorials.com/" \
target="_blank">https://thegimptutorials.com/</a><br><br>I always enjoy your \
questions, wish I had more answers.<br><br>--<br>Bob Sullivan<br>Schenectady County \
(NY) Public Library</div></div> </blockquote></div>
[prev in list] [next in list] [prev in thread] [next in thread]
Configure |
About |
News |
Add a list |
Sponsored by KoreLogic