Watch, Follow, &
Connect with Us

For forums, blogs and more please visit our
Developer Tools Community.


Welcome, Guest
Guest Settings
Help

Thread: Need to extract the text in pdf files



Permlink Replies: 4 - Last Post: Jun 21, 2016 11:55 PM Last Post By: Robert Triest Threads: [ Previous | Next ]
Thomas Lee

Posts: 7
Registered: 8/3/01
Need to extract the text in pdf files
Click to report abuse...   Click to reply to this thread Reply
  Posted: Jun 10, 2016 11:16 AM
I am using Delphi XE. I have an app I wrote for a client two years ago and at that time I used an external command line app to extract the text but it doesn't work so well with pdf files they receive from a new client of theirs. What is the best way to accomplish this?

Thanks,
TD
Linden ROTH

Posts: 467
Registered: 11/3/11
Re: Need to extract the text in pdf files
Click to report abuse...   Click to reply to this thread Reply
  Posted: Jun 10, 2016 3:46 PM   in response to: Thomas Lee in response to: Thomas Lee
Thomas Lee wrote:
I am using Delphi XE. I have an app I wrote for a client two years ago and at that time I used an external command line app to extract the text but it doesn't work so well with pdf files they receive from a new client of theirs. What is the best way to accomplish this?

Thanks,
TD

Is the text actually in there ... could be pure graphic !??!??!

OR

PDF security settings
--
Linden
"Mango" was Cool but "Wasabi" was Hotter but remember it's all in the "source"
Thomas Lee

Posts: 7
Registered: 8/3/01
Re: Need to extract the text in pdf files
Click to report abuse...   Click to reply to this thread Reply
  Posted: Jun 13, 2016 7:13 AM   in response to: Linden ROTH in response to: Linden ROTH
Is the text actually in there ... could be pure graphic !??!??!

OR

PDF security settings
--
Linden

Yes the text is there. The problem I think is the txt is in two columns in the pdf.

Thanks for trying to help!
TD
Robert Triest

Posts: 687
Registered: 3/24/05
Re: Need to extract the text in pdf files
Click to report abuse...   Click to reply to this thread Reply
  Posted: Jun 21, 2016 11:55 PM   in response to: Thomas Lee in response to: Thomas Lee
OR pdf is compressed..
Erik Salaj

Posts: 144
Registered: 12/23/11
Re: Need to extract the text in pdf files
Click to report abuse...   Click to reply to this thread Reply
  Posted: Jun 21, 2016 4:22 PM   in response to: Thomas Lee in response to: Thomas Lee
I am using Delphi XE. I have an app I wrote for a client two years
ago and at that time I used an external command line app to extract
the text but it doesn't work so well with pdf files they receive from
a new client of theirs. What is the best way to accomplish this?

Try PDFium Component Suite:

http://winsoft.sk/pdfium.htm

Extact text from PDF demo example:

http://winsoft.sk/download/pdfiumtext.zip

Erik Salaj, WINSOFT
Legend
Helpful Answer (5 pts)
Correct Answer (10 pts)

Server Response from: ETNAJIVE02