[PLUG] Converting PDF Document to Text
Rich Shepard
rshepard at appl-ecosys.com
Thu Jun 4 16:59:05 UTC 2009
Virtually all PDF documents are easily converted to plain text either
using pdftotext, pdftops->ps2ascii, or blocking text displayed in xpdf or
acroread and copying into a text fine. I have one of the rare exceptions
here and wonder what magic I've missed trying.
The file was created by a Xerox copier (scanned paper document and saved
as a .pdf). The pdfinfo tells me the creator was a Xerox WorkCentre 7345,
and copying/printing is allowed. I tried printing to a .ps file, but that
won't translate to plain text, either.
I would prefer to not re-key a list several pages long if there's a way to
translate this document to something I can use.
Any and all suggestions solicited.
Rich
--
Richard B. Shepard, Ph.D. | Integrity Credibility
Applied Ecosystem Services, Inc. | Innovation
<http://www.appl-ecosys.com> Voice: 503-667-4517 Fax: 503-667-8863
More information about the PLUG
mailing list