Command-line gems: pdftotext

730 days ago

Say what you like about PDF files but they’re here to stay. A visit to virtually any government site will prove this. Want a tax form? Want to renew your driving license? You’ll be downloading a PDF file. They’ve become the lingua franca, if you like, of document transfer.

From an open source perspective PDF has good and bad points. It’s good in that it’s an open standard that’s freely implementable, but bad because it remains a proprietary format under the control of Adobe.

What’s little realized among Linux users is the sheer quantity of PDF tools available at the command line. Need to convert a PDF to HTML, text or postscript file? It’s easy when you know how.


Click here to read more