Skip to content

Latest commit

 

History

History
23 lines (12 loc) · 940 Bytes

readme.md

File metadata and controls

23 lines (12 loc) · 940 Bytes

pdftotext-go

OpenSSF Scorecard

Extract texts with their corresponding page numbers from PDF files. Wraps the command line tool pdftotext (poppler-utils).

Usage

  1. poppler-utils (version >=22.05.0) must be installed and available in the path.
  2. go get "github.com/heussd/pdftotext-go"
  3. See tests for code examples.

Why poppler version >=22.05.0

Version 22.05.0 of poppler introduced a new parameter -tsv, which extracts PDF content with meta data as TSV. This functionality is essential for the operation of this library.

Thanks to