You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
# Note: Set language with '-l' to 3-character ISO 639-2 language codes.
# Note: Set 'tessedit_pageseg_mode' to 1 (Automatic page segmentation with Orientation and script detection [OSD]) via CLI argument '-psm' instead of config variable.
# Output in hOCR format including layout information (in form of HTML) instead of just text file.
tessedit_create_hocr 1
# Don't recognize common latin ligatures to improve searchability (hoping this option will be renamed to `tessedit_char_denylist`).
tessedit_char_blacklist fffiflffiffl
# Write .pdf output file (only available from tesseract 3.03+)