14 posts tagged with "OCR"

Minimum technical requirements

Java version: 11
webPDF version: 8 (revision 2159)

Text recognition

With update revision 2159, OCR text recognition in webPDF was improved. Documents can now be optimized before recognition to increase result quality.

Technical Minimum Requirements

Java version: 7
webPDF version: 7
wsclient version: 1

Using OCR Text Recognition with the wsclient Library

How can webPDF webservices be used in practice with the wsclient library? This article shows a concrete coding example focused on the OCR webservice.

How-to: Using the OCR webservice of webPDF 7

September 11, 2018

Minimum technical requirements

Java version: 7
webPDF version: 7
wsclient version: 1

Light bulb image: guide and tutorial

This example explains how to use the OCR webservice of webPDF. OCR in webPDF is based on Tesseract. By default, German, English, French, Spanish, and Italian are supported. Additional languages can be installed in the Tesseract folder (see the webPDF manual for details).

Languages using a multibyte character set are currently not supported, for example Arabic and several Far Eastern languages. OCR is mainly useful for documents that contain text visually, but not as embedded searchable text. For extracting already embedded text from PDF documents, webPDF provides an option in the Toolbox webservice.

14 posts tagged with "OCR"

OCR Quality Improved

Minimum technical requirements

OCR Webservice

Technical Minimum Requirements

Using OCR Text Recognition with the wsclient Library

How-to: Using the OCR webservice of webPDF 7

Minimum technical requirements

Minimum technical requirements​

Technical Minimum Requirements​

Using OCR Text Recognition with the wsclient Library​

Minimum technical requirements​

Minimum technical requirements

Technical Minimum Requirements

Using OCR Text Recognition with the wsclient Library

Minimum technical requirements