OCR Optical character recognition with webPDF

Blue glowing cables

The webPDF portal and its OCR webservice allow you to run optical character recognition on graphics and scanned files and convert them into editable formats.

Optical character recognition (OCR)

OCR is used whenever optical text recognition must be applied to image-based files and the result should then be stored as PDF, text, or XML. In practical terms: OCR transforms images of text into searchable PDF documents.

This is especially useful when scanned incoming mail must be searchable for specific terms and then automatically integrated and categorized within existing work processes.

OCR (Optical Character Recognition), also called text recognition, is a technology that converts scanned paper documents, PDF files, and digital images into editable and searchable files.

How does OCR work in the webPDF portal?

Select the output format (PDF, text, or XML) and the document language. Then run text recognition.

OCR in webPDF portal

Always set the source language correctly; otherwise special characters may not be recognized accurately.

More information on this topic