What is OCR (Text Recognition)?
OCR stands for Optical Character Recognition. This technology converts text in images into editable, searchable text. Instead of reading letters like a human, the software recognizes patterns and shapes, matching them to known characters.
Formidex uses Tesseract.js, one of the most powerful open-source OCR engines. Text recognition runs entirely in your browser. Language data is downloaded on first use and stored locally.
How does text recognition work?
The OCR engine first breaks the image into individual lines and words. Then it analyzes each character based on learned patterns. By incorporating dictionaries and language models, even hard-to-read characters are correctly identified.
Recognition quality depends heavily on the input image. Printed text on white background is recognized very reliably. Handwriting, distorted text, or low-contrast images are significantly more challenging.
Tips for best results
Ensure good resolution. Text should be at least 12 pixels tall for reliable recognition. Images at 300 DPI or higher yield the best results.
Select the correct language. The OCR engine uses language-specific dictionaries that significantly improve recognition. For mixed texts, you can activate multiple languages simultaneously.
Make sure text is straight. Heavily rotated or distorted text is recognized poorly. If needed, straighten the image first using the rotate tool.
Common use cases
Digitize documents: photograph invoices, contracts, letters, or notes and extract the text. This saves you from retyping and lets you process the text directly.
Extract from screenshots: copy text from screenshots, error messages, or app interfaces without typing everything manually.
Capture business cards: automatically read contact details from business cards instead of typing them in manually. Especially handy after conferences and trade shows.