Tesseract is an OCR (optical character recognition) engine
Tesseract is an ocr (optical character recognition) engine. It is open-source and available on most Unix variants. It supports many scripts, including Latin, Greek, Cyrillic, Hindi, Tamil, Chinese, Japanese, Korean, Thai, Arabic and Hebrew.