armenian-ocr-toolkit
Serge-Ordanyan/armenian-ocr-toolkit
Summary
A Python toolkit for performing OCR on Armenian historical documents. It uses pytesseract with multiple preprocessing methods (including morphological repair for broken fonts) and combines Armenian language models (hye+arm) to improve accuracy on scanned, old, or difficult texts. Outputs results from six parallel methods for comparison.