armenian-ocr-toolkit

Serge-Ordanyan/armenian-ocr-toolkit

Python Stars: 0 Forks: 0 Tools

Summary

A Python toolkit for performing OCR on Armenian historical documents. It uses pytesseract with multiple preprocessing methods (including morphological repair for broken fonts) and combines Armenian language models (hye+arm) to improve accuracy on scanned, old, or difficult texts. Outputs results from six parallel methods for comparison.

Similar Projects