Armenian-Surname-Normalizer
VladTepes33/Armenian-Surname-Normalizer
Phonetic normalization of Armenian surnames (Latin script) in PHP — trigram indexing + proximity scoring for fuzzy search over diaspora archives.
Summary
A PHP library for phonetically normalizing Armenian surnames written in the Latin alphabet. It transforms various historical transliterations (French, German, Russian, etc.) into a single canonical form to enable fuzzy search across diaspora archives. The process involves a 10-step pipeline handling digraphs, prefixes, suffixes, and specific Armenian phonetic rules.