Armenian-Surname-Normalizer

VladTepes33/Armenian-Surname-Normalizer

Phonetic normalization of Armenian surnames (Latin script) in PHP — trigram indexing + proximity scoring for fuzzy search over diaspora archives.

PHP Stars: 0 Forks: 0 Libraries

Summary

A PHP library for phonetically normalizing Armenian surnames written in the Latin alphabet. It transforms various historical transliterations (French, German, Russian, etc.) into a single canonical form to enable fuzzy search across diaspora archives. The process involves a 10-step pipeline handling digraphs, prefixes, suffixes, and specific Armenian phonetic rules.

Similar Projects