Armenian-Surname-Normalizer

VladTepes33/Armenian-Surname-Normalizer

Phonetic normalization of Armenian surnames (Latin script) in PHP — trigram indexing + proximity scoring for fuzzy search over diaspora archives.

PHP Stars: 0 Forks: 0 Language/NLP

Summary

A PHP library for phonetic normalization of Armenian surnames transcribed in Latin script, designed to handle inconsistent transliteration from diaspora archives. It applies a 10-step normalization process (lowercase, accent removal, nasal/phonetic digraph conversion, prefix/suffix stripping, etc.) to produce a canonical form, enabling fuzzy search across historical records.

Similar Projects