ReRooted-ArmenianCorpus
jhdeov/ReRooted-ArmenianCorpus
ReRooted: Speech corpus of Syrian Armenian refugee testimonials
Summary
ReRooted-ArmenianCorpus is a work-in-progress speech corpus project that processes and cleans transcribed audio testimonials from Syrian Armenian refugees. The repository contains manually cleaned TextGrid alignment files, metadata linking to audio files stored externally, and tools for converting and refining transcripts from the original ReRooted Archive. The goal is to create a linguistically valuable resource for Armenian language NLP and research.