ReRooted-ArmenianCorpus

jhdeov/ReRooted-ArmenianCorpus

ReRooted: Speech corpus of Syrian Armenian refugee testimonials

Stars: 3 Forks: 1 License: GPL-3.0 Data

Summary

ReRooted-ArmenianCorpus is a work-in-progress speech corpus project that processes and cleans transcribed audio testimonials from Syrian Armenian refugees. The repository contains manually cleaned TextGrid alignment files, metadata linking to audio files stored externally, and tools for converting and refining transcripts from the original ReRooted Archive. The goal is to create a linguistically valuable resource for Armenian language NLP and research.

Similar Projects