Armenian Language Digital Resources

These projects collectively provide annotated datasets, corpora, and tools for computational and linguistic research on the Armenian language across its historical and modern dialects.

13 projects
proiel-treebank
proiel/proiel-treebank

The PROIEL Treebank is a linguistic dataset containing dependency treebank annotations for texts in ancient Indo-Euro...

Stars: 40
UD_Armenian-ArmTDP
UniversalDependencies/UD_Armenian-ArmTDP

A Universal Dependencies treebank for Eastern Armenian, providing manually annotated morphological and syntactic data...

Stars: 12
armenian-intonation
jhdeov/armenian-intonation

A curated speech corpus of Armenian question-answer dialogues designed for intonation and prosody studies. It contain...

Stars: 4
ReRooted-ArmenianCorpus
jhdeov/ReRooted-ArmenianCorpus

ReRooted-ArmenianCorpus is a work-in-progress speech corpus project that processes and cleans transcribed audio testi...

Stars: 3
UD_Western_Armenian-ArmTDP
UniversalDependencies/UD_Western_Armenian-ArmTDP

A Universal Dependencies (UD) treebank for Western Armenian, containing manually annotated morphological and syntacti...

Stars: 3
ArmTDP-NER
myavrum/ArmTDP-NER

ArmTDP-NER is a manually annotated gold-standard named entity recognition (NER) corpus for Modern Eastern Armenian, c...

Stars: 3
armenian_datasets
ArmVectores/armenian_datasets

A curated list of Armenian language datasets, corpora, models, and digital resources for NLP and computational lingui...

Stars: 2
AI2001_Category-Linguistics-SC-Armenian
seanpm2001/AI2001_Category-Linguistics-SC-Armenian

This repository is part of the AI2001 project, specifically for Armenian language linguistic datasets. The README sta...

R Stars: 2
UD_Classical_Armenian-CAVaL
UniversalDependencies/UD_Classical_Armenian-CAVaL

A Universal Dependencies (UD) treebank for Classical Armenian, containing annotated texts from the Gospels and Movses...

Stars: 1
UD_Armenian-BSUT
UniversalDependencies/UD_Armenian-BSUT

A Universal Dependencies treebank for Eastern Armenian, manually annotated from the ArmTDP v2.0 corpus. It includes e...

Stars: 1
armenian-stylo
CVidalG/armenian-stylo

A dataset repository for a stylometric study on Classical Armenian texts, specifically for authorship attribution of ...

Stars: 1
armenian
TITUS-2-0/armenian

This repository is part of the TITUS-2-0 project, which hosts digital editions of historical texts in various languag...

Stars: 0
UD_Middle_Armenian-ArmTDP
UniversalDependencies/UD_Middle_Armenian-ArmTDP

A Universal Dependencies treebank for Middle Armenian, containing manually annotated grammatical examples for linguis...

Stars: 0