Armenian Language Computational Resources

This cluster encompasses a diverse collection of datasets, corpora, and treebanks for the Armenian language, spanning its historical, modern, spoken, and written forms to support computational linguistics, NLP, and digital humanities research.

13 projects

treebank universal-dependencies corpus-linguistics dependency-parsing armenian-nlp

proiel-treebank

proiel/proiel-treebank

The PROIEL Treebank is a linguistic dataset containing dependency treebank annotations for texts in ancient Indo-Euro...

Stars: 40

UD_Armenian-ArmTDP

UniversalDependencies/UD_Armenian-ArmTDP

A Universal Dependencies treebank for Eastern Armenian, providing manually annotated morphological and syntactic data...

Stars: 12

armenian-intonation

jhdeov/armenian-intonation

A curated speech corpus of Armenian question-answer dialogues designed for intonation and prosody studies. It contain...

Stars: 4

ReRooted-ArmenianCorpus

jhdeov/ReRooted-ArmenianCorpus

This repository contains cleaned TextGrid transcript files for the ReRooted Archive, a corpus of Syrian Armenian refu...

Stars: 3

UD_Western_Armenian-ArmTDP

UniversalDependencies/UD_Western_Armenian-ArmTDP

A Universal Dependencies (UD) treebank for Western Armenian, containing manually annotated morphological and syntacti...

Stars: 3

ArmTDP-NER

myavrum/ArmTDP-NER

ArmTDP-NER is a manually annotated gold-standard named entity recognition (NER) corpus for Modern Eastern Armenian, c...

Stars: 3

armenian_datasets

ArmVectores/armenian_datasets

A curated list of Armenian language datasets, corpora, models, and digital resources for NLP and computational lingui...

Stars: 2

AI2001_Category-Linguistics-SC-Armenian

seanpm2001/AI2001_Category-Linguistics-SC-Armenian

This repository is part of the AI2001 project, specifically for Armenian language linguistic datasets. The README sta...

R Stars: 2

UD_Classical_Armenian-CAVaL

UniversalDependencies/UD_Classical_Armenian-CAVaL

A Universal Dependencies treebank for Classical Armenian, containing annotated texts from the Gospels and Movses Khor...

Stars: 1

UD_Armenian-BSUT

UniversalDependencies/UD_Armenian-BSUT

A Universal Dependencies treebank for Eastern Armenian, manually annotated from the ArmTDP v2.0 corpus. It includes e...

Stars: 1

armenian-stylo

CVidalG/armenian-stylo

A dataset repository for a stylometric study on Classical Armenian texts, specifically for authorship attribution of ...

Stars: 1

armenian

TITUS-2-0/armenian

This repository is part of the TITUS-2-0 project, which hosts digital editions of historical texts in various languag...

Stars: 0

UD_Middle_Armenian-ArmTDP

UniversalDependencies/UD_Middle_Armenian-ArmTDP

A Universal Dependencies treebank for Middle Armenian, manually annotated with morphological and syntactic data, deri...

Stars: 0