Armenian Language NLP Toolkit

These projects collectively develop tools and pipelines for processing, analyzing, and modeling the Armenian language using natural language processing techniques.

28 projects
hyspell
martakert/hyspell

A Hunspell dictionary for the Armenian language, providing spellchecking support for the hy_AM locale. Includes files...

Stars: 6
uniparser-grammar-eastern-armenian
timarkh/uniparser-grammar-eastern-armenian

A rule-based morphological analyzer for Modern Eastern Armenian built with uniparser-morph. It performs lemmatization...

Python Stars: 5
armspeech
Varuzhan97/armspeech

ArmSpeech is an offline Armenian speech recognition library and CLI tool built on Coqui STT, trained on a 15.7-hour A...

Python Stars: 5
uniparser-grammar-classical-armenian
timarkh/uniparser-grammar-classical-armenian

A rule-based morphological analyzer for Classical Armenian (Grabar) built on the uniparser-morph framework. It provid...

Python Stars: 2
hyw_augment
veganyogurt4u/hyw_augment

A Python project that creates a morphological augmentation layer for Western Armenian language processing. It integra...

Python Stars: 1
xcl_nlp
caval-project/xcl_nlp

This repository contains NLP parsing models (likely dependency parsers and/or part-of-speech taggers) specifically tr...

Stars: 1
Armenian-Dialect-Detector
takavor/Armenian-Dialect-Detector

A small NLP project that trains classifiers to distinguish between Eastern and Western Armenian dialects using Wikipe...

Jupyter Notebook Stars: 1
vortan_testing
DataPoint-Armenia/vortan_testing

A testing framework for evaluating Armenian spellcheckers on precision, accuracy, and speed. It includes tools to run...

Python Stars: 1
armenian-corpus-core
RVogel101/armenian-corpus-core

A Python package for collecting, processing, and normalizing a Western Armenian language corpus. It provides a full E...

Python Stars: 0
barbar-ai
SpartakMezlumyan/barbar-ai

A web application built with Next.js that translates Modern Literary Armenian (Grakan) to Classical Armenian (Grabar)...

TypeScript Stars: 0
armenian-news-clustering
1Shadowscale1/armenian-news-clustering

A Python package for clustering and deduplicating Armenian news articles using NLP and machine learning techniques. I...

Python Stars: 0
YSU_NLP_RAG
lilitbeglaryan/YSU_NLP_RAG

A modular Retrieval-Augmented Generation (RAG) system designed for Question Answering on Armenian Labor Law documents...

Jupyter Notebook Stars: 0
armenian-rhyme-app
NazaniCode/armenian-rhyme-app

A web application for finding Armenian rhymes using IPA phoneme similarity analysis with feature-aware algorithms. It...

Python Stars: 0
loanword-detection-in-armenian
saro2808/loanword-detection-in-armenian

A research project exploring machine learning approaches for detecting loanwords in Armenian and predicting their lan...

Jupyter Notebook Stars: 0
ArmenianTransliterate
shantk54/ArmenianTransliterate

A Java-based tool for transliterating Western Armenian text to English. It reads .txt files from a resources director...

Java Stars: 0
armenian-text-processor
karapetyanrobert/armenian-text-processor

A Python tool for Armenian speech-to-text conversion and task extraction from transcribed text. It records Armenian s...

Python Stars: 0
uniparser-grammar-don-armenian
robibistspark/uniparser-grammar-don-armenian

A rule-based morphological analyzer for Modern Eastern Armenian built with uniparser-morph. It performs full morpholo...

Python Stars: 0
Armenian-Character-Network
72Maksim03/Armenian-Character-Network

A Python project that extracts and visualizes character co-occurrence networks from Armenian literary texts. It uses ...

Python Stars: 0
armenian-ner-network
daviddallakyan2005/armenian-ner-network

A pipeline for Armenian Named Entity Recognition (NER) and network analysis. Downloads Armenian text data, preprocess...

Python Stars: 0
word2vec_arm
QnarikP/word2vec_arm

A complete pipeline for training Word2Vec embeddings on Armenian text, including data preprocessing, model training w...

Jupyter Notebook Stars: 0
armenian_text_embedding_demo
BagratMinasyan/armenian_text_embedding_demo

A demonstration repository for a pre-trained Armenian text embedding model, showcasing applications in text classific...

Jupyter Notebook Stars: 0
sentiment-analysis-armenian
narekatsy/sentiment-analysis-armenian

A Python tool for sentiment analysis on Armenian literary texts from the Eastern Armenian National Corpus (EANC). It ...

Python Stars: 0
armenian-transcription
vahetildian/armenian-transcription

A small Python script for transcribing Eastern Armenian text into a custom French-inspired phonetic transcription sys...

Python Stars: 0
Armenian-tokenizer
nairabarseghyan/Armenian-tokenizer

A student project implementing multiple tokenization methods (BPE, WordPiece, SentencePiece, tiktoken) for Armenian l...

Jupyter Notebook Stars: 0
Exploring-the-Linguistic-Efficiency-of-Large-Language-Models-in-Armenian-Discourse
Anahit-N/Exploring-the-Linguistic-Efficiency-of-Large-Language-Models-in-Armenian-Discourse

A capstone project evaluating GPT-3.5-Turbo's performance on Armenian language tasks, including extractive QA, multip...

Jupyter Notebook Stars: 0
ArmenianLatinToArmenian
haykeminyan/ArmenianLatinToArmenian

A Python library for transliterating Armenian text written in Latin script (Romanized Armenian) back to the Armenian ...

Python Stars: 0
jybuben
ohmin839/jybuben

A Java-based toolkit for processing Armenian text, featuring multiple components for conversion, data collection, and...

Java Stars: 0
goybuben
ohmin839/goybuben

A Go-based toolkit for editing and processing Armenian text, consisting of three components: an API, a converter, and...

Go Stars: 0