Armenian Language NLP Tools

These projects are all software tools focused on processing, analyzing, or transforming text in the Armenian language using natural language processing and computational methods.

28 projects
hyspell
martakert/hyspell

A Hunspell dictionary for the Armenian language, providing spellchecking support for the hy_AM locale. Includes files...

Stars: 6
uniparser-grammar-eastern-armenian
timarkh/uniparser-grammar-eastern-armenian

A rule-based morphological analyzer for Modern Eastern Armenian built with uniparser-morph. It performs lemmatization...

Python Stars: 5
armspeech
Varuzhan97/armspeech

ArmSpeech is an offline Armenian speech recognition library and CLI tool built on Coqui STT, trained on a 15.7-hour A...

Python Stars: 5
uniparser-grammar-classical-armenian
timarkh/uniparser-grammar-classical-armenian

A rule-based morphological analyzer for Classical Armenian (Grabar) built on the uniparser-morph framework. It provid...

Python Stars: 2
hyw_augment
veganyogurt4u/hyw_augment

A Western Armenian NLP augmentation layer designed to improve LLM output quality by providing structured linguistic d...

Python Stars: 1
xcl_nlp
caval-project/xcl_nlp

This repository contains NLP parsing models (likely dependency parsers and/or part-of-speech taggers) specifically tr...

Stars: 1
Armenian-Dialect-Detector
takavor/Armenian-Dialect-Detector

A small NLP project that trains classifiers to distinguish between Eastern and Western Armenian dialects using Wikipe...

Jupyter Notebook Stars: 1
vortan_testing
DataPoint-Armenia/vortan_testing

A testing framework for evaluating Armenian spellcheckers on precision, accuracy, and speed. It includes tools to run...

Python Stars: 1
armenian-corpus-core
RVogel101/armenian-corpus-core

A Python package for collecting, processing, and normalizing a Western Armenian language corpus. It provides a full E...

Python Stars: 0
barbar-ai
SpartakMezlumyan/barbar-ai

A web application built with Next.js that translates Modern Literary Armenian (Grakan) to Classical Armenian (Grabar)...

TypeScript Stars: 0
armenian-news-clustering
1Shadowscale1/armenian-news-clustering

A Python package for clustering and deduplicating Armenian news articles using NLP and machine learning techniques. I...

Python Stars: 0
YSU_NLP_RAG
lilitbeglaryan/YSU_NLP_RAG

A modular Retrieval-Augmented Generation (RAG) system designed for Question Answering on Armenian Labor Law documents...

Jupyter Notebook Stars: 0
armenian-rhyme-app
NazaniCode/armenian-rhyme-app

A web application for finding Armenian rhymes using IPA phoneme similarity analysis with feature-aware algorithms. It...

Python Stars: 0
loanword-detection-in-armenian
saro2808/loanword-detection-in-armenian

A research project exploring machine learning approaches for detecting loanwords in Armenian and predicting their lan...

Jupyter Notebook Stars: 0
ArmenianTransliterate
shantk54/ArmenianTransliterate

A Java-based tool for transliterating Western Armenian text to English. It reads .txt files from a resources director...

Java Stars: 0
armenian-text-processor
karapetyanrobert/armenian-text-processor

A Python tool for Armenian speech-to-text conversion and task extraction from transcribed text. It records Armenian s...

Python Stars: 0
uniparser-grammar-don-armenian
robibistspark/uniparser-grammar-don-armenian

A rule-based morphological analyzer for Modern Eastern Armenian built with uniparser-morph. It performs full morpholo...

Python Stars: 0
Armenian-Character-Network
72Maksim03/Armenian-Character-Network

A Python project that extracts and visualizes character co-occurrence networks from Armenian literary texts. It uses ...

Python Stars: 0
armenian-ner-network
daviddallakyan2005/armenian-ner-network

A pipeline for Armenian Named Entity Recognition (NER) and network analysis. Downloads Armenian text data, preprocess...

Python Stars: 0
word2vec_arm
QnarikP/word2vec_arm

A complete pipeline for training Word2Vec embeddings on Armenian text, including data preprocessing, model training w...

Jupyter Notebook Stars: 0
armenian_text_embedding_demo
BagratMinasyan/armenian_text_embedding_demo

A demonstration repository for a pre-trained Armenian text embedding model, showcasing applications in text classific...

Jupyter Notebook Stars: 0
sentiment-analysis-armenian
narekatsy/sentiment-analysis-armenian

A Python tool for sentiment analysis on Armenian literary texts from the Eastern Armenian National Corpus (EANC). It ...

Python Stars: 0
armenian-transcription
vahetildian/armenian-transcription

A small Python script for transcribing Eastern Armenian text into a custom French-inspired phonetic transcription sys...

Python Stars: 0
Armenian-tokenizer
nairabarseghyan/Armenian-tokenizer

A student project implementing multiple tokenization methods (BPE, WordPiece, SentencePiece, tiktoken) for Armenian l...

Jupyter Notebook Stars: 0
Exploring-the-Linguistic-Efficiency-of-Large-Language-Models-in-Armenian-Discourse
Anahit-N/Exploring-the-Linguistic-Efficiency-of-Large-Language-Models-in-Armenian-Discourse

A capstone project evaluating GPT-3.5-Turbo's performance on Armenian language tasks, including extractive QA, multip...

Jupyter Notebook Stars: 0
ArmenianLatinToArmenian
haykeminyan/ArmenianLatinToArmenian

A Python library for transliterating Armenian text written in Latin script (Romanized Armenian) back to the Armenian ...

Python Stars: 0
jybuben
ohmin839/jybuben

A Java-based toolkit for processing Armenian text, featuring multiple components for conversion, data collection, and...

Java Stars: 0
goybuben
ohmin839/goybuben

A Go-based toolkit for editing and processing Armenian text, consisting of three components: an API, a converter, and...

Go Stars: 0