This cluster consists of projects focused on natural language processing, corpus linguistics, and computational analysis specifically for the Armenian language.
A Hunspell dictionary for the Armenian language, providing spellchecking support for the hy_AM locale. Includes files...
A rule-based morphological analyzer for Modern Eastern Armenian built with uniparser-morph. It performs lemmatization...
ArmSpeech is an offline Armenian speech recognition library and CLI tool built on Coqui STT, trained on a 15.7-hour A...
A rule-based morphological analyzer for Classical Armenian (Grabar) built on the uniparser-morph framework. It provid...
A Western Armenian NLP augmentation layer designed to improve LLM output quality by providing structured linguistic d...
A web application for finding Armenian rhymes using IPA phoneme similarity analysis with feature-aware algorithms. It...
A small NLP project that trains classifiers to distinguish between Eastern and Western Armenian dialects using Wikipe...
A testing framework for evaluating Armenian spellcheckers on precision, accuracy, and speed. It includes tools to run...
A Python package for collecting, processing, and normalizing a Western Armenian language corpus. It provides a full E...
A web application built with Next.js that translates Modern Literary Armenian (Grakan) to Classical Armenian (Grabar)...
A Python package for clustering and deduplicating Armenian news articles using NLP and machine learning techniques. I...
A modular Retrieval-Augmented Generation (RAG) system designed for Question Answering on Armenian Labor Law documents...
A Java-based tool for transliterating Western Armenian text to English. It reads .txt files from a resources director...
A Python tool for Armenian speech-to-text conversion and task extraction from transcribed text. It records Armenian s...
A rule-based morphological analyzer for Modern Eastern Armenian built with uniparser-morph. It performs full morpholo...
A Python project that extracts and visualizes character co-occurrence networks from Armenian literary texts. It uses ...
A pipeline for Armenian Named Entity Recognition (NER) and network analysis. Downloads Armenian text data, preprocess...
A demonstration repository for a pre-trained Armenian text embedding model, showcasing applications in text classific...
A Python tool for sentiment analysis on Armenian literary texts from the Eastern Armenian National Corpus (EANC). It ...
A small Python script for transcribing Eastern Armenian text into a custom French-inspired phonetic transcription sys...
A student project implementing multiple tokenization methods (BPE, WordPiece, SentencePiece, tiktoken) for Armenian l...
A Python library for transliterating Armenian text written in Latin script (Romanized Armenian) back to the Armenian ...
A Java-based toolkit for processing Armenian text, featuring multiple components for conversion, data collection, and...
A Go-based toolkit for editing and processing Armenian text, consisting of three components: an API, a converter, and...
A professional-grade English-to-Eastern Armenian literary translation tool that uses LLMs to emulate a human translat...
This repository hosts an interactive web-based etymological dictionary for Western Armenian, featuring over 18,938 en...
A JavaScript library for transliterating Classical Armenian (Grabar) text into English using Western Armenian liturgi...
Armenian Contexto is a semantic word guessing game implementation that uses Armenian FastText embeddings and cosine s...
Apertium monolingual language package for Armenian (Western/Eastern dialects) providing morphological analysis, gener...