Explore 37 Armenian open source projects in the Language/NLP category.
A professional-grade English-to-Eastern Armenian literary translation tool that uses LLMs to emulate a human translat...
This repository hosts an interactive web-based etymological dictionary for Western Armenian, featuring over 18,938 en...
A rule-based morphological analyzer for Modern Eastern Armenian built with uniparser-morph. It performs lemmatization...
A rule-based morphological analyzer for Classical Armenian (Grabar) built on the uniparser-morph framework. It provid...
A Western Armenian NLP augmentation layer designed to improve LLM output quality by providing structured linguistic d...
This repository contains NLP parsing models (likely dependency parsers and/or part-of-speech taggers) specifically tr...
A web application for finding Armenian rhymes using IPA phoneme similarity analysis with feature-aware algorithms. It...
A Python package for collecting, processing, and normalizing a Western Armenian language corpus. It provides a full E...
A rule-based morphological analyzer for Modern Eastern Armenian built with uniparser-morph. It performs full morpholo...
A JavaScript library for transliterating Classical Armenian (Grabar) text into English using Western Armenian liturgi...
ArmSpeech is an offline Armenian speech recognition library and CLI tool built on Coqui STT, trained on a 15.7-hour A...
A Python package for clustering and deduplicating Armenian news articles using NLP and machine learning techniques. I...
A modular Retrieval-Augmented Generation (RAG) system designed for Question Answering on Armenian Labor Law documents...
A pipeline for Armenian Named Entity Recognition (NER) and network analysis. Downloads Armenian text data, preprocess...
A student project implementing multiple tokenization methods (BPE, WordPiece, SentencePiece, tiktoken) for Armenian l...
A Hunspell dictionary for the Armenian language, providing spellchecking support for the hy_AM locale. Includes files...
A repository analyzing phonetic patterns in Classical Armenian texts, focusing on frequencies of vowels, consonants, ...
Soorj is an experimental Armenian-script programming language implemented in Python. It provides Armenian keywords fo...
A small NLP project that trains classifiers to distinguish between Eastern and Western Armenian dialects using Wikipe...
A testing framework for evaluating Armenian spellcheckers on precision, accuracy, and speed. It includes tools to run...
A web application built with Next.js that translates Modern Literary Armenian (Grakan) to Classical Armenian (Grabar)...
A research project exploring machine learning approaches for detecting loanwords in Armenian and predicting their lan...
A Python project that extracts and visualizes character co-occurrence networks from Armenian literary texts. It uses ...
A complete pipeline for training Word2Vec embeddings on Armenian text, including data preprocessing, model training w...
A demonstration repository for a pre-trained Armenian text embedding model, showcasing applications in text classific...
A Python tool for sentiment analysis on Armenian literary texts from the Eastern Armenian National Corpus (EANC). It ...
Hayasa is a C-based programming language designed with Armenian keywords and symbols compatible with Armenian keyboar...
A capstone project evaluating GPT-3.5-Turbo's performance on Armenian language tasks, including extractive QA, multip...
A Python library for transliterating Armenian text written in Latin script (Romanized Armenian) back to the Armenian ...
A Java-based toolkit for processing Armenian text, featuring multiple components for conversion, data collection, and...
A Go-based toolkit for editing and processing Armenian text, consisting of three components: an API, a converter, and...
A Python tool for Armenian speech-to-text conversion and task extraction from transcribed text. It records Armenian s...
A personal project to create a programming language based on the grammatical structure of the Armenian language. It's...
A Java-based tool for transliterating Western Armenian text to English. It reads .txt files from a resources director...
genarm is an experimental Common Lisp library for generating Armenian text using generative grammars. The project is ...
A small Python script for transcribing Eastern Armenian text into a custom French-inspired phonetic transcription sys...
A Scheme-based project that modifies the syntax of the Scheme programming language to allow the use of Armenian keywo...