Browse 39 Armenian open source projects tagged with armenian-nlp.
ArmBench-LLM is a specialized evaluation framework for benchmarking large language models on Armenian language tasks....
ArmTDP-NER is a manually annotated gold-standard named entity recognition (NER) corpus for Modern Eastern Armenian, c...
A specialized evaluation benchmark for testing text embedding models on Armenian language tasks including semantic te...
A Python project that creates a morphological augmentation layer for Western Armenian language processing. It integra...
This repository presents a benchmark for evaluating multilingual sentence embedding models on a real-world task: cros...
A voice assistant for Armenian banks that uses LiveKit for real-time audio, scrapes bank websites for data, employs R...
A project to create a bilingual Western Armenian-English large language model using QLoRA fine-tuning on Qwen 2.5 1.5...
A Python package for collecting, processing, and normalizing a Western Armenian language corpus. It provides a full E...
This repository contains a political science research project comparing machine learning methods for classifying gove...
A Retrieval-Augmented Generation (RAG) system designed to answer questions about Armenian Labor Law in Armenian. It i...
A web application for finding Armenian rhymes using IPA phoneme similarity analysis with feature-aware algorithms. It...
A speech-to-text model for Armenian language using Wav2Vec2-BERT, fine-tuned on the Armenian Common Voice dataset. Th...
A project exploring Armenian language autocomplete using multiple NLP approaches including Word2Vec, LSTM, BERT trans...
An AI-powered system for analyzing scanned Armenian newspapers using computer vision (YOLO for section detection, Ope...
A Python package for clustering and deduplicating Armenian news articles using NLP and machine learning techniques. I...
A pipeline for Armenian Named Entity Recognition (NER) and network analysis. Downloads Armenian text data, preprocess...
A student project implementing multiple tokenization methods (BPE, WordPiece, SentencePiece, tiktoken) for Armenian l...
A curated list of Armenian language datasets, corpora, models, and digital resources for NLP and computational lingui...
A Telegram bot that classifies Armenian news articles into four categories (Political, Social, Educational, Health) u...
A Western Armenian language model project using transformer architecture, implemented in Jupyter notebooks. Includes ...
A small NLP project that trains classifiers to distinguish between Eastern and Western Armenian dialects using Wikipe...
A Python-based OpenClaw skill that monitors Armenian Telegram channels for power/water outage announcements. It scrap...
A web application built with Next.js that translates Modern Literary Armenian (Grakan) to Classical Armenian (Grabar)...
AzgIntel is an AI-powered NLP pipeline for Armenian text classification using a BERT-based multilingual model. It pro...
A machine learning project for detecting AI-generated text in Armenian. It includes a custom dataset, fine-tuned tran...
A research project exploring machine learning approaches for detecting loanwords in Armenian and predicting their lan...
A dataset of 30,000 Armenian news articles scraped from websites, categorized into six topics (Army, Political, Econo...
A translated version of the DailyDialog dataset into Eastern Armenian, formatted as sequential sentence pairs (input/...
A dataset containing 100,000 Armenian sentences, formatted as a CSV or text file, intended for training and evaluatin...
An OCR project for Armenian handwritten text using the Mashtots dataset, implemented in TensorFlow/Keras and trained ...
A Python project that extracts and visualizes character co-occurrence networks from Armenian literary texts. It uses ...
A complete pipeline for training Word2Vec embeddings on Armenian text, including data preprocessing, model training w...
A demonstration repository for a pre-trained Armenian text embedding model, showcasing applications in text classific...
A Python tool for sentiment analysis on Armenian literary texts from the Eastern Armenian National Corpus (EANC). It ...
A capstone project evaluating GPT-3.5-Turbo's performance on Armenian language tasks, including extractive QA, multip...
A Python library for transliterating Armenian text written in Latin script (Romanized Armenian) back to the Armenian ...
A thesis project analyzing Armenian political discourse on Twitter using NLP techniques including sentiment analysis,...
A curated list of 316 Armenian stopwords for NLP text preprocessing, provided as a JSON file with usage examples for ...
A Python tool for Armenian speech-to-text conversion and task extraction from transcribed text. It records Armenian s...