Browse 50 Armenian open source projects tagged with machine-learning.
Silero Models is a popular open-source repository providing pre-trained text-to-speech (TTS) and speech-to-text model...
A comprehensive portfolio repository documenting a 10-month Machine Learning Engineer bootcamp program from the Armen...
A robust Speech-to-Text (STT) service with multi-provider API integration (Google Gemini, ElevenLabs, Whisper, HiSpee...
A full-stack Armenian speech recognition system with React frontend, Spring Boot backend, and Azure deployment. Inclu...
A Master's thesis project implementing a full Retrieval-Augmented Generation (RAG) pipeline for querying Armenian ban...
An end-to-end pipeline for digitizing Classical Armenian (Grabar) texts from scanned PDFs into a searchable, translat...
ArmBench-TextEmbed is a specialized Python benchmark for evaluating the performance of text embedding models on the A...
An automated pipeline for dubbing English YouTube videos into Armenian, featuring transcription, AI-powered translati...
This repository presents a benchmark for evaluating multilingual sentence embedding models on a real-world task: cros...
A speech-to-text model for Armenian language using Wav2Vec2-BERT, fine-tuned on the Armenian Common Voice dataset. Th...
A student-friendly implementation of a GPT language model specifically for Armenian, trained on a 63 GB corpus and fi...
An end-to-end pipeline for Armenian-to-English speech and text conversion, featuring Armenian automatic speech recogn...
A machine learning project for predicting house prices in Armenia using a RandomForest model. Includes data scraping/...
A Python package for clustering and deduplicating Armenian news articles using NLP and machine learning techniques. I...
An end-to-end machine learning project analyzing Armenian kindergarten data across 9 cities. It includes web scraping...
A student project implementing multiple tokenization methods (BPE, WordPiece, SentencePiece, tiktoken) for Armenian l...
A complete data science pipeline project analyzing Yerevan's real estate market. It includes web scraping to collect ...
This project is a data science pipeline analyzing Yerevan's real estate market. It includes web scraping of Armenian ...
An AI-powered video dubbing tool specifically for Armenian language, combining speech recognition, translation, voice...
A web scraping and machine learning project that collects real estate listings from Yerevan using Selenium, processes...
ArmenianWhisper is an open-source summer project focused on creating an AI-powered pipeline for Armenian speech recog...
A web application for classifying images of Armenian dishes using a machine learning model. The project includes a Re...
A Jupyter Notebook project implementing a Convolutional Neural Network (CNN) to classify Armenian handwritten letters...
A web application built with Next.js that translates Modern Literary Armenian (Grakan) to Classical Armenian (Grabar)...
AzgIntel is an AI-powered NLP pipeline for Armenian text classification using a BERT-based multilingual model. It pro...
A real-time Armenian Sign Language translator using MediaPipe for hand landmark extraction and an LSTM neural network...
A machine learning project for detecting AI-generated text in Armenian. It includes a custom dataset, fine-tuned tran...
A research project exploring machine learning approaches for detecting loanwords in Armenian and predicting their lan...
A CNN-based project for recognizing Armenian alphabet letters from grayscale images using the Mashtots Dataset v2. Th...
A dataset of 30,000 Armenian news articles scraped from websites, categorized into six topics (Army, Political, Econo...
A dataset containing 100,000 Armenian sentences, formatted as a CSV or text file, intended for training and evaluatin...
MLTama is a project implementing Armenian draughts (a board game) with reinforcement learning agents. It includes a w...
This repository contains a Jupyter Notebook project analyzing a dataset of tech salaries in Armenia. It applies multi...
A complete pipeline for training Word2Vec embeddings on Armenian text, including data preprocessing, model training w...
A demonstration repository for a pre-trained Armenian text embedding model, showcasing applications in text classific...
This repository contains Jupyter notebooks and Python tools for downloading and analyzing ERA5 climate reanalysis dat...
A machine learning project analyzing Armenian credit registry data to predict loan defaults using logistic regression...
A workshop repository for a TUMO 2024 event teaching Optical Music Recognition (OMR) and AI-based music generation, w...
ArmSpeechTT is a fine-tuned Whisper model for Armenian speech-to-text, trained on Common Voice data with a webcam dem...
A thesis project analyzing Armenia's real estate market using data visualization, price prediction models, and featur...
A project that trains a LeNet-5 convolutional neural network on Armenian script characters. It includes data preparat...
A thesis project analyzing Armenian political discourse on Twitter using NLP techniques including sentiment analysis,...
A submission to the "Parameter Golf Armenia" competition, focusing on hyperparameter tuning for a GPT language model ...
A repository containing NLP tasks and notebooks for an ITA NLP 2024 course, set up for use with Jupyter Lab and the p...
A machine learning project for classifying Armenian coins (10, 20, 100 dram) using classic ML models like SVM, Random...
A group project from Armenia Code Academy focused on water quality classification using machine learning. The project...
Master's thesis project comparing LSTM and Transformer models for generating Armenian folk music in MIDI format. Incl...
A machine learning project for recognizing handwritten Armenian alphabet characters using the Mashtots Dataset v2. It...
A Jupyter Notebook project for predicting car prices in the Armenian market using linear regression and hyperparamete...
A book recommendation system project using Armenian user data, implementing both collaborative filtering and content-...