ArmBench-LLM

Metricam/ArmBench-LLM

A comprehensive Armenian model evaluation framework for benchmarking large language models (LLMs).

Python Stars: 6 Forks: 1 ML/AI

llm-evaluation benchmarking armenian-nlp language-model vllm huggingface multilingual-ai

Summary

ArmBench-LLM is a specialized evaluation framework for benchmarking large language models on Armenian language tasks. It supports multiple Armenian-specific datasets (language, literature, history) and the Armenian version of MMLU-Pro, offering both vLLM-optimized and Hugging Face inference. The project includes configuration management, result generation, and submission to a public leaderboard.

View on GitHub

More in: Armenian Language AI Tools →

Similar Projects

EdikSimonian/armenian-gpt

A student-friendly implementation of a GPT language model specifically for Armenian, trained on a 63 GB corpus and fi...

Python Stars: 0

ArmBench-TextEmbed

Metric-AI-Lab/ArmBench-TextEmbed

ArmBench-TextEmbed is a specialized Python benchmark for evaluating the performance of text embedding models on the A...

Python Stars: 4

WesternArmenianLLM

RVogel101/WesternArmenianLLM

A project to create a bilingual Western Armenian-English large language model using QLoRA fine-tuning on Qwen 2.5 1.5...

Python Stars: 0

Western-Armenian-Chat-Model

haigaz15/Western-Armenian-Chat-Model

A Western Armenian language model project using transformer architecture, implemented in Jupyter notebooks. Includes ...

Jupyter Notebook Stars: 1

ArmenianLanguegeAutocomplete

madanela/ArmenianLanguegeAutocomplete

A project exploring Armenian language autocomplete using multiple NLP approaches including Word2Vec, LSTM, BERT trans...

Jupyter Notebook Stars: 2

armenian_text_embedding_demo

BagratMinasyan/armenian_text_embedding_demo

A demonstration repository for a pre-trained Armenian text embedding model, showcasing applications in text classific...

Jupyter Notebook Stars: 0