natural-language-processing

Browse 5 Armenian open source projects tagged with natural-language-processing.

armenian-news-clustering
1Shadowscale1/armenian-news-clustering

A Python package for clustering and deduplicating Armenian news articles using NLP and machine learning techniques. I...

Python Stars: 0
Armenian-tokenizer
nairabarseghyan/Armenian-tokenizer

A student project implementing multiple tokenization methods (BPE, WordPiece, SentencePiece, tiktoken) for Armenian l...

Jupyter Notebook Stars: 0
Armenian-News-Dataset
erantonyan24/Armenian-News-Dataset

A dataset of 30,000 Armenian news articles scraped from websites, categorized into six topics (Army, Political, Econo...

Stars: 0
arm_sentences_100-000
Evrikia/arm_sentences_100-000

A dataset containing 100,000 Armenian sentences, formatted as a CSV or text file, intended for training and evaluatin...

Stars: 0
-stopwords-hy-
Albert-Ananyan/-stopwords-hy-

A curated list of 316 Armenian stopwords for NLP text preprocessing, provided as a JSON file with usage examples for ...

Stars: 0