Browse 5 Armenian open source projects tagged with natural-language-processing.
A Python package for clustering and deduplicating Armenian news articles using NLP and machine learning techniques. I...
A student project implementing multiple tokenization methods (BPE, WordPiece, SentencePiece, tiktoken) for Armenian l...
A dataset of 30,000 Armenian news articles scraped from websites, categorized into six topics (Army, Political, Econo...
A dataset containing 100,000 Armenian sentences, formatted as a CSV or text file, intended for training and evaluatin...
A curated list of 316 Armenian stopwords for NLP text preprocessing, provided as a JSON file with usage examples for ...