Explore 91 Armenian open source projects in the Data category.
A Universal Dependencies treebank for Eastern Armenian, providing manually annotated morphological and syntactic data...
A curated speech corpus of Armenian question-answer dialogues designed for intonation and prosody studies. It contain...
The PROIEL Treebank is a linguistic dataset containing dependency treebank annotations for texts in ancient Indo-Euro...
A Universal Dependencies (UD) treebank for Western Armenian, containing manually annotated morphological and syntacti...
A Universal Dependencies treebank for Eastern Armenian, manually annotated from the ArmTDP v2.0 corpus. It includes e...
A public repository containing an annotated version of the Kouyoumdjian 1970 Armenian-English dictionary. It provides...
A ground-truth dataset for Handwritten Text Recognition (HTR) of Armenian manuscripts from the Dulaurier collection a...
The Index of Digitized Armenian Manuscripts is a curated dataset and website that catalogs Armenian manuscripts avail...
ArmTDP-NER is a manually annotated gold-standard named entity recognition (NER) corpus for Modern Eastern Armenian, c...
A Python toolset for parsing, validating, and processing official Armenian government budget documents (laws, spendin...
A Universal Dependencies (UD) treebank for Classical Armenian, containing annotated texts from the Gospels and Movses...
A dataset repository for a stylometric study on Classical Armenian texts, specifically for authorship attribution of ...
This repository is part of the TITUS-2-0 project, which hosts digital editions of historical texts in various languag...
A statistical analysis project examining gender wage disparities in Armenia's labor market using R for data cleaning,...
This repository contains raw bibliographic records extracted from the 8-volume printed catalog of Armenian manuscript...
A Universal Dependencies treebank for Middle Armenian, containing manually annotated grammatical examples for linguis...
A fieldwork data archive for the Iranian Armenian dialect, containing audio recordings, transcriptions, and linguisti...
A data science project analyzing kindergarten infrastructure across 10 major Armenian cities. It involves web scrapin...
A data cleaning and analysis project focused on a hypothetical London dataset. The project demonstrates a full pipeli...
A data pipeline for preprocessing and modeling rental property data in Armenia, focusing on cleaning unstructured dat...
A Python project for scraping, processing, and analyzing articles from Armenian news sites to study patterns in Engli...
A project for creating Armenian OCR datasets by scraping Armenian Wiktionary, processing words into lowercase/upperca...
A bioinformatics project analyzing mitochondrial DNA (mtDNA) from Armenian and neighboring populations. It includes a...
A prototype interactive dashboard analyzing car accident trends, high-risk areas, and cost distributions in Armenia u...
A repository containing Armenian dictionary data files and a Makefile to compile them into the StarDict format. It ag...
ReRooted-ArmenianCorpus is a work-in-progress speech corpus project that processes and cleans transcribed audio testi...
A curated multilingual dataset of Armenian and Armenia-related keywords, names, and geographic terms designed for fil...
A Python web scraping tool and dataset for Armenian patents, parsing data from the Armenian Intellectual Property Off...
A curated list of Armenian language datasets, corpora, models, and digital resources for NLP and computational lingui...
A digitized version of the classic Bararan English-Armenian dictionary containing 27,001 entries. The data was conver...
This is the data repository for Armenia's implementation of the Open SDG (Sustainable Development Goals) platform. It...
A dataset of Armenian khachkar (cross-stone) information scraped and parsed from the Armenica.org website. Contains s...
A Python web scraper that extracts structured data about Armenian khachkars (cross-stones) from the armenica.org webs...
A Jupyter Notebook repository analyzing EU export-import data discrepancies under sanctions against Russia. It invest...
A web scraper and structured metadata dataset for the Nor dar (Նոր դար) Armenian periodicals collection (1884–1887) f...
A Python scraper that extracts structured metadata from the British Library's EAP180/3/10 collection of Armenian peri...
A metadata dataset of Armenian duduk music records collected from the Trove platform of the National Library of Austr...
A metadata dataset of Armenian-related music records extracted from the Trove platform of the National Library of Aus...
A dataset and Python scraper for collecting metadata records of Armenia-related maps from the National Library of Aus...
A Python web scraper that extracts metadata from the Pan-Armenian Digital Library's "Collection 9," which contains di...
A Python web scraper that extracts structured metadata from the Columbia Armenian Oral History Archive (1968–1977) co...
A Python scraper for extracting metadata from the Pan-Armenian Digital Library (ARAR) collection of Armenian and Arme...
An interactive data visualization project built with React/TypeScript/Vite that explores Armenia's IT labor market tr...
A Python scraper targeting the Library of Congress "Greek and Armenian Patriarchates of Jerusalem" collection to extr...
A metadata scraper for the Project SAVE Armenian Photograph Archives online catalog. It extracts publicly accessible ...
A dataset of metadata for Australian newspaper articles from 1915-1923 covering the Armenian Genocide, extracted from...
An interactive data story project analyzing Armenia's IT labor market from 2022 to 2025, focusing on employment growt...
This repository contains a data visualization project analyzing migration patterns and employment conditions in Armen...
A data analysis mini-project exploring the correlation between access to basic handwashing facilities and child morta...
A Python project analyzing user engagement across 24 Armenian news YouTube channels. It scrapes data (views, ratings,...
A dataset of 30,000 Armenian news articles scraped from websites, categorized into six topics (Army, Political, Econo...
A translated version of the DailyDialog dataset into Eastern Armenian, formatted as sequential sentence pairs (input/...
A dataset containing 100,000 Armenian sentences, formatted as a CSV or text file, intended for training and evaluatin...
A data visualization project analyzing female workforce participation in Armenia using R and R Shiny. It includes int...
This repository contains an R-based research project analyzing Armenia's international economic partnerships from 202...
An R-based data analysis project exploring Armenia's demographic trends (birth, death, migration) through data cleani...
A repository containing curated Armenian genetic data (G25 coordinates) for population genetics analysis. It aggregat...
A data analysis project examining car market trends in Armenia from 2015-2024, using R to analyze import/export data ...
An R Markdown project analyzing and visualizing healthcare outcomes (life expectancy, mortality, spending, risk facto...
A university capstone project analyzing Armenia's automobile and battery market using Jupyter Notebooks. It examines ...
A GitHub repository containing a front-end web application to visualize historical petrol/gas prices in Armenia. The ...
A repository containing datasets of Armenian-related art exhibits from two Russian museum sources: the "Artefact" pro...
A project analyzing 17 years (2006-2022) of Armenian government budget expenditure data using SQL for data cleaning a...
Analysis of Armenian population perceptions using 2019 Caucasus Barometer survey data. The project explores attitudes...
A case study analyzing Armenia's labor market dynamics using job posting data scraped from an online portal. The proj...
A Python script that downloads historical exchange rate data from the Central Bank of Armenia via a SOAP API and save...
A Python web scraper and structured metadata dataset for the Ardzaganq Armenian periodicals collection (1882–1891) fr...
A Python web scraper that extracts metadata from the Pan-Armenian Digital Library (ARAR), specifically targeting Coll...
A Python scraper and dataset for the Library of Congress Armenian Rarities collection. It extracts structured metadat...
A Python script that extracts metadata records related to Armenian cultural heritage from the Hispana (Spain's nation...
A Python scraper that extracts metadata about Armenian elements listed in the UNESCO Intangible Cultural Heritage dat...
A Jupyter Notebook project for web scraping Armenian fairy tales by Hovhannes Tumanyan and their Russian translations...
A curated list of 316 Armenian stopwords for NLP text preprocessing, provided as a JSON file with usage examples for ...
This repository provides spatial data (geographic boundaries) for protected areas in Armenia, such as national parks ...
A repository claiming to contain "all words from every language that exists in the universe," but actually provides c...
This is a data mining project analyzing Yerevan's rental market using Python and Jupyter Notebook. It cleans and anal...
A Python project for scraping, parsing, and visualizing rental apartment listing data from list.am for Yerevan, Armen...
A repository that is part of the Seanpm2001 WorldDB project, specifically containing the Earth/Armenia database set. ...
A repository containing lists of Armenian words, including a romanization guide and word lists in various formats (CS...
This repository is part of the Seanpm2001 WorldDB project, specifically containing documentation for the Earth/Armeni...
A curated collection of links to official documents, reports, and data related to transport policy, planning, and dev...
A repository containing Armenian (Western) vocabulary data for the Vocably language-learning app. The data is primari...
A repository containing public domain works by the Armenian author Melik S. Davit-Bek, specifically focusing on the h...
An R-based data analysis project comparing health outcomes across Armenia, Georgia, and Azerbaijan. It explores how s...
A dashboard for analyzing political advertising targeting in Armenia, likely part of the "Who Targets Me" (WTM) proje...
A repository serving as a curated index of transport-related documents, plans, and data sources for Armenia. It is st...
This repository is a structured data file for Armenia within a larger geographic knowledge base (SpocWiki). It contai...
This repository contains a descriptive analysis of Armenia's electricity production from fossil fuels versus renewabl...
A repository containing Armenian vocabulary data (words and translations) automatically generated and updated for the...
A repository containing a claimed database of Armenian phone numbers, presented as an educational resource for securi...
This repository is part of the AI2001 project, specifically for Armenian language linguistic datasets. The README sta...