Explore 107 Armenian open source projects in the Data category.
A Universal Dependencies treebank for Eastern Armenian, providing manually annotated morphological and syntactic data...
A curated speech corpus of Armenian question-answer dialogues designed for intonation and prosody studies. It contain...
The PROIEL Treebank is a linguistic dataset containing dependency treebank annotations for texts in ancient Indo-Euro...
A Universal Dependencies (UD) treebank for Western Armenian, containing manually annotated morphological and syntacti...
A Universal Dependencies treebank for Classical Armenian, containing annotated texts from the Gospels and Movses Khor...
A Universal Dependencies treebank for Eastern Armenian, manually annotated from the ArmTDP v2.0 corpus. It includes e...
A public repository containing an annotated version of the Kouyoumdjian 1970 Armenian-English dictionary. It provides...
A ground-truth dataset for Handwritten Text Recognition (HTR) of Armenian manuscripts from the Dulaurier collection a...
The Index of Digitized Armenian Manuscripts is a curated dataset and website that catalogs Armenian manuscripts avail...
ArmTDP-NER is a manually annotated gold-standard named entity recognition (NER) corpus for Modern Eastern Armenian, c...
A Python toolset for parsing, validating, and processing official Armenian state budget documents (budget laws, spend...
A dataset repository for a stylometric study on Classical Armenian texts, specifically for authorship attribution of ...
This repository is part of the TITUS-2-0 project, which hosts digital editions of historical texts in various languag...
A Universal Dependencies treebank for Middle Armenian, manually annotated with morphological and syntactic data, deri...
A statistical analysis project examining gender wage disparities in Armenia's labor market using R for data cleaning,...
This repository contains raw bibliographic records extracted from the 8-volume printed catalog of Armenian manuscript...
This repository provides an open dataset of Armenia's administrative divisions, including 11 regions, 71 communities,...
A fieldwork data archive for the Iranian Armenian dialect, containing audio recordings, transcriptions, and linguisti...
A data science project analyzing kindergarten infrastructure across 10 major Armenian cities. It involves web scrapin...
A data cleaning and analysis project focused on a hypothetical London dataset. The project demonstrates a full pipeli...
A data pipeline for preprocessing and modeling rental property data in Armenia, focusing on cleaning unstructured dat...
A Python project for scraping, processing, and analyzing articles from Armenian news sites to study patterns in Engli...
A project for creating Armenian OCR datasets by scraping Armenian Wiktionary, processing words into lowercase/upperca...
A bioinformatics project analyzing mitochondrial DNA (mtDNA) from Armenian and neighboring populations. It includes a...
A prototype interactive dashboard analyzing car accident trends, high-risk areas, and cost distributions in Armenia u...
A repository containing Armenian dictionary data files and a Makefile to compile them into the StarDict format. It ag...
ReRooted-ArmenianCorpus is a repository focused on cleaning and preparing a speech corpus from the ReRooted Archive, ...
A curated multilingual dataset of Armenian and Armenia-related keywords, names, and geographic terms designed for fil...
A Python web scraping tool and dataset for Armenian patents, parsing data from the Armenian Intellectual Property Off...
A curated list of Armenian language datasets, corpora, models, and digital resources for NLP and computational lingui...
A digitized version of the classic Bararan English-Armenian dictionary containing 27,001 entries. The data was conver...
This is the data repository for Armenia's implementation of the Open SDG (Sustainable Development Goals) platform. It...
A metadata scraper for the Project SAVE Armenian Photograph Archives online catalog. It extracts publicly accessible ...
A dataset of Armenian khachkar (cross-stone) information scraped and parsed from the Armenica.org website. Contains s...
A Python web scraper that extracts structured data about Armenian khachkars (cross-stones) from the armenica.org webs...
A Jupyter Notebook repository analyzing EU export-import data discrepancies under sanctions against Russia. It invest...
A web scraper and structured metadata dataset for the Nor dar (Նոր դար) Armenian periodicals collection (1884–1887) f...
A Python scraper that extracts structured metadata from the British Library's EAP180/3/10 collection of Armenian peri...
A metadata dataset of Armenian duduk music records collected from the Trove platform of the National Library of Austr...
A metadata dataset of Armenian-related music records extracted from the Trove platform of the National Library of Aus...
A dataset and Python scraper for collecting metadata records of Armenia-related maps from the National Library of Aus...
A Python web scraper that extracts metadata from the Pan-Armenian Digital Library's "Collection 9," which contains di...
A Python web scraper that extracts structured metadata from the Columbia Armenian Oral History Archive (1968–1977) co...
A Python scraper for extracting metadata from the Pan-Armenian Digital Library (ARAR) collection of Armenian and Arme...
An interactive data visualization project built with React/TypeScript/Vite that explores Armenia's IT labor market tr...
A Python scraper targeting the Library of Congress "Greek and Armenian Patriarchates of Jerusalem" collection to extr...
A dataset of metadata for Australian newspaper articles from 1915-1923 covering the Armenian Genocide, extracted from...
An interactive data story project analyzing Armenia's IT labor market from 2022 to 2025, focusing on employment growt...
This repository contains a data visualization project analyzing migration patterns and employment conditions in Armen...
A data analysis mini-project exploring the correlation between access to basic handwashing facilities and child morta...
A Python project analyzing user engagement across 24 Armenian news YouTube channels. It scrapes data (views, ratings,...
A dataset of 30,000 Armenian news articles scraped from websites, categorized into six topics (Army, Political, Econo...
A translated version of the DailyDialog dataset into Eastern Armenian, formatted as sequential sentence pairs (input/...
A dataset containing 100,000 Armenian sentences, formatted as a CSV or text file, intended for training and evaluatin...
A data visualization project analyzing female workforce participation in Armenia using R and R Shiny. It includes int...
This repository contains an R-based research project analyzing Armenia's international economic partnerships from 202...
An R-based data analysis project exploring Armenia's demographic trends (birth, death, migration) through data cleani...
A repository containing curated Armenian genetic data (G25 coordinates) for population genetics analysis. It aggregat...
A data analysis project examining car market trends in Armenia from 2015-2024, using R to analyze import/export data ...
An R Markdown project analyzing and visualizing healthcare outcomes (life expectancy, mortality, spending, risk facto...
A university capstone project analyzing Armenia's automobile and battery market using Jupyter Notebooks. It examines ...
A GitHub repository containing a front-end web application to visualize historical petrol/gas prices in Armenia. The ...
A repository containing datasets of Armenian-related art exhibits from two Russian museum sources: the "Artefact" pro...
A project analyzing 17 years (2006-2022) of Armenian government budget expenditure data using SQL for data cleaning a...
Analysis of Armenian population perceptions using 2019 Caucasus Barometer survey data. The project explores attitudes...
A case study analyzing Armenia's labor market dynamics using job posting data scraped from an online portal. The proj...
A Python web scraper and structured metadata dataset for the Armenian Rare Books collection (EAP180/1/1, 1512–1800) f...
A focused web scraping project that extracts and structures metadata from the Haraj Armenian periodicals collection (...
A Python web scraper and structured metadata dataset for the Murch Armenian periodicals collection (1889–1898) from t...
This repository contains a Python scraper and cleaned metadata dataset for the Armenian books collection (EAP180/1/2)...
A Python web scraper and cleaned metadata dataset for the British Library's EAP180/1/3 collection of Armenian books f...
This repository contains a dataset of metadata for Armenian-language books from the ARAM digital library in France. I...
A Python scraper that extracts structured metadata for Armenian traditional songs from the WebARAM cultural heritage ...
This repository contains a data visualization project analyzing the relationship between employment conditions and mi...
Este repositorio contiene un análisis estadístico detallado del panorama de los micronegocios en Colombia basado en l...
This repository contains data analysis scripts and visualizations exploring healthcare resource disparities across Ar...
A Python script that downloads historical exchange rate data from the Central Bank of Armenia via a SOAP API and save...
A repository providing structured API access to Islamic religious texts (Quran, Hadith) and resources in five Europea...
A Python web scraper and structured metadata dataset for the Ardzaganq Armenian periodicals collection (1882–1891) fr...
A Python web scraper that extracts metadata from the Pan-Armenian Digital Library (ARAR), specifically targeting Coll...
A Python scraper and dataset for the Library of Congress Armenian Rarities collection. It extracts structured metadat...
A Python script that extracts metadata records related to Armenian cultural heritage from the Hispana (Spain's nation...
A Python scraper that extracts metadata about Armenian elements listed in the UNESCO Intangible Cultural Heritage dat...
A Jupyter Notebook project for web scraping Armenian fairy tales by Hovhannes Tumanyan and their Russian translations...
A curated list of 316 Armenian stopwords for NLP text preprocessing, provided as a JSON file with usage examples for ...
This repository provides spatial data (geographic boundaries) for protected areas in Armenia, such as national parks ...
A Python web scraper and structured metadata dataset for the Armenian Books (1901–1920) collection from the British L...
A Python web scraper and structured metadata dataset for the "Ararat" Armenian periodicals collection (1868-1886) fro...
A Python scraper and structured metadata dataset for the "Murch" Armenian periodicals collection (EAP180/2/1, 1889–18...
A Python scraper and data cleaning tool that extracts and structures historical records of Armenian refugees at Camp ...
A repository claiming to contain "all words from every language" but actually provides comma-separated word lists for...
This is a data mining project analyzing Yerevan's rental market using Python and Jupyter Notebook. It cleans and anal...
A Python project for scraping, parsing, and visualizing rental apartment listing data from list.am for Yerevan, Armen...
A repository that is part of the Seanpm2001 WorldDB project, specifically containing the Earth/Armenia database set. ...
A repository containing lists of Armenian words, including a romanization guide and word lists in various formats (CS...
This repository is part of the Seanpm2001 WorldDB project, specifically containing documentation for the Earth/Armeni...
A curated collection of links to official documents, reports, and data related to transport policy, planning, and dev...
A repository containing Armenian (Western) vocabulary data for the Vocably language-learning app. The data is primari...
A repository containing public domain works by the Armenian author Melik S. Davit-Bek, specifically focusing on the h...
An R-based data analysis project comparing health outcomes across Armenia, Georgia, and Azerbaijan. It explores how s...
A dashboard for analyzing political advertising targeting in Armenia, likely part of the "Who Targets Me" (WTM) proje...
A repository serving as a curated index of transport-related documents, plans, and data sources for Armenia. It is st...
This repository appears to be a single data file for Armenia within a larger geographic knowledge base (SpocWiki). It...
This repository contains a descriptive analysis of Armenia's electricity production from fossil fuels versus renewabl...
A repository containing Armenian vocabulary data (words and translations) automatically generated and updated for the...
A repository containing a claimed database of Armenian phone numbers, presented as an educational resource for securi...
This repository is part of the AI2001 project, specifically for Armenian language linguistic datasets. The README sta...