armenian

TITUS-2-0/armenian

TITUS datasets of Armenian languages

Stars: 0 Forks: 0 Data

Summary

This repository is part of the TITUS-2-0 project, which hosts digital editions of historical texts in various languages. This specific repo contains datasets for Armenian languages, primarily Classical Armenian (xcl-Armn). It includes one released dataset (koriw) and several others marked as in progress. The data is encoded in TEI XML, validated via GitHub Actions, and linked to detailed metadata and documentation.

Similar Projects