armenian_tales

fenlich/armenian_tales

Помощь для проекта по восточноармянско-русскому параллельному корпусу. Веб-скрейпинг сказок Ованнеса Туманяна.

Jupyter Notebook Stars: 0 Forks: 0 Data

Summary

A Jupyter Notebook project for web scraping Armenian fairy tales by Hovhannes Tumanyan and their Russian translations to build an Eastern Armenian-Russian parallel corpus. The work involved solving scraping issues like inconsistent HTML structure and differing URL titles across languages, resulting in a CSV dataset.

Similar Projects