armenian_tales
fenlich/armenian_tales
Помощь для проекта по восточноармянско-русскому параллельному корпусу. Веб-скрейпинг сказок Ованнеса Туманяна.
Summary
A Jupyter Notebook project for web scraping Armenian fairy tales by Hovhannes Tumanyan and their Russian translations to build an Eastern Armenian-Russian parallel corpus. The work involved solving scraping issues like inconsistent HTML structure and differing URL titles across languages, resulting in a CSV dataset.