Polish-Russian Parallel Corpus

Polish-Russian Parallel Corpus Project

Project Factsheet

English Name:

Polish-Russian Parallel Corpus

Polish Name:

Korpus równoległy polsko-rosyjski

Project Type:

A grant of the Ministry of Science and Higher Education N N104056638 (later – National Science Centre)


10 May 2010 ‒ 10 May 2012 (to be extended to 10 May 2013)

Project Web Page:

http://www.pol-ros.polon.uw.edu.pl/ (in Polish)

Principal investigator:

Marek Łaziński

Polish partners involved

Project summary

The project aims to collect 50 million words, 50% of Polish originals translated into Russian and 50% vice versa. The core of the resources consists of the literary classics of the nineteenth century and contemporary works which are most popular in the neighbouring country. The corpus contains press texts and their translations, as well as legal texts. The texts are annotated according to the DTDs of the National Corpus of Polish and the Russian National Corpus. A morphosyntactic search is possible, although the standards of the two national corpora differ according to some grammatical classes and categories.