PressMint project
Project factsheet
English name: |
Interoperable corpora of historical newspapers |
Polish name: |
Interoperacyjne korpusy gazet historycznych |
Project type: |
A CLARIN ERIC Flasghip project |
Duration: |
1 June 2025 ‒ 31 May 2027 |
Project Web page: |
https://www.clarin.eu/content/pressmint-interoperable-corpora-historical-newspapers/ |
Polish partners in the consortium
University of Wrocław (PI: Adam Pawłowski)
Linguistic Engineering Group, Institute of Computer Science, Polish Academy of Sciences (PI: Maciej Ogrodniczuk)
Project description
PressMint is a CLARIN flagship project that aims to compile a multilingual, comparable, annotated, translated and interoperable set of corpora of European historical newspapers from around the start of the 20th century. The PressMint corpora will be openly available, both for download in a variety of instances and formats, as well as via several online corpus analysis tools. The project will proactively disseminate and foster the use of the corpus collection.
While historical newspapers are of interest to a diverse group of researchers from the social sciences and humanities - historians, historical linguists, social scientists, ethnologists, anthropologists, media and communication scholars, and cultural studies amongst others - contemporary digital resources, tools and methods are still underutilised in these fields. Existing corpora are not interoperable, which precludes methods for their comparison, as well as any translingual and transnational research, an especially important consideration, as statehood and nationhood are highly dynamic in Europe in the period to be covered by the project corpora. The PressMint project aims to improve this situation by providing a valuable service to the academic community on a truly pan-European, multilingual and multidisciplinary level.