Linguistic Engineering Projects
This page contains a list of externally financed projects relevant to language resources and technologies carried out by Polish research institutions.
Ending in…
2026
UniDive (Universality, diversity and idiosyncrasy in language technology)
2023
CLARIN-PL-Biz (Common Language Resources and Technology Infrastructure)
DARIAH.Lab (Digital Research Infrastructure for the Arts and Humanities DARIAH-PL)
2022
HOMADOS (Hampering Misinformation by Assessing Credibility of Online Sources)
CURLICAT (Curated Multilingual Language Resources for CEF AT)
2021
CLARIN-PL (Polish chapter of Common Language Resources and Technology Infrastructure)
CORMETAN (Cognitive and sociocultural analysis of metaphoric expressions in Polish texts)
Kwantyfikatory w języku: użycie i znaczenie (Quantifiers in Language: Use and Meaning)
2020
MARCELL (Multilingual Resources for CEF.AT in the legal domain)
2019
Chronofleks (A diachronic formal model of Polish inflection and its implementation)
CoDeS (Compositional distributional semantic models for identification, discrimination and disambiguation of senses in Polish texts)
ELRC (European Language Resource Coordination)
Parthenos (Pooling Activities, Resources and Tools for Heritage E-research Networking, Optimization and Synergies)
Scwad (Compositional distributional modelling of Polish language semantics)
SYNAMET (Microcorpus of Synaesthetic Metaphors. Towards a Formal Description and Efficient Methods of Analysis of Metaphors in Discourse)
2018
COTHEC (Computable theory of coreference)
CLARIN-PL (Common Language Resources and Technology Infrastructure)
KORBA (Electronic corpus of 17th and 18th century Polish texts)
TextLink (Structuring Discourse in Multilingual Europe)
2017
PARSEME (PARSing and Multi-word Expressions. Towards linguistic precision and computational efficiency in natural language processing)
2016
CLARIN-PL (Common Language Resources and Technology Infrastructure)
OPTA (Identifying OPinion TArgets and opinion expressions in Polish texts)
2015
Lingwistyczny Warsztat (Linguistic workshop for speech analysis and recognition)
SpeechSamples (Cross-language comparative analysis of phonemes)
TELEDS (Human-computer dialog system based on telephone connection)
Readability (Measuring the degree of readability of nonliterary Polish texts)
2014
NEKST (An adaptive system to support problem-solving on the basis of document collections in the Internet)
CORE (Computer-based methods for coreference resolution in Polish texts)
TrendMiner (Large-scale, Cross-lingual Trend Mining and Summarisation of Real-time Media Streams)
2013
ATLAS (Applied Technology for Language-Aided CMS)
CESAR (CEntral and South-east europeAn Resources), part of META-NET
Automatic detection and correction of annotation errors in Polish language corpora
psi-toolkit (Publicly available tools for automatic processing of Polish language)
SYNAT (Creation of a universal, open repository platform for hosting and communication of networked resources of knowledge for science, education and open knowledge-based society)
2012
IMPACT (Improving Access to Text)
2011
CLARIN (Common Language Resources and Technology Infrastructure)
Construction of a treebank for Polish using automatic syntactic analysis
FLaReNet (Fostering Language Resources Network)
NKJP (National Corpus of Polish)
2010
MONDILEX (Conceptual Modelling of Networking of Centres for High-Quality Research in Slavic Lexicography and Their Digital Resources)
2009
Automatic detection of semantic dependencies within verb argument structures in large treebanks
LUNA (spoken Language UNderstanding in multilinguAl communication systems)
RAMKI (Polish Framenet)
2008
Automatic extraction of linguistic knowledge from a large corpus of Polish
LT4eL (Language Technology for eLearning)
plWordNet (Automatic methods of constructing a semantic network of Polish lexemes for natural language processing)
2007
2004
The IPI PAN Corpus of Polish
2000
1999
1998
HPSG Grammar of Polish (theory and implementation)
1997