Size: 7025
Comment:
|
Size: 3664
Comment:
|
Deletions are marked like this. | Additions are marked like this. |
Line 3: | Line 3: |
== Research infrastructures == | This page contains a list of ''externally financed'' projects relevant to language resources and technologies carried out by Polish research institutions. |
Line 5: | Line 5: |
* [[http://www.clarin.eu/|CLARIN]]: * [[Linguistic Engineering Group|Institute of Computer Science, Polish Academy of Sciences]], * [[http://www.ispan.waw.pl/zakjez/semantyka/index_eng.htm|Institute of Slavic Studies, Polish Academy of Sciences]], * [[http://www.pjwstk.edu.pl/|Polish-Japanese Institute of Information Technology]] * [[http://www.uni.lodz.pl/|Institute of English Language, University of Łódź]] * [[http://www.iis.pwr.wroc.pl/|Institute of Applied Informatics, Wrocław University of Technology]] |
== Ending in… == |
Line 12: | Line 7: |
* [[http://www.flarenet.eu/|FLaReNet]] – * [[Linguistic Engineering Group|Institute of Computer Science, Polish Academy of Sciences]], * [[http://amu.edu.pl/en/|Adam Mickiewicz University in Poznań]], |
=== 2015 === |
Line 16: | Line 9: |
* [[http://www.meta-net.eu/|META-NET]] – * [[Linguistic Engineering Group|Institute of Computer Science, Polish Academy of Sciences]], * [[http://www.uni.lodz.pl/|Institute of English Language, University of Łódź]] |
* [[http://www.dsp.agh.edu.pl/doku.php?id=pl:research:lingwarsz|Lingwistyczny Warsztat]] (Linguistic workshop for speech analysis and recognition) * [[http://www.dsp.agh.edu.pl/doku.php?id=pl:research:300|SpeechSamples]] (Cross-language comparative analysis of phonemes) * [[http://www.dsp.agh.edu.pl/doku.php?id=pl:research:lider|TELEDS]] (Human-computer dialog system based on telephone connection) |
Line 20: | Line 13: |
=== 2014 === | |
Line 21: | Line 15: |
== Current externally funded projects == | * [[NEKST]] (An adaptive system to support problem-solving on the basis of document collections in the Internet) * [[CORE]] (Computer-based methods for coreference resolution in Polish texts) * [[Readability]] (Measuring the degree of readability of nonliterary Polish texts) |
Line 23: | Line 19: |
* [[CESAR]], * ''[[http://www.elka.pw.edu.pl/pol/Media2/Serwis-Informacyjny/Aktualnosci/Projekt-SYNAT-System-Nauki-i-Techniki-realizowany-w-ramach-INFINITI-PASSIM|SYNAT]] (Creation of a universal, open repository platform for hosting and communication of networked resources of knowledge for science, education and open knowledge-based society)'' – a [[http://en.ncbir.pl/|National Centre for Research and Development]] grant, 16 August 2010 – 16 August 2013. Polish title: ''Utworzenie uniwersalnej, otwartej, repozytoryjnej platformy hostingowej i komunikacyjnej dla sieciowych zasobów wiedzy dla nauki, edukacji i otwartego społeczeństwa wiedzy''. PI: Beata Konikowska. * ''[[http://www.ipipan.waw.pl/nekst/|Nekst]](An adaptive system to support problem-solving on the basis of document collections in the Internet)'' ‒ a national [[http://www.eng.nauka.gov.pl/meinen/|Ministry of Science and Higher Education]] Innovative Economy Operational Programme (PO IG) grant, 1 January 2010 ‒ 31 December 2013. Polish title: ''Adaptacyjny system wspomagający rozwiązywanie problemów w oparciu o analizę treści dostępnych źródeł elektronicznych''. PI: Jacek Koronacki. * ''[[http://www.atlasproject.eu/|ATLAS]]'' (''Applied Technology for Language-Aided CMS'') ‒ a European ([[http://ec.europa.eu/information_society/activities/ict_psp/index_en.htm|CIP ICT-PSP]]) project (grant agreement 250467), 1 March 2010 ‒ 28 February 2013. Polish PI: Adam Przepiórkowski. * ''Construction of a treebank for Polish using automatic syntactic analysis'' ‒ a national [[http://www.eng.nauka.gov.pl/meinen/|Ministry of Science and Higher Education]] research grant (number N N104 224735), 14 October 2008 ‒ 13 April 2011. Polish title: ''Budowa banku drzew składniowych dla języka polskiego z wykorzystaniem automatycznej analizy składniowej''. PI: Marcin Woliński. * ''[[http://www.clarin.eu/|CLARIN]] (Common Language Resources and Technology Infrastructure)'' ‒ a European ([[http://cordis.europa.eu/esfri/|ESFRI]]) infrastructure project, FP7 (contract number 212230), 1 January 2008 ‒ 31 December 2010 (plus 6 months extension). PI at ICS PAS: Adam Przepiórkowski. * ''[[http://nkjp.pl/|NKJP]] (National Corpus of Polish)'' ‒ a national [[http://www.eng.nauka.gov.pl/meinen/|Ministry of Science and Higher Education]] research/development grant (number R17 003 03), 13 December 2007 ‒ 12 December 2010 (plus 6 months extension). Polish title: ''Narodowy Korpus Języka Polskiego''. PI: Adam Przepiórkowski. |
=== 2013 === |
Line 31: | Line 21: |
=== Selected past projects === | * [[http://zil.ipipan.waw.pl/Automatic%20detection%20and%20correction%20of%20annotation%20errors%20in%20Polish%20language%20corpora|Automatic detection and correction of annotation errors in Polish language corpora]] * [[Polish-Russian Parallel Corpus]] * [[psi-toolkit]] (Publicly available tools for automatic processing of Polish language) * [[SYNAT]] (Creation of a universal, open repository platform for hosting and communication of networked resources of knowledge for science, education and open knowledge-based society) |
Line 33: | Line 26: |
* ''Automatic detection of semantic dependencies within verb argument structures in large treebanks'' ‒ a national [[http://www.eng.nauka.gov.pl/meinen/|Ministry of Science and Higher Education]] habilitation grant (number N N516 0165 33), 2 November 2007 ‒ 1 November 2009. Polish title: ''Automatyczne wykrywanie zależności semantycznych w strukturze argumentowej czasowników w dużych korpusach tekstów anotowanych syntaktycznie''. PI: Elżbieta Hajnicz. * ''[[http://www.ist-luna.eu/|LUNA]] (spoken Language UNderstanding in multilinguAl communication systems)'' ‒ a European ( [[http://www.cordis.lu/ist/|IST]]) Specific Targeted Research Project (contract number 033549), 4 September 2006 ‒ 3 September 2009. Polish PI: Agnieszka Mykowiecka. * ''Spoken language understanding in multilingual communication systems'' ‒ a [[http://www.eng.nauka.gov.pl/meinen/|Ministry of Science and Higher Education]] support for the Polish participation in the [[http://www.ist-luna.eu/|LUNA]] project, 1 March 2008 ‒ 1 September 2009. Polish title: ''Rozumienie mowy w wielojęzycznych systemach komunikacji''. PI: Małgorzata Marciniak. * ''[[http://www.lt4el.eu/|LT4eL]] (Language Technology for eLearning)'' ‒ a European ( [[http://www.cordis.lu/ist/|IST]]) Specific Targeted Research Project (contract number 027391), 1 December 2005 ‒ 31 May 2008. Polish PI: Adam Przepiórkowski. * ''[[http://nlp.ipipan.waw.pl/PPJP/|Automatic extraction of linguistic knowledge from a large corpus of Polish]]'' ‒ a national [[http://www.eng.nauka.gov.pl/meinen/|Ministry of Science and Higher Education]] research grant (number 3T11C00328), 9 March 2005 ‒ 8 March 2008. Polish title: ''Automatyczna ekstrakcja wiedzy lingwistycznej z dużego korpusu języka polskiego''. PI: Adam Przepiórkowski. The first publicly available tagger of Polish, [[http://nlp.pwr.wroc.pl/takipi/|TaKIPI]] has originally been developed within this project. * ''Information Extraction from Polish free text'' ‒ a national [[http://www.eng.nauka.gov.pl/meinen/|Ministry of Science and Higher Education]] research grant (number 3T11C00727), 20 October 2004 ‒ 19 October 2007. Polish title: ''Opracowanie narzędzi do ekstrakcji informacji z tekstów w języku polskim''. PI: Agnieszka Mykowiecka. * ''[[http://korpus.pl/|The IPI PAN Corpus]] of Polish'' ‒ a national [[http://www.kbn.gov.pl/|KBN]] grant (7T11C04320), 1 April 2001 ‒ 31 March 2004. Polish title: ''Anotowany korpus pisanego języka polskiego z dostępem przez internet (z uwzględnieniem zastosowań w inżynierii lingwistycznej)''. PI: Adam Przepiórkowski. * ''A [[../../../CRIT2/|Treebank / Test-Suite of Polish Utterances]]'' ‒ a EU [[http://www.ipipan.waw.pl/en/research/grants-completed.html#euro|CRIT-2]] subproject (ICS-MM), 15 October 1997 ‒ 14 October 2000. Coordinator: Leonard Bolc. * ''An [[../../../HPSG/hpsg.html|HPSG Grammar of Polish]] (theory and [[../../../HPSG/PolishInHPSG.pl|implementation]])'' ‒ a national [[http://www.kbn.gov.pl/|KBN]] grant (8T11C01110), 1 January 1996 ‒ 31 December 1998. Polish title: ''Zastosowanie metod inżynierii lingwistycznej do automatycznej analizy i syntezy tekstów języka polskiego''. PI: Leonard Bolc. |
== Finished in… == === 2013 === * [[ATLAS]] (Applied Technology for Language-Aided CMS) * [[CESAR]] (CEntral and South-east europeAn Resources), part of [[http://www.meta-net.eu/|META-NET]] === 2012 === * [[http://www.impact-project.eu/|IMPACT]] (Improving Access to Text) * [[https://bitbucket.org/jsbien/ndt|Text digitalization tools for philological research]] * [[plWordNet2|plWordNet 2.0]] === 2011 === * [[CLARIN]] (Common Language Resources and Technology Infrastructure) * [[Construction of a treebank for Polish using automatic syntactic analysis]] * [[FLaReNet]] (Fostering Language Resources Network) * [[NKJP]] (National Corpus of Polish) * [[http://www.ppbw.pl/ppbw/8.html|Speech Technology Systems for Public Security Systems]] === 2010 === * [[MONDILEX]] (Conceptual Modelling of Networking of Centres for High-Quality Research in Slavic Lexicography and Their Digital Resources) === 2009 === * [[Automatic detection of semantic dependencies within verb argument structures in large treebanks]] * [[LUNA]] (spoken Language UNderstanding in multilinguAl communication systems) * [[RAMKI]] (Polish Framenet) === 2008 === * [[Automatic extraction of linguistic knowledge from a large corpus of Polish]] * [[LT4eL]] (Language Technology for eLearning) * [[plWordNet]] (Automatic methods of constructing a semantic network of Polish lexemes for natural language processing) === 2007 === * [[Information Extraction from Polish free text]] === 2004 === * The [[IPI PAN Corpus]] of Polish === 2000 === * [[Test Suite of Polish Utterances|Test-Suite of Polish Utterances]] === 1999 === * [[http://bc.klf.uw.edu.pl/33/|Test Suites for Validation and Evaluation of Polish Language Parsers]] === 1998 === * [[GRAMLEX]] * [[HPSG Grammar of Polish]] (theory and implementation) === 1997 === * [[Natural language access to knowledge databases with spatial information]] === 1996 === * [[CEGLEX]] * [[http://nauka-polska.pl/dhtml/raporty/praceBadawcze?rtype=opis&lang=pl&objectId=51593|Morphologico-syntactic Analyser for a Large Subset of Polish]] * [[POLEX]] |
Linguistic Engineering Projects
This page contains a list of externally financed projects relevant to language resources and technologies carried out by Polish research institutions.
Ending in…
2015
Lingwistyczny Warsztat (Linguistic workshop for speech analysis and recognition)
SpeechSamples (Cross-language comparative analysis of phonemes)
TELEDS (Human-computer dialog system based on telephone connection)
2014
NEKST (An adaptive system to support problem-solving on the basis of document collections in the Internet)
CORE (Computer-based methods for coreference resolution in Polish texts)
Readability (Measuring the degree of readability of nonliterary Polish texts)
2013
Automatic detection and correction of annotation errors in Polish language corpora
psi-toolkit (Publicly available tools for automatic processing of Polish language)
SYNAT (Creation of a universal, open repository platform for hosting and communication of networked resources of knowledge for science, education and open knowledge-based society)
Finished in…
2013
ATLAS (Applied Technology for Language-Aided CMS)
CESAR (CEntral and South-east europeAn Resources), part of META-NET
2012
IMPACT (Improving Access to Text)
2011
CLARIN (Common Language Resources and Technology Infrastructure)
Construction of a treebank for Polish using automatic syntactic analysis
FLaReNet (Fostering Language Resources Network)
NKJP (National Corpus of Polish)
2010
MONDILEX (Conceptual Modelling of Networking of Centres for High-Quality Research in Slavic Lexicography and Their Digital Resources)
2009
Automatic detection of semantic dependencies within verb argument structures in large treebanks
LUNA (spoken Language UNderstanding in multilinguAl communication systems)
RAMKI (Polish Framenet)
2008
Automatic extraction of linguistic knowledge from a large corpus of Polish
LT4eL (Language Technology for eLearning)
plWordNet (Automatic methods of constructing a semantic network of Polish lexemes for natural language processing)
2007
2004
The IPI PAN Corpus of Polish
2000
1999
1998
HPSG Grammar of Polish (theory and implementation)
1997