Locked History Actions

Diff for "Linguistic Engineering Group"

Differences between revisions 3 and 32 (spanning 29 versions)
Revision 3 as of 2011-03-07 13:41:55
Size: 9597
Comment:
Revision 32 as of 2012-01-19 15:14:24
Size: 37
Editor: MichalLenart
Comment:
Deletions are marked like this. Additions are marked like this.
Line 1: Line 1:
= The Linguistic Engineering Group =

The Linguistic Engineering (LE) Group is part of the [[http://www.ipipan.waw.pl/en/dept/dept-ai.html|Department of Artificial Intelligence]] at the [[http://www.ipipan.waw.pl/en/|Institute of Computer Science]], [[http://www.english.pan.pl/|Polish Academy of Sciences]] (ICS PAS).

== People ==

|| Anna Andrzejczuk, MSc || [[mailto:anna.andrzejczuk@ipipan.waw.pl|anna.andrzejczuk@ipipan.waw.pl]] ||
|| Leonard Bolc, PhD (Professor Emeritus) || [[mailto:leonard.bolc@ipipan.waw.pl|leonard.bolc@ipipan.waw.pl]] ||
|| Łukasz Degórski, MSc || [[mailto:ldegorski@bach.ipipan.waw.pl|ldegorski@bach.ipipan.waw.pl]] ||
|| [[http://www.ipipan.waw.pl/~hajnicz/|Elżbieta Hajnicz]], PhD || [[mailto:elzbieta.hajnicz@ipipan.waw.pl|elzbieta.hajnicz@ipipan.waw.pl]] ||
|| Łukasz Kobyliński, MSc || [[mailto:lkobylinski@ipipan.waw.pl|lkobylinski@ipipan.waw.pl]] ||
|| [[http://www.ipipan.waw.pl/~aniak/|Anna Kupść]], PhD (on leave) || [[mailto:anna.kupsc@ipipan.waw.pl|anna.kupsc@ipipan.waw.pl]] ||
|| Małgorzata Marciniak, PhD || [[mailto:malgorzata.marciniak@ipipan.waw.pl|malgorzata.marciniak@ipipan.waw.pl]] ||
|| [[http://marcinmilkowski.pl/|Marcin Miłkowski]], PhD (part time) || [[mailto:marcin.milkowski@ifispan.waw.pl|marcin.milkowski@ifispan.waw.pl]] ||
|| [[http://www.ipipan.waw.pl/~agn/|Agnieszka Mykowiecka]], PhD || [[mailto:agnieszka.mykowiecka@ipipan.waw.pl|agnieszka.mykowiecka@ipipan.waw.pl]] ||
|| Maciej Ogrodniczuk, PhD || [[mailto:maciej.ogrodniczuk@ipipan.waw.pl|maciej.ogrodniczuk@ipipan.waw.pl]] ||
|| Jakub Piskorski, PhD (Associate) || [[mailto:jakub.piskorski@ipipan.waw.pl|jakub.piskorski@ipipan.waw.pl]] ||
|| [[../../../../~adamp/|Adam Przepiórkowski]], PhD, Head of the Group || [[mailto:adam.przepiorkowski@ipipan.waw.pl|adam.przepiorkowski@ipipan.waw.pl]] ||
|| Piotr Rychlik, PhD || [[mailto:rychlik@ipipan.waw.pl|rychlik@ipipan.waw.pl]] ||
|| [[http://www.cs.albany.edu/~tomek/|Tomek Strzałkowski]], PhD, Foreign Associate || [[mailto:tomek@cs.albany.edu|tomek@cs.albany.edu]] ||
|| Łukasz Szałkiewicz, MSc || [[mailto:lukasz.szalkiewicz@ipipan.waw.pl|lukasz.szalkiewicz@ipipan.waw.pl]] ||
|| [[http://www.site.uottawa.ca/~szpak/|Stan Szpakowicz]], PhD, Foreign Associate || [[mailto:szpak@site.uottawa.ca|szpak@site.uottawa.ca]] ||
|| Aleksander Wawer, MSc || [[mailto:aleksander.wawer@ipipan.waw.pl|aleksander.wawer@ipipan.waw.pl]] ||
|| [[http://www.ipipan.waw.pl/~wolinski/|Marcin Woliński]], PhD || [[mailto:marcin.wolinski@ipipan.waw.pl|marcin.wolinski@ipipan.waw.pl]] ||
|| Alina Wróblewska, MSc || [[mailto:alina.wroblewska@ipipan.waw.pl|alina.wroblewska@ipipan.waw.pl]] ||

== Research ==

=== The main research areas of the Group ===

 * (Polish) corpus linguistics; cf. the [[http://korpus.pl/en/|IPI PAN Corpus of Polish]] and the [[http://nkjp.pl/|National Corpus of Polish]],
 * syntactic and semantic parsing of Polish; cf. [[http://nlp.ipipan.waw.pl/Spejd/|Spejd]] and [[http://nlp.ipipan.waw.pl/~wolinski/swigra/|Świgra]],
 * extraction of linguistic knowledge from corpora,
 * information extraction,
 * sentiment analysis,
 * morphosyntactic system of Polish,
 * generative linguistic formalisms, esp., HPSG and LFG.

The Group is a member of [[http://www.clarin.eu/|CLARIN]], [[http://www.flarenet.eu/|FLaReNet]] and [[http://www.meta-net.eu/|META-NET]].

=== Current externally funded projects ===

 * [[CESAR]] (CEntral and South-east europeAn Resources),
 * [[SYNAT]] (Creation of a universal, open repository platform for hosting and communication of networked resources of knowledge for science, education and open knowledge-based society),
 * [[NEKST]] (An adaptive system to support problem-solving on the basis of document collections in the Internet),
 * [[ATLAS]] (Applied Technology for Language-Aided CMS),
 * [[Construction of a treebank for Polish using automatic syntactic analysis]],
 * [[CLARIN]] (Common Language Resources and Technology Infrastructure),
 * [[NKJP]] (National Corpus of Polish).

=== Some of our past projects ===

 * ''Automatic detection of semantic dependencies within verb argument structures in large treebanks'' ‒ a national [[http://www.eng.nauka.gov.pl/meinen/|Ministry of Science and Higher Education]] habilitation grant (number N N516 0165 33), 2 November 2007 ‒ 1 November 2009. Polish title: ''Automatyczne wykrywanie zależności semantycznych w strukturze argumentowej czasowników w dużych korpusach tekstów anotowanych syntaktycznie''. PI: Elżbieta Hajnicz.
 * ''[[http://www.ist-luna.eu/|LUNA]] (spoken Language UNderstanding in multilinguAl communication systems)'' ‒ a European ( [[http://www.cordis.lu/ist/|IST]]) Specific Targeted Research Project (contract number 033549), 4 September 2006 ‒ 3 September 2009. Polish PI: Agnieszka Mykowiecka.
 * ''Spoken language understanding in multilingual communication systems'' ‒ a [[http://www.eng.nauka.gov.pl/meinen/|Ministry of Science and Higher Education]] support for the Polish participation in the [[http://www.ist-luna.eu/|LUNA]] project, 1 March 2008 ‒ 1 September 2009. Polish title: ''Rozumienie mowy w wielojęzycznych systemach komunikacji''. PI: Małgorzata Marciniak.
 * ''[[http://www.lt4el.eu/|LT4eL]] (Language Technology for eLearning)'' ‒ a European ( [[http://www.cordis.lu/ist/|IST]]) Specific Targeted Research Project (contract number 027391), 1 December 2005 ‒ 31 May 2008. Polish PI: Adam Przepiórkowski.
 * ''[[http://nlp.ipipan.waw.pl/PPJP/|Automatic extraction of linguistic knowledge from a large corpus of Polish]]'' ‒ a national [[http://www.eng.nauka.gov.pl/meinen/|Ministry of Science and Higher Education]] research grant (number 3T11C00328), 9 March 2005 ‒ 8 March 2008. Polish title: ''Automatyczna ekstrakcja wiedzy lingwistycznej z dużego korpusu języka polskiego''. PI: Adam Przepiórkowski. The first publicly available tagger of Polish, [[http://nlp.pwr.wroc.pl/takipi/|TaKIPI]] has originally been developed within this project.
 * ''Information Extraction from Polish free text'' ‒ a national [[http://www.eng.nauka.gov.pl/meinen/|Ministry of Science and Higher Education]] research grant (number 3T11C00727), 20 October 2004 ‒ 19 October 2007. Polish title: ''Opracowanie narzędzi do ekstrakcji informacji z tekstów w języku polskim''. PI: Agnieszka Mykowiecka.
 * ''[[http://korpus.pl/|The IPI PAN Corpus]] of Polish'' ‒ a national [[http://www.kbn.gov.pl/|KBN]] grant (7T11C04320), 1 April 2001 ‒ 31 March 2004. Polish title: ''Anotowany korpus pisanego języka polskiego z dostępem przez internet (z uwzględnieniem zastosowań w inżynierii lingwistycznej)''. PI: Adam Przepiórkowski.
 * ''A [[../../../CRIT2/|Treebank / Test-Suite of Polish Utterances]]'' ‒ a EU [[http://www.ipipan.waw.pl/en/research/grants-completed.html#euro|CRIT-2]] subproject (ICS-MM), 15 October 1997 ‒ 14 October 2000. Coordinator: Leonard Bolc.
 * ''An [[../../../HPSG/hpsg.html|HPSG Grammar of Polish]] (theory and [[../../../HPSG/PolishInHPSG.pl|implementation]])'' ‒ a national [[http://www.kbn.gov.pl/|KBN]] grant (8T11C01110), 1 January 1996 ‒ 31 December 1998. Polish title: ''Zastosowanie metod inżynierii lingwistycznej do automatycznej analizy i syntezy tekstów języka polskiego''. PI: Leonard Bolc.


== Publicly available tools and resources ==

Here are some of the tools and resources created within our projects.

Tools (all open source, under [[http://www.gnu.org/copyleft/gpl.html|GPL]]):

 * [[http://nlp.ipipan.waw.pl/~wolinski/swigra/|Świgra]] – a DCG parser,
 * [[http://nlp.ipipan.waw.pl/Spejd/|Spejd]] – a shallow parsing and disambiguation system,
 * [[http://nlp.pwr.wroc.pl/takipi/|TaKIPI]] – a morphosyntactic tagger for Polish,
 * [[http://code.google.com/p/pantera-tagger/|PANTERA]] – a morphosyntactic tagger for Polish,
 * [[http://poliqarp.sourceforge.net/|Poliqarp]] – a corpus indexing and search engine,
 * [[http://sourceforge.net/projects/dendrarium/|Dendrarium]] – a treebank development system (under development),
 * [[http://nlp.ipipan.waw.pl/Anotatornia/|Anotatornia]] – a system for multi-level manual annotation of corpora (forthcoming),
 * [[http://nlp.ipipan.waw.pl/WSDDE/|WSDDE]] – a system for designing and performing Word Sense Disambiguation experiments (forthcoming),
 * [[http://nlp.ipipan.waw.pl/PPJP/|etc.]]


Resources:

 * [[http://korpus.pl/|IPI PAN Corpus of Polish]],
 * [[http://nkjp.pl/index.php?page=0&lang=1|National Corpus of Polish]] (under development).


== Other activities ==


Links to some other activities of the Group:

 * [[../../../../../../seminar-e.html|NLP Seminar at IPI PAN]];
 * [[http://iis.ipipan.waw.pl/|Intelligent Information Systems]] series of conferences.
#refresh 0 http://zil.ipipan.waw.pl