Size: 3376
Comment:
|
Size: 3372
Comment:
|
Deletions are marked like this. | Additions are marked like this. |
Line 24: | Line 24: |
* [[http://www.eti.pg.gda.pl/katedry/kiw/pracownicy/Jan.Daciuk/personal/fsa_polski.html|Finite state utilities ]] (J. Daciuk), * [[http://www.cs.put.poznan.pl/dweiss/xml/projects/lametyzator/index.xml?lang=en|Stemming engine for Polish ]] (D. Weiss), |
* [[http://www.eti.pg.gda.pl/katedry/kiw/pracownicy/Jan.Daciuk/personal/fsa_polski.html|Finite state utilities]] (J. Daciuk), * [[http://www.cs.put.poznan.pl/dweiss/xml/projects/lametyzator/index.xml?lang=en|Stemming engine for Polish]] (D. Weiss), |
Language Tools and Resources for Polish
Written corpora and corpus-related tools
National Corpus of Polish (under development),
PICLE corpus (the Polish sub-corpus of the International Corpus of Learner English (ICLE),
Poliqarp – a corpus indexing and search engine,
Anotatornia – a system for multi-level manual annotation of corpora.
Parallel corpora
OPUS – an open source parallel corpus (European Parliament, EMEA, KDE, movie subtitles),
Morphological tools and resources
Morfeusz SGJP – morphological analyser (Z. Saloni, W. Gruszczyński, M. Woliński, R. Wołosz),
Morfologik – morphological analyser (M. Miłkowski),
http://sgalus.republika.pl/indexe.html – lexical analyser and a Polish proof-reader (S. Galus),
Neurosoft Gram (demo of a morphological analyser),
Finite state utilities (J. Daciuk),
Stemming engine for Polish (D. Weiss),
Stempel, another stemmer (A. Białecki).
Taggers
Parsers, grammars, treebanks
Świgra – a DCG parser,
Spejd – a shallow parsing and disambiguation system,
Dendrarium – a treebank development system (under development),
Machine translation demonstrations
Translatica (EN-PL-EN),
InterTran (multilingual),
LingvoBit (EN-PL-EN),
Systran (EN-PL, PL-FR and some more).
Other
plWordNet, Polish WordNet (M. Piasecki),
Kolokacje, a Web crawler and collocation finder (A. Buczyński)
WSDDE – a system for designing and performing Word Sense Disambiguation experiments (forthcoming),