Locked History Actions

Diff for "LRT"

Differences between revisions 55 and 61 (spanning 6 versions)
Revision 55 as of 2011-04-20 18:56:38
Size: 8555
Comment:
Revision 61 as of 2011-04-26 21:23:11
Size: 9076
Comment:
Deletions are marked like this. Additions are marked like this.
Line 26: Line 26:
 * [[http://psi.amu.edu.pl/en/index.php?title=Parallel_Corpora|PSI collection of parallel corpora]], a growing collection of parallel corpora pairing Polish with other european languages.
Line 57: Line 58:
 * [[ftp://ftp.mimuw.edu.pl/pub/People/polszczyzna/Szpakowicz/|Formalny opis składniowy zdań polskich]] (S. Szpakowicz).  * [[ftp://ftp.mimuw.edu.pl/pub/People/polszczyzna/Szpakowicz/|Formalny opis składniowy zdań polskich]] (S. Szpakowicz),
 * [[http://thetos.aei.polsl.pl/las/|Serwer LAS / Linguistic Analysis Server]].
Line 86: Line 88:
 * [[http://www.translatica.pl/|Translatica]] (EN-PL-EN),  * [[http://www.translatica.pl/|Translatica]] (EN-PL-EN, DE-PL-DE, RU-PL-RU), see also [[http://poleng.pl|Poleng website]] with an experimental FR-PL-FR version
Line 95: Line 97:
Line 101: Line 104:
 * [[http://psi.amu.edu.pl/en/index.php?title=SyMGIZA%2B%2B|SyMGIZA++]], an extension of Giza++ that computes symmetric word alignment models.

Language Tools and Resources for Polish

This page contains a list of publicly available language tools and resources.

Parallel corpora

Translation memories

  • MyMemory, freely available multilingual TM,

  • TAUS Data, a multilingual TM from the members of TAUS Data Association.

Morphological tools and resources

Taggers

  • TaKIPI, a morphosyntactic tagger for Polish,

  • PANTERA, a morphosyntactic tagger for Polish.

Parsers, grammars, treebanks

Machine-readable dictionaries

Human-readable dictionaries

Speech analysis and synthesis tools

Machine translation demonstrations

Other

  • Kolokacje, a Web crawler and collocation finder (A. Buczyński),

  • WSDDE, a system for designing and performing Word Sense Disambiguation experiments (R. Młodzki et al.),

  • Frazeo, a search engine and clusterer of news in Polish (P. Pęzik),

  • Segment, a rule-based sentence tokenizer supporting SRX standard (J. Lipski; the Polish rules are available in LanguageTool project),

  • Lakon, a system for news summarization (master's thesis by A. Dudczak).

  • SyMGIZA++, an extension of Giza++ that computes symmetric word alignment models.