Locked History Actions

Diff for "LRT"

Differences between revisions 45 and 49 (spanning 4 versions)
Revision 45 as of 2011-04-18 14:57:51
Size: 7257
Comment:
Revision 49 as of 2011-04-18 22:30:22
Size: 7227
Comment: add TM
Deletions are marked like this. Additions are marked like this.
Line 12: Line 12:
 * [[http://ifa.amu.edu.pl/~ifaconc/blog/?page_id=60|PICLE corpus]] (the Polish sub-corpus of the [[http://www.fltr.ucl.ac.be/fltr/germ/etan/cecl/Cecl-Projects/Icle/icle.htm|International Corpus of Learner English]] (ICLE)),  * [[http://ifa.amu.edu.pl/~ifaconc/blog/?page_id=60|PICLE corpus]], the Polish sub-corpus of the [[http://www.fltr.ucl.ac.be/fltr/germ/etan/cecl/Cecl-Projects/Icle/icle.htm|International Corpus of Learner English]] (ICLE),
Line 25: Line 25:

== Translation memories ==

 * [[http://mymemory.translated.net/|MyMemory]], freely available multilingual TM,
 * [[http://www.tausdata.org/|TAUS Data]], a multilingual TM from the members of TAUS Data Association.
Line 41: Line 46:
 * [[http://code.google.com/p/pantera-tagger/|PANTERA]], a morphosyntactic tagger for Polish,
 * a prototype [[http://nlp.ipipan.waw.pl/~adamp/msc/mastalerz.radomir/CD.tgz|implementation]] of Maximum Entropy tagging created within Radomir Mastalerz's [[http://nlp.ipipan.waw.pl/~adamp/msc/mastalerz.radomir/1000-MGR-INF-97543.pdf.gz|MSc]].
 * [[http://code.google.com/p/pantera-tagger/|PANTERA]], a morphosyntactic tagger for Polish.
Line 82: Line 86:
 * [[http://nlp.ipipan.waw.pl/WSDDE/|WSDDE]] – a system for designing and performing Word Sense Disambiguation experiments (R. Młodzki ''et al.''),  * [[http://nlp.ipipan.waw.pl/WSDDE/|WSDDE]], a system for designing and performing Word Sense Disambiguation experiments (R. Młodzki ''et al.''),

Language Tools and Resources for Polish

This page contains a list of publicly available language tools and resources.

Parallel corpora

Translation memories

  • MyMemory, freely available multilingual TM,

  • TAUS Data, a multilingual TM from the members of TAUS Data Association.

Morphological tools and resources

Taggers

  • TaKIPI, a morphosyntactic tagger for Polish,

  • PANTERA, a morphosyntactic tagger for Polish.

Parsers, grammars, treebanks

Machine-readable dictionaries

Human-readable dictionaries

Speech analysis and synthesis tools

  • Skrybot, commercial speech recognition system (L. Pawlaczyk, P. Bosky)

  • Ivona, commercial text-to-speech system (Expressivo)

Machine translation demonstrations

Other

  • Kolokacje, a Web crawler and collocation finder (A. Buczyński),

  • WSDDE, a system for designing and performing Word Sense Disambiguation experiments (R. Młodzki et al.),

  • Frazeo, a search engine and clusterer of news in Polish (P. Pęzik),

  • Segment, a rule-based sentence tokenizer supporting SRX standard (J. Lipski; the Polish rules are available in LanguageTool project),

  • Lakon, a system for news summarization (master's thesis by A. Dudczak).