Locked History Actions

Diff for "LRT"

Differences between revisions 110 and 111
Revision 110 as of 2011-09-26 10:23:32
Size: 11874
Comment:
Revision 111 as of 2011-09-29 00:25:48
Size: 11964
Comment:
Deletions are marked like this. Additions are marked like this.
Line 83: Line 83:
 * [[http://futrega.org/etc/nazwiska.zip|List of all Polish surnames]], licence unknown, see [[http://futrega.org/etc/nazwiska.html|further information on this resource]].  * [[http://futrega.org/etc/nazwiska.zip|List of all Polish surnames]], licence unknown, see [[http://futrega.org/etc/nazwiska.html|further information on this resource]],
 * [[http://clip.ipipan.waw.pl/Gazetteer|The Polish Gazetteer]] (A. Savary, J. Piskorski).

Language Tools and Resources for Polish

This page contains a list of publicly available language tools and resources.

Parallel corpora

Spoken corpora

Translation memories

  • MyMemory, freely available multilingual TM,

  • TAUS Data, a multilingual TM from the members of TAUS Data Association.

Morphological tools and resources

Taggers

  • TaKIPI, a morphosyntactic tagger for Polish,

  • PANTERA, a morphosyntactic tagger for Polish.

Parsers, grammars, treebanks

Machine-readable dictionaries

Human-readable dictionaries

Speech analysis and synthesis tools

  • Skrybot, commercial speech recognition system (L. Pawlaczyk, P. Bosky),

  • Ivona, commercial text-to-speech system (Expressivo),

  • Acapela, text to speech demo,

  • Synteza mowy polskiej, automatic speech recognition and speech synthesis demos, with background information (K. Szklanny),

  • System syntezy mowy ciągłej (G. Demenko, S. Grocholewski),

  • Polish MBROLA database (K. Szklanny, K. Marasek),

  • SynTalk, commercial speech synthesis system (NeuroSoft),

  • PrimeSpeech, commercial speech recognition systems,

  • OrtFon, phonetic transcriber (AGH DSP),

  • ASR, automatic speech recognition system for Polish (AGH DSP),

  • Anotator, speech corpora anotator dedicated for Polish and focused on connecting existing resources (AGH DSP).

Machine translation demonstrations

Other

  • Kolokacje, a Web crawler and collocation finder (A. Buczyński),

  • WSDDE, a system for designing and performing Word Sense Disambiguation experiments (R. Młodzki et al.),

  • Frazeo, a search engine and clusterer of news in Polish (P. Pęzik),

  • Segment, a rule-based sentence tokenizer supporting SRX standard (J. Lipski; the Polish rules are available in LanguageTool project),

  • Translatica SRX sentence segmentation rules for Polish (LGPL)

  • Lakon, a system for news summarization (master's thesis by A. Dudczak),

  • SyMGIZA++, an extension of Giza++ that computes symmetric word alignment models,

  • Multiservice, a sample interface for running NLP Web services for Polish,

  • Hipisek, an experimental question answering system (M. Walas).