Locked History Actions

Diff for "LRT"

Differences between revisions 283 and 284
Revision 283 as of 2015-04-15 08:55:47
Size: 23187
Editor: MateuszKopec
Comment:
Revision 284 as of 2015-04-15 09:23:37
Size: 23367
Editor: MateuszKopec
Comment:
Deletions are marked like this. Additions are marked like this.
Line 135: Line 135:
   * Spejd [[http://clip.ipipan.waw.pl/SpejdLemmatizingGrammar|grammar of Polish with lemmatisation of Polish nominal syntactic groups]],
Line 188: Line 189:
 * [[http://www.gzegzolka.com/poliszynel/|Poliszynel]] (P. Sawicki)
 * [[http://www.spolszcz.pl/|spolszcz.pl]] (P. Sawicki)
 * [[http://www.polszczyzna.info/polonizator|Polonizator]] (TiP)
 * [[http://slowniki.zoni.pl/?s=ogonki|Polonizer]] 
 * [[http://galaxy.eti.pg.gda.pl/katedry/kiw/pracownicy/Jan.Daciuk/personal/man/fsa_accent.1.html|fsa_accent]] (J. Daciuk)
 * [[http://wm.ite.pl/proj/pliterki/index.html|pliterki]] (W. Muła)
 * [[http://logipam.org/charlifter/en.php|Logipam]]
 * [[http://www.gzegzolka.com/poliszynel/|Poliszynel]] (P. Sawicki),
 * [[http://www.spolszcz.pl/|spolszcz.pl]] (P. Sawicki),
 * [[http://www.polszczyzna.info/polonizator|Polonizator]] (TiP),
 * [[http://slowniki.zoni.pl/?s=ogonki|Polonizer]],
 * [[http://galaxy.eti.pg.gda.pl/katedry/kiw/pracownicy/Jan.Daciuk/personal/man/fsa_accent.1.html|fsa_accent]] (J. Daciuk),
 * [[http://wm.ite.pl/proj/pliterki/index.html|pliterki]] (W. Muła),
 * [[http://logipam.org/charlifter/en.php|Logipam]].

== Named Entity Recognition ==
 * [[http://zil.ipipan.waw.pl/Nerf|Nerf]], a tool for named entity recognition, available on GNU GPL v.3,
 * [[http://nlp.pwr.wroc.pl/en/tools-and-resources/liner2|Liner2]], named entity recognizer released on GNU GPL with models to recognize 5 and 56 categories of proper names (M. Marcińczuk and M. Janicki).
Line 197: Line 202:
 * [[https://play.google.com/store/apps/details?id=com.pwr.plwordnet|Mobile plWordNet]], free mobile application for plWordNet browsing (J. Kocoń)  * [[https://play.google.com/store/apps/details?id=com.pwr.plwordnet|Mobile plWordNet]], free mobile application for plWordNet browsing (J. Kocoń),
Line 208: Line 213:
 * [[http://zil.ipipan.waw.pl/Nerf|Nerf]], a tool for named entity recognition, available on GNU GPL v.3,
 * [[http://nlp.pwr.wroc.pl/en/tools-and-resources/liner2|Liner2]], named entity recognizer released on GNU GPL with models to recognize 5 and 56 categories of proper names (M. Marcińczuk and M. Janicki),

Language Tools and Resources for Polish

This page contains a list of publicly available language tools and resources.

Spoken corpora

Parallel corpora and translation memories

Machine-readable dictionaries

Human-readable dictionaries

Morphological tools and resources

Taggers

Parsers, grammars, treebanks

Sentiment analysis

Coreference

Speech analysis and synthesis tools

Machine translation demonstrations

Summarizers

Diacritization

Named Entity Recognition

  • Nerf, a tool for named entity recognition, available on GNU GPL v.3,

  • Liner2, named entity recognizer released on GNU GPL with models to recognize 5 and 56 categories of proper names (M. Marcińczuk and M. Janicki).

Other

  • Mobile plWordNet, free mobile application for plWordNet browsing (J. Kocoń),

  • Kolokacje, a Web crawler and collocation finder (A. Buczyński),

  • WSDDE, a system for designing and performing Word Sense Disambiguation experiments (R. Młodzki et al.),

  • Frazeo, a search engine and clusterer of news in Polish (P. Pęzik),

  • Segment, a rule-based sentence tokenizer supporting SRX standard (J. Lipski; the Polish rules are available in LanguageTool project, see here for short instructions on how to use the tool),

  • Toki, a tokenizer supporting SRX standard, C++ library and toolkit (T. Śniatowski and A. Radziszewski),

  • Translatica SRX sentence segmentation rules for Polish (LGPL),

  • SyMGIZA++, an extension of Giza++ that computes symmetric word alignment models,

  • Multiservice, a sample interface for running NLP Web services for Polish (see also usage and format),

  • Hipisek, an experimental question answering system (M. Walas),

  • Narzędzia dygitalizacji tekstów, Poliqarp for DjVu i inne programy,

  • PSI-Toolkit, a chain of publicly available tools for automatic processing of Polish,

  • Fextor, a feature extraction framework,

  • LexCSD, a system for semi-automatic sense disambiguation,

  • SuperMatrix, a general tool for lexical semantic knowledge acquisition,

  • WordnetLoom, an wordnet editor application,

  • Toposław, tool for the creation of electronic inflectional dictionaries of multi-word units,

  • CorpCor, a web-based tool for correcting morphosyntactic annotation in TEI XML encoded corpora (e.g. NKJP).