Locked History Actions

Diff for "LRT"

Differences between revisions 228 and 230 (spanning 2 versions)
Revision 228 as of 2013-06-08 20:01:15
Size: 20672
Comment:
Revision 230 as of 2013-06-08 20:08:16
Size: 20538
Comment: Wyrzuciłem nieaktualne dowiązanie: * [[http://www.ispan.waw.pl/zakjez/pracjcz/slowniki/slowniki.html|Słowniki składniowe języka czeskiego i polskiego]] (Z. Greń),
Deletions are marked like this. Additions are marked like this.
Line 64: Line 64:
== Machine-readable dictionaries ==
 * [[http://plwordnet.pwr.wroc.pl/wordnet|plWordNet, Polish WordNet]] (M. Piasecki),
 * [[http://synonimy.ux.pl/|Polish OpenThesaurus]], słownik synonimów – a crowdsourced Polish thesaurus (M. Miłkowski),
 * [[http://www.sjp.pl/|Słownik języka polskiego (d. alternatywny)]], Polish ispell dictionaries, along with some definitions and online form display.
 * [[http://nl.ijs.si/ME/V4/doc/index.html#sec-lex|Sample morphosyntactic Polish lexicon]], the MULTEXT-East morphosyntactic lexicons,
 * [[http://www.dsp.agh.edu.pl/doku.php?id=pl:resources:ngram|N-gram model of Polish]] (AGH DSP),
 * [[Nowy_slownik_angielsko-polski|Nowy słownik angielsko-polski]] (T. Piotrowski, Z. Saloni),
 * [[http://zil.ipipan.waw.pl/OpenCYCPL|Polish OpenCYC]] (A. Pohl),
 * [[http://www.slowniki.org.pl/pol.html|Polish machine-generated dictionaries]], available on Creative Commons (J. Kazojć),
 * [[http://futrega.org/etc/nazwiska.zip|List of all Polish surnames]], licence unknown, see [[http://futrega.org/etc/nazwiska.html|further information on this resource]],
 * [[http://clip.ipipan.waw.pl/Gazetteer|Gazetteer for Polish Named Entities]] (A. Savary, M. Lenart, J. Piskorski),
 * [[http://zil.ipipan.waw.pl/PNET|Triggers for Polish Named Entities]] (M. Baron, L. Manicki, A. Savary),
 * [[http://nlp.pwr.wroc.pl/en/tools-and-resources/nelexicon|NELexicon]] contains more than 1.4 millions of proper names (M. Marcińczuk, A. Musiał, M. Janicki),
 * [[http://clip.ipipan.waw.pl/Walenty|Walenty]], the Polish Valence Dictionary (F. Skwarski, M. Świdziński, W. Kieraś, E. Hajnicz, A. Patejuk, A. Przepiórkowski, M. Woliński),
 * [[http://zil.ipipan.waw.pl/SGDPV|Syntatic-generative dictionary of Polish verbs]] (K. Polański),
 * [[http://zil.ipipan.waw.pl/SAWA|SAWA]], the Grammatical Lexicon of Warsaw Urban Proper Names (M. Marciniak, C. Heliasz, J. Rabiega-Wiśniewska, P. Sikora, M. Woliński, A. Savary),
 * [[http://zil.ipipan.waw.pl/SEJF|SEJF]], the Grammatical Lexicon of Polish Phraseology (M. Czerepowicka, A. Savary),
 * [[http://zil.ipipan.waw.pl/SEJFEK|SEJFEK]], the Grammatical Lexicon of Polish Economical Phraseology (F. Makowiecki, A. Savary),
 * [[http://zil.ipipan.waw.pl/WikiTopoPl|WikiTopoPl]], a multilingual lexicon of 155,000 Polish geographical proper names extracted from Wikipedia and their equivalents in Bulgarian, Croatian, English, German, modern Greek, Hungarian, Romanian, Serbian and Slovak (L. Manicki).
 * [[http://zil.ipipan.waw.pl/Prolexbase|Prolexbase 2.0]], a multiligual relational dictionary of proper names in Polish, English and French (M. Baron, B. Bouchou-Markhoff, L. Manicki, D. Maurel, A. Savary, M. Tran).

== Human-readable dictionaries ==
 * [[http://www.wsjp.pl/|Wielki Słownik Języka Polskiego]],
 * [[http://doroszewski.pwn.pl|Słownik języka polskiego PAN pod red. W. Doroszewskiego]],
 * [[http://pl.wiktionary.org|Wikisłownik]],
 * [[http://www.slownik-online.pl/index.php|Słownik wyrazów obcych i zwrotów obcojęzycznych Władysława Kopalińskiego]],
 * [[http://leksykony.interia.pl/synonim|Słownik synonimów i antonimów Piotra Żmigrodzkiego]],
 * [[http://poliqarp.wbl.klf.uw.edu.pl/slownik-warszawski/| Poliqarp for DjVu search engine]] for J. Karłowicz, A. Kryński, W. Niedźwiedzki. Dictionary of Polish. Warsaw 1900–1927,
 * [[http://poliqarp.wbl.klf.uw.edu.pl/slownik-polszczyzny-xvi-wieku/| Poliqarp for DjVu search engine]] for S. Bąk, M. R. Mayenowa, F. Pepłowski (eds.). Dictionary of the 16th century Polish. Wrocław — Warszawa, 1966-???? (work in progress),
 * [[http://poliqarp.wbl.klf.uw.edu.pl/slownik-lindego/| Poliqarp for DjVu search engine]] for M. Samuel Bogumił Linde. Dictionary of Polish (2nd edition). Lwów 1854-1861,
 * [[http://poliqarp.wbl.klf.uw.edu.pl/slownik-geograficzny/| Poliqarp for DjVu search engine]] for B. Chlebowski, F. Sulimierski, W. Walewski (eds.), The Geographical Dictionary of the Polish Kingdom and other Slavic Countries, Warszawa 1880-1902,
 * [[http://eswil.ijp-pan.krakow.pl/|Edycja elektroniczna Słownika wileńskiego]],
 * [[http://pelcra.pl/hask|PELCRA HASK Collocation Dictionaries]].
Line 109: Line 143:

== Machine-readable dictionaries ==
 * [[http://plwordnet.pwr.wroc.pl/wordnet|plWordNet, Polish WordNet]] (M. Piasecki),
 * [[http://synonimy.ux.pl/|Polish OpenThesaurus]], słownik synonimów – a crowdsourced Polish thesaurus (M. Miłkowski),
 * [[http://www.sjp.pl/|Słownik języka polskiego (d. alternatywny)]], Polish ispell dictionaries, along with some definitions and online form display.
 * [[http://nl.ijs.si/ME/V4/doc/index.html#sec-lex|Sample morphosyntactic Polish lexicon]], the MULTEXT-East morphosyntactic lexicons,
 * [[http://www.ispan.waw.pl/zakjez/pracjcz/slowniki/slowniki.html|Słowniki składniowe języka czeskiego i polskiego]] (Z. Greń),
 * [[http://www.dsp.agh.edu.pl/doku.php?id=pl:resources:ngram|N-gram model of Polish]] (AGH DSP),
 * [[Nowy_slownik_angielsko-polski|Nowy słownik angielsko-polski]] (T. Piotrowski, Z. Saloni),
 * [[http://zil.ipipan.waw.pl/OpenCYCPL|Polish OpenCYC]] (A. Pohl),
 * [[http://www.slowniki.org.pl/pol.html|Polish machine-generated dictionaries]], available on Creative Commons (J. Kazojć),
 * [[http://futrega.org/etc/nazwiska.zip|List of all Polish surnames]], licence unknown, see [[http://futrega.org/etc/nazwiska.html|further information on this resource]],
 * [[http://clip.ipipan.waw.pl/Gazetteer|Gazetteer for Polish Named Entities]] (A. Savary, M. Lenart, J. Piskorski),
 * [[http://zil.ipipan.waw.pl/PNET|Triggers for Polish Named Entities]] (M. Baron, L. Manicki, A. Savary),
 * [[http://nlp.pwr.wroc.pl/en/tools-and-resources/nelexicon|NELexicon]] contains more than 1.4 millions of proper names (M. Marcińczuk, A. Musiał, M. Janicki),
 * [[http://clip.ipipan.waw.pl/Walenty|Walenty]], the Polish Valence Dictionary (F. Skwarski, M. Świdziński, W. Kieraś, E. Hajnicz, A. Patejuk, A. Przepiórkowski, M. Woliński),
 * [[http://zil.ipipan.waw.pl/SGDPV|Syntatic-generative dictionary of Polish verbs]] (K. Polański),
 * [[http://zil.ipipan.waw.pl/SAWA|SAWA]], the Grammatical Lexicon of Warsaw Urban Proper Names (M. Marciniak, C. Heliasz, J. Rabiega-Wiśniewska, P. Sikora, M. Woliński, A. Savary),
 * [[http://zil.ipipan.waw.pl/SEJF|SEJF]], the Grammatical Lexicon of Polish Phraseology (M. Czerepowicka, A. Savary),
 * [[http://zil.ipipan.waw.pl/SEJFEK|SEJFEK]], the Grammatical Lexicon of Polish Economical Phraseology (F. Makowiecki, A. Savary),
 * [[http://zil.ipipan.waw.pl/WikiTopoPl|WikiTopoPl]], a multilingual lexicon of 155,000 Polish geographical proper names extracted from Wikipedia and their equivalents in Bulgarian, Croatian, English, German, modern Greek, Hungarian, Romanian, Serbian and Slovak (L. Manicki).
 * [[http://zil.ipipan.waw.pl/Prolexbase|Prolexbase 2.0]], a multiligual relational dictionary of proper names in Polish, English and French (M. Baron, B. Bouchou-Markhoff, L. Manicki, D. Maurel, A. Savary, M. Tran).

== Human-readable dictionaries ==
 * [[http://www.wsjp.pl/|Wielki Słownik Języka Polskiego]],
 * [[http://doroszewski.pwn.pl|Słownik języka polskiego PAN pod red. W. Doroszewskiego]],
 * [[http://pl.wiktionary.org|Wikisłownik]],
 * [[http://www.slownik-online.pl/index.php|Słownik wyrazów obcych i zwrotów obcojęzycznych Władysława Kopalińskiego]],
 * [[http://leksykony.interia.pl/synonim|Słownik synonimów i antonimów Piotra Żmigrodzkiego]],
 * [[http://poliqarp.wbl.klf.uw.edu.pl/slownik-warszawski/| Poliqarp for DjVu search engine]] for J. Karłowicz, A. Kryński, W. Niedźwiedzki. Dictionary of Polish. Warsaw 1900–1927,
 * [[http://poliqarp.wbl.klf.uw.edu.pl/slownik-polszczyzny-xvi-wieku/| Poliqarp for DjVu search engine]] for S. Bąk, M. R. Mayenowa, F. Pepłowski (eds.). Dictionary of the 16th century Polish. Wrocław — Warszawa, 1966-???? (work in progress),
 * [[http://poliqarp.wbl.klf.uw.edu.pl/slownik-lindego/| Poliqarp for DjVu search engine]] for M. Samuel Bogumił Linde. Dictionary of Polish (2nd edition). Lwów 1854-1861,
 * [[http://poliqarp.wbl.klf.uw.edu.pl/slownik-geograficzny/| Poliqarp for DjVu search engine]] for B. Chlebowski, F. Sulimierski, W. Walewski (eds.), The Geographical Dictionary of the Polish Kingdom and other Slavic Countries, Warszawa 1880-1902,
 * [[http://eswil.ijp-pan.krakow.pl/|Edycja elektroniczna Słownika wileńskiego]],
 * [[http://pelcra.pl/hask|PELCRA HASK Collocation Dictionaries]].

Language Tools and Resources for Polish

This page contains a list of publicly available language tools and resources.

Spoken corpora

Parallel corpora and translation memories

Machine-readable dictionaries

Human-readable dictionaries

Morphological tools and resources

Taggers

  • TaKIPI, a morphosyntactic tagger for Polish (Decision Trees),

  • PANTERA, a morphosyntactic tagger for Polish (Transformation-Based Learning),

  • WMBT, a morphosyntactic tagger for Polish (Memory-Based Learning),

  • TaCo, a statistical morphosyntactic tagset converter for positional tagsets (e.g. Polish),

  • WCRFT, a morphosyntactic tagger for Polish (Conditional Random Fields),

  • Concraft, a morphosyntactic disambiguation tool for Polish (Constrained Conditional Random Fields),

  • NKJP model for TnT Tagger, a trained model usable on Morfeusz-segmented text with TnT Tagger.

Parsers, grammars, treebanks

Speech analysis and synthesis tools

Machine translation demonstrations

Other

  • Kolokacje, a Web crawler and collocation finder (A. Buczyński),

  • WSDDE, a system for designing and performing Word Sense Disambiguation experiments (R. Młodzki et al.),

  • Frazeo, a search engine and clusterer of news in Polish (P. Pęzik),

  • Segment, a rule-based sentence tokenizer supporting SRX standard (J. Lipski; the Polish rules are available in LanguageTool project, see here for short instructions on how to use the tool),

  • Toki, a tokenizer supporting SRX standard, C++ library and toolkit (T. Śniatowski and A. Radziszewski)

  • Translatica SRX sentence segmentation rules for Polish (LGPL)

  • Lakon, a system for news summarization (master's thesis by A. Dudczak),

  • SyMGIZA++, an extension of Giza++ that computes symmetric word alignment models,

  • Multiservice, a sample interface for running NLP Web services for Polish,

  • Hipisek, an experimental question answering system (M. Walas),

  • Narzędzia dygitalizacji tekstów, Poliqarp for DjVu i inne programy,

  • Nerf, a tool for named entity recognition, available on GNU GPL v.3.

  • Liner2, named entity recognizer released on GNU GPL with models to recognize 5 and 56 categories of proper names (M. Marcińczuk and M. Janicki).

  • PSI-Toolkit, a chain of publicly available tools for automatic processing of Polish.

  • Fextor, a feature extraction framework.

  • LexCSD, a system for semi-automatic sense disambiguation.

  • SuperMatrix, a general tool for lexical semantic knowledge acquisition

  • WordnetLoom, an wordnet editor application.

  • Toposław, tool for the creation of electronic inflectional dictionaries of multi-word units.

  • CorpCor, a web-based tool for correcting morphosyntactic annotation in TEI XML encoded corpora (e.g. NCP).

  • Polish Coreference Tools, a suite of Polish coreference resolution tools, created as part of the CORE project.