Locked History Actions

Diff for "psi-toolkit"

Differences between revisions 2 and 5 (spanning 3 versions)
Revision 2 as of 2011-09-23 12:20:51
Size: 724
Comment:
Revision 5 as of 2011-09-23 13:31:19
Size: 1342
Comment:
Deletions are marked like this. Additions are marked like this.
Line 1: Line 1:
= Treebank project = = psi-toolkit =
Line 11: Line 11:


== Project description ==

The aim of the project is to develop a tool chain (called '''psi-toolkit''') for automatic processing of Polish (and - to lesser extent - other languages: English, German, French, Spanish and Russian) with the focus on machine translation. The tool chain will include:
  * segmentation/tokenization/lemmatization,
  * shallow parsing,
  * deep parsing,
  * rule-based machine translation,
  * statistical machine translation,
  * automatic generation of inflected forms from lemma sequences
  * automatic postedition.

All tools will be publicly available under the LGPL licence.

psi-toolkit

Project factsheet

English name:

Publicly available tools for automatic processing of Polish language

Polish name:

Narzędzia do automatycznego przetwarzania języka polskiego udostępnione publicznie

Project type:

A national Ministry of Science and Higher Education research grant (number N N516 480540)

Duration:

2011 ‒ 2013

Principal investigator:

Krzysztof Jassem

Institution:

Information Systems Laboratory, Faculty of Mathematics and Computer Science, Adam Mickiewicz University

Project description

The aim of the project is to develop a tool chain (called psi-toolkit) for automatic processing of Polish (and - to lesser extent - other languages: English, German, French, Spanish and Russian) with the focus on machine translation. The tool chain will include:

  • segmentation/tokenization/lemmatization,
  • shallow parsing,
  • deep parsing,
  • rule-based machine translation,
  • statistical machine translation,
  • automatic generation of inflected forms from lemma sequences
  • automatic postedition.

All tools will be publicly available under the LGPL licence.