Upload page content

You can upload content for the page named below. If you change the page name, you can also upload content for another page. If the page name is empty, we derive the page name from the file name.

File to load page content from
Page name
Comment

Locked History Actions

plWordNet2

plWordNet 2.0

Project factsheet

English title:

Polish title:

Półautomatyczna konstrukcja zasobów leksykalnych przez rozpoznawanie relacji semantycznych na podstawie danych morfo-syntaktycznych i semantycznych w korpusach tekstu

Project type:

A national Ministry of Science and Higher Education research grant (number N N516 068637)

Duration:

1 October 2009 ‒ 30 September 2012

Project Web page:

http://nlp.pwr.wroc.pl/projekty/slowosiec2

Principal investigator:

Maciej Piasecki

Institution:

Institute of Applied Informatics, Wrocław University of Technology

Project description

This project is continuation of the former project (3 T11C 018 29) on the construction of plWordNet 1.0 -- the first publicly available wordnet for Polish. The main goal of this project is to extend and improve Distributional Semantics methods and pattern-based methods developed in the former project and to build a complex, semi-automatic system supporting linguists working on plWordNet construction.

The second goal is to extend plWordNet 1.0 to the size of 70000-80000 lexical units (pairs: lemma, sense number) and 45000-55000 synsets.

The main objective of both projects was to construct a Polish WordNet as economically as possible.

Polish WordNet is a network of lexical-semantic relations, an electronic thesaurus with a structure modelled on that of the Princeton WordNet and those constructed in the EuroWordNet project. Polish WordNet describes the meaning of a lexical unit of one or more words by placing this unit in a network of links which represent such relations as synonymy, hypernyny, meronymy etc.

To reduce the cost of the project, Polish WordNet was built semi-automatically. Lexical relations were automatically recognized in large corpora of Polish, e.g., IPI PAN Corpus) and suggested to linguists/lexicographers via a graphical interface.