Locked History Actions

Diff for "plWordNet2"

Differences between revisions 1 and 2
Revision 1 as of 2012-05-14 12:34:14
Size: 1513
Comment:
Revision 2 as of 2012-05-20 23:53:24
Size: 2065
Comment:
Deletions are marked like this. Additions are marked like this.
Line 15: Line 15:
The main objective of the project was to construct a Polish !WordNet as economically as possible. This project is continuation of the former project (3 T11C 018 29) on the construction of plWordNet 1.0 -- the first publicly available wordnet for Polish.
The main goal of this project is to extend and improve Distributional Semantics methods and pattern-based methods developed in the former project and to build a complex, semi-automatic system supporting linguists working on plWordNet construction.

The second goal is to extend plWordNet 1.0 to the size of 70000-80000 lexical units (pairs: lemma, sense number) and 45000-55000 synsets.

The main objective of both projects was to construct a Polish !WordNet as economically as possible.

plWordNet 2.0

Project factsheet

English title:

Polish title:

Półautomatyczna konstrukcja zasobów leksykalnych przez rozpoznawanie relacji semantycznych na podstawie danych morfo-syntaktycznych i semantycznych w korpusach tekstu

Project type:

A national Ministry of Science and Higher Education research grant (number N N516 068637)

Duration:

1 October 2009 ‒ 30 September 2012

Project Web page:

http://nlp.pwr.wroc.pl/projekty/slowosiec2

Principal investigator:

Maciej Piasecki

Institution:

Institute of Applied Informatics, Wrocław University of Technology

Project description

This project is continuation of the former project (3 T11C 018 29) on the construction of plWordNet 1.0 -- the first publicly available wordnet for Polish. The main goal of this project is to extend and improve Distributional Semantics methods and pattern-based methods developed in the former project and to build a complex, semi-automatic system supporting linguists working on plWordNet construction.

The second goal is to extend plWordNet 1.0 to the size of 70000-80000 lexical units (pairs: lemma, sense number) and 45000-55000 synsets.

The main objective of both projects was to construct a Polish WordNet as economically as possible.

Polish WordNet is a network of lexical-semantic relations, an electronic thesaurus with a structure modelled on that of the Princeton WordNet and those constructed in the EuroWordNet project. Polish WordNet describes the meaning of a lexical unit of one or more words by placing this unit in a network of links which represent such relations as synonymy, hypernyny, meronymy etc.

To reduce the cost of the project, Polish WordNet was built semi-automatically. Lexical relations were automatically recognized in large corpora of Polish, e.g., IPI PAN Corpus) and suggested to linguists/lexicographers via a graphical interface.