Size: 104
Comment:
|
Size: 2594
Comment:
|
Deletions are marked like this. | Additions are marked like this. |
Line 3: | Line 3: |
Here we will document accuracy of various linguistic engineering systems for Polish. | This page documents performance of various NLP systems for Polish. == Morphological analysis == == POS tagging == == Shallow parsing == == Dependency parsing == == Deep parsing == == Word sense disambiguation == == Named entity recognition == == Sentiment analysis == == Mention detection == === Test set === Precision, recall and F-measure is calculated on [[http://clip.ipipan.waw.pl/PCC|Polish Coreference Corpus]] data with two alternative mention detection scores: * EXACT: score of exact boundary matches (an automatic and a manual mention match if they have exactly the same boundaries; in other words, they consist of the same tokens) * HEAD: score of head matches (we reduce the automatic and the manual mentions to their single head tokens and compare them). === Results === ||<|2> '''System name''' ||<|2> '''Short description''' ||<|2> '''Main publication''' ||<|2> '''License''' |||||| '''EXACT''' |||||| '''HEAD''' || ||'''P''' || '''R''' || '''F''' ||'''P''' || '''R''' || '''F''' || || [[http://zil.ipipan.waw.pl/MentionDetector|Mention Detector]] || Collects mention candidates from available sources – morphosyntactical, shallow parsing, named entity and/or zero anaphora detection tools || Ogrodniczuk M., Głowińska K., Kopeć M., Savary A., Zawisławska M. 'Coreference in Polish: Annotation, Resolution and Evaluation', chapter 10.6. Walter De Gruyter, 2015. || CC BY 3 || 66.79% || 67.21% || 67.00% || 88.29% || 89.41% || 88.85% || == Coreference resolution == === Test set === [[http://clip.ipipan.waw.pl/PCC|Polish Coreference Corpus]] === Results === || '''System name''' || '''Short description''' || '''Main publication''' || '''License''' || '''P''' || '''R''' || '''F''' || || [[http://zil.ipipan.waw.pl/Ruler|Ruler]] || Rule-based || Ogrodniczuk M., Kopeć M. 'End-to-end coreference resolution baseline system for Polish'. In Z. Vetulani (ed.), Proceedings of the 5th Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics, pp. 167–171, Poznań, Poland, 2011. || CC BY 3 || || || || || [[http://zil.ipipan.waw.pl/Bartek|Bartek]] || Statistical || Kopeć M., Ogrodniczuk M. 'Creating a Coreference Resolution System for Polish'. In Proceedings of the 8th International Conference on Language Resources and Evaluation, LREC 2012, pp. 192–195, ELRA. || CC BY 3 || || || || == Summarization == |
Benchmarks
This page documents performance of various NLP systems for Polish.
Morphological analysis
POS tagging
Shallow parsing
Dependency parsing
Deep parsing
Word sense disambiguation
Named entity recognition
Sentiment analysis
Mention detection
Test set
Precision, recall and F-measure is calculated on Polish Coreference Corpus data with two alternative mention detection scores:
- EXACT: score of exact boundary matches (an automatic and a manual mention match if they have exactly the same boundaries; in other words, they consist of the same tokens)
- HEAD: score of head matches (we reduce the automatic and the manual mentions to their single head tokens and compare them).
Results
System name |
Short description |
Main publication |
License |
EXACT |
HEAD |
||||
P |
R |
F |
P |
R |
F |
||||
Collects mention candidates from available sources – morphosyntactical, shallow parsing, named entity and/or zero anaphora detection tools |
Ogrodniczuk M., Głowińska K., Kopeć M., Savary A., Zawisławska M. 'Coreference in Polish: Annotation, Resolution and Evaluation', chapter 10.6. Walter De Gruyter, 2015. |
CC BY 3 |
66.79% |
67.21% |
67.00% |
88.29% |
89.41% |
88.85% |
Coreference resolution
Test set
Results
System name |
Short description |
Main publication |
License |
P |
R |
F |
Rule-based |
Ogrodniczuk M., Kopeć M. 'End-to-end coreference resolution baseline system for Polish'. In Z. Vetulani (ed.), Proceedings of the 5th Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics, pp. 167–171, Poznań, Poland, 2011. |
CC BY 3 |
|
|
|
|
Statistical |
Kopeć M., Ogrodniczuk M. 'Creating a Coreference Resolution System for Polish'. In Proceedings of the 8th International Conference on Language Resources and Evaluation, LREC 2012, pp. 192–195, ELRA. |
CC BY 3 |
|
|
|