Differences between revisions 16 and 17

Benchmarks

This page documents performance of various NLP systems for Polish.

Morphological analysis

: Morfeusz, Concraft/WCRFT, Spejd, Dependency Parser, TIMEX/Nerf,

POS tagging

Shallow parsing

Dependency parsing

Deep parsing

Word sense disambiguation

Named entity recognition

Sentiment analysis

Mention detection

Precision, recall and F-measure are calculated on Polish Coreference Corpus data with two alternative mention detection scores:

EXACT: score of exact boundary matches (an automatic and a manual mention match if they have exactly the same boundaries; in other words, they consist of the same tokens)
HEAD: score of head matches (we reduce the automatic and the manual mentions to their single head tokens and compare them).

System name	Short description	Main publication	License	EXACT			HEAD
System name	Short description	Main publication	License	P	R	F	P	R	F
Mention Detector	Collects mention candidates from available sources – morphosyntactical, shallow parsing, named entity and/or zero anaphora detection tools	Ogrodniczuk M., Głowińska K., Kopeć M., Savary A., Zawisławska M. Coreference in Polish: Annotation, Resolution and Evaluation, chapter 10.6. Walter De Gruyter, 2015.	CC BY 3	66.79%	67.21%	67.00%	88.29%	89.41%	88.85%

Coreference resolution

As there is still no consensus about the single best coreference resolution metrics, CoNLL measure is used (average of MUC, B3 and CEAFE F-measure values). For end-to-end systems CoNLL-2011 shared task-based approach is used, so two result calculation strategies are presented:

INTERSECT: consider only correct system mentions (i.e. the intersection between gold and system mentions)
TRANSFORM: unify system and gold mention sets using the following procedure for twinless mentions (without a corresponding mention in the second set):
1. insert twinless gold mentions into system mention set as singletons
2. remove twinless singleton system mentions
3. insert twinless non-singletion system mentions into gold set as singletons.

The results are produced on Polish Coreference Corpus data.

System name	Short description	Main publication	License	Gold	EXACT INTERSECT	EXACT TRANSFORM	HEAD INTERSECT	HEAD TRANSFORM
Ruler	Rule-based	Ogrodniczuk M., Kopeć M. 'End-to-end coreference resolution baseline system for Polish'. In Z. Vetulani (ed.), Proceedings of the 5th Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics, pp. 167–171, Poznań, Poland, 2011.	CC BY 3
Bartek	Statistical	Kopeć M., Ogrodniczuk M. 'Creating a Coreference Resolution System for Polish'. In Proceedings of the 8th International Conference on Language Resources and Evaluation, LREC 2012, pp. 192–195, ELRA.	CC BY 3

-  ⇤ ← Revision 16 as of 2016-10-20 09:15:06 → 
  Size: 2537
  Editor: MaciejOgrodniczuk
  Comment:
+   ← Revision 17 as of 2016-10-20 22:57:50 → ⇥
  Size: 3492
  Editor: MaciejOgrodniczuk
  Comment:
-Deletions are marked like this.
+Additions are marked like this.
 Line 6:
+ : Morfeusz, Concraft/WCRFT, Spejd, Dependency Parser, TIMEX/Nerf,
-Line 39:
+Line 44:
-[[http://clip.ipipan.waw.pl/PCC|Polish Coreference Corpus]]
+As there is still no consensus about the single best coreference resolution metrics, CoNLL measure is used (average of MUC, B3 and CEAFE F-measure values). For end-to-end systems CoNLL-2011 shared task-based approach is used, so two result calculation strategies are presented:
 * INTERSECT: consider only correct system mentions (i.e. the intersection between gold and system mentions)
 * TRANSFORM: unify system and gold mention sets using the following procedure for twinless mentions (without a corresponding mention in the second set):
  1. insert twinless gold mentions into system mention set as singletons
  1. remove twinless singleton system mentions
  1. insert twinless non-singletion system mentions into gold set as singletons.
-Line 41:
+Line 51:
-|| '''System name'''                          || '''Short description''' || '''Main publication''' || '''License''' || '''P''' || '''R''' || '''F''' ||
|| [[http://zil.ipipan.waw.pl/Ruler|Ruler]]   || Rule-based              || Ogrodniczuk M., Kopeć M. 'End-to-end coreference resolution baseline system for Polish'. In Z. Vetulani (ed.), Proceedings of the 5th Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics, pp. 167–171, Poznań, Poland, 2011.  || CC BY 3 ||         ||         ||         ||
|| [[http://zil.ipipan.waw.pl/Bartek|Bartek]] || Statistical             || Kopeć M., Ogrodniczuk M. 'Creating a Coreference Resolution System for Polish'. In Proceedings of the 8th International Conference on Language Resources and Evaluation, LREC 2012, pp. 192–195, ELRA. || CC BY 3 ||         ||         ||         ||
+The results are produced on [[http://clip.ipipan.waw.pl/PCC|Polish Coreference Corpus]] data.

|| '''System name'''                          || '''Short description''' || '''Main publication''' || '''License''' || '''Gold''' || '''EXACT INTERSECT''' || '''EXACT TRANSFORM''' || '''HEAD INTERSECT''' || '''HEAD TRANSFORM''' ||
|| [[http://zil.ipipan.waw.pl/Ruler|Ruler]]   || Rule-based              || Ogrodniczuk M., Kopeć M. 'End-to-end coreference resolution baseline system for Polish'. In Z. Vetulani (ed.), Proceedings of the 5th Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics, pp. 167–171, Poznań, Poland, 2011.  || CC BY 3 ||         ||         ||         ||  ||  ||
|| [[http://zil.ipipan.waw.pl/Bartek|Bartek]] || Statistical             || Kopeć M., Ogrodniczuk M. 'Creating a Coreference Resolution System for Polish'. In Proceedings of the 8th International Conference on Language Resources and Evaluation, LREC 2012, pp. 192–195, ELRA. || CC BY 3 ||         ||         ||         ||  ||  ||

Diff for "benchmarks"

Menu

Wiki