= Lemmatisation of Polish nominal syntactic groups (Spejd grammar) = This is a variant of a [[http://clip.ipipan.waw.pl/LRT?action=AttachFile&do=view&target=gramatyka_Spejd_NKJP_1.0.zip|NKJP grammar]] for [[http://zil.ipipan.waw.pl/Spejd|Spejd]]. The grammar has been created by Łukasz Degórski [1]; some corrections and improvements have been added by Piotr Przybyła while adapting it for entity recognition in the Polish question answering system RAFAEL [2]. As NKJP grammar, it is available on GNU GPL v.3. '''Download grammar rules''': [[attachment:rules.sr|rules.sr]] Using this grammar, Spejd is able to output information about lemmas (base forms) of most of types of nominal syntactic groups. For example, group description may look like this (notice the "base" attribute): {{{ zapisu dźwięków zapis dźwięków }}} As Spejd is unable to generate new word forms, some lemmas need to be expressed in a special way. For example see: {{{ pochodzenia indyjskiego pochodzenie ADJ(indyjski,n,pos) }}} Here ADJ(indyski,n.pos) should be replaced by the adjective "indyjski" in neuter gender and positive grade, i.e. "indyjskie", to create full lemma, i.e. "indyjskie pochodzenie". Special expressions are the following: * ADJ(lemma, gender, grade) for adjective, * GER(lemma, negation) for gerunds, * PPAS(lemma, gender, negation) for passive participles, * PACT(lemma, gender, negation) for active participles. == References == [1] Degórski, Ł. (2012). Towards the Lemmatisation of Polish Nominal Syntactic Groups Using a Shallow Grammar. Proceedings of the International Joint Conference on Security and Intelligent Information Systems (S&IIS 2011). [2] Przybyła, P. (2015). Gathering Knowledge for Question Answering Beyond Named Entities. Proceedings of the 20th International Conference on Application of Natural Language to Information Systems (NLDB 2015).