<?xml version="1.0" encoding="utf-8"?><!DOCTYPE article  PUBLIC '-//OASIS//DTD DocBook XML V4.4//EN'  'http://www.docbook.org/xml/4.4/docbookx.dtd'><article><articleinfo><title>Converters4NKJP</title><revhistory><revision><revnumber>10</revnumber><date>2011-09-28 12:08:46</date><authorinitials>LukaszDegorski</authorinitials></revision><revision><revnumber>9</revnumber><date>2011-09-28 11:46:29</date><authorinitials>LukaszDegorski</authorinitials></revision><revision><revnumber>8</revnumber><date>2011-09-28 11:45:59</date><authorinitials>LukaszDegorski</authorinitials></revision><revision><revnumber>7</revnumber><date>2011-09-28 11:43:40</date><authorinitials>LukaszDegorski</authorinitials></revision><revision><revnumber>6</revnumber><date>2011-09-28 11:36:19</date><authorinitials>LukaszDegorski</authorinitials></revision><revision><revnumber>5</revnumber><date>2011-09-28 11:14:07</date><authorinitials>LukaszDegorski</authorinitials></revision><revision><revnumber>4</revnumber><date>2011-09-26 12:25:22</date><authorinitials>AdamPrzepiorkowski</authorinitials></revision><revision><revnumber>3</revnumber><date>2011-09-26 12:15:32</date><authorinitials>AdamPrzepiorkowski</authorinitials></revision><revision><revnumber>2</revnumber><date>2011-09-26 12:11:27</date><authorinitials>AdamPrzepiorkowski</authorinitials></revision><revision><revnumber>1</revnumber><date>2011-09-26 12:10:07</date><authorinitials>AdamPrzepiorkowski</authorinitials></revision></revhistory></articleinfo><section><title>Converters for NKJP formats</title><para>This page (under construction as of the end of September 2011) collects converters from and to the <ulink url="http://nlp.ipipan.waw.pl/TEI4NKJP/">TEI4NKJP</ulink> XML format, as used in the <ulink url="http://nkjp.pl/">National Corpus of Polish</ulink>. </para><section><title>Converters from the output of Anotatornia to TEI NKJP</title><para>The format evolved during the project and the final TEI4NKJP is a little bit different than the Anotatornia (see <ulink url="http://clip.ipipan.waw.pl/Converters4NKJP/Anotatornia#">http://nlp.ipipan.waw.pl/Anotatornia/</ulink>) output. To upgrade, use the following scripts: </para><itemizedlist><listitem><para><ulink url="http://clip.ipipan.waw.pl/Converters4NKJP/Converters4NKJP?action=AttachFile&amp;do=get&amp;target=modify-tei-morphosyntax.pl">modify-tei-morphosyntax.pl</ulink> </para></listitem><listitem><para><ulink url="http://clip.ipipan.waw.pl/Converters4NKJP/Converters4NKJP?action=AttachFile&amp;do=get&amp;target=modify-tei-segmentation.pl">modify-tei-segmentation.pl</ulink> </para></listitem><listitem><para><ulink url="http://clip.ipipan.waw.pl/Converters4NKJP/Converters4NKJP?action=AttachFile&amp;do=get&amp;target=modify-tei-senses.pl">modify-tei-senses.pl</ulink> </para></listitem></itemizedlist><para>The scripts were meant to be simple. Fatal error reporting in modify-tei-morphosyntax.pl is straightforward: a line is printed to the output file, rendering the XML not well-formed. In all cases, the resulting files should be <ulink url="http://nlp.ipipan.waw.pl/TEI4NKJP/">validated</ulink>. </para></section></section></article>