Package edu.northwestern.at.morphadorner.tools.adornedtosimpleteip5

AdornedToSimpleTEIP5 converts a base-level MorphAdorner file to a more TEI P5-like format.

See: Description

Package edu.northwestern.at.morphadorner.tools.adornedtosimpleteip5 Description

AdornedToSimpleTEIP5 converts a base-level MorphAdorner file to a more TEI P5-like format.

Usage:

java edu.northwestern.at.morphadorner.tools.adornedtosimpleteip5.adornedtosimpleteip5 outputdirectory adorned1.xml adorned2.xml ...

where outputdirectory is the output directory containing the resultant simplified adorned XML files and adorned*.xml are the input adorned XML files.

When adding adornments to a TEI XML file, MorphAdorner adds attributes with the morphological adornments to each <w> and <pc> element. These attributes are not part of the standard TEI P5 specification.

AdornedToSimpleTEIP5 removes all of the non-standard attributes except the reg= attribute. The part of speech pos= attribute is mapped to the ana=attribute, and a "#" is prefixed to the part of speech tag. The lemma lem= attribute is mapped to the lemma= attribute. These are standard TEI P5 attributes.

The non-standard reg= attribute which specifies the standard spelling is left as is.

The MorphAdorner release materials include a modified TEI P5 RNG scheme file which adds the non-standard word-level attributes so that adorned files can be verified using standard XML processing software.