See: Description
Class | Description |
---|---|
AdornedToSimpleTEIP5 |
Convert MorphAdorned file to simple TEI P5 format.
|
AdornedToSimpleTEIP5 converts a base-level MorphAdorner file to a more TEI P5-like format.
Usage:
java edu.northwestern.at.morphadorner.tools.adornedtosimpleteip5.adornedtosimpleteip5 outputdirectory adorned1.xml adorned2.xml ...
where outputdirectory is the output directory containing the resultant simplified adorned XML files and adorned*.xml are the input adorned XML files.
When adding adornments to a TEI XML file, MorphAdorner adds attributes with the morphological adornments to each <w> and <pc> element. These attributes are not part of the standard TEI P5 specification.
AdornedToSimpleTEIP5 removes all of the non-standard attributes except the reg= attribute. The part of speech pos= attribute is mapped to the ana=attribute, and a "#" is prefixed to the part of speech tag. The lemma lem= attribute is mapped to the lemma= attribute. These are standard TEI P5 attributes.
The non-standard reg= attribute which specifies the standard spelling is left as is.
The MorphAdorner release materials include a modified TEI P5 RNG scheme file which adds the non-standard word-level attributes so that adorned files can be verified using standard XML processing software.