NU
IT
Northwestern University Information Technology |
MorphAdorner V2.0 | Site Map |
ExtractTEIText applies an XSL transformation to an input TEI XML file to extract the text from the body of the file.
Usage:
extractteitext input.xml output.xml
where
input.xml | The input TEI XML file. |
output.txt | The output file containing the text extracted from the input TEI file. |
The XSLT transformation used to extract the text is defined in the tei2text.xsl file in the xslt directory of the MorphAdorner release. This transformation works well for unadorned TEI files, not so well for adorned files. You can use the Unadorn utility to unadorn an adorned file before extracting the text.
Home | |
Welcome | |
Announcements and News | |
Announcements and news about changes to MorphAdorner | |
Documentation | |
Documentation for using MorphAdorner | |
Download MorphAdorner | |
Downloading and installing the MorphAdorner client and server software | |
Glossary | |
Glossary of MorphAdorner terms | |
Helpful References | |
Natural language processing references | |
Licenses | |
Licenses for MorphAdorner and Associated Software | |
Server | |
Online examples of MorphAdorner Server facilities. | |
Talks | |
Slides from talks about MorphAdorner. | |
Tech Talk | |
Technical information for programmers using MorphAdorner |
Academic Technologies and Research Services,
NU Library 2East, 1970 Campus Drive Evanston, IL 60208. |
Contact Us.
|