|
NU
IT
Northwestern University Information Technology |
| MorphAdorner V2.0 | Site Map |
FindTeiTextLanguage determines the language(s) in which a TEI text is written.
Usage:
findteitextlanguage output.tab input1.xml input2.xml ...
where
The output file is a tab-delimited utf-8 text file containing the following fields, in order.
Texts which do not have at least three recognizable languages will have missing language names set to blank with a score of zero.
Language recognizer scores range from 0.0 (not a match) to 1.0 (perfect match). Documents for which the second and third languages achieve non-negligible scores indicate potential problems for processing unless the words in the secondary language are marked up in the TEI document.
| Home | |
| Welcome | |
| Announcements and News | |
| Announcements and news about changes to MorphAdorner | |
| Documentation | |
| Documentation for using MorphAdorner | |
| Download MorphAdorner | |
| Downloading and installing the MorphAdorner client and server software | |
| Glossary | |
| Glossary of MorphAdorner terms | |
| Helpful References | |
| Natural language processing references | |
| Licenses | |
| Licenses for MorphAdorner and Associated Software | |
| Server | |
| Online examples of MorphAdorner Server facilities. | |
| Talks | |
| Slides from talks about MorphAdorner. | |
| Tech Talk | |
| Technical information for programmers using MorphAdorner |
|
Academic Technologies and Research Services,
NU Library 2East, 1970 Campus Drive Evanston, IL 60208. |
Contact Us.
|