NU
IT
Northwestern University Information Technology |
MorphAdorner V2.0 | Site Map |
AdornedToSketch converts one or more adorned files to the verticalized input required by the Sketch or NoSketch corpus search engines.
Usage:
adornedtosketch sketchinput.txt corpusname adorned1.xml adorned2.xml ...
where
Known flaw: AdornedToSketch does not generate the "glue" elements which bind punctuation marks to word tokens. Searching the corpus still works fine in the Sketch or NoSketch engine, but the punctuation marks are displayed detached from any token to which they would normally be attached.
The Sketch engine, and its simpler sibling the NoSketch engine, are corpus query systems based upon the thesis work of Pavel Rychl�. The engines are products of Lexical Computing Ltd., headed by computational linguist Adam Kilgarriff.
Home | |
Welcome | |
Announcements and News | |
Announcements and news about changes to MorphAdorner | |
Documentation | |
Documentation for using MorphAdorner | |
Download MorphAdorner | |
Downloading and installing the MorphAdorner client and server software | |
Glossary | |
Glossary of MorphAdorner terms | |
Helpful References | |
Natural language processing references | |
Licenses | |
Licenses for MorphAdorner and Associated Software | |
Server | |
Online examples of MorphAdorner Server facilities. | |
Talks | |
Slides from talks about MorphAdorner. | |
Tech Talk | |
Technical information for programmers using MorphAdorner |
Academic Technologies and Research Services,
NU Library 2East, 1970 Campus Drive Evanston, IL 60208. |
Contact Us.
|