Package edu.northwestern.at.morphadorner.tools.adornedtosketch

AdornedToSketch converts one or more adorned files to the verticalized input required by the Sketch or NoSketch corpus search engines.

See: Description

Package edu.northwestern.at.morphadorner.tools.adornedtosketch Description

AdornedToSketch converts one or more adorned files to the verticalized input required by the Sketch or NoSketch corpus search engines.

Usage:

adornedtosketch sketchinput.txt corpusname adorned1.xml adorned2.xml ...

where

Known flaw: AdornedToSketch does not generate the "glue" elements which bind punctuation marks to word tokens. Searching the corpus still works fine in the Sketch or NoSketch engine, but the punctuation marks are displayed detached from any token to which they would normally be attached.