See: Description
Class | Description |
---|---|
AddUnclear |
AddUnclear adds type="unclear" attribute to tokens containing character gaps.
|
CountDividedWords |
Count words containing divider characters.
|
ExtractSoftHyphens |
Filter hyphenated words.
|
FindSoftHyphens |
Determine which words containing soft hyphens should actually be hyphenated.
|
FixWordBreaks |
Fix word breaks.
|
FixWordBreaks.WProcessor |
Process an adorned word.
|
RemoveCruft |
Remove long s, brace-enclosed entities, superscripts, etc.
|
SuperFixer |
SuperFixer marks "^" characters with special tags.
|
The tcp package contains utilities aimed at processing Text Creation Partnership texts.