See: Description
| Class | Description |
|---|---|
| AddUnclear |
AddUnclear adds type="unclear" attribute to tokens containing character gaps.
|
| CountDividedWords |
Count words containing divider characters.
|
| ExtractSoftHyphens |
Filter hyphenated words.
|
| FindSoftHyphens |
Determine which words containing soft hyphens should actually be hyphenated.
|
| FixWordBreaks |
Fix word breaks.
|
| FixWordBreaks.WProcessor |
Process an adorned word.
|
| RemoveCruft |
Remove long s, brace-enclosed entities, superscripts, etc.
|
| SuperFixer |
SuperFixer marks "^" characters with special tags.
|
The tcp package contains utilities aimed at processing Text Creation Partnership texts.