public class MergeWordLists
extends java.lang.Object
Usage:
java edu.northwestern.at.morphadorner.tools.mergewordlists.MergeWordLists output.txt input.txt input2.txt ...
output.txt -- output merged word list file.
input*.txt -- input text files containing word lists to be merged.
The output file is a utf-8 text file containing the merged word list from the input files. Only one copy of a word is output if it appears multiple times. The merged words appear in ascending alphanumeric order in the output file.
| Modifier and Type | Field and Description |
|---|---|
protected static java.util.Set<java.lang.String> |
mergedWordSet
Merged word list.
|
| Modifier | Constructor and Description |
|---|---|
protected |
MergeWordLists()
Allow overrides but not instantiation.
|
| Modifier and Type | Method and Description |
|---|---|
protected static void |
loadAndMergeWords(java.lang.String inputFileName)
Merge word lists from a file.
|
static void |
main(java.lang.String[] args)
Main program.
|
protected static void |
saveMergedWords(java.lang.String outputFileName)
Save the merged word lists.
|
protected static java.util.Set<java.lang.String> mergedWordSet
public static void main(java.lang.String[] args)
protected static void loadAndMergeWords(java.lang.String inputFileName)
throws java.lang.Exception
java.lang.Exceptionprotected static void saveMergedWords(java.lang.String outputFileName)
throws java.lang.Exception
java.lang.Exception