Package adams.flow.transformer.tokenizer
-
Class Summary Class Description AbstractTokenizer Ancestor for classes that split strings into words ("tokenize").StanfordPTBTokenizer Uses Stanford's PTBTokenizer.
For more details on the options see:
http://nlp.stanford.edu/software/tokenizer.shtml
Twokenize Uses TweetNLP's Twokenize.
For more details on the tokenizer see:
https://github.com/brendano/ark-tweet-nlp/blob/master/src/cmu/arktweetnlp/Twokenize.java
WekaTokenizer Uses the specified Weka tokenizer.