Package weka.core.tokenizers.cleaners
-
Class Summary Class Description NormalizeTwitterHashtag Replaces all hashtags with the same dummy hashtag.NormalizeTwitterUsers Replaces all users with the same dummy user.NormalizeURLs Replaces all urls with the same dummy url.RegExp Cleans tokens based on regular expressions, i.e., if token matches regexp it gets replaced with the specified expression.RemoveEmoticons Removes emoticons, based on a provided lists of emoticon strings.WordCluster Replaces words with clusters.