Class NormalizeURLs

  • All Implemented Interfaces:
    Serializable, weka.core.OptionHandler, weka.core.tokenizers.cleaners.TokenCleaner

    public class NormalizeURLs
    extends weka.core.tokenizers.cleaners.AbstractTokenCleaner
    Replaces all urls with the same dummy url.
    Version:
    $Revision$
    Author:
    FracPete (fracpete at waikato dot ac dot nz)
    See Also:
    Serialized Form
    • Constructor Detail

      • NormalizeURLs

        public NormalizeURLs()
    • Method Detail

      • globalInfo

        public String globalInfo()
        Returns a string describing the cleaner.
        Specified by:
        globalInfo in class weka.core.tokenizers.cleaners.AbstractTokenCleaner
        Returns:
        a description suitable for displaying in the explorer/experimenter gui
      • reset

        protected void reset()
        Resets the cleaner.
        Overrides:
        reset in class weka.core.tokenizers.cleaners.AbstractTokenCleaner
      • clean

        public String clean​(String token)
        Determines whether a token is clean or not.
        Specified by:
        clean in interface weka.core.tokenizers.cleaners.TokenCleaner
        Specified by:
        clean in class weka.core.tokenizers.cleaners.AbstractTokenCleaner
        Parameters:
        token - the token to check
        Returns:
        the clean token or null to ignore