Class ClusterStatistics

  • All Implemented Interfaces:
    adams.core.AdditionalInformationHandler, adams.core.Destroyable, adams.core.GlobalInfoSupporter, adams.core.logging.LoggingLevelHandler, adams.core.logging.LoggingSupporter, adams.core.option.OptionHandler, adams.core.SizeOfHandler, Serializable

    public class ClusterStatistics
    extends AbstractClusterMembershipPostProcessor
    Computes cluster statistics (min/max/mean/stdev) for the provided dataset. An additional attribute is added to the dataset structure that indicates the cluster index and the statistic.
    Only numeric attributes are considered when computing the statistics.
    Stored in container under: Clustered dataset

    -logging-level <OFF|SEVERE|WARNING|INFO|CONFIG|FINE|FINER|FINEST> (property: loggingLevel)
        The logging level for outputting errors and debugging output.
        default: WARNING
     
    Version:
    $Revision: 7171 $
    Author:
    fracpete (fracpete at waikato dot ac dot nz)
    See Also:
    Serialized Form
    • Field Summary

      • Fields inherited from class adams.core.option.AbstractOptionHandler

        m_OptionManager
      • Fields inherited from class adams.core.logging.LoggingObject

        m_Logger, m_LoggingIsEnabled, m_LoggingLevel
    • Method Summary

      All Methods Instance Methods Concrete Methods 
      Modifier and Type Method Description
      protected weka.core.Instances calculateStatistics​(weka.core.Instances data, weka.clusterers.Clusterer clusterer, weka.core.Instances outputFormat)
      Calculates the statistics.
      protected weka.core.Instances createOutputFormat​(weka.core.Instances data)
      Generates the output format (additional attribute for cluster index).
      protected weka.core.DenseInstance createRow​(int index, String statistic, weka.core.Instances data)
      Creates a new stats row.
      String globalInfo()
      Returns a string describing the object.
      protected weka.core.Instances processDatasetWithClusterer​(weka.core.Instances data, weka.clusterers.Clusterer clusterer)
      Performs some form of processing on the full dataset.
      • Methods inherited from class adams.core.option.AbstractOptionHandler

        cleanUpOptions, defineOptions, destroy, finishInit, getDefaultLoggingLevel, getOptionManager, initialize, loggingLevelTipText, newOptionManager, reset, setLoggingLevel, toCommandLine, toString
      • Methods inherited from class adams.core.logging.LoggingObject

        configureLogger, getLogger, getLoggingLevel, initializeLogging, isLoggingEnabled, sizeOf
      • Methods inherited from interface adams.core.logging.LoggingLevelHandler

        getLoggingLevel
    • Constructor Detail

      • ClusterStatistics

        public ClusterStatistics()
    • Method Detail

      • globalInfo

        public String globalInfo()
        Returns a string describing the object.
        Specified by:
        globalInfo in interface adams.core.GlobalInfoSupporter
        Specified by:
        globalInfo in class adams.core.option.AbstractOptionHandler
        Returns:
        a description suitable for displaying in the gui
      • createOutputFormat

        protected weka.core.Instances createOutputFormat​(weka.core.Instances data)
        Generates the output format (additional attribute for cluster index).
        Parameters:
        data - the original input data
        Returns:
        the header of the output format
      • createRow

        protected weka.core.DenseInstance createRow​(int index,
                                                    String statistic,
                                                    weka.core.Instances data)
        Creates a new stats row.
        Parameters:
        index - the cluster index
        statistic - the statistic's name
        data - the data format
        Returns:
        the generated row
      • calculateStatistics

        protected weka.core.Instances calculateStatistics​(weka.core.Instances data,
                                                          weka.clusterers.Clusterer clusterer,
                                                          weka.core.Instances outputFormat)
        Calculates the statistics.
        Parameters:
        data - the input data used for training the clusterer
        clusterer - the built clusterer
        outputFormat - the format to use for the output
        Returns:
        the generated output