Class GroupedRandomSplitGenerator

    • Field Detail

      • m_Percentage

        protected double m_Percentage
        the percentage.
      • m_PreserveOrder

        protected boolean m_PreserveOrder
        whether to preserve the order.
      • m_Generated

        protected boolean m_Generated
        whether the split was generated.
      • m_RegExp

        protected BaseRegExp m_RegExp
        the regular expression for the nominal/string attribute.
      • m_Group

        protected String m_Group
        the group expression.
      • m_Generator

        protected RandomSplitGenerator m_Generator
        the underlying scheme for generating the split.
    • Constructor Detail

      • GroupedRandomSplitGenerator

        public GroupedRandomSplitGenerator()
        Initializes the generator.
      • GroupedRandomSplitGenerator

        public GroupedRandomSplitGenerator​(weka.core.Instances data,
                                           long seed,
                                           double percentage,
                                           boolean preserveOrder,
                                           WekaAttributeIndex index,
                                           BaseRegExp regExp,
                                           String group)
        Initializes the generator. Does not preserve the order.
        Parameters:
        data - the dataset to split
        seed - the seed value to use for randomization
        percentage - the percentage of the training set (0-1)
        preserveOrder - whether to preserve the order
        index - the attribute index
        regExp - the regular expression to apply to the attribute values
        group - the regexp group to use as group
    • Method Detail

      • percentageTipText

        public String percentageTipText()
        Returns the tip text for this property.
        Returns:
        tip text for this property suitable for displaying in the GUI or for listing the options.
      • preserveOrderTipText

        public String preserveOrderTipText()
        Returns the tip text for this property.
        Returns:
        tip text for this property suitable for displaying in the GUI or for listing the options.
      • setIndex

        public void setIndex​(WekaAttributeIndex value)
        Sets the attribute index to use for grouping.
        Parameters:
        value - the index
      • getIndex

        public WekaAttributeIndex getIndex()
        Returns the attribute index to use for grouping.
        Returns:
        the index
      • indexTipText

        public String indexTipText()
        Returns the tip text for this property.
        Returns:
        tip text for this property suitable for displaying in the GUI or for listing the options.
      • setRegExp

        public void setRegExp​(BaseRegExp value)
        Sets the regular expression for identifying the group (eg '^(.*)-([0-9]+)-(.*)$').
        Parameters:
        value - the expression
      • getRegExp

        public BaseRegExp getRegExp()
        Returns the regular expression for identifying the group (eg '^(.*)-([0-9]+)-(.*)$').
        Returns:
        the expression
      • regExpTipText

        public String regExpTipText()
        Returns the tip text for this property.
        Returns:
        tip text for this property suitable for displaying in the GUI or for listing the options.
      • setGroup

        public void setGroup​(String value)
        Sets the replacement string to use as group (eg '$2').
        Parameters:
        value - the group
      • getGroup

        public String getGroup()
        Returns the replacement string to use as group (eg '$2').
        Returns:
        the group
      • groupTipText

        public String groupTipText()
        Returns the tip text for this property.
        Returns:
        tip text for this property suitable for displaying in the GUI or for listing the options.
      • canRandomize

        protected boolean canRandomize()
        Returns whether randomization is enabled.
        Specified by:
        canRandomize in class AbstractSplitGenerator
        Returns:
        true if to randomize
      • checkNext

        protected boolean checkNext()
        Returns true if the iteration has more elements. (In other words, returns true if next would return an element rather than throwing an exception.)
        Specified by:
        checkNext in class AbstractSplitGenerator
        Returns:
        true if the iterator has more elements.