Package adams.ml.evaluation
Class DefaultCrossValidationFoldGenerator
- java.lang.Object
-
- adams.core.logging.LoggingObject
-
- adams.core.logging.CustomLoggingLevelObject
-
- adams.core.option.AbstractOptionHandler
-
- adams.ml.evaluation.AbstractSplitGenerator
-
- adams.ml.evaluation.DefaultCrossValidationFoldGenerator
-
- All Implemented Interfaces:
Destroyable,GlobalInfoSupporter,LoggingLevelHandler,LoggingSupporter,OptionHandler,Randomizable,SizeOfHandler,CrossValidationFoldGenerator<Dataset,TrainTestSetContainer>,PerFoldCrossValidationFoldGenerator<Dataset,TrainTestSetContainer>,SplitGenerator<Dataset,TrainTestSetContainer>,Serializable,Iterator<TrainTestSetContainer>
public class DefaultCrossValidationFoldGenerator extends AbstractSplitGenerator implements PerFoldCrossValidationFoldGenerator<Dataset,TrainTestSetContainer>
Helper class for generating cross-validation folds.
The template for the relation name accepts the following placeholders:- Author:
- fracpete (fracpete at waikato dot ac dot nz)
- See Also:
- Serialized Form
-
-
Field Summary
Fields Modifier and Type Field Description protected intm_ActualNumFoldsthe actual number of folds.protected intm_CurrentFoldthe current fold.protected List<FoldPair<Binnable<DataRow>>>m_FoldPairsthe temporary pairs.protected CrossValidationGeneratorm_Generatorthe underlying scheme for generating the folds.protected intm_NumFoldsthe number of folds.protected booleanm_Randomizewhether to randomize the data.protected booleanm_Stratifywhether to stratify the data (in case of nominal class).-
Fields inherited from class adams.ml.evaluation.AbstractSplitGenerator
m_Data, m_Initialized, m_OriginalIndices, m_Seed, m_UseViews
-
Fields inherited from class adams.core.option.AbstractOptionHandler
m_OptionManager
-
Fields inherited from class adams.core.logging.LoggingObject
m_Logger, m_LoggingIsEnabled, m_LoggingLevel
-
-
Constructor Summary
Constructors Constructor Description DefaultCrossValidationFoldGenerator()Initializes the generator.DefaultCrossValidationFoldGenerator(Dataset data, int numFolds, long seed, boolean stratify)Initializes the generator.DefaultCrossValidationFoldGenerator(Dataset data, int numFolds, long seed, boolean randomize, boolean stratify)Initializes the generator.
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description protected booleancanRandomize()Returns whether randomization is enabled.protected booleancheckNext()Returns true if the iteration has more elements.protected TrainTestSetContainercreateNext()Returns the next element in the iteration.int[]crossValidationIndices()Returns the cross-validation indices.int[][]crossValidationIndicesPerFold()Returns the cross-validation indices per fold.voiddefineOptions()Adds options to the internal list of options.protected voiddoInitializeIterator()Initializes the iterator, randomizes the data if required.intgetActualNumFolds()Returns the actual number of folds used (eg when using LOO).intgetNumFolds()Returns the number of folds.booleangetRandomize()Returns whether to randomize the data.booleangetStratify()Returns whether to stratify the data (in case of nominal class).StringglobalInfo()Returns a string describing the object.StringnumFoldsTipText()Returns the tip text for this property.StringrandomizeTipText()Returns the tip text for this property.protected voidreset()Resets the generator.voidsetData(Dataset value)Sets the original data.voidsetNumFolds(int value)Sets the number of folds to use.voidsetRandomize(boolean value)Sets whether to randomize the data.voidsetStratify(boolean value)Sets whether to stratify the data (nominal class).StringstratifyTipText()Returns the tip text for this property.StringtoString()Returns a short description of the generator.-
Methods inherited from class adams.ml.evaluation.AbstractSplitGenerator
getData, getSeed, getUseViews, hasNext, initialize, initializeIterator, next, randomize, remove, seedTipText, setSeed, setUseViews, useViewsTipText
-
Methods inherited from class adams.core.option.AbstractOptionHandler
cleanUpOptions, destroy, finishInit, getDefaultLoggingLevel, getOptionManager, loggingLevelTipText, newOptionManager, setLoggingLevel, toCommandLine
-
Methods inherited from class adams.core.logging.LoggingObject
configureLogger, getLogger, getLoggingLevel, initializeLogging, isLoggingEnabled, sizeOf
-
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, wait
-
Methods inherited from interface adams.core.Destroyable
destroy
-
Methods inherited from interface java.util.Iterator
forEachRemaining
-
Methods inherited from interface adams.core.logging.LoggingLevelHandler
getLoggingLevel
-
Methods inherited from interface adams.core.option.OptionHandler
cleanUpOptions, getOptionManager, toCommandLine
-
Methods inherited from interface adams.core.Randomizable
getSeed, seedTipText, setSeed
-
Methods inherited from interface adams.data.splitgenerator.SplitGenerator
getData, hasNext, initializeIterator, next, remove
-
-
-
-
Field Detail
-
m_NumFolds
protected int m_NumFolds
the number of folds.
-
m_ActualNumFolds
protected int m_ActualNumFolds
the actual number of folds.
-
m_Stratify
protected boolean m_Stratify
whether to stratify the data (in case of nominal class).
-
m_CurrentFold
protected transient int m_CurrentFold
the current fold.
-
m_Randomize
protected boolean m_Randomize
whether to randomize the data.
-
m_Generator
protected transient CrossValidationGenerator m_Generator
the underlying scheme for generating the folds.
-
-
Constructor Detail
-
DefaultCrossValidationFoldGenerator
public DefaultCrossValidationFoldGenerator()
Initializes the generator.
-
DefaultCrossValidationFoldGenerator
public DefaultCrossValidationFoldGenerator(Dataset data, int numFolds, long seed, boolean stratify)
Initializes the generator.- Parameters:
data- the full datasetnumFolds- the number of folds, leave-one-out if less than 2seed- the seed for randomizationstratify- whether to perform stratified CV
-
DefaultCrossValidationFoldGenerator
public DefaultCrossValidationFoldGenerator(Dataset data, int numFolds, long seed, boolean randomize, boolean stratify)
Initializes the generator.- Parameters:
data- the full datasetnumFolds- the number of folds, leave-one-out if less than 2seed- the seed valuerandomize- whether to randomize the datastratify- whether to perform stratified CV
-
-
Method Detail
-
globalInfo
public String globalInfo()
Returns a string describing the object.- Specified by:
globalInfoin interfaceGlobalInfoSupporter- Specified by:
globalInfoin classAbstractOptionHandler- Returns:
- a description suitable for displaying in the gui
-
defineOptions
public void defineOptions()
Adds options to the internal list of options.- Specified by:
defineOptionsin interfaceOptionHandler- Overrides:
defineOptionsin classAbstractSplitGenerator
-
reset
protected void reset()
Resets the generator.- Overrides:
resetin classAbstractSplitGenerator
-
setData
public void setData(Dataset value)
Sets the original data.- Specified by:
setDatain interfaceSplitGenerator<Dataset,TrainTestSetContainer>- Overrides:
setDatain classAbstractSplitGenerator- Parameters:
value- the data
-
setNumFolds
public void setNumFolds(int value)
Sets the number of folds to use.- Specified by:
setNumFoldsin interfaceCrossValidationFoldGenerator<Dataset,TrainTestSetContainer>- Parameters:
value- the number of folds, less than 2 for LOO
-
getNumFolds
public int getNumFolds()
Returns the number of folds.- Specified by:
getNumFoldsin interfaceCrossValidationFoldGenerator<Dataset,TrainTestSetContainer>- Returns:
- the number of folds
-
numFoldsTipText
public String numFoldsTipText()
Returns the tip text for this property.- Returns:
- tip text for this property suitable for displaying in the GUI or for listing the options.
-
getActualNumFolds
public int getActualNumFolds()
Returns the actual number of folds used (eg when using LOO).- Specified by:
getActualNumFoldsin interfaceCrossValidationFoldGenerator<Dataset,TrainTestSetContainer>- Returns:
- the actual number of folds, -1 if not yet calculated
- See Also:
AbstractSplitGenerator.initializeIterator()
-
setRandomize
public void setRandomize(boolean value)
Sets whether to randomize the data.- Specified by:
setRandomizein interfaceCrossValidationFoldGenerator<Dataset,TrainTestSetContainer>- Parameters:
value- true if to randomize the data
-
getRandomize
public boolean getRandomize()
Returns whether to randomize the data.- Specified by:
getRandomizein interfaceCrossValidationFoldGenerator<Dataset,TrainTestSetContainer>- Returns:
- true if to randomize the data
-
randomizeTipText
public String randomizeTipText()
Returns the tip text for this property.- Returns:
- tip text for this property suitable for displaying in the GUI or for listing the options.
-
setStratify
public void setStratify(boolean value)
Sets whether to stratify the data (nominal class).- Parameters:
value- whether to stratify the data (nominal class)
-
getStratify
public boolean getStratify()
Returns whether to stratify the data (in case of nominal class).- Returns:
- true if to stratify
-
stratifyTipText
public String stratifyTipText()
Returns the tip text for this property.- Returns:
- tip text for this property suitable for displaying in the GUI or for listing the options.
-
canRandomize
protected boolean canRandomize()
Returns whether randomization is enabled.- Specified by:
canRandomizein classAbstractSplitGenerator- Returns:
- true if to randomize
-
checkNext
protected boolean checkNext()
Returns true if the iteration has more elements. (In other words, returns true if next would return an element rather than throwing an exception.)- Specified by:
checkNextin classAbstractSplitGenerator- Returns:
- true if the iterator has more elements.
-
doInitializeIterator
protected void doInitializeIterator()
Initializes the iterator, randomizes the data if required.- Specified by:
doInitializeIteratorin classAbstractSplitGenerator- See Also:
AbstractSplitGenerator.canRandomize()
-
createNext
protected TrainTestSetContainer createNext()
Returns the next element in the iteration.- Specified by:
createNextin classAbstractSplitGenerator- Returns:
- the next element in the iteration.
- Throws:
NoSuchElementException- iteration has no more elements.
-
crossValidationIndices
public int[] crossValidationIndices()
Returns the cross-validation indices.- Specified by:
crossValidationIndicesin interfaceCrossValidationFoldGenerator<Dataset,TrainTestSetContainer>- Returns:
- the indices
-
crossValidationIndicesPerFold
public int[][] crossValidationIndicesPerFold()
Returns the cross-validation indices per fold.- Specified by:
crossValidationIndicesPerFoldin interfacePerFoldCrossValidationFoldGenerator<Dataset,TrainTestSetContainer>- Returns:
- the indices
-
toString
public String toString()
Returns a short description of the generator.- Specified by:
toStringin interfaceSplitGenerator<Dataset,TrainTestSetContainer>- Overrides:
toStringin classAbstractSplitGenerator- Returns:
- a short description
-
-