Package adams.flow.transformer
Class WekaCrossValidationSplit
- java.lang.Object
-
- adams.core.logging.LoggingObject
-
- adams.core.logging.CustomLoggingLevelObject
-
- adams.core.option.AbstractOptionHandler
-
- adams.flow.core.AbstractActor
-
- adams.flow.transformer.AbstractTransformer
-
- adams.flow.transformer.WekaCrossValidationSplit
-
- All Implemented Interfaces:
adams.core.AdditionalInformationHandler,adams.core.CleanUpHandler,adams.core.Destroyable,adams.core.GlobalInfoSupporter,adams.core.logging.LoggingLevelHandler,adams.core.logging.LoggingSupporter,adams.core.option.OptionHandler,adams.core.QuickInfoSupporter,adams.core.Randomizable,adams.core.ShallowCopySupporter<adams.flow.core.Actor>,adams.core.SizeOfHandler,adams.core.Stoppable,adams.core.StoppableWithFeedback,adams.core.VariablesInspectionHandler,InstancesViewCreator,adams.event.VariableChangeListener,adams.flow.core.Actor,adams.flow.core.ErrorHandler,adams.flow.core.InputConsumer,adams.flow.core.OutputProducer,Serializable,Comparable
public class WekaCrossValidationSplit extends adams.flow.transformer.AbstractTransformer implements adams.core.Randomizable, InstancesViewCreator
Generates train/test pairs like during a cross-validation run. It is possible to generate pairs for leave-one-out cross-validation (LOOCV) as well.
It is essential that a class attribute is set. In case of a nominal class attribute, the data gets stratified automatically.
Each of the pairs gets forwarded as a container. The training set can be accessed in the container with 'Train' and the test set with 'Test'.
Input/output:
- accepts:
weka.core.Instances
- generates:
adams.flow.container.WekaTrainTestSetContainer
Container information:
- adams.flow.container.WekaTrainTestSetContainer: Train, Test, Seed, FoldNumber, FoldCount, Train original indices, Test original indices
-logging-level <OFF|SEVERE|WARNING|INFO|CONFIG|FINE|FINER|FINEST> (property: loggingLevel) The logging level for outputting errors and debugging output. default: WARNING
-name <java.lang.String> (property: name) The name of the actor. default: WekaCrossValidationSplit
-annotation <adams.core.base.BaseAnnotation> (property: annotations) The annotations to attach to this actor. default:
-skip <boolean> (property: skip) If set to true, transformation is skipped and the input token is just forwarded as it is. default: false
-stop-flow-on-error <boolean> (property: stopFlowOnError) If set to true, the flow execution at this level gets stopped in case this actor encounters an error; the error gets propagated; useful for critical actors. default: false
-silent <boolean> (property: silent) If enabled, then no errors are output in the console; Note: the enclosing actor handler must have this enabled as well. default: false
-seed <long> (property: seed) The seed value for the randomization. default: 1
-folds <int> (property: folds) The folds to use; using '-1' will generate folds for leave-one-out cross-validation (LOOCV). default: 10
-relation <java.lang.String> (property: relationName) The placeholders for creating the relation name: @ = original relation name, $T = type (train/test), $N = current fold number. default: @
-create-view <boolean> (property: createView) If enabled, views of the dataset are created instead of actual copies. default: false
-generator <weka.classifiers.CrossValidationFoldGenerator> (property: generator) The scheme to use for generating the folds; the actor options take precedence over the scheme's ones. default: weka.classifiers.DefaultCrossValidationFoldGenerator
- Author:
- fracpete (fracpete at waikato dot ac dot nz)
- See Also:
- Serialized Form
-
-
Field Summary
Fields Modifier and Type Field Description static StringBACKUP_GENERATORthe key for storing the current fold in the backup.protected CrossValidationFoldGeneratorm_ActualGeneratorthe actual fold generator.protected booleanm_CreateViewwhether to create a view only.protected intm_Foldsthe number of folds to generate.protected CrossValidationFoldGeneratorm_Generatorthe fold generator.protected Stringm_RelationNamethe format of the relation names of the generated datasets.protected longm_Seedthe seed value.-
Fields inherited from class adams.flow.transformer.AbstractTransformer
BACKUP_INPUT, BACKUP_OUTPUT, m_InputToken, m_OutputToken
-
Fields inherited from class adams.flow.core.AbstractActor
m_Annotations, m_BackupState, m_DetectedObjectVariables, m_DetectedVariables, m_ErrorHandler, m_Executed, m_Executing, m_ExecutionListeningSupporter, m_FullName, m_LoggingPrefix, m_Name, m_Parent, m_ScopeHandler, m_Self, m_Silent, m_Skip, m_StopFlowOnError, m_StopMessage, m_Stopped, m_StorageHandler, m_VariablesUpdated
-
-
Constructor Summary
Constructors Constructor Description WekaCrossValidationSplit()
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description Class[]accepts()Returns the class that the consumer accepts.protected Hashtable<String,Object>backupState()Backs up the current state of the actor before update the variables.StringcreateViewTipText()Returns the tip text for this property.voiddefineOptions()Adds options to the internal list of options.protected StringdoExecute()Executes the flow item.StringfoldsTipText()Returns the tip text for this property.Class[]generates()Returns the class of objects that it generates.StringgeneratorTipText()Returns the tip text for this property.booleangetCreateView()Returns whether to create only a view.intgetFolds()Returns the number of folds to generate.CrossValidationFoldGeneratorgetGenerator()Returns the scheme for generating the folds.StringgetQuickInfo()Returns a quick info about the actor, which will be displayed in the GUI.StringgetRelationName()Returns the pattern used for renaming the relation.longgetSeed()Returns the seed value.StringglobalInfo()Returns a string describing the object.booleanhasPendingOutput()Checks whether there is pending output to be collected after executing the flow item.adams.flow.core.Tokenoutput()Returns the generated token.protected voidpruneBackup()Removes entries from the backup.StringrelationNameTipText()Returns the tip text for this property.protected voidreset()Resets the scheme.protected voidrestoreState(Hashtable<String,Object> state)Restores the state of the actor before the variables got updated.StringseedTipText()Returns the tip text for this property.voidsetCreateView(boolean value)Sets whether to create a view only.voidsetFolds(int value)Sets the number of folds to use.voidsetGenerator(CrossValidationFoldGenerator value)Sets the scheme for generating the folds.voidsetRelationName(String value)Sets the pattern to use for renaming the relation.voidsetSeed(long value)Sets the seed value.voidstopExecution()Stops the execution.voidwrapUp()Cleans up after the execution has finished.-
Methods inherited from class adams.flow.transformer.AbstractTransformer
currentInput, execute, hasInput, input, postExecute
-
Methods inherited from class adams.flow.core.AbstractActor
annotationsTipText, canInspectOptions, canPerformSetUpCheck, cleanUp, compareTo, configureLogger, destroy, equals, finalUpdateVariables, findVariables, findVariables, forceVariables, forCommandLine, forName, forName, getAdditionalInformation, getAnnotations, getDefaultName, getDetectedVariables, getErrorHandler, getFlowActors, getFlowExecutionListeningSupporter, getFullName, getName, getNextSibling, getParent, getParentComponent, getPreviousSibling, getRoot, getScopeHandler, getSilent, getSkip, getStopFlowOnError, getStopMessage, getStorageHandler, getVariables, handleError, handleException, hasErrorHandler, hasStopMessage, index, initialize, isBackedUp, isExecuted, isExecuting, isFinished, isHeadless, isStopped, nameTipText, performSetUpChecks, performVariableChecks, preExecute, pruneBackup, setAnnotations, setErrorHandler, setName, setParent, setSilent, setSkip, setStopFlowOnError, setUp, setVariables, shallowCopy, shallowCopy, silentTipText, sizeOf, skipTipText, stopExecution, stopFlowOnErrorTipText, updateDetectedVariables, updatePrefix, updateVariables, variableChanged
-
Methods inherited from class adams.core.option.AbstractOptionHandler
cleanUpOptions, finishInit, getDefaultLoggingLevel, getOptionManager, loggingLevelTipText, newOptionManager, setLoggingLevel, toCommandLine, toString
-
Methods inherited from class adams.core.logging.LoggingObject
getLogger, getLoggingLevel, initializeLogging, isLoggingEnabled
-
Methods inherited from class java.lang.Object
clone, finalize, getClass, hashCode, notify, notifyAll, wait, wait, wait
-
Methods inherited from interface adams.flow.core.Actor
cleanUp, compareTo, destroy, equals, findVariables, getAnnotations, getDefaultName, getDetectedVariables, getErrorHandler, getFlowExecutionListeningSupporter, getFullName, getName, getNextSibling, getParent, getParentComponent, getPreviousSibling, getRoot, getScopeHandler, getSilent, getSkip, getStopFlowOnError, getStopMessage, getStorageHandler, getVariables, handleError, hasErrorHandler, hasStopMessage, index, isExecuted, isFinished, isHeadless, isStopped, setAnnotations, setErrorHandler, setName, setParent, setSilent, setSkip, setStopFlowOnError, setUp, setVariables, shallowCopy, shallowCopy, sizeOf, stopExecution, toCommandLine, variableChanged
-
-
-
-
Field Detail
-
BACKUP_GENERATOR
public static final String BACKUP_GENERATOR
the key for storing the current fold in the backup.- See Also:
- Constant Field Values
-
m_Seed
protected long m_Seed
the seed value.
-
m_Folds
protected int m_Folds
the number of folds to generate.
-
m_RelationName
protected String m_RelationName
the format of the relation names of the generated datasets.
-
m_CreateView
protected boolean m_CreateView
whether to create a view only.
-
m_Generator
protected CrossValidationFoldGenerator m_Generator
the fold generator.
-
m_ActualGenerator
protected transient CrossValidationFoldGenerator m_ActualGenerator
the actual fold generator.
-
-
Method Detail
-
globalInfo
public String globalInfo()
Returns a string describing the object.- Specified by:
globalInfoin interfaceadams.core.GlobalInfoSupporter- Specified by:
globalInfoin classadams.core.option.AbstractOptionHandler- Returns:
- a description suitable for displaying in the gui
-
defineOptions
public void defineOptions()
Adds options to the internal list of options.- Specified by:
defineOptionsin interfaceadams.core.option.OptionHandler- Overrides:
defineOptionsin classadams.flow.core.AbstractActor
-
getQuickInfo
public String getQuickInfo()
Returns a quick info about the actor, which will be displayed in the GUI.- Specified by:
getQuickInfoin interfaceadams.flow.core.Actor- Specified by:
getQuickInfoin interfaceadams.core.QuickInfoSupporter- Overrides:
getQuickInfoin classadams.flow.core.AbstractActor- Returns:
- null if no info available, otherwise short string
-
accepts
public Class[] accepts()
Returns the class that the consumer accepts.- Specified by:
acceptsin interfaceadams.flow.core.InputConsumer- Returns:
- weka.core.Instances.class
-
generates
public Class[] generates()
Returns the class of objects that it generates.- Specified by:
generatesin interfaceadams.flow.core.OutputProducer- Returns:
- adams.flow.container.WekaTrainTestSetContainer.class
-
setSeed
public void setSeed(long value)
Sets the seed value.- Specified by:
setSeedin interfaceadams.core.Randomizable- Parameters:
value- the seed
-
getSeed
public long getSeed()
Returns the seed value.- Specified by:
getSeedin interfaceadams.core.Randomizable- Returns:
- the seed
-
seedTipText
public String seedTipText()
Returns the tip text for this property.- Specified by:
seedTipTextin interfaceadams.core.Randomizable- Returns:
- tip text for this property suitable for displaying in the GUI or for listing the options.
-
setFolds
public void setFolds(int value)
Sets the number of folds to use.- Parameters:
value- the folds, use -1 for LOOCV
-
getFolds
public int getFolds()
Returns the number of folds to generate.- Returns:
- the folds, 1 in case of LOOCV
-
foldsTipText
public String foldsTipText()
Returns the tip text for this property.- Returns:
- tip text for this property suitable for displaying in the GUI or for listing the options.
-
setRelationName
public void setRelationName(String value)
Sets the pattern to use for renaming the relation.- Parameters:
value- the pattern
-
getRelationName
public String getRelationName()
Returns the pattern used for renaming the relation.- Returns:
- the pattern
-
relationNameTipText
public String relationNameTipText()
Returns the tip text for this property.- Returns:
- tip text for this property suitable for displaying in the GUI or for listing the options.
-
setCreateView
public void setCreateView(boolean value)
Sets whether to create a view only.- Specified by:
setCreateViewin interfaceInstancesViewCreator- Parameters:
value- true if to create a view only
-
getCreateView
public boolean getCreateView()
Returns whether to create only a view.- Specified by:
getCreateViewin interfaceInstancesViewCreator- Returns:
- true if to create view only
-
createViewTipText
public String createViewTipText()
Returns the tip text for this property.- Specified by:
createViewTipTextin interfaceInstancesViewCreator- Returns:
- tip text for this property suitable for displaying in the GUI or for listing the options.
-
setGenerator
public void setGenerator(CrossValidationFoldGenerator value)
Sets the scheme for generating the folds.- Parameters:
value- the generator
-
getGenerator
public CrossValidationFoldGenerator getGenerator()
Returns the scheme for generating the folds.- Returns:
- the generator
-
generatorTipText
public String generatorTipText()
Returns the tip text for this property.- Returns:
- tip text for this property suitable for displaying in the GUI or for listing the options.
-
pruneBackup
protected void pruneBackup()
Removes entries from the backup.- Overrides:
pruneBackupin classadams.flow.core.AbstractActor
-
backupState
protected Hashtable<String,Object> backupState()
Backs up the current state of the actor before update the variables.- Overrides:
backupStatein classadams.flow.transformer.AbstractTransformer- Returns:
- the backup
-
restoreState
protected void restoreState(Hashtable<String,Object> state)
Restores the state of the actor before the variables got updated.- Overrides:
restoreStatein classadams.flow.transformer.AbstractTransformer- Parameters:
state- the backup of the state to restore from
-
reset
protected void reset()
Resets the scheme.- Overrides:
resetin classadams.flow.core.AbstractActor
-
doExecute
protected String doExecute()
Executes the flow item.- Specified by:
doExecutein classadams.flow.core.AbstractActor- Returns:
- null if everything is fine, otherwise error message
-
hasPendingOutput
public boolean hasPendingOutput()
Checks whether there is pending output to be collected after executing the flow item.- Specified by:
hasPendingOutputin interfaceadams.flow.core.OutputProducer- Overrides:
hasPendingOutputin classadams.flow.transformer.AbstractTransformer- Returns:
- true if there is pending output
-
output
public adams.flow.core.Token output()
Returns the generated token.- Specified by:
outputin interfaceadams.flow.core.OutputProducer- Overrides:
outputin classadams.flow.transformer.AbstractTransformer- Returns:
- the generated token
-
wrapUp
public void wrapUp()
Cleans up after the execution has finished.- Specified by:
wrapUpin interfaceadams.flow.core.Actor- Overrides:
wrapUpin classadams.flow.transformer.AbstractTransformer
-
stopExecution
public void stopExecution()
Stops the execution. No message set.- Specified by:
stopExecutionin interfaceadams.flow.core.Actor- Specified by:
stopExecutionin interfaceadams.core.Stoppable- Overrides:
stopExecutionin classadams.flow.core.AbstractActor
-
-