adams.flow.transformer.WekaRandomSplit

Name

adams.flow.transformer.WekaRandomSplit

Synopsis

Splits a dataset into a training and test set according to a specified split percentage. Randomization can be suppressed using the 'preserve order' option.
The training set can be accessed in the container with 'Train' and the test set with 'Test'.
Depending on the split generator in use, more than one container may be output.

Additional information

Flow input/output:
- input: weka.core.Instances
- output: adams.flow.container.WekaTrainTestSetContainer

Container information:
- adams.flow.container.WekaTrainTestSetContainer:
   - Train: training set; weka.core.Instances
   - Test: test set; weka.core.Instances
   - Seed: seed value; java.lang.Long
   - FoldNumber: current fold (1-based); java.lang.Integer
   - FoldCount: total number of folds; java.lang.Integer
   - Train original indices: original indices (0-based, train); array of int
   - Test original indices: original indices (0-based, test); array of int

Options

loggingLevel

The logging level for outputting errors and debugging output.

command-line -logging-level <OFF|SEVERE|WARNING|INFO|CONFIG|FINE|FINER|FINEST>

default WARNING

min-user-mode Expert
name

The name of the actor.

command-line -name <java.lang.String>

default WekaRandomSplit
annotations

The annotations to attach to this actor.

command-line -annotation <adams.core.base.BaseAnnotation>

default
skip

If set to true, transformation is skipped and the input token is just forwarded as it is.

command-line -skip <boolean>

default false
stopFlowOnError

If set to true, the flow execution at this level gets stopped in case this actor encounters an error; the error gets propagated; useful for critical actors.

command-line -stop-flow-on-error <boolean>

default false

min-user-mode Expert
silent

If enabled, then no errors are output in the console; Note: the enclosing actor handler must have this enabled as well.

command-line -silent <boolean>

default false

min-user-mode Expert
outputArray

If enabled, the splits are output as array rather than one-by-one.

command-line -output-array <boolean>

default false
preserveOrder

If set to true, then the order is preserved by suppressing randomization; overrides the value defined by the split generator scheme.

command-line -preserve-order <boolean>

default false
seed

The seed value for the randomization; overrides the value defined by the split generator scheme.

command-line -seed <long>

default 1
percentage

The percentage for the split (between 0 and 1); overrides the value defined by the split generator scheme.

command-line -percentage <double>

default 0.66

minimum 1.0E-4

maximum 0.9999
createView

If enabled, views of the dataset are created instead of actual copies; overrides the value defined by the split generator scheme.

command-line -create-view <boolean>

default false
generator

The scheme to use for generating the split; the actor options take precedence over the scheme's ones.

command-line -generator <weka.classifiers.RandomSplitGenerator>

default weka.classifiers.DefaultRandomSplitGenerator

command-line	`-logging-level <OFF\|SEVERE\|WARNING\|INFO\|CONFIG\|FINE\|FINER\|FINEST>`
default	`WARNING`
min-user-mode	`Expert`

command-line	`-name <java.lang.String>`
default	`WekaRandomSplit`

command-line	`-annotation <adams.core.base.BaseAnnotation>`
default

command-line	`-stop-flow-on-error <boolean>`
default	`false`
min-user-mode	`Expert`

command-line	`-silent <boolean>`
default	`false`
min-user-mode	`Expert`

command-line	`-percentage <double>`
default	`0.66`
minimum	`1.0E-4`
maximum	`0.9999`

command-line	`-generator <weka.classifiers.RandomSplitGenerator>`
default	`weka.classifiers.DefaultRandomSplitGenerator`

command-line	`-seed <long>`
default	`1`