adams.flow.transformer.WekaInstancesMerge

Name

adams.flow.transformer.WekaInstancesMerge

Synopsis

Merges multiple datasets, either from file or using Instances/Instance objects.
If no 'ID' attribute is named, then all datasets must contain the same number of rows.
Attributes can be excluded from ending up in the final dataset via a regular expression. They can also be prefixed with name and/or index.

Additional information

Flow input/output:
- input: java.lang.String[], java.io.File[], weka.core.Instance[], weka.core.Instances[]
- output: weka.core.Instances

Options

loggingLevel

The logging level for outputting errors and debugging output.

command-line -logging-level <OFF|SEVERE|WARNING|INFO|CONFIG|FINE|FINER|FINEST>

default WARNING

min-user-mode Expert
name

The name of the actor.

command-line -name <java.lang.String>

default WekaInstancesMerge
annotations

The annotations to attach to this actor.

command-line -annotation <adams.core.base.BaseAnnotation>

default
skip

If set to true, transformation is skipped and the input token is just forwarded as it is.

command-line -skip <boolean>

default false
stopFlowOnError

If set to true, the flow execution at this level gets stopped in case this actor encounters an error; the error gets propagated; useful for critical actors.

command-line -stop-flow-on-error <boolean>

default false

min-user-mode Expert
silent

If enabled, then no errors are output in the console; Note: the enclosing actor handler must have this enabled as well.

command-line -silent <boolean>

default false

min-user-mode Expert
usePrefix

Whether to prefix the attribute names of each dataset with an index and an optional string.

command-line -use-prefix <boolean>

default false
addIndex

Whether to add the index of the dataset to the prefix.

command-line -add-index <boolean>

default false
remove

If true, only keep instances where data is available from each source.

command-line -remove <boolean>

default false
prefix

The optional prefix string to prefix the index number with (in case prefixes are used); '@' is a placeholder for the relation name.

command-line -prefix <java.lang.String>

default dataset
prefixSeparator

The separator string between the generated prefix and the original attribute name.

command-line -prefix-separator <java.lang.String>

default -
excludedAttributes

The regular expression used on the attribute names, to determine whether an attribute should be excluded or not (matching sense can be inverted); leave empty to include all attributes.

command-line -exclude-atts <java.lang.String>

default
invertMatchingSense

Whether to invert the matching sense of excluding attributes, ie, the regular expression is used for including attributes.

command-line -invert <boolean>

default false
uniqueID

The name of the attribute (string/numeric) used for uniquely identifying rows among the datasets.

command-line -unique-id <java.lang.String>

default
keepOnlySingleUniqueID

If enabled, only a single instance of the unique ID attribute is kept.

command-line -keep-only-single-unique-id <boolean>

default false
strict

If enabled, ensures that IDs in unique ID column are truly unique.

command-line -strict <boolean>

default false

command-line	`-logging-level <OFF\|SEVERE\|WARNING\|INFO\|CONFIG\|FINE\|FINER\|FINEST>`
default	`WARNING`
min-user-mode	`Expert`

command-line	`-name <java.lang.String>`
default	`WekaInstancesMerge`

command-line	`-annotation <adams.core.base.BaseAnnotation>`
default

command-line	`-stop-flow-on-error <boolean>`
default	`false`
min-user-mode	`Expert`

command-line	`-silent <boolean>`
default	`false`
min-user-mode	`Expert`

command-line	`-prefix-separator <java.lang.String>`
default	`-`

command-line	`-keep-only-single-unique-id <boolean>`
default	`false`

Name

Synopsis

Additional information

See also

Options