adams.flow.transformer.WekaInstancesMerge
Merges multiple datasets, either from file or using Instances/Instance objects.
If no 'ID' attribute is named, then all datasets must contain the same number of rows.
Attributes can be excluded from ending up in the final dataset via a regular expression. They can also be prefixed with name and/or index.
Flow input/output:
- input: java.lang.String[], java.io.File[], weka.core.Instance[], weka.core.Instances[]
- output: weka.core.Instances
The logging level for outputting errors and debugging output.
command-line | -logging-level <OFF|SEVERE|WARNING|INFO|CONFIG|FINE|FINER|FINEST> |
default | WARNING |
min-user-mode | Expert |
The name of the actor.
command-line | -name <java.lang.String> |
default | WekaInstancesMerge |
The annotations to attach to this actor.
command-line | -annotation <adams.core.base.BaseAnnotation> |
default |
|
If set to true, transformation is skipped and the input token is just forwarded as it is.
command-line | -skip <boolean> |
default | false |
If set to true, the flow execution at this level gets stopped in case this actor encounters an error; the error gets propagated; useful for critical actors.
command-line | -stop-flow-on-error <boolean> |
default | false |
min-user-mode | Expert |
If enabled, then no errors are output in the console; Note: the enclosing actor handler must have this enabled as well.
command-line | -silent <boolean> |
default | false |
min-user-mode | Expert |
Whether to prefix the attribute names of each dataset with an index and an optional string.
command-line | -use-prefix <boolean> |
default | false |
Whether to add the index of the dataset to the prefix.
command-line | -add-index <boolean> |
default | false |
If true, only keep instances where data is available from each source.
command-line | -remove <boolean> |
default | false |
The optional prefix string to prefix the index number with (in case prefixes are used); '@' is a placeholder for the relation name.
command-line | -prefix <java.lang.String> |
default | dataset |
The separator string between the generated prefix and the original attribute name.
command-line | -prefix-separator <java.lang.String> |
default | - |
The regular expression used on the attribute names, to determine whether an attribute should be excluded or not (matching sense can be inverted); leave empty to include all attributes.
command-line | -exclude-atts <java.lang.String> |
default |
|
Whether to invert the matching sense of excluding attributes, ie, the regular expression is used for including attributes.
command-line | -invert <boolean> |
default | false |
The name of the attribute (string/numeric) used for uniquely identifying rows among the datasets.
command-line | -unique-id <java.lang.String> |
default |
|
If enabled, only a single instance of the unique ID attribute is kept.
command-line | -keep-only-single-unique-id <boolean> |
default | false |
If enabled, ensures that IDs in unique ID column are truly unique.
command-line | -strict <boolean> |
default | false |