Package adams.flow.transformer
Class SpreadSheetAggregate
-
- All Implemented Interfaces:
AdditionalInformationHandler
,CleanUpHandler
,Destroyable
,GlobalInfoSupporter
,LoggingLevelHandler
,LoggingSupporter
,OptionHandler
,QuickInfoSupporter
,ShallowCopySupporter<Actor>
,SizeOfHandler
,Stoppable
,StoppableWithFeedback
,VariablesInspectionHandler
,VariableChangeListener
,Actor
,ErrorHandler
,InputConsumer
,OutputProducer
,Serializable
,Comparable
public class SpreadSheetAggregate extends AbstractSpreadSheetTransformer
Aggregates rows (min, max, avg, etc) in a spreadsheet using key columns.
All numeric columns in the specified aggregrate range (excluding the key columns) get aggregated. For each of the specified aggregates a new column is generated.
If no key column(s) provided, the complete spreadsheet is used for aggregation.
Missing cells get ignored.
COUNT doesn't need any numeric columns.
Note: A single non-numeric cell makes a column a non-numeric one!
Input/output:
- accepts:
adams.data.spreadsheet.SpreadSheet
- generates:
adams.data.spreadsheet.SpreadSheet
-logging-level <OFF|SEVERE|WARNING|INFO|CONFIG|FINE|FINER|FINEST> (property: loggingLevel) The logging level for outputting errors and debugging output. default: WARNING min-user-mode: Expert
-name <java.lang.String> (property: name) The name of the actor. default: SpreadSheetAggregate
-annotation <adams.core.base.BaseAnnotation> (property: annotations) The annotations to attach to this actor. default:
-skip <boolean> (property: skip) If set to true, transformation is skipped and the input token is just forwarded as it is. default: false
-stop-flow-on-error <boolean> (property: stopFlowOnError) If set to true, the flow execution at this level gets stopped in case this actor encounters an error; the error gets propagated; useful for critical actors. default: false min-user-mode: Expert
-silent <boolean> (property: silent) If enabled, then no errors are output in the console; Note: the enclosing actor handler must have this enabled as well. default: false min-user-mode: Expert
-key-columns <adams.data.spreadsheet.SpreadSheetColumnRange> (property: keyColumns) The columns to use as keys for identifying rows in the spreadsheets; if left empty, all rows are used. default: example: A range is a comma-separated list of single 1-based indices or sub-ranges of indices ('start-end'); 'inv(...)' inverts the range '...'; column names (case-sensitive) as well as the following placeholders can be used: first, second, third, last_2, last_1, last; numeric indices can be enforced by preceding them with '#' (eg '#12'); column names can be surrounded by double quotes.
-aggregate-columns <adams.data.spreadsheet.SpreadSheetColumnRange> (property: aggregateColumns) The columns to aggregate (only numeric ones will be used). default: first-last example: A range is a comma-separated list of single 1-based indices or sub-ranges of indices ('start-end'); 'inv(...)' inverts the range '...'; column names (case-sensitive) as well as the following placeholders can be used: first, second, third, last_2, last_1, last; numeric indices can be enforced by preceding them with '#' (eg '#12'); column names can be surrounded by double quotes.
-aggregate <COUNT|SUM|MIN|MAX|RANGE|AVERAGE|MEDIAN|STDEV|STDEVP|INTERQUARTILE|QUARTILE25|QUARTILE75> [-aggregate ...] (property: aggregates) The aggregates to calculate and introduce as columns. default: SUM
- Author:
- fracpete (fracpete at waikato dot ac dot nz)
- See Also:
- Serialized Form
-
-
Nested Class Summary
Nested Classes Modifier and Type Class Description static class
SpreadSheetAggregate.Aggregate
The types of aggregates to generate.
-
Field Summary
Fields Modifier and Type Field Description protected SpreadSheetColumnRange
m_AggregateColumns
the range of columns to aggregate.protected SpreadSheetAggregate.Aggregate[]
m_Aggregates
the aggregates to generate.protected SpreadSheetColumnRange
m_KeyColumns
the range of column indices to use as key for identifying a row.-
Fields inherited from class adams.flow.transformer.AbstractTransformer
BACKUP_INPUT, BACKUP_OUTPUT, m_InputToken, m_OutputToken
-
Fields inherited from class adams.flow.core.AbstractActor
m_Annotations, m_BackupState, m_DetectedObjectVariables, m_DetectedVariables, m_ErrorHandler, m_Executed, m_Executing, m_ExecutionListeningSupporter, m_FullName, m_LoggingPrefix, m_Name, m_Parent, m_ScopeHandler, m_Self, m_Silent, m_Skip, m_StopFlowOnError, m_StopMessage, m_Stopped, m_StorageHandler, m_VariablesUpdated
-
Fields inherited from class adams.core.option.AbstractOptionHandler
m_OptionManager
-
Fields inherited from class adams.core.logging.LoggingObject
m_Logger, m_LoggingIsEnabled, m_LoggingLevel
-
Fields inherited from interface adams.flow.core.Actor
FILE_EXTENSION, FILE_EXTENSION_GZ
-
-
Constructor Summary
Constructors Constructor Description SpreadSheetAggregate()
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description String
aggregateColumnsTipText()
Returns the tip text for this property.String
aggregatesTipText()
Returns the tip text for this property.protected HashMap<SpreadSheetAggregate.Aggregate,Number>
computeAggregates(SpreadSheet input, List<Integer> subset, int index)
Computes the aggregates.void
defineOptions()
Adds options to the internal list of options.protected String
doExecute()
Executes the flow item.SpreadSheetColumnRange
getAggregateColumns()
Returns the colums that should get aggregated.SpreadSheetAggregate.Aggregate[]
getAggregates()
Returns the aggregates to calculate.SpreadSheetColumnRange
getKeyColumns()
Returns the colums that identify a rowxString
getQuickInfo()
Returns a quick info about the actor, which will be displayed in the GUI.String
globalInfo()
Returns a string describing the object.String
keyColumnsTipText()
Returns the tip text for this property.void
setAggregateColumns(SpreadSheetColumnRange value)
Sets the colums that should get aggregated.void
setAggregates(SpreadSheetAggregate.Aggregate[] value)
Sets the aggregates to calculate.void
setKeyColumns(SpreadSheetColumnRange value)
Sets the colums that identify a row.-
Methods inherited from class adams.flow.transformer.AbstractSpreadSheetTransformer
accepts, generates
-
Methods inherited from class adams.flow.transformer.AbstractTransformer
backupState, currentInput, execute, hasInput, hasPendingOutput, input, output, postExecute, restoreState, wrapUp
-
Methods inherited from class adams.flow.core.AbstractActor
annotationsTipText, canInspectOptions, canPerformSetUpCheck, cleanUp, compareTo, configureLogger, destroy, equals, finalUpdateVariables, findVariables, findVariables, forceVariables, forCommandLine, forName, forName, getAdditionalInformation, getAnnotations, getDefaultName, getDetectedVariables, getErrorHandler, getFlowActors, getFlowExecutionListeningSupporter, getFullName, getName, getNextSibling, getParent, getParentComponent, getPreviousSibling, getRoot, getScopeHandler, getSilent, getSkip, getStopFlowOnError, getStopMessage, getStorageHandler, getVariables, handleError, handleException, hasErrorHandler, hasStopMessage, index, initialize, isBackedUp, isExecuted, isExecuting, isFinished, isHeadless, isStopped, nameTipText, performSetUpChecks, performVariableChecks, preExecute, pruneBackup, pruneBackup, reset, setAnnotations, setErrorHandler, setName, setParent, setSilent, setSkip, setStopFlowOnError, setUp, setVariables, shallowCopy, shallowCopy, silentTipText, sizeOf, skipTipText, stopExecution, stopExecution, stopFlowOnErrorTipText, updateDetectedVariables, updatePrefix, updateVariables, variableChanged
-
Methods inherited from class adams.core.option.AbstractOptionHandler
cleanUpOptions, finishInit, getDefaultLoggingLevel, getOptionManager, loggingLevelTipText, newOptionManager, setLoggingLevel, toCommandLine, toString
-
Methods inherited from class adams.core.logging.LoggingObject
getLogger, getLoggingLevel, initializeLogging, isLoggingEnabled
-
Methods inherited from class java.lang.Object
clone, finalize, getClass, hashCode, notify, notifyAll, wait, wait, wait
-
Methods inherited from interface adams.flow.core.Actor
cleanUp, compareTo, destroy, equals, findVariables, getAnnotations, getDefaultName, getDetectedVariables, getErrorHandler, getFlowExecutionListeningSupporter, getFullName, getName, getNextSibling, getParent, getParentComponent, getPreviousSibling, getRoot, getScopeHandler, getSilent, getSkip, getStopFlowOnError, getStopMessage, getStorageHandler, getVariables, handleError, hasErrorHandler, hasStopMessage, index, isExecuted, isFinished, isHeadless, isStopped, setAnnotations, setErrorHandler, setName, setParent, setSilent, setSkip, setStopFlowOnError, setUp, setVariables, shallowCopy, shallowCopy, sizeOf, stopExecution, stopExecution, toCommandLine, variableChanged
-
Methods inherited from interface adams.core.AdditionalInformationHandler
getAdditionalInformation
-
Methods inherited from interface adams.core.logging.LoggingLevelHandler
getLoggingLevel, setLoggingLevel
-
Methods inherited from interface adams.core.logging.LoggingSupporter
getLogger, isLoggingEnabled
-
Methods inherited from interface adams.core.option.OptionHandler
cleanUpOptions, getOptionManager
-
Methods inherited from interface adams.core.VariablesInspectionHandler
canInspectOptions
-
-
-
-
Field Detail
-
m_KeyColumns
protected SpreadSheetColumnRange m_KeyColumns
the range of column indices to use as key for identifying a row.
-
m_AggregateColumns
protected SpreadSheetColumnRange m_AggregateColumns
the range of columns to aggregate.
-
m_Aggregates
protected SpreadSheetAggregate.Aggregate[] m_Aggregates
the aggregates to generate.
-
-
Method Detail
-
globalInfo
public String globalInfo()
Returns a string describing the object.- Specified by:
globalInfo
in interfaceGlobalInfoSupporter
- Specified by:
globalInfo
in classAbstractOptionHandler
- Returns:
- a description suitable for displaying in the gui
-
defineOptions
public void defineOptions()
Adds options to the internal list of options.- Specified by:
defineOptions
in interfaceOptionHandler
- Overrides:
defineOptions
in classAbstractActor
-
setKeyColumns
public void setKeyColumns(SpreadSheetColumnRange value)
Sets the colums that identify a row.- Parameters:
value
- the range
-
getKeyColumns
public SpreadSheetColumnRange getKeyColumns()
Returns the colums that identify a rowx- Returns:
- the range
-
keyColumnsTipText
public String keyColumnsTipText()
Returns the tip text for this property.- Returns:
- tip text for this property suitable for displaying in the GUI or for listing the options.
-
setAggregateColumns
public void setAggregateColumns(SpreadSheetColumnRange value)
Sets the colums that should get aggregated.- Parameters:
value
- the range
-
getAggregateColumns
public SpreadSheetColumnRange getAggregateColumns()
Returns the colums that should get aggregated.- Returns:
- the range
-
aggregateColumnsTipText
public String aggregateColumnsTipText()
Returns the tip text for this property.- Returns:
- tip text for this property suitable for displaying in the GUI or for listing the options.
-
setAggregates
public void setAggregates(SpreadSheetAggregate.Aggregate[] value)
Sets the aggregates to calculate.- Parameters:
value
- the aggregates
-
getAggregates
public SpreadSheetAggregate.Aggregate[] getAggregates()
Returns the aggregates to calculate.- Returns:
- the aggregates
-
aggregatesTipText
public String aggregatesTipText()
Returns the tip text for this property.- Returns:
- tip text for this property suitable for displaying in the GUI or for listing the options.
-
getQuickInfo
public String getQuickInfo()
Returns a quick info about the actor, which will be displayed in the GUI.- Specified by:
getQuickInfo
in interfaceActor
- Specified by:
getQuickInfo
in interfaceQuickInfoSupporter
- Overrides:
getQuickInfo
in classAbstractActor
- Returns:
- null if no info available, otherwise short string
-
computeAggregates
protected HashMap<SpreadSheetAggregate.Aggregate,Number> computeAggregates(SpreadSheet input, List<Integer> subset, int index)
Computes the aggregates.- Parameters:
input
- the original sheetsubset
- the subset of rows to use for the computation, null if all rowsindex
- the column in the original spreadsheet- Returns:
- the computed values
-
doExecute
protected String doExecute()
Executes the flow item.- Specified by:
doExecute
in classAbstractActor
- Returns:
- null if everything is fine, otherwise error message
-
-