|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||
java.lang.Objectweka.experiment.ExplicitTestsetResultProducer
public class ExplicitTestsetResultProducer
Loads the external test set and calls the appropriate SplitEvaluator to generate some results.
The filename of the test set is constructed as follows:
<dir> + / + <prefix> + <relation-name> + <suffix>
The relation-name can be modified by using the regular expression to replace the matching sub-string with a specified replacement string. In order to get rid of the string that the Weka filters add to the end of the relation name, just use '.*-weka' as the regular expression to find.
The suffix determines the type of file to load, i.e., one is not restricted to ARFF files. As long as Weka recognizes the extension specified in the suffix, the data will be loaded with one of Weka's converters.
-D Save raw split evaluator output.
-O <file/directory name/path> The filename where raw output will be stored. If a directory name is specified then then individual outputs will be gzipped, otherwise all output will be zipped to the named file. Use in conjuction with -D. (default: splitEvalutorOut.zip)
-W <class name> The full class name of a SplitEvaluator. eg: weka.experiment.ClassifierSplitEvaluator
-R Set when data is to be randomized.
-dir <directory> The directory containing the test sets. (default: current directory)
-prefix <string> An optional prefix for the test sets (before the relation name). (default: empty string)
-suffix <string> The suffix to append to the test set. (default: _test.arff)
-find <regular expression> The regular expression to search the relation name with. Not used if an empty string. (default: empty string)
-replace <string> The replacement string for the all the matches of '-find'. (default: empty string)
Options specific to split evaluator weka.experiment.ClassifierSplitEvaluator:
-W <class name> The full class name of the classifier. eg: weka.classifiers.bayes.NaiveBayes
-C <index> The index of the class for which IR statistics are to be output. (default 1)
-I <index> The index of an attribute to output in the results. This attribute should identify an instance in order to know which instances are in the test set of a cross validation. if 0 no output (default 0).
-P Add target and prediction columns to the result for each fold.
Options specific to classifier weka.classifiers.rules.ZeroR:
-D If set, classifier is run in debug mode and may output additional info to the consoleAll options after -- will be passed to the split evaluator.
| Field Summary | |
|---|---|
static String |
DATASET_FIELD_NAME
The name of the key field containing the dataset name. |
static String |
DEFAULT_SUFFIX
the default suffix. |
static String |
RUN_FIELD_NAME
The name of the key field containing the run number. |
static String |
TIMESTAMP_FIELD_NAME
The name of the result field containing the timestamp. |
| Constructor Summary | |
|---|---|
ExplicitTestsetResultProducer()
|
|
| Method Summary | |
|---|---|
void |
doRun(int run)
Gets the results for a specified run number. |
void |
doRunKeys(int run)
Gets the keys for a specified run number. |
Enumeration |
enumerateMeasures()
Returns an enumeration of any additional measure names that might be in the SplitEvaluator. |
String |
getCompatibilityState()
Gets a description of the internal settings of the result producer, sufficient for distinguishing a ResultProducer instance from another with different settings (ignoring those settings set through this interface). |
String[] |
getKeyNames()
Gets the names of each of the columns produced for a single run. |
Object[] |
getKeyTypes()
Gets the data types of each of the columns produced for a single run. |
double |
getMeasure(String additionalMeasureName)
Returns the value of the named measure. |
String[] |
getOptions()
Gets the current settings of the result producer. |
File |
getOutputFile()
Get the value of OutputFile. |
boolean |
getRandomizeData()
Get if dataset is to be randomized. |
boolean |
getRawOutput()
Get if raw split evaluator output is to be saved. |
String |
getRelationFind()
Returns the currently set regular expression to use on the relation name. |
String |
getRelationReplace()
Returns the currently set replacement string to use on the relation name. |
String[] |
getResultNames()
Gets the names of each of the columns produced for a single run. |
Object[] |
getResultTypes()
Gets the data types of each of the columns produced for a single run. |
String |
getRevision()
Returns the revision string. |
SplitEvaluator |
getSplitEvaluator()
Get the SplitEvaluator. |
File |
getTestsetDir()
Returns the currently set directory for the test sets. |
String |
getTestsetPrefix()
Returns the currently set prefix. |
String |
getTestsetSuffix()
Returns the currently set suffix. |
static Double |
getTimestamp()
Gets a Double representing the current date and time. |
String |
globalInfo()
Returns a string describing this result producer. |
Enumeration |
listOptions()
Returns an enumeration describing the available options.. |
String |
outputFileTipText()
Returns the tip text for this property. |
void |
postProcess()
Perform any postprocessing. |
void |
preProcess()
Prepare to generate results. |
String |
randomizeDataTipText()
Returns the tip text for this property. |
String |
rawOutputTipText()
Returns the tip text for this property. |
String |
relationFindTipText()
Returns the tip text for this property. |
String |
relationReplaceTipText()
Returns the tip text for this property. |
void |
setAdditionalMeasures(String[] additionalMeasures)
Set a list of method names for additional measures to look for in SplitEvaluators. |
void |
setInstances(Instances instances)
Sets the dataset that results will be obtained for. |
void |
setOptions(String[] options)
Parses a given list of options. |
void |
setOutputFile(File value)
Set the value of OutputFile. |
void |
setRandomizeData(boolean value)
Set to true if dataset is to be randomized. |
void |
setRawOutput(boolean value)
Set to true if raw split evaluator output is to be saved. |
void |
setRelationFind(String value)
Sets the regular expression to use on the relation name. |
void |
setRelationReplace(String value)
Sets the replacement string to use on the relation name. |
void |
setResultListener(ResultListener listener)
Sets the object to send results of each run to. |
void |
setSplitEvaluator(SplitEvaluator value)
Set the SplitEvaluator. |
void |
setTestsetDir(File value)
Sets the directory to use for the test sets. |
void |
setTestsetPrefix(String value)
Sets the prefix to use for the test sets. |
void |
setTestsetSuffix(String value)
Sets the suffix to use for the test sets. |
String |
splitEvaluatorTipText()
Returns the tip text for this property. |
String |
testsetDirTipText()
Returns the tip text for this property. |
String |
testsetPrefixTipText()
Returns the tip text for this property. |
String |
testsetSuffixTipText()
Returns the tip text for this property. |
String |
toString()
Gets a text descrption of the result producer. |
| Methods inherited from class java.lang.Object |
|---|
equals, getClass, hashCode, notify, notifyAll, wait, wait, wait |
| Field Detail |
|---|
public static final String DEFAULT_SUFFIX
public static String DATASET_FIELD_NAME
public static String RUN_FIELD_NAME
public static String TIMESTAMP_FIELD_NAME
| Constructor Detail |
|---|
public ExplicitTestsetResultProducer()
| Method Detail |
|---|
public String globalInfo()
public Enumeration listOptions()
listOptions in interface OptionHandler
public void setOptions(String[] options)
throws Exception
-D Save raw split evaluator output.
-O <file/directory name/path> The filename where raw output will be stored. If a directory name is specified then then individual outputs will be gzipped, otherwise all output will be zipped to the named file. Use in conjuction with -D. (default: splitEvalutorOut.zip)
-W <class name> The full class name of a SplitEvaluator. eg: weka.experiment.ClassifierSplitEvaluator
-R Set when data is to be randomized.
-dir <directory> The directory containing the test sets. (default: current directory)
-prefix <string> An optional prefix for the test sets (before the relation name). (default: empty string)
-suffix <string> The suffix to append to the test set. (default: _test.arff)
-find <regular expression> The regular expression to search the relation name with. Not used if an empty string. (default: empty string)
-replace <string> The replacement string for the all the matches of '-find'. (default: empty string)
Options specific to split evaluator weka.experiment.ClassifierSplitEvaluator:
-W <class name> The full class name of the classifier. eg: weka.classifiers.bayes.NaiveBayes
-C <index> The index of the class for which IR statistics are to be output. (default 1)
-I <index> The index of an attribute to output in the results. This attribute should identify an instance in order to know which instances are in the test set of a cross validation. if 0 no output (default 0).
-P Add target and prediction columns to the result for each fold.
Options specific to classifier weka.classifiers.rules.ZeroR:
-D If set, classifier is run in debug mode and may output additional info to the consoleAll options after -- will be passed to the split evaluator.
setOptions in interface OptionHandleroptions - the list of options as an array of strings
Exception - if an option is not supportedpublic String[] getOptions()
getOptions in interface OptionHandlerpublic void setInstances(Instances instances)
setInstances in interface ResultProducerinstances - a value of type 'Instances'.public void setAdditionalMeasures(String[] additionalMeasures)
setAdditionalMeasures in interface ResultProduceradditionalMeasures - an array of measure names, null if nonepublic Enumeration enumerateMeasures()
enumerateMeasures in interface AdditionalMeasureProducerpublic double getMeasure(String additionalMeasureName)
getMeasure in interface AdditionalMeasureProduceradditionalMeasureName - the name of the measure to query for its value
IllegalArgumentException - if the named measure is not supportedpublic void setResultListener(ResultListener listener)
setResultListener in interface ResultProducerlistener - a value of type 'ResultListener'public static Double getTimestamp()
public void preProcess()
throws Exception
preProcess in interface ResultProducerException - if an error occurs during preprocessing.
public void postProcess()
throws Exception
postProcess in interface ResultProducerException - if an error occurs
public void doRunKeys(int run)
throws Exception
doRunKeys in interface ResultProducerrun - the run number to get keys for.
Exception - if a problem occurs while getting the keys
public void doRun(int run)
throws Exception
doRun in interface ResultProducerrun - the run number to get results for.
Exception - if a problem occurs while getting the resultspublic String[] getKeyNames()
getKeyNames in interface ResultProducerpublic Object[] getKeyTypes()
getKeyTypes in interface ResultProducerpublic String[] getResultNames()
getResultNames in interface ResultProducerpublic Object[] getResultTypes()
getResultTypes in interface ResultProducerpublic String getCompatibilityState()
getCompatibilityState in interface ResultProducerpublic String outputFileTipText()
public File getOutputFile()
public void setOutputFile(File value)
value - Value to assign to OutputFile.public String randomizeDataTipText()
public boolean getRandomizeData()
public void setRandomizeData(boolean value)
value - true if dataset is to be randomizedpublic String rawOutputTipText()
public boolean getRawOutput()
public void setRawOutput(boolean value)
value - true if output is to be savedpublic String splitEvaluatorTipText()
public SplitEvaluator getSplitEvaluator()
public void setSplitEvaluator(SplitEvaluator value)
value - new SplitEvaluator to use.public String testsetDirTipText()
public File getTestsetDir()
public void setTestsetDir(File value)
value - the directory to usepublic String testsetPrefixTipText()
public String getTestsetPrefix()
public void setTestsetPrefix(String value)
value - the prefixpublic String testsetSuffixTipText()
public String getTestsetSuffix()
public void setTestsetSuffix(String value)
value - the suffixpublic String relationFindTipText()
public String getRelationFind()
public void setRelationFind(String value)
value - the regular expressionpublic String relationReplaceTipText()
public String getRelationReplace()
public void setRelationReplace(String value)
value - the regular expressionpublic String toString()
toString in class Objectpublic String getRevision()
getRevision in interface RevisionHandler
|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||