|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||
java.lang.Objectweka.filters.Filter
weka.filters.SimpleFilter
weka.filters.SimpleBatchFilter
weka.filters.unsupervised.instance.RemoveDuplicates
public class RemoveDuplicates
Removes all duplicate instances.
Valid options are:-include-class Whether to include the class attribute in the comparison as well.
-randomize Whether to randomize the data after the removal process.
-S <int> Specifies the seed value for randomization. (default: 42)
| Field Summary | |
|---|---|
protected boolean |
m_IncludeClass
whether to take the class into account. |
protected boolean |
m_Randomize
whether to randomize the data after the removal. |
protected int |
m_Seed
the seed value for the randomization. |
| Fields inherited from class weka.filters.SimpleFilter |
|---|
m_Debug |
| Fields inherited from class weka.filters.Filter |
|---|
m_FirstBatchDone, m_InputRelAtts, m_InputStringAtts, m_NewBatch, m_OutputRelAtts, m_OutputStringAtts |
| Constructor Summary | |
|---|---|
RemoveDuplicates()
|
|
| Method Summary | |
|---|---|
protected weka.core.Instances |
determineOutputFormat(weka.core.Instances inputFormat)
Determines the output format based on the input format and returns this. |
weka.core.Capabilities |
getCapabilities()
Returns the Capabilities of this filter. |
boolean |
getIncludeClass()
Returns whether to include the class attribute in the comparison. |
String[] |
getOptions()
Gets the current settings of the filter. |
boolean |
getRandomize()
Returns whether to include the class attribute in the comparison. |
String |
getRevision()
Returns the revision string. |
int |
getSeed()
Gets the seed for the random number generations |
String |
globalInfo()
Returns a string describing this classifier. |
String |
includeClassTipText()
Returns the tip text for this property. |
Enumeration |
listOptions()
Returns an enumeration describing the available options. |
static void |
main(String[] args)
Main method for running this filter. |
protected weka.core.Instances |
process(weka.core.Instances instances)
Processes the given data (may change the provided dataset) and returns the modified version. |
String |
randomizeTipText()
Returns the tip text for this property. |
String |
seedTipText()
Returns the tip text for this property. |
void |
setIncludeClass(boolean value)
Sets whether to include the class attribute in the comparison. |
void |
setOptions(String[] options)
Parses a given list of options. |
void |
setRandomize(boolean value)
Sets whether to include the class attribute in the comparison. |
void |
setSeed(int value)
Set the seed for random number generation. |
| Methods inherited from class weka.filters.SimpleBatchFilter |
|---|
batchFinished, hasImmediateOutputFormat, input |
| Methods inherited from class weka.filters.SimpleFilter |
|---|
debugTipText, getDebug, reset, setDebug, setInputFormat |
| Methods inherited from class weka.filters.Filter |
|---|
batchFilterFile, bufferInput, copyValues, copyValues, filterFile, flushInput, getCapabilities, getInputFormat, getOutputFormat, initInputLocators, initOutputLocators, inputFormatPeek, isFirstBatchDone, isNewBatch, isOutputFormatDefined, makeCopies, makeCopy, mayRemoveInstanceAfterFirstBatchDone, numPendingOutput, output, outputFormatPeek, outputPeek, push, resetQueue, runFilter, setOutputFormat, testInputFormat, toString, useFilter, wekaStaticWrapper |
| Methods inherited from class java.lang.Object |
|---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, wait |
| Field Detail |
|---|
protected boolean m_IncludeClass
protected boolean m_Randomize
protected int m_Seed
| Constructor Detail |
|---|
public RemoveDuplicates()
| Method Detail |
|---|
public String globalInfo()
globalInfo in class weka.filters.SimpleFilterpublic Enumeration listOptions()
listOptions in interface weka.core.OptionHandlerlistOptions in class weka.filters.SimpleFilter
public void setOptions(String[] options)
throws Exception
-include-class Whether to include the class attribute in the comparison as well.
-randomize Whether to randomize the data after the removal process.
-S <int> Specifies the seed value for randomization. (default: 42)
setOptions in interface weka.core.OptionHandlersetOptions in class weka.filters.SimpleFilteroptions - the list of options as an array of string.s
Exception - if an option is not supported.public String[] getOptions()
getOptions in interface weka.core.OptionHandlergetOptions in class weka.filters.SimpleFilterpublic void setIncludeClass(boolean value)
value - if true the class attribute gets includedpublic boolean getIncludeClass()
public String includeClassTipText()
public void setRandomize(boolean value)
value - if true the class attribute gets includedpublic boolean getRandomize()
public String randomizeTipText()
public void setSeed(int value)
setSeed in interface weka.core.Randomizablevalue - the seedpublic int getSeed()
getSeed in interface weka.core.Randomizablepublic String seedTipText()
public weka.core.Capabilities getCapabilities()
getCapabilities in interface weka.core.CapabilitiesHandlergetCapabilities in class weka.filters.FilterCapabilities
protected weka.core.Instances determineOutputFormat(weka.core.Instances inputFormat)
throws Exception
determineOutputFormat in class weka.filters.SimpleFilterinputFormat - the input format to base the output format on
Exception - in case the determination goes wrong
protected weka.core.Instances process(weka.core.Instances instances)
throws Exception
process in class weka.filters.SimpleFilterinstances - the data to process
Exception - in case the processing goes wrongpublic String getRevision()
getRevision in interface weka.core.RevisionHandlergetRevision in class weka.filters.Filterpublic static void main(String[] args)
args - should contain arguments to the filter: use -h for help
|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||