|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||
java.lang.Objectorg.apache.hadoop.conf.Configured
weka.hadoop.HadoopExperiment
public class HadoopExperiment
| Nested Class Summary | |
|---|---|
static class |
HadoopExperiment.Map
Mapper class that read a whole file as input if its size is small, or read N lines as an inputsplit if input file is large. |
static class |
HadoopExperiment.Reduce
Collect text data from Mapper and write to output file. |
| Field Summary | |
|---|---|
protected int |
folderCount
record total input size, datasets * algorithms * repetition * folds |
protected int |
inputSize
record total input size, datasets * algorithms * repetition * folds |
protected String[] |
m_AdditionalMeasures
|
protected int |
m_attID
Attribute index of instance identifier (default -1) |
protected Classifier |
m_Classifier
The classifier used for evaluation |
static HadoopExperiment |
m_Exp
HadoopExperiment object to be used in Map/reduce classes |
protected int |
m_IRclass
Class index for information retrieval statistics (default 0) |
protected int |
m_NumFolds
The number of folds in the cross-validation |
protected boolean |
m_predTargetColumn
Flag for prediction and target columns output. |
protected int |
m_Repetition
Repetition number |
protected InstancesResultListener |
m_ResultListener
|
protected CrossValidationResultProducer |
m_RP
Default ResultProducer |
protected ClassifierSplitEvaluator |
m_SplitEvaluator
two different Split Evaluators |
protected RegressionSplitEvaluator |
m_SplitEvaluator2
|
protected ArrayList<String> |
measures
Store additional measurements value |
protected String |
num
String value of input lines split size for hadoop |
protected static int |
NUM_IR_STATISTICS
The number of IR statistics |
protected static int |
NUM_UNWEIGHTED_IR_STATISTICS
The number of unweighted averaged IR statistics |
protected static int |
NUM_WEIGHTED_IR_STATISTICS
The number of averaged IR statistics |
protected static int |
Regression_RESULT_SIZE
|
protected static int |
RESULT_SIZE
The length of a result |
protected String |
uniqueFile
|
protected String |
uniqueFolder
|
| Constructor Summary | |
|---|---|
HadoopExperiment()
|
|
| Method Summary | |
|---|---|
static void |
determineLinesPerMap(int number)
Method to determine how many lines to read per input split. |
static Double |
getTimestamp()
Get current time info. |
static void |
main(String[] args)
Main method to run Hadoop experiment. |
int |
run(String[] args)
Setting up hadoop job and run |
| Methods inherited from class org.apache.hadoop.conf.Configured |
|---|
getConf, setConf |
| Methods inherited from class java.lang.Object |
|---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
| Methods inherited from interface org.apache.hadoop.conf.Configurable |
|---|
getConf, setConf |
| Field Detail |
|---|
protected static final int RESULT_SIZE
protected static final int Regression_RESULT_SIZE
protected static final int NUM_IR_STATISTICS
protected static final int NUM_WEIGHTED_IR_STATISTICS
protected static final int NUM_UNWEIGHTED_IR_STATISTICS
protected int m_NumFolds
protected int m_Repetition
protected int m_attID
protected int m_IRclass
protected CrossValidationResultProducer m_RP
protected InstancesResultListener m_ResultListener
protected boolean m_predTargetColumn
protected Classifier m_Classifier
protected ClassifierSplitEvaluator m_SplitEvaluator
protected RegressionSplitEvaluator m_SplitEvaluator2
protected ArrayList<String> measures
protected String[] m_AdditionalMeasures
protected String num
protected String uniqueFile
protected String uniqueFolder
protected int inputSize
protected int folderCount
public static HadoopExperiment m_Exp
| Constructor Detail |
|---|
public HadoopExperiment()
| Method Detail |
|---|
public static Double getTimestamp()
public int run(String[] args)
throws Exception
run in interface org.apache.hadoop.util.Toolargs - type of String[]
Exceptionpublic static void determineLinesPerMap(int number)
number - an total amount of lines will occur in
the final output file, given knowledge of datasets, algorithms, folds and runs number,.
public static void main(String[] args)
throws Exception
args - type of String[]
Exception
|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||