|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||
java.lang.Objectweka.core.converters.AbstractLoader
weka.core.converters.AbstractFileLoader
weka.core.converters.CSVLoader
public class CSVLoader
Reads a source that is in comma separated format (the default). One can also change the column separator from comma to tab or another character. Assumes that the first row in the file determines the number of and names of the attributes.
Valid options are:-H No header row present in the data.
-N <range> The range of attributes to force type to be NOMINAL. 'first' and 'last' are accepted as well. Examples: "first-last", "1,4,5-27,50-last" (default: -none-)
-S <range> The range of attribute to force type to be STRING. 'first' and 'last' are accepted as well. Examples: "first-last", "1,4,5-27,50-last" (default: -none-)
-D <range> The range of attribute to force type to be DATE. 'first' and 'last' are accepted as well. Examples: "first-last", "1,4,5-27,50-last" (default: -none-)
-format <date format> The date formatting string to use to parse date values. (default: "yyyy-MM-dd'T'HH:mm:ss")
-M <str> The string representing a missing value. (default: ?)
-F <separator> The field separator to be used. '\t' can be used as well. (default: ',')
Loader,
Serialized Form| Field Summary | |
|---|---|
static String |
FILE_EXTENSION
the file extension. |
| Fields inherited from class weka.core.converters.AbstractFileLoader |
|---|
FILE_EXTENSION_COMPRESSED |
| Fields inherited from interface weka.core.converters.Loader |
|---|
BATCH, INCREMENTAL, NONE |
| Constructor Summary | |
|---|---|
CSVLoader()
default constructor. |
|
| Method Summary | |
|---|---|
String |
dateAttributesTipText()
Returns the tip text for this property. |
String |
dateFormatTipText()
Returns the tip text for this property. |
String |
fieldSeparatorTipText()
Returns the tip text for this property. |
Instances |
getDataSet()
Return the full data set. |
String |
getDateAttributes()
Returns the current attribute range to be forced to type date. |
String |
getDateFormat()
Get the format to use for parsing date values. |
String |
getFieldSeparator()
Returns the character used as column separator. |
String |
getFileDescription()
Returns a description of the file type. |
String |
getFileExtension()
Get the file extension used for arff files. |
String[] |
getFileExtensions()
Gets all the file extensions used for this type of file. |
String |
getMissingValue()
Returns the current placeholder for missing values. |
Instance |
getNextInstance(Instances structure)
CSVLoader is unable to process a data set incrementally. |
boolean |
getNoHeaderRowPresent()
Get whether there is no header row in the data. |
String |
getNominalAttributes()
Returns the current attribute range to be forced to type nominal. |
String[] |
getOptions()
Gets the current settings of the Loader. |
String |
getRevision()
Returns the revision string. |
String |
getStringAttributes()
Returns the current attribute range to be forced to type string. |
Instances |
getStructure()
Determines and returns (if possible) the structure (internally the header) of the data set as an empty set of instances. |
String |
globalInfo()
Returns a string describing this attribute evaluator. |
Enumeration |
listOptions()
Returns an enumeration describing the available options. |
static void |
main(String[] args)
Main method. |
String |
missingValueTipText()
Returns the tip text for this property. |
String |
noHeaderRowPresentTipText()
Returns the tip text for this property. |
String |
nominalAttributesTipText()
Returns the tip text for this property. |
void |
reset()
Resets the Loader ready to read a new data set or the same data set again. |
void |
setDateAttributes(String value)
Set the attribute range to be forced to type date. |
void |
setDateFormat(String value)
Set the format to use for parsing date values. |
void |
setFieldSeparator(String value)
Sets the character used as column separator. |
void |
setMissingValue(String value)
Sets the placeholder for missing values. |
void |
setNoHeaderRowPresent(boolean b)
Set whether there is no header row in the data. |
void |
setNominalAttributes(String value)
Sets the attribute range to be forced to type nominal. |
void |
setOptions(String[] options)
Parses a given list of options. |
void |
setSource(File file)
Resets the Loader object and sets the source of the data set to be the supplied File object. |
void |
setSource(InputStream input)
Resets the Loader object and sets the source of the data set to be the supplied Stream object. |
void |
setStringAttributes(String value)
Sets the attribute range to be forced to type string. |
String |
stringAttributesTipText()
Returns the tip text for this property. |
| Methods inherited from class weka.core.converters.AbstractFileLoader |
|---|
getUseRelativePath, retrieveFile, runFileLoader, setEnvironment, setFile, setUseRelativePath, useRelativePathTipText |
| Methods inherited from class weka.core.converters.AbstractLoader |
|---|
setRetrieval |
| Methods inherited from class java.lang.Object |
|---|
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
| Field Detail |
|---|
public static String FILE_EXTENSION
| Constructor Detail |
|---|
public CSVLoader()
| Method Detail |
|---|
public String getFileExtension()
getFileExtension in interface FileSourcedConverterpublic String getFileDescription()
getFileDescription in interface FileSourcedConverterpublic String[] getFileExtensions()
getFileExtensions in interface FileSourcedConverterpublic String globalInfo()
public Enumeration listOptions()
listOptions in interface OptionHandler
public void setOptions(String[] options)
throws Exception
-H No header row present in the data.
-N <range> The range of attributes to force type to be NOMINAL. 'first' and 'last' are accepted as well. Examples: "first-last", "1,4,5-27,50-last" (default: -none-)
-S <range> The range of attribute to force type to be STRING. 'first' and 'last' are accepted as well. Examples: "first-last", "1,4,5-27,50-last" (default: -none-)
-D <range> The range of attribute to force type to be DATE. 'first' and 'last' are accepted as well. Examples: "first-last", "1,4,5-27,50-last" (default: -none-)
-format <date format> The date formatting string to use to parse date values. (default: "yyyy-MM-dd'T'HH:mm:ss")
-M <str> The string representing a missing value. (default: ?)
-F <separator> The field separator to be used. '\t' can be used as well. (default: ',')
setOptions in interface OptionHandleroptions - the list of options as an array of strings
Exception - if an option is not supportedpublic String[] getOptions()
getOptions in interface OptionHandlerpublic String noHeaderRowPresentTipText()
public void setNoHeaderRowPresent(boolean b)
b - true if there is no header row in the datapublic boolean getNoHeaderRowPresent()
public void setNominalAttributes(String value)
value - the rangepublic String getNominalAttributes()
public String nominalAttributesTipText()
public void setStringAttributes(String value)
value - the rangepublic String getStringAttributes()
public String stringAttributesTipText()
public void setDateAttributes(String value)
value - the rangepublic String getDateAttributes()
public String dateAttributesTipText()
public void setDateFormat(String value)
value - the format to use.public String getDateFormat()
public String dateFormatTipText()
public void setMissingValue(String value)
value - the placeholderpublic String getMissingValue()
public String missingValueTipText()
public void setFieldSeparator(String value)
value - the character to usepublic String getFieldSeparator()
public String fieldSeparatorTipText()
public void setSource(InputStream input)
throws IOException
setSource in interface LoadersetSource in class AbstractLoaderinput - the input stream
IOException - if an error occurs
public void setSource(File file)
throws IOException
setSource in interface LoadersetSource in class AbstractFileLoaderfile - the source file.
IOException - if an error occurs
public Instances getStructure()
throws IOException
getStructure in interface LoadergetStructure in class AbstractLoaderIOException - if an error occurs
public Instances getDataSet()
throws IOException
getDataSet in interface LoadergetDataSet in class AbstractLoaderIOException - if there is no source or parsing fails
public Instance getNextInstance(Instances structure)
throws IOException
getNextInstance in interface LoadergetNextInstance in class AbstractLoaderstructure - ignored
IOException - always. CSVLoader is unable to process a data
set incrementally.
public void reset()
throws IOException
reset in interface Loaderreset in class AbstractFileLoaderIOException - if something goes wrongpublic String getRevision()
getRevision in interface RevisionHandlerpublic static void main(String[] args)
args - should contain the name of an input file.
|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||