|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Objectweka.experiment.NoiseCurveCrossValidationResultProducer
Does a N-fold cross-validation, but generates a Noise Curve by also varying the number amount of Noise. Always uses the same N-fold test set for testing.
Field Summary | |
static java.lang.String |
DATASET_FIELD_NAME
|
static java.lang.String |
FOLD_FIELD_NAME
|
protected java.lang.String[] |
m_AdditionalMeasures
The names of any additional measures to look for in SplitEvaluators |
protected java.util.Vector |
m_AttributeStats
Store Statistics of Attributes |
protected boolean |
m_classNoise
Add noise to Class Labels in Training Set |
protected boolean |
m_classNoiseTest
Add noise to Class Labels in Testing Set |
protected int |
m_CurrentSize
Dataset size for the runs, we take the full dataset |
protected boolean |
m_debugOutput
Save raw output of split evaluators --- for debugging purposes |
protected boolean |
m_featureMiss
Set features missing, do not include Class as a Feature in Training Set |
protected boolean |
m_featureMissTest
Set features missing, do not include Class as a Feature in Testing Set |
protected boolean |
m_featureNoise
Add noise to Features, do not include Class as a Feature in Training Set |
protected boolean |
m_featureNoiseTest
Add noise to Features, do not include Class as a Feature in Testing Set |
protected Instances |
m_Instances
The dataset of interest |
protected int |
m_NumFolds
The number of folds in the cross-validation |
protected java.io.File |
m_OutputFile
The destination output file/directory for raw output |
protected double[] |
m_PlotPoints
The specific points to plot, either integers representing specific numbers of training examples, or decimal fractions representing percentages of the full training set -- ONLY INTEGERS SUPPORTED |
protected java.util.Random |
m_Random
Random Number, used for randomization in each run |
protected ResultListener |
m_ResultListener
The ResultListener to send results to |
protected SplitEvaluator |
m_SplitEvaluator
The SplitEvaluator used to generate results |
protected OutputZipper |
m_ZipDest
The output zipper to use for saving raw splitEvaluator output |
static java.lang.String |
NOISE_FIELD_NAME
|
static java.lang.String |
RUN_FIELD_NAME
|
static java.lang.String |
STEP_FIELD_NAME
|
static java.lang.String |
TIMESTAMP_FIELD_NAME
|
Constructor Summary | |
NoiseCurveCrossValidationResultProducer()
|
Method Summary | |
void |
addClassNoise(Instances train,
Instances test,
int noiseLevel)
|
void |
addFeatureMiss(Instances train,
Instances test,
int noiseLevel)
|
void |
addFeatureNoise(Instances train,
Instances test,
int noiseLevel)
|
java.lang.String |
classNoiseTestTipText()
Returns the tip text for this property |
java.lang.String |
classNoiseTipText()
Returns the tip text for this property |
void |
doRun(int run)
Gets the results for a specified run number. |
void |
doRunKeys(int run)
Gets the keys for a specified run number. |
java.util.Enumeration |
enumerateMeasures()
Returns an enumeration of any additional measure names that might be in the SplitEvaluator |
java.lang.String |
featureMissTestTipText()
Returns the tip text for this property |
java.lang.String |
featureMissTipText()
Returns the tip text for this property |
java.lang.String |
featureNoiseTestTipText()
Returns the tip text for this property |
java.lang.String |
featureNoiseTipText()
Returns the tip text for this property |
boolean |
getclassNoise()
Get if Noise is to be added to Class |
boolean |
getclassNoiseTest()
Get if Noise is to be added to Class in Testing Set |
java.lang.String |
getCompatibilityState()
Gets a description of the internal settings of the result producer, sufficient for distinguishing a ResultProducer instance from another with different settings (ignoring those settings set through this interface). |
boolean |
getfeatureMiss()
Get if Features are to be set Missing |
boolean |
getfeatureMissTest()
Get if Features are to be set Missing in Testing Set |
boolean |
getfeatureNoise()
Get if Noise to be added in Features |
boolean |
getfeatureNoiseTest()
Get if Noise is to be added to Feature in Testing Set |
java.lang.String[] |
getKeyNames()
Gets the names of each of the columns produced for a single run. |
java.lang.Object[] |
getKeyTypes()
Gets the data types of each of the columns produced for a single run. |
double |
getMeasure(java.lang.String additionalMeasureName)
Returns the value of the named measure |
int |
getNumFolds()
Get the value of NumFolds. |
java.lang.String[] |
getOptions()
Gets the current settings of the result producer. |
java.io.File |
getOutputFile()
Get the value of OutputFile. |
java.lang.String |
getPlotPoints()
Get the value of PlotPoints. |
boolean |
getRawOutput()
Get if raw split evaluator output is to be saved |
java.lang.String[] |
getResultNames()
Gets the names of each of the columns produced for a single run. |
java.lang.Object[] |
getResultTypes()
Gets the data types of each of the columns produced for a single run. |
SplitEvaluator |
getSplitEvaluator()
Get the SplitEvaluator. |
static java.lang.Double |
getTimestamp()
Gets a Double representing the current date and time. |
java.lang.String |
globalInfo()
Returns a string describing this result producer |
protected static boolean |
isInteger(double val)
Return true if the given double represents an integer value |
java.util.Enumeration |
listOptions()
Returns an enumeration describing the available options.. |
static void |
main(java.lang.String[] args)
|
protected int |
maxTrainSize()
Get the maximum size of the training set based on maximum training set size from the n-fold CV |
java.lang.String |
numFoldsTipText()
Returns the tip text for this property |
java.lang.String |
outputFileTipText()
Returns the tip text for this property |
protected double[] |
parsePlotPoints(java.lang.String plotPoints)
Parse a string of doubles separated by commas or spaces into a sorted array of doubles |
protected int |
plotPoint(int i)
Return the amount of noise for the ith point on the curve for plotPoints as specified. |
java.lang.String |
plotPointsTipText()
Returns the tip text for this property |
void |
postProcess()
Perform any postprocessing. |
void |
preProcess()
Prepare to generate results. |
java.lang.String |
rawOutputTipText()
Returns the tip text for this property |
void |
setAdditionalMeasures(java.lang.String[] additionalMeasures)
Set a list of method names for additional measures to look for in SplitEvaluators. |
void |
setclassNoise(boolean d)
Set to true if Noise is to be added to Class |
void |
setclassNoiseTest(boolean d)
Set to true if Noise is to be added to Class in Testing |
void |
setfeatureMiss(boolean d)
Set to true if Features are to be set Missing |
void |
setfeatureMissTest(boolean d)
Set to true if Features are to be set Missing in Testing |
void |
setfeatureNoise(boolean d)
Set to true if Noise is to be added to Features |
void |
setfeatureNoiseTest(boolean d)
Set to true if Noise is to be added in Fetures in Testing |
void |
setInstances(Instances instances)
Sets the dataset that results will be obtained for. |
void |
setNumFolds(int newNumFolds)
Set the value of NumFolds. |
void |
setOptions(java.lang.String[] options)
Parses a given list of options. |
void |
setOutputFile(java.io.File newOutputFile)
Set the value of OutputFile. |
void |
setPlotPoints(java.lang.String plotPoints)
Set the value of PlotPoints. |
void |
setRawOutput(boolean d)
Set to true if raw split evaluator output is to be saved |
void |
setResultListener(ResultListener listener)
Sets the object to send results of each run to. |
void |
setSplitEvaluator(SplitEvaluator newSplitEvaluator)
Set the SplitEvaluator. |
java.lang.String |
splitEvaluatorTipText()
Returns the tip text for this property |
java.lang.String |
toString()
Gets a text descrption of the result producer. |
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, wait |
Field Detail |
protected Instances m_Instances
protected ResultListener m_ResultListener
protected int m_NumFolds
protected boolean m_debugOutput
protected boolean m_classNoise
protected boolean m_featureNoise
protected boolean m_featureMiss
protected boolean m_classNoiseTest
protected boolean m_featureNoiseTest
protected boolean m_featureMissTest
protected OutputZipper m_ZipDest
protected java.io.File m_OutputFile
protected SplitEvaluator m_SplitEvaluator
protected java.lang.String[] m_AdditionalMeasures
protected java.util.Vector m_AttributeStats
protected double[] m_PlotPoints
protected int m_CurrentSize
protected java.util.Random m_Random
public static java.lang.String DATASET_FIELD_NAME
public static java.lang.String RUN_FIELD_NAME
public static java.lang.String FOLD_FIELD_NAME
public static java.lang.String TIMESTAMP_FIELD_NAME
public static java.lang.String STEP_FIELD_NAME
public static java.lang.String NOISE_FIELD_NAME
Constructor Detail |
public NoiseCurveCrossValidationResultProducer()
Method Detail |
public java.lang.String globalInfo()
public void setInstances(Instances instances)
setInstances
in interface ResultProducer
instances
- a value of type 'Instances'.public void setResultListener(ResultListener listener)
setResultListener
in interface ResultProducer
listener
- a value of type 'ResultListener'public void setAdditionalMeasures(java.lang.String[] additionalMeasures)
setAdditionalMeasures
in interface ResultProducer
additionalMeasures
- a list of method namespublic java.util.Enumeration enumerateMeasures()
enumerateMeasures
in interface AdditionalMeasureProducer
public double getMeasure(java.lang.String additionalMeasureName)
getMeasure
in interface AdditionalMeasureProducer
additionalMeasureName
- the name of the measure to query for its value
public static java.lang.Double getTimestamp()
public void preProcess() throws java.lang.Exception
preProcess
in interface ResultProducer
java.lang.Exception
- if an error occurs during preprocessing.public void postProcess() throws java.lang.Exception
postProcess
in interface ResultProducer
java.lang.Exception
- if an error occurspublic void doRunKeys(int run) throws java.lang.Exception
doRunKeys
in interface ResultProducer
run
- the run number to get keys for.
java.lang.Exception
- if a problem occurs while getting the keysprotected int maxTrainSize()
public void doRun(int run) throws java.lang.Exception
doRun
in interface ResultProducer
run
- the run number to generate results for.
java.lang.Exception
- if a problem occurs while getting the resultsprotected int plotPoint(int i)
protected static boolean isInteger(double val)
public java.lang.String[] getKeyNames()
getKeyNames
in interface ResultProducer
public java.lang.Object[] getKeyTypes()
getKeyTypes
in interface ResultProducer
public java.lang.String[] getResultNames()
getResultNames
in interface ResultProducer
public java.lang.Object[] getResultTypes()
getResultTypes
in interface ResultProducer
public java.lang.String getCompatibilityState()
getCompatibilityState
in interface ResultProducer
public java.lang.String outputFileTipText()
public java.io.File getOutputFile()
public void setOutputFile(java.io.File newOutputFile)
public java.lang.String numFoldsTipText()
public int getNumFolds()
public void setNumFolds(int newNumFolds)
public java.lang.String plotPointsTipText()
public java.lang.String getPlotPoints()
public void setPlotPoints(java.lang.String plotPoints)
protected double[] parsePlotPoints(java.lang.String plotPoints)
public java.lang.String rawOutputTipText()
public boolean getRawOutput()
public void setRawOutput(boolean d)
public java.lang.String classNoiseTipText()
public boolean getclassNoise()
public void setclassNoise(boolean d)
public java.lang.String featureNoiseTipText()
public boolean getfeatureNoise()
public void setfeatureNoise(boolean d)
public java.lang.String featureMissTipText()
public boolean getfeatureMiss()
public void setfeatureMiss(boolean d)
public java.lang.String classNoiseTestTipText()
public boolean getclassNoiseTest()
public void setclassNoiseTest(boolean d)
public java.lang.String featureNoiseTestTipText()
public boolean getfeatureNoiseTest()
public void setfeatureNoiseTest(boolean d)
public java.lang.String featureMissTestTipText()
public boolean getfeatureMissTest()
public void setfeatureMissTest(boolean d)
public java.lang.String splitEvaluatorTipText()
public SplitEvaluator getSplitEvaluator()
public void setSplitEvaluator(SplitEvaluator newSplitEvaluator)
public java.util.Enumeration listOptions()
listOptions
in interface OptionHandler
public void setOptions(java.lang.String[] options) throws java.lang.Exception
-X num_folds
The number of folds to use for the cross-validation.
-D
Specify that raw split evaluator output is to be saved.
-O file/directory name
Specify the file or directory to which raw split evaluator output
is to be saved. If a directory is specified, then each output string
is saved as an individual gzip file. If a file is specified, then
each output string is saved as an entry in a zip file.
-W classname
Specify the full class name of the split evaluator.
-N Add Noise to Class in Training -n Add Noise to Class in Testing -F Add Noise to Features in Training -f Add Noise to Features in Testing -M Set Features Missing in Training -m Set Features Missing in Testing All option after -- will be passed to the split evaluator.
setOptions
in interface OptionHandler
options
- the list of options as an array of strings
java.lang.Exception
- if an option is not supportedpublic java.lang.String[] getOptions()
getOptions
in interface OptionHandler
public java.lang.String toString()
public void addClassNoise(Instances train, Instances test, int noiseLevel) throws java.lang.Exception
java.lang.Exception
public void addFeatureNoise(Instances train, Instances test, int noiseLevel) throws java.lang.Exception
java.lang.Exception
public void addFeatureMiss(Instances train, Instances test, int noiseLevel) throws java.lang.Exception
java.lang.Exception
public static void main(java.lang.String[] args)
|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |