|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Objectweka.deduping.Deduper
An abstract class that takes a set of objects and identifies disjoint subsets of duplicates
Field Summary | |
protected java.util.ArrayList |
m_statistics
An arraylist of Object arrays containing statistics |
Constructor Summary | |
Deduper()
|
Method Summary | |
abstract void |
buildDeduper(Instances trainInstances,
Instances testInstances)
Given training data, build the metrics required by the deduper |
abstract void |
findDuplicates(Instances testInstances,
int numObjects)
Identify duplicates within the testing data |
static Deduper |
forName(java.lang.String deduperName,
java.lang.String[] options)
|
java.util.ArrayList |
getStatistics()
Return the list of statistics collected during deduping |
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Field Detail |
protected java.util.ArrayList m_statistics
Constructor Detail |
public Deduper()
Method Detail |
public abstract void buildDeduper(Instances trainInstances, Instances testInstances) throws java.lang.Exception
java.lang.Exception
public abstract void findDuplicates(Instances testInstances, int numObjects) throws java.lang.Exception
testInstances
- a set of instances among which to identify duplicatesnumObjects
- the number of "true object" sets to create
java.lang.Exception
public static Deduper forName(java.lang.String deduperName, java.lang.String[] options) throws java.lang.Exception
java.lang.Exception
public java.util.ArrayList getStatistics()
|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |