|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Objectcom.rapidminer.tools.AbstractObservable<Operator>
com.rapidminer.operator.Operator
com.rapidminer.operator.AbstractExampleSetProcessing
com.rapidminer.operator.preprocessing.outlier.AbstractOutlierDetection
com.rapidminer.operator.preprocessing.outlier.DKNOutlierOperator
public class DKNOutlierOperator
This operator performs a D^k_n Outlier Search according to the outlier detection approach recommended by Ramaswamy, Rastogi and Shim in "Efficient Algorithms for Mining Outliers from Large Data Sets". It is primarily a statistical outlier search based on a distance measure similar to the DB(p,D)-Outlier Search from Knorr and Ng. But it utilizes a distance search through the k-th nearest neighbourhood, so it implements some sort of locality as well.
The method states, that those objects with the largest distance to their k-th nearest neighbours are likely to be outliers respective to the data set, because it can be assumed, that those objects have a more sparse neighbourhood than the average objects. As this effectively provides a simple ranking over all the objects in the data set according to the distance to their k-th nearest neighbours, the user can specify a number of n objects to be the top-n outliers in the data set.
The operator supports cosine, sine or squared distances in addition to the euclidian distance which can be specified by a distance parameter. The Operator takes an example set and passes it on with an boolean top-n D^k outlier status in a new boolean-valued special outlier attribute indicating true (outlier) and false (no outlier).
Field Summary | |
---|---|
static java.lang.String |
PARAMETER_DISTANCE_FUNCTION
The parameter name for "choose which distance function will be used for calculating " |
static java.lang.String |
PARAMETER_NUMBER_OF_NEIGHBORS
The parameter name for "Specifies the k value for the k-th nearest neighbours to be the analyzed. |
static java.lang.String |
PARAMETER_NUMBER_OF_OUTLIERS
The parameter name for "The number of top-n Outliers to be looked for. |
Constructor Summary | |
---|---|
DKNOutlierOperator(OperatorDescription description)
|
Method Summary | |
---|---|
ExampleSet |
apply(ExampleSet eSet)
This method implements the main functionality of the Operator but can be considered as a sort of wrapper to pass the RapidMiner operator chain data deeper into the search space class, so do not expect a lot of things happening here. |
protected java.util.Set<java.lang.String> |
getOutlierValues()
|
java.util.List<ParameterType> |
getParameterTypes()
Returns a list of ParameterTypes describing the parameters of this operator. |
ResourceConsumptionEstimator |
getResourceConsumptionEstimator()
Subclasses can override this method if they are able to estimate the consumed resources (CPU time and memory), based on their input. |
Methods inherited from class com.rapidminer.operator.preprocessing.outlier.AbstractOutlierDetection |
---|
modifyMetaData, writesIntoExistingData |
Methods inherited from class com.rapidminer.operator.AbstractExampleSetProcessing |
---|
doWork, getExampleSetInputPort, getExampleSetOutputPort, getInputPort, getRequiredMetaData, shouldAutoConnect |
Methods inherited from class com.rapidminer.tools.AbstractObservable |
---|
addObserver, addObserverAsFirst, fireUpdate, removeObserver |
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, wait |
Field Detail |
---|
public static final java.lang.String PARAMETER_NUMBER_OF_NEIGHBORS
public static final java.lang.String PARAMETER_NUMBER_OF_OUTLIERS
public static final java.lang.String PARAMETER_DISTANCE_FUNCTION
Constructor Detail |
---|
public DKNOutlierOperator(OperatorDescription description)
Method Detail |
---|
public ExampleSet apply(ExampleSet eSet) throws OperatorException
apply
in class AbstractExampleSetProcessing
OperatorException
public java.util.List<ParameterType> getParameterTypes()
Operator
getParameterTypes
in interface ParameterHandler
getParameterTypes
in class Operator
protected java.util.Set<java.lang.String> getOutlierValues()
getOutlierValues
in class AbstractOutlierDetection
public ResourceConsumptionEstimator getResourceConsumptionEstimator()
Operator
getResourceConsumptionEstimator
in interface ResourceConsumer
getResourceConsumptionEstimator
in class Operator
|
|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |