org.apache.nutch.indexer
Class DeleteDuplicates.IndexedDoc.ByHashScore

java.lang.Object
  extended byorg.apache.nutch.io.WritableComparator
      extended byorg.apache.nutch.indexer.DeleteDuplicates.IndexedDoc.ByHashScore
All Implemented Interfaces:
Comparator
Enclosing class:
DeleteDuplicates.IndexedDoc

public static class DeleteDuplicates.IndexedDoc.ByHashScore
extends WritableComparator

Order equal hashes by decreasing score and increasing urlLen.


Constructor Summary
DeleteDuplicates.IndexedDoc.ByHashScore()
           
 
Method Summary
 int compare(byte[] b1, int s1, int l1, byte[] b2, int s2, int l2)
          Optimization hook.
 
Methods inherited from class org.apache.nutch.io.WritableComparator
compare, compare, compareBytes, define, get, getKeyClass, newKey, readFloat, readInt, readLong, readUnsignedShort
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 
Methods inherited from interface java.util.Comparator
equals
 

Constructor Detail

DeleteDuplicates.IndexedDoc.ByHashScore

public DeleteDuplicates.IndexedDoc.ByHashScore()
Method Detail

compare

public int compare(byte[] b1,
                   int s1,
                   int l1,
                   byte[] b2,
                   int s2,
                   int l2)
Description copied from class: WritableComparator
Optimization hook. Override this to make SequenceFile.Sorter's scream.

The default implementation reads the data into two WritableComparables (using Writable.readFields(DataInput), then calls WritableComparator.compare(WritableComparable,WritableComparable).

Overrides:
compare in class WritableComparator


Copyright © 2006 The Apache Software Foundation