Package org.apache.nutch.indexer

Maintain Lucene full-text indexes.

See:
          Description

Interface Summary
IndexingFilter Extension point for indexing.
 

Class Summary
DeleteDuplicates Deletes duplicate documents in a set of Lucene indexes.
DeleteDuplicates.IndexedDoc The key used in sorting for duplicates.
DeleteDuplicates.IndexedDoc.ByHashDoc Order equal hashes by decreasing index and document.
DeleteDuplicates.IndexedDoc.ByHashScore Order equal hashes by decreasing score and increasing urlLen.
HighFreqTerms Lists the most frequent terms in an index.
IndexingFilters Creates and caches IndexingFilter implementing plugins.
IndexMerger IndexMerger creates an index for the output corresponding to a single fetcher run.
IndexOptimizer  
IndexSegment Creates an index for the output corresponding to a single fetcher run.
NutchSimilarity Similarity implementatation used by Nutch indexing and search.
 

Exception Summary
IndexingException  
 

Package org.apache.nutch.indexer Description

Maintain Lucene full-text indexes.



Copyright © 2006 The Apache Software Foundation