org.apache.nutch.indexer
Class IndexSegment

java.lang.Object
  extended byorg.apache.nutch.indexer.IndexSegment

public class IndexSegment
extends Object

Creates an index for the output corresponding to a single fetcher run.


Field Summary
static String DONE_NAME
           
static Logger LOG
           
static int LOG_STEP
           
 
Constructor Summary
IndexSegment(NutchFileSystem nfs, long maxDocs, File srcDir, File localWorkingDir)
          Index a segment in the given NFS.
 
Method Summary
static float calculateBoost(float pageScore, float scorePower, boolean boostByLinkCount, int linkCount)
           
 void indexPages()
           
static void main(String[] args)
          Create an index for the input files in the named directory.
 void setScorePower(float power)
          Determines the power of link analyis scores.
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Field Detail

DONE_NAME

public static final String DONE_NAME
See Also:
Constant Field Values

LOG

public static final Logger LOG

LOG_STEP

public static int LOG_STEP
Constructor Detail

IndexSegment

public IndexSegment(NutchFileSystem nfs,
                    long maxDocs,
                    File srcDir,
                    File localWorkingDir)
Index a segment in the given NFS.

Method Detail

setScorePower

public void setScorePower(float power)
Determines the power of link analyis scores. Each pages's boost is set to scorescorePower where score is its link analysis score and scorePower is the value passed to this method.


indexPages

public void indexPages()
                throws Exception
Throws:
Exception

calculateBoost

public static float calculateBoost(float pageScore,
                                   float scorePower,
                                   boolean boostByLinkCount,
                                   int linkCount)

main

public static void main(String[] args)
                 throws Exception
Create an index for the input files in the named directory.

Throws:
Exception


Copyright © 2006 The Apache Software Foundation