org.apache.nutch.tools
Class WebDBAdminTool

java.lang.Object
  extended byorg.apache.nutch.tools.WebDBAdminTool

public class WebDBAdminTool
extends Object

The WebDBAdminTool is for Nutch administrators who need special access to the webdb. It allows for finer editing of the stored values.

Author:
Mike Cafarella

Field Summary
static Logger LOG
           
 
Constructor Summary
WebDBAdminTool(IWebDBReader reader)
           
 
Method Summary
 void emitTopK(int k)
          Emit the top K-rated Pages.
static void main(String[] argv)
          This tool performs a number of generic db management tasks.
 void scoreDump()
          Emit each page's score and link data
 void textDump(String dumpName)
          Emit the webdb to 2 text files.
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Field Detail

LOG

public static final Logger LOG
Constructor Detail

WebDBAdminTool

public WebDBAdminTool(IWebDBReader reader)
Method Detail

textDump

public void textDump(String dumpName)
              throws IOException
Emit the webdb to 2 text files.

Throws:
IOException

emitTopK

public void emitTopK(int k)
              throws IOException
Emit the top K-rated Pages.

Throws:
IOException

scoreDump

public void scoreDump()
               throws IOException
Emit each page's score and link data

Throws:
IOException

main

public static void main(String[] argv)
                 throws FileNotFoundException,
                        IOException
This tool performs a number of generic db management tasks. Right now, it only emits the text-format database.

Throws:
FileNotFoundException
IOException


Copyright © 2006 The Apache Software Foundation