org.apache.nutch.net
Class PrefixURLFilter
java.lang.Object
org.apache.nutch.net.PrefixURLFilter
- All Implemented Interfaces:
- URLFilter
- public class PrefixURLFilter
- extends Object
- implements URLFilter
Filters URLs based on a file of URL prefixes. The file is named by
(1) property "urlfilter.prefix.file" in ./conf/nutch-default.xml, and
(2) attribute "file" in plugin.xml of this plugin
Attribute "file" has higher precedence if defined.
The format of this file is one URL prefix per line.
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
PrefixURLFilter
public PrefixURLFilter()
throws IOException
PrefixURLFilter
public PrefixURLFilter(String filename)
throws IOException
filter
public String filter(String url)
- Specified by:
filter
in interface URLFilter
main
public static void main(String[] args)
throws IOException
- Throws:
IOException
Copyright © 2006 The Apache Software Foundation