org.apache.nutch.net
Class PrefixURLFilter
java.lang.Object
org.apache.nutch.net.PrefixURLFilter
- All Implemented Interfaces:
- URLFilter
- public class PrefixURLFilter
- extends Object
- implements URLFilter
Filters URLs based on a file of URL prefixes. The file is named by
(1) property "urlfilter.prefix.file" in ./conf/nutch-default.xml, and
(2) attribute "file" in plugin.xml of this plugin
Attribute "file" has higher precedence if defined.
The format of this file is one URL prefix per line.
| Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
PrefixURLFilter
public PrefixURLFilter()
throws IOException
PrefixURLFilter
public PrefixURLFilter(String filename)
throws IOException
filter
public String filter(String url)
- Specified by:
filter in interface URLFilter
main
public static void main(String[] args)
throws IOException
- Throws:
IOException
Copyright © 2006 The Apache Software Foundation