org.apache.nutch.db
Class WebDBAnchors
java.lang.Object
org.apache.nutch.db.WebDBAnchors
- public class WebDBAnchors
- extends Object
Utility that extracts the set of anchor texts for a URL from the database.
Method Summary |
String[] |
getAnchors(UTF8 url)
Return the anchor texts of links in the db that point to this URL. |
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
WebDBAnchors
public WebDBAnchors(IWebDBReader db)
- Construct for the named db.
getAnchors
public String[] getAnchors(UTF8 url)
throws IOException
- Return the anchor texts of links in the db that point to this URL.
Performs duplicate elimination by source site, so that a single site
cannot generate multiple, identical anchors for a URL.
- Throws:
IOException
Copyright © 2006 The Apache Software Foundation