|
|||||||||||
| PREV NEXT | FRAMES NO FRAMES | ||||||||||
| Packages that use HTMLMetaTags | |
| org.apache.nutch.analysis.lang | Text document language identifier. |
| org.apache.nutch.parse | |
| org.apache.nutch.parse.html | An HTML document parsing plugin. |
| org.apache.nutch.parse.js | |
| org.creativecommons.nutch | Sample plugins that parse and index Creative Commons medadata. |
| Uses of HTMLMetaTags in org.apache.nutch.analysis.lang |
| Methods in org.apache.nutch.analysis.lang with parameters of type HTMLMetaTags | |
Parse |
HTMLLanguageParser.filter(Content content,
Parse parse,
HTMLMetaTags metaTags,
DocumentFragment doc)
Scan the HTML document looking at possible indications of content language. |
| Uses of HTMLMetaTags in org.apache.nutch.parse |
| Methods in org.apache.nutch.parse with parameters of type HTMLMetaTags | |
Parse |
HtmlParseFilter.filter(Content content,
Parse parse,
HTMLMetaTags metaTags,
DocumentFragment doc)
Adds metadata or otherwise modifies a parse of HTML content, given the DOM tree of a page. |
static Parse |
HtmlParseFilters.filter(Content content,
Parse parse,
HTMLMetaTags metaTags,
DocumentFragment doc)
Run all defined filters. |
| Uses of HTMLMetaTags in org.apache.nutch.parse.html |
| Methods in org.apache.nutch.parse.html with parameters of type HTMLMetaTags | |
static void |
HTMLMetaProcessor.getMetaTags(HTMLMetaTags metaTags,
Node node,
URL currURL)
Sets the indicators in robotsMeta to appropriate
values, based on any META tags found under the given
node. |
| Uses of HTMLMetaTags in org.apache.nutch.parse.js |
| Methods in org.apache.nutch.parse.js with parameters of type HTMLMetaTags | |
Parse |
JSParseFilter.filter(Content content,
Parse parse,
HTMLMetaTags metaTags,
DocumentFragment doc)
|
| Uses of HTMLMetaTags in org.creativecommons.nutch |
| Methods in org.creativecommons.nutch with parameters of type HTMLMetaTags | |
Parse |
CCParseFilter.filter(Content content,
Parse parse,
HTMLMetaTags metaTags,
DocumentFragment doc)
Adds metadata or otherwise modifies a parse of an HTML document, given the DOM tree of a page. |
|
|||||||||||
| PREV NEXT | FRAMES NO FRAMES | ||||||||||