|
|||||||||||
| PREV NEXT | FRAMES NO FRAMES | ||||||||||
| Packages that use Parser | |
| org.apache.nutch.parse | |
| org.apache.nutch.parse.html | An HTML document parsing plugin. |
| org.apache.nutch.parse.js | |
| org.apache.nutch.parse.msword | A Word document parsing plugin. |
| org.apache.nutch.parse.pdf | A pdf parsing plugin. |
| org.apache.nutch.parse.text | A plain text parsing plugin. |
| Uses of Parser in org.apache.nutch.parse |
| Methods in org.apache.nutch.parse that return Parser | |
static Parser |
ParserFactory.getParser(String contentType,
String url)
Returns the appropriate Parser implementation given a content
type and url. |
| Uses of Parser in org.apache.nutch.parse.html |
| Classes in org.apache.nutch.parse.html that implement Parser | |
class |
HtmlParser
|
| Uses of Parser in org.apache.nutch.parse.js |
| Classes in org.apache.nutch.parse.js that implement Parser | |
class |
JSParseFilter
This class is a heuristic link extractor for JavaScript files and code snippets. |
| Uses of Parser in org.apache.nutch.parse.msword |
| Classes in org.apache.nutch.parse.msword that implement Parser | |
class |
MSWordParser
parser for mime type application/msword. |
| Uses of Parser in org.apache.nutch.parse.pdf |
| Classes in org.apache.nutch.parse.pdf that implement Parser | |
class |
PdfParser
parser for mime type application/pdf. |
| Uses of Parser in org.apache.nutch.parse.text |
| Classes in org.apache.nutch.parse.text that implement Parser | |
class |
TextParser
|
|
|||||||||||
| PREV NEXT | FRAMES NO FRAMES | ||||||||||