org.apache.nutch.mapReduce.lib
Class RegexMapper

java.lang.Object
  extended byorg.apache.nutch.mapReduce.lib.RegexMapper
All Implemented Interfaces:
Configurable, Mapper

public class RegexMapper
extends Object
implements Mapper

A Mapper that extracts text matching a regular expression.


Constructor Summary
RegexMapper()
           
 
Method Summary
 void configure(JobConf job)
          Initializes a new instance from a JobConf.
 void map(WritableComparable key, Writable value, OutputCollector output)
          Maps a single input key/value pair into intermediate key/value pairs.
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

RegexMapper

public RegexMapper()
Method Detail

configure

public void configure(JobConf job)
Description copied from interface: Configurable
Initializes a new instance from a JobConf.

Specified by:
configure in interface Configurable
Parameters:
job - the configuration

map

public void map(WritableComparable key,
                Writable value,
                OutputCollector output)
         throws IOException
Description copied from interface: Mapper
Maps a single input key/value pair into intermediate key/value pairs. Output pairs need not be of the same types as input pairs. A given input pair may map to zero or many output pairs. Output pairs are collected with calls to OutputCollector.collect(WritableComparable,Writable).

Specified by:
map in interface Mapper
Parameters:
key - the key
value - the values
output - collects mapped keys and values
Throws:
IOException


Copyright © 2006 The Apache Software Foundation