org.apache.nutch.crawl
Class WebTableReader.WebTableRegexMapper
java.lang.Object
org.apache.hadoop.mapreduce.Mapper<K1,V1,K2,V2>
org.apache.gora.mapreduce.GoraMapper<String,WebPage,org.apache.hadoop.io.Text,org.apache.hadoop.io.Text>
org.apache.nutch.crawl.WebTableReader.WebTableRegexMapper
- Enclosing class:
- WebTableReader
public static class WebTableReader.WebTableRegexMapper
- extends org.apache.gora.mapreduce.GoraMapper<String,WebPage,org.apache.hadoop.io.Text,org.apache.hadoop.io.Text>
Filters the entries from the table based on a regex
Nested classes/interfaces inherited from class org.apache.hadoop.mapreduce.Mapper |
org.apache.hadoop.mapreduce.Mapper.Context |
Method Summary |
protected void |
map(String key,
WebPage value,
org.apache.hadoop.mapreduce.Mapper.Context context)
|
protected void |
setup(org.apache.hadoop.mapreduce.Mapper.Context context)
|
Methods inherited from class org.apache.gora.mapreduce.GoraMapper |
initMapperJob, initMapperJob, initMapperJob, initMapperJob, initMapperJob |
Methods inherited from class org.apache.hadoop.mapreduce.Mapper |
cleanup, run |
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
WebTableReader.WebTableRegexMapper
public WebTableReader.WebTableRegexMapper()
map
protected void map(String key,
WebPage value,
org.apache.hadoop.mapreduce.Mapper.Context context)
throws IOException,
InterruptedException
- Overrides:
map
in class org.apache.hadoop.mapreduce.Mapper<String,WebPage,org.apache.hadoop.io.Text,org.apache.hadoop.io.Text>
- Throws:
IOException
InterruptedException
setup
protected void setup(org.apache.hadoop.mapreduce.Mapper.Context context)
throws IOException,
InterruptedException
- Overrides:
setup
in class org.apache.hadoop.mapreduce.Mapper<String,WebPage,org.apache.hadoop.io.Text,org.apache.hadoop.io.Text>
- Throws:
IOException
InterruptedException
Copyright © 2013 The Apache Software Foundation