org.apache.nutch.crawl
Class PartitionUrlByHost

java.lang.Object
  extended by org.apache.nutch.crawl.PartitionUrlByHost
All Implemented Interfaces:
org.apache.hadoop.mapred.JobConfigurable, org.apache.hadoop.mapred.Partitioner<org.apache.hadoop.io.Text,org.apache.hadoop.io.Writable>

public class PartitionUrlByHost
extends Object
implements org.apache.hadoop.mapred.Partitioner<org.apache.hadoop.io.Text,org.apache.hadoop.io.Writable>

Partition urls by hostname.


Constructor Summary
PartitionUrlByHost()
           
 
Method Summary
 void close()
           
 void configure(org.apache.hadoop.mapred.JobConf job)
           
 int getPartition(org.apache.hadoop.io.Text key, org.apache.hadoop.io.Writable value, int numReduceTasks)
          Hash by hostname.
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

PartitionUrlByHost

public PartitionUrlByHost()
Method Detail

configure

public void configure(org.apache.hadoop.mapred.JobConf job)
Specified by:
configure in interface org.apache.hadoop.mapred.JobConfigurable

close

public void close()

getPartition

public int getPartition(org.apache.hadoop.io.Text key,
                        org.apache.hadoop.io.Writable value,
                        int numReduceTasks)
Hash by hostname.

Specified by:
getPartition in interface org.apache.hadoop.mapred.Partitioner<org.apache.hadoop.io.Text,org.apache.hadoop.io.Writable>


Copyright © 2006 The Apache Software Foundation