Interface | Description |
---|---|
URLFilter |
Interface used to limit which URLs enter Nutch.
|
URLNormalizer |
Interface used to convert URLs to normal form and optionally perform
substitutions
|
Class | Description |
---|---|
URLFilterChecker |
Checks one given filter or all filters.
|
URLFilters |
Creates and caches
URLFilter implementing plugins. |
URLNormalizerChecker |
Checks one given normalizer or all normalizers.
|
URLNormalizers |
This class uses a "chained filter" pattern to run defined normalizers.
|
Exception | Description |
---|---|
URLFilterException |
filters
and normalizers
.Copyright © 2015 The Apache Software Foundation