public class Subcollection extends org.apache.hadoop.conf.Configured implements URLFilter
Modifier and Type | Field and Description |
---|---|
static String |
TAG_BLACKLIST |
static String |
TAG_COLLECTION |
static String |
TAG_COLLECTIONS |
static String |
TAG_ID |
static String |
TAG_KEY |
static String |
TAG_NAME |
static String |
TAG_WHITELIST |
X_POINT_ID
Constructor and Description |
---|
Subcollection(org.apache.hadoop.conf.Configuration conf) |
Subcollection(String id,
String name,
org.apache.hadoop.conf.Configuration conf)
public Constructor
|
Subcollection(String id,
String name,
String key,
org.apache.hadoop.conf.Configuration conf)
public Constructor
|
Modifier and Type | Method and Description |
---|---|
String |
filter(String urlString)
Simple "indexOf" currentFilter for matching patterns.
|
String |
getBlackListString()
Returns blacklist String
|
String |
getId() |
String |
getKey() |
String |
getName() |
List<String> |
getWhiteList()
Returns whitelist
|
String |
getWhiteListString()
Returns whitelist String
|
void |
initialize(Element collection)
Initialize Subcollection from dom element
|
protected void |
parseList(List<String> list,
String text)
Create a list of patterns from chunk of text, patterns are separated with
newline
|
void |
setBlackList(String list)
Set contents of blacklist from String
|
void |
setWhiteList(ArrayList<String> whiteList) |
void |
setWhiteList(String list)
Set contents of whitelist from String
|
public static final String TAG_COLLECTIONS
public static final String TAG_COLLECTION
public static final String TAG_WHITELIST
public static final String TAG_BLACKLIST
public static final String TAG_NAME
public static final String TAG_KEY
public static final String TAG_ID
public Subcollection(String id, String name, org.apache.hadoop.conf.Configuration conf)
id
- id of SubCollectionname
- name of SubCollectionpublic Subcollection(String id, String name, String key, org.apache.hadoop.conf.Configuration conf)
id
- id of SubCollectionname
- name of SubCollectionpublic Subcollection(org.apache.hadoop.conf.Configuration conf)
public String getName()
public String getKey()
public String getId()
public String getWhiteListString()
public String getBlackListString()
public void setWhiteList(ArrayList<String> whiteList)
whiteList
- The whiteList to set.public String filter(String urlString)
rules for evaluation are as follows: 1. if pattern matches in blacklist then url is rejected 2. if pattern matches in whitelist then url is allowed 3. url is rejected
filter
in interface URLFilter
URLFilter.filter(java.lang.String)
public void initialize(Element collection)
collection
- protected void parseList(List<String> list, String text)
list
- text
- public void setBlackList(String list)
list
- the blacklist contentspublic void setWhiteList(String list)
list
- the whitelist contentsCopyright © 2014 The Apache Software Foundation