Overview
Package
Class
Use
Tree
Deprecated
Index
Help
PREV NEXT
FRAMES
NO FRAMES
All Classes
A
B
C
D
E
F
G
H
I
L
M
N
O
P
Q
R
S
T
U
V
W
A
AbstractDroid
<
T
extends
Task
> - Class in
org.apache.droids
Manage common tasks in standard Droids
AbstractDroid(TaskQueue<T>, TaskMaster<T>)
- Constructor for class org.apache.droids.
AbstractDroid
accept(String)
- Method in class org.apache.droids.helper.factories.
URLFiltersFactory
Run all defined filters.
accept(String, String)
- Method in class org.apache.droids.helper.factories.
URLFiltersFactory
Run a specific filter class.
accept()
- Method in class org.apache.droids.net.
RegexRule
Return if this rule is used for filtering-in or out.
AGENT
- Variable in class org.apache.http.
PostFile
The AGENT name
awaitTermination(long, TimeUnit)
- Method in interface org.apache.droids.api.
TaskMaster
awaitTermination(long, TimeUnit)
- Method in class org.apache.droids.impl.
MultiThreadedTaskMaster
B
bufferSize
- Variable in class org.apache.droids.handle.
Save
C
clear()
- Method in class org.apache.droids.impl.
SimpleTaskQueue
clear()
- Method in class org.apache.droids.impl.
SimpleTaskQueueWithHistory
contentTypes
- Static variable in class org.apache.droids.protocol.
MediaType
Deprecated.
Officially known contentTypes
CrawlingDroid
- Class in
org.apache.droids.robot.crawler
CrawlingDroid(TaskQueue<Link>, TaskMaster<Link>)
- Constructor for class org.apache.droids.robot.crawler.
CrawlingDroid
CrawlingWorker
- Class in
org.apache.droids.robot.crawler
CrawlingWorker(CrawlingDroid)
- Constructor for class org.apache.droids.robot.crawler.
CrawlingWorker
D
delaySpread
- Variable in class org.apache.droids.delay.
RandomDelayTimer
DelayTimer
- Interface in
org.apache.droids.api
Define the timer delay interface.
Droid
<
T
extends
Task
> - Interface in
org.apache.droids.api
Interface for a droid.
DroidFactory
<
T
extends
Task
> - Class in
org.apache.droids.helper.factories
Factory that will lookup a droid by its name and returns it.
DroidFactory()
- Constructor for class org.apache.droids.helper.factories.
DroidFactory
DroidsException
- Exception in
org.apache.droids.exception
Wrapper object to limit the number of different Exception we can throw.
DroidsException(String)
- Constructor for exception org.apache.droids.exception.
DroidsException
Constructs a new exception with the specified detail message.
DroidsException(Throwable)
- Constructor for exception org.apache.droids.exception.
DroidsException
For more information
Exception
E
execute(T)
- Method in interface org.apache.droids.api.
Worker
execute(Link)
- Method in class org.apache.droids.robot.crawler.
CrawlingWorker
execute(FileTask)
- Method in class org.apache.droids.robot.walker.
FileWorker
existUrl(URL)
- Static method in class org.apache.droids.net.
UrlHelper
Does the url exist?
F
FileProtocol
- Class in
org.apache.droids.protocol.file
FileProtocol()
- Constructor for class org.apache.droids.protocol.file.
FileProtocol
FileTask
- Class in
org.apache.droids.robot.walker
FileTask(File, int)
- Constructor for class org.apache.droids.robot.walker.
FileTask
FileWorker
- Class in
org.apache.droids.robot.walker
FileWorker(TaskQueue<FileTask>)
- Constructor for class org.apache.droids.robot.walker.
FileWorker
filter(String)
- Method in interface org.apache.droids.api.
URLFilter
Transforms the URL: can pass the original URL through or "delete" the URL by returning null
filter(String)
- Method in class org.apache.droids.net.
RegexURLFilter
findRobotsUrl(URL)
- Static method in class org.apache.droids.net.
UrlHelper
Search the url of the robots.txt that is responsible for the given base url.
finished()
- Method in interface org.apache.droids.api.
Droid
Invoke when the droid has completed
finished()
- Method in class org.apache.droids.robot.crawler.
CrawlingDroid
finished()
- Method in class org.apache.droids.robot.walker.
SimpleWalkingDroid
from
- Variable in class org.apache.droids.protocol.
HttpBase
G
GaussianRandomDelayTimer
- Class in
org.apache.droids.delay
GaussianRandomDelayTimer()
- Constructor for class org.apache.droids.delay.
GaussianRandomDelayTimer
GaussianRandomDelayTimer(int, int)
- Constructor for class org.apache.droids.delay.
GaussianRandomDelayTimer
GenericFactory
<
T
> - Class in
org.apache.droids.helper.factories
Basically all factories till now extend this generic factory.
GenericFactory()
- Constructor for class org.apache.droids.helper.factories.
GenericFactory
getAnchorText()
- Method in interface org.apache.droids.api.
Link
getAnchorText()
- Method in class org.apache.droids.
LinkTask
getCompletedTasks()
- Method in interface org.apache.droids.api.
TaskMaster
getCompletedTasks()
- Method in class org.apache.droids.impl.
MultiThreadedTaskMaster
getContentType(String)
- Method in interface org.apache.droids.api.
Protocol
Returns the content type of the url
getContentType()
- Method in exception org.apache.droids.exception.
ParserNotFoundException
If not constructed via message only it will return the content typee which has caused the problem
getContentType(String)
- Method in class org.apache.droids.protocol.file.
FileProtocol
getContentType(String)
- Method in class org.apache.droids.protocol.
HttpBase
Will analyze and return the content type of the given url.
getData()
- Method in interface org.apache.droids.api.
Parse
Other data extracted from the page.
getData()
- Method in class org.apache.droids.parse.
ParseImpl
getDelayMillis()
- Method in interface org.apache.droids.api.
DelayTimer
Returns the value of the delay between request.
getDelayMillis()
- Method in class org.apache.droids.delay.
GaussianRandomDelayTimer
getDelayMillis()
- Method in class org.apache.droids.delay.
RandomDelayTimer
getDelayMillis()
- Method in class org.apache.droids.delay.
SimpleDelayTimer
Gets the delay time.
getDelaySpread()
- Method in class org.apache.droids.delay.
RandomDelayTimer
getDelayTimer()
- Method in class org.apache.droids.impl.
MultiThreadedTaskMaster
getDepth()
- Method in interface org.apache.droids.api.
Task
getDepth()
- Method in class org.apache.droids.
LinkTask
getDepth()
- Method in class org.apache.droids.robot.walker.
FileTask
getDroid(String)
- Method in class org.apache.droids.helper.factories.
DroidFactory
Lookup a droid by its name and return it.
getElements()
- Method in class org.apache.droids.parse.html.
HtmlParser
getEventFactory()
- Method in class org.apache.droids.helper.
StAX
Get the ready to used EventFactory
getEventParser(InputStream)
- Method in class org.apache.droids.helper.
StAX
Get an event Parser based on the incoming stream
getFile()
- Method in class org.apache.droids.robot.walker.
FileTask
getFilteredOutlinks(Parse)
- Method in class org.apache.droids.robot.crawler.
CrawlingWorker
getFiltersFactory()
- Method in class org.apache.droids.robot.crawler.
CrawlingDroid
getFinishedWorking()
- Method in interface org.apache.droids.api.
TaskMaster
getFinishedWorking()
- Method in class org.apache.droids.impl.
MultiThreadedTaskMaster
getFrom()
- Method in interface org.apache.droids.api.
Link
From where the link was created
getFrom()
- Method in class org.apache.droids.
LinkTask
getFrom()
- Method in class org.apache.droids.protocol.
HttpBase
Returns the eMail address of the bot.
getHandlerFactory()
- Method in class org.apache.droids.robot.crawler.
CrawlingDroid
getId()
- Method in interface org.apache.droids.api.
Task
The id of the task.
getId()
- Method in class org.apache.droids.
LinkTask
getId()
- Method in class org.apache.droids.robot.walker.
FileTask
getLastCompletedTask()
- Method in interface org.apache.droids.api.
TaskMaster
getLastCompletedTask()
- Method in class org.apache.droids.impl.
MultiThreadedTaskMaster
getLastModifedDate()
- Method in class org.apache.droids.
LinkTask
getLastModifiedDate()
- Method in interface org.apache.droids.api.
Link
last modified date
getLastModifiedDate()
- Method in class org.apache.droids.
LinkTask
getMap()
- Method in class org.apache.droids.helper.factories.
GenericFactory
Get the register which contains all components.
getMaxDepth()
- Method in class org.apache.droids.impl.
SimpleTaskValidator
getMaxSize()
- Method in class org.apache.droids.impl.
SimpleTaskQueue
getMaxThreads()
- Method in class org.apache.droids.impl.
MultiThreadedTaskMaster
Get number of maximum allowed threads
getMinimumDelay()
- Method in class org.apache.droids.delay.
RandomDelayTimer
getNewWorker()
- Method in interface org.apache.droids.api.
Droid
Ask the droid for a new worker
getNewWorker()
- Method in class org.apache.droids.robot.crawler.
CrawlingDroid
getNewWorker()
- Method in class org.apache.droids.robot.walker.
SimpleWalkingDroid
getNewWorker()
- Method in interface org.apache.droids.robot.walker.
WalkingDroid
getOutlinks()
- Method in class org.apache.droids.
ParseData
Get the outlinks of the page.
getOutputDir()
- Method in class org.apache.droids.handle.
Save
Get the directory where we want to save the stream.
getParse(InputStream, Link)
- Method in interface org.apache.droids.api.
Parser
Creates the parse for some content.
getParse(InputStream, Link)
- Method in class org.apache.droids.parse.html.
HtmlParser
getParser(String)
- Method in class org.apache.droids.helper.factories.
ParserFactory
Lookup a parser by its identifier (content type) and return it.
getParser(InputStream)
- Method in class org.apache.droids.helper.
StAX
Get a stream Parser based on the incoming stream
getParserFactory()
- Method in class org.apache.droids.robot.crawler.
CrawlingDroid
getProtocol(String)
- Method in class org.apache.droids.helper.factories.
ProtocolFactory
Will lookup a protocol based on the underlying uri
getProtocolFactory()
- Method in class org.apache.droids.robot.crawler.
CrawlingDroid
getQueue()
- Method in class org.apache.droids.
AbstractDroid
getRefer()
- Method in class org.apache.droids.protocol.
HttpBase
Return the refer URI where the bot is send from.
getResponseBodyAsStream()
- Method in class org.apache.http.
PostFile
getSize()
- Method in interface org.apache.droids.api.
TaskQueue
How many task do we have
left
in the queue.
getSize()
- Method in class org.apache.droids.impl.
SimpleTaskQueue
getsolrBase()
- Method in class org.apache.http.
PostFile
getSrc()
- Method in class org.apache.http.
PostFile
getStartTime()
- Method in interface org.apache.droids.api.
TaskMaster
getStartTime()
- Method in class org.apache.droids.impl.
MultiThreadedTaskMaster
getStreamWriter(OutputStream)
- Method in class org.apache.droids.helper.
StAX
Get a stream writer based on the incoming stream
getTask(String)
- Method in interface org.apache.droids.api.
TaskQueue
Return the task that is identified with the given id
getTask(String)
- Method in class org.apache.droids.impl.
SimpleTaskQueue
getTaskDate()
- Method in interface org.apache.droids.api.
Task
When was the task created
getTaskDate()
- Method in class org.apache.droids.
LinkTask
getTaskDate()
- Method in class org.apache.droids.robot.walker.
FileTask
getTaskMaster()
- Method in class org.apache.droids.
AbstractDroid
getTaskMaster()
- Method in interface org.apache.droids.api.
Droid
Get the task master
getTaskValidator()
- Method in class org.apache.droids.impl.
SimpleTaskQueue
getText()
- Method in interface org.apache.droids.api.
Parse
The textual content of the page.
getText()
- Method in class org.apache.droids.parse.
ParseImpl
getTimeout()
- Method in class org.apache.droids.protocol.
HttpBase
Get the timeout we want for the connection.
getTo()
- Method in interface org.apache.droids.api.
Link
To where the link is pointing to
getTo()
- Method in class org.apache.droids.
LinkTask
getURI()
- Method in interface org.apache.droids.api.
Link
getURI()
- Method in class org.apache.droids.
LinkTask
getUrl()
- Method in exception org.apache.droids.exception.
ParserNotFoundException
If not constructed via message only it will return the url which has caused the problem
getUrl()
- Method in exception org.apache.droids.exception.
ProtocolNotFoundException
Will return the url which has caused the problem
getUrlPrefix(URL)
- Static method in class org.apache.droids.net.
UrlHelper
Creating a valid protocol prefix.
getUserAgent()
- Method in class org.apache.droids.protocol.
HttpBase
Get the name of our UserAgent
getWriter(OutputStream)
- Method in class org.apache.droids.helper.
StAX
Get an event writer based on the incoming stream
H
handle(InputStream, URL, Parse)
- Method in interface org.apache.droids.api.
Handler
handle(InputStream, URL, Parse)
- Method in class org.apache.droids.handle.
Save
handle(InputStream, URL, Parse)
- Method in class org.apache.droids.handle.
Sysout
handle(InputStream, URL, Parse)
- Method in class org.apache.droids.helper.factories.
HandlerFactory
Will traverse all registered handler and execute them.
handle(Parse, InputStream, Link)
- Method in class org.apache.droids.robot.crawler.
CrawlingWorker
Handler
- Interface in
org.apache.droids.api
A handler is a component that uses the stream, the parse and url to invoke arbitrary business logic on the objects.
HandlerFactory
- Class in
org.apache.droids.helper.factories
Factory that will traverse all registered handler and execute them.
HandlerFactory()
- Constructor for class org.apache.droids.helper.factories.
HandlerFactory
hasNext()
- Method in interface org.apache.droids.api.
TaskQueue
Do we have more task waiting for service
hasNext()
- Method in class org.apache.droids.impl.
SimpleTaskQueue
HtmlParser
- Class in
org.apache.droids.parse.html
HtmlParser()
- Constructor for class org.apache.droids.parse.html.
HtmlParser
Http
- Class in
org.apache.droids.protocol.http
Simple implementation for http protocol.
Http()
- Constructor for class org.apache.droids.protocol.http.
Http
HttpBase
- Class in
org.apache.droids.protocol
Helper class that provides basic methods like returning the agent string and content type.
HttpBase()
- Constructor for class org.apache.droids.protocol.
HttpBase
I
init()
- Method in interface org.apache.droids.api.
Droid
Initialize the queue.
init()
- Method in class org.apache.droids.robot.crawler.
CrawlingDroid
init()
- Method in class org.apache.droids.robot.walker.
SimpleWalkingDroid
InvalidTaskException
- Exception in
org.apache.droids.exception
InvalidTaskException(String)
- Constructor for exception org.apache.droids.exception.
InvalidTaskException
isAllowed(String)
- Method in interface org.apache.droids.api.
Protocol
Some protocols (like http) offer a mechanism to evaluate whether the client can request a given url (in http this is the robots.txt configuration)
isAllowed(String)
- Method in interface org.apache.droids.norobots.
Rule
Boolean.TRUE means it is allowed.
isAllowed(String)
- Method in class org.apache.droids.protocol.file.
FileProtocol
isAllowed(String)
- Method in class org.apache.droids.protocol.http.
Http
isContentType(String)
- Static method in class org.apache.droids.protocol.
MediaType
Deprecated.
Test whether a given type is in our array of known media types.
isForceAllow()
- Method in class org.apache.droids.protocol.http.
Http
You can force that a site is allowed (ignoring the robots.txt).
isIncludeHost()
- Method in class org.apache.droids.handle.
Save
Do we want to prefix the export dir with the host name.
isUrlAllowed(URL)
- Method in class org.apache.droids.norobots.
NoRobotClient
Decide if the parsed website will allow this URL to be be seen.
L
Link
- Interface in
org.apache.droids.api
Simple extension of a
Task
.
LinkTask
- Class in
org.apache.droids
LinkTask(Link, URI, int)
- Constructor for class org.apache.droids.
LinkTask
log
- Variable in class org.apache.droids.handle.
WriterHandler
log
- Variable in class org.apache.droids.helper.factories.
GenericFactory
log
- Variable in class org.apache.droids.helper.
Loggable
Loggable
- Class in
org.apache.droids.helper
Simple wrapper class to easier debug/log.
Loggable()
- Constructor for class org.apache.droids.helper.
Loggable
M
main(String[])
- Static method in class org.apache.droids.robot.crawler.
CrawlingDroid
main(String[])
- Static method in class org.apache.droids.robot.walker.
SimpleWalkingDroid
match(String)
- Method in class org.apache.droids.net.
RegexRule
Checks if a url matches this rule.
MediaType
- Class in
org.apache.droids.protocol
Deprecated.
Soon to be replaced via the tika mediaType support
merge(T)
- Method in interface org.apache.droids.api.
TaskQueue
Add a Task to the queue...
merge(Collection<? extends T>)
- Method in interface org.apache.droids.api.
TaskQueue
Merge a given list of tasks with the current queue.
merge(T)
- Method in class org.apache.droids.impl.
SimpleTaskQueue
merge(Collection<? extends T>)
- Method in class org.apache.droids.impl.
SimpleTaskQueue
Tasks with duplicate keys will be overwritten
merge(T)
- Method in class org.apache.droids.impl.
SimpleTaskQueueWithHistory
minimumDelay
- Variable in class org.apache.droids.delay.
RandomDelayTimer
MultiThreadedTaskMaster
<
T
extends
Task
> - Class in
org.apache.droids.impl
MultiThreadedTaskMaster()
- Constructor for class org.apache.droids.impl.
MultiThreadedTaskMaster
N
next()
- Method in interface org.apache.droids.api.
TaskQueue
Return the next task that is waiting for service
next()
- Method in class org.apache.droids.impl.
SimpleTaskQueue
NoRobotClient
- Class in
org.apache.droids.norobots
A Client which may be used to decide which urls on a website may be looked at, according to the norobots specification located at: http://www.robotstxt.org/wc/norobots-rfc.html
NoRobotClient(String)
- Constructor for class org.apache.droids.norobots.
NoRobotClient
Create a Client for a particular user-agent name.
NoRobotException
- Exception in
org.apache.droids.norobots
Application exception for anything that might go wrong in the checking of a robots.txt file.
NoRobotException(String)
- Constructor for exception org.apache.droids.norobots.
NoRobotException
NoRobotException(String, Throwable)
- Constructor for exception org.apache.droids.norobots.
NoRobotException
O
openStream(String)
- Method in interface org.apache.droids.api.
Protocol
Return the stream represent of the url
openStream(String)
- Method in class org.apache.droids.protocol.file.
FileProtocol
openStream(String)
- Method in class org.apache.droids.protocol.http.
Http
org.apache.droids
- package org.apache.droids
This package is the principal package for Apache Droids.
org.apache.droids.api
- package org.apache.droids.api
This package defines all interfaces that we are using for droids.
org.apache.droids.delay
- package org.apache.droids.delay
This package is the principal package for Apache Droids Delay Timers.
org.apache.droids.exception
- package org.apache.droids.exception
This package defines some custom exceptions that we are using in droids.
org.apache.droids.handle
- package org.apache.droids.handle
This package contains some basic implementations of various handlers.
org.apache.droids.helper
- package org.apache.droids.helper
This package contains various helper.
org.apache.droids.helper.factories
- package org.apache.droids.helper.factories
This package contains all core factories that we use in Droids.
org.apache.droids.impl
- package org.apache.droids.impl
org.apache.droids.net
- package org.apache.droids.net
This package contains various helper for the work with protocols and network communication.
org.apache.droids.norobots
- package org.apache.droids.norobots
Using norobots-rfc
org.apache.droids.parse
- package org.apache.droids.parse
This package contains various helper and implementations around parsing.
org.apache.droids.parse.html
- package org.apache.droids.parse.html
This package contains various parser.
org.apache.droids.protocol
- package org.apache.droids.protocol
This package contains various class around the support of protocol specific classes.
org.apache.droids.protocol.file
- package org.apache.droids.protocol.file
This package contains various file protocol implementations.
org.apache.droids.protocol.http
- package org.apache.droids.protocol.http
This package contains various http protocol implementations.
org.apache.droids.robot.crawler
- package org.apache.droids.robot.crawler
org.apache.droids.robot.walker
- package org.apache.droids.robot.walker
org.apache.http
- package org.apache.http
P
Parse
- Interface in
org.apache.droids.api
Wrapper object that encapsulate the result of the parsing of the underlying document.
parse(URL)
- Method in class org.apache.droids.norobots.
NoRobotClient
Head to a website and suck in their robots.txt file.
ParseData
- Class in
org.apache.droids
The result object that are filled by a parser
ParseData(Collection<Link>)
- Constructor for class org.apache.droids.
ParseData
Create a new instance of Parse data for the given outlinks
ParseImpl
- Class in
org.apache.droids.parse
Default implementation of Parse
ParseImpl(String, ParseData)
- Constructor for class org.apache.droids.parse.
ParseImpl
Create a new instance of a Parse for the given text and ParseData
Parser
- Interface in
org.apache.droids.api
Simple parser that is only forcing to return a parse object.
ParserFactory
- Class in
org.apache.droids.helper.factories
Factory that will lookup a parser by its identifier and return it.
ParserFactory()
- Constructor for class org.apache.droids.helper.factories.
ParserFactory
ParserNotFoundException
- Exception in
org.apache.droids.exception
ParserNotFoundException gives a detailed exception for problems that can occur while parsing a task.
ParserNotFoundException(String, String)
- Constructor for exception org.apache.droids.exception.
ParserNotFoundException
Create an exception for the given url and content type
ParserNotFoundException(String, String, String)
- Constructor for exception org.apache.droids.exception.
ParserNotFoundException
Create an exception for the given url and content type
ParserNotFoundException(String)
- Constructor for exception org.apache.droids.exception.
ParserNotFoundException
Constructs a new exception with the specified detail message.
parseText(String)
- Method in class org.apache.droids.norobots.
NoRobotClient
pipe(Reader, Writer)
- Static method in class org.apache.droids.handle.
WriterHandler
Pipes everything from the reader to the writer via a buffer
post(String, String)
- Method in class org.apache.http.
PostFile
PostFile
- Class in
org.apache.http
PostFile(String, InputStream)
- Constructor for class org.apache.http.
PostFile
prepareConnection(URL)
- Method in class org.apache.droids.protocol.
HttpBase
Will prepare a HttpURLConnection with the userAgent, from, the refer and the timeout
processAllTasks(TaskQueue<T>, Droid<T>)
- Method in interface org.apache.droids.api.
TaskMaster
processAllTasks(TaskQueue<T>, Droid<T>)
- Method in class org.apache.droids.impl.
MultiThreadedTaskMaster
The queue has been initialized
Protocol
- Interface in
org.apache.droids.api
The protocol interface is a wrapper to hide the underlying implementation of the communication at protocol level.
ProtocolFactory
- Class in
org.apache.droids.helper.factories
Factory that will lookup a protocol plugin and return it.
ProtocolFactory()
- Constructor for class org.apache.droids.helper.factories.
ProtocolFactory
ProtocolNotFoundException
- Exception in
org.apache.droids.exception
If we do not have any instance of a protocol registered for the iven url.
ProtocolNotFoundException(String)
- Constructor for exception org.apache.droids.exception.
ProtocolNotFoundException
Create an exception for the given url
ProtocolNotFoundException(String, String)
- Constructor for exception org.apache.droids.exception.
ProtocolNotFoundException
Create an exception for the given url and detailed message
Q
queue
- Variable in class org.apache.droids.
AbstractDroid
R
random
- Variable in class org.apache.droids.delay.
RandomDelayTimer
RandomDelayTimer
- Class in
org.apache.droids.delay
RandomDelayTimer()
- Constructor for class org.apache.droids.delay.
RandomDelayTimer
RandomDelayTimer(int, int)
- Constructor for class org.apache.droids.delay.
RandomDelayTimer
refer
- Variable in class org.apache.droids.protocol.
HttpBase
RegexRule
- Class in
org.apache.droids.net
A generic regular expression rule.
RegexRule(boolean)
- Constructor for class org.apache.droids.net.
RegexRule
Constructs a new regular expression rule.
RegexURLFilter
- Class in
org.apache.droids.net
Regular expression implementation of an UrlFilter.
RegexURLFilter()
- Constructor for class org.apache.droids.net.
RegexURLFilter
resolve(String)
- Method in class org.apache.droids.helper.factories.
GenericFactory
Will lookup which component is linked to the name and will return it.
Rule
- Interface in
org.apache.droids.norobots
A robots.txt rule.
S
Save
- Class in
org.apache.droids.handle
Handler which is writing the stream to the file system.
Save()
- Constructor for class org.apache.droids.handle.
Save
setAnchorText(String)
- Method in class org.apache.droids.
LinkTask
setDelayMillis(long)
- Method in class org.apache.droids.delay.
SimpleDelayTimer
Sets the delay time.
setDelaySpread(int)
- Method in class org.apache.droids.delay.
RandomDelayTimer
setDelayTimer(DelayTimer)
- Method in class org.apache.droids.impl.
MultiThreadedTaskMaster
setElements(Map<String, String>)
- Method in class org.apache.droids.parse.html.
HtmlParser
setFile(String)
- Method in class org.apache.droids.net.
RegexURLFilter
setFiltersFactory(URLFiltersFactory)
- Method in class org.apache.droids.robot.crawler.
CrawlingDroid
setForceAllow(boolean)
- Method in class org.apache.droids.protocol.http.
Http
You can force that a site is allowed (ignoring the robot.txt).
setFrom(String)
- Method in class org.apache.droids.protocol.
HttpBase
Set the eMail address of the bot.
setHandlerFactory(HandlerFactory)
- Method in class org.apache.droids.robot.crawler.
CrawlingDroid
setIncludeHost(boolean)
- Method in class org.apache.droids.handle.
Save
Do we want to prefix the export dir with the host name.
setInitialFiles(Collection<File>)
- Method in class org.apache.droids.robot.walker.
SimpleWalkingDroid
setInitialFiles(Collection<File>)
- Method in interface org.apache.droids.robot.walker.
WalkingDroid
setInitialLocations(Collection<String>)
- Method in class org.apache.droids.robot.crawler.
CrawlingDroid
setLastModifedDate(Date)
- Method in class org.apache.droids.
LinkTask
setLinksTo(Collection<URI>)
- Method in class org.apache.droids.
LinkTask
setMap(Map)
- Method in class org.apache.droids.helper.factories.
GenericFactory
Set the register which contains all components.
setMaxDepth(int)
- Method in class org.apache.droids.impl.
SimpleTaskValidator
setMaxSize(int)
- Method in class org.apache.droids.impl.
SimpleTaskQueue
setMaxThreads(int)
- Method in class org.apache.droids.impl.
MultiThreadedTaskMaster
Adjust number of allowed threads
setMinimumDelay(int)
- Method in class org.apache.droids.delay.
RandomDelayTimer
setOutputDir(String)
- Method in class org.apache.droids.handle.
Save
Set the directory where we want to save the stream.
setParserFactory(ParserFactory)
- Method in class org.apache.droids.robot.crawler.
CrawlingDroid
setProtocolFactory(ProtocolFactory)
- Method in class org.apache.droids.robot.crawler.
CrawlingDroid
setRefer(String)
- Method in class org.apache.droids.protocol.
HttpBase
Set the refer URI where the bot is send from.
setTaskValidator(TaskValidator<T>)
- Method in class org.apache.droids.impl.
SimpleTaskQueue
setTimeout(int)
- Method in class org.apache.droids.protocol.
HttpBase
Set the timeout we want for the connection.
setUserAgent(String)
- Method in class org.apache.droids.protocol.
HttpBase
Set the name of our UserAgent
shutdownAndAwaitTermination()
- Method in class org.apache.droids.impl.
MultiThreadedTaskMaster
Shutdown all threads, close the pools and leave.
SimpleDelayTimer
- Class in
org.apache.droids.delay
SimpleDelayTimer()
- Constructor for class org.apache.droids.delay.
SimpleDelayTimer
SimpleDelayTimer(long)
- Constructor for class org.apache.droids.delay.
SimpleDelayTimer
SimpleTaskQueue
<
T
extends
Task
> - Class in
org.apache.droids.impl
A simple
SimpleTaskQueue()
- Constructor for class org.apache.droids.impl.
SimpleTaskQueue
Simple queue constructor.
SimpleTaskQueueWithHistory
<
T
extends
Task
> - Class in
org.apache.droids.impl
Extend the task queue to ignore any tasks we have already seen
SimpleTaskQueueWithHistory()
- Constructor for class org.apache.droids.impl.
SimpleTaskQueueWithHistory
Simple queue constructor.
SimpleTaskValidator
<
T
extends
Task
> - Class in
org.apache.droids.impl
A simple
SimpleTaskValidator()
- Constructor for class org.apache.droids.impl.
SimpleTaskValidator
SimpleTaskValidator(int)
- Constructor for class org.apache.droids.impl.
SimpleTaskValidator
SimpleWalkingDroid
- Class in
org.apache.droids.robot.walker
SimpleWalkingDroid(TaskQueue<FileTask>, TaskMaster<FileTask>)
- Constructor for class org.apache.droids.robot.walker.
SimpleWalkingDroid
start()
- Method in class org.apache.droids.
AbstractDroid
start()
- Method in interface org.apache.droids.api.
Droid
Invoke an instance of the worker used in the droid
start()
- Method in class org.apache.droids.robot.crawler.
CrawlingDroid
statusCode()
- Method in class org.apache.http.
PostFile
StAX
- Class in
org.apache.droids.helper
Helper class that eases the usage of StAX in your plugins.
StAX()
- Constructor for class org.apache.droids.helper.
StAX
Easy helper to get StAX based parser and writer.
Sysout
- Class in
org.apache.droids.handle
Handler that write the stream to the sysout.
Sysout()
- Constructor for class org.apache.droids.handle.
Sysout
T
Task
- Interface in
org.apache.droids.api
A task is a working instruction for a droid.
taskMaster
- Variable in class org.apache.droids.
AbstractDroid
TaskMaster
<
T
extends
Task
> - Interface in
org.apache.droids.api
Responsible for running all the tasks
TaskQueue
<
T
extends
Task
> - Interface in
org.apache.droids.api
A queue is the data structure where the different tasks are waiting for service.
TaskValidator
<
T
extends
Task
> - Interface in
org.apache.droids.api
timeout
- Variable in class org.apache.droids.protocol.
HttpBase
U
URLFilter
- Interface in
org.apache.droids.api
Filter to limit the urls that we want to allow in our queue.
URLFiltersFactory
- Class in
org.apache.droids.helper.factories
Factory that will traverse all registered filter and execute them.
URLFiltersFactory()
- Constructor for class org.apache.droids.helper.factories.
URLFiltersFactory
UrlHelper
- Class in
org.apache.droids.net
Helper class that offers a couple of method to work with urls
userAgent
- Variable in class org.apache.droids.protocol.
HttpBase
V
validateTask(T)
- Method in interface org.apache.droids.api.
TaskValidator
This will take a task and make sure it is valid.
validateTask(T)
- Method in class org.apache.droids.impl.
SimpleTaskQueue
validateTask(T)
- Method in class org.apache.droids.impl.
SimpleTaskValidator
W
WalkingDroid
- Interface in
org.apache.droids.robot.walker
Worker
<
T
extends
Task
> - Interface in
org.apache.droids.api
A worker is the unit that is doing the actual work.
WriterHandler
- Class in
org.apache.droids.handle
Wrapper that allows you to pipe a stream from a reader to a writer via a buffer
WriterHandler()
- Constructor for class org.apache.droids.handle.
WriterHandler
A
B
C
D
E
F
G
H
I
L
M
N
O
P
Q
R
S
T
U
V
W
Overview
Package
Class
Use
Tree
Deprecated
Index
Help
PREV NEXT
FRAMES
NO FRAMES
All Classes
Copyright © 2008 The Apache Software Foundation