All Classes
-
All Classes Interface Summary Class Summary Exception Summary Class Description AbortChecker This class furnishes an abort signal whenever the job activity says it should.AuthenticationCredentials This interface describes immutable classes which represents authentication information for all kinds of authentication.CookieManager This class manages the database table into which we write cookies.CookieManager.CookiesCacheClass Cache class for robots.CookieManager.CookiesDescription This is the object description for a session key object.CookieManager.CookiesExecutor This is the executor object for locating cookies session objects.CookieManager.DynamicCookieSet This is a set of cookies, built dynamically.CookieSet This class represents a bunch of cookiesCredentialsDescription This class describes credential information pulled from a configuration.CredentialsDescription.BasicCredential Basic type credentialsCredentialsDescription.CredentialsItem Class representing an individual credential item.CredentialsDescription.LoginParameterIterator LoginParameter iteratorCredentialsDescription.NTLMCredential NTLM-style credentialsCredentialsDescription.SessionCredential Session credentialsCredentialsDescription.SessionCredentialItem Session credential helper classCredentialsDescription.SessionCredentialParameter Session credential parameter classDataCache This class is a cache of a specific URL's data.DataCache.DocumentData This class represents everything we need to know about a document that's getting passed from the getDocumentVersions() phase to the processDocuments() phase.DNSManager This class manages the database table into which we DNS entries for hosts.DNSManager.DNSCacheClass Cache class for robots.DNSManager.DNSInfo This is a cached data item.DNSManager.HostDescription This is the object description for a robots host object.DNSManager.HostExecutor This is the executor object for locating robots host objects.FindContentHandler This class is the handler for HTML content grepping during state transitionsFindHandler This class is used to discover links in a session login contextFindHTMLFormHandler This class is the handler for HTML form parsing during state transitionsFindHTMLHrefHandler This class is the handler for HTML parsing during state transitionsFindPreferredRedirectionHandler This class is the handler for redirection handling during state transitionsFindRedirectionHandler This class is the handler for redirection parsing during state transitionsFormData This interface describes the form data gleaned from an HTML page.FormDataAccumulator This class accumulates form data and allows overridesFormDataAccumulator.FormItemIterator Iterator over FormItemsFormDataElement This interface describes individual form data elements, for form submission.FormItem This class provides an individual data itemFormParseState This class interprets the tag stream generated by the BasicParseState class, and keeps track of the form tags.IDiscoveredLinkHandler This interface describes the functionality needed by a link extractor to note a discovered link.IHTMLHandler This interface describes the functionality needed by an HTML processor in order to handle an HTML document.IMetaTagHandler This interface describes the functionality needed by a parser to handle metadata tags.IRedirectionHandler This interface describes the functionality needed by an redirection processor in order to handle a redirection.IThrottledConnection This interface represents an established connection to a URL.IXMLHandler This interface describes the functionality needed by an XML processor in order to handle an XML document.LinkParseState This class recognizes and interprets all linksLoginCookies This interface describes cookies obtained during sequential authentication.LoginParameters This interface describes login parameters to be used to submit a page during sequential authentication.Messages MetaParseState This class recognizes and interprets all meta tagsPageCredentials This interface describes immutable classes which represents authentication information for page-based authentication.RobotsManager This class manages the database table into which we write robots.txt files for hosts.RobotsManager.HostDescription This is the object description for a robots host object.RobotsManager.HostExecutor This is the executor object for locating robots host objects.RobotsManager.Record This class represents a record in a robots.txt file.RobotsManager.RobotsCacheClass Cache class for robots.RobotsManager.RobotsData This is a cached data item.ScriptParseState This class interprets the tag stream generated by the HTMLParseState class, and causes script sections to be skippedSequenceCredentials This interface describes immutable classes which represents authentication information for sequence-based authentication.ThrottleDescription This class describes complex throttling criteria pulled from a configuration.ThrottleDescription.ThrottleItem Class representing an individual throttle item.ThrottledFetcher This class uses httpclient to fetch stuff from webservers.ThrottledFetcher.ConnectionPool Each connection pool has identical connections we can draw on.ThrottledFetcher.ConnectionPoolKey Connection pool keyThrottledFetcher.ExecuteMethodThread This thread does the actual socket communication with the server.ThrottledFetcher.LaxBrowserCompatSpecProvider Class to create a cookie spec.ThrottledFetcher.OurBasicCookieStore ThrottledFetcher.PoolException Pool exception classThrottledFetcher.ThrottledConnection Throttled connections.ThrottledFetcher.ThrottledInputstream This class throttles an input stream based on the specified byte rate parameters.ThrottledFetcher.WaitException Wait exception classTrustsDescription This class describes trust information pulled from a configuration.TrustsDescription.TrustsItem Class representing an individual credential item.WebcrawlerConfig Constants for the Webcrawler connector configuration.WebcrawlerConnector This is the Web Crawler implementation of the IRepositoryConnector interface.WebcrawlerConnector.CanonicalizationPolicies Class representing a list of canonicalization rulesWebcrawlerConnector.CanonicalizationPolicy Class representing a URL regular expression match, for the purposes of determining canonicalization policyWebcrawlerConnector.EvaluatorToken Evaluator token.WebcrawlerConnector.EvaluatorTokenStream Token stream.WebcrawlerConnector.FetchStatus WebcrawlerConnector.MappingRule Class representing a mapping ruleWebcrawlerConnector.MappingRules Class that represents all mappingsWebcrawlerConnector.NameValue Name/value classWebURL Replacement class for java.net.URI, which is broken in many ways.