ir.webutils
Class HTMLPageRetriever

java.lang.Object
  extended by ir.webutils.HTMLPageRetriever
Direct Known Subclasses:
SafeHTMLPageRetriever

public class HTMLPageRetriever
extends java.lang.Object

HTMLPageRetriever allows clients to download web pages from URLs. This is the default implementation, which performs no processing aside from downloading web pages from a URL. This class does not maintain any state, so subclasses do not need to worry about fields.


Constructor Summary
HTMLPageRetriever()
          Constructs a HTMLPageRetriever object.
 
Method Summary
 HTMLPage getHTMLPage(Link link)
          Downloads a web page from a given URL.
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

HTMLPageRetriever

public HTMLPageRetriever()
Constructs a HTMLPageRetriever object. Subclasses wishing to behave as singletons do not need to worry about overriding the constructor.

Method Detail

getHTMLPage

public HTMLPage getHTMLPage(Link link)
                     throws PathDisallowedException
Downloads a web page from a given URL.

Parameters:
link - The Link with the URL to download the page from.
Returns:
An HTML page representing the page downloaded from the Link.
Throws:
PathDisallowedException