|
||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||
SUMMARY: INNER | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Object | +--ir.webutils.Spider | +--ir.webutils.DirectorySpider
Spider that limits itself to the directory it started in.
Fields inherited from class ir.webutils.Spider |
count, linksToVisit, maxCount, saveDir, slow, visited, webpr |
Constructor Summary | |
DirectorySpider()
|
Method Summary | |
java.util.List |
getNewLinks(HTMLPage page)
Gets links from the page that are in or below the starting directory. |
protected void |
handleUCommandLineOption(java.lang.String value)
Sets the initial URL from the "-u" argument, then calls the corresponding superclass method. |
static void |
main(java.lang.String[] args)
Spider the web according to the following command options, but only below the start URL directory. |
Methods inherited from class ir.webutils.Spider |
doCrawl, go, handleCCommandLineOption, handleDCommandLineOption, handleSafeCommandLineOption, handleSlowCommandLineOption, linkToHTMLPage, processArgs, processPage |
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Constructor Detail |
public DirectorySpider()
Method Detail |
public java.util.List getNewLinks(HTMLPage page)
getNewLinks
in class Spider
page
that are in or below the
directory of the first page.protected void handleUCommandLineOption(java.lang.String value)
handleUCommandLineOption
in class Spider
value
- The value of the "-u" command line argument.public static void main(java.lang.String[] args)
|
||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||
SUMMARY: INNER | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |