public class DirectorySpider extends Spider
Constructor and Description |
---|
DirectorySpider() |
Modifier and Type | Method and Description |
---|---|
java.util.List<Link> |
getNewLinks(HTMLPage page)
Gets links from the page that are in or below the starting
directory.
|
protected void |
handleUCommandLineOption(java.lang.String value)
Sets the initial URL from the "-u" argument, then calls the
corresponding superclass method.
|
static void |
main(java.lang.String[] args)
Spider the web according to the following command options,
but only below the start URL directory.
|
doCrawl, go, handleCCommandLineOption, handleDCommandLineOption, handleSafeCommandLineOption, handleSlowCommandLineOption, indexPage, linkToHTMLPage, processArgs
public java.util.List<Link> getNewLinks(HTMLPage page)
getNewLinks
in class Spider
page
- The current page.page
that are in or below the
directory of the first page.protected void handleUCommandLineOption(java.lang.String value)
handleUCommandLineOption
in class Spider
value
- The value of the "-u" command line argument.public static void main(java.lang.String[] args)