public class SiteSpider extends Spider
Constructor and Description |
---|
SiteSpider() |
Modifier and Type | Method and Description |
---|---|
java.util.List<Link> |
getNewLinks(HTMLPage page)
Gets links from the given page that are on the same host as the
page.
|
static void |
main(java.lang.String[] args)
Spider the web according to the following command options,
but stay within the given site (same URL host).
|
doCrawl, go, handleCCommandLineOption, handleDCommandLineOption, handleSafeCommandLineOption, handleSlowCommandLineOption, handleUCommandLineOption, indexPage, linkToHTMLPage, processArgs
public java.util.List<Link> getNewLinks(HTMLPage page)
getNewLinks
in class Spider
page
- The current page.page
that have the same
host as url
.public static void main(java.lang.String[] args)