|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Object ir.webutils.HTMLPage ir.webutils.SafeHTMLPage
public final class SafeHTMLPage
SafeHTMLPage is an immutable representation of information about a
web page that includes information about whether or not this page
can be indexed. This class is intended to be used in conjunction
with SafeHTMLPageRetriever
to allow
clients to facilitate writing spiders that obey both the Robots
Exclusion Protocol and the Robots META tags.
Field Summary |
---|
Fields inherited from class ir.webutils.HTMLPage |
---|
link, outLinks, text |
Constructor Summary | |
---|---|
SafeHTMLPage(Link link,
java.lang.String text,
boolean index)
Constructs an SafeHTMLPage with the given link,
text, and indication whether or not indexing is allowed. |
Method Summary | |
---|---|
boolean |
indexAllowed()
Indicates whether or not indexing has been disallowed by a Robots META tag. |
Methods inherited from class ir.webutils.HTMLPage |
---|
addEndSlash, empty, getLink, getOutLinks, getText, setOutLinks, write |
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Constructor Detail |
---|
public SafeHTMLPage(Link link, java.lang.String text, boolean index)
SafeHTMLPage
with the given link,
text, and indication whether or not indexing is allowed.
link
- A Link
object representing the given page.text
- The text of the page.index
- Should be true
iff. the page can be
indexed.Method Detail |
---|
public boolean indexAllowed()
SafeHTMLPageRetriever
that
supports Robots META tags, such as SafeHTMLPageRetriever
.
indexAllowed
in class HTMLPage
true
iff. the page can be indexed.
|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |