Class | Description |
---|---|
Document |
Docment is an abstract class that provides for tokenization
of a document with stop-word removal and an iterator-like interface
similar to StringTokenizer.
|
DocumentIterator |
An object for iterating over a set of documents in a directory.
|
DocumentReference |
A simple data structure for storing a reference to a document file
that includes information on the length of its document vector.
|
Feedback |
Gets and stores information about relevance feedback from the user and computes
an updated query based on original query and retrieved documents that are
rated relevant and irrelevant.
|
FileDocument |
A Document stored as a file.
|
HashMapVector |
A data structure for a term vector for a document stored
as a HashMap that maps tokens to Weight's that store the
weight of that token in the document.
|
HTMLFileDocument |
An HTML file document where HTML commands are removed
from the token stream.
|
InvertedIndex |
An inverted index for vector-space information retrieval.
|
Retrieval |
A lightweight object for storing information about a retrieved Document.
|
TextFileDocument |
A normal ASCII text file Document
|
TextStringDocument |
A simple document represented by a String
|
TokenInfo |
A lightweight object for storing information about a token (a.k.a word, term)
in an inverted index.
|
TokenOccurrence |
A lightweight object for storing information about an occurrence of a token (a.k.a word, term)
in a Document.
|
For command line interfaces see the main methods of the following classes: