|
Terrier IR Platform 2.2.1 |
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Object uk.ac.gla.terrier.structures.CollectionStatistics
public class CollectionStatistics
This class provides basic statistics for the indexed
collection of documents, such as the average length of documents,
or the total number of documents in the collection.
After indexing, statistics are saved in the PREFIX.log file, along
with the classes that should be used for the Lexicon, the DocumentIndex,
the DirectIndex and the InvertedIndex. This means that an index knows
how it was build and how it should be opened again.
Constructor Summary | |
---|---|
CollectionStatistics(int numDocs,
int numTerms,
long numTokens,
long numPointers)
|
Method Summary | |
---|---|
double |
getAverageDocumentLength()
Returns the documents' average length. |
int |
getNumberOfDocuments()
Returns the total number of documents in the collection. |
long |
getNumberOfPointers()
Returns the total number of pointers in the collection. |
long |
getNumberOfTokens()
Returns the total number of tokens in the collection. |
int |
getNumberOfUniqueTerms()
Returns the total number of unique terms in the lexicon. |
Methods inherited from class java.lang.Object |
---|
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Constructor Detail |
---|
public CollectionStatistics(int numDocs, int numTerms, long numTokens, long numPointers)
Method Detail |
---|
public double getAverageDocumentLength()
public int getNumberOfDocuments()
public long getNumberOfPointers()
public long getNumberOfTokens()
public int getNumberOfUniqueTerms()
|
Terrier IR Platform 2.2.1 |
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |