CAdditionalDocumentInformation Class Reference

#include <CAdditionalDocumentInformation.h>

List of all members.

Public Member Functions

 CAdditionalDocumentInformation (const string &inName="N.N.")
 Constructor sets filename.
void setFileNameBase (const string &inName)
 ...If neccesary set filename later
void resetDF ()
 Reset mMaximumDF and mDFSquareSum.
void adjustDF (double inDF)
 Add the right things to mMaximumDF and mDFSquareSum.
void resetSquareDFLogICF ()
 reset mSquareDFLogICFSum
void adjustSquareDFLogICF (double)
 add the right things to mSquareDFLogICFSum
Accessors
double getMaximumDF () const
double getDFSquareSum () const
double getSquareDFLogICFSum () const
bool output (ostream &outStream) const
bool output () const
bool input (istream &inStream)
bool input ()

Protected Attributes

string mFileNameBase
 Filename of the document (from which the ADI file will be built).
double mMaximumDF
 Maximum Document frequency of a feature for the whole document.
double mDFSquareSum
 Sum of squared document frequencies off all features of the document.
double mSquareDFLogICFSum
 Sum of (DF*DF*log(ICF)) for all features of the document.


Detailed Description

Additional document information (ADI) is information is needed for weighting schemes which take into account properties of whole documents. Things like e.g. the euclidean length of a vector have to be calculated beforehand.

Definition at line 56 of file CAdditionalDocumentInformation.h.


Constructor & Destructor Documentation

CAdditionalDocumentInformation::CAdditionalDocumentInformation ( const string &  inName = "N.N."  ) 

Constructor sets filename.

Definition at line 61 of file CAdditionalDocumentInformation.cc.

References resetDF(), resetSquareDFLogICF(), and setFileNameBase().

00061                                                                                   {
00062   setFileNameBase(inName);
00063   resetDF();
00064   resetSquareDFLogICF();
00065 };


Member Function Documentation

void CAdditionalDocumentInformation::setFileNameBase ( const string &  inName  ) 

...If neccesary set filename later

Definition at line 81 of file CAdditionalDocumentInformation.cc.

References mFileNameBase, and resetDF().

Referenced by CAdditionalDocumentInformation().

00081                                                                         {
00082   mFileNameBase=inName;
00083   resetDF();
00084 };

void CAdditionalDocumentInformation::resetDF (  ) 

Reset mMaximumDF and mDFSquareSum.

Definition at line 100 of file CAdditionalDocumentInformation.cc.

References mDFSquareSum, and mMaximumDF.

Referenced by CAdditionalDocumentInformation(), and setFileNameBase().

00100                                             {
00101   mMaximumDF=0;
00102   mDFSquareSum=0;
00103 };

void CAdditionalDocumentInformation::adjustDF ( double  inDF  ) 

Add the right things to mMaximumDF and mDFSquareSum.

Definition at line 119 of file CAdditionalDocumentInformation.cc.

References mDFSquareSum, and mMaximumDF.

00119                                                         {
00120   mDFSquareSum+=inDF*inDF;
00121   mMaximumDF=max(inDF,mMaximumDF);
00122 };

void CAdditionalDocumentInformation::resetSquareDFLogICF (  ) 

reset mSquareDFLogICFSum

Definition at line 176 of file CAdditionalDocumentInformation.cc.

References mSquareDFLogICFSum.

Referenced by CAdditionalDocumentInformation().

00176                                                         {
00177   mSquareDFLogICFSum=0;
00178 };

void CAdditionalDocumentInformation::adjustSquareDFLogICF ( double  inValue  ) 

add the right things to mSquareDFLogICFSum

Definition at line 194 of file CAdditionalDocumentInformation.cc.

References mSquareDFLogICFSum.

00194                                                                        {
00195   mSquareDFLogICFSum+=inValue;
00196 };

double CAdditionalDocumentInformation::getMaximumDF (  )  const

Definition at line 138 of file CAdditionalDocumentInformation.cc.

References mMaximumDF.

00138                                                         {
00139   assert(mMaximumDF);
00140   return mMaximumDF;
00141 };

double CAdditionalDocumentInformation::getDFSquareSum (  )  const

Definition at line 157 of file CAdditionalDocumentInformation.cc.

References mDFSquareSum.

00157                                                           {
00158   assert(mDFSquareSum);
00159   return mDFSquareSum;
00160 };

double CAdditionalDocumentInformation::getSquareDFLogICFSum (  )  const

Definition at line 212 of file CAdditionalDocumentInformation.cc.

References mSquareDFLogICFSum.

00212                                                                 {
00213   return mSquareDFLogICFSum;
00214 };

bool CAdditionalDocumentInformation::output ( ostream &  outStream  )  const

Definition at line 253 of file CAdditionalDocumentInformation.cc.

References mDFSquareSum, mMaximumDF, and mSquareDFLogICFSum.

00253                                                                   {
00254 #ifdef PRINT
00255   cout << "[>mSquareDFLogICFSum" 
00256        << mMaximumDF
00257        << ",square"
00258        << mDFSquareSum
00259        << ","
00260        << mSquareDFLogICFSum
00261        << "]" 
00262        << flush 
00263        << endl;
00264 #endif
00265   outStream << mMaximumDF
00266       << " " 
00267       << mDFSquareSum
00268       << " " 
00269       << mSquareDFLogICFSum;
00270  return outStream;
00271 };

bool CAdditionalDocumentInformation::output (  )  const

Definition at line 230 of file CAdditionalDocumentInformation.cc.

References mFileNameBase.

00230                                                 {
00231   ofstream 
00232     lStream(string(mFileNameBase+".ADI").c_str());
00233 
00234   
00235 
00236   return output(lStream);
00237 };

bool CAdditionalDocumentInformation::input ( istream &  inStream  ) 

Definition at line 287 of file CAdditionalDocumentInformation.cc.

References mDFSquareSum, mMaximumDF, and mSquareDFLogICFSum.

00287                                                            {
00288   inStream >> mMaximumDF
00289      >> mDFSquareSum
00290      >> mSquareDFLogICFSum;
00291 #ifdef PRINT
00292   cout << "[Max" 
00293        << mMaximumDF
00294        << ",square"
00295        << mDFSquareSum
00296        << ",IDFlog.."
00297        << mSquareDFLogICFSum
00298        << "]" 
00299        << flush 
00300        << endl;
00301 #endif
00302   return inStream;
00303 };

bool CAdditionalDocumentInformation::input (  ) 

Definition at line 319 of file CAdditionalDocumentInformation.cc.

References mFileNameBase.

00319                                           {
00320   ifstream 
00321     lStream(string(mFileNameBase+".ADI").c_str());
00322   return input(lStream);
00323 };


Member Data Documentation

Filename of the document (from which the ADI file will be built).

Definition at line 59 of file CAdditionalDocumentInformation.h.

Referenced by input(), output(), and setFileNameBase().

Maximum Document frequency of a feature for the whole document.

Definition at line 61 of file CAdditionalDocumentInformation.h.

Referenced by adjustDF(), getMaximumDF(), input(), output(), and resetDF().

Sum of squared document frequencies off all features of the document.

Definition at line 63 of file CAdditionalDocumentInformation.h.

Referenced by adjustDF(), getDFSquareSum(), input(), output(), and resetDF().

Sum of (DF*DF*log(ICF)) for all features of the document.

Definition at line 65 of file CAdditionalDocumentInformation.h.

Referenced by adjustSquareDFLogICF(), getSquareDFLogICFSum(), input(), output(), and resetSquareDFLogICF().


The documentation for this class was generated from the following files:

Generated on Wed Jan 7 00:31:04 2009 for Gift by  doxygen 1.5.6