[Top][All Lists]

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [help-GIFT] Re: Clarification on inverted file

From: David Squire
Subject: Re: [help-GIFT] Re: Clarification on inverted file
Date: Mon, 20 Aug 2001 20:10:58 +1000

Wolfgang Mueller wrote:

> MARS is strongly inspired by text retrieval,
> but modifies the retrieval scheme, basing the weighting not on the document
> frequency but on the standard deviation of the term frequency.

I haven't got the article in front of me, but if I recall correctly they didn't 
use standard deviations of term frequencies, but rather std. devs. of 
continous-valued features. This would mean that features that took on a wide 
range of values in the query would get a low weight.

This is clearly related to the term frequency idea, since if the features were 
quantized a la Viper, then features with low std. dev. would tend to get high 
term frequencies for the quantiles around the mean.



--  Dr. David McG. Squire, Postgraduate Research Coordinator (Caulfield)
Computer Science and Software Engineering, Monash University, Australia
Do/Don't want HTML mail? Let me know.

reply via email to

[Prev in Thread] Current Thread [Next in Thread]