help-gift
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [help-GIFT] Unpacking features in more user friendly format


From: Epa Uwimana
Subject: Re: [help-GIFT] Unpacking features in more user friendly format
Date: Mon, 12 Mar 2007 19:54:58 -0400

Dear Dr. Muller,

I looked at the generate inverted file but it is still not clear. 
Whenever you have a chance, please help or anybody else who can help. 
I read in gift_features.h about  FREQ_MAX 0xff for the maximum 
frequency and I am wondering if there is some kind of logic I might 
need to apply to the big numbers that I am getting to make them look 
normal. 

Again I need to read the .fts and print the output with FeatureID and 
it's frequency.

Thanks again Dr. Muller

Epa Uwimana


---- Message from Wolfgang Mueller <address@hidden
bamberg.de> at 2007-03-09 11:34:08 ------
Dear Epa,

It depends a lot on what sizes you have. This perl is 32-bit centric.

If you want to be sure to read the same features as GIFT does, look at
gift-generate-inverted-files and see what it uses to read the .fts
files. I am currently not in a position to do this by myself. If
problems persist, please kick me again end of next week or beginning of
the week after.

Best,
Wolfgang

On Fri, 09 Mar 2007, Epa wrote:

> Hey,
> 
> I read GIFT literature and I have an idea of what GIFT does and what 
I can 
> expect from GIFT. Using Perl script, I am trying to unpack the .fts 
files so I 
> can see featureID and their frequencies. I used the script from Dr. 
Muller 
> with small modification so I can sort features in ascending order 
but I am 
> wary of accepting the result.
> 
> Here is my my script:
> 
>        open (FILE,shift)|| die "can't open the file $file";
>        %features= ();
>        read (FILE,$lString,4) || die ("Cannot read the file $file");
>          while (read FILE,$lString,8|| die ("Cannot read the file 
$file")){
>          my ($FeatureID,$Frequency)=unpack("If",$lString);
>                if ($Frequency>0){
>                $features{"$FeatureID"}=$Frequency;
>                                 }
>          }
>   
>         foreach $key (sort{$a<=>$b} keys %features){
>               print "$key:$features{$key}";}
> 
> My output seems to have some big numbers both in FeatureID and 
Frequencies and 
> I am wondering if this is accurate or something went wrong during 
features 
> extraction.
> Here is an example of some of the features form one .fts file:
> 
> 0:3.48554469996741e+28
> 1006665857:1.22526735123633e-40
> 4181688320:2.15799963506022e-43
> 4215242752:1.82168800362226e-43
> 
> I would appreciate some help or advice on how to convert the .fts 
into a more 
> user friendly because I am questioning these bif numbers in my 
output.
> 
> Thank you very much
> 
> Epa Uwimana
>  
> 
> 
>  
> 
> 
> 
> 
> 
> _______________________________________________
> help-GIFT mailing list
> address@hidden
> http://lists.gnu.org/mailman/listinfo/help-gift
--
Dr. Wolfgang Mueller
LS Medieninformatik
Universitaet Bamberg


_______________________________________________
help-GIFT mailing list
address@hidden
http://lists.gnu.org/mailman/listinfo/help-gift






reply via email to

[Prev in Thread] Current Thread [Next in Thread]