bug-gnubg
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Bug-gnubg] Absolute FIBS rating formula


From: Michael Petch
Subject: Re: [Bug-gnubg] Absolute FIBS rating formula
Date: Sun, 27 Jul 2014 14:56:33 -0600

Somewhere that may be help you is posting this to fibsboard.com forums.


Michael Petch
CApp::Sysware Consulting Ltd.
OpenPGP FingerPrint=D81C 6A0D 987E 7DA5 3219 6715 466A 2ACE 5CAE 3304
Sent from my mobile


-------- Original message --------
From: Philippe Michel
Date:2014-07-27 14:26 (GMT-07:00)
To: Michael Petch
Cc: address@hidden
Subject: Absolute FIBS rating formula

On Sat, 26 Jul 2014, Michael Petch wrote:

> I will be building the next release of GNUbg (1.03.000) next Sunday
> evening. So any changes that one may have should be put in as soon as
> possible so testing can occur.

I have redone the simulations from Kees van den Doel that led to the
"Error based abs. FIBS rating" in match statistics :
http://people.cs.ubc.ca/~kvdoel/tmp/ratings/

This number became quite inaccurate years ago when the definition of a
"close cube decision" changed. The cube errors rating loss is
underestimated by a lot. On the other hand the offset used to get the
absolute rating is 2050 when the rating of the bots based on gnubg 2ply
is more like 2150.

The old formula is :

R = a2(N)*EPM+b(N)*EPC,
where
a2(N) = 8798 + 25526/N,
and
b(N) = 863 - 519/N.

and what I got is :

a2(N) = 11877 + 32191/N,
and
b(N) = 4766 - 5285/N.

As you can see, the grading is tougher and the higher base rating of gnubg
may or may not compensate for it depending on the level of play.


I'd be interested if someone who has a fair database of FIBS matches (I
don't) could check if these new values are realistic. I'm not familiar
with the Python API, but I suppose it should be possible to write a short
script that loads an already analyzed match and shows the FIBS rating from
both formulas.

I'd be interested by more accurate data from people running bots on FIBS,
preferably if they use the 1.0 weights file : what is their average rating
for various levels of play ? And are they cheated by droppers in a
significant way ?


reply via email to

[Prev in Thread] Current Thread [Next in Thread]