Re: [Bug-gnubg] GNU Backgammon overview/background

bug-gnubg

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Bug-gnubg] GNU Backgammon overview/background

From:	Joseph Heled
Subject:	Re: [Bug-gnubg] GNU Backgammon overview/background
Date:	Tue, 11 Nov 2003 15:21:53 +1300
User-agent:	Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.5) Gecko/20031007



Øystein Johansen wrote:

The race net is not trained with TD at all! (or maybe it was back inlast century, but it played horrible). Just think about it. When there'sno contact, taking the opponents next roll move and is quite independentof the players move. The weights will therefore not converge to anyvalues at all. Same thing when it comes to evaluation of race position.1-ply evaluation of a race is just a waste of time, since the opponentsroll and move won't affect your best move. (..well, maybe some smallpoint...)
The race net is therefore trained against the OSR evaluator. OSR is theOne Side Rollout algorithm. It simply rolls out the position at one sideusing heuristic move rules. For each game it rolls out it keeps track ofhow many rolls it used to get off. Then it gets a roll distribution forthe position. The same is done for the other side, and the winningprobability is calculated is the same way as the 1 sided bearoffdatabase. This OSR algorithm is used to train the race network.

Almost all of the above is not true. The race net is trained exactly inthe same way as the other nets. The OSR method can serve as a steppingstone step only. I was surprised (and slightly dismayed) when it turnedout OSR does a not-so-good job at checkers play (based on my rolloutbenchmark). The reason seems to be that the OSR heuristic misses manycorrect plays. While each error is small, the accumulative effect can bebig enough to choose the weaker of two close plays, which in racesituations is most of the time.


-Joseph

[Prev in Thread]

Current Thread

[Next in Thread]

[Bug-gnubg] GNU Backgammon overview/background, Thomas Hauk, 2003/11/10
- Re: [Bug-gnubg] GNU Backgammon overview/background, Øystein Johansen, 2003/11/10
  - Re: [Bug-gnubg] GNU Backgammon overview/background, Joseph Heled <=
  - Re: [Bug-gnubg] GNU Backgammon overview/background, Thomas Hauk, 2003/11/12
    - Re: [Bug-gnubg] GNU Backgammon overview/background, Øystein Johansen, 2003/11/12
    - Re: [Bug-gnubg] GNU Backgammon overview/background, Joseph Heled, 2003/11/16
- Re: [Bug-gnubg] GNU Backgammon overview/background, Jim Segrave, 2003/11/11
  - Re: [Bug-gnubg] GNU Backgammon overview/background, Thomas Hauk, 2003/11/12
    - Re: [Bug-gnubg] GNU Backgammon overview/background, Øystein Johansen, 2003/11/12
    - Re: [Bug-gnubg] GNU Backgammon overview/background, Jim Segrave, 2003/11/12
    - Re: [Bug-gnubg] GNU Backgammon overview/background, Thomas Hauk, 2003/11/12
    - RE: [Bug-gnubg] GNU Backgammon overview/background, David Montgomery, 2003/11/12
    - Re: [Bug-gnubg] GNU Backgammon overview/background, Jim Segrave, 2003/11/13

Prev by Date: RE: [Bug-gnubg] Rating and grading
Next by Date: Re: [Bug-gnubg] Rating and grading
Previous by thread: Re: [Bug-gnubg] GNU Backgammon overview/background
Next by thread: Re: [Bug-gnubg] GNU Backgammon overview/background
Index(es):
- Date
- Thread