gnugo-devel
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[gnugo-devel] Fw: [computer-go] Is Rémi c orrect?


From: terry mcintyre
Subject: [gnugo-devel] Fw: [computer-go] Is Rémi c orrect?
Date: Wed, 6 Feb 2008 13:53:47 -0800 (PST)

This is interesting. Any comments as to why level 8 should be better than level 
10?

Thanks for your attention.
 
Terry McIntyre <address@hidden> 
----- Forwarded Message ----
From: Weston Markham <address@hidden>
To: computer-go <address@hidden>
Sent: Wednesday, February 6, 2008 1:49:44 PM
Subject: Re: [computer-go] Is Rémi correct?


I 
know 
that 
other 
people 
have 
mentioned 
this 
sort 
of 
thing 
already,
but 
the 
result 
of 
level 
8 
being 
better 
than 
level 
10 
matches 
my 
own
experience 
with 
slightly 
older 
versions 
of 
gnugo.  
As 
I 
recall, 
8 
was
the 
best, 
9 
a 
little 
worse, 
and 
10 
worse 
again.  
Increasing 
the 
level
seems 
to 
improve 
play 
after 
that, 
but 
it 
dramatically 
increases 
the
time.

Weston

On 
Feb 
6, 
2008 
12:48 
PM, 
Don 
Dailey 
<address@hidden> 
wrote:
> 
Here 
is 
an 
update 
from 
the 
new 
1000 
game 
test 
using 
gungo 
at 
level 
8
> 
instead 
of 
10.
>
> 
Rank 
Name  
  
  
  
  
 
Elo  
  
+  
  
- 
games 
score 
oppo. 
draws
>  
  
1 
Gnugo-3.7.11  
1800  
 
34  
 
30  
2186  
 
97%  
1137  
  
0%
>  
  
2 
Mogo_03  
  
  
 
1507  
 
48  
 
56  
 
186  
 
16%  
1800  
  
0%
>  
  
3 
Mogo_02  
  
  
 
1202  
 
43  
 
51  
1000  
  
3%  
1800  
  
0%
>  
  
4 
Mogo_01  
  
  
 
1003  
 
70  
 
96  
1000  
  
1%  
1800  
  
0%
>
> 
The 
test, 
at 
this 
point, 
seems 
to 
indicate 
that 
gnugo 
at 
level 
8 
is
> 
stronger 
than 
at 
level 
10 
because 
mogo 
is 
not 
doing 
as 
well 
as 
in 
the
> 
previous 
test.  
  
It 
will 
be 
more 
meaningful 
when 
we 
get 
to 
levels 
close
> 
to 
gnugo's 
strength.
>
> 
- 
Don
>
>
> 
> 
As 
promised,  
to 
answer 
Rémi, 
I 
did 
a 
study 
with 
mogo 
vs 
Gnu 
at 
various
> 
> 
levels.  
 
There 
is 
NO 
self 
play 
involved, 
Gnugo-3.7.11 
is 
the 
only
> 
> 
opponent 
for 
progressively 
higher 
rated 
version 
of 
Mogo.
> 
>
> 
> 
Here 
are 
the 
raw 
results 
so 
far:
> 
>
> 
> 
Rank 
Name  
  
  
  
  
 
Elo  
  
+  
  
- 
games 
score 
oppo. 
draws
> 
>  
  
1 
Mogo_10  
  
  
 
2319  
 
72  
 
60  
 
500  
 
95%  
1800  
  
0%
> 
>  
  
2 
Mogo_11  
  
  
 
2284  
 
94  
 
74  
 
259  
 
94%  
1800  
  
0%
> 
>  
  
3 
Mogo_09  
  
  
 
2234  
 
57  
 
49  
 
500  
 
92%  
1800  
  
0%
> 
>  
  
4 
Mogo_08  
  
  
 
2124  
 
43  
 
39  
 
500  
 
87%  
1800  
  
0%
> 
>  
  
5 
Mogo_07  
  
  
 
2016  
 
35  
 
33  
 
500  
 
78%  
1800  
  
0%
> 
>  
  
6 
Mogo_06  
  
  
 
1961  
 
32  
 
30  
 
500  
 
72%  
1800  
  
0%
> 
>  
  
7 
Mogo_05  
  
  
 
1814  
 
28  
 
28  
 
500  
 
52%  
1800  
  
0%
> 
>  
  
8 
Gnugo-3.7.11  
1800  
 
13  
 
13  
5259  
 
44%  
1823  
  
0%
> 
>  
  
9 
Mogo_04  
  
  
 
1711  
 
29  
 
29  
 
500  
 
37%  
1800  
  
0%
> 
>  
 
10 
Mogo_03  
  
  
 
1534  
 
35  
 
38  
 
500  
 
18%  
1800  
  
0%
> 
>  
 
11 
Mogo_02  
  
  
 
1281  
 
60  
 
72  
 
500  
  
5%  
1800  
  
0%
> 
>  
 
12 
Mogo_01  
  
  
 
1004  
115  
178  
 
500  
  
1%  
1800  
  
0%
> 
>
> 
>
> 
> 
The 
issue 
is 
whether 
self-play 
results 
distort 
the 
rating 
of 
programs.
> 
> 
In 
this 
case, 
we 
are 
only 
testing 
whether 
it 
distorts 
the 
ratings 
of
> 
> 
Mogo 
since 
no 
other 
programs 
were 
tested.
> 
>
> 
> 
In 
the 
following 
table,  
I 
played 
up 
to 
500 
games 
between 
Gnugo 
and 
Mogo
> 
> 
at 
various 
levels.  
 
The 
levels 
are 
the 
exact 
levels 
that 
correspond 
to
> 
> 
the 
big 
scalability 
study.  
  
  
In 
the 
middle 
column 
I 
listed 
the
> 
> 
ratings 
as 
computed 
by 
bayeselo 
in 
games 
against  
ONLY 
Gnugo 
and 
set 
the
> 
> 
default 
rating 
of 
Gnugo 
to 
1800, 
just 
as 
in 
the 
study.
> 
>
> 
> 
Unfortunately,  
I 
used 
level 
10 
in 
the 
gnugo 
only 
games 
but 
in 
the 
big
> 
> 
study 
we 
use 
level 
8.  
 
It's 
my 
understanding 
there 
is 
little 
difference
> 
> 
between 
these 
2 
but 
we 
can 
probably 
assume 
Mogo 
might 
be 
a 
little 
better
> 
> 
than 
indicated 
relative 
to 
the 
big 
scalability 
study.
> 
>
> 
> 
It 
looks 
like 
there 
indeed 
is 
a 
lot 
of 
distortion 
at 
the 
low 
end 
of 
the
> 
> 
scale.  
Mogo 
seems 
much 
stronger 
at 
low 
levels 
than 
the 
larger
> 
> 
scalability 
study 
indicated.
> 
>
> 
> 
At 
the 
higher 
levels,  
we 
also 
get 
a 
mismatch,  
where 
Mogo's 
rating
> 
> 
doesn't 
seem 
as 
high 
when 
playing 
only 
Gnugo.  
 
This 
is 
as 
Rémi
> 
> 
claims.
> 
>
> 
> 
One 
thing 
to 
note 
is 
that 
at 
higher 
levels 
it's 
more 
difficult 
to 
get 
an
> 
> 
accurate 
rating.  
Mogo_10 
is 
winning 
95% 
of 
it's 
games 
against 
Gnugo,
> 
> 
and 
an 
extra 
win 
or 
loss 
every 
few 
games 
can 
make 
a 
lot 
of 
difference.
> 
> 
However 
I 
am 
inclined 
to 
believe 
this 
is 
real 
since 
it 
seems 
to 
hold 
for
> 
> 
several 
upper 
levels.  
 
At 
level 
7 
it's 
only 
42 
ELO, 
but 
at 
levels
> 
> 
beyond 
this 
it's 
over 
100 
ELO.
> 
>
> 
> 
I've 
never 
doubted 
that 
there 
is 
some 
intransivity 
between 
programs, 
but
> 
> 
I 
am 
a 
little 
surprised 
that 
it 
is 
this 
much.  
Even 
if 
the 
comparison 
is
> 
> 
slightly 
unfair 
due 
to 
Mogo 
playing 
a 
stronger 
version 
of 
Gnugo 
in 
this
> 
> 
study,  
it's 
still 
seems 
like 
it 
must 
be 
at 
least 
100 
ELO.
> 
>
> 
>
> 
> 
vers  
vs 
Gnu  
Study
> 
> 
----  
------  
-----
> 
>  
 
01  
  
1004  
  
688
> 
>  
 
02  
  
1281  
 
1093
> 
>  
 
03  
  
1534  
 
1331
> 
>  
 
04  
  
1711  
 
1554
> 
>  
 
05  
  
1814  
 
1751
> 
>  
 
06  
  
1961  
 
1971
> 
>  
 
07  
  
2016  
 
2058
> 
>  
 
08  
  
2124  
 
2270
> 
>  
 
09  
  
2234  
 
2347
> 
>  
 
10  
  
2319  
 
2470
> 
>
> 
>
> 
> 
My 
suggestion 
to 
improve 
this 
situation 
is 
to 
play 
a 
few 
thousands 
games
> 
> 
against 
a 
well 
rated 
Gnugo 
and 
set 
up 
mogo 
as 
a 
second 
anchor.
> 
>
> 
> 
- 
Don
> 
>
> 
>
> 
>
> 
> 
_______________________________________________
> 
> 
computer-go 
mailing 
list
> 
> 
address@hidden
> 
> 
http://www.computer-go.org/mailman/listinfo/computer-go/
> 
>
> 
>
> 
_______________________________________________
> 
computer-go 
mailing 
list
> 
address@hidden
> 
http://www.computer-go.org/mailman/listinfo/computer-go/
>
_______________________________________________
computer-go 
mailing 
list
address@hidden
http://www.computer-go.org/mailman/listinfo/computer-go/






      
____________________________________________________________________________________
Looking for last minute shopping deals?  
Find them fast with Yahoo! Search.  
http://tools.search.yahoo.com/newsearch/category.php?category=shopping




reply via email to

[Prev in Thread] Current Thread [Next in Thread]