BBO Discussion Forums: One for the math nerds - BBO Discussion Forums

Jump to content

Page 1 of 1
  • You cannot start a new topic
  • You cannot reply to this topic

One for the math nerds DD, SD and variances

#1 User is offline   1eyedjack 

  • PipPipPipPipPipPipPipPip
  • Group: Advanced Members
  • Posts: 6,575
  • Joined: 2004-March-12
  • Gender:Male
  • Location:UK

Posted 2011-July-02, 06:34

We know that on a given hand the double-dummy result is not necessarily the best single-dummy result.
Consequently, it is dangerous (on a given hand) to rely on a comparison of the actual result against the DD result as being a guide to how well the hand was managed.
It is also fairly obvious (to me) that over the long term, both opposing sides will benefit or lose out to the same extent from such discrepancies.
I should expect, therefore that if you calculate your average IMP per board against the double dummy result it will give a fair approximation to how well you have played, and, furthermore, the larger the hand population over which the average is calculated, so the closer will be the approximation to the true measure.

What interests me (and is beyond my capability) is to arrive at an approximate function that shows how that discrepancy varies with the population size.

Any takers?
Psych (pron. saik): A gross and deliberate misstatement of honour strength and/or suit length. Expressly permitted under Law 73E but forbidden contrary to that law by Acol club tourneys.

Psyche (pron. sahy-kee): The human soul, spirit or mind (derived, personification thereof, beloved of Eros, Greek myth).
Masterminding (pron. mPosted ImagesPosted ImagetPosted Imager-mPosted ImagendPosted Imageing) tr. v. - Any bid made by bridge player with which partner disagrees.

"Gentlemen, when the barrage lifts." 9th battalion, King's own Yorkshire light infantry,
2000 years earlier: "morituri te salutant"

"I will be with you, whatever". Blair to Bush, precursor to invasion of Iraq
0

#2 User is offline   hotShot 

  • Axxx Axx Axx Axx
  • PipPipPipPipPipPipPip
  • Group: Advanced Members
  • Posts: 2,976
  • Joined: 2003-August-31
  • Gender:Male

Posted 2011-July-02, 07:27

First of all you should be aware (and I'm sure you are) that the deviation, gets generally smaller, the higher the contract is.
This is caused by the fact that the opening lead will be better, the more information is available at the time of the lead.
Since the DD-Solver has perfect information and the human player does not know much (e.g. 1NT p p p).


The function you are looking for does not only depend on the population size. A key factor is the question you want to answer.
If you want do generate informations about something equally likely, than a small sample will be enough.
If you want to analyze 2 possible lines of play and one has 68% and the other has 70%, you will need big sample size to be sure that there is a significant difference between the 2.
0

#3 User is online   helene_t 

  • The Abbess
  • PipPipPipPipPipPipPipPipPipPipPip
  • Group: Advanced Members
  • Posts: 17,198
  • Joined: 2004-April-22
  • Gender:Female
  • Location:Copenhagen, Denmark
  • Interests:History, languages

Posted 2011-July-02, 09:07

If the SD for TOTPOINTS_TO_IMPS(DDresult-actualresult) for a single board is s then for the average IMPs across n boards it is s/sqrt(n).

As hotshot explains, s is not the same for each board, but the formula still holds. However, the s you should use is sqrt(E(s^2)), i.e. the squareroot of the average variance (variance = squared standard deviation)*.

*Assuming the mean difference of the two is independent of the variance. If you play against a weaker team then the mean of TOTPOINTS_TO_IMPS(DDresult-actualresult) will be positive but it could be that you are more likely to have a flat board against them when the actual result at your table is likely to be the same as the DD.
The world would be such a happy place, if only everyone played Acol :) --- TramTicket
0

#4 User is offline   FrancesHinden 

  • Limit bidder
  • PipPipPipPipPipPipPipPipPip
  • Group: Advanced Members
  • Posts: 8,482
  • Joined: 2004-November-02
  • Gender:Female
  • Location:England
  • Interests:Bridge, classical music, skiing... but I spend more time earning a living than doing any of those

Posted 2011-July-02, 11:41

 1eyedjack, on 2011-July-02, 06:34, said:

I should expect, therefore that if you calculate your average IMP per board against the double dummy result it will give a fair approximation to how well you have played..


All other debates apart, I would correct that to "a fair approximation of how much better you have played than your opponents"
0

#5 User is offline   nigel_k 

  • PipPipPipPipPipPip
  • Group: Advanced Members
  • Posts: 2,207
  • Joined: 2009-April-26
  • Gender:Male
  • Location:Wellington, NZ

Posted 2011-July-03, 15:08

I don't necessarily think this is a terrible idea but you need to be aware that for high level contracts, you will need to overbid to maximize your score in comparison to the double dummy result.

For example, suppose there is a slam that needs a two way guess for a queen and another suit not to break 4-0 (90%). Single dummy this is a 45% slam and double dummy it is a 90% slam. Let's say you are non-vulnerable. If you stay out of the slam, your expectation against the double dummy result is -8.8 IMPS. If you bid the slam your expectation is -7.4 IMPS.

Or you could just not use the double dummy comparison for slam hands. Instead work out the chance of making and award yourself zero if the slam is 50% or better and otherwise your expected IMP loss assuming they stay out at the other table.
2

#6 User is offline   inquiry 

  • PipPipPipPipPipPipPipPipPipPip
  • Group: Admin
  • Posts: 14,566
  • Joined: 2003-February-13
  • Gender:Male
  • Location:Amelia Island, FL
  • Interests:Bridge, what else?

Posted 2011-July-03, 17:22

You may want to consider the study by Peter Cheung's 2002 study using bridge hands from OKbridge (using bridgebrowser) and actual results at the table and compared to double dummy analysis.


This was his summary

"The most important general finding is that double dummy analysis is very accurate as compared to actual play from OKBridge. The overall total number of tricks taken by the declarer is 9.21 (9.22 for imp and 9.20 for mp). The double dummy analysis of the same deal produce 9.11 (9.12 for imp and 9.11 for mp). So actual play by OKBridge player takes 0.1 tricks more then the double dummy analysis result. This is from 383,000 deals and over 25 million plays."

See Peter Cheung's website for more delatils (click here)


--Ben--

#7 User is offline   jogs 

  • PipPipPipPipPipPip
  • Group: Advanced Members
  • Posts: 1,316
  • Joined: 2011-March-01
  • Gender:Male
  • Interests:student of the game

Posted 2011-July-03, 21:40

 inquiry, on 2011-July-03, 17:22, said:

You may want to consider the study by Peter Cheung's 2002 study using bridge hands from OKbridge (using bridgebrowser) and actual results at the table and compared to double dummy analysis.


This was his summary

"The most important general finding is that double dummy analysis is very accurate as compared to actual play from OKBridge. The overall total number of tricks taken by the declarer is 9.21 (9.22 for imp and 9.20 for mp). The double dummy analysis of the same deal produce 9.11 (9.12 for imp and 9.11 for mp). So actual play by OKBridge player takes 0.1 tricks more then the double dummy analysis result. This is from 383,000 deals and over 25 million plays."

See Peter Cheung's website for more delatils (click here)



http://crystalwebsit...ouble_dummy.htm

Cheung didn't show a valid proof of that conclusion. Those means only show that the error biases balance out. He needs to show that the errors of the estimates are small and the variances or the errors are small.
2

#8 User is offline   1eyedjack 

  • PipPipPipPipPipPipPipPip
  • Group: Advanced Members
  • Posts: 6,575
  • Joined: 2004-March-12
  • Gender:Male
  • Location:UK

Posted 2011-July-03, 22:10

 nigel_k, on 2011-July-03, 15:08, said:

I don't necessarily think this is a terrible idea but you need to be aware that for high level contracts, you will need to overbid to maximize your score in comparison to the double dummy result.

For example, suppose there is a slam that needs a two way guess for a queen and another suit not to break 4-0 (90%). Single dummy this is a 45% slam and double dummy it is a 90% slam. Let's say you are non-vulnerable. If you stay out of the slam, your expectation against the double dummy result is -8.8 IMPS. If you bid the slam your expectation is -7.4 IMPS.

Or you could just not use the double dummy comparison for slam hands. Instead work out the chance of making and award yourself zero if the slam is 50% or better and otherwise your expected IMP loss assuming they stay out at the other table.


Instinctively, on balance I think you are right. But it is not all one-way traffic. You don't always have a 2-way finesse in a slam, and DD they will always find the killing lead if there is one to be found.
Psych (pron. saik): A gross and deliberate misstatement of honour strength and/or suit length. Expressly permitted under Law 73E but forbidden contrary to that law by Acol club tourneys.

Psyche (pron. sahy-kee): The human soul, spirit or mind (derived, personification thereof, beloved of Eros, Greek myth).
Masterminding (pron. mPosted ImagesPosted ImagetPosted Imager-mPosted ImagendPosted Imageing) tr. v. - Any bid made by bridge player with which partner disagrees.

"Gentlemen, when the barrage lifts." 9th battalion, King's own Yorkshire light infantry,
2000 years earlier: "morituri te salutant"

"I will be with you, whatever". Blair to Bush, precursor to invasion of Iraq
0

#9 User is online   helene_t 

  • The Abbess
  • PipPipPipPipPipPipPipPipPipPipPip
  • Group: Advanced Members
  • Posts: 17,198
  • Joined: 2004-April-22
  • Gender:Female
  • Location:Copenhagen, Denmark
  • Interests:History, languages

Posted 2011-July-04, 02:28

 jogs, on 2011-July-03, 21:40, said:

http://crystalwebsit...ouble_dummy.htm

Cheung didn't show a valid proof of that conclusion. Those means only show that the error biases balance out. He needs to show that the errors of the estimates are small and the variances or the errors are small.

I think that an interesting hypothesis to test would be if the mean result for each board is generally close the DD result. And if not, if there are particular subsets of the deals on which the declarer has an advantage relative to DD.
The world would be such a happy place, if only everyone played Acol :) --- TramTicket
0

#10 User is offline   hotShot 

  • Axxx Axx Axx Axx
  • PipPipPipPipPipPipPip
  • Group: Advanced Members
  • Posts: 2,976
  • Joined: 2003-August-31
  • Gender:Male

Posted 2011-July-04, 05:30

Has anybody read this?

http://ieeexplore.ie...rnumber=4749256
1

#11 User is offline   jogs 

  • PipPipPipPipPipPip
  • Group: Advanced Members
  • Posts: 1,316
  • Joined: 2011-March-01
  • Gender:Male
  • Interests:student of the game

Posted 2011-July-04, 07:09

Cheung said in 2002.

Quote

It is likely within the next 3-5 years such a bidding system used by a computer program will be able to beat the best player in the world.


That hasn't happened. It's unlikely any bidding system by computers would be that successful until someone is willing to spend as much money on bridge as IBM has spent studying chess.
1

#12 User is offline   Wackojack 

  • PipPipPipPipPip
  • Group: Full Members
  • Posts: 925
  • Joined: 2004-September-13
  • Gender:Male
  • Location:England
  • Interests:I have discovered that the water cooler is a chrono-synclastic infundibulum

Posted 2011-July-04, 13:57

As Frances said your comparison with the double dummy result reflects your ability compared with that of your opponents. Although I don't think it gets near to answering your question and I am not sure what your question really was; FWIIW I kept a record of playing a forcing pass system on BBO. The fact that we were up in imps was probably our opps not having the luxury of working out how to defend against it.
Imps as Imps as
The forcing pass record to 30 March 2010
376 Hands played 376
Imps per hand against field 0.33
As Declarer 0.41
As defender 0.23
Imps per hand against Par 0.42

The gains against par were mainly phantom sacs by opps and better sacs by us.
May 2003: Mission accomplished
Oct 2006: Mission impossible
Soon: Mission illegal
0

#13 User is offline   jogs 

  • PipPipPipPipPipPip
  • Group: Advanced Members
  • Posts: 1,316
  • Joined: 2011-March-01
  • Gender:Male
  • Interests:student of the game

Posted 2011-July-04, 14:55

 helene_t, on 2011-July-04, 02:28, said:

I think that an interesting hypothesis to test would be if the mean result for each board is generally close the DD result. And if not, if there are particular subsets of the deals on which the declarer has an advantage relative to DD.


Have only seen a study on 1NT when both sides have 20 HCP. DD Declarer made less than 6.2 tricks. Author didn't publish that declarer made in actual play. But suspect declarer's mean tricks would be between 6.8 to 7. In this contract defense has a huge opening lead advantage DD.
0

Page 1 of 1
  • You cannot start a new topic
  • You cannot reply to this topic

3 User(s) are reading this topic
0 members, 3 guests, 0 anonymous users