Carlo Rolloutによって推定 – 予め計算しておいたk = 4個の戦略の中から戦略 を選択,葉ノード以降はその戦略に従って⾏動 ▪ ポーカーでよく使われるAbstractionを⽤いて計算 された戦略 ▪ 降りることに特化した戦略 ▪ コールすることに特化した戦略 ▪ レイズすることに特化した戦略 – ノードのvalueが戦略に依存したものであることを 近似的に表現 Fig. 4. Real-time search in Pluribus. The subgame sh nodes indicates that the player to act does not know w information subgame. Right: The transformed subga strategy. An initial chance node reaches each root nod reached in the previously-computed strategy profile (o time in the hand that real-time search is conducted). T which each player still in the hand chooses among k chooses. For simplicity, 2 k = in the figure. In Pluribu selection of a continuation strategy for that player f terminal node (whose value is estimated by rolling continuation strategies the players chose). Fig. 4. Real-time search in Pluribus. The subgame shows just two players for simplicity. A dashed line between nodes indicates that the player to act does not know which of the two nodes she is in. Left: The original imperfect- information subgame. Right: The transformed subgame that is searched in real time to determine a player’s