AI Case Study

Deep Stack's poker AI delivered statistically significant wins in Texas Hold 'Em poker games through deep learning.

Deep Stack is one of several AI tools set up to play poker - this is a challenge because of the lack of full information that might be available in a game like chess. Hence the bluff element. Deep Stack was able to deliver a statistically significant levels of wins against professional poker players.

Industry

Consumer Goods And Services

Entertainment And Sports

Project Overview

"The aim of traditional game-playing AIs is to calculate the possible results of a game as far as possible and then rank the strategy options using a formula that searches data from other winning games. The downside to this method is that in order to compress the available data, algorithms sometimes group together strategies that don’t actually work. Poker AI, DeepStack, avoids abstracting data by only calculating ahead a few steps rather than an entire game. The program continuously recalculates its algorithms as new information is acquired. When the AI needs to act before the opponent makes a bet or holds and does not receive new information, deep learning steps in. Neural networks, the systems that enact the knowledge acquired by deep learning, can help limit the potential situations factored by the algorithms because they have been trained on the behavior in the game. This makes the AI’s reaction both faster and more accurate ... In order to train DeepStack’s neural networks, researchers required the program to solve more than 10 million randomly generated poker game situations.

To test DeepStack, the researchers pitted it last year against a pool of 33 professional poker players selected by the International Federation of Poker. Over the course of 4 weeks, the players challenged the program to 44,852 games of heads-up no-limit Texas Hold ‘em, a two-player version of the game in which participants can bet as much money as they have. After using a formula to eliminate instances where luck, not strategy, caused a win, researchers found that DeepStack’s final win rate was 486 milli-big-blinds per game . A milli- big-blind is one-thousandth of the bet required to win a game. That’s nearly 10 times that of what professional poker players consider a sizable margin, the team reports this week in Science."

Reported Results

"DeepStack’s final win rate was 486 milli-big-blinds per game. A milli- big-blind is one-thousandth of the bet required to win a game. That’s nearly 10 times what professional poker players consider a sizeable margin"

Technology

Function

Operations

General Operations

Background

According to Science Mag:
"In this version of poker, two or more players are randomly dealt two face-down cards. At the introduction of each new set of public cards, players are asked to bet, hold, or abandon the money at stake on the table. Because of the random nature of the game and two initial private cards, players' bets are predicated on guessing what their opponent might do. Unlike chess, where a winning strategy can be deduced from the state of the board and all the opponent’s potential moves, Hold ‘em requires what we commonly call intuition."

Benefits

Data