Back to Portfolio

Royal Hold'em AI

Deep CFR Nash Equilibrium Self-Play RL
How It Works

Built with Deep Counterfactual Regret Minimization (Deep CFR), a reinforcement learning technique that approximates Nash Equilibrium by iteratively playing against itself and training deep neural networks to minimize regret over past decisions.

Nash Equilibrium
A strategy no opponent can exploit — provably optimal in a game-theoretic sense.
Self-Play
Zero human data. The AI discovers strategy entirely by playing millions of games against itself.
12 Neural Networks
8 advantage nets learn regret per betting round. 4 strategy nets accumulate the final equilibrium policy.
C++ + PyTorch
Parallel C++ game tree traversal runs concurrently with PyTorch gradient updates every iteration.
0
Your Wins
vs
0
AI Wins
P1

You

Waiting…
Stack 100
Bet 0
Preflop
POT 0
Community cards will appear here
AI

Deep CFR AI

Nash Equilibrium Strategy
Stack 100
Bet 0
Decision Breakdown

AI decision breakdown will appear here after the first move.

You'll see the exact probability the AI assigned to each action and the raw neural network outputs.

Welcome to Royal Hold'em

Royal Hold'em plays identically to Texas Hold'em, but uses a stripped 20-card deck containing only Tens, Jacks, Queens, Kings, and Aces.

Both players start with 100 chips. The small blind is 1 chip and the big blind is 2 chips.

Because there are fewer cards, hand values are heavily inflated. Two Pair and Three of a Kind are extremely common. In this scenario, you hit a Royal Flush!

A♠
A♠
K♠
K♠
P1

You (Royal Flush)

Stack 84
Bet 6
Flop
POT 12
Q♠
Q♠
J♠
J♠
10♠
10♠
AI

Deep CFR

Bet 6
Stack 84