Royal Hold'em

How It Works

Built with Deep Counterfactual Regret Minimization (Deep CFR), a reinforcement learning technique that approximates Nash Equilibrium by iteratively playing against itself and training deep neural networks to minimize regret over past decisions.

Note: To keep the AI's processing manageable and strategy optimal, starting chips are reset to 100 every hand.

Nash Equilibrium

A strategy no opponent can exploit — provably optimal in a game-theoretic sense.

Self-Play

Zero human data. The AI discovers strategy entirely by playing millions of games against itself.

12 Neural Networks

8 advantage nets learn regret per betting round. 4 strategy nets accumulate the final equilibrium policy.

C++ + PyTorch

Parallel C++ game tree traversal runs concurrently with PyTorch gradient updates every iteration.

Your Wins

AI Wins

You

Waiting…

Stack 100

Bet 0

Preflop

POT 0

Community cards will appear here

Deep CFR AI

Nash Equilibrium Strategy

Stack 100

Bet 0

Decision Breakdown

AI decision breakdown will appear here after the first move.

You'll see the exact probability the AI assigned to each action and the raw neural network outputs.

You

Deep CFR AI

Welcome to Royal Hold'em

You (Royal Flush)

Deep CFR