News GTO Wizard AI Outperforms GPT-5 and Grok 4 in New Benchmark

Welcome to Poker Community Forums!

Welcome to PCF, the online poker forums. Grind your way to the top of the leaderboards on StockPoker and DarkSidePoker, and sharpen your strategy alongside the community. Earn badges by finishing in the top 3 on the PCF/StockPoker leaderboard, Shark Tank leaderboard, or DarkSidePoker leaderboard. Visit the leaderboard pages to learn how to qualify for the monthly leaderboard freerolls.

Join PCF! Login

Cpvr

PCF Prodigy
Staff member
Admin
Joined
Jan 23, 2024
Messages
1,200
Reaction score
280
In the rapidly evolving world of artificial intelligence, a common question in the poker industry has emerged: when will AI be good enough to consistently beat its human counterparts?

Humans were first pitted against AI back in 2019, with the first AI to beat Humans"]Pluribus besting a team of human players, becoming the first AI model to do so. Then, just last year nine AI models battled it out over almost 4,000 hands to find out who was best. While Meta's LLAMA 4 went broke, OpenAI o3 emerged victorious.

However, the frontier of poker and artificial intelligence has a new top model: GTO Wizard AI.

What is GTO Wizard AI?​

Their new GTO Wizard AI model is a state-of-the-art poker agent that powers all the site's custom solutions. Rather than being built off a general-purpose model, GTO Wizard AI was originally developed as Ruse AI by Canadian programmers Marc-Antoine Provost and Philippe Beardsell. This technology was acquired by GTO Wizard in 2023.

Unlike earlier bots like Slumbot (the 2018 Annual Computer Poker Competition (ACPC) champion), which relied on massive, pre-computed strategies, the GTO Wizard AI model does not store a complete poker strategy before play. Rather, it was trained against itself of hundreds of millions of hands, gradually learning which plays led to the highest expected value.

"Through deep reinforcement learning," says GTO Wizard, "GTO Wizard AI considers each particular situation as it arises during play and solves it in real-time, in a matter of seconds."

This approach was vindicated after GTO Wizard AI took on Slumbot in a controlled 150,000-hand match; GTO Wizard AI recorded a win rate of 19.4 bb/100 against Slumbot.

The outcome was as dramatic as it was surprising: GTO Wizard AI achieved a win-rate of 19.4BB/100 over the course of the match. For context, a world-class human professional typically aims for a win rate of 5 bb/100. If the stakes were $50/$100, with 200 hands of heads-up played per hour, GTO Wizard AI would have won $19.4 per hand at an hourly win rate of $3,880.

GTO Wizard AI

New AI Poker Benchmark​

But this isn't the only model that GTO Wizard AI has taken on and beaten.

New benchmark results provide the first standardized comparison between "frontier" Large Language Models (LLMs) and specialized poker agents. The data reveals that, while general AI has made massive leaps in reasoning, it still lacks the specific strategic depth required to beat the world’s leading poker solver.

Solving the "Luck" Factor with AIVAT​

How does GTO Wizard know these rankings are accurate and not just a run of hot cards? The benchmark utilizes AIVAT, a sophisticated variance-reduction technology. Because poker is naturally high-variance, it usually takes hundreds of thousands of hands to reach a statistically significant conclusion. AIVAT reduces this requirement by 10x, enabling researchers to assess an agent's "luck-adjusted" performance much more efficiently.

Challenge the Wizard: API Access Now Live​

GTO Wizard is now providing API access to allow independent developers and researchers to submit their own models for evaluation. This move aims to foster more transparent competition in the AI space. Developers can integrate their agents directly into the evaluation platform to compete in real-time. The API allows for hand simulation and result retrieval without exposing the solver’s internal capabilities.

In order to take on GTO Wizard AI, they must play a minimum of 2,500 hands of Heads-Up No-Limit Hold'em, with 200bb stacks that reset every hand. The API will limit usage to 100,000 hands per month.

As the industry moves toward Heads-Up Pot-Limit Omaha (PLO) benchmarks in the near future, the message from GTO Wizard is clear: the era of "claiming" to be the best is over. Now, you have to prove it on the leaderboard.


Source: https://www.pokernews.com/news/2026...s-gpt-5-and-grok-4-in-new-benchmark-51020.htm
 
AI has gotten to the level that one would just be observing 😂 But we shouldn't lie, those tools are not some magic wand. If your basics are not strong, even the best AI will not save you. With how tech is moving fast now, you would think the era of humans has gone, but our experience still matters.
 
Join Poker Community (PCF) on Discord! lbrew1 streams StockPoker on Twitch!

Members online

No members online now.
PCF Bankroll Starter $25 Freebuy on Stock Poker
Back
Top