Completely different AI labs have completely different priorities. OpenAI has historically centered on client customers, for example, whereas its rival Anthropic tends to focus on enterprises. Elon Muskβs xAI, we found lately, has been inserting specific emphasis on video-game walkthroughs.
On Friday, Enterprise Insiderβs Grace Kay revealed an in depth and far-reaching report about xAI, the AI startup lately acquired by SpaceX, with specific emphasis on how Musk is making life tough for workers. However this specific anecdote stood out:
In a single occasion final yr, a mannequin launch was delayed for a number of days as a result of Musk was dissatisfied with how the chatbot answered detailed questions in regards to the online game βBaldurβs Gate,β in response to individuals acquainted with the matter. Excessive-level engineers had been pulled from different initiatives to enhance the responses earlier than launch, they stated.
After all, you may think about the frustration of any revered and skilled engineer who reveals as much as work considering heβll be tackling basic issues of data and machine intelligence, solely to be sidetracked into serving to a 54-year-old man beat his online game. However the anecdote raises an much more urgent query: Did Musk find yourself getting the gaming expertise he wished?
To reply that query, our resident RPG fanatic Ram Iyer put collectively a set of 5 common questions on Baldurβs Gate, which we ran towards xAI and the three main fashions in a type of quasi-benchmark that Iβve determined to name βBaldurBench.β
Within the curiosity of journalistic transparency, Iβve made all of the chat transcripts public, so you may see them right here: Grok, ChatGPT, Claude, and Gemini.
First, the excellent news: Grok truly provides fairly good info. Its responses had been a bit dense with gamer jargon β βsave-scummingβ as an alternative of saving and βDPSβ as an alternative of injury β however the solutions had been each helpful and well-informed, supplied you knew what it was speaking about. Grok additionally actually loves tables and theorycraft, which is about what you’d anticipate.
There are many Baldurβs Gate guides on the market and the fashions had been usually drawing from the identical ones, so the most important variations had been stylistic. ChatGPT prefers bulleted lists and sentence fragments, whereas Gemini likes to daring necessary phrases.
Techcrunch occasion
Boston, MA
|
June 9, 2026
The largest shock was Claude, which was significantly involved about giving me info that might spoil my expertise of the sport. Once I requested about good celebration compositions, it closed the steerage by saying, βDonβt stress an excessive amount of and simply play what sounds enjoyable to you.β Thanks, Claude!
Itβs necessary to remember, it is a topic space we all know (because of Enterprise Insiderβs reporting) that xAI has particularly centered on reaching parity. So we shouldnβt learn an excessive amount of into the truth that, after the reported dash, Grokβs recommendation turned out about the identical as the opposite fashions. Nonetheless, itβs good to know xAI could make it work if it tries.
Β





