So, Grok 3 is expected tonight at 8 PM Pacific Time and with some apprehension I will get into the dangerous territory of predicting technology. I am not trying to be too precise here, but wanted to set forth my expectations and signals I am looking for given that I have been following LLMs keenly over the last 3 years.

What we know about Grok 3 and xAI .. with some thoughts on what it could mean

The billionaire tech mogul founded xAI as a challenger to Microsoft-backed OpenAI and Alphabet's Google. Musk also co-founded OpenAI.

100,000 H100 GPU Cluster

Grok 3 is being trained on a 100,000 Nvidia H100 GPU Cluster. GPT 4 was supposed to have been trained on 24,000 GPUs. There was buzz around the new architecture which allowed for a 100,000 GPU cluster to achieve coherence through some innovation at the Network Link layer. Meaning: If scaling laws hold, this should be a substantially better model. I am assuming that xAI has done a good job of Data and Algos, and the scaling laws should deliver better results over GPT4 other things being equal.

Data from Twitter

With ChatGPT some of the core data sources have been protective of their data including Reddit, Twitter among others. Twitter/X have a near monopoly of high velocity of data. It will be interesting to see the quality of Grok 3 given this advantage. TikTok is supposed to deliver better recommendations because

Model vs. Systems

All the buzz is still for the models. The real breakthrough is going to happen at the systems level where all this tech is put to work on internal data stores that companies possess. While the model vendors are going for the level of power where they could capture the most value, it remains to be seen how much value enterprises are ready to cede to the model vendors and adopt models in a way that gives significant structural pricing power for the model vendors.

Productization

Beyond the model, the competition is also at the level of products and OpenAI is clearly focused on leading with products besides their research. Elon and xAI also have a product pedigree, so it would be interesting to see what productized features they deliver besides a good model.

Elon Musk’s reputation

Elon Musk has delivered ground breaking technologies including with Tesla, SpaceX and Starlink – mostly pioneers though there have been some small sized incumbents in those spaces. This is an epic attempt to catch up after starting late, and given all the drama around the conflict with Sam Altman and Elon’s role in the new administration this is going to matter a lot for his reputation. This may not mean much for technology progress, but if Elon’s reputation for being a genius is maintained by the Grok 3 effort it could push the SOTA significantly.

DeepSeek effort

Given that Grok 3 is being released post DeepSeek and OpenAI’s o1 & o3-mini models, it needs to have a reinforcement learning and a test time compute story. I would be surprised if they release a model post DeepSeek and be branded as behind the curve. It would be interesting to observe the approach to RL that Grok 3 takes – between the technologies used (GRPO etc.), their openness to showing reasoning traces, and their approach to Mixture of Experts. We have not seen much being published by the xAI team, and it will be interesting to see their openness in sharing their methods too.

Marketing

Anthropic has a quasi-academic flavor to its product releases. OpenAI is increasingly product focused and putting together a smorgasbord of capability demos (12 days of Shipmas is an example). DeepSeek and Meta have published a lot of details of their training and also have done open-source and open-weights to a reasonable extent. Grok 2 was a hurried launch. I am curious to see how the marketing for Grok 3 works – the capabilities they highlight, the methods they create buzz, the use cases they focus on etc. I want to see the vibe of the brand .. which has been subdued in Grok 2.

So those are my thoughts as we wait for Grok 3 to be unveiled. I will follow up post launch with my thoughts.

Book Your AI Consultation Today!

GenAI

AI Innovation

Practical AI Mastery Workshop

AI Strategy

Vector Database Consulting Services

Hybrid Retrieval

Vibe Coding

AI Guidance

Power Platform

Custom Application Development

Power Platform Support

Innovation Partner COE

HCL Notes

Application Maintenance

Expectations around Grok 3

100,000 H100 GPU Cluster

Data from Twitter

Model vs. Systems

Productization

Elon Musk’s reputation

DeepSeek effort

Marketing

more similar articles

AI Didn’t Make Me 10x Faster. It Just Moved the Work.

Automation in the Age of AI: The Foundation Still Matters

Analyzing India’s Potential as an AI Leader: From Digital Services Hub to Global AI Powerhouse

Exploring Agentic RAG in Healthcare

reliable, efficient & secured
– tailored to your needs !

Contact us at:

Our Services

Thought Leadership

about us

help & support

Expectations around Grok 3

100,000 H100 GPU Cluster

Data from Twitter

Model vs. Systems

Productization

Elon Musk’s reputation

DeepSeek effort

Marketing

more similar articles

reliable, efficient & secured– tailored to your needs !

Contact us at:

Our Services

Thought Leadership

about us

help & support

reliable, efficient & secured
– tailored to your needs !