Insights

The official Llama API is now accelerated by Groq. Served on the world’s most efficient inference chip, it’s the fastest way to run the world’s most trusted openly available models with no tradeoffs. In collaboration with Meta, a limited free preview is live now. What Is It? The official Llama...

Build with access to the internet and the ability to run code with a one line change to your model string. Compound Beta is Groq’s first compound AI system, released under preview on GroqCloud™. It combines openly available models already supported on our platform with built-in tool use, starting with...

Meta’s Llama 4 Scout and Maverick models are live today on GroqCloud™, giving developers and enterprises day-zero access to the most advanced open-source AI models available. Today, Meta released the first models in the Llama 4 herd, which will enable people to build more personalized multimodal experiences. With Llama 4...