Open source AI models

The best open source AI models, ranked and compared

A browsable guide to the top open source large language models. Compare context, license, speed, and intelligence, and run any of them in Gumloop.

Talk to Sales

The best open source LLMs

Each model is open weight and available to run as an agent in Gumloop. Rankings weigh quality, cost, and how well each one handles real agent work.

1

GLM-5.2

Z.ai
Best overall
Available in Gumloop

GLM-5.2 from Z.ai is the top rated open source model in Gumloop. It pairs a million-token context with strong coding and reasoning, and it is built to chain tools and actions across long agentic tasks.

  • Million-token context
  • Top tier coding and reasoning
  • Built for agentic tool use

Flagship MoE model with 1M context for coding, reasoning, and agentic workflows.

Speed
Intelligence
ProviderZ.ai
Context1.0M tokens | 786k words
LicenseMIT
Released2026

Tool Calling
Reasoning
2

DeepSeek V4 Pro

DeepSeek
Best for reasoning
Available in Gumloop

DeepSeek V4 Pro is the most capable DeepSeek model, built for complex reasoning, coding, and long-context agent workflows. It matches frontier quality on hard math and logic tasks at a fraction of the cost.

  • Million-token context
  • Strong chain-of-thought reasoning
  • Competitive coding at low cost

Most capable DeepSeek model for complex reasoning, coding, and long-context agent workflows.

Speed
Intelligence
ProviderDeepSeek
Context1.0M tokens | 786k words
LicenseMIT
Released2026

Tool Calling
Reasoning
3

Kimi K2.7 Code

Moonshot
Best for coding
Available in Gumloop

Kimi K2.7 Code from Moonshot is tuned for agentic coding and long-horizon software engineering. It plans multi-step edits across large codebases without losing track.

  • Built for long-horizon coding
  • Agentic, multi-step edits
  • Tool calling and vision

Coding-focused agentic model for long-horizon software engineering tasks.

Speed
Intelligence
ProviderMoonshot
Context262k tokens | 196k words
LicenseModified MIT
Released2026

Tool Calling
Vision
4

Qwen3.5 397B

Qwen
Best for multilingual
Available in Gumloop

Qwen3.5 397B from Alibaba is a 397B mixture-of-experts model with top-tier reasoning and broad multilingual coverage. It is a strong open-weight choice for work that spans many languages.

  • 397B mixture-of-experts
  • Top tier reasoning
  • Broad multilingual coverage

Flagship model with top-tier reasoning and multilingual capabilities.

Speed
Intelligence
ProviderQwen
Context262k tokens | 196k words
LicenseApache 2.0
Released2026

Vision
5

DeepSeek V4 Flash

DeepSeek
Fastest
Available in Gumloop

DeepSeek V4 Flash trades a little raw intelligence for speed. It keeps the million-token context and reasoning of the Pro model while responding fast, which makes it a fit for high-volume agent work.

  • Fast responses
  • Million-token context
  • Tool calling and reasoning

Fast reasoning model for long-context agent workflows.

Speed
Intelligence
ProviderDeepSeek
Context1.0M tokens | 786k words
LicenseMIT
Released2026

Tool Calling
Reasoning
6

MiniMax M3

MiniMax
Best multimodal
Available in Gumloop

MiniMax M3 is a native multimodal model built for agentic coding and tool use. It reads images alongside text and pairs that with a long context window for large, mixed-media tasks.

  • Native multimodal
  • Agentic coding and tool use
  • Long context window

Native multimodal model for agentic coding and tool use.

Speed
Intelligence
ProviderMiniMax
Context524k tokens | 393k words
LicenseApache 2.0
Released2026

Tool Calling
Vision
7

Kimi K2.6

Moonshot
Best for multi-agent
Available in Gumloop

Kimi K2.6 from Moonshot is a multimodal model aimed at long-horizon coding, UI and UX generation, and multi-agent orchestration. It is a fit for complex workflows that hand work between several agents.

  • Multimodal input
  • Long-horizon coding and UI generation
  • Multi-agent orchestration

Multimodal model for long-horizon coding, UI/UX generation, and multi-agent orchestration.

Speed
Intelligence
ProviderMoonshot
Context262k tokens | 196k words
LicenseModified MIT
Released2026

Vision

Open source models compared

Specs side by side, so you can match a model to your task at a glance.

ModelProviderContextLicenseIntelligenceBest forIn Gumloop
GLM-5.2Z.ai1.0M tokensMIT5 / 5Best overall
DeepSeek V4 ProDeepSeek1.0M tokensMIT5 / 5Best for reasoning
Kimi K2.7 CodeMoonshot262k tokensModified MIT5 / 5Best for coding
Qwen3.5 397BQwen262k tokensApache 2.05 / 5Best for multilingual
DeepSeek V4 FlashDeepSeek1.0M tokensMIT4 / 5Fastest
MiniMax M3MiniMax524k tokensApache 2.05 / 5Best multimodal
Kimi K2.6Moonshot262k tokensModified MIT5 / 5Best for multi-agent

Open source vs frontier models

Frontier models from OpenAI, Anthropic, and Google still lead on the hardest tasks. Open source models win on cost, control, and customization, and they close the quality gap with every release.

Open source modelsFrontier models
CostLow per-token cost, often a fraction of frontier pricing.Premium pricing on the strongest tiers.
Control and privacyOpen weights you can self-host or run under Zero Data Retention.Hosted by the provider, with data handling set by their policy.
CustomizationFine-tune and adapt the weights to your domain.Limited to provider tuning and prompting.
Peak capabilityClosing the gap fast, and already ahead on cost-to-quality.Still leads on the hardest frontier benchmarks.
HostingRun them yourself, through a provider, or inside Gumloop.Provider API only.

Understanding open source AI models

What are open source AI models?

Open source AI models are large language models whose weights are published under a license that lets you download, run, and usually modify them. Rather than calling a single provider API, you can host the model yourself, run it through a hosting provider, or use it inside a platform like Gumloop.

That openness is why these models have spread so fast. Anyone can inspect them, fine-tune them on their own data, and run them wherever their data needs to stay.

Open weights vs fully open source

Most models people call open source today are open weight. The trained weights are downloadable, so you can run and fine-tune the model, but the training data and full recipe may not be released.

Fully open source goes further and also publishes the training code and data. For practical use the license matters most. Apache 2.0 and MIT are permissive and safe for commercial work, while some community licenses add conditions worth reading before you self-host.

How open source LLMs work

These models are trained on large text corpora to predict the next token, then refined with instruction tuning and reinforcement learning so they follow directions and use tools. Many of the strongest open source models use a mixture-of-experts design, which routes each token through a small slice of a much larger network. That keeps quality high while holding inference cost down.

The newest open source models also stream their reasoning, so you can watch them plan a multi-step task instead of waiting for a single final answer.

How to choose an open source model

Start with the task. Match the context window to the size of your inputs, check the license if you plan to self-host or fine-tune, and weigh cost against the quality the work actually needs.

For most agent work a strong general model is the right default, then reach for a coding-tuned or long-context model when the task calls for it. In Gumloop you can switch the model behind an agent at any time, so it is easy to test a few against your own workload.

Open source AI models FAQ

Run open source models in Gumloop

Pick a model, connect your tools, and put an AI agent to work. No infrastructure to manage.
Talk to Sales
Gradient