14
Models
6
Makers
1M
Max Context
9/14
Open Weights
New since April: Kimi K2.6 (Apr 20) · MiMo-V2.5 & V2.5-Pro replace the V2 line (Apr 22) · DeepSeek V4 Pro & Flash (Apr 24) · Qwen3.7 Max, Qwen3.7 Plus & Qwen3.6 Plus join (May 20) · MiniMax M3 (Jun 1). Six-week window, four new ~1T-param sparse-attention flagships with 1M context.
The Go Subscription
$5 first month → $10/mo after
All 14 models, one subscription. Works with OpenCode or any coding agent. Top-up credit available. Cancel any time. Higher tiers (5x → 100x) multiply the limits. Free tier: 200 requests / 5h on Big Pickle + free models.
Subscribe to Go →
Requests per 5 hours — Go 1x tier (log scale)
DeepSeek V4 Flash
31,650
MiMo-V2.5
30,100
Qwen3.7 Plus
4,300
DeepSeek V4 Pro
3,450
MiniMax M2.7
3,400
Qwen3.6 Plus
3,300
MiMo-V2.5-Pro
3,250
MiniMax M3
3,200
Kimi K2.6
1,150
Qwen3.7 Max
950
GLM-5.1
880
Big Pickle + free
200
GLM-5, Kimi K2.5 and MiniMax M2.5 are included but not listed on the published limits chart. The throttle is an inverse capability signal — flagships cost more to serve.
MiniMax
MiniMax M3
New · Jun 1 Multimodal
Newest model in the lineup. MiniMax Sparse Attention (MSA) makes 1M-token context practical — 1/20th per-token compute vs prior gen, 9× faster prefill, 15× faster decode. Native text+image+video input. Open weights promised within days of launch.
ParamsNot disclosed
Context1M (512K min)
ReleasedJun 1, 2026
LicenseOpen (pending)
Key Benchmarks
SWE-Pro 59.0% Terminal-Bench 2.1 66.0% OSWorld 70.0% Beats GPT-5.5 / Gemini 3.1 Pro
Alibaba (Qwen)
Qwen3.7 Max
New · May 20 Proprietary
Best SWE-Bench Pro score in the Go lineup (60.6%). Flagship agent model — ran a 35-hour autonomous kernel-optimization session: 1,158 tool calls, 432 evaluations, 10x speedup, zero human intervention. Highest-ranked Chinese model on the AA Intelligence Index (56.6).
ParamsNot disclosed
Context1M / 65K out
ReleasedMay 20, 2026
LicenseProprietary
Key Benchmarks
SWE-Pro 60.6% ★ Terminal-Bench 69.7 MCP-Atlas 76.4 GPQA-D 92.4
Moonshot AI
Kimi K2.6
New · Apr 20 Mod. MIT
Native multimodal agentic flagship. 1T MoE with ~32B active; MLA keeps long-context VRAM cheap. Agent-swarm capable: scales to 300 sub-agents executing 4,000 coordinated steps. Beats Opus 4.6 and GPT-5.4 on SWE-Bench Pro.
Params1T / 32B active
Context256K
ReleasedApr 20, 2026
LicenseModified MIT
Key Benchmarks
SWE-Pro 58.6% HLE w/ tools 54.0% 300-agent swarm
DeepSeek
DeepSeek V4 Pro
New · Apr 24 Open Weights
Biggest model in the lineup: 1.6T params, 49B active. CSA+HCA hybrid attention — at 1M context it needs only 27% of the FLOPs and 10% of the KV cache of V3.2. Pro-Max variant posts the highest coding scores of any model (80.6% SWE-Verified, 93.5 LiveCodeBench).
Params1.6T / 49B active
Context1M
ReleasedApr 24, 2026
LicenseOpen Weights
Key Benchmarks
SWE-Verified 80.6% (Max) LiveCodeBench 93.5 MRCR-1M 83.5%
DeepSeek
DeepSeek V4 Flash
New · Apr 24 Open Weights
The throughput king of Go — 31,650 requests per 5 hours, the highest allowance in the plan. Efficiency-optimized MoE (284B total, just 13B active) with the same 1M-token context as Pro. The default pick for high-volume agent loops.
Params284B / 13B active
Context1M
ReleasedApr 24, 2026
LicenseOpen Weights
Why It Matters
Highest Go limit 13B active = cheap serving 1M ctx at Flash price
Xiaomi
MiMo-V2.5-Pro
New · Apr 22 Open Source
Replaces both MiMo-V2-Pro and V2-Omni: one open-source model for reasoning AND full multimodality (image, audio, video, text). 1.02T MoE, hybrid attention + 3-layer multi-token prediction. Uses 40–60% fewer tokens than Opus 4.6 for comparable agentic scores. UltraSpeed variant hits 1,000 tok/s.
Params1.02T / 42B active
Context1M
ReleasedApr 22, 2026
LicenseOpen Source
Key Benchmarks
SWE-Pro 57.2% Token-efficient ClawEval UltraSpeed 1000 tok/s
Xiaomi
MiMo-V2.5
New · Apr 22 Multimodal
The efficient sibling in Xiaomi's V2.5 public-beta series (which also includes TTS and ASR variants). Second-highest allowance on Go at 30,100 requests per 5 hours — built to be hammered in agent loops where Pro-class reasoning isn't needed.
ParamsNot disclosed
Context
ReleasedApr 22, 2026
InputMultimodal
Why It Matters
30,100 req/5h on Go Volume workhorse
Alibaba (Qwen)
Qwen3.7 Plus
New · May 2026 Vision
Multimodal variant of the 3.7 series with vision input at a lower price point. The budget pick for high-volume, routine workloads — Max is the heavy hitter, Plus is what you run all day. Generous 4,300 req/5h on Go.
ParamsNot disclosed
InputText + Vision
ReleasedMay 2026
LicenseProprietary
Why It Matters
GUI-agent capable 4,300 req/5h on Go
Alibaba (Qwen)
Qwen3.6 Plus
Proprietary
Prior-generation Plus, kept in the lineup as a proven high-volume option (3,300 req/5h). Solid general coding and chat at minimal cost — the safe fallback when 3.7 capacity is constrained.
ParamsNot disclosed
Context
GenerationQwen 3.6
LicenseProprietary
Why It Matters
3,300 req/5h on Go Fallback capacity
Zhipu AI (Z.ai)
GLM-5.1
Reasoning MIT
Long-horizon agentic flagship. Can work autonomously for up to 8 hours on a single task. +20.4pts CyberGym, +7.3pts Terminal-Bench vs GLM-5. Trained entirely on Huawei Ascend chips.
Params744B / ~44B active
Context200K
ReleasedApr 7, 2026
LicenseMIT
Key Benchmarks
SWE-Pro 58.4% CyberGym 68.7% Terminal-Bench 63.5%
Zhipu AI (Z.ai)
GLM-5
Reasoning MIT
First open-weights model to break AA Intelligence Index 50. DeepSeek Sparse Attention for 200K context. Superseded by 5.1 for agentic work but still a strong general reasoner.
Params744B / 44B active
Context200K
ReleasedFeb 11, 2026
LicenseMIT
Key Benchmarks
AA Index 50 SWE-Verified 77.8% AIME 95.4%
Moonshot AI
Kimi K2.5
Multimodal Open Weights
Native multimodal agentic model built via continual pretraining on ~15T visual+text tokens atop K2. MoonViT 400M vision encoder. State-of-the-art visual coding and BrowseComp w/ ctx (74.9%).
ParamsMoE (K2 base)
Context262K
ReleasedJan 27, 2026
LicenseOpen Weights
Key Benchmarks
HLE w/ tools 50.2% AIME 96.1% BrowseComp 74.9%
MiniMax
MiniMax M2.7
Reasoning Not Yet Open
First model deeply participating in its own evolution. Self-improving with Agent Teams and dynamic tool search. Highest GDPval-AA Elo (1495) among open-source models. 97% skill adherence.
ParamsNot disclosed
Context200K
ReleasedMar 18, 2026
LicenseTBD
Key Benchmarks
AA Index 50 GDPval-AA 1495 Elo SWE-Pro 56.2%
MiniMax
MiniMax M2.5
Reasoning MIT
Open-source workhorse with best-in-class SWE-Bench Verified (80.2%) at launch. ~100 tok/s inference. Matching Opus-class coding at 1/20th the cost — still the value benchmark of the lineup.
Params~229B / 10B active
Context200K
ReleasedFeb 12, 2026
LicenseMIT
Key Benchmarks
SWE-Verified 80.2% Multi-SWE 51.3% BrowseComp 76.3%

Quick Comparison

Model Maker Params Context License Best At Price (in/out) Go req/5h
Qwen3.7 MaxAlibaba1MProprietarySWE-Pro (60.6%), 35h autonomy$2.50/$7.50950
MiniMax M3MiniMax1MOpen (pending)SWE-Pro 59.0%, MSA speed$0.60/$2.403,200
Kimi K2.6Moonshot1T/32B256KMod. MITAgent swarm, multimodal coding$0.95/$4.001,150
DeepSeek V4 ProDeepSeek1.6T/49B1MOpen Wt.Coding (80.6% SWE-V, Max)$1.74/$3.483,450
DeepSeek V4 FlashDeepSeek284B/13B1MOpen Wt.Throughput, volume agents31,650
MiMo-V2.5-ProXiaomi1.02T/42B1MOpen SourceToken efficiency, multimodal$1/$33,250
MiMo-V2.5XiaomiVolume workhorse30,100
Qwen3.7 PlusAlibabaProprietaryVision, budget volume4,300
Qwen3.6 PlusAlibabaProprietaryFallback capacity3,300
GLM-5.1Zhipu AI~744B/44B200KMITLong-horizon (8h autonomy)Z.ai API880
GLM-5Zhipu AI744B/44B200KMITAA Index, SWE-VerifiedContact Z.ai
Kimi K2.5MoonshotMoE262KOpen Wt.Multimodal, BrowseComp$0.38/$1.72
MiniMax M2.7MiniMax200KTBDSelf-evolution, GDPval-AA$0.30/$1.203,400
MiniMax M2.5MiniMax229B/10B200KMITSWE-Verified (80.2%), Cost$0.15/$1.20
Go req/5h = requests per 5-hour window on the Go 1x tier ($10/mo). Higher tiers (5x–100x) multiply these limits. "—" = included in Go but not on the published limits chart. API prices are list prices on OpenRouter/Fireworks/first-party, not what Go charges.
Get OpenCode Go — $5 first month →

Data sourced from opencode.ai/go, official releases, HuggingFace, OpenRouter, and Artificial Analysis. Updated June 9, 2026.

Built with here.now · VE LAB brand system