2026-06-16 zhipu
Zhipu released GLM-5.2 weights under MIT, with a 1M context, a long-horizon focus, and a tunable thinking budget. Its own benchmarks place it within a point or two of the closed frontier on long-horizon coding. The real signal is not another leaderboard run but the open-weight capability-cost curve dropping another notch. Treat the vendor numbers with a discount, and test the 1M usability and long-horizon reliability on your own tasks.
Read analysis 2026-06-14 zhipu
Zhipu released GLM-5.2 and declared it fully open the same week Anthropic's Fable was pulled. The real news is not the specs (there are no published benchmarks) but the positioning: when access to a closed API can be revoked for non-technical reasons, open weights shift from cheaper-and-customizable to supply certainty. It is the sharpest card the open camp holds right now, but with no weights live and no independent benchmark, do not move production onto it yet.
Read analysis 2026-06-10 deepseek
DeepSeek V4 matters because it turns 1M context from a capability demo into a cost, routing, and product-default problem for builders.
Read analysis 2026-06-10 deepseek
DeepSeek V4 pressures closed frontier models by pairing open weights with same-day API availability, compatibility, and a clear migration path.
Read analysis 2026-06-10 minimax
MiniMax M3's real signal is not another 1M context window; it is MSA trying to lower long-context cost before serving tricks begin.
Read analysis 2026-06-10 minimax
M3's real signal is MSA cutting per-token compute at 1M context to 1/20 of the prior generation, with 15x faster decoding. The cost curve of long-context agents is pushed down by a Chinese lab. But the weights were not open on launch day; 'open source in 10 days' is the sincerity test.
Read analysis 2026-06-10 minimax
M3's hard part is not the model card; it is whether vLLM and the broader serving stack can support MSA's block-sparse attention efficiently.
Read analysis