CallMissed Blog

Insights on AI communication, voice agents, WhatsApp automation, and the future of customer engagement.

#Open Weights7 postsClear filter ×
5 min read
ArticleMay 8, 2026

Qwen 3.5: Alibaba's Multilingual Powerhouse

Alibaba's Qwen line has quietly become the multilingual default for the open-weight world. The Qwen 3.5 release in February 2026 cemented that — the family now spans 201 languages and dialects, leads instruction-following benchmarks, and sets a new baseline for what an open-weight model can do acros…

5 min read
ArticleMay 8, 2026

On-Device AI in 2026: Apple Intelligence, Phi, and the Local LLM Renaissance

For most of LLMs' history, "local model" meant either "demo-quality" or "you own a GPU." In 2026 that has shifted. Small models tuned for consumer hardware are crossing the threshold of usefulness — not parity with frontier models, but good enough that real apps are shipping with on-device inference…

5 min read
GuideMay 8, 2026

How Llama 4's Mixture-of-Experts Architecture Works

Meta's Llama 4 family is the first Llama generation to ship as a Mixture-of-Experts (MoE) architecture. That single design choice explains most of what's different about Scout and Maverick — including why both have "17 billion active parameters" but very different total parameter counts, and why the…

6 min read
ReviewMay 8, 2026

Mistral Medium 3.5: One Model, Three Product Lines

Mistral released Medium 3.5 on April 29, 2026, and the most interesting thing about it isn't a benchmark number — it's the strategy. Where every other open-weight flagship in 2026 has gone Mixture-of-Experts, Mistral Medium 3.5 is dense, 128 billion parameters, with a 256K context window. And it con…

5 min read
ArticleMay 8, 2026

DeepSeek R2: The Open-Source Reasoning Surprise

DeepSeek's R2 is the model that made open-weight reasoning a real category in 2026. Reasoning models — the variants that explicitly think before answering — were a closed-vendor club through 2025. R2 changed that: a 32B-parameter open-weight checkpoint that runs on a single 24GB consumer GPU and cle…

5 min read
ArticleMay 8, 2026

Gemma 4: Google's Open-Weight Push for 2026

Google's Gemma line has always been the open-weight cousin to the closed-source Gemini family — same training pipeline, same research lineage, public weights, permissive license. Gemma 4 is the 2026 release, and the headline is that the 31B dense variant beats Llama 4 Scout on most reasoning benchma…

6 min read
ComparisonMay 8, 2026

MoE vs Dense Models in 2026: Which Architecture Wins

The architecture wars are mostly settled in 2026 — but not in the way 2024's debates predicted. Mixture-of-Experts dominates the 100B+ flagship class: DeepSeek V4, Llama 4 Maverick, Qwen 3.5 397B-A17, Mistral Large 3 — all sparse MoE. Meanwhile, dense holds the mid-tier: Mistral Medium 3.5 at 128B i…