CallMissed Blog
Insights on AI communication, voice agents, WhatsApp automation, and the future of customer engagement.
5 min read
ArticleMay 8, 2026
Gemini 3.1 Pro Benchmarks Explained: ARC-AGI-2 and Beyond
On February 19, 2026, Google released Gemini 3.1 Pro and the benchmark headline that followed was unusual: a verified score of 77.1% on ARC-AGI-2, more than double the previous Gemini 3 Pro number on the same test. ARC-AGI-2 is a benchmark designed to be hard for memorization, so a jump that size is…
5 min read
ArticleMay 8, 2026
DeepSeek R2: The Open-Source Reasoning Surprise
DeepSeek's R2 is the model that made open-weight reasoning a real category in 2026. Reasoning models — the variants that explicitly think before answering — were a closed-vendor club through 2025. R2 changed that: a 32B-parameter open-weight checkpoint that runs on a single 24GB consumer GPU and cle…