Simon Willison Emerging AI Innovations Apr 16

Qwen3.6-35B-A3B on my laptop drew me a better pelican than Claude Opus 4.7

★★★★★ significance 2/5

The author compares the image generation capabilities of the new Qwen3.6-35B-A3B and Claude Opus 4.7 models using a specific 'pelican riding a bicycle' benchmark. The comparison highlights differences in how the models interpret complex prompts and render specific details.

Why it matters Small-scale, open-weight models are increasingly challenging the visual reasoning and generation capabilities of top-tier proprietary frontier models.

Read the original at Simon Willison

Entities mentioned

Anthropic Alibaba

Related coverage

arXiv cs.CLAu-M-ol: A Unified Model for Medical Audio and Language Understanding
Simon WillisonIntroducing talkie: a 13B vintage language model from 1930
Hugging FaceAdaptive Ultrasound Imaging with Physics-Informed NV-Raw2Insights-US AI
Simon Willisonmicrosoft/VibeVoice
WIRED AIThe Man Behind AlphaGo Thinks AI Is Taking the Wrong Path

Qwen3.6-35B-A3B on my laptop drew me a better pelican than Claude Opus 4.7

Entities mentioned

Tags

Related coverage