The 8088 The 8088 ← All news
Simon Willison Emerging AI Innovations Apr 16

Qwen3.6-35B-A3B on my laptop drew me a better pelican than Claude Opus 4.7

★★★★★ significance 2/5

The author compares the image generation capabilities of the new Qwen3.6-35B-A3B and Claude Opus 4.7 models using a specific 'pelican riding a bicycle' benchmark. The comparison highlights differences in how the models interpret complex prompts and render specific details.

Why it matters Small-scale, open-weight models are increasingly challenging the visual reasoning and generation capabilities of top-tier proprietary frontier models.
Read the original at Simon Willison

Entities mentioned

Anthropic Alibaba

Tags

#qwen #claude #llm #benchmarking #image generation

Related coverage