The 8088 The 8088 ← All news
Simon Willison Emerging AI Innovations Apr 24

DeepSeek V4 - almost on the frontier, a fraction of the price

★★★★ significance 4/5

DeepSeek has released the first two models of its highly anticipated V4 series: DeepSeek-V4-Pro and DeepSeek-V4-Flash. These models feature a 1 million token context window and use a Mixture of Experts architecture, with the Pro version being one of the largest open weights models available.

Why it matters High-parameter frontier performance is decoupling from extreme compute costs, challenging the dominance of Western-centric, high-cost proprietary models.
Read the original at Simon Willison

Entities mentioned

DeepSeek

Tags

#deepseek #open weights #llm #mixture of experts #v4

Related coverage