DeepSeek Launches V4 Flash and Pro Models, Claiming to Close Gap with Frontier AI

DeepSeek’s V4 Launch Targets Frontier AI Performance

Chinese AI lab DeepSeek released preview versions of its next‑generation models—V4 Flash and V4 Pro—promising to "close the gap" with the most advanced proprietary systems on reasoning benchmarks.

Million‑Token Context and Mixture‑of‑Experts Architecture

Both models employ a mixture‑of‑experts design that activates only a subset of parameters per task, enabling a context window of 1 million tokens. This capacity allows developers to feed entire codebases or lengthy documents into a single prompt without truncation.

Parameter Counts, Active Units, and Pricing Breakdown

V4 Pro: 1.6 trillion total parameters, 49 billion active at inference – the largest open‑weight model to date.
V4 Flash: 284 billion total parameters, 13 billion active.
Pricing (per million tokens): V4 Flash – $0.14 input, $0.28 output.
V4 Pro – $0.145 input, $3.48 output.
Both models undercut comparable offerings from OpenAI (GPT‑5.x), Google (Gemini 3.x) and Anthropic (Claude 4.x).

Open‑Weight Competition and Geopolitical Backdrop

The launch arrives a day after the U.S. accused China of large‑scale AI IP theft. DeepSeek itself faces allegations of “distilling” proprietary models from Anthropic and OpenAI, intensifying scrutiny on its rapid scaling.

Future Trajectory for DeepSeek and the Open‑Source AI Market

If the performance claims hold, DeepSeek could force closed‑source leaders to reconsider pricing and openness strategies. However, a noted lag of 3‑6 months on knowledge tests suggests the lab must accelerate research to keep pace with frontier models like GPT‑5.4 and Gemini 3.1.