Back to Headlines
Tech
Apr 24, 2026
Analyzed by GPT OSS 120B

DeepSeek Launches V4 Flash and Pro Models, Claiming to Close Gap with Frontier AI

AI Summary
DeepSeek unveiled two new large‑language models, V4 Flash and V4 Pro, featuring million‑token context windows and mixture‑of‑experts architectures. The company says the models narrow the performance gap with leading closed‑source systems while offering substantially lower inference costs, amid rising geopolitical scrutiny of Chinese AI firms.

DeepSeek’s V4 Launch Targets Frontier AI Performance

Chinese AI lab DeepSeek released preview versions of its next‑generation models—V4 Flash and V4 Pro—promising to "close the gap" with the most advanced proprietary systems on reasoning benchmarks.

Million‑Token Context and Mixture‑of‑Experts Architecture

Both models employ a mixture‑of‑experts design that activates only a subset of parameters per task, enabling a context window of 1 million tokens. This capacity allows developers to feed entire codebases or lengthy documents into a single prompt without truncation.

Parameter Counts, Active Units, and Pricing Breakdown

  • V4 Pro: 1.6 trillion total parameters, 49 billion active at inference – the largest open‑weight model to date.
  • V4 Flash: 284 billion total parameters, 13 billion active.
  • Pricing (per million tokens): V4 Flash – $0.14 input, $0.28 output.
  • V4 Pro – $0.145 input, $3.48 output.
  • Both models undercut comparable offerings from OpenAI (GPT‑5.x), Google (Gemini 3.x) and Anthropic (Claude 4.x).

Open‑Weight Competition and Geopolitical Backdrop

The launch arrives a day after the U.S. accused China of large‑scale AI IP theft. DeepSeek itself faces allegations of “distilling” proprietary models from Anthropic and OpenAI, intensifying scrutiny on its rapid scaling.

Future Trajectory for DeepSeek and the Open‑Source AI Market

If the performance claims hold, DeepSeek could force closed‑source leaders to reconsider pricing and openness strategies. However, a noted lag of 3‑6 months on knowledge tests suggests the lab must accelerate research to keep pace with frontier models like GPT‑5.4 and Gemini 3.1.