Models Added

Released: 2026-04-24

## DeepSeek V4 Pro DeepSeek V4 Pro is a large-scale Mixture-of-Experts (MoE) model with 1.6T total parameters and 49B activated per token, supporting a 1M-token context window for advanced reasoning and long-horizon workflows. It delivers strong performance across knowledge, mathematics, and software engineering tasks, making it suitable for complex, real-world applications. ## DeepSeek V4 Flash DeepSeek V4 Flash is an efficiency-optimized Mixture-of-Experts (MoE) model with 284B total parameters and 13B activated per token, designed for fast inference and high-throughput workloads. It supports a 1M-token context window, enabling large-scale reasoning and long-context processing. **Enjoy them.**