DeepSeek API: Pricing and Model Specifications for V4 Flash and Pro
By
nateb2022
Lightly toasted, lightly seasoned, mostly correct.
Summary
DeepSeek's API documentation page detailing the pricing and model specifications for DeepSeek-V4-Flash and DeepSeek-V4-Pro. It covers token-based billing (per 1M tokens), model versions, base URLs for OpenAI and Anthropic formats, thinking mode support, context length (1M), and maximum output limits. The content is purely technical documentation with no narrative or editorial elements.
Key quotes
· 4 pulledThe prices listed below are in units of per 1M tokens.
A token, the smallest unit of text that the model recognizes, can be a word, a number, or even a punctuation mark.
We will bill based on the total number of input and output tokens by the model.
Supports both non-thinking and thinking (default) modes
You might also wanna read
DeepSeek-V3.1: Open-Source Language Model with Hybrid Inference for Advanced Reasoning and Coding
DeepSeek-V3.1 is an open-source large language model that introduces hybrid inference with both 'Think' and 'Non-Think' modes, optimized for
DeepSeek-V4: Hybrid Sparse-Attention Architecture Enables Efficient Million-Token Context Inference
DeepSeek-V4 introduces a hybrid sparse-attention architecture combined with on-policy distillation across domain specialists, enabling 1M-to
DeepSeek-V3.1-Terminus: Latest Open-Source LLM with Enhanced Stability and Agent Capabilities
DeepSeek-V3.1-Terminus is the latest open-source large language model from DeepSeek, representing the 7th launch in their series. This refin

DeepSeek previews V4 AI model, claims competitiveness with US rivals and Huawei compatibility
Chinese AI company DeepSeek has released a preview of its next-generation AI model V4, claiming it can compete with leading closed-source sy
DeepSeek's V4 Model Shows Widening Gap with US Frontier AI Despite Being China's Best
DeepSeek's latest V4 model release was met with a muted reaction, as analysis by the US National Institute for Standards and Technology foun
